Cross-attention enables deep learning on limited omics-imaging-clinical data of 130 lung cancer patients.

CP: Cancer biology CP: Systems biology CT scan attention deep learning gene expression imaging lung cancer multi-modal model radiogenomics small samples survival

Journal

Cell reports methods
ISSN: 2667-2375
Titre abrégé: Cell Rep Methods
Pays: United States
ID NLM: 9918227360606676

Informations de publication

Date de publication:
05 Jul 2024
Historique:
received: 29 08 2023
revised: 18 04 2024
accepted: 17 06 2024
medline: 10 7 2024
pubmed: 10 7 2024
entrez: 9 7 2024
Statut: aheadofprint

Résumé

Deep-learning tools that extract prognostic factors derived from multi-omics data have recently contributed to individualized predictions of survival outcomes. However, the limited size of integrated omics-imaging-clinical datasets poses challenges. Here, we propose two biologically interpretable and robust deep-learning architectures for survival prediction of non-small cell lung cancer (NSCLC) patients, learning simultaneously from computed tomography (CT) scan images, gene expression data, and clinical information. The proposed models integrate patient-specific clinical, transcriptomic, and imaging data and incorporate Kyoto Encyclopedia of Genes and Genomes (KEGG) and Reactome pathway information, adding biological knowledge within the learning process to extract prognostic gene biomarkers and molecular pathways. While both models accurately stratify patients in high- and low-risk groups when trained on a dataset of only 130 patients, introducing a cross-attention mechanism in a sparse autoencoder significantly improves the performance, highlighting tumor regions and NSCLC-related genes as potential biomarkers and thus offering a significant methodological advancement when learning from small imaging-omics-clinical samples.

Identifiants

pubmed: 38981473
pii: S2667-2375(24)00182-6
doi: 10.1016/j.crmeth.2024.100817
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

100817

Informations de copyright

Copyright © 2024 The Author(s). Published by Elsevier Inc. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of interests The authors declare no competing interests.

Auteurs

Suraj Verma (S)

School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK.

Giuseppe Magazzù (G)

York St John University, York, UK.

Noushin Eftekhari (N)

The Alan Turing Institute, London, UK.

Thai Lou (T)

Gateshead Health NHS Foundation Trust, Gateshead, UK.

Alex Gilhespy (A)

South Tyneside and Sunderland NHS Foundation Trust, Sunderland, UK.

Annalisa Occhipinti (A)

School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK; Centre for Digital Innovation, Teesside University, Middlesbrough, UK; National Horizons Centre, Teesside University, Darlington, UK.

Claudio Angione (C)

School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK; Centre for Digital Innovation, Teesside University, Middlesbrough, UK; National Horizons Centre, Teesside University, Darlington, UK. Electronic address: c.angione@tees.ac.uk.

Classifications MeSH