Deep learning with evolutionary and genomic profiles for identifying cancer subtypes.


Journal

Journal of bioinformatics and computational biology
ISSN: 1757-6334
Titre abrégé: J Bioinform Comput Biol
Pays: Singapore
ID NLM: 101187344

Informations de publication

Date de publication:
06 2019
Historique:
entrez: 11 7 2019
pubmed: 11 7 2019
medline: 16 7 2020
Statut: ppublish

Résumé

Cancer subtype identification is an unmet need in precision diagnosis. Recently, evolutionary conservation has been indicated to contain informative signatures for functional significance in cancers. However, the importance of evolutionary conservation in distinguishing cancer subtypes remains largely unclear. Here, we identified the evolutionarily conserved genes (i.e. core genes) and observed that they are primarily involved in cellular pathways relevant to cell growth and metabolisms. By using these core genes, we developed two novel strategies, namely a feature-based strategy (FES) and an image-based strategy (IMS) by integrating their evolutionary and genomic profiles with the deep learning algorithm. In comparison with the FES using the random set and the strategy using the PAM50 classifier, the core gene set-based FES achieved a higher accuracy for identifying breast cancer subtypes. The IMS and FES using the core gene set yielded better performances than the other strategies, in terms of classifying both breast cancer subtypes and multiple cancer types. Moreover, the IMS is reproducible even using different gene expression data (i.e. RNA-seq and microarray). Comprehensive analysis of eight cancer types demonstrates that our evolutionary conservation-based models represent a valid and helpful approach for identifying cancer subtypes and the core gene set offers distinguishable clues of cancer subtypes.

Identifiants

pubmed: 31288637
doi: 10.1142/S0219720019400055
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1940005

Subventions

Organisme : NIAID NIH HHS
ID : R01 AI111965
Pays : United States

Auteurs

Chun-Yu Lin (CY)

* Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 6110011, Japan.

Peiying Ruan (P)

† NVIDIA AI Technology Center, NVIDIA Corporation Japan, Tokyo 1070052, Japan.

Ruiming Li (R)

* Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 6110011, Japan.

Jinn-Moon Yang (JM)

‡ Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsinchu 300, Taiwan.

Simon See (S)

§ NVIDIA AI Technology Center, NVIDIA Corporation Singapore, Singapore 138522, Singapore.

Jiangning Song (J)

¶ Monash Biomedicine Discovery Institute and Department of Biochemistry & Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.

Tatsuya Akutsu (T)

* Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 6110011, Japan.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH