Multiomics Topic Modeling for Breast Cancer Classification.

chr14q32 miRNA expression regulation miRNAs multiomics stochastic block modeling topic modeling

Journal

Cancers
ISSN: 2072-6694
Titre abrégé: Cancers (Basel)
Pays: Switzerland
ID NLM: 101526829

Informations de publication

Date de publication:
23 Feb 2022
Historique:
received: 11 02 2022
accepted: 18 02 2022
entrez: 10 3 2022
pubmed: 11 3 2022
medline: 11 3 2022
Statut: epublish

Résumé

The integration of transcriptional data with other layers of information, such as the post-transcriptional regulation mediated by microRNAs, can be crucial to identify the driver genes and the subtypes of complex and heterogeneous diseases such as cancer. This paper presents an approach based on topic modeling to accomplish this integration task. More specifically, we show how an algorithm based on a hierarchical version of stochastic block modeling can be naturally extended to integrate any combination of 'omics data. We test this approach on breast cancer samples from the TCGA database, integrating data on messenger RNA, microRNAs, and copy number variations. We show that the inclusion of the microRNA layer significantly improves the accuracy of subtype classification. Moreover, some of the hidden structures or "topics" that the algorithm extracts actually correspond to genes and microRNAs involved in breast cancer development and are associated to the survival probability.

Identifiants

pubmed: 35267458
pii: cancers14051150
doi: 10.3390/cancers14051150
pmc: PMC8909787
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : Italian Ministry of Education, University and Research (MIUR) (L.232/2016)
ID : Departments of Excellence 2018--2022

Références

Methods. 2015 Mar;74:83-9
pubmed: 25484339
Nat Commun. 2016 Jun 16;7:11863
pubmed: 27306566
Oncogene. 2006 Apr 6;25(15):2273-84
pubmed: 16288205
NPJ Breast Cancer. 2021 Oct 12;7(1):136
pubmed: 34642313
Phys Rev E. 2020 Jul;102(1-1):012305
pubmed: 32794904
Lancet. 2017 Mar 18;389(10074):1134-1150
pubmed: 27865536
Front Genet. 2014 Oct 06;5:345
pubmed: 25339974
Sci Adv. 2018 Jul 18;4(7):eaaq1360
pubmed: 30035215
Cancers (Basel). 2021 Jun 15;13(12):
pubmed: 34203763
PLoS Comput Biol. 2019 Mar 5;15(3):e1006701
pubmed: 30835723
BMC Cancer. 2019 Aug 20;19(1):824
pubmed: 31429720
Springerplus. 2016 Sep 20;5(1):1608
pubmed: 27652181
PLoS Genet. 2017 Mar 23;13(3):e1006599
pubmed: 28333934
Proc Natl Acad Sci U S A. 2004 Mar 2;101(9):2999-3004
pubmed: 14973191
Oncogene. 2013 Sep 5;32(36):4294-303
pubmed: 23001043
Nucleic Acids Res. 2009 Jan;37(Database issue):D155-8
pubmed: 18957447
Nature. 2002 Jan 31;415(6871):530-6
pubmed: 11823860
Cancers (Basel). 2020 Dec 16;12(12):
pubmed: 33339347
BMC Cancer. 2014 Jul 26;14:538
pubmed: 25064703
Breast Cancer Res Treat. 2012 Aug;135(1):301-6
pubmed: 22752290
Front Mol Neurosci. 2014 Feb 04;7:2
pubmed: 24550773
Genome Biol. 2018 Feb 6;19(1):15
pubmed: 29409532
Cancer Res. 2008 May 1;68(9):3108-14
pubmed: 18451135
Nature. 2012 Oct 4;490(7418):61-70
pubmed: 23000897
Nature. 2000 Aug 17;406(6797):747-52
pubmed: 10963602
Cancer Res. 2008 Nov 15;68(22):9532-40
pubmed: 19010930
Phys Rev E. 2020 Sep;102(3-1):032309
pubmed: 33075933
BMC Bioinformatics. 2021 Nov 30;22(1):576
pubmed: 34847879
Epigenomics. 2019 Nov;11(14):1581-1599
pubmed: 31693439
Nat Rev Genet. 2016 Aug 16;17(9):507-22
pubmed: 27528417
Psychother Psychosom. 2014;83(2):89-105
pubmed: 24458030
Cancer Cell. 2009 Dec 8;16(6):533-46
pubmed: 19962671
Sci Rep. 2019 Jan 23;9(1):337
pubmed: 30674955
EMBO J. 2011 Sep 23;30(20):4299-308
pubmed: 21946562
Funct Integr Genomics. 2019 Jul;19(4):645-658
pubmed: 30859354
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50
pubmed: 16199517
Phys Rev E. 2017 Jan;95(1-1):012317
pubmed: 28208453
Nucleic Acids Res. 2018 Jan 4;46(D1):D360-D370
pubmed: 29194489
Nucleic Acids Res. 2019 Mar 18;47(5):2205-2215
pubmed: 30657980
Proc Natl Acad Sci U S A. 2001 Sep 11;98(19):10869-74
pubmed: 11553815
Sci Rep. 2017 Oct 19;7(1):13534
pubmed: 29051564
Nucleic Acids Res. 2016 May 5;44(8):e71
pubmed: 26704973
Nat Genet. 2013 Oct;45(10):1113-20
pubmed: 24071849
Cancer Cell. 2018 Apr 9;33(4):690-705.e9
pubmed: 29622464
BMC Bioinformatics. 2008 Dec 29;9:559
pubmed: 19114008
Phys Rev Lett. 2003 Feb 28;90(8):088102
pubmed: 12633463
Int J Mol Sci. 2019 Jun 26;20(13):
pubmed: 31247897
Theranostics. 2015 Jul 13;5(10):1122-43
pubmed: 26199650
Mol Oncol. 2011 Feb;5(1):5-23
pubmed: 21147047
Sci Rep. 2015 Dec 07;5:17386
pubmed: 26639632
Phys Rev E Stat Nonlin Soft Matter Phys. 2014 Jan;89(1):012804
pubmed: 24580278
Front Biosci (Landmark Ed). 2017 Jun 1;22(10):1774-1791
pubmed: 28410145
Nature. 2012 Apr 18;486(7403):346-52
pubmed: 22522925

Auteurs

Filippo Valle (F)

Physics Department, University of Turin and INFN, via P. Giuria 1, 10125 Turin, Italy.

Matteo Osella (M)

Physics Department, University of Turin and INFN, via P. Giuria 1, 10125 Turin, Italy.

Michele Caselle (M)

Physics Department, University of Turin and INFN, via P. Giuria 1, 10125 Turin, Italy.

Classifications MeSH