Speos: an ensemble graph representation learning framework to predict core gene candidates for complex diseases.
Journal
Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555
Informations de publication
Date de publication:
08 11 2023
08 11 2023
Historique:
received:
27
03
2023
accepted:
27
10
2023
medline:
9
11
2023
pubmed:
8
11
2023
entrez:
8
11
2023
Statut:
epublish
Résumé
Understanding phenotype-to-genotype relationships is a grand challenge of 21st century biology with translational implications. The recently proposed "omnigenic" model postulates that effects of genetic variation on traits are mediated by core-genes and -proteins whose activities mechanistically influence the phenotype, whereas peripheral genes encode a regulatory network that indirectly affects phenotypes via core gene products. Here, we develop a positive-unlabeled graph representation-learning ensemble-approach based on a nested cross-validation to predict core-like genes for diverse diseases using Mendelian disorder genes for training. Employing mouse knockout phenotypes for external validations, we demonstrate that core-like genes display several key properties of core genes: Mouse knockouts of genes corresponding to our most confident predictions give rise to relevant mouse phenotypes at rates on par with the Mendelian disorder genes, and all candidates exhibit core gene properties like transcriptional deregulation in disease and loss-of-function intolerance. Moreover, as predicted for core genes, our candidates are enriched for drug targets and druggable proteins. In contrast to Mendelian disorder genes the new core-like genes are enriched for druggable yet untargeted gene products, which are therefore attractive targets for drug development. Interpretation of the underlying deep learning model suggests plausible explanations for our core gene predictions in form of molecular mechanisms and physical interactions. Our results demonstrate the potential of graph representation learning for the interpretation of biological complexity and pave the way for studying core gene properties and future drug development.
Identifiants
pubmed: 37938585
doi: 10.1038/s41467-023-42975-z
pii: 10.1038/s41467-023-42975-z
pmc: PMC10632370
doi:
Substances chimiques
RNA
63231-63-0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
7206Informations de copyright
© 2023. The Author(s).
Références
Immunity. 2002 Mar;16(3):479-92
pubmed: 11911831
Nucleic Acids Res. 2014 Jan;42(Database issue):D358-63
pubmed: 24234451
Nat Genet. 2000 May;25(1):25-9
pubmed: 10802651
PLoS Comput Biol. 2015 Jul 09;11(7):e1004259
pubmed: 26158728
Nat Genet. 2008 Apr;40(4):395-402
pubmed: 18311140
Nucleic Acids Res. 2021 Jan 8;49(D1):D1311-D1320
pubmed: 33045747
Nat Genet. 2018 Sep;50(9):1318-1326
pubmed: 30104760
EMBO J. 1998 Feb 16;17(4):877-85
pubmed: 9463366
Curr Drug Targets. 2021;22(7):760-769
pubmed: 33475057
Mucosal Immunol. 2013 Sep;6(5):886-99
pubmed: 23250276
Nat Genet. 2021 Aug;53(8):1243-1249
pubmed: 34326547
Elife. 2021 Feb 15;10:
pubmed: 33587031
PLoS Comput Biol. 2015 Apr 17;11(4):e1004219
pubmed: 25885710
Cardiovasc Res. 2020 Mar 1;116(3):698-707
pubmed: 31135876
PLoS One. 2014 May 09;9(5):e97079
pubmed: 24816822
Cell. 2021 May 27;184(11):3022-3040.e28
pubmed: 33961781
Am J Hum Genet. 2018 Oct 4;103(4):535-552
pubmed: 30290150
Cell. 2018 Jun 14;173(7):1573-1580
pubmed: 29906445
Science. 2019 Dec 20;366(6472):
pubmed: 31857451
Nucleic Acids Res. 2019 Jan 8;47(D1):D801-D806
pubmed: 30407599
Nat Methods. 2017 Mar;14(3):297-301
pubmed: 28099430
J Biol Chem. 2021 Jan-Jun;296:100630
pubmed: 33823154
Nature. 2011 Feb 10;470(7333):163-5
pubmed: 21307913
Interdiscip Sci. 2022 Sep;14(3):775-785
pubmed: 35536538
J Immunol. 2003 Jun 1;170(11):5571-7
pubmed: 12759435
Saudi J Gastroenterol. 2018 Jul-Aug;24(4):201-210
pubmed: 29873318
Sci Transl Med. 2017 Mar 29;9(383):
pubmed: 28356508
Comput Biol Chem. 2015 Aug;57:21-8
pubmed: 25736609
Cell. 2016 Dec 15;167(7):1853-1866.e17
pubmed: 27984732
Nucleic Acids Res. 2015 Jan;43(Database issue):D789-98
pubmed: 25428349
J Crohns Colitis. 2014 Oct;8(10):1315-26
pubmed: 24835165
Patterns (N Y). 2022 Dec 06;4(1):100651
pubmed: 36699743
J Mach Learn Res. 2019;20:
pubmed: 34335110
PLoS Genet. 2018 Sep 10;14(9):e1007458
pubmed: 30199539
Nat Commun. 2021 May 14;12(1):2851
pubmed: 33990562
Nucleic Acids Res. 2018 Jan 4;46(D1):D1068-D1073
pubmed: 29156001
Expert Rev Clin Immunol. 2022 Jun;18(6):551-555
pubmed: 35507314
Int J Immunogenet. 2016 Feb;43(1):18-24
pubmed: 26566691
Bioinformatics. 2019 Feb 1;35(3):497-505
pubmed: 30020411
J Int Med Res. 2020 Oct;48(10):300060520961675
pubmed: 33026276
Elife. 2017 Sep 22;6:
pubmed: 28936969
Nature. 2016 Aug 17;536(7616):285-91
pubmed: 27535533
Nat Commun. 2020 Jan 31;11(1):655
pubmed: 32005800
Nature. 2012 Sep 6;489(7414):91-100
pubmed: 22955619
PLoS One. 2015 May 01;10(5):e0125138
pubmed: 25933025
PLoS Genet. 2015 Feb 13;11(2):e1004967
pubmed: 25679399
Front Immunol. 2021 Feb 11;12:631472
pubmed: 33643317
Nature. 2017 Oct 11;550(7675):204-213
pubmed: 29022597
Structure. 2011 Feb 9;19(2):162-71
pubmed: 21300286
Science. 2008 Oct 3;322(5898):104-10
pubmed: 18719252
Nucleic Acids Res. 2021 Jan 8;49(D1):D97-D103
pubmed: 33151298
Nat Commun. 2023 Nov 8;14(1):7206
pubmed: 37938585
Nucleic Acids Res. 2021 Jan 8;49(D1):D325-D334
pubmed: 33290552
Cell Syst. 2018 Apr 25;6(4):484-495.e5
pubmed: 29605183
Nat Rev Methods Primers. 2022;2(1):
pubmed: 37214176
PLoS Comput Biol. 2010 Jan 15;6(1):e1000641
pubmed: 20090828
Database (Oxford). 2021 Feb 18;2021:
pubmed: 33599246
Am J Hum Genet. 2019 Sep 5;105(3):456-476
pubmed: 31402091
Cell. 2017 Jun 15;169(7):1177-1186
pubmed: 28622505
Proteomics. 2012 May;12(10):1499-518
pubmed: 22589225
Nat Genet. 2003 Mar;33 Suppl:228-37
pubmed: 12610532
Nat Methods. 2009 Jan;6(1):83-90
pubmed: 19060904
Cancer Res. 2004 Feb 1;64(3):1122-9
pubmed: 14871847
Biol Chem. 2014 Mar;395(3):335-46
pubmed: 24184991
Nat Commun. 2022 Jul 6;13(1):3895
pubmed: 35794153
Cell. 2019 May 2;177(4):1022-1034.e6
pubmed: 31051098
Nat Commun. 2019 Jun 3;10(1):2417
pubmed: 31160569
J Biomed Inform. 2021 Mar;115:103688
pubmed: 33545331
Ann Rheum Dis. 2020 Nov;79(11):1446-1452
pubmed: 32732242
Front Genet. 2021 Nov 25;12:779186
pubmed: 34899863
Nature. 2020 Apr;580(7803):402-408
pubmed: 32296183
Nat Genet. 2015 Dec;47(12):1385-92
pubmed: 26523775
J Biol Chem. 2005 May 6;280(18):18056-62
pubmed: 15760905
Nucleic Acids Res. 2021 Jan 8;49(D1):D605-D612
pubmed: 33237311
Nat Commun. 2022 Nov 10;13(1):6794
pubmed: 36357376
KDD. 2016 Aug;2016:855-864
pubmed: 27853626
Cell. 2019 Jan 10;176(1-2):377-390.e19
pubmed: 30612741
Nat Biomed Eng. 2022 Dec;6(12):1353-1369
pubmed: 36316368
Am J Hum Genet. 2008 Apr;82(4):949-58
pubmed: 18371930
Proteins. 2006 May 15;63(3):490-500
pubmed: 16450363
Bioinformatics. 2010 Apr 15;26(8):1057-63
pubmed: 20185403
Nucleic Acids Res. 2018 Jan 4;46(D1):D649-D655
pubmed: 29145629
Nat Biotechnol. 2018 Mar;36(3):272-281
pubmed: 29457794
Bioinformatics. 2010 May 1;26(9):1219-24
pubmed: 20215462
Nat Genet. 2021 Sep;53(9):1300-1310
pubmed: 34475573
BMC Bioinformatics. 2016 Sep 13;17(1):365
pubmed: 27618934
Nat Genet. 2021 Nov;53(11):1527-1533
pubmed: 34711957
BMC Bioinformatics. 2022 Jan 4;23(1):5
pubmed: 34983367
Am J Hum Genet. 2016 Jul 7;99(1):139-53
pubmed: 27346688
PLoS One. 2021 Oct 15;16(10):e0258626
pubmed: 34653225
Protein Sci. 2021 Jan;30(1):187-200
pubmed: 33070389
Cell. 2015 Apr 23;161(3):661-673
pubmed: 25910213
IEEE/ACM Trans Comput Biol Bioinform. 2022 Mar-Apr;19(2):819-829
pubmed: 32809944
Genome Biol. 2021 Jan 26;22(1):49
pubmed: 33499903
BMC Bioinformatics. 2006 Mar 20;7 Suppl 1:S2
pubmed: 16723005