Benchmarking network propagation methods for disease gene identification.


Journal

PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922

Informations de publication

Date de publication:
09 2019
Historique:
received: 24 01 2019
accepted: 16 07 2019
revised: 13 09 2019
pubmed: 4 9 2019
medline: 21 1 2020
entrez: 4 9 2019
Statut: epublish

Résumé

In-silico identification of potential target genes for disease is an essential aspect of drug target discovery. Recent studies suggest that successful targets can be found through by leveraging genetic, genomic and protein interaction information. Here, we systematically tested the ability of 12 varied algorithms, based on network propagation, to identify genes that have been targeted by any drug, on gene-disease data from 22 common non-cancerous diseases in OpenTargets. We considered two biological networks, six performance metrics and compared two types of input gene-disease association scores. The impact of the design factors in performance was quantified through additive explanatory models. Standard cross-validation led to over-optimistic performance estimates due to the presence of protein complexes. In order to obtain realistic estimates, we introduced two novel protein complex-aware cross-validation schemes. When seeding biological networks with known drug targets, machine learning and diffusion-based methods found around 2-4 true targets within the top 20 suggestions. Seeding the networks with genes associated to disease by genetics decreased performance below 1 true hit on average. The use of a larger network, although noisier, improved overall performance. We conclude that diffusion-based prioritisers and machine learning applied to diffusion-based features are suited for drug discovery in practice and improve over simpler neighbour-voting methods. We also demonstrate the large impact of choosing an adequate validation strategy and the definition of seed disease genes.

Identifiants

pubmed: 31479437
doi: 10.1371/journal.pcbi.1007276
pii: PCOMPBIOL-D-19-00113
pmc: PMC6743778
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1007276

Déclaration de conflit d'intérêts

I have read the journal’s policy and the authors of this manuscript have the following competing interests: SJB, DRW, AG, and BHD are paid employees and shareholders of GlaxoSmithKline. The commercial affiliation of SJB, DRW, AG, and BHD does not alter our adherence to PLOS policies.

Références

Amino Acids. 2015 Dec;47(12):2583-92
pubmed: 26215734
Nat Genet. 2015 Feb;47(2):106-14
pubmed: 25501392
Cell. 2011 Mar 4;144(5):646-74
pubmed: 21376230
Sci Rep. 2018 Apr 26;8(1):6620
pubmed: 29700343
Hum Genet. 2014 Feb;133(2):125-38
pubmed: 24122152
Bioinformatics. 2018 Feb 1;34(3):533-534
pubmed: 29029016
Nucleic Acids Res. 2014 Jan;42(Database issue):D1083-90
pubmed: 24214965
PLoS One. 2017 Dec 6;12(12):e0189012
pubmed: 29211807
Cell Syst. 2018 Apr 25;6(4):484-495.e5
pubmed: 29605183
Cell Syst. 2016 Dec 21;3(6):540-548.e5
pubmed: 27889536
Bioinformatics. 2017 Jan 1;33(1):145-147
pubmed: 27591081
IEEE/ACM Trans Comput Biol Bioinform. 2012 Nov-Dec;9(6):1812-8
pubmed: 23221088
Nat Rev Genet. 2017 Sep;18(9):551-562
pubmed: 28607512
Nucleic Acids Res. 2017 Jan 4;45(D1):D362-D368
pubmed: 27924014
Bioinformatics. 2012 Oct 15;28(20):2640-7
pubmed: 22923290
Sci Rep. 2017 Nov 17;7(1):15784
pubmed: 29150626
Neural Netw. 2013 Jul;43:84-98
pubmed: 23500503
Nucleic Acids Res. 2015 Jan;43(Database issue):D447-52
pubmed: 25352553
BMC Bioinformatics. 2011 Oct 06;12:389
pubmed: 21977986
Nature. 2014 Jul 24;511(7510):421-7
pubmed: 25056061
PLoS Comput Biol. 2012;8(3):e1002444
pubmed: 22479173
Biometrics. 2003 Sep;59(3):614-23
pubmed: 14601762
Mol Syst Biol. 2007;3:88
pubmed: 17353930
Bioinformatics. 2016 Sep 15;32(18):2872-4
pubmed: 27256314
Nucleic Acids Res. 2017 Jan 4;45(D1):D369-D379
pubmed: 27980099
BMC Bioinformatics. 2008 Sep 30;9:405
pubmed: 18823568
Nat Rev Drug Discov. 2012 Mar 01;11(3):191-200
pubmed: 22378269
Biom J. 2008 Jun;50(3):346-63
pubmed: 18481363
Cell. 2017 Jun 15;169(7):1177-1186
pubmed: 28622505
Med Decis Making. 1989 Jul-Sep;9(3):190-5
pubmed: 2668680
Genome Res. 2011 Jul;21(7):1109-21
pubmed: 21536720
Genome Biol. 2004;5(10):R80
pubmed: 15461798
Nucleic Acids Res. 2017 Jan 4;45(D1):D985-D994
pubmed: 27899665
Nucleic Acids Res. 2012 Jan;40(Database issue):D841-6
pubmed: 22121220
Bioinformatics. 2017 Feb 15;33(4):612-614
pubmed: 27993773
BMC Bioinformatics. 2008 Dec 29;9:559
pubmed: 19114008
J Comput Biol. 2011 Mar;18(3):507-22
pubmed: 21385051
Nat Genet. 2015 Aug;47(8):856-60
pubmed: 26121088
Bioinformatics. 2019 Aug 15;35(16):2877-2879
pubmed: 30596886
Nat Methods. 2015 Feb;12(2):115-21
pubmed: 25633503
PLoS One. 2015 Mar 04;10(3):e0118432
pubmed: 25738806
Clin Transl Immunology. 2018 Jan 31;7(1):e1001
pubmed: 29484179
Genome Biol. 2008;9 Suppl 1:S4
pubmed: 18613948
Artif Intell Med. 2014 Jun;61(2):63-78
pubmed: 24726035
Nat Methods. 2016 Nov 29;13(12):966-967
pubmed: 27898060
Nat Methods. 2012 Apr;9(4):345-50
pubmed: 22453911
Bioinformatics. 2017 Jun 15;33(12):1829-1836
pubmed: 28200073

Auteurs

Sergio Picart-Armada (S)

B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, CIBER-BBN, Barcelona, Spain.
Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain.
Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, Esplugues de Llobregat, Spain.

Steven J Barrett (SJ)

Research Statistics, GSK, Stevenage, United Kingdom.

David R Willé (DR)

Research Statistics, GSK, Stevenage, United Kingdom.

Alexandre Perera-Lluna (A)

B2SLab, Departament d'Enginyeria de Sistemes, Automàtica i Informàtica Industrial, Universitat Politècnica de Catalunya, CIBER-BBN, Barcelona, Spain.
Networking Biomedical Research Centre in the subject area of Bioengineering, Biomaterials and Nanomedicine (CIBER-BBN), Madrid, Spain.
Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, Esplugues de Llobregat, Spain.

Alex Gutteridge (A)

Computational Biology and Statistics, GSK, Stevenage, United Kingdom.

Benoit H Dessailly (BH)

GSK Vaccines, Rixensart, Belgium.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH