Drug-target interaction prediction with tree-ensemble learning and output space reconstruction.

Drug-target networks Interaction prediction Network reconstruction Tree-ensembles multi-output prediction

Journal

BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194

Informations de publication

Date de publication:
07 Feb 2020
Historique:
received: 21 06 2019
accepted: 21 01 2020
entrez: 9 2 2020
pubmed: 9 2 2020
medline: 4 4 2020
Statut: epublish

Résumé

Computational prediction of drug-target interactions (DTI) is vital for drug discovery. The experimental identification of interactions between drugs and target proteins is very onerous. Modern technologies have mitigated the problem, leveraging the development of new drugs. However, drug development remains extremely expensive and time consuming. Therefore, in silico DTI predictions based on machine learning can alleviate the burdensome task of drug development. Many machine learning approaches have been proposed over the years for DTI prediction. Nevertheless, prediction accuracy and efficiency are persisting problems that still need to be tackled. Here, we propose a new learning method which addresses DTI prediction as a multi-output prediction task by learning ensembles of multi-output bi-clustering trees (eBICT) on reconstructed networks. In our setting, the nodes of a DTI network (drugs and proteins) are represented by features (background information). The interactions between the nodes of a DTI network are modeled as an interaction matrix and compose the output space in our problem. The proposed approach integrates background information from both drug and target protein spaces into the same global network framework. We performed an empirical evaluation, comparing the proposed approach to state of the art DTI prediction methods and demonstrated the effectiveness of the proposed approach in different prediction settings. For evaluation purposes, we used several benchmark datasets that represent drug-protein networks. We show that output space reconstruction can boost the predictive performance of tree-ensemble learning methods, yielding more accurate DTI predictions. We proposed a new DTI prediction method where bi-clustering trees are built on reconstructed networks. Building tree-ensemble learning models with output space reconstruction leads to superior prediction results, while preserving the advantages of tree-ensembles, such as scalability, interpretability and inductive setting.

Sections du résumé

BACKGROUND BACKGROUND
Computational prediction of drug-target interactions (DTI) is vital for drug discovery. The experimental identification of interactions between drugs and target proteins is very onerous. Modern technologies have mitigated the problem, leveraging the development of new drugs. However, drug development remains extremely expensive and time consuming. Therefore, in silico DTI predictions based on machine learning can alleviate the burdensome task of drug development. Many machine learning approaches have been proposed over the years for DTI prediction. Nevertheless, prediction accuracy and efficiency are persisting problems that still need to be tackled. Here, we propose a new learning method which addresses DTI prediction as a multi-output prediction task by learning ensembles of multi-output bi-clustering trees (eBICT) on reconstructed networks. In our setting, the nodes of a DTI network (drugs and proteins) are represented by features (background information). The interactions between the nodes of a DTI network are modeled as an interaction matrix and compose the output space in our problem. The proposed approach integrates background information from both drug and target protein spaces into the same global network framework.
RESULTS RESULTS
We performed an empirical evaluation, comparing the proposed approach to state of the art DTI prediction methods and demonstrated the effectiveness of the proposed approach in different prediction settings. For evaluation purposes, we used several benchmark datasets that represent drug-protein networks. We show that output space reconstruction can boost the predictive performance of tree-ensemble learning methods, yielding more accurate DTI predictions.
CONCLUSIONS CONCLUSIONS
We proposed a new DTI prediction method where bi-clustering trees are built on reconstructed networks. Building tree-ensemble learning models with output space reconstruction leads to superior prediction results, while preserving the advantages of tree-ensembles, such as scalability, interpretability and inductive setting.

Identifiants

pubmed: 32033537
doi: 10.1186/s12859-020-3379-z
pii: 10.1186/s12859-020-3379-z
pmc: PMC7006075
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

49

Références

Bioinformatics. 2015 Jun 15;31(12):i221-9
pubmed: 26072486
PLoS Comput Biol. 2007 Jun;3(6):e116
pubmed: 17604446
Health Policy. 2011 Apr;100(1):4-17
pubmed: 21256615
Methods. 2015 Jul 15;83:98-104
pubmed: 25957673
BMC Syst Biol. 2018 Dec 31;12(Suppl 9):136
pubmed: 30598094
Bioinformatics. 2011 Nov 1;27(21):3036-43
pubmed: 21893517
Annu Rev Pharmacol Toxicol. 2012;52:361-79
pubmed: 22017683
Bioinformatics. 2008 Jul 1;24(13):i232-40
pubmed: 18586719
BMC Bioinformatics. 2017 Oct 4;18(1):440
pubmed: 28978313
BMC Bioinformatics. 2019 Oct 28;20(1):525
pubmed: 31660848
Bioinformatics. 2013 Jan 15;29(2):238-45
pubmed: 23162055
Brief Bioinform. 2017 Mar 1;18(2):333-347
pubmed: 26944082
Bioinformatics. 2017 Nov 15;33(22):3610-3618
pubmed: 29036404
Brief Bioinform. 2015 Mar;16(2):325-37
pubmed: 24723570
Mol Biosyst. 2015 Aug;11(8):2116-25
pubmed: 26008881
BMC Bioinformatics. 2016 Dec 22;17(Suppl 19):509
pubmed: 28155697
Molecules. 2017 Nov 25;22(12):
pubmed: 29186828
Nat Rev Drug Discov. 2004 Aug;3(8):673-83
pubmed: 15286734
Curr Protein Pept Sci. 2018;19(5):488-497
pubmed: 27829347
J Med Syst. 2012 Aug;36(4):2431-48
pubmed: 21537851
Nature. 2009 Nov 12;462(7270):175-81
pubmed: 19881490
BMC Bioinformatics. 2007;8 Suppl 10:S8
pubmed: 18269702
Bioinformatics. 2016 Jun 15;32(12):i18-i27
pubmed: 27307615
Curr Drug Metab. 2019;20(3):194-202
pubmed: 30129407
J Proteome Res. 2017 Apr 7;16(4):1401-1409
pubmed: 28264154
Drug Discov Today. 2007 Jan;12(1-2):34-42
pubmed: 17198971
Drug Discov Today. 2012 Jan;17(1-2):10-22
pubmed: 21777691
Brief Bioinform. 2016 Jan;17(1):2-12
pubmed: 25832646
Nat Chem Biol. 2008 Nov;4(11):682-90
pubmed: 18936753
IEEE J Biomed Health Inform. 2017 Mar;21(2):561-572
pubmed: 26731781
Nat Rev Drug Discov. 2010 Mar;9(3):203-14
pubmed: 20168317
IEEE/ACM Trans Comput Biol Bioinform. 2004 Jan-Mar;1(1):24-45
pubmed: 17048406
IEEE J Biomed Health Inform. 2019 May;23(3):1336-1345
pubmed: 29994408
Bioinformatics. 2016 Apr 1;32(7):1057-64
pubmed: 26614126
Brief Bioinform. 2019 Jul 19;20(4):1337-1357
pubmed: 29377981
Nature. 2012 Jun 10;486(7403):361-7
pubmed: 22722194
BMC Bioinformatics. 2015 Nov 04;16:365
pubmed: 26537615
J Am Chem Soc. 2014 Aug 20;136(33):11556-65
pubmed: 25061983
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):646-656
pubmed: 26890921
IEEE/ACM Trans Comput Biol Bioinform. 2019 Nov 04;:
pubmed: 31689203
Bioinformatics. 2017 Aug 1;33(15):2337-2344
pubmed: 28430977
Bioinformatics. 2007 Jul 1;23(13):i57-65
pubmed: 17646345
PLoS Comput Biol. 2016 Feb 12;12(2):e1004760
pubmed: 26872142
Bioinformatics. 2018 Apr 1;34(7):1164-1173
pubmed: 29186331
Brief Bioinform. 2016 Jul;17(4):696-712
pubmed: 26283676
PLoS One. 2015 Mar 04;10(3):e0118432
pubmed: 25738806

Auteurs

Konstantinos Pliakos (K)

KU Leuven, Campus KULAK, Faculty of Medicine, Kortrijk, Belgium. konstantinos.pliakos@kuleuven.be.
ITEC, imec research group at KU Leuven, Kortrijk, Belgium. konstantinos.pliakos@kuleuven.be.

Celine Vens (C)

KU Leuven, Campus KULAK, Faculty of Medicine, Kortrijk, Belgium.
ITEC, imec research group at KU Leuven, Kortrijk, Belgium.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
Humans Meta-Analysis as Topic Sample Size Models, Statistical Computer Simulation

Classifications MeSH