Novel Consensus Architecture To Improve Performance of Large-Scale Multitask Deep Learning QSAR Models.
Journal
Journal of chemical information and modeling
ISSN: 1549-960X
Titre abrégé: J Chem Inf Model
Pays: United States
ID NLM: 101230060
Informations de publication
Date de publication:
25 11 2019
25 11 2019
Historique:
pubmed:
5
10
2019
medline:
8
10
2020
entrez:
5
10
2019
Statut:
ppublish
Résumé
Advances in the development of high-throughput screening and automated chemistry have rapidly accelerated the production of chemical and biological data, much of them freely accessible through literature aggregator services such as ChEMBL and PubChem. Here, we explore how to use this comprehensive mapping of chemical biology space to support the development of large-scale quantitative structure-activity relationship (QSAR) models. We propose a new deep learning consensus architecture (DLCA) that combines consensus and multitask deep learning approaches together to generate large-scale QSAR models. This method improves knowledge transfer across different target/assays while also integrating contributions from models based on different descriptors. The proposed approach was validated and compared with proteochemometrics, multitask deep learning, and Random Forest methods paired with various descriptors types. DLCA models demonstrated improved prediction accuracy for both regression and classification tasks. The best models together with their modeling sets are provided through publicly available web services at https://predictor.ncats.io .
Identifiants
pubmed: 31584270
doi: 10.1021/acs.jcim.9b00526
pmc: PMC8381874
mid: NIHMS1729328
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
4613-4624Subventions
Organisme : Intramural NIH HHS
ID : ZIA TR000058
Pays : United States
Références
J Chem Inf Model. 2015 Feb 23;55(2):263-74
pubmed: 25635324
J Chem Inf Model. 2017 Nov 27;57(11):2672-2685
pubmed: 29019671
Bioinformatics. 2016 Jan 1;32(1):85-95
pubmed: 26351271
J Cheminform. 2017 Aug 14;9(1):45
pubmed: 29086168
Integr Biol (Camb). 2014 Nov;6(11):1023-33
pubmed: 25255469
J Chem Inf Model. 2006 Sep-Oct;46(5):1924-36
pubmed: 16995723
Sci Adv. 2018 Jul 25;4(7):eaap7885
pubmed: 30050984
Mol Inform. 2011 Mar 14;30(2-3):241-50
pubmed: 27466777
J Chem Inf Model. 2010 Dec 27;50(12):2094-111
pubmed: 21033656
J Chem Inf Model. 2010 May 24;50(5):742-54
pubmed: 20426451
J Chem Inf Model. 2017 Aug 28;57(8):2077-2088
pubmed: 28651433
Nucleic Acids Res. 2014 Jan;42(Database issue):D18-25
pubmed: 24271396
J Chem Inf Model. 2009 Jan;49(1):133-44
pubmed: 19125628
J Chem Inf Model. 2008 Apr;48(4):766-84
pubmed: 18311912
Environ Health Perspect. 2016 Jul;124(7):1023-33
pubmed: 26908244
Chem Sci. 2018 Jun 6;9(24):5441-5451
pubmed: 30155234
J Chem Inf Model. 2017 Oct 23;57(10):2490-2504
pubmed: 28872869
J Chem Inf Model. 2008 Sep;48(9):1733-46
pubmed: 18729318
Biomolecules. 2018 Oct 30;8(4):
pubmed: 30380783
J Chem Inf Model. 2013 Aug 26;53(8):1957-66
pubmed: 23829430
J Med Chem. 2006 Mar 9;49(5):1536-48
pubmed: 16509572
Mol Pharm. 2016 Feb 1;13(2):545-56
pubmed: 26669717
Chem Sci. 2017 Oct 31;9(2):513-530
pubmed: 29629118
Mol Biosyst. 2012 Sep;8(9):2373-84
pubmed: 22751809
SAR QSAR Environ Res. 2015;26(10):783-93
pubmed: 26305108
Brief Bioinform. 2017 Nov 1;18(6):1057-1070
pubmed: 27542402
Nucleic Acids Res. 2012 Jan;40(Database issue):D1100-7
pubmed: 21948594
J Chem Inf Model. 2017 Aug 28;57(8):2068-2076
pubmed: 28692267
J Comput Aided Mol Des. 2016 Aug;30(8):595-608
pubmed: 27558503
J Chem Inf Model. 2015 Jun 22;55(6):1246-60
pubmed: 25995041
J Mol Graph Model. 2002 Jan;20(4):269-76
pubmed: 11858635
J Chem Inf Model. 2014 Mar 24;54(3):705-12
pubmed: 24524735