Novel Consensus Architecture To Improve Performance of Large-Scale Multitask Deep Learning QSAR Models.


Journal

Journal of chemical information and modeling
ISSN: 1549-960X
Titre abrégé: J Chem Inf Model
Pays: United States
ID NLM: 101230060

Informations de publication

Date de publication:
25 11 2019
Historique:
pubmed: 5 10 2019
medline: 8 10 2020
entrez: 5 10 2019
Statut: ppublish

Résumé

Advances in the development of high-throughput screening and automated chemistry have rapidly accelerated the production of chemical and biological data, much of them freely accessible through literature aggregator services such as ChEMBL and PubChem. Here, we explore how to use this comprehensive mapping of chemical biology space to support the development of large-scale quantitative structure-activity relationship (QSAR) models. We propose a new deep learning consensus architecture (DLCA) that combines consensus and multitask deep learning approaches together to generate large-scale QSAR models. This method improves knowledge transfer across different target/assays while also integrating contributions from models based on different descriptors. The proposed approach was validated and compared with proteochemometrics, multitask deep learning, and Random Forest methods paired with various descriptors types. DLCA models demonstrated improved prediction accuracy for both regression and classification tasks. The best models together with their modeling sets are provided through publicly available web services at https://predictor.ncats.io .

Identifiants

pubmed: 31584270
doi: 10.1021/acs.jcim.9b00526
pmc: PMC8381874
mid: NIHMS1729328
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

4613-4624

Subventions

Organisme : Intramural NIH HHS
ID : ZIA TR000058
Pays : United States

Références

J Chem Inf Model. 2015 Feb 23;55(2):263-74
pubmed: 25635324
J Chem Inf Model. 2017 Nov 27;57(11):2672-2685
pubmed: 29019671
Bioinformatics. 2016 Jan 1;32(1):85-95
pubmed: 26351271
J Cheminform. 2017 Aug 14;9(1):45
pubmed: 29086168
Integr Biol (Camb). 2014 Nov;6(11):1023-33
pubmed: 25255469
J Chem Inf Model. 2006 Sep-Oct;46(5):1924-36
pubmed: 16995723
Sci Adv. 2018 Jul 25;4(7):eaap7885
pubmed: 30050984
Mol Inform. 2011 Mar 14;30(2-3):241-50
pubmed: 27466777
J Chem Inf Model. 2010 Dec 27;50(12):2094-111
pubmed: 21033656
J Chem Inf Model. 2010 May 24;50(5):742-54
pubmed: 20426451
J Chem Inf Model. 2017 Aug 28;57(8):2077-2088
pubmed: 28651433
Nucleic Acids Res. 2014 Jan;42(Database issue):D18-25
pubmed: 24271396
J Chem Inf Model. 2009 Jan;49(1):133-44
pubmed: 19125628
J Chem Inf Model. 2008 Apr;48(4):766-84
pubmed: 18311912
Environ Health Perspect. 2016 Jul;124(7):1023-33
pubmed: 26908244
Chem Sci. 2018 Jun 6;9(24):5441-5451
pubmed: 30155234
J Chem Inf Model. 2017 Oct 23;57(10):2490-2504
pubmed: 28872869
J Chem Inf Model. 2008 Sep;48(9):1733-46
pubmed: 18729318
Biomolecules. 2018 Oct 30;8(4):
pubmed: 30380783
J Chem Inf Model. 2013 Aug 26;53(8):1957-66
pubmed: 23829430
J Med Chem. 2006 Mar 9;49(5):1536-48
pubmed: 16509572
Mol Pharm. 2016 Feb 1;13(2):545-56
pubmed: 26669717
Chem Sci. 2017 Oct 31;9(2):513-530
pubmed: 29629118
Mol Biosyst. 2012 Sep;8(9):2373-84
pubmed: 22751809
SAR QSAR Environ Res. 2015;26(10):783-93
pubmed: 26305108
Brief Bioinform. 2017 Nov 1;18(6):1057-1070
pubmed: 27542402
Nucleic Acids Res. 2012 Jan;40(Database issue):D1100-7
pubmed: 21948594
J Chem Inf Model. 2017 Aug 28;57(8):2068-2076
pubmed: 28692267
J Comput Aided Mol Des. 2016 Aug;30(8):595-608
pubmed: 27558503
J Chem Inf Model. 2015 Jun 22;55(6):1246-60
pubmed: 25995041
J Mol Graph Model. 2002 Jan;20(4):269-76
pubmed: 11858635
J Chem Inf Model. 2014 Mar 24;54(3):705-12
pubmed: 24524735

Auteurs

Alexey V Zakharov (AV)

National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Tongan Zhao (T)

National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Dac-Trung Nguyen (DT)

National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Tyler Peryea (T)

National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Timothy Sheils (T)

National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Adam Yasgar (A)

National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Ruili Huang (R)

National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Noel Southall (N)

National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Anton Simeonov (A)

National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH