Towards reproducible computational drug discovery.

Bioinformatics Cheminformatics Data science Data sharing Drug design Drug discovery Open data Open science Reproducibility Reproducible research

Journal

Journal of cheminformatics
ISSN: 1758-2946
Titre abrégé: J Cheminform
Pays: England
ID NLM: 101516718

Informations de publication

Date de publication:
28 Jan 2020
Historique:
received: 17 07 2019
accepted: 02 01 2020
entrez: 12 1 2021
pubmed: 13 1 2021
medline: 13 1 2021
Statut: epublish

Résumé

The reproducibility of experiments has been a long standing impediment for further scientific progress. Computational methods have been instrumental in drug discovery efforts owing to its multifaceted utilization for data collection, pre-processing, analysis and inference. This article provides an in-depth coverage on the reproducibility of computational drug discovery. This review explores the following topics: (1) the current state-of-the-art on reproducible research, (2) research documentation (e.g. electronic laboratory notebook, Jupyter notebook, etc.), (3) science of reproducible research (i.e. comparison and contrast with related concepts as replicability, reusability and reliability), (4) model development in computational drug discovery, (5) computational issues on model development and deployment, (6) use case scenarios for streamlining the computational drug discovery protocol. In computational disciplines, it has become common practice to share data and programming codes used for numerical calculations as to not only facilitate reproducibility, but also to foster collaborations (i.e. to drive the project further by introducing new ideas, growing the data, augmenting the code, etc.). It is therefore inevitable that the field of computational drug design would adopt an open approach towards the collection, curation and sharing of data/code.

Identifiants

pubmed: 33430992
doi: 10.1186/s13321-020-0408-x
pii: 10.1186/s13321-020-0408-x
pmc: PMC6988305
doi:

Types de publication

Journal Article Review

Langues

eng

Pagination

9

Subventions

Organisme : Thailand Research Fund
ID : RSA6280075

Références

J Chem Inf Model. 2012 Nov 26;52(11):2864-75
pubmed: 23088335
J Cheminform. 2015 Sep 15;7:46
pubmed: 26379782
Curr Pharm Des. 2012;18(9):1266-91
pubmed: 22316153
Nucleic Acids Res. 2019 Jan 8;47(D1):D1102-D1109
pubmed: 30371825
Curr Top Med Chem. 2002 Dec;2(12):1321-32
pubmed: 12470283
J Mol Graph Model. 2000 Aug-Oct;18(4-5):464-77
pubmed: 11143563
PLoS Comput Biol. 2018 Jun 15;14(6):e1006220
pubmed: 29906293
Nature. 2009 Nov 12;462(7270):175-81
pubmed: 19881490
Bioinformatics. 2012 Jun 1;28(11):1525-6
pubmed: 22500002
PLoS One. 2013 Jul 23;8(7):e67332
pubmed: 23935830
Comb Chem High Throughput Screen. 2011 Dec;14(10):861-71
pubmed: 21843145
Bioinformatics. 2010 Dec 1;26(23):3000-1
pubmed: 20889496
Environ Health Perspect. 2003 Aug;111(10):1391-401
pubmed: 12896862
Database (Oxford). 2015 Sep 16;2015:
pubmed: 26384374
J Am Med Inform Assoc. 2014 Nov-Dec;21(6):957-8
pubmed: 25008006
Nature. 2008 Sep 18;455(7211):273
pubmed: 18800097
Mini Rev Med Chem. 2003 Dec;3(8):861-75
pubmed: 14529504
Mol Inform. 2014 Dec;33(11-12):749-56
pubmed: 27485421
J Chem Inf Model. 2015 Jan 26;55(1):19-25
pubmed: 25493610
Prog Biophys Mol Biol. 2015 Jan;117(1):99-106
pubmed: 25433232
J Natl Cancer Inst. 2004 Mar 17;96(6):434-42
pubmed: 15026468
Elife. 2017 Sep 05;6:
pubmed: 28873054
Nat Biotechnol. 2007 Feb;25(2):197-206
pubmed: 17287757
J Chem Inf Model. 2006 May-Jun;46(3):1060-8
pubmed: 16711725
Chem Cent J. 2007 Jul 05;1:19
pubmed: 17880750
IEEE Trans Biomed Eng. 2016 Oct;63(10):1999-2006
pubmed: 27295645
Gigascience. 2017 Aug 1;6(8):1-7
pubmed: 28854616
Nat Biotechnol. 2004 Oct;22(10):1253-9
pubmed: 15470465
Brief Bioinform. 2015 Sep;16(5):901-3
pubmed: 25433467
CPT Pharmacometrics Syst Pharmacol. 2019 Apr;8(4):205-210
pubmed: 30697975
PLoS Comput Biol. 2019 Apr 8;15(4):e1006650
pubmed: 30958812
Bioinformatics. 2014 Jan 15;30(2):298-300
pubmed: 24262214
J Comput Chem. 2008 May;29(7):1019-31
pubmed: 18072177
Eur J Hum Genet. 2015 Oct;23(10):1271-8
pubmed: 25248396
IEEE Trans Biomed Eng. 2016 Oct;63(10):2015-20
pubmed: 27429432
Med Chem. 2005 Nov;1(6):649-55
pubmed: 16787349
PLoS Biol. 2015 Mar 13;13(3):e1002106
pubmed: 25768323
J Health Econ. 2016 May;47:20-33
pubmed: 26928437
J Chem Inf Model. 2018 Mar 26;58(3):673-682
pubmed: 29425037
Nature. 2000 Jun 15;405(6788):847-56
pubmed: 10866211
Nature. 2016 Apr 28;532(7600):459-64
pubmed: 27074502
Bioinformatics. 2007 Mar 15;23(6):769-70
pubmed: 17237072
Bioinformatics. 2006 Jul 15;22(14):1710-6
pubmed: 16632493
Infect Immun. 2010 Dec;78(12):4972-5
pubmed: 20876290
Curr Top Med Chem. 2014;14(3):294-303
pubmed: 24283973
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D689-91
pubmed: 16381960
Nat Biotechnol. 2010 Sep;28(9):935-42
pubmed: 20829833
Nucleic Acids Res. 2014 Jan;42(Database issue):D1091-7
pubmed: 24203711
J Exp Psychol Gen. 2014 Apr;143(2):534-47
pubmed: 23855496
Gigascience. 2015 Oct 15;4:47
pubmed: 26473029
PLoS Comput Biol. 2009 Jul;5(7):e1000424
pubmed: 19649301
J Chem Inf Comput Sci. 1999 Sep-Oct;39(5):897-902
pubmed: 10529988
J Cheminform. 2017 Mar 6;9:15
pubmed: 28316653
BMC Bioinformatics. 2017 Jul 12;18(1):337
pubmed: 28701218
Bioinformatics. 2018 Feb 1;34(3):514-515
pubmed: 28968637
J Cheminform. 2017 Mar 7;9:17
pubmed: 28316655
Bioinformatics. 2012 Oct 1;28(19):2520-2
pubmed: 22908215
J Cheminform. 2016 Aug 10;8:39
pubmed: 27516811
Sci Data. 2016 Mar 15;3:160018
pubmed: 26978244
Curr Protein Pept Sci. 2007 Aug;8(4):381-411
pubmed: 17696871
BMC Bioinformatics. 2018 Oct 15;19(Suppl 10):349
pubmed: 30367595
Bioinformatics. 2017 Aug 15;33(16):2580-2582
pubmed: 28379341
J Comput Aided Mol Des. 2011 Jun;25(6):533-54
pubmed: 21660515
J Cheminform. 2015 Dec 09;7:60
pubmed: 26664458
J Cheminform. 2010 Jun 30;2(1):5
pubmed: 20591161
Genome Res. 2005 Oct;15(10):1451-5
pubmed: 16169926
J Biomed Semantics. 2015 Mar 21;6:10
pubmed: 25815161
Nature. 2005 Jul 7;436(7047):20-1
pubmed: 16001034
J Mol Graph Model. 2008 Jun;26(8):1315-26
pubmed: 18328754
Assay Drug Dev Technol. 2010 Apr;8(2):152-74
pubmed: 20070233
Nature. 2016 May 25;533(7604):452-4
pubmed: 27225100
PLoS One. 2017 May 11;12(5):e0177459
pubmed: 28494014
Am J Epidemiol. 2008 Aug 15;168(4):374-83; discussion 384-90
pubmed: 18611956
Arch Toxicol. 2004 Oct;78(10):549-64
pubmed: 15170526
J Comput Aided Mol Des. 2013 Apr;27(4):321-36
pubmed: 23615761
Bioinformation. 2010 Mar 31;4(9):417-20
pubmed: 20975892
Nucleic Acids Res. 2017 Jan 4;45(D1):D380-D388
pubmed: 27924025
J Med Chem. 2012 Aug 9;55(15):6832-48
pubmed: 22780961
J Chem Inf Comput Sci. 2003 Mar-Apr;43(2):493-500
pubmed: 12653513
Elife. 2015 Dec 23;4:
pubmed: 26701910
Trends Pharmacol Sci. 2010 Mar;31(3):115-23
pubmed: 20117850
Chem Res Toxicol. 2005 Sep;18(9):1420-6
pubmed: 16167834
J Chem Inf Model. 2007 Nov-Dec;47(6):2345-57
pubmed: 17880194
J Cheminform. 2016 Jun 21;8:34
pubmed: 27330567
J Pharmacol Pharmacother. 2011 Apr;2(2):138-9
pubmed: 21772785
Mol Inform. 2011 Dec;30(11-12):960-72
pubmed: 27468151
Expert Opin Drug Discov. 2017 Aug;12(8):757-767
pubmed: 28602100
J Cheminform. 2017 May 4;9(1):27
pubmed: 29086046
PLoS Comput Biol. 2014 Apr 10;10(4):e1003537
pubmed: 24722319
Science. 2011 Dec 2;334(6060):1226-7
pubmed: 22144613
Nat Biotechnol. 1997 Aug;15(8):799-800
pubmed: 9255798
Genome Biol. 2010;11(8):R86
pubmed: 20738864
Chem Res Toxicol. 2007 Jul;20(7):1019-30
pubmed: 17555332
F1000Res. 2016 Jan 04;5:2
pubmed: 26835004
J Chem Inf Comput Sci. 2001 May-Jun;41(3):607-13
pubmed: 11410036
Database (Oxford). 2013 Jun 21;2013:bat044
pubmed: 23794735
SAR QSAR Environ Res. 2011 Mar;22(1-2):89-106
pubmed: 21391143
CPT Pharmacometrics Syst Pharmacol. 2015 Jun;4(6):316-9
pubmed: 26225259
Front Neurosci. 2013 Feb 06;7:9
pubmed: 23390412
Brief Bioinform. 2019 May 21;20(3):1004-1010
pubmed: 29228189
J Comput Aided Mol Des. 2016 Mar;30(3):237-49
pubmed: 26897747
J Chem Inf Comput Sci. 2001 May-Jun;41(3):663-70
pubmed: 11410044
J Androl. 2006 May-Jun;27(3):313-5
pubmed: 16474014
J Cheminform. 2019 Jan 10;11(1):4
pubmed: 30631996
Nature. 2014 Mar 27;507(7493):523-5
pubmed: 24678534
J Chem Inf Model. 2008 Oct;48(10):2081-94
pubmed: 18826208
J Cheminform. 2019 Apr 8;11(1):29
pubmed: 30963287
J Mol Biol. 2016 Feb 22;428(4):720-725
pubmed: 26410586
Cancer Biol Ther. 2009 Feb;8(3):233-5
pubmed: 19333009
Mol Inform. 2010 Jul 12;29(6-7):476-88
pubmed: 27463326
Proc (Bayl Univ Med Cent). 2000 Oct;13(4):421-3
pubmed: 16389357
BMC Res Notes. 2015 Nov 02;8:628
pubmed: 26526344
Nature. 2017 May 29;546(7656):173-174
pubmed: 28569835
J Comput Aided Mol Des. 2007 Jan-Mar;21(1-3):23-32
pubmed: 17253117
F1000Res. 2017 Jul 17;6:1136
pubmed: 28928948
Nucleic Acids Res. 2016 Jan 4;44(D1):D1045-53
pubmed: 26481362
Mol Divers. 2006 Aug;10(3):283-99
pubmed: 17031533
Methods Enzymol. 2003;374:461-91
pubmed: 14696385
Genome Biol. 2004;5(10):R80
pubmed: 15461798
Nucleic Acids Res. 2019 Jan 8;47(D1):D930-D940
pubmed: 30398643
Nucleic Acids Res. 2019 Jul 2;47(W1):W225-W233
pubmed: 31131402
Bioinformatics. 2010 Nov 1;26(21):2778-9
pubmed: 20847218
Expert Opin Drug Discov. 2015 Apr;10(4):321-9
pubmed: 25693813
Curr Top Med Chem. 2012;12(18):1965-79
pubmed: 23110532
J Chem Inf Comput Sci. 2004 Sep-Oct;44(5):1526-39
pubmed: 15446810
Angew Chem Int Ed Engl. 2009;48(7):1198-229
pubmed: 19173328
BMC Bioinformatics. 2011;12 Suppl 15:S2
pubmed: 22373175
PLoS Comput Biol. 2017 Apr 13;13(4):e1005412
pubmed: 28407023
Eur J Med Chem. 2006 Feb;41(2):166-75
pubmed: 16368163
Br J Pharmacol. 2011 Mar;162(6):1239-49
pubmed: 21091654
Bioinform Biol Insights. 2015 Sep 10;9:125-8
pubmed: 26401099
Nat Rev Genet. 2010 Sep;11(9):647-57
pubmed: 20717155
Bioinformatics. 2012 May 1;28(9):1278-9
pubmed: 22437851
Forensic Sci Int Genet. 2015 Mar;15:2-7
pubmed: 25457631
Nucleic Acids Res. 2014 Jul;42(Web Server issue):W252-8
pubmed: 24782522
Nat Biotechnol. 2017 Apr 11;35(4):316-319
pubmed: 28398311
J Cheminform. 2011 Jul 28;3:28
pubmed: 21798025
Green Chem. 2016 Aug 21;18(16):4348-4360
pubmed: 28503093
Expert Opin Drug Discov. 2010 Jul;5(7):633-54
pubmed: 22823204
Bioinformatics. 2009 Jul 1;25(13):1709-10
pubmed: 19429600
PLoS Comput Biol. 2010 Jun 24;6(6):e1000809
pubmed: 20589079
Mol Inform. 2015 Feb;34(2-3):171-8
pubmed: 27490039
Mol Inform. 2015 May;34(5):276-83
pubmed: 27490273
Proteins. 1990;8(3):195-202
pubmed: 2281083
PLoS Biol. 2014 Jan;12(1):e1001745
pubmed: 24415924
ACS Omega. 2017 Jun 30;2(6):2805-2812
pubmed: 28691113
Open Med Chem J. 2017 Nov 30;11:212-221
pubmed: 29387275
Nucleic Acids Res. 2017 Jan 4;45(D1):D932-D939
pubmed: 27789690
Gigascience. 2016 Jul 11;5(1):30
pubmed: 27401684
J Cheminform. 2015 Aug 28;7:45
pubmed: 26322135
Biosystems. 2018 Sep;171:74-79
pubmed: 30053414
J Cheminform. 2014 May 14;6:25
pubmed: 24910716
J Cheminform. 2017 Jun 6;9(1):33
pubmed: 29086040
PLoS Comput Biol. 2013 Oct;9(10):e1003285
pubmed: 24204232
Methods Mol Biol. 2015;1260:119-47
pubmed: 25502379
Expert Opin Drug Metab Toxicol. 2012 Nov;8(11):1435-46
pubmed: 22849616
J Chem Inf Model. 2017 Aug 28;57(8):1735-1740
pubmed: 28737911
Environ Mutagen. 1985;7(6):919-21
pubmed: 4065064
J Cheminform. 2017 Jun 14;9(1):40
pubmed: 29086066
Drug Discov Today. 2006 Aug;11(15-16):700-7
pubmed: 16846797
ACS Cent Sci. 2017 Apr 26;3(4):283-293
pubmed: 28470045
BMC Bioinformatics. 2010 Mar 29;11:159
pubmed: 20346188
SIAM J Sci Comput. 2016;38(3):C179-C202
pubmed: 28190948
Curr Opin Chem Biol. 2007 Apr;11(2):182-7
pubmed: 17307018
J Comput Aided Mol Des. 2012 Jul;26(7):801-4
pubmed: 22644661
Curr Top Med Chem. 2016;16(30):3646-3656
pubmed: 27334200
Nat Biotechnol. 2010 Nov;28(11):1181-5
pubmed: 21057489
J Biomol Screen. 2006 Oct;11(7):864-9
pubmed: 16973922
Nat Biotechnol. 2008 Aug;26(8):889-96
pubmed: 18688244
Pharmacol Rev. 2013 Dec 31;66(1):334-95
pubmed: 24381236
J Med Chem. 1979 Oct;22(10):1238-44
pubmed: 513071
J Chem Inf Model. 2017 Feb 27;57(2):115-121
pubmed: 28125221
Bioinformatics. 2003 Mar 1;19(4):524-31
pubmed: 12611808
Saudi Pharm J. 2015 Jul;23(3):223-9
pubmed: 26106269
Nat Rev Drug Discov. 2016 Jun 30;15(7):447
pubmed: 27357013
J Med Chem. 2006 Oct 5;49(20):5912-31
pubmed: 17004707
Nucleic Acids Res. 2018 Jan 4;46(D1):D1074-D1082
pubmed: 29126136
Nature. 2017 Oct 18;550(7677):451-453
pubmed: 29072289
PLoS Comput Biol. 2019 Jul 25;15(7):e1007007
pubmed: 31344036
Nat Biotechnol. 2004 Feb;22(2):177-83
pubmed: 14755292
J Mol Graph Model. 2012 Sep;38:360-2
pubmed: 23085175
J Med Chem. 2007 May 17;50(10):2385-90
pubmed: 17447748
J Cheminform. 2018 Jan 16;10(1):1
pubmed: 29340790
J Comput Aided Mol Des. 2001 May;15(5):411-28
pubmed: 11394736
J Cheminform. 2016 Nov 24;8:67
pubmed: 27942268
J Comput Aided Mol Des. 2002 Aug-Sep;16(8-9):653-81
pubmed: 12602956
Nature. 2009 Sep 10;461(7261):168-70
pubmed: 19741685
J Lab Autom. 2011 Feb;16(1):90-8
pubmed: 21609689
PLoS Comput Biol. 2015 Sep 10;11(9):e1004385
pubmed: 26356732
Mutat Res. 1991 May;257(3):229-306
pubmed: 1707500
Curr Protoc Mol Biol. 2010 Jan;Chapter 19:Unit 19.10.1-21
pubmed: 20069535
Curr Top Med Chem. 2010;10(1):46-54
pubmed: 19929827
Future Med Chem. 2016 Oct;8(15):1825-1839
pubmed: 27643715
J Cheminform. 2015 Jun 25;7:32
pubmed: 26110025

Auteurs

Nalini Schaduangrat (N)

Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, 10700, Bangkok, Thailand.

Samuel Lampa (S)

Department of Pharmaceutical Biosciences, Uppsala University, 751 24, Uppsala, Sweden.

Saw Simeon (S)

Interdisciplinary Graduate Program in Bioscience, Faculty of Science, Kasetsart University, 10900, Bangkok, Thailand.

Matthew Paul Gleeson (MP)

Department of Biomedical Engineering, Faculty of Engineering, King Mongkut's Institute of Technology Ladkrabang, 10520, Bangkok, Thailand. paul.gl@kmitl.ac.th.

Ola Spjuth (O)

Department of Pharmaceutical Biosciences, Uppsala University, 751 24, Uppsala, Sweden. ola.spjuth@farmbio.uu.se.

Chanin Nantasenamat (C)

Center of Data Mining and Biomedical Informatics, Faculty of Medical Technology, Mahidol University, 10700, Bangkok, Thailand. chanin.nan@mahidol.edu.

Classifications MeSH