Benefiting from big data in natural products: importance of preserving foundational skills and prioritizing data quality.
Journal
Natural product reports
ISSN: 1460-4752
Titre abrégé: Nat Prod Rep
Pays: England
ID NLM: 8502408
Informations de publication
Date de publication:
17 11 2021
17 11 2021
Historique:
pubmed:
5
11
2021
medline:
15
3
2022
entrez:
4
11
2021
Statut:
epublish
Résumé
Systematic, large-scale, studies at the genomic, metabolomic, and functional level have transformed the natural product sciences. Improvements in technology and reduction in cost for obtaining spectroscopic, chromatographic, and genomic data coupled with the creation of readily accessible curated and functionally annotated data sets have altered the practices of virtually all natural product research laboratories. Gone are the days when the natural products researchers were expected to devote themselves exclusively to the isolation, purification, and structure elucidation of small molecules. We now also engage with big data in taxonomic, genomic, proteomic, and/or metabolomic collections, and use these data to generate and test hypotheses. While the oft stated aim for the use of large-scale -omics data in the natural products sciences is to achieve a rapid increase in the rate of discovery of new drugs, this has not yet come to pass. At the same time, new technologies have provided unexpected opportunities for natural products chemists to ask and answer new and different questions. With this viewpoint, we discuss the evolution of big data as a part of natural products research and provide a few examples of how discoveries have been enabled by access to big data. We also draw attention to some of the limitations in our existing engagement with large datasets and consider what would be necessary to overcome them.
Identifiants
pubmed: 34734219
doi: 10.1039/d1np00061f
pmc: PMC8597707
doi:
Substances chimiques
Biological Products
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Review
Langues
eng
Sous-ensembles de citation
IM
Pagination
1947-1953Subventions
Organisme : NCCIH NIH HHS
ID : U41 AT008718
Pays : United States
Références
Nature. 1984 May 31-Jun 6;309(5967):462-4
pubmed: 6328317
Nat Microbiol. 2018 Apr;3(4):415-422
pubmed: 29434326
Anal Chim Acta. 2018 Aug 27;1021:69-77
pubmed: 29681286
Nat Protoc. 2020 Jun;15(6):1954-1991
pubmed: 32405051
Chem Soc Rev. 2020 Jun 7;49(11):3297-3314
pubmed: 32393943
Metabolites. 2018 May 10;8(2):
pubmed: 29748461
J Cheminform. 2021 Jan 10;13(1):2
pubmed: 33423696
Anal Chem. 2017 Oct 3;89(19):10397-10406
pubmed: 28914531
Proc Natl Acad Sci U S A. 2021 May 11;118(19):
pubmed: 33941694
J Nat Prod. 2021 Mar 26;84(3):824-835
pubmed: 33666420
Nat Prod Rep. 2019 Jan 1;36(1):35-107
pubmed: 30003207
J Am Chem Soc. 1971 May 5;93(9):2325-7
pubmed: 5553076
ACS Cent Sci. 2019 Nov 27;5(11):1824-1833
pubmed: 31807684
Anal Chem. 2014 Jul 15;86(14):6812-7
pubmed: 24927477
Angew Chem Int Ed Engl. 2016 Aug 22;55(35):10210-26
pubmed: 27488942
Cell. 2020 Apr 2;181(1):29-45
pubmed: 32197064
J Chem Inf Model. 2020 Jul 27;60(7):3376-3386
pubmed: 32538625
Nat Methods. 2020 Sep;17(9):901-904
pubmed: 32807955
Anal Chim Acta. 2008 Oct 3;627(1):71-81
pubmed: 18790129
Proc Natl Acad Sci U S A. 2020 Jul 21;117(29):17195-17203
pubmed: 32606248
J Chem Inf Model. 2021 Jun 28;61(6):2560-2571
pubmed: 34042443
J Am Chem Soc. 2019 Mar 6;141(9):3910-3919
pubmed: 30735616
Elife. 2022 May 26;11:
pubmed: 35616633
Gigascience. 2021 Jan 13;10(1):
pubmed: 33438731