MINE 2.0: enhanced biochemical coverage for peak identification in untargeted metabolomics.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
27 06 2022
Historique:
received: 23 08 2021
revised: 25 03 2022
accepted: 16 05 2022
pubmed: 21 5 2022
medline: 15 11 2022
entrez: 20 5 2022
Statut: ppublish

Résumé

Although advances in untargeted metabolomics have made it possible to gather data on thousands of cellular metabolites in parallel, identification of novel metabolites from these datasets remains challenging. To address this need, Metabolic in silico Network Expansions (MINEs) were developed. A MINE is an expansion of known biochemistry which can be used as a list of potential structures for unannotated metabolomics peaks. Here, we present MINE 2.0, which utilizes a new set of biochemical transformation rules that covers 93% of MetaCyc reactions (compared to 25% in MINE 1.0). This results in a 17-fold increase in database size and a 40% increase in MINE database compounds matching unannotated peaks from an untargeted metabolomics dataset. MINE 2.0 is thus a significant improvement to this community resource. The MINE 2.0 website can be accessed at https://minedatabase.ci.northwestern.edu. The MINE 2.0 web API documentation can be accessed at https://mine-api.readthedocs.io/en/latest/. The data and code underlying this article are available in the MINE-2.0-Paper repository at https://github.com/tyo-nu/MINE-2.0-Paper. MINE 2.0 source code can be accessed at https://github.com/tyo-nu/MINE-Database (MINE construction), https://github.com/tyo-nu/MINE-Server (backend web API) and https://github.com/tyo-nu/MINE-app (web app). Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 35595247
pii: 6589885
doi: 10.1093/bioinformatics/btac331
pmc: PMC9237697
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

3484-3487

Subventions

Organisme : NIGMS NIH HHS
ID : T32 GM008449
Pays : United States
Organisme : National Science Foundation
ID : MCB-1614953
Organisme : U.S. Department of Energy
ID : DE-SC0018249

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Références

PLoS Comput Biol. 2014 Feb 20;10(2):e1003483
pubmed: 24586134
Anal Chem. 2017 Oct 3;89(19):10171-10180
pubmed: 28876899
Anal Chem. 2015 Oct 20;87(20):10619-26
pubmed: 26415007
Metab Eng. 2021 May;65:79-87
pubmed: 33662575
Metab Eng. 2018 Jan;45:223-236
pubmed: 29278749
ACS Synth Biol. 2021 Apr 16;10(4):724-736
pubmed: 33764057
Bioinformatics. 2018 Jun 15;34(12):2096-2102
pubmed: 29447341
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12580-5
pubmed: 26392543
Metabolites. 2020 Apr 21;10(4):
pubmed: 32326153
Nucleic Acids Res. 2021 Jan 8;49(D1):D1388-D1395
pubmed: 33151290
J Cheminform. 2016 Jan 29;8:3
pubmed: 26834843
Microb Cell Fact. 2019 Jun 13;18(1):109
pubmed: 31196115
Nucleic Acids Res. 2021 Jan 8;49(D1):D498-D508
pubmed: 33211880
Nucleic Acids Res. 2018 Jan 4;46(D1):D633-D639
pubmed: 29059334
Metabolites. 2019 Apr 13;9(4):
pubmed: 31013937
Nucleic Acids Res. 2022 Jan 7;50(D1):D603-D609
pubmed: 34850162
J Am Chem Soc. 2020 May 20;142(20):9097-9105
pubmed: 32275430
Nat Methods. 2017 Feb;14(2):187-194
pubmed: 27941785
J Cheminform. 2019 Jan 5;11(1):2
pubmed: 30612223
J Pharm Biomed Anal. 2018 May 30;154:138-149
pubmed: 29547800
J Cheminform. 2019 Aug 9;11(1):55
pubmed: 31399811
J Cheminform. 2015 Aug 28;7:44
pubmed: 26322134
Nat Methods. 2018 Jan;15(1):53-56
pubmed: 29176591
J Mass Spectrom. 2010 Jul;45(7):703-14
pubmed: 20623627
Anal Chem. 2021 Aug 31;93(34):11692-11700
pubmed: 34403256
Bioinformatics. 2005 Apr 15;21(8):1603-9
pubmed: 15613400

Auteurs

Jonathan Strutz (J)

Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA.
Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA.
Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA.

Kevin M Shebek (KM)

Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA.
Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA.
Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA.

Linda J Broadbelt (LJ)

Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA.
Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA.

Keith E J Tyo (KEJ)

Department of Chemical and Biological Engineering, Northwestern University, Evanston, IL 60208, USA.
Center for Synthetic Biology, Northwestern University, Evanston, IL 60208, USA.
Chemistry of Life Processes Institute, Northwestern University, Evanston, IL 60208, USA.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH