MetaboAnalystR 4.0: a unified LC-MS workflow for global metabolomics.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
01 May 2024
Historique:
received: 15 09 2023
accepted: 18 04 2024
medline: 2 5 2024
pubmed: 2 5 2024
entrez: 1 5 2024
Statut: epublish

Résumé

The wide applications of liquid chromatography - mass spectrometry (LC-MS) in untargeted metabolomics demand an easy-to-use, comprehensive computational workflow to support efficient and reproducible data analysis. However, current tools were primarily developed to perform specific tasks in LC-MS based metabolomics data analysis. Here we introduce MetaboAnalystR 4.0 as a streamlined pipeline covering raw spectra processing, compound identification, statistical analysis, and functional interpretation. The key features of MetaboAnalystR 4.0 includes an auto-optimized feature detection and quantification algorithm for LC-MS1 spectra processing, efficient MS2 spectra deconvolution and compound identification for data-dependent or data-independent acquisition, and more accurate functional interpretation through integrated spectral annotation. Comprehensive validation studies using LC-MS1 and MS2 spectra obtained from standards mixtures, dilution series and clinical metabolomics samples have shown its excellent performance across a wide range of common tasks such as peak picking, spectral deconvolution, and compound identification with good computing efficiency. Together with its existing statistical analysis utilities, MetaboAnalystR 4.0 represents a significant step toward a unified, end-to-end workflow for LC-MS based global metabolomics in the open-source R environment.

Identifiants

pubmed: 38693118
doi: 10.1038/s41467-024-48009-6
pii: 10.1038/s41467-024-48009-6
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

3675

Informations de copyright

© 2024. The Author(s).

Références

Wishart, D. S. Metabolomics for investigating physiological and pathophysiological processes. Physiol. Rev. 99, 1819–1875 (2019).
pubmed: 31434538 doi: 10.1152/physrev.00035.2018
Shen, X. et al. Metabolic reaction network-based recursive metabolite annotation for untargeted metabolomics. Nat. Commun. 10, 1516 (2019).
pubmed: 30944337 pmcid: 6447530 doi: 10.1038/s41467-019-09550-x
Vermeulen, R., Schymanski, E. L., Barabasi, A. L. & Miller, G. W. The exposome and health: where chemistry meets biology. Science 367, 392–396 (2020).
pubmed: 31974245 pmcid: 7227413 doi: 10.1126/science.aay3164
Chang, H. Y. et al. A practical guide to metabolomics software development. Anal. Chem. 93, 1912–1923 (2021).
pubmed: 33467846 pmcid: 7859930 doi: 10.1021/acs.analchem.0c03581
Tsugawa, H. et al. MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nat. Methods 12, 523–526 (2015).
pubmed: 25938372 pmcid: 4449330 doi: 10.1038/nmeth.3393
Tautenhahn, R., Patti, G. J., Rinehart, D. & Siuzdak, G. XCMS online: a web-based platform to process untargeted metabolomic data. Anal. Chem. 84, 5035–5039 (2012).
pubmed: 22533540 pmcid: 3703953 doi: 10.1021/ac300698c
Li, S., Siddiqa, A., Thapa, M., Chi, Y. & Zheng, S. Trackable and scalable LC-MS metabolomics data processing using asari. Nat. Commun. 14, 4113 (2023).
pubmed: 37433854 pmcid: 10336130 doi: 10.1038/s41467-023-39889-1
Smith, C. A., Want, E. J., O’Maille, G., Abagyan, R. & Siuzdak, G. XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal. Chem. 78, 779–787 (2006).
pubmed: 16448051 doi: 10.1021/ac051437y
Schmid, R. et al. Integrative analysis of multimodal mass spectrometry data in MZmine 3. Nat. Biotechnol. 41, 447–449 (2023).
Aron, A. T. et al. Reproducible molecular networking of untargeted mass spectrometry data using GNPS. Nat. Protoc. 15, 1954–1991 (2020).
pubmed: 32405051 doi: 10.1038/s41596-020-0317-5
Lai, Z. et al. Identifying metabolites by integrating metabolome databases with mass spectrometry cheminformatics. Nat. Methods 15, 53–56 (2018).
pubmed: 29176591 doi: 10.1038/nmeth.4512
Dührkop, K. et al. SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information. Nat. Methods 16, 299–302 (2019).
pubmed: 30886413 doi: 10.1038/s41592-019-0344-8
Xia, J., Psychogios, N., Young, N. & Wishart, D. S. MetaboAnalyst: a web server for metabolomic data analysis and interpretation. Nucleic Acids Res. 37, W652–W660 (2009).
pubmed: 19429898 pmcid: 2703878 doi: 10.1093/nar/gkp356
Stanstrup, J. et al. The metaRbolomics Toolbox in Bioconductor and beyond. Metabolites 9, 200 (2019).
Li, S. et al. Predicting network activity from high throughput metabolomics. PLoS Comput. Biol. 9, e1003123 (2013).
pubmed: 23861661 pmcid: 3701697 doi: 10.1371/journal.pcbi.1003123
Lu, Y., Pang, Z. & Xia, J. Comprehensive investigation of pathway enrichment methods for functional interpretation of LC-MS global metabolomics data. Brief Bioinform. 24, bbac553 (2023).
Kanehisa, M., Furumichi, M., Sato, Y., Kawashima, M. & Ishiguro-Watanabe, M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 51, D587–d592 (2023).
pubmed: 36300620 doi: 10.1093/nar/gkac963
Stancliffe, E., Schwaiger-Haber, M., Sindelar, M. & Patti, G. J. DecoID improves identification rates in metabolomics through database-assisted MS/MS deconvolution. Nat. Methods 18, 779–787 (2021).
pubmed: 34239103 pmcid: 9302972 doi: 10.1038/s41592-021-01195-3
Alka, O. et al. DIAMetAlyzer allows automated false-discovery rate-controlled analysis for data-independent acquisition in metabolomics. Nat. Commun. 13, 1347 (2022).
pubmed: 35292629 pmcid: 8924252 doi: 10.1038/s41467-022-29006-z
Raetz, M., Bonner, R. & Hopfgartner, G. SWATH-MS for metabolomics and lipidomics: critical aspects of qualitative and quantitative analysis. Metabolomics 16, 71 (2020).
pubmed: 32504120 doi: 10.1007/s11306-020-01692-0
Tsugawa, H. et al. A lipidome atlas in MS-DIAL 4. Nat. Biotechnol. 38, 1159–1163 (2020).
pubmed: 32541957 doi: 10.1038/s41587-020-0531-2
Wang, M. et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 34, 828–837 (2016).
pubmed: 27504778 pmcid: 5321674 doi: 10.1038/nbt.3597
Yin, Y., Wang, R., Cai, Y., Wang, Z. & Zhu, Z.-J. DecoMetDIA: deconvolution of multiplexed MS/MS spectra for metabolite identification in SWATH-MS-based untargeted metabolomics. Anal. Chem. 91, 11897–11904 (2019).
pubmed: 31436405 doi: 10.1021/acs.analchem.9b02655
Xing, S. et al. Recognizing contamination fragment ions in liquid chromatography-tandem mass spectrometry data. J. Am. Soc. Mass Spectrom. 32, 2296–2305 (2021).
pubmed: 33739814 doi: 10.1021/jasms.0c00478
Chong, J. & Xia, J. MetaboAnalystR: an R package for flexible and reproducible analysis of metabolomics data. Bioinformatics 34, 4313–4314 (2018).
pubmed: 29955821 pmcid: 6289126 doi: 10.1093/bioinformatics/bty528
Chong, J., Yamamoto, M. & Xia, J. MetaboAnalystR 2.0: from raw spectra to biological insights. Metabolites 9, 57 (2019).
Pang, Z., Chong, J., Li, S. & Xia, J. MetaboAnalystR 3.0: toward an optimized workflow for global metabolomics. Metabolites 10, 186 (2020).
pubmed: 32392884 pmcid: 7281575 doi: 10.3390/metabo10050186
Chen, L. et al. Metabolite discovery through global annotation of untargeted metabolomics data. Nat. Methods 18, 1377–1385 (2021).
pubmed: 34711973 pmcid: 8733904 doi: 10.1038/s41592-021-01303-3
Li, Y. et al. Spectral entropy outperforms MS/MS dot product similarity for small-molecule compound identification. Nat. Methods 18, 1524–1531 (2021).
pubmed: 34857935 doi: 10.1038/s41592-021-01331-z
Aisporna, A. et al. Neutral loss mass spectral data enhances molecular similarity analysis in METLIN. J. Am. Soc. Mass Spectrom. 33, 530–534 (2022).
pubmed: 35174708 pmcid: 10131246 doi: 10.1021/jasms.1c00343
Wishart, D. S. et al. HMDB 5.0: the human metabolome database for 2022. Nucleic Acids Res. 50, D622–d631 (2022).
pubmed: 34986597 doi: 10.1093/nar/gkab1062
Horai, H. et al. MassBank: a public repository for sharing mass spectral data for life sciences. J. Mass Spectrom. 45, 703–714 (2010).
pubmed: 20623627 doi: 10.1002/jms.1777
Kind, T. et al. LipidBlast in silico tandem mass spectrometry database for lipid identification. Nat. Methods 10, 755–758 (2013).
pubmed: 23817071 pmcid: 3731409 doi: 10.1038/nmeth.2551
Watanabe, K., Yasugi, E. & Oshima, M. How to search the glycolipid data in “LIPIDBANK for Web”, the newly developed lipid database in Japan. Trends Glycosci. Glycotechnol. 12, 175–184 (2000).
doi: 10.4052/tigg.12.175
Jeffryes, J. G. et al. MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics. J. Cheminform. 7, 44 (2015).
pubmed: 26322134 pmcid: 4550642 doi: 10.1186/s13321-015-0087-1
Fahy, E. et al. Update of the LIPID MAPS comprehensive classification system for lipids. J. Lipid Res. 50, S9–S14 (2009).
pubmed: 19098281 pmcid: 2674711 doi: 10.1194/jlr.R800095-JLR200
Xing, S., Shen, S., Xu, B., Li, X. & Huan, T. BUDDY: molecular formula discovery via bottom-up MS/MS interrogation. Nat. Methods 20, 881–890 (2023).
pubmed: 37055660 doi: 10.1038/s41592-023-01850-x
Karp, P. D. et al. The BioCyc collection of microbial genomes and metabolic pathways. Brief. Bioinform. 20, 1085–1093 (2017).
pmcid: 6781571 doi: 10.1093/bib/bbx085
Chang, L., Zhou, G., Ou, H. & Xia, J. mGWAS-explorer: linking SNPs, genes, metabolites, and diseases for functional insights. Metabolites 12, 526 (2022).
pubmed: 35736459 pmcid: 9230867 doi: 10.3390/metabo12060526
Narayanaswamy, P. et al. MetaboKit: a comprehensive data extraction tool for untargeted metabolomics. Mol. Omics 16, 436–447 (2020).
pubmed: 32519713 doi: 10.1039/D0MO00030B
Phapale, P. et al. Public LC-orbitrap tandem mass spectral library for metabolite identification. J. Proteome Res. 20, 2089–2097 (2021).
pubmed: 33529026 doi: 10.1021/acs.jproteome.0c00930
Wu, P. et al. The trans-omics landscape of COVID-19. Nat. Commun. 12, 4543 (2021).
pubmed: 34315889 pmcid: 8316550 doi: 10.1038/s41467-021-24482-1
Sameh, M. et al. Integrated multiomics analysis to infer COVID-19 biological insights. Sci. Rep. 13, 1802 (2023).
pubmed: 36720931 pmcid: 9888750 doi: 10.1038/s41598-023-28816-5
Stringer, K. A. et al. Whole blood reveals more metabolic detail of the human metabolome than serum as measured by 1H-NMR spectroscopy: implications for sepsis metabolomics. Shock 44, 200–208 (2015).
pubmed: 26009817 pmcid: 4537695 doi: 10.1097/SHK.0000000000000406
Thomas, T. et al. Evidence of structural protein damage and membrane lipid remodeling in red blood cells from COVID-19 patients. J. Proteome Res. 19, 4455–4469 (2020).
pubmed: 33103907 pmcid: 7640979 doi: 10.1021/acs.jproteome.0c00606
Chamberlain, C. A., Rubio, V. Y. & Garrett, T. J. Impact of matrix effects and ionization efficiency in non-quantitative untargeted metabolomics. Metabolomics 15, 135 (2019).
pubmed: 31584114 doi: 10.1007/s11306-019-1597-z
Ripon, M. A. R., Bhowmik, D. R., Amin, M. T. & Hossain, M. S. Role of arachidonic cascade in COVID-19 infection: a review. Prostaglandins Other Lipid Mediat. 154, 106539 (2021).
pubmed: 33592322 pmcid: 7882227 doi: 10.1016/j.prostaglandins.2021.106539
Bae, J. H., Choe, H. J., Holick, M. F. & Lim, S. Association of vitamin D status with COVID-19 and its severity: Vitamin D and COVID-19: a narrative review. Rev. Endocr. Metab. Disord. 23, 579–599 (2022).
pubmed: 34982377 pmcid: 8724612 doi: 10.1007/s11154-021-09705-6
Xu, Y. et al. The importance of vitamin D metabolism as a potential prophylactic, immunoregulatory and neuroprotective treatment for COVID-19. J. Transl. Med. 18, 322 (2020).
pubmed: 32847594 pmcid: 7447609 doi: 10.1186/s12967-020-02488-5
Abu-Farha, M. et al. The role of lipid metabolism in COVID-19 virus infection and as a drug target. Int. J. Mol. Sci. 21, 3544 (2020).
Yoo A. B., Jette M. A., Grondona M. SLURM: Simple Linux Utility for Resource Management. In Job Scheduling Strategies for Parallel Processing (eds Feitelson, D., Rudolph, L. & Schwiegelshohn, U.) (Springer, Berlin, 2003).
Pang, Z. et al. MetaboAnalyst 6.0: towards a unified platform for metabolomics data processing, analysis and interpretation. Nucleic Acids Res. https://doi.org/10.1093/nar/gkae253 (2024).
doi: 10.1093/nar/gkae253 pubmed: 38587201 pmcid: 11014258
Guijas, C. et al. METLIN: a technology platform for identifying knowns and unknowns. Anal. Chem. 90, 3156–3164 (2018).
pubmed: 29381867 pmcid: 5933435 doi: 10.1021/acs.analchem.7b04424
Wang, L. et al. Spatially resolved isotope tracing reveals tissue metabolic activity. Nat. Methods 19, 223–230 (2022).
pubmed: 35132243 pmcid: 10926149 doi: 10.1038/s41592-021-01378-y
Li, S. & Zheng, S. Generalized tree structure to annotate untargeted metabolomics and stable isotope tracing data. Anal. Chem. 95, 6212–6217 (2023).
pubmed: 37018697 pmcid: 10117393 doi: 10.1021/acs.analchem.2c05810
Graca, G. et al. Automated annotation of untargeted all-ion fragmentation LC-MS metabolomics data with MetaboAnnotatoR. Anal. Chem. 94, 3446–3455 (2022).
pubmed: 35180347 pmcid: 8892435 doi: 10.1021/acs.analchem.1c03032
Santa-Rios, A., Barst, B. D. & Basu, N. Mercury speciation in whole blood and dried blood spots from capillary and venous sources. Anal. Chem. 92, 3605–3612 (2020).
pubmed: 32056420 doi: 10.1021/acs.analchem.9b04407
Johnson, J. M., Yu, T., Strobel, F. H. & Jones, D. P. A practical approach to detect unique metabolic patterns for personalized medicine. Analyst 135, 2864–2870 (2010).
pubmed: 20838665 pmcid: 3069708 doi: 10.1039/c0an00333f
Go, Y. M. et al. Reference standardization for mass spectrometry and high-resolution metabolomics applications to exposome research. Toxicol. Sci. 148, 531–543 (2015).
pubmed: 26358001 pmcid: 4675836 doi: 10.1093/toxsci/kfv198
Giné, R. et al. HERMES: a molecular-formula-oriented method to target the metabolome. Nat. Methods 18, 1370–1376 (2021).
pubmed: 34725482 pmcid: 9284938 doi: 10.1038/s41592-021-01307-z
Ten-Domenech, I. et al. Comparing Targeted vs. Untargeted MS(2) Data-Dependent Acquisition for Peak Annotation in LC-MS Metabolomics. Metabolites 10, 126 (2020).
pubmed: 32225041 pmcid: 7241085 doi: 10.3390/metabo10040126
Tenenbaum, D. & Maintainer, B. KEGGREST: Client-side REST access to the Kyoto Encyclopedia of Genes and Genomes (KEGG). R package version 1, (2021).
Djoumbou Feunang, Y. et al. ClassyFire: automated chemical classification with a comprehensive, computable taxonomy. J. Cheminform. 8, 61 (2016).
pubmed: 27867422 pmcid: 5096306 doi: 10.1186/s13321-016-0174-y
Fahy, E. & Subramaniam, S. RefMet: a reference nomenclature for metabolomics. Nat. Methods 17, 1173–1174 (2020).
pubmed: 33199890 doi: 10.1038/s41592-020-01009-y
Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y. & Morishima, K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45, D353–d361 (2017).
pubmed: 27899662 doi: 10.1093/nar/gkw1092
Wishart, D. S. et al. MiMeDB: the Human Microbial Metabolome Database. Nucleic Acids Res. 51, D611–d620 (2023).
pubmed: 36215042 doi: 10.1093/nar/gkac868
Wishart, D. et al. T3DB: the toxic exposome database. Nucleic Acids Res. 43, D928–D934 (2015).
pubmed: 25378312 doi: 10.1093/nar/gku1004
Rothwell, J. A. et al. Phenol-Explorer 3.0: a major update of the Phenol-Explorer database to incorporate data on the effects of food processing on polyphenol content. Database 2013, bat070 (2013).
pubmed: 24103452 pmcid: 3792339 doi: 10.1093/database/bat070
Neveu, V. et al. Exposome-Explorer: a manually-curated database on biomarkers of exposure to dietary and environmental factors. Nucleic Acids Res. 45, D979–d984 (2017).
pubmed: 27924041 doi: 10.1093/nar/gkw980
Mohammed Taha, H. et al. The NORMAN Suspect List Exchange (NORMAN-SLE): facilitating European and worldwide collaboration on suspect screening in high resolution mass spectrometry. Environ. Sci. Eur. 34, 104 (2022).
pubmed: 36284750 pmcid: 9587084 doi: 10.1186/s12302-022-00680-6
Gabriela VCF, David, K., Matías, S.-B. & Ezequiel, S. Robust elastic net estimators for variable selection and identification of proteomic biomarkers. Ann. Appl. Stat. 13, 2065–2090 (2019).
Kepplinger, D. Robust variable selection and estimation via adaptive elastic net S-estimators for linear regression. Comput. Stat. Data. Anal. 183, 107730 (2023).
Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918–920 (2012).
pubmed: 23051804 pmcid: 3471674 doi: 10.1038/nbt.2377
Du, X., Smirnov, A., Pluskal, T., Jia, W. & Sumner, S. Metabolomics data preprocessing Using ADAP and MZmine 2. Methods Mol. Biol. 2104, 25–48 (2020).
pubmed: 31953811 pmcid: 8359540 doi: 10.1007/978-1-0716-0239-3_3

Auteurs

Zhiqiang Pang (Z)

Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada.

Lei Xu (L)

Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada.

Charles Viau (C)

Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada.

Yao Lu (Y)

Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada.

Reza Salavati (R)

Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada.

Niladri Basu (N)

Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada.

Jianguo Xia (J)

Faculty of Agricultural and Environmental Sciences, McGill University, Ste-Anne-de-Bellevue, QC, Canada. jeff.xia@mcgill.ca.
Department of Microbiology and Immunology, McGill University, Montreal, QC, Canada. jeff.xia@mcgill.ca.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH