MetaboAnalystR 4.0: a unified LC-MS workflow for global metabolomics.
Journal
Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555
Informations de publication
Date de publication:
01 May 2024
01 May 2024
Historique:
received:
15
09
2023
accepted:
18
04
2024
medline:
2
5
2024
pubmed:
2
5
2024
entrez:
1
5
2024
Statut:
epublish
Résumé
The wide applications of liquid chromatography - mass spectrometry (LC-MS) in untargeted metabolomics demand an easy-to-use, comprehensive computational workflow to support efficient and reproducible data analysis. However, current tools were primarily developed to perform specific tasks in LC-MS based metabolomics data analysis. Here we introduce MetaboAnalystR 4.0 as a streamlined pipeline covering raw spectra processing, compound identification, statistical analysis, and functional interpretation. The key features of MetaboAnalystR 4.0 includes an auto-optimized feature detection and quantification algorithm for LC-MS1 spectra processing, efficient MS2 spectra deconvolution and compound identification for data-dependent or data-independent acquisition, and more accurate functional interpretation through integrated spectral annotation. Comprehensive validation studies using LC-MS1 and MS2 spectra obtained from standards mixtures, dilution series and clinical metabolomics samples have shown its excellent performance across a wide range of common tasks such as peak picking, spectral deconvolution, and compound identification with good computing efficiency. Together with its existing statistical analysis utilities, MetaboAnalystR 4.0 represents a significant step toward a unified, end-to-end workflow for LC-MS based global metabolomics in the open-source R environment.
Identifiants
pubmed: 38693118
doi: 10.1038/s41467-024-48009-6
pii: 10.1038/s41467-024-48009-6
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
3675Informations de copyright
© 2024. The Author(s).
Références
Wishart, D. S. Metabolomics for investigating physiological and pathophysiological processes. Physiol. Rev. 99, 1819–1875 (2019).
pubmed: 31434538
doi: 10.1152/physrev.00035.2018
Shen, X. et al. Metabolic reaction network-based recursive metabolite annotation for untargeted metabolomics. Nat. Commun. 10, 1516 (2019).
pubmed: 30944337
pmcid: 6447530
doi: 10.1038/s41467-019-09550-x
Vermeulen, R., Schymanski, E. L., Barabasi, A. L. & Miller, G. W. The exposome and health: where chemistry meets biology. Science 367, 392–396 (2020).
pubmed: 31974245
pmcid: 7227413
doi: 10.1126/science.aay3164
Chang, H. Y. et al. A practical guide to metabolomics software development. Anal. Chem. 93, 1912–1923 (2021).
pubmed: 33467846
pmcid: 7859930
doi: 10.1021/acs.analchem.0c03581
Tsugawa, H. et al. MS-DIAL: data-independent MS/MS deconvolution for comprehensive metabolome analysis. Nat. Methods 12, 523–526 (2015).
pubmed: 25938372
pmcid: 4449330
doi: 10.1038/nmeth.3393
Tautenhahn, R., Patti, G. J., Rinehart, D. & Siuzdak, G. XCMS online: a web-based platform to process untargeted metabolomic data. Anal. Chem. 84, 5035–5039 (2012).
pubmed: 22533540
pmcid: 3703953
doi: 10.1021/ac300698c
Li, S., Siddiqa, A., Thapa, M., Chi, Y. & Zheng, S. Trackable and scalable LC-MS metabolomics data processing using asari. Nat. Commun. 14, 4113 (2023).
pubmed: 37433854
pmcid: 10336130
doi: 10.1038/s41467-023-39889-1
Smith, C. A., Want, E. J., O’Maille, G., Abagyan, R. & Siuzdak, G. XCMS: processing mass spectrometry data for metabolite profiling using nonlinear peak alignment, matching, and identification. Anal. Chem. 78, 779–787 (2006).
pubmed: 16448051
doi: 10.1021/ac051437y
Schmid, R. et al. Integrative analysis of multimodal mass spectrometry data in MZmine 3. Nat. Biotechnol. 41, 447–449 (2023).
Aron, A. T. et al. Reproducible molecular networking of untargeted mass spectrometry data using GNPS. Nat. Protoc. 15, 1954–1991 (2020).
pubmed: 32405051
doi: 10.1038/s41596-020-0317-5
Lai, Z. et al. Identifying metabolites by integrating metabolome databases with mass spectrometry cheminformatics. Nat. Methods 15, 53–56 (2018).
pubmed: 29176591
doi: 10.1038/nmeth.4512
Dührkop, K. et al. SIRIUS 4: a rapid tool for turning tandem mass spectra into metabolite structure information. Nat. Methods 16, 299–302 (2019).
pubmed: 30886413
doi: 10.1038/s41592-019-0344-8
Xia, J., Psychogios, N., Young, N. & Wishart, D. S. MetaboAnalyst: a web server for metabolomic data analysis and interpretation. Nucleic Acids Res. 37, W652–W660 (2009).
pubmed: 19429898
pmcid: 2703878
doi: 10.1093/nar/gkp356
Stanstrup, J. et al. The metaRbolomics Toolbox in Bioconductor and beyond. Metabolites 9, 200 (2019).
Li, S. et al. Predicting network activity from high throughput metabolomics. PLoS Comput. Biol. 9, e1003123 (2013).
pubmed: 23861661
pmcid: 3701697
doi: 10.1371/journal.pcbi.1003123
Lu, Y., Pang, Z. & Xia, J. Comprehensive investigation of pathway enrichment methods for functional interpretation of LC-MS global metabolomics data. Brief Bioinform. 24, bbac553 (2023).
Kanehisa, M., Furumichi, M., Sato, Y., Kawashima, M. & Ishiguro-Watanabe, M. KEGG for taxonomy-based analysis of pathways and genomes. Nucleic Acids Res. 51, D587–d592 (2023).
pubmed: 36300620
doi: 10.1093/nar/gkac963
Stancliffe, E., Schwaiger-Haber, M., Sindelar, M. & Patti, G. J. DecoID improves identification rates in metabolomics through database-assisted MS/MS deconvolution. Nat. Methods 18, 779–787 (2021).
pubmed: 34239103
pmcid: 9302972
doi: 10.1038/s41592-021-01195-3
Alka, O. et al. DIAMetAlyzer allows automated false-discovery rate-controlled analysis for data-independent acquisition in metabolomics. Nat. Commun. 13, 1347 (2022).
pubmed: 35292629
pmcid: 8924252
doi: 10.1038/s41467-022-29006-z
Raetz, M., Bonner, R. & Hopfgartner, G. SWATH-MS for metabolomics and lipidomics: critical aspects of qualitative and quantitative analysis. Metabolomics 16, 71 (2020).
pubmed: 32504120
doi: 10.1007/s11306-020-01692-0
Tsugawa, H. et al. A lipidome atlas in MS-DIAL 4. Nat. Biotechnol. 38, 1159–1163 (2020).
pubmed: 32541957
doi: 10.1038/s41587-020-0531-2
Wang, M. et al. Sharing and community curation of mass spectrometry data with Global Natural Products Social Molecular Networking. Nat. Biotechnol. 34, 828–837 (2016).
pubmed: 27504778
pmcid: 5321674
doi: 10.1038/nbt.3597
Yin, Y., Wang, R., Cai, Y., Wang, Z. & Zhu, Z.-J. DecoMetDIA: deconvolution of multiplexed MS/MS spectra for metabolite identification in SWATH-MS-based untargeted metabolomics. Anal. Chem. 91, 11897–11904 (2019).
pubmed: 31436405
doi: 10.1021/acs.analchem.9b02655
Xing, S. et al. Recognizing contamination fragment ions in liquid chromatography-tandem mass spectrometry data. J. Am. Soc. Mass Spectrom. 32, 2296–2305 (2021).
pubmed: 33739814
doi: 10.1021/jasms.0c00478
Chong, J. & Xia, J. MetaboAnalystR: an R package for flexible and reproducible analysis of metabolomics data. Bioinformatics 34, 4313–4314 (2018).
pubmed: 29955821
pmcid: 6289126
doi: 10.1093/bioinformatics/bty528
Chong, J., Yamamoto, M. & Xia, J. MetaboAnalystR 2.0: from raw spectra to biological insights. Metabolites 9, 57 (2019).
Pang, Z., Chong, J., Li, S. & Xia, J. MetaboAnalystR 3.0: toward an optimized workflow for global metabolomics. Metabolites 10, 186 (2020).
pubmed: 32392884
pmcid: 7281575
doi: 10.3390/metabo10050186
Chen, L. et al. Metabolite discovery through global annotation of untargeted metabolomics data. Nat. Methods 18, 1377–1385 (2021).
pubmed: 34711973
pmcid: 8733904
doi: 10.1038/s41592-021-01303-3
Li, Y. et al. Spectral entropy outperforms MS/MS dot product similarity for small-molecule compound identification. Nat. Methods 18, 1524–1531 (2021).
pubmed: 34857935
doi: 10.1038/s41592-021-01331-z
Aisporna, A. et al. Neutral loss mass spectral data enhances molecular similarity analysis in METLIN. J. Am. Soc. Mass Spectrom. 33, 530–534 (2022).
pubmed: 35174708
pmcid: 10131246
doi: 10.1021/jasms.1c00343
Wishart, D. S. et al. HMDB 5.0: the human metabolome database for 2022. Nucleic Acids Res. 50, D622–d631 (2022).
pubmed: 34986597
doi: 10.1093/nar/gkab1062
Horai, H. et al. MassBank: a public repository for sharing mass spectral data for life sciences. J. Mass Spectrom. 45, 703–714 (2010).
pubmed: 20623627
doi: 10.1002/jms.1777
Kind, T. et al. LipidBlast in silico tandem mass spectrometry database for lipid identification. Nat. Methods 10, 755–758 (2013).
pubmed: 23817071
pmcid: 3731409
doi: 10.1038/nmeth.2551
Watanabe, K., Yasugi, E. & Oshima, M. How to search the glycolipid data in “LIPIDBANK for Web”, the newly developed lipid database in Japan. Trends Glycosci. Glycotechnol. 12, 175–184 (2000).
doi: 10.4052/tigg.12.175
Jeffryes, J. G. et al. MINEs: open access databases of computationally predicted enzyme promiscuity products for untargeted metabolomics. J. Cheminform. 7, 44 (2015).
pubmed: 26322134
pmcid: 4550642
doi: 10.1186/s13321-015-0087-1
Fahy, E. et al. Update of the LIPID MAPS comprehensive classification system for lipids. J. Lipid Res. 50, S9–S14 (2009).
pubmed: 19098281
pmcid: 2674711
doi: 10.1194/jlr.R800095-JLR200
Xing, S., Shen, S., Xu, B., Li, X. & Huan, T. BUDDY: molecular formula discovery via bottom-up MS/MS interrogation. Nat. Methods 20, 881–890 (2023).
pubmed: 37055660
doi: 10.1038/s41592-023-01850-x
Karp, P. D. et al. The BioCyc collection of microbial genomes and metabolic pathways. Brief. Bioinform. 20, 1085–1093 (2017).
pmcid: 6781571
doi: 10.1093/bib/bbx085
Chang, L., Zhou, G., Ou, H. & Xia, J. mGWAS-explorer: linking SNPs, genes, metabolites, and diseases for functional insights. Metabolites 12, 526 (2022).
pubmed: 35736459
pmcid: 9230867
doi: 10.3390/metabo12060526
Narayanaswamy, P. et al. MetaboKit: a comprehensive data extraction tool for untargeted metabolomics. Mol. Omics 16, 436–447 (2020).
pubmed: 32519713
doi: 10.1039/D0MO00030B
Phapale, P. et al. Public LC-orbitrap tandem mass spectral library for metabolite identification. J. Proteome Res. 20, 2089–2097 (2021).
pubmed: 33529026
doi: 10.1021/acs.jproteome.0c00930
Wu, P. et al. The trans-omics landscape of COVID-19. Nat. Commun. 12, 4543 (2021).
pubmed: 34315889
pmcid: 8316550
doi: 10.1038/s41467-021-24482-1
Sameh, M. et al. Integrated multiomics analysis to infer COVID-19 biological insights. Sci. Rep. 13, 1802 (2023).
pubmed: 36720931
pmcid: 9888750
doi: 10.1038/s41598-023-28816-5
Stringer, K. A. et al. Whole blood reveals more metabolic detail of the human metabolome than serum as measured by 1H-NMR spectroscopy: implications for sepsis metabolomics. Shock 44, 200–208 (2015).
pubmed: 26009817
pmcid: 4537695
doi: 10.1097/SHK.0000000000000406
Thomas, T. et al. Evidence of structural protein damage and membrane lipid remodeling in red blood cells from COVID-19 patients. J. Proteome Res. 19, 4455–4469 (2020).
pubmed: 33103907
pmcid: 7640979
doi: 10.1021/acs.jproteome.0c00606
Chamberlain, C. A., Rubio, V. Y. & Garrett, T. J. Impact of matrix effects and ionization efficiency in non-quantitative untargeted metabolomics. Metabolomics 15, 135 (2019).
pubmed: 31584114
doi: 10.1007/s11306-019-1597-z
Ripon, M. A. R., Bhowmik, D. R., Amin, M. T. & Hossain, M. S. Role of arachidonic cascade in COVID-19 infection: a review. Prostaglandins Other Lipid Mediat. 154, 106539 (2021).
pubmed: 33592322
pmcid: 7882227
doi: 10.1016/j.prostaglandins.2021.106539
Bae, J. H., Choe, H. J., Holick, M. F. & Lim, S. Association of vitamin D status with COVID-19 and its severity: Vitamin D and COVID-19: a narrative review. Rev. Endocr. Metab. Disord. 23, 579–599 (2022).
pubmed: 34982377
pmcid: 8724612
doi: 10.1007/s11154-021-09705-6
Xu, Y. et al. The importance of vitamin D metabolism as a potential prophylactic, immunoregulatory and neuroprotective treatment for COVID-19. J. Transl. Med. 18, 322 (2020).
pubmed: 32847594
pmcid: 7447609
doi: 10.1186/s12967-020-02488-5
Abu-Farha, M. et al. The role of lipid metabolism in COVID-19 virus infection and as a drug target. Int. J. Mol. Sci. 21, 3544 (2020).
Yoo A. B., Jette M. A., Grondona M. SLURM: Simple Linux Utility for Resource Management. In Job Scheduling Strategies for Parallel Processing (eds Feitelson, D., Rudolph, L. & Schwiegelshohn, U.) (Springer, Berlin, 2003).
Pang, Z. et al. MetaboAnalyst 6.0: towards a unified platform for metabolomics data processing, analysis and interpretation. Nucleic Acids Res. https://doi.org/10.1093/nar/gkae253 (2024).
doi: 10.1093/nar/gkae253
pubmed: 38587201
pmcid: 11014258
Guijas, C. et al. METLIN: a technology platform for identifying knowns and unknowns. Anal. Chem. 90, 3156–3164 (2018).
pubmed: 29381867
pmcid: 5933435
doi: 10.1021/acs.analchem.7b04424
Wang, L. et al. Spatially resolved isotope tracing reveals tissue metabolic activity. Nat. Methods 19, 223–230 (2022).
pubmed: 35132243
pmcid: 10926149
doi: 10.1038/s41592-021-01378-y
Li, S. & Zheng, S. Generalized tree structure to annotate untargeted metabolomics and stable isotope tracing data. Anal. Chem. 95, 6212–6217 (2023).
pubmed: 37018697
pmcid: 10117393
doi: 10.1021/acs.analchem.2c05810
Graca, G. et al. Automated annotation of untargeted all-ion fragmentation LC-MS metabolomics data with MetaboAnnotatoR. Anal. Chem. 94, 3446–3455 (2022).
pubmed: 35180347
pmcid: 8892435
doi: 10.1021/acs.analchem.1c03032
Santa-Rios, A., Barst, B. D. & Basu, N. Mercury speciation in whole blood and dried blood spots from capillary and venous sources. Anal. Chem. 92, 3605–3612 (2020).
pubmed: 32056420
doi: 10.1021/acs.analchem.9b04407
Johnson, J. M., Yu, T., Strobel, F. H. & Jones, D. P. A practical approach to detect unique metabolic patterns for personalized medicine. Analyst 135, 2864–2870 (2010).
pubmed: 20838665
pmcid: 3069708
doi: 10.1039/c0an00333f
Go, Y. M. et al. Reference standardization for mass spectrometry and high-resolution metabolomics applications to exposome research. Toxicol. Sci. 148, 531–543 (2015).
pubmed: 26358001
pmcid: 4675836
doi: 10.1093/toxsci/kfv198
Giné, R. et al. HERMES: a molecular-formula-oriented method to target the metabolome. Nat. Methods 18, 1370–1376 (2021).
pubmed: 34725482
pmcid: 9284938
doi: 10.1038/s41592-021-01307-z
Ten-Domenech, I. et al. Comparing Targeted vs. Untargeted MS(2) Data-Dependent Acquisition for Peak Annotation in LC-MS Metabolomics. Metabolites 10, 126 (2020).
pubmed: 32225041
pmcid: 7241085
doi: 10.3390/metabo10040126
Tenenbaum, D. & Maintainer, B. KEGGREST: Client-side REST access to the Kyoto Encyclopedia of Genes and Genomes (KEGG). R package version 1, (2021).
Djoumbou Feunang, Y. et al. ClassyFire: automated chemical classification with a comprehensive, computable taxonomy. J. Cheminform. 8, 61 (2016).
pubmed: 27867422
pmcid: 5096306
doi: 10.1186/s13321-016-0174-y
Fahy, E. & Subramaniam, S. RefMet: a reference nomenclature for metabolomics. Nat. Methods 17, 1173–1174 (2020).
pubmed: 33199890
doi: 10.1038/s41592-020-01009-y
Kanehisa, M., Furumichi, M., Tanabe, M., Sato, Y. & Morishima, K. KEGG: new perspectives on genomes, pathways, diseases and drugs. Nucleic Acids Res. 45, D353–d361 (2017).
pubmed: 27899662
doi: 10.1093/nar/gkw1092
Wishart, D. S. et al. MiMeDB: the Human Microbial Metabolome Database. Nucleic Acids Res. 51, D611–d620 (2023).
pubmed: 36215042
doi: 10.1093/nar/gkac868
Wishart, D. et al. T3DB: the toxic exposome database. Nucleic Acids Res. 43, D928–D934 (2015).
pubmed: 25378312
doi: 10.1093/nar/gku1004
Rothwell, J. A. et al. Phenol-Explorer 3.0: a major update of the Phenol-Explorer database to incorporate data on the effects of food processing on polyphenol content. Database 2013, bat070 (2013).
pubmed: 24103452
pmcid: 3792339
doi: 10.1093/database/bat070
Neveu, V. et al. Exposome-Explorer: a manually-curated database on biomarkers of exposure to dietary and environmental factors. Nucleic Acids Res. 45, D979–d984 (2017).
pubmed: 27924041
doi: 10.1093/nar/gkw980
Mohammed Taha, H. et al. The NORMAN Suspect List Exchange (NORMAN-SLE): facilitating European and worldwide collaboration on suspect screening in high resolution mass spectrometry. Environ. Sci. Eur. 34, 104 (2022).
pubmed: 36284750
pmcid: 9587084
doi: 10.1186/s12302-022-00680-6
Gabriela VCF, David, K., Matías, S.-B. & Ezequiel, S. Robust elastic net estimators for variable selection and identification of proteomic biomarkers. Ann. Appl. Stat. 13, 2065–2090 (2019).
Kepplinger, D. Robust variable selection and estimation via adaptive elastic net S-estimators for linear regression. Comput. Stat. Data. Anal. 183, 107730 (2023).
Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918–920 (2012).
pubmed: 23051804
pmcid: 3471674
doi: 10.1038/nbt.2377
Du, X., Smirnov, A., Pluskal, T., Jia, W. & Sumner, S. Metabolomics data preprocessing Using ADAP and MZmine 2. Methods Mol. Biol. 2104, 25–48 (2020).
pubmed: 31953811
pmcid: 8359540
doi: 10.1007/978-1-0716-0239-3_3