Pathway analysis in metabolomics: Recommendations for the use of over-representation analysis.


Journal

PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922

Informations de publication

Date de publication:
09 2021
Historique:
received: 14 05 2021
accepted: 23 08 2021
revised: 17 09 2021
pubmed: 8 9 2021
medline: 15 12 2021
entrez: 7 9 2021
Statut: epublish

Résumé

Over-representation analysis (ORA) is one of the commonest pathway analysis approaches used for the functional interpretation of metabolomics datasets. Despite the widespread use of ORA in metabolomics, the community lacks guidelines detailing its best-practice use. Many factors have a pronounced impact on the results, but to date their effects have received little systematic attention. Using five publicly available datasets, we demonstrated that changes in parameters such as the background set, differential metabolite selection methods, and pathway database used can result in profoundly different ORA results. The use of a non-assay-specific background set, for example, resulted in large numbers of false-positive pathways. Pathway database choice, evaluated using three of the most popular metabolic pathway databases (KEGG, Reactome, and BioCyc), led to vastly different results in both the number and function of significantly enriched pathways. Factors that are specific to metabolomics data, such as the reliability of compound identification and the chemical bias of different analytical platforms also impacted ORA results. Simulated metabolite misidentification rates as low as 4% resulted in both gain of false-positive pathways and loss of truly significant pathways across all datasets. Our results have several practical implications for ORA users, as well as those using alternative pathway analysis methods. We offer a set of recommendations for the use of ORA in metabolomics, alongside a set of minimal reporting guidelines, as a first step towards the standardisation of pathway analysis in metabolomics.

Identifiants

pubmed: 34492007
doi: 10.1371/journal.pcbi.1009105
pii: PCOMPBIOL-D-21-00895
pmc: PMC8448349
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1009105

Subventions

Organisme : NHLBI NIH HHS
ID : R01 HL133932
Pays : United States
Organisme : Medical Research Council
ID : MR/R008922/1
Pays : United Kingdom
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/S019669/1
Pays : United Kingdom
Organisme : Wellcome Trust
ID : 222837/Z/21/Z
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/T007974/1
Pays : United Kingdom

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

Sci Rep. 2018 Apr 27;8(1):6678
pubmed: 29703927
Brief Bioinform. 2016 Sep;17(5):891-901
pubmed: 26467821
BMC Bioinformatics. 2015 May 22;16:169
pubmed: 25994840
Annu Rev Biochem. 2017 Jun 20;86:277-304
pubmed: 28654323
BMC Syst Biol. 2011 Oct 14;5:165
pubmed: 21999653
NPJ Syst Biol Appl. 2018 Dec 13;5:3
pubmed: 30564458
Front Physiol. 2015 Dec 17;6:383
pubmed: 26733877
Nucleic Acids Res. 2009 Jan;37(Database issue):D623-8
pubmed: 18940869
Bioinformatics. 2014 Feb 15;30(4):523-30
pubmed: 24336805
Methods Mol Biol. 2015;1277:161-93
pubmed: 25677154
Nucleic Acids Res. 2020 Jan 8;48(D1):D498-D503
pubmed: 31691815
Genomics. 2003 Feb;81(2):98-104
pubmed: 12620386
Nucleic Acids Res. 2018 Jul 2;46(W1):W486-W494
pubmed: 29762782
Nat Commun. 2019 Sep 25;10(1):4358
pubmed: 31554818
Nat Med. 2019 Jun;25(6):968-976
pubmed: 31171880
Nucleic Acids Res. 2000 Jan 1;28(1):27-30
pubmed: 10592173
Metabolomics. 2018;14(4):37
pubmed: 29503602
Metabolomics. 2020 Mar 25;16(4):44
pubmed: 32215752
Sci Rep. 2018 Apr 13;8(1):5973
pubmed: 29654235
Nucleic Acids Res. 2018 Jan 4;46(D1):D1266-D1270
pubmed: 29069414
Metabolomics. 2007 Sep;3(3):211-221
pubmed: 24039616
Bioinformatics. 2013 Dec 15;29(24):3241-2
pubmed: 24064416
Methods Mol Biol. 2020;2104:387-400
pubmed: 31953827
Metabolomics. 2018 Jul 6;14(7):97
pubmed: 30830410
Metabolites. 2021 Feb 11;11(2):
pubmed: 33670102
Nat Methods. 2016 Aug 30;13(9):705-6
pubmed: 27575621
Metabolites. 2019 Feb 06;9(2):
pubmed: 30736318
Gigascience. 2021 Jan 23;10(1):
pubmed: 33484242
Mol Syst Biol. 2017 Jan 16;13(1):907
pubmed: 28093455
BMC Genomics. 2021 Mar 16;22(1):191
pubmed: 33726670
BMC Bioinformatics. 2019 May 15;20(1):243
pubmed: 31092193
PLoS Comput Biol. 2012;8(2):e1002375
pubmed: 22383865
Nat Genet. 1999 Jul;22(3):281-5
pubmed: 10391217
Nat Commun. 2019 Dec 13;10(1):5695
pubmed: 31836702
BMC Bioinformatics. 2021 Apr 15;22(1):191
pubmed: 33858350
Genome Biol. 2019 Oct 9;20(1):203
pubmed: 31597578
Nucleic Acids Res. 2018 Jul 2;46(W1):W510-W513
pubmed: 29718427
Brief Bioinform. 2019 Jul 19;20(4):1085-1093
pubmed: 29447345
Immun Inflamm Dis. 2015 Sep;3(3):224-38
pubmed: 26421150
BMC Bioinformatics. 2018 Jan 02;19(1):1
pubmed: 29291722
Metabolites. 2018 Sep 15;8(3):
pubmed: 30223552
Front Genet. 2019 Nov 22;10:1203
pubmed: 31824580
J Cell Biol. 2017 Jul 3;216(7):2027-2045
pubmed: 28566324
Nucleic Acids Res. 2020 Jan 8;48(D1):D440-D444
pubmed: 31691833
Infect Genet Evol. 2019 Mar;68:253-264
pubmed: 30615950

Auteurs

Cecilia Wieder (C)

Section of Bioinformatics, Division of Systems Medicine, Department of Metabolism, Digestion, and Reproduction, Faculty of Medicine, Imperial College London, London, United Kingdom.

Clément Frainay (C)

Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France.

Nathalie Poupin (N)

Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France.

Pablo Rodríguez-Mier (P)

Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France.

Florence Vinson (F)

Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France.

Juliette Cooke (J)

Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France.

Rachel Pj Lai (RP)

Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, United Kingdom.

Jacob G Bundy (JG)

Section of Biomolecular Medicine, Division of Systems Medicine, Department of Metabolism, Digestion, and Reproduction, Faculty of Medicine, Imperial College London, London, United Kingdom.

Fabien Jourdan (F)

Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse, France.
MetaToul-MetaboHUB, National Infrastructure of Metabolomics and Fluxomics, Toulouse, France.

Timothy Ebbels (T)

Section of Bioinformatics, Division of Systems Medicine, Department of Metabolism, Digestion, and Reproduction, Faculty of Medicine, Imperial College London, London, United Kingdom.

Articles similaires

Humans Middle Aged Female Male Surveys and Questionnaires
Adolescent Child Female Humans Male
Humans Colorectal Neoplasms Biomarkers, Tumor Prognosis Gene Expression Regulation, Neoplastic

Classifications MeSH