Enrichment on steps, not genes, improves inference of differentially expressed pathways.


Journal

PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922

Informations de publication

Date de publication:
25 Mar 2024
Historique:
received: 13 09 2023
accepted: 05 03 2024
medline: 25 3 2024
pubmed: 25 3 2024
entrez: 25 3 2024
Statut: aheadofprint

Résumé

Enrichment analysis is frequently used in combination with differential expression data to investigate potential commonalities amongst lists of genes and generate hypotheses for further experiments. However, current enrichment analysis approaches on pathways ignore the functional relationships between genes in a pathway, particularly OR logic that occurs when a set of proteins can each individually perform the same step in a pathway. As a result, these approaches miss pathways with large or multiple sets because of an inflation of pathway size (when measured as the total gene count) relative to the number of steps. We address this problem by enriching on step-enabling entities in pathways. We treat sets of protein-coding genes as single entities, and we also weight sets to account for the number of genes in them using the multivariate Fisher's noncentral hypergeometric distribution. We then show three examples of pathways that are recovered with this method and find that the results have significant proportions of pathways not found in gene list enrichment analysis.

Identifiants

pubmed: 38527066
doi: 10.1371/journal.pcbi.1011968
pii: PCOMPBIOL-D-23-01455
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1011968

Informations de copyright

Copyright: © 2024 Markarian et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Auteurs

Nicholas Markarian (N)

Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America.
Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America.

Kimberly M Van Auken (KM)

Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America.

Dustin Ebert (D)

Division of Bioinformatics, Department of Population and Public Health Sciences, Keck School of Medicine, University of Southern California, Los Angeles, California, United States of America.

Paul W Sternberg (PW)

Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America.

Classifications MeSH