MousiPLIER: A Mouse Pathway-Level Information Extractor Model.


Journal

eNeuro
ISSN: 2373-2822
Titre abrégé: eNeuro
Pays: United States
ID NLM: 101647362

Informations de publication

Date de publication:
24 May 2024
Historique:
received: 23 08 2023
revised: 10 05 2024
accepted: 16 05 2024
medline: 25 5 2024
pubmed: 25 5 2024
entrez: 24 5 2024
Statut: aheadofprint

Résumé

High throughput gene expression profiling measures individual gene expression across conditions. However, genes are regulated in complex networks, not as individual entities, limiting the interpretability of gene expression data. Machine learning models that incorporate prior biological knowledge are a powerful tool to extract meaningful biology from gene expression data. Pathway-level information extractor (PLIER) is an unsupervised machine learning method that defines biological pathways by leveraging the vast amount of published transcriptomic data. PLIER converts gene expression data into known pathway gene sets, termed latent variables (LVs), to substantially reduce data dimensionality and improve interpretability. In the current study, we trained the first mouse PLIER model on 190,111 mouse brain RNA-sequencing samples, the greatest amount of training data ever used by PLIER. We then validated the mousiPLIER approach in a study of microglia and astrocyte gene expression across mouse brain aging. mousiPLIER identified biological pathways that are significantly associated with aging, including one latent variable (LV41) corresponding to striatal signal. To gain further insight into the genes contained in LV41, we performed k-means clustering on the training data to identify studies that respond strongly to LV41. We found that the variable was relevant to striatum and aging across the scientific literature. Finally, we built a web server (http://mousiplier.greenelab.com/) for users to easily explore the learned latent variables. Taken together this study defines mousiPLIER as a method to uncover meaningful biological processes in mouse brain transcriptomic studies.

Identifiants

pubmed: 38789274
pii: ENEURO.0313-23.2024
doi: 10.1523/ENEURO.0313-23.2024
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

Copyright © 2024 Zhang et al.

Auteurs

Shuo Zhang (S)

Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania, Philadelphia, PA 19104, USA.
Penn Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.

Benjamin J Heil (BJ)

Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.

Wayne Mao (W)

Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15260, USA.

Maria Chikina (M)

Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15260, USA.

Casey S Greene (CS)

Department of Pharmacology, University of Colorado School of Medicine, Denver, CO 80045, USA casey.s.greene@cuanschutz.edu eheller@pennmedicine.upenn.edu.
Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Denver, CO 80045, USA.

Elizabeth A Heller (EA)

Department of Systems Pharmacology and Translational Therapeutics, University of Pennsylvania, Philadelphia, PA 19104, USA casey.s.greene@cuanschutz.edu eheller@pennmedicine.upenn.edu.
Penn Epigenetics Institute, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.

Classifications MeSH