MousiPLIER: A Mouse Pathway-Level Information Extractor Model.
Journal
eNeuro
ISSN: 2373-2822
Titre abrégé: eNeuro
Pays: United States
ID NLM: 101647362
Informations de publication
Date de publication:
24 May 2024
24 May 2024
Historique:
received:
23
08
2023
revised:
10
05
2024
accepted:
16
05
2024
medline:
25
5
2024
pubmed:
25
5
2024
entrez:
24
5
2024
Statut:
aheadofprint
Résumé
High throughput gene expression profiling measures individual gene expression across conditions. However, genes are regulated in complex networks, not as individual entities, limiting the interpretability of gene expression data. Machine learning models that incorporate prior biological knowledge are a powerful tool to extract meaningful biology from gene expression data. Pathway-level information extractor (PLIER) is an unsupervised machine learning method that defines biological pathways by leveraging the vast amount of published transcriptomic data. PLIER converts gene expression data into known pathway gene sets, termed latent variables (LVs), to substantially reduce data dimensionality and improve interpretability. In the current study, we trained the first mouse PLIER model on 190,111 mouse brain RNA-sequencing samples, the greatest amount of training data ever used by PLIER. We then validated the mousiPLIER approach in a study of microglia and astrocyte gene expression across mouse brain aging. mousiPLIER identified biological pathways that are significantly associated with aging, including one latent variable (LV41) corresponding to striatal signal. To gain further insight into the genes contained in LV41, we performed k-means clustering on the training data to identify studies that respond strongly to LV41. We found that the variable was relevant to striatum and aging across the scientific literature. Finally, we built a web server (http://mousiplier.greenelab.com/) for users to easily explore the learned latent variables. Taken together this study defines mousiPLIER as a method to uncover meaningful biological processes in mouse brain transcriptomic studies.
Identifiants
pubmed: 38789274
pii: ENEURO.0313-23.2024
doi: 10.1523/ENEURO.0313-23.2024
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
Copyright © 2024 Zhang et al.