EnTSSR: A Weighted Ensemble Learning Method to Impute Single-Cell RNA Sequencing Data.
Journal
IEEE/ACM transactions on computational biology and bioinformatics
ISSN: 1557-9964
Titre abrégé: IEEE/ACM Trans Comput Biol Bioinform
Pays: United States
ID NLM: 101196755
Informations de publication
Date de publication:
Historique:
pubmed:
9
9
2021
medline:
22
1
2022
entrez:
8
9
2021
Statut:
ppublish
Résumé
The advancements of single-cell RNA sequencing (scRNA-seq) technologies have provided us unprecedented opportunities to characterize cellular states and investigate the mechanisms of complex diseases. Due to technical issues such as dropout events, scRNA-seq data contains excess of false zero counts, which has a substantial impact on the downstream analyses. Although several computational approaches have been proposed to impute dropout events in scRNA-seq data, there is no strong consensus on which is the best approach. In this study, we propose a novel weighted ensemble learning method, named EnTSSR, to impute dropout events in scRNA-seq data. By using a multi-view two-side sparse self-representation framework, our model can exploit the consensus similarities between genes and between cells based on the imputed results of various imputation methods. Moreover, we introduce a weighted ensemble strategy to leverage the information captured by various imputation methods effectively. Down-sampling experiments, clustering analysis, differential expression analysis and cell trajectory inference are carried out to evaluate the performance of our proposed model. Experiment results demonstrate that our EnTSSR can effectively recover the true expression pattern of scRNA-seq data.
Identifiants
pubmed: 34495837
doi: 10.1109/TCBB.2021.3110850
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM