scDC: single cell differential composition analysis.
Composition analysis
RNA-seq
Single cell
scRNA-seq
Journal
BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194
Informations de publication
Date de publication:
24 Dec 2019
24 Dec 2019
Historique:
received:
07
11
2019
accepted:
12
11
2019
entrez:
25
12
2019
pubmed:
25
12
2019
medline:
27
2
2020
Statut:
epublish
Résumé
Differences in cell-type composition across subjects and conditions often carry biological significance. Recent advancements in single cell sequencing technologies enable cell-types to be identified at the single cell level, and as a result, cell-type composition of tissues can now be studied in exquisite detail. However, a number of challenges remain with cell-type composition analysis - none of the existing methods can identify cell-type perfectly and variability related to cell sampling exists in any single cell experiment. This necessitates the development of method for estimating uncertainty in cell-type composition. We developed a novel single cell differential composition (scDC) analysis method that performs differential cell-type composition analysis via bootstrap resampling. scDC captures the uncertainty associated with cell-type proportions of each subject via bias-corrected and accelerated bootstrap confidence intervals. We assessed the performance of our method using a number of simulated datasets and synthetic datasets curated from publicly available single cell datasets. In simulated datasets, scDC correctly recovered the true cell-type proportions. In synthetic datasets, the cell-type compositions returned by scDC were highly concordant with reference cell-type compositions from the original data. Since the majority of datasets tested in this study have only 2 to 5 subjects per condition, the addition of confidence intervals enabled better comparisons of compositional differences between subjects and across conditions. scDC is a novel statistical method for performing differential cell-type composition analysis for scRNA-seq data. It uses bootstrap resampling to estimate the standard errors associated with cell-type proportion estimates and performs significance testing through GLM and GLMM models. We have made this method available to the scientific community as part of the scdney package (Single Cell Data Integrative Analysis) R package, available from https://github.com/SydneyBioX/scdney.
Sections du résumé
BACKGROUND
BACKGROUND
Differences in cell-type composition across subjects and conditions often carry biological significance. Recent advancements in single cell sequencing technologies enable cell-types to be identified at the single cell level, and as a result, cell-type composition of tissues can now be studied in exquisite detail. However, a number of challenges remain with cell-type composition analysis - none of the existing methods can identify cell-type perfectly and variability related to cell sampling exists in any single cell experiment. This necessitates the development of method for estimating uncertainty in cell-type composition.
RESULTS
RESULTS
We developed a novel single cell differential composition (scDC) analysis method that performs differential cell-type composition analysis via bootstrap resampling. scDC captures the uncertainty associated with cell-type proportions of each subject via bias-corrected and accelerated bootstrap confidence intervals. We assessed the performance of our method using a number of simulated datasets and synthetic datasets curated from publicly available single cell datasets. In simulated datasets, scDC correctly recovered the true cell-type proportions. In synthetic datasets, the cell-type compositions returned by scDC were highly concordant with reference cell-type compositions from the original data. Since the majority of datasets tested in this study have only 2 to 5 subjects per condition, the addition of confidence intervals enabled better comparisons of compositional differences between subjects and across conditions.
CONCLUSIONS
CONCLUSIONS
scDC is a novel statistical method for performing differential cell-type composition analysis for scRNA-seq data. It uses bootstrap resampling to estimate the standard errors associated with cell-type proportion estimates and performs significance testing through GLM and GLMM models. We have made this method available to the scientific community as part of the scdney package (Single Cell Data Integrative Analysis) R package, available from https://github.com/SydneyBioX/scdney.
Identifiants
pubmed: 31870280
doi: 10.1186/s12859-019-3211-9
pii: 10.1186/s12859-019-3211-9
pmc: PMC6929335
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
721Références
Development. 2019 Mar 27;146(12):
pubmed: 30846445
mSphere. 2017 Sep 27;2(5):
pubmed: 28959739
PLoS One. 2012;7(12):e52078
pubmed: 23284876
Bioinformatics. 2017 Nov 01;33(21):3486-3488
pubmed: 29036287
PLoS Med. 2016 Dec 13;13(12):e1002194
pubmed: 27959923
F1000Res. 2018 Aug 15;7:1297
pubmed: 30228881
F1000Res. 2018 Jul 26;7:1141
pubmed: 30271584
Ann Appl Stat. 2013 Mar 1;7(1):
pubmed: 24312162
Cell Metab. 2016 Oct 11;24(4):593-607
pubmed: 27667667
Cell Stem Cell. 2015 Oct 1;17(4):471-85
pubmed: 26431182
PLoS One. 2018 Nov 1;13(11):e0206785
pubmed: 30383866
Genome Biol. 2016 Feb 17;17:29
pubmed: 26887813
Brief Bioinform. 2019 Nov 27;20(6):2316-2326
pubmed: 30137247
mSystems. 2018 Jul 17;3(4):
pubmed: 30035234