Chromatin-accessibility estimation from single-cell ATAC-seq data with scOpen.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
04 11 2021
Historique:
received: 17 05 2021
accepted: 04 10 2021
entrez: 5 11 2021
pubmed: 6 11 2021
medline: 28 12 2021
Statut: epublish

Résumé

A major drawback of single-cell ATAC-seq (scATAC-seq) is its sparsity, i.e., open chromatin regions with no reads due to loss of DNA material during the scATAC-seq protocol. Here, we propose scOpen, a computational method based on regularized non-negative matrix factorization for imputing and quantifying the open chromatin status of regulatory regions from sparse scATAC-seq experiments. We show that scOpen improves crucial downstream analysis steps of scATAC-seq data as clustering, visualization, cis-regulatory DNA interactions, and delineation of regulatory features. We demonstrate the power of scOpen to dissect regulatory changes in the development of fibrosis in the kidney. This identifies a role of Runx1 and target genes by promoting fibroblast to myofibroblast differentiation driving kidney fibrosis.

Identifiants

pubmed: 34737275
doi: 10.1038/s41467-021-26530-2
pii: 10.1038/s41467-021-26530-2
pmc: PMC8568974
doi:

Substances chimiques

Chromatin 0
DNA 9007-49-2

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

6386

Informations de copyright

© 2021. The Author(s).

Références

Buenrostro, J. D., Giresi, P. G., Zaba, L. C., Chang, H. Y. & Greenleaf, W. J. Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position. Nat. Methods 10, 1213–1218 (2013).
pubmed: 24097267 pmcid: 3959825 doi: 10.1038/nmeth.2688
Corces, M. R. et al. The chromatin accessibility landscape of primary human cancers. Science 362, eaav1898 (2018).
pubmed: 30361341 pmcid: 6408149 doi: 10.1126/science.aav1898
Schep, A. N. et al. Structured nucleosome fingerprints enable high-resolution mapping of chromatin architecture within regulatory regions. Genome Res. 25, 1757–1770 (2015).
pubmed: 26314830 pmcid: 4617971 doi: 10.1101/gr.192294.115
Li, Z. et al. Identification of transcription factor binding sites using ATAC-seq. Genome Biol. 20, 45 (2019).
pubmed: 30808370 pmcid: 6391789 doi: 10.1186/s13059-019-1642-2
Buenrostro, J. D. et al. Single-cell chromatin accessibility reveals principles of regulatory variation. Nature 523, 486 (2015).
pubmed: 26083756 pmcid: 4685948 doi: 10.1038/nature14590
Buenrostro, J. D. et al. Integrated single-cell analysis maps the continuous regulatory landscape of human hematopoietic differentiation. Cell 173, 1535–1548 (2018).
pubmed: 29706549 pmcid: 5989727 doi: 10.1016/j.cell.2018.03.074
Domcke, S. et al. A human cell atlas of fetal chromatin accessibility. Science 370, eaba7612 (2020).
Satpathy, A. T. et al. Transcript-indexed ATAC-seq for precision immune profiling. Nat. Med. 24, 580–590 (2018).
pubmed: 29686426 pmcid: 5948148 doi: 10.1038/s41591-018-0008-8
Zamanighomi, M. et al. Unsupervised clustering and epigenetic classification of single cells. Nat. Commun. 9, 2410 (2018).
pubmed: 29925875 pmcid: 6010417 doi: 10.1038/s41467-018-04629-3
Bravo González-Blas, C. et al. cisTopic: cis-regulatory topic modeling on single-cell ATAC-seq data. Nat. Methods 16, 397–400 (2019).
pubmed: 30962623 doi: 10.1038/s41592-019-0367-1
Fang, R. et al. Comprehensive analysis of single cell atac-seq data with snapatac. Nat. Commun. 12, 1–15 (2021).
doi: 10.1038/s41467-021-21583-9
Schep, A. N., Wu, B., Buenrostro, J. D. & Greenleaf, W. J. chromVAR: inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat. Methods 14, 975 (2017).
pubmed: 28825706 pmcid: 5623146 doi: 10.1038/nmeth.4401
Pliner, H. A. et al. Cicero predicts cis-regulatory DNA interactions from single-cell chromatin accessibility data. Mol. Cell 71, 858–871 (2018).
pubmed: 30078726 pmcid: 6582963 doi: 10.1016/j.molcel.2018.06.044
Van Dijk, D. et al. Recovering gene interactions from single-cell data using data diffusion. Cell 174, 716–729 (2018).
pubmed: 29961576 pmcid: 6771278 doi: 10.1016/j.cell.2018.05.061
Gong, W., Kwak, I.-Y., Pota, P., Koyano-Nakagawa, N. & Garry, D. J. DrImpute: imputing dropout events in single cell RNA sequencing data. BMC Bioinforma. 19, 220 (2018).
doi: 10.1186/s12859-018-2226-y
Becht, E. et al. Dimensionality reduction for visualizing single-cell data using UMAP. Nat. Biotechnol. 37, 38 (2019).
doi: 10.1038/nbt.4314
Li, W. V. & Li, J. J. An accurate and robust imputation method scImpute for single-cell RNA-seq data. Nat. Commun. 9, 997 (2018).
pubmed: 29520097 pmcid: 5843666 doi: 10.1038/s41467-018-03405-7
Eraslan, G., Simon, L. M., Mircea, M., Mueller, N. S. & Theis, F. J. Single-cell RNA-seq denoising using a deep count autoencoder. Nat. Commun. 10, 390 (2019).
pubmed: 30674886 pmcid: 6344535 doi: 10.1038/s41467-018-07931-2
Huang, M. et al. SAVER: gene expression recovery for single-cell RNA sequencing. Nat. Methods 15, 539 (2018).
pubmed: 29941873 pmcid: 6030502 doi: 10.1038/s41592-018-0033-z
Xiong, L. et al. SCALE method for single-cell ATAC-seq analysis via latent feature extraction. Nat. Commun. 10, 1–10 (2019).
doi: 10.1038/s41467-019-12630-7
Chen, H. et al. Assessment of computational methods for the analysis of single-cell atac-seq data. Genome Biol. 20, 1–25 (2019).
doi: 10.1186/s13059-019-1854-5
Cichocki, A. & Phan, A.-H. Fast local algorithms for large scale nonnegative matrix and tensor factorizations. IEICE Trans. Fundam. 92, 708–721 (2009).
doi: 10.1587/transfun.E92.A.708
Satopaa, V., Albrecht, J., Irwin, D. & Raghavan, B. Finding a “ kneedle” in a haystack: detecting knee points in system behavior. In 2011 31st International Conference on Distributed Computing Systems Workshops, (ed. Lisa, C.) 166–171 (IEEE, 2011).
Hou, W., Ji, Z., Ji, H. & Hicks, S. C. A systematic evaluation of single-cell RNA-sequencing imputation methods. Genome Biol. 21, 218 (2020).
pubmed: 32854757 pmcid: 7450705 doi: 10.1186/s13059-020-02132-x
Li, R. & Quon, G. scBFA: modeling detection patterns to mitigate technical noise in large-scale single-cell genomics data. Genome Biol. 20, 193 (2019).
pubmed: 31500668 pmcid: 6734238 doi: 10.1186/s13059-019-1806-0
Josse, J. & Husson, F. et al. missMDA: a package for handling missing values in multivariate data analysis. J. Stat. Softw. 70, 1–31 (2016).
doi: 10.18637/jss.v070.i01
Hubert, L. & Arabie, P. Comparing partitions. J. Classif. 2, 193–218 (1985).
doi: 10.1007/BF01908075
Becht, E. et al. Dimensionality reduction for visualizing single-cell data using umap. Nat. Biotechnol. 37, 38–44 (2019).
doi: 10.1038/nbt.4314
Kramann, R. et al. Pharmacological GLI2 inhibition prevents myofibroblast cell-cycle progression and reduces kidney fibrosis. J. Clin. Investig. 125, 2935–2951 (2015).
pubmed: 26193634 pmcid: 4563736 doi: 10.1172/JCI74929
Kramann, R. et al. Perivascular Gli1+ progenitors are key contributors to injury-induced organ fibrosis. Cell Stem Cell 16, 51–66 (2015).
pubmed: 25465115 doi: 10.1016/j.stem.2014.11.004
Korsunsky, I. et al. Fast, sensitive and accurate integration of single-cell data with harmony. Nat. Methods 16, 1289–1296 (2019).
pubmed: 31740819 pmcid: 6884693 doi: 10.1038/s41592-019-0619-0
Wu, H., Kirita, Y., Donnelly, E. L. & Humphreys, B. D. Advantages of Single-nucleus over single-cell RNA sequencing of adult kidney: rare cell types and novel cell states revealed in fibrosis. J. Am. Soc. Nephrol. 30, 23–32 (2019).
pubmed: 30510133 doi: 10.1681/ASN.2018090912
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902 (2019).
pubmed: 31178118 pmcid: 6687398 doi: 10.1016/j.cell.2019.05.031
Granja, J. M. et al. ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis. Nat. Genet. 53, 403–411 (2021).
pubmed: 33633365 pmcid: 8012210 doi: 10.1038/s41588-021-00790-6
Bábíčková, J. et al. Regardless of etiology, progressive renal disease causes ultrastructural and functional alterations of peritubular capillaries. Kidney Int. 91, 70–85 (2017).
pubmed: 27678159 doi: 10.1016/j.kint.2016.07.038
Kramann, R. et al. Parabiosis and single-cell RNA sequencing reveal a limited contribution of monocytes to myofibroblasts in kidney fibrosis. JCI Insight 3, e99561 (2018).
Vaidya, V. S., Ramirez, V., Ichimura, T., Bobadilla, N. A. & Bonventre, J. V. Urinary kidney injury molecule-1: a sensitive quantitative biomarker for early detection of kidney tubular injury. Am. J. Physiol. Ren. Physiol. 290, F517–29 (2006).
doi: 10.1152/ajprenal.00291.2005
Sugawara, A., Sanno, N., Takahashi, N., Osamura, R. Y. & Abe, K. Retinoid X receptors in the kidney: their protein expression and functional significance. Endocrinology 138, 3175–80 (1997).
pubmed: 9231765 doi: 10.1210/endo.138.8.5351
Marable, S. S., Chung, E., Adam, M., Potter, S. S. & Park, J.-S. Hnf4a deletion in the mouse kidney phenocopies Fanconi renotubular syndrome. JCI Insight 3, 354–80 (2018).
doi: 10.1172/jci.insight.97497
Kramann, R., DiRocco, D. P. & Humphreys, B. D. Understanding the origin, activation and regulation of matrix-producing myofibroblasts for treatment of fibrotic disease. J. Pathol. 231, 273–289 (2013).
pubmed: 24006178 doi: 10.1002/path.4253
Kirita, Y., Wu, H., Uchimura, K., Wilson, P. C. & Humphreys, B. D. Cell profiling of mouse acute kidney injury reveals conserved cellular responses to injury. Proc. Natl Acad. Sci. USA 117, 15874–15883 (2020).
pubmed: 32571916 pmcid: 7355049 doi: 10.1073/pnas.2005477117
Markó, L. et al. Tubular epithelial NF-κB activity regulates ischemic AKI. J. Am. Soc. Nephrol. 27, 2658–2669 (2016).
pubmed: 26823548 pmcid: 5004652 doi: 10.1681/ASN.2015070748
Kuppe, C. et al. Decoding myofibroblast origins in human kidney fibrosis. Nature 589, 281–286 (2021).
pubmed: 33176333 doi: 10.1038/s41586-020-2941-1
Muhl, L. et al. Single-cell analysis uncovers fibroblast heterogeneity and criteria for fibroblast and mural cell identification and discrimination. Nat. Commun. 11, 3953 (2020).
pubmed: 32769974 pmcid: 7414220 doi: 10.1038/s41467-020-17740-1
de Bruijn, M. & Dzierzak, E. Runx transcription factors in the development and function of the definitive hematopoietic system. Blood 129, 2061–2069 (2017).
pubmed: 28179276 doi: 10.1182/blood-2016-12-689109
Chan, S. C. et al. Mechanism of fibrosis in HNF1B-related autosomal dominant tubulointerstitial kidney disease. J. Am. Soc. Nephrol. 29, 2493–2509 (2018).
pubmed: 30097458 pmcid: 6171276 doi: 10.1681/ASN.2018040437
Henderson, N. C. et al. Targeting of αv integrin depletion identifies a core, targetable molecular pathway that regulates fibrosis across solid organs. Nat. Med. 19, 1617–1624 (2013).
pubmed: 24216753 doi: 10.1038/nm.3282
Buenrostro, J. D., Wu, B., Chang, H. Y. & Greenleaf, W. J. ATAC- seq: a method for assaying chromatin accessibility genome-wide. Curr. Protoc. Mol. Biol. 109, 21–29 (2015).
pubmed: 25559105 pmcid: 4374986 doi: 10.1002/0471142727.mb2129s109
Stuart, T., Srivastava, A., Lareau, C. & Satija, R. Multimodal single-cell chromatin analysis with Signac. Preprint at https://doi.org/10.1101/2020.11.09.373613 (2020).
Andrews, T. S. & Hemberg, M. False signals induced by single-cell imputation. F1000Research 7, 1740 (2018).
pubmed: 30906525 doi: 10.12688/f1000research.16613.1
Welch, J. D. et al. Single-cell multi-omic integration compares and contrasts features of brain cell identity. Cell 177, 1873–1887 (2019).
pubmed: 31178122 pmcid: 6716797 doi: 10.1016/j.cell.2019.05.006
Lal, A. et al. Deep learning-based enhancement of epigenomics data with AtacWorks. Nat. Commun. 12, 1507 (2021).
pubmed: 33686069 pmcid: 7940635 doi: 10.1038/s41467-021-21765-5
Rotem, A. et al. Single-cell ChIP-seq reveals cell subpopulations defined by chromatin state. Nat. Biotechnol. 33, 1165–1172 (2015).
pubmed: 26458175 pmcid: 4636926 doi: 10.1038/nbt.3383
Bartosovic, M., Kabbe, M. & Castelo-Branco, G. Single-cell CUT&Tag profiles histone modifications and transcription factors in complex tissues. Nat. Biotechnol. 39, 825–835 (2021).
pubmed: 33846645 pmcid: 7611252 doi: 10.1038/s41587-021-00869-9
Kaya-Okur, H. S. et al. CUT&Tag for efficient epigenomic profiling of small samples and single cells. Nat. Commun. 10, 1930 (2019).
pubmed: 31036827 pmcid: 6488672 doi: 10.1038/s41467-019-09982-5
Smallwood, S. A. et al. Single-cell genome-wide bisulfite sequencing for assessing epigenetic heterogeneity. Nat. Methods 11, 817–820 (2014).
pubmed: 25042786 pmcid: 4117646 doi: 10.1038/nmeth.3035
Zhou, T. et al. Runt-related transcription factor 1 (RUNX1) promotes TGF-β-induced renal tubular epithelial-to-mesenchymal transition (EMT) and renal fibrosis through the PI3K subunit p110δ. EBioMedicine 31, 217–225 (2018).
pubmed: 29759484 pmcid: 6013935 doi: 10.1016/j.ebiom.2018.04.023
Kim, W. et al. RUNX1 is essential for mesenchymal stem cell proliferation and myofibroblast differentiation. PNAS 111, 16389–16394 (2014).
pubmed: 25313057 pmcid: 4246299 doi: 10.1073/pnas.1407097111
Koth, J. et al. Runx1 promotes scar deposition and inhibits myocardial proliferation and survival during zebrafish heart regeneration. Development 147, dev186569 (2020).
Hsieh, C. -J. & Dhillon, I. S. Fast coordinate descent methods with variable selection for non-negative matrix factorization. SIGKDD 1064–1072 (2011).
Corces, M. R. et al. Lineage-specific and single-cell chromatin accessibility charts human hematopoiesis and leukemia evolution. Nat. Genet. 48, 1193–1203 (2016).
pubmed: 27526324 pmcid: 5042844 doi: 10.1038/ng.3646
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet. J. 17, 10–12 (2011).
doi: 10.14806/ej.17.1.200
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357 (2012).
pubmed: 22388286 pmcid: 3322381 doi: 10.1038/nmeth.1923
Li, H. et al. The sequence alignment/map format and samtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943 pmcid: 2723002 doi: 10.1093/bioinformatics/btp352
Zhang, Y. et al. Model-based analysis of ChIP-Seq (MACS). Genome Biol. 9, R137 (2008).
pubmed: 18798982 pmcid: 2592715 doi: 10.1186/gb-2008-9-9-r137
Davis, J. & Goadrich, M. The relationship between precision-recall and roc curves. In Proceedings of the 23rd International Conference on Machine Learning (eds. William, C. & Andrew, M.) 233–240 (ACM, 2006).
Raimundo, F., Vallot, C. & Vert, J.-P. Tuning parameters of dimensionality reduction methods for single-cell rna-seq analysis. Genome Biol. 21, 1–17 (2020).
doi: 10.1186/s13059-020-02128-7
Maaten, Lvd & Hinton, G. Visualizing data using t-SNE. JMLR 9, 2579–2605 (2008).
Cusanovich, D. A. et al. A single-cell atlas of in vivo mammalian chromatin accessibility. Cell 174, 1309–1324 (2018).
pubmed: 30078704 pmcid: 6158300 doi: 10.1016/j.cell.2018.06.052
Khan, A. et al. JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework. Nucleic Acids Res. 46, D260–D266 (2018).
pubmed: 29140473 doi: 10.1093/nar/gkx1126
Cairns, J. et al. Chicago: robust detection of dna looping interactions in capture hi-c data. Genome Biol. 17, 127 (2016).
pubmed: 27306882 pmcid: 4908757 doi: 10.1186/s13059-016-0992-2
Grau, J., Grosse, I. & Keilwagen, J. Prroc: computing and visualizing precision-recall and receiver operating characteristic curves in r. Bioinformatics 31, 2595–2597 (2015).
pubmed: 25810428 pmcid: 4514923 doi: 10.1093/bioinformatics/btv153
Fornes, O. et al. Jaspar 2020: update of the open-access database of transcription factor binding profiles. Nucleic Acids Res. 48, D87–D92 (2019).
pmcid: 7145627
Angerer, P. et al. destiny: diffusion maps for large-scale single-cell data in r. Bioinformatics 32, 1241–1243 (2016).
pubmed: 26668002 doi: 10.1093/bioinformatics/btv715
Patel, H. et al. nf-core/rnaseq: nf-core/rnaseq v3.0—Silver Shark https://doi.org/10.5281/zenodo.4323183 (2020).
Dobin, A. et al. Star: ultrafast universal rna-seq aligner. Bioinformatics 29, 15–21 (2013).
doi: 10.1093/bioinformatics/bts635 pubmed: 23104886
Patro, R., Duggal, G., Love, M. I., Irizarry, R. A. & Kingsford, C. Salmon provides fast and bias-aware quantification of transcript expression. Nat. Methods 14, 417–419 (2017).
pubmed: 28263959 pmcid: 5600148 doi: 10.1038/nmeth.4197
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for rna-seq data with deseq2. Genome Biol. 15, 1–21 (2014).
doi: 10.1186/s13059-014-0550-8
Li, Z. Chromatin-accessibility Estimation from Single-cell Atac Data with Scopen https://doi.org/10.5281/zenodo.5513693 (2021).

Auteurs

Zhijian Li (Z)

Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, 52074, Aachen, Germany.

Christoph Kuppe (C)

Institute of Experimental Medicine and Systems Biology, RWTH Aachen University Medical School, 52074, Aachen, Germany.
Division of Nephrology and Clinical Immunology, RWTH Aachen University, 52074, Aachen, Germany.

Susanne Ziegler (S)

Institute of Experimental Medicine and Systems Biology, RWTH Aachen University Medical School, 52074, Aachen, Germany.

Mingbo Cheng (M)

Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, 52074, Aachen, Germany.

Nazanin Kabgani (N)

Institute of Experimental Medicine and Systems Biology, RWTH Aachen University Medical School, 52074, Aachen, Germany.

Sylvia Menzel (S)

Institute of Experimental Medicine and Systems Biology, RWTH Aachen University Medical School, 52074, Aachen, Germany.

Martin Zenke (M)

Department of Cell Biology, Institute of Biomedical Engineering, RWTH Aachen University Medical School, 52074, Aachen, Germany.
Helmholtz Institute for Biomedical Engineering, RWTH Aachen University, Aachen, Germany.

Rafael Kramann (R)

Institute of Experimental Medicine and Systems Biology, RWTH Aachen University Medical School, 52074, Aachen, Germany. rkramann@gmx.net.
Division of Nephrology and Clinical Immunology, RWTH Aachen University, 52074, Aachen, Germany. rkramann@gmx.net.
Department of Internal Medicine, Nephrology and Transplantation, Erasmus Medical Center, 3015GD, Rotterdam, The Netherlands. rkramann@gmx.net.

Ivan G Costa (IG)

Institute for Computational Genomics, Joint Research Center for Computational Biomedicine, RWTH Aachen University Medical School, 52074, Aachen, Germany. ivan.costa@rwth-aachen.de.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Coal Metagenome Phylogeny Bacteria Genome, Bacterial
Animals Epigenesis, Genetic DNA Methylation Skates, Fish CpG Islands

Classifications MeSH