Benchmarking of methods for DNA methylome deconvolution.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
16 May 2024
Historique:
received: 20 10 2023
accepted: 30 04 2024
medline: 17 5 2024
pubmed: 17 5 2024
entrez: 16 5 2024
Statut: epublish

Résumé

Defining the number and abundance of different cell types in tissues is important for understanding disease mechanisms as well as for diagnostic and prognostic purposes. Typically, this is achieved by immunohistological analyses, cell sorting, or single-cell RNA-sequencing. Alternatively, cell-specific DNA methylome information can be leveraged to deconvolve cell fractions from a bulk DNA mixture. However, comprehensive benchmarking of deconvolution methods and modalities was not yet performed. Here we evaluate 16 deconvolution algorithms, developed either specifically for DNA methylome data or more generically. We assess the performance of these algorithms, and the effect of normalization methods, while modeling variables that impact deconvolution performance, including cell abundance, cell type similarity, reference panel size, method for methylome profiling (array or sequencing), and technical variation. We observe differences in algorithm performance depending on each these variables, emphasizing the need for tailoring deconvolution analyses. The complexity of the reference, marker selection method, number of marker loci and, for sequencing-based assays, sequencing depth have a marked influence on performance. By developing handles to select the optimal analysis configuration, we provide a valuable source of information for studies aiming to deconvolve array- or sequencing-based methylation data.

Identifiants

pubmed: 38755121
doi: 10.1038/s41467-024-48466-z
pii: 10.1038/s41467-024-48466-z
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

4134

Informations de copyright

© 2024. The Author(s).

Références

Pardoll, D. M. The blockade of immune checkpoints in cancer immunotherapy. Nat. Rev. Cancer 12, 252–264 (2012).
pubmed: 22437870 pmcid: 4856023 doi: 10.1038/nrc3239
Lambrechts, D. et al. Phenotype molding of stromal cells in the lung tumor microenvironment. Nat. Med. 24, 1277–1289 (2018).
pubmed: 29988129 doi: 10.1038/s41591-018-0096-5
Jain, R. K. Normalization of tumor vasculature: an emerging concept in antiangiogenic therapy. Science 307, 58–62 (2005).
pubmed: 15637262 doi: 10.1126/science.1104819
Ausserwöger, H. et al. Non-specificity as the sticky problem in therapeutic antibody development. Nat. Rev. Chem. 6, 844–861 (2022).
pubmed: 37117703 doi: 10.1038/s41570-022-00438-x
Schonbrunn, A. Editorial: antibody can get it right: confronting problems of antibody specificity and irreproducibility. Mol. Endocrinol. 28, 1403–1407 (2014).
pubmed: 25184858 pmcid: 4154242 doi: 10.1210/me.2014-1230
Hewitt, S. M., Baskin, D. G., Frevert, C. W., Stahl, W. L. & Rosa-Molinar, E. Controls for immunohistochemistry: the Histochemical Society’s standards of practice for validation of immunohistochemical assays. J. Histochem. Cytochem. 62, 693–697 (2014).
pubmed: 25023613 pmcid: 4212362 doi: 10.1369/0022155414545224
Szilágyi, M. et al. Circulating cell-free nucleic acids: main characteristics and clinical application. Int. J. Mol. Sci. 21, 6827 (2020).
pubmed: 32957662 pmcid: 7555669 doi: 10.3390/ijms21186827
Schwarzenbach, H., Hoon, D. S. & Pantel, K. Cell-free nucleic acids as biomarkers in cancer patients. Nat. Rev. Cancer 11, 426–437 (2011).
pubmed: 21562580 doi: 10.1038/nrc3066
Chan, A. K., Chiu, R. W. & Lo, Y. M., Biochemists, C. S. R. C. O. T. A. O. C. Cell-free nucleic acids in plasma, serum and urine: a new tool in molecular diagnosis. Ann. Clin. Biochem. 40, 122–130 (2003).
pubmed: 12662399 doi: 10.1258/000456303763046030
Swarup, V. & Rajeswari, M. R. Circulating (cell-free) nucleic acids–a promising, non-invasive tool for early detection of several human diseases. FEBS Lett. 581, 795–799 (2007).
pubmed: 17289032 doi: 10.1016/j.febslet.2007.01.051
Huiwen, C., Kate, S., Tatjana, J., Bernard, T. & Joris Robert, V. Expanded knowledge of cell-free DNA biology: potential to broaden the clinical utility. Extracell. Vesicles Circ. Nucleic Acids 3, 216–234 (2022).
Poon, L. L. M., Leung, T. N., Lau, T. K. & Lo, Y. M. D. Presence of fetal RNA in maternal plasma. Clin. Chem. 46, 1832–1834 (2000).
pubmed: 11067820 doi: 10.1093/clinchem/46.11.1832
Wang, X., Park, J., Susztak, K., Zhang, N. R. & Li, M. Bulk tissue cell type deconvolution with multi-subject single-cell expression reference. Nat. Commun. 10, 380 (2019).
pubmed: 30670690 pmcid: 6342984 doi: 10.1038/s41467-018-08023-x
Titus, A. J., Gallimore, R. M., Salas, L. A. & Christensen, B. C. Cell-type deconvolution from DNA methylation: a review of recent applications. Hum. Mol. Genet. 26, R216–R224 (2017).
pubmed: 28977446 pmcid: 5886462 doi: 10.1093/hmg/ddx275
Shen-Orr, S. S. & Gaujoux, R. Computational deconvolution: extracting cell type-specific information from heterogeneous samples. Curr. Opin. Immunol. 25, 571–578 (2013).
pubmed: 24148234 doi: 10.1016/j.coi.2013.09.015
Teschendorff, A. E. & Zheng, S. C. Cell-type deconvolution in epigenome-wide association studies: a review and recommendations. Epigenomics 9, 757–768 (2017).
pubmed: 28517979 doi: 10.2217/epi-2016-0153
Avila Cobos, F., Alquicira-Hernandez, J., Powell, J. E., Mestdagh, P. & De Preter, K. Benchmarking of cell type deconvolution pipelines for transcriptomics data. Nat. Commun. 11, 5650 (2020).
pubmed: 33159064 pmcid: 7648640 doi: 10.1038/s41467-020-19015-1
Cheng, W. C. et al. Intra- and inter-individual variance of gene expression in clinical studies. PLoS ONE 7, e38650 (2012).
pubmed: 22723873 pmcid: 3377725 doi: 10.1371/journal.pone.0038650
Ozsolak, F. & Milos, P. M. RNA sequencing: advances, challenges and opportunities. Nat. Rev. Genet. 12, 87–98 (2011).
pubmed: 21191423 doi: 10.1038/nrg2934
Marguerat, S. & Bahler, J. Coordinating genome expression with cell size. Trends Genet. 28, 560–565 (2012).
pubmed: 22863032 doi: 10.1016/j.tig.2012.07.003
Merbs, S. L. et al. Cell-specific DNA methylation patterns of retina-specific genes. PLoS ONE 7, e32602 (2012).
pubmed: 22403679 pmcid: 3293830 doi: 10.1371/journal.pone.0032602
Hudon Thibeault, A. A. & Laprise, C. Cell-specific DNA methylation signatures in asthma. Genes 10, 932 (2019).
pubmed: 31731604 pmcid: 6896152 doi: 10.3390/genes10110932
Grigoriu, A. et al. Cell specific patterns of methylation in the human placenta. Epigenetics 6, 368–379 (2011).
pubmed: 21131778 pmcid: 3092685 doi: 10.4161/epi.6.3.14196
Bibikova, M. et al. High-throughput DNA methylation profiling using universal bead arrays. Genome Res. 16, 383–393 (2006).
pubmed: 16449502 pmcid: 1415217 doi: 10.1101/gr.4410706
Smith, Z. D., Gu, H., Bock, C., Gnirke, A. & Meissner, A. High-throughput bisulfite sequencing in mammalian genomes. Methods 48, 226–232 (2009).
pubmed: 19442738 pmcid: 2864123 doi: 10.1016/j.ymeth.2009.05.003
Jeong, Y. et al. Systematic evaluation of cell-type deconvolution pipelines for sequencing-based bulk DNA methylomes. Brief. Bioinform. 23, bbac248 (2022).
pubmed: 35794707 pmcid: 9294431 doi: 10.1093/bib/bbac248
Song, J. & Kuan, P.-F. A systematic assessment of cell type deconvolution algorithms for DNA methylation data. Brief. Bioinform. 23, bbac449 (2022).
pubmed: 36242584 doi: 10.1093/bib/bbac449
Mullen, K. M. bvls R-Package. https://CRAN.R-project.org/package=bvls (2013).
Hastie, T., Qian, J. & Tay, K. An Introduction to glmnet. https://CRAN.R-project.org/package=glmnet (2023).
Zhang, H., Cai, R., Dai, J. & Sun, W. EMeth: an EM algorithm for cell type decomposition based on DNA methylation data. Sci. Rep. 11, 5717 (2021).
pubmed: 33707472 pmcid: 7952399 doi: 10.1038/s41598-021-84864-9
Teschendorff, A. E., Breeze, C. E., Zheng, S. C. & Beck, S. A comparison of reference-based algorithms for correcting cell-type heterogeneity in Epigenome-Wide Association Studies. BMC Bioinform. 18, 105 (2017).
doi: 10.1186/s12859-017-1511-5
Moss, J. et al. Comprehensive human cell-type methylation atlas reveals origins of circulating cell-free DNA in health and disease. Nat. Commun. 9, 5068 (2018).
pubmed: 30498206 pmcid: 6265251 doi: 10.1038/s41467-018-07466-6
Arneson, D., Yang, X. & Wang, K. MethylResolver-a method for deconvoluting bulk DNA methylation profiles into known and unknown cell contents. Commun. Biol. 3, 422 (2020).
pubmed: 32747663 pmcid: 7400544 doi: 10.1038/s42003-020-01146-2
Aryee, M. J. et al. Minfi: a flexible and comprehensive Bioconductor package for the analysis of Infinium DNA methylation microarrays. Bioinformatics 30, 1363–1369 (2014).
pubmed: 24478339 pmcid: 4016708 doi: 10.1093/bioinformatics/btu049
Mullen, K. M. & van Stokkum, I. H. M. nnls: The Lawson-Hanson Algorithm for Non-Negative Least Squares (NNLS). https://CRAN.R-project.org/package=nnls (2012).
Hao, Y., Yan, M., Heath, B. R., Lei, Y. L. & Xie, Y. Fast and robust deconvolution of tumor infiltrating lymphocyte from expression profiles using least trimmed squares. PLoS Comput. Biol. 15, e1006976 (2019).
pubmed: 31059559 pmcid: 6522071 doi: 10.1371/journal.pcbi.1006976
Wilson, D. R., Jin, C., Ibrahim, J. G. & Sun, W. Iced-t provides accurate estimates of immune cell abundance in tumor samples by allowing for aberrant gene expression patterns. J. Am. Stat. Assoc. 115, 1055–1065 (2020).
pubmed: 33012900 doi: 10.1080/01621459.2019.1654874
Danziger, S. A. et al. ADAPTS: automated deconvolution augmentation of profiles for tissue specific cells. PLoS ONE 14, e0224693 (2019).
pubmed: 31743345 pmcid: 6863530 doi: 10.1371/journal.pone.0224693
Luo, Q. et al. A meta-analysis of immune-cell fractions at high resolution reveals novel associations with common phenotypes and health outcomes. Genome Med. 15, 59 (2023).
pubmed: 37525279 pmcid: 10388560 doi: 10.1186/s13073-023-01211-5
Koestler, D. C. et al. Improving cell mixture deconvolution by identifying optimal DNA methylation libraries (IDOL). BMC Bioinforma. 17, 1–21 (2016).
doi: 10.1186/s12859-016-0943-7
Houtepen, L. C. et al. Genome-wide DNA methylation levels and altered cortisol stress reactivity following childhood trauma in humans. Nat. Commun. 7, 10967 (2016).
pubmed: 26997371 pmcid: 4802173 doi: 10.1038/ncomms10967
Pidsley, R. et al. Critical evaluation of the Illumina MethylationEPIC BeadChip microarray for whole-genome DNA methylation profiling. Genome Biol. 17, 208 (2016).
pubmed: 27717381 pmcid: 5055731 doi: 10.1186/s13059-016-1066-1
Salas, L. A. et al. An optimized library for reference-based deconvolution of whole-blood biospecimens assayed using the Illumina HumanMethylationEPIC BeadArray. Genome Biol. 19, 64 (2018).
pubmed: 29843789 pmcid: 5975716 doi: 10.1186/s13059-018-1448-7
Eckhardt, F. et al. DNA methylation profiling of human chromosomes 6, 20 and 22. Nat. Genet. 38, 1378–1385 (2006).
pubmed: 17072317 pmcid: 3082778 doi: 10.1038/ng1909
Galle, E. et al. DNA methylation-driven EMT is a common mechanism of resistance to various therapeutic agents in cancer. Clin. Epigenetics 12, 19 (2020).
doi: 10.1186/s13148-020-0821-z
De Borre, M., et al. Cell-free DNA methylome analysis for early preeclampsia prediction. Nat. Med.29, 1–10 (2023).
Houseman, E. A. et al. Reference-free deconvolution of DNA methylation data and mediation by cell composition effects. BMC Bioinform. 17, 1–15 (2016).
doi: 10.1186/s12859-016-1140-4
Li, Z. & Wu, H. TOAST: improving reference-free cell composition estimation by cross-cell type differential analysis. Genome Biol. 20, 1–17 (2019).
doi: 10.1186/s13059-019-1778-0
Scherer, M. et al. Reference-free deconvolution, visualization and interpretation of complex DNA methylation data using DecompPipeline, MeDeCom and FactorViz. Nat. Protoc. 15, 3240–3263 (2020).
pubmed: 32978601 doi: 10.1038/s41596-020-0369-6
Keukeleire, P., Makrodimitris, S. & Reinders, M. Cell type deconvolution of methylated cell-free DNA at the resolution of individual reads. NAR Genom. Bioinform. 5, lqad048 (2023).
pubmed: 37274121 pmcid: 10236360 doi: 10.1093/nargab/lqad048
Li, S. et al. Comprehensive tissue deconvolution of cell-free DNA by deep learning for disease diagnosis and monitoring. Proc. Natl Acad. Sci. USA 120, e2305236120 (2023).
pubmed: 37399400 pmcid: 10334733 doi: 10.1073/pnas.2305236120
Pedersen, B. S., Eyring, K., De, S., Yang, I. V. & Schwartz, D. A. Fast and accurate alignment of long bisulfite-seq reads. Preprint at https://arxiv.org/abs/1401.1129 (2014).
Picard toolkit. Broad Institute, GitHub repository https://broadinstitute.github.io/picard/ (2019).
MethylDackel https://github.com/dpryan79/MethylDackel (2016)
Krueger, F. Trim Galore. The Babraham Institute https://github.com/FelixKrueger/TrimGalore (2012)
Ben, B. preprocessCore: A Collection of Pre-Processing Functions. https://github.com/bmbolstad/preprocessCore (2023)
De Ridder, K. Benchmarking of methods for DNA methylome deconvolution, FunctionalEpigeneticsLab/DNAme-deconvolution-benchmarking, https://doi.org/10.5281/zenodo.10987211 (2024).

Auteurs

Kobe De Ridder (K)

Laboratory for Functional Epigenetics, Department of Human Genetics, KU Leuven, 3000, Leuven, Belgium.

Huiwen Che (H)

Laboratory for Functional Epigenetics, Department of Human Genetics, KU Leuven, 3000, Leuven, Belgium.

Kaat Leroy (K)

Laboratory for Functional Epigenetics, Department of Human Genetics, KU Leuven, 3000, Leuven, Belgium.

Bernard Thienpont (B)

Laboratory for Functional Epigenetics, Department of Human Genetics, KU Leuven, 3000, Leuven, Belgium. bernard.thienpont@kuleuven.be.
KU Leuven Institute for Single Cell Omics (LISCO), KU Leuven, 3000, Leuven, Belgium. bernard.thienpont@kuleuven.be.
KU Leuven Cancer Institute (LKI), KU Leuven, 3000, Leuven, Belgium. bernard.thienpont@kuleuven.be.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH