Geographic variation of mutagenic exposures in kidney cancer genomes.
Journal
Nature
ISSN: 1476-4687
Titre abrégé: Nature
Pays: England
ID NLM: 0410462
Informations de publication
Date de publication:
01 May 2024
01 May 2024
Historique:
received:
04
05
2023
accepted:
28
03
2024
medline:
2
5
2024
pubmed:
2
5
2024
entrez:
1
5
2024
Statut:
aheadofprint
Résumé
International differences in the incidence of many cancer types indicate the existence of carcinogen exposures that have not yet been identified by conventional epidemiology make a substantial contribution to cancer burden
Identifiants
pubmed: 38693263
doi: 10.1038/s41586-024-07368-2
pii: 10.1038/s41586-024-07368-2
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
© 2024. The Author(s).
Références
Brennan, P. & Davey-Smith, G. Identifying novel causes of cancers to enhance cancer prevention: new strategies are needed. J. Natl Cancer Inst. 114, 353–360 (2022).
pubmed: 34743211
doi: 10.1093/jnci/djab204
Hsieh, J. J. et al. Renal cell carcinoma. Nat. Rev. Dis. Primers 3, 17009 (2017).
pubmed: 28276433
pmcid: 5936048
doi: 10.1038/nrdp.2017.9
Koh, G., Degasperi, A., Zou, X., Momen, S. & Nik-Zainal, S. Mutational signatures: emerging concepts, caveats and clinical applications. Nat. Rev. Cancer 21, 619–637 (2021).
pubmed: 34316057
doi: 10.1038/s41568-021-00377-7
Alexandrov, L. B. et al. The repertoire of mutational signatures in human cancer. Nature 578, 94–101 (2020).
pubmed: 32025018
pmcid: 7054213
doi: 10.1038/s41586-020-1943-3
Scelo, G. et al. Variation in genomic landscape of clear cell renal cell carcinoma across Europe. Nat. Commun. 5, 5135 (2014).
pubmed: 25351205
doi: 10.1038/ncomms6135
Mitchell, T. J. et al. Timing the landmark events in the evolution of clear cell renal cell cancer: TRACERx renal. Cell 173, 611–623.e17 (2018).
pubmed: 29656891
pmcid: 5927631
doi: 10.1016/j.cell.2018.02.020
Campbell, P. J. et al. Pan-cancer analysis of whole genomes. Nature 578, 82–93 (2020).
doi: 10.1038/s41586-020-1969-6
Degasperi, A. et al. A practical framework and online tool for mutational signature analyses show intertissue variation and driver dependencies. Nat. Cancer 1, 249–263 (2020).
pubmed: 32118208
pmcid: 7048622
doi: 10.1038/s43018-020-0027-5
The Cancer Genome Atlas Research Network. Comprehensive molecular characterization of clear cell renal cell carcinoma. Nature 499, 43–49 (2013).
pmcid: 3771322
doi: 10.1038/nature12222
Mutographs. Cancer Grand Challenges https://cancergrandchallenges.org/teams (2023).
Sung, H. et al. Global Cancer Statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA 71, 209–249 (2021).
pubmed: 33538338
Nik-Zainal, S. et al. Mutational processes molding the genomes of 21 breast cancers. Cell 149, 979–993 (2012).
pubmed: 22608084
pmcid: 3414841
doi: 10.1016/j.cell.2012.04.024
Alexandrov, L. B. et al. Signatures of mutational processes in human cancer. Nature 500, 415–421 (2013).
pubmed: 23945592
pmcid: 3776390
doi: 10.1038/nature12477
Drost, J. et al. Use of CRISPR-modified human stem cell organoids to study the origin of mutational signatures in cancer. Science 358, 234–238 (2017).
pubmed: 28912133
pmcid: 6038908
doi: 10.1126/science.aao3130
Hoang, M. L. et al. Mutational signature of aristolochic acid exposure as revealed by whole-exome sequencing. Sci. Transl. Med. 5, 197ra102 (2013).
pubmed: 23926200
pmcid: 3973132
doi: 10.1126/scitranslmed.3006200
Poon, S. L. et al. Genome-wide mutational signatures of aristolochic acid and its application as a screening tool. Sci. Transl. Med. 5, 197ra101 (2013).
pubmed: 23926199
doi: 10.1126/scitranslmed.3006086
Grollman, A. P. Aristolochic acid nephropathy: harbinger of a global iatrogenic disease. Environ. Mol. Mutagen. 54, 1–7 (2013).
pubmed: 23238808
doi: 10.1002/em.21756
Turesky, R. J. et al. Aristolochic acid exposure in Romania and implications for renal cell carcinoma. Br. J. Cancer 114, 76–80 (2016).
pubmed: 26657656
doi: 10.1038/bjc.2015.402
Wang, X.-M. et al. Integrative genomic study of Chinese clear cell renal cell carcinoma reveals features associated with thrombus. Nat. Commun. 11, 739 (2020).
pubmed: 32029730
pmcid: 7005298
doi: 10.1038/s41467-020-14601-9
Stefanovic, V. & Radovanovic, Z. Balkan endemic nephropathy and associated urothelial cancer. Nat. Clin. Pract. Urol. 5, 105–112 (2008).
pubmed: 18259188
doi: 10.1038/ncpuro1019
Huang, M. N. et al. Genome-scale mutational signatures of aflatoxin in cells, mice, and human tumors. Genome Res. 27, 1475–1486 (2017).
pubmed: 28739859
pmcid: 5580708
doi: 10.1101/gr.220038.116
Haradhvala, N. J. et al. Mutational strand asymmetries in cancer genomes reveal mechanisms of DNA damage and repair. Cell 164, 538–549 (2016).
pubmed: 26806129
pmcid: 4753048
doi: 10.1016/j.cell.2015.12.050
Otlu, B. et al. Topography of mutational signatures in human cancer. Cell Rep. 42, 112930 (2023).
pubmed: 37540596
pmcid: 10507738
doi: 10.1016/j.celrep.2023.112930
Nik-Zainal, S. The genome as a record of environmental exposure. Mutagenesis 30, 763–770 (2015).
pubmed: 26443852
pmcid: 4637815
Sato, Y. et al. Integrated molecular analysis of clear-cell renal cell carcinoma. Nat. Genet. 45, 860–867 (2013).
pubmed: 23797736
doi: 10.1038/ng.2699
Dempsey, D. et al. Nicotine metabolite ratio as an index of cytochrome P450 2A6 metabolic activity. Clin. Pharmacol. Ther. 76, 64–72 (2004).
pubmed: 15229465
doi: 10.1016/j.clpt.2004.02.011
Velenosi, T. J. et al. Untargeted metabolomics reveals N,N,N-trimethyl-L-alanyl-L-proline betaine (TMAP) as a novel biomarker of kidney function. Sci. Rep. 9, 6831 (2019).
pubmed: 31048706
pmcid: 6497643
doi: 10.1038/s41598-019-42992-3
Nik-Zainal, S. et al. The life history of 21 breast cancers. Cell 149, 994–1007 (2012).
pubmed: 22608083
pmcid: 3428864
doi: 10.1016/j.cell.2012.04.023
Dentro, S. C., Wedge, D. C. & van Loo, P. Principles of reconstructing the subclonal architecture of cancers. Cold Spring Harb. Perspect. Med. 7, a026625 (2017).
pubmed: 28270531
pmcid: 5538405
doi: 10.1101/cshperspect.a026625
Shearer, J. J. et al. Serum concentrations of per- and polyfluoroalkyl substances and risk of renal cell carcinoma. J. Natl Cancer Inst. 113, 580–587 (2021).
pubmed: 32944748
doi: 10.1093/jnci/djaa143
Kucab, J. E. et al. A compendium of mutational signatures of environmental agents. Cell 177, 821–836.e16 (2019).
pubmed: 30982602
pmcid: 6506336
doi: 10.1016/j.cell.2019.03.001
Gabriel, A. A. G. et al. Genetic analysis of lung cancer and the germline impact on somatic mutation burden. J. Natl Cancer Inst. 114, 1159–1166 (2022).
pubmed: 35511172
pmcid: 9360465
doi: 10.1093/jnci/djac087
Liu, Y., Gusev, A., Heng, Y. J., Alexandrov, L. B. & Kraft, P. Somatic mutational profiles and germline polygenic risk scores in human cancer. Genome Med. 14, 14 (2022).
pubmed: 35144655
pmcid: 8832866
doi: 10.1186/s13073-022-01016-y
Moody, S. et al. Mutational signatures in esophageal squamous cell carcinoma from eight countries with varying incidence. Nat. Genet. 53, 1553–1563 (2021).
pubmed: 34663923
doi: 10.1038/s41588-021-00928-6
Abascal, F. et al. Somatic mutation landscapes at single-molecule resolution. Nature 593, 405–410 (2021).
pubmed: 33911282
doi: 10.1038/s41586-021-03477-4
Robinson, P. S. et al. Increased somatic mutation burdens in normal human cells due to defective DNA polymerases. Nat. Genet. 53, 1434–1442 (2021).
pubmed: 34594041
pmcid: 8492474
doi: 10.1038/s41588-021-00930-y
Robinson, P. S. et al. Inherited MUTYH mutations cause elevated somatic mutation rates and distinctive mutational signatures in normal human cells. Nat. Commun. 13, 3949 (2022).
pubmed: 35803914
pmcid: 9270427
doi: 10.1038/s41467-022-31341-0
Fowler, J. C. & Jones, P. H. Somatic mutation: what shapes the mutational landscape of normal epithelia? Cancer Discov. 12, 1642–1655 (2022).
pubmed: 35397477
pmcid: 7613026
doi: 10.1158/2159-8290.CD-22-0145
Whalley, J. P. et al. Framework for quality assessment of whole genome cancer sequences. Nat. Commun. 11, 5040 (2020).
pubmed: 33028839
pmcid: 7541455
doi: 10.1038/s41467-020-18688-y
Bergmann, E. A., Chen, B.-J., Arora, K., Vacic, V. & Zody, M. C. Conpair: concordance and contamination estimator for matched tumor–normal pairs. Bioinformatics 32, 3196–3198 (2016).
pubmed: 27354699
pmcid: 5048070
doi: 10.1093/bioinformatics/btw389
Van Loo, P. et al. Allele-specific copy number analysis of tumors. Proc. Natl Acad. Sci. USA 107, 16910–16915 (2010).
pubmed: 20837533
pmcid: 2947907
doi: 10.1073/pnas.1009843107
Jones, D. et al. cgpCaVEManWrapper: simple execution of CaVEMan in order to detect somatic single nucleotide variants in NGS data. Curr. Protoc. Bioinform. 56, 15.10.1–15.10.18 (2016).
doi: 10.1002/cpbi.20
Raine, K. M. et al. cgpPindel: identifying somatically acquired insertion and deletion events from paired end sequencing. Curr. Protoc. Bioinform. 52, 15.7.1–15.7.12 (2015).
doi: 10.1002/0471250953.bi1507s52
Kim, S. et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat. Methods 15, 591–594 (2018).
pubmed: 30013048
doi: 10.1038/s41592-018-0051-x
Bergstrom, E. N. et al. SigProfilerMatrixGenerator: a tool for visualizing and exploring patterns of small mutational events. BMC Genom. 20, 685 (2019).
doi: 10.1186/s12864-019-6041-2
Liu, M., Wu, Y., Jiang, N., Boot, A. & Rozen, S. G. mSigHdp: hierarchical Dirichlet process mixture modeling for mutational signature discovery. NAR Genom. Bioinform. 5, lqad005 (2023).
pubmed: 36694663
pmcid: 9869330
doi: 10.1093/nargab/lqad005
Islam, S. M. A. et al. Uncovering novel mutational signatures by de novo extraction with SigProfilerExtractor. Cell Genom. 2, 100179 (2022).
pubmed: 36388765
pmcid: 9646490
doi: 10.1016/j.xgen.2022.100179
Senkin, S. MSA: reproducible mutational signature attribution with confidence based on simulations. BMC Bioinform. 22, 540 (2021).
doi: 10.1186/s12859-021-04450-8
Martincorena, I. et al. Universal patterns of selection in cancer and somatic tissues. Cell 171, 1029–1041.e21 (2017).
pubmed: 29056346
pmcid: 5720395
doi: 10.1016/j.cell.2017.09.042
Tamborero, D. et al. Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations. Genome Med. 10, 25 (2018).
pubmed: 29592813
pmcid: 5875005
doi: 10.1186/s13073-018-0531-8
Gao, J. et al. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Signal. 6, pl1 (2013).
pubmed: 23550210
pmcid: 4160307
doi: 10.1126/scisignal.2004088
Díaz-Gay, M. et al. Assigning mutational signatures to individual samples and individual somatic mutations with SigProfilerAssignment. Bioinformatics 39, btad756 (2023).
pubmed: 38096571
pmcid: 10746860
doi: 10.1093/bioinformatics/btad756
Wilcoxon, F. Individual comparisons by ranking methods. Biometrics Bull. 1, 80 (1945).
doi: 10.2307/3001968
Benjamini, Y. & Hochberg, Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J. R. Stat. Soc. B 57, 289–300 (1995).
doi: 10.1111/j.2517-6161.1995.tb02031.x
Dušek, L. et al. Epidemiology of Malignant Tumours in the Czech Republic, Version 7.0. Masaryk University http://www.svod.cz (2007).
Liu, M. et al. Association studies of up to 1.2 million individuals yield new insights into the genetic etiology of tobacco and alcohol use. Nat. Genet. 51, 237–244 (2019).
pubmed: 30643251
pmcid: 6358542
doi: 10.1038/s41588-018-0307-5
Yengo, L. et al. Meta-analysis of genome-wide association studies for height and body mass index in ∼700000 individuals of European ancestry. Hum. Mol. Genet. 27, 3641–3649 (2018).
pubmed: 30124842
pmcid: 6488973
doi: 10.1093/hmg/ddy271
Lagou, V. et al. Sex-dimorphic genetic effects and novel loci for fasting glucose and insulin variability. Nat. Commun. 12, 24 (2021).
pubmed: 33402679
pmcid: 7785747
doi: 10.1038/s41467-020-19366-9
Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018).
pubmed: 30305743
pmcid: 6786975
doi: 10.1038/s41586-018-0579-z
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
pubmed: 19648217
pmcid: 2752134
doi: 10.1101/gr.094052.109
Chang, C. C. et al. Second-generation PLINK: rising to the challenge of larger and richer datasets. Gigascience 4, 7 (2015).
pubmed: 25722852
pmcid: 4342193
doi: 10.1186/s13742-015-0047-8
Shim, H. et al. A multivariate genome-wide association analysis of 10 LDL subfractions, and their response to statin treatment, in 1868 Caucasians. PLoS ONE 10, e0120758 (2015).
pubmed: 25898129
pmcid: 4405269
doi: 10.1371/journal.pone.0120758
Choi, S. W. & O’Reilly, P. F. PRSice-2: polygenic risk score software for biobank-scale data. Gigascience 8, giz082 (2019).
pubmed: 31307061
pmcid: 6629542
doi: 10.1093/gigascience/giz082
Loftfield, E. et al. Novel biomarkers of habitual alcohol intake and associations with risk of pancreatic and liver cancers and liver disease mortality. J. Natl Cancer Inst. 113, 1542–1550 (2021).
pubmed: 34010397
pmcid: 8562969
doi: 10.1093/jnci/djab078
Gao, J., Meyer, K., Borucki, K. & Ueland, P. M. Multiplex Immuno-MALDI-TOF MS for targeted quantification of protein biomarkers and their proteoforms related to inflammation and renal dysfunction. Anal. Chem. 90, 3366–3373 (2018).
pubmed: 29420882
doi: 10.1021/acs.analchem.7b04975
R Core Team. R: A Language and Environment for Statistical Computing. https://www.r-project.org/ (R Foundation for Statistical Computing, 2022).
Van Rossum, G. & Drake, F. L. Python 3 Reference Manual (CreateSpace, 2009).
Bates, D., Mächler, M., Bolker, B. & Walker, S. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, https://doi.org/10.18637/jss.v067.i01 (2015).
Bengtsson H. matrixStats: Functions that apply to rows and columns of matrices (and to vectors). https://cran.r-project.org/web/packages/matrixStats/index.html (2023).
Bates, D. et al. Matrix: Sparse and dense matrix classes and methods. https://matrix.r-forge.r-project.org/ (2023).
Chamberlain, S., Teucher, A. & Mahoney, M. geojsonio. https://github.com/ropensci/geojsonio (2023).
Hijmans, R. J. raster. https://github.com/rspatial/raster (2024).
Bivand, R. & Rundel, C. rgeos. https://github.com/cran/rgeos/ (2023).
Pebesma, E. & Bivand, R. Spatial Data Science: With Applications in R (Chapman and Hall/CRC, 2023).
Tennekes, M. tmaptools: Thematic map tools. https://github.com/r-tmap/tmaptools (2021).
Lin Pedersen, T. patchwork: The composer of plots. https://github.com/thomasp85/patchwork (2024).
Cheng, J. Leaflet. https://github.com/rstudio/leaflet (2023).
Wickham, H., François, R., Henry, L., Müller, K. & Vaughan, D. dplyr: A grammar of data manipulation. https://dplyr.tidyverse.org (2023).
Wickham, H., Miller, E. & Smith, D. haven: Import and export ‘SPSS’, ‘Stata’ and ‘SAS’ files. https://haven.tidyverse.org (2023).
Harrell Jr F. Hmisc: Harrell Miscellaneous. R package version 5.1-1. https://hbiostat.org/r/hmisc/ (2023).
Schauberger, P. & Walker, A. openxlsx: Read, write and edit xlsx files. https://github.com/ycphs/openxlsx (2022).
Bivand, R., Keitt, T. & Rowlingson, B. rgdal: Bindings for the ‘Geospatial’ data abstraction library. https://rgdal.r-forge.r-project.org (2023).
Wickham, H., Pedersen, T. L. & Seidel, D. scales: Scale functions for visualization. https://scales.r-lib.org (2023).
Wickham, H. stringr: Simple, consistent wrappers for common string operations. https://stringr.tidyverse.org (2023).
Wickham, H., Vaughan, D. & Girlich, M. tidyr: Tidy messy data. https://tidyr.tidyverse.org (2024).
Müller, K. & Wickham, H. tibble: Simple data frames. https://tibble.tidyverse.org/ (2023).
Dragulescu, A. & Arendt, C. xlsx: Read, write, format Excel 2007 and Excel 97/2000/XP/2003 files. https://github.com/colearendt/xlsx (2022).
Archer, E. rfPermute. https://github.com/EricArcher/rfPermute (2023).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
doi: 10.1023/A:1010933404324
Wickham, H. forcats: Tools for working with categorical variables (factors). https://forcats.tidyverse.org/ (2023).
The pandas development team. pandas-dev/pandas: Pandas. https://doi.org/10.5281/zenodo.7093122 (2022).
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362 (2020).
pubmed: 32939066
pmcid: 7759461
doi: 10.1038/s41586-020-2649-2
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
pubmed: 32015543
pmcid: 7056644
doi: 10.1038/s41592-019-0686-2
Seabold, S. & Perktold, J. statsmodels: Econometric and statistical modeling with Python. In 9th Python in Science Conference https://doi.org/10.25080/Majora-92bf1922-011 (2010).
Luo, J. firthlogist. https://github.com/jzluo/firthlogist (2022).
Smith, N. J. et al. pydata/patsy. https://doi.org/10.5281/zenodo.5529350 (2021).
Kluyver, T. et al. in Positioning and Power in Academic Publishing: Players, Agents and Agendas (eds Loizides, F. & Schmidt, B.) 87–90 (2016).
Wickham, H. ggplot2: Elegant graphics for data analysis. https://ggplot2.tidyverse.org (2016).
Campitelli, E. ggnewscale: Multiple fill and color scales in ggplot2. https://doi.org/10.5281/zenodo.7971612 (2023).
Mike, F. C., Davis, T. L. & ggplot2 authors. ggpattern: ‘ggplot2’ pattern geoms. https://github.com/trevorld/ggpattern (2022).
Slowikowski, K. ggrepel: Automatically position non-overlapping text labels with ‘ggplot2’. https://github.com/slowkow/ggrepel (2024).
Yutani, H. ggsflabel. https://yutannihilation.github.io/ggsflabel/ (2023).
Dunnington, D. ggspatial: Spatial data framework for ggplot2. https://paleolimbot.github.io/ggspatial/ (2023).
Kassambara, A. ggpubr: ‘ggplot2’ based publication ready plots. https://rpkgs.datanovia.com/ggpubr/ (2023).
Wilke, C. O. cowplot: Streamlined plot theme and plot annotations for ‘ggplot2’. https://wilkelab.org/cowplot/ (2024).
Hunter, J. D. Matplotlib: A 2D graphics environment. Comput. Sci. Eng. 9, 90–95 (2007).
doi: 10.1109/MCSE.2007.55
Waskom, M. L. seaborn: statistical data visualization. J. Open Source Softw. 6, 3021 (2021).
doi: 10.21105/joss.03021
He, Y. TMB_plotter. https://github.com/AlexandrovLab/TMB_plotter (2020).
Global Administrative Areas. GADM v4.1. https://gadm.org (2022).
Patterson, T. & Nathaniel, V. World Countries, 1:10 million. Natural Earth v5.1.1. https://www.naturalearthdata.com (2022).