MutationalPatterns: the one stop shop for the analysis of mutational processes.

Base substitutions Indels Mutagenic processes Mutational signatures R Regional mutation patterns Somatic mutations

Journal

BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258

Informations de publication

Date de publication:
15 Feb 2022
Historique:
received: 08 11 2021
accepted: 01 02 2022
entrez: 16 2 2022
pubmed: 17 2 2022
medline: 19 2 2022
Statut: epublish

Résumé

The collective of somatic mutations in a genome represents a record of mutational processes that have been operative in a cell. These processes can be investigated by extracting relevant mutational patterns from sequencing data. Here, we present the next version of MutationalPatterns, an R/Bioconductor package, which allows in-depth mutational analysis of catalogues of single and double base substitutions as well as small insertions and deletions. Major features of the package include the possibility to perform regional mutation spectra analyses and the possibility to detect strand asymmetry phenomena, such as lesion segregation. On top of this, the package also contains functions to determine how likely it is that a signature can cause damaging mutations (i.e., mutations that affect protein function). This updated package supports stricter signature refitting on known signatures in order to prevent overfitting. Using simulated mutation matrices containing varied signature contributions, we showed that reliable refitting can be achieved even when only 50 mutations are present per signature. Additionally, we incorporated bootstrapped signature refitting to assess the robustness of the signature analyses. Finally, we applied the package on genome mutation data of cell lines in which we deleted specific DNA repair processes and on large cancer datasets, to show how the package can be used to generate novel biological insights. This novel version of MutationalPatterns allows for more comprehensive analyses and visualization of mutational patterns in order to study the underlying processes. Ultimately, in-depth mutational analyses may contribute to improved biological insights in mechanisms of mutation accumulation as well as aid cancer diagnostics. MutationalPatterns is freely available at http://bioconductor.org/packages/MutationalPatterns .

Sections du résumé

BACKGROUND BACKGROUND
The collective of somatic mutations in a genome represents a record of mutational processes that have been operative in a cell. These processes can be investigated by extracting relevant mutational patterns from sequencing data.
RESULTS RESULTS
Here, we present the next version of MutationalPatterns, an R/Bioconductor package, which allows in-depth mutational analysis of catalogues of single and double base substitutions as well as small insertions and deletions. Major features of the package include the possibility to perform regional mutation spectra analyses and the possibility to detect strand asymmetry phenomena, such as lesion segregation. On top of this, the package also contains functions to determine how likely it is that a signature can cause damaging mutations (i.e., mutations that affect protein function). This updated package supports stricter signature refitting on known signatures in order to prevent overfitting. Using simulated mutation matrices containing varied signature contributions, we showed that reliable refitting can be achieved even when only 50 mutations are present per signature. Additionally, we incorporated bootstrapped signature refitting to assess the robustness of the signature analyses. Finally, we applied the package on genome mutation data of cell lines in which we deleted specific DNA repair processes and on large cancer datasets, to show how the package can be used to generate novel biological insights.
CONCLUSIONS CONCLUSIONS
This novel version of MutationalPatterns allows for more comprehensive analyses and visualization of mutational patterns in order to study the underlying processes. Ultimately, in-depth mutational analyses may contribute to improved biological insights in mechanisms of mutation accumulation as well as aid cancer diagnostics. MutationalPatterns is freely available at http://bioconductor.org/packages/MutationalPatterns .

Identifiants

pubmed: 35168570
doi: 10.1186/s12864-022-08357-3
pii: 10.1186/s12864-022-08357-3
pmc: PMC8845394
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

134

Subventions

Organisme : Nederlandse Organisatie voor Wetenschappelijk Onderzoek
ID : 016.Vidi.171.023

Informations de copyright

© 2022. The Author(s).

Références

Nat Commun. 2020 Nov 4;11(1):5584
pubmed: 33149131
Genome Res. 2018 Nov;28(11):1747-1756
pubmed: 30341162
Nature. 2020 Feb;578(7793):82-93
pubmed: 32025007
Nature. 2018 Mar 15;555(7696):371-376
pubmed: 29489755
Cell. 2019 Mar 21;177(1):101-114
pubmed: 30901533
Genome Biol. 2015 Mar 24;16:56
pubmed: 25887522
Cell Stem Cell. 2021 Oct 7;28(10):1726-1739.e6
pubmed: 34496298
Genome Biol. 2016 Feb 22;17:31
pubmed: 26899170
Nat Med. 2017 Apr;23(4):517-525
pubmed: 28288110
BMC Bioinformatics. 2019 Apr 18;20(Suppl 4):152
pubmed: 30999866
PLoS Comput Biol. 2021 Jun 28;17(6):e1009119
pubmed: 34181655
Nature. 2016 Oct 13;538(7624):260-264
pubmed: 27698416
Sci Rep. 2020 Feb 26;10(1):3501
pubmed: 32103116
Nat Commun. 2019 Jul 5;10(1):2969
pubmed: 31278357
Cell Rep. 2013 Jan 31;3(1):246-59
pubmed: 23318258
N Engl J Med. 2009 Jul 9;361(2):123-34
pubmed: 19553641
Nature. 2020 Apr;580(7802):269-273
pubmed: 32106218
Immunology. 2020 Jul;160(3):233-247
pubmed: 32031242
Bioinformatics. 2015 Nov 15;31(22):3673-5
pubmed: 26163694
Nat Cancer. 2020 Feb;1(2):249-263
pubmed: 32118208
Nat Genet. 2015 Dec;47(12):1402-7
pubmed: 26551669
Nature. 2005 Apr 14;434(7035):913-7
pubmed: 15829966
Nat Commun. 2020 Nov 17;11(1):5834
pubmed: 33203900
Mol Cell Biol. 2007 Dec;27(24):8442-53
pubmed: 17923696
Nature. 2020 Jul;583(7818):699-710
pubmed: 32728249
Nucleic Acids Res. 2020 Jan 8;48(D1):D682-D688
pubmed: 31691826
Nature. 2020 Feb;578(7793):94-101
pubmed: 32025018
Annu Rev Biochem. 2007;76:1-22
pubmed: 17328676
Science. 2017 Oct 13;358(6360):234-238
pubmed: 28912133
Nature. 2019 Nov;575(7781):210-216
pubmed: 31645765
Nature. 2020 Jul;583(7815):265-270
pubmed: 32581361
Nucleic Acids Res. 2021 Jan 8;49(D1):D10-D17
pubmed: 33095870
Life Sci Alliance. 2019 Jul 2;2(4):
pubmed: 31266885
Cell. 2019 May 2;177(4):821-836.e16
pubmed: 30982602
Genome Med. 2018 Apr 25;10(1):33
pubmed: 29695279
Nature. 2015 Feb 19;518(7539):360-364
pubmed: 25693567
Nat Rev Genet. 2014 Sep;15(9):585-98
pubmed: 24981601
Antioxid Redox Signal. 2013 Jun 20;18(18):2429-43
pubmed: 23311711
Science. 2019 Jun 28;364(6447):
pubmed: 31249028
Bioinformatics. 2017 Jan 1;33(1):8-16
pubmed: 27591080
Genome Res. 2019 Jul;29(7):1067-1077
pubmed: 31221724

Auteurs

Freek Manders (F)

Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands.
Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.

Arianne M Brandsma (AM)

Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands.
Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.

Jurrian de Kanter (J)

Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands.
Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.

Mark Verheul (M)

Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands.
Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.

Rurika Oka (R)

Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands.
Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.

Markus J van Roosmalen (MJ)

Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands.
Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.

Bastiaan van der Roest (B)

Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.
Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands.
Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands.

Arne van Hoeck (A)

Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.
Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands.

Edwin Cuppen (E)

Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands.
Center for Molecular Medicine, University Medical Center Utrecht, Utrecht University, Universiteitsweg 100, 3584 CG, Utrecht, The Netherlands.

Ruben van Boxtel (R)

Princess Máxima Center for Pediatric Oncology, Heidelberglaan 25, 3584CS, Utrecht, The Netherlands. R.vanBoxtel@prinsesmaximacentrum.nl.
Oncode Institute, Jaarbeursplein 6, 3521 AL, Utrecht, The Netherlands. R.vanBoxtel@prinsesmaximacentrum.nl.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH