MutationalPatterns: the one stop shop for the analysis of mutational processes.
Base substitutions
Indels
Mutagenic processes
Mutational signatures
R
Regional mutation patterns
Somatic mutations
Journal
BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258
Informations de publication
Date de publication:
15 Feb 2022
15 Feb 2022
Historique:
received:
08
11
2021
accepted:
01
02
2022
entrez:
16
2
2022
pubmed:
17
2
2022
medline:
19
2
2022
Statut:
epublish
Résumé
The collective of somatic mutations in a genome represents a record of mutational processes that have been operative in a cell. These processes can be investigated by extracting relevant mutational patterns from sequencing data. Here, we present the next version of MutationalPatterns, an R/Bioconductor package, which allows in-depth mutational analysis of catalogues of single and double base substitutions as well as small insertions and deletions. Major features of the package include the possibility to perform regional mutation spectra analyses and the possibility to detect strand asymmetry phenomena, such as lesion segregation. On top of this, the package also contains functions to determine how likely it is that a signature can cause damaging mutations (i.e., mutations that affect protein function). This updated package supports stricter signature refitting on known signatures in order to prevent overfitting. Using simulated mutation matrices containing varied signature contributions, we showed that reliable refitting can be achieved even when only 50 mutations are present per signature. Additionally, we incorporated bootstrapped signature refitting to assess the robustness of the signature analyses. Finally, we applied the package on genome mutation data of cell lines in which we deleted specific DNA repair processes and on large cancer datasets, to show how the package can be used to generate novel biological insights. This novel version of MutationalPatterns allows for more comprehensive analyses and visualization of mutational patterns in order to study the underlying processes. Ultimately, in-depth mutational analyses may contribute to improved biological insights in mechanisms of mutation accumulation as well as aid cancer diagnostics. MutationalPatterns is freely available at http://bioconductor.org/packages/MutationalPatterns .
Sections du résumé
BACKGROUND
BACKGROUND
The collective of somatic mutations in a genome represents a record of mutational processes that have been operative in a cell. These processes can be investigated by extracting relevant mutational patterns from sequencing data.
RESULTS
RESULTS
Here, we present the next version of MutationalPatterns, an R/Bioconductor package, which allows in-depth mutational analysis of catalogues of single and double base substitutions as well as small insertions and deletions. Major features of the package include the possibility to perform regional mutation spectra analyses and the possibility to detect strand asymmetry phenomena, such as lesion segregation. On top of this, the package also contains functions to determine how likely it is that a signature can cause damaging mutations (i.e., mutations that affect protein function). This updated package supports stricter signature refitting on known signatures in order to prevent overfitting. Using simulated mutation matrices containing varied signature contributions, we showed that reliable refitting can be achieved even when only 50 mutations are present per signature. Additionally, we incorporated bootstrapped signature refitting to assess the robustness of the signature analyses. Finally, we applied the package on genome mutation data of cell lines in which we deleted specific DNA repair processes and on large cancer datasets, to show how the package can be used to generate novel biological insights.
CONCLUSIONS
CONCLUSIONS
This novel version of MutationalPatterns allows for more comprehensive analyses and visualization of mutational patterns in order to study the underlying processes. Ultimately, in-depth mutational analyses may contribute to improved biological insights in mechanisms of mutation accumulation as well as aid cancer diagnostics. MutationalPatterns is freely available at http://bioconductor.org/packages/MutationalPatterns .
Identifiants
pubmed: 35168570
doi: 10.1186/s12864-022-08357-3
pii: 10.1186/s12864-022-08357-3
pmc: PMC8845394
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
134Subventions
Organisme : Nederlandse Organisatie voor Wetenschappelijk Onderzoek
ID : 016.Vidi.171.023
Informations de copyright
© 2022. The Author(s).
Références
Nat Commun. 2020 Nov 4;11(1):5584
pubmed: 33149131
Genome Res. 2018 Nov;28(11):1747-1756
pubmed: 30341162
Nature. 2020 Feb;578(7793):82-93
pubmed: 32025007
Nature. 2018 Mar 15;555(7696):371-376
pubmed: 29489755
Cell. 2019 Mar 21;177(1):101-114
pubmed: 30901533
Genome Biol. 2015 Mar 24;16:56
pubmed: 25887522
Cell Stem Cell. 2021 Oct 7;28(10):1726-1739.e6
pubmed: 34496298
Genome Biol. 2016 Feb 22;17:31
pubmed: 26899170
Nat Med. 2017 Apr;23(4):517-525
pubmed: 28288110
BMC Bioinformatics. 2019 Apr 18;20(Suppl 4):152
pubmed: 30999866
PLoS Comput Biol. 2021 Jun 28;17(6):e1009119
pubmed: 34181655
Nature. 2016 Oct 13;538(7624):260-264
pubmed: 27698416
Sci Rep. 2020 Feb 26;10(1):3501
pubmed: 32103116
Nat Commun. 2019 Jul 5;10(1):2969
pubmed: 31278357
Cell Rep. 2013 Jan 31;3(1):246-59
pubmed: 23318258
N Engl J Med. 2009 Jul 9;361(2):123-34
pubmed: 19553641
Nature. 2020 Apr;580(7802):269-273
pubmed: 32106218
Immunology. 2020 Jul;160(3):233-247
pubmed: 32031242
Bioinformatics. 2015 Nov 15;31(22):3673-5
pubmed: 26163694
Nat Cancer. 2020 Feb;1(2):249-263
pubmed: 32118208
Nat Genet. 2015 Dec;47(12):1402-7
pubmed: 26551669
Nature. 2005 Apr 14;434(7035):913-7
pubmed: 15829966
Nat Commun. 2020 Nov 17;11(1):5834
pubmed: 33203900
Mol Cell Biol. 2007 Dec;27(24):8442-53
pubmed: 17923696
Nature. 2020 Jul;583(7818):699-710
pubmed: 32728249
Nucleic Acids Res. 2020 Jan 8;48(D1):D682-D688
pubmed: 31691826
Nature. 2020 Feb;578(7793):94-101
pubmed: 32025018
Annu Rev Biochem. 2007;76:1-22
pubmed: 17328676
Science. 2017 Oct 13;358(6360):234-238
pubmed: 28912133
Nature. 2019 Nov;575(7781):210-216
pubmed: 31645765
Nature. 2020 Jul;583(7815):265-270
pubmed: 32581361
Nucleic Acids Res. 2021 Jan 8;49(D1):D10-D17
pubmed: 33095870
Life Sci Alliance. 2019 Jul 2;2(4):
pubmed: 31266885
Cell. 2019 May 2;177(4):821-836.e16
pubmed: 30982602
Genome Med. 2018 Apr 25;10(1):33
pubmed: 29695279
Nature. 2015 Feb 19;518(7539):360-364
pubmed: 25693567
Nat Rev Genet. 2014 Sep;15(9):585-98
pubmed: 24981601
Antioxid Redox Signal. 2013 Jun 20;18(18):2429-43
pubmed: 23311711
Science. 2019 Jun 28;364(6447):
pubmed: 31249028
Bioinformatics. 2017 Jan 1;33(1):8-16
pubmed: 27591080
Genome Res. 2019 Jul;29(7):1067-1077
pubmed: 31221724