mSigHdp: hierarchical Dirichlet process mixture modeling for mutational signature discovery.


Journal

NAR genomics and bioinformatics
ISSN: 2631-9268
Titre abrégé: NAR Genom Bioinform
Pays: England
ID NLM: 101756213

Informations de publication

Date de publication:
Mar 2023
Historique:
received: 25 10 2022
accepted: 19 01 2023
entrez: 25 1 2023
pubmed: 26 1 2023
medline: 26 1 2023
Statut: epublish

Résumé

Mutational signatures are characteristic patterns of mutations caused by endogenous or exogenous mutational processes. These signatures can be discovered by analyzing mutations in large sets of samples-usually somatic mutations in tumor samples. Most programs for discovering mutational signatures are based on non-negative matrix factorization (NMF). Alternatively, signatures can be discovered using hierarchical Dirichlet process (HDP) mixture models, an approach that has been less explored. These models assign mutations to clusters and view each cluster as being generated from the signature of a particular mutational process. Here, we describe mSigHdp, an improved approach to using HDP mixture models to discover mutational signatures. We benchmarked mSigHdp and state-of-the-art NMF-based approaches on four realistic synthetic data sets. These data sets encompassed 18 cancer types. In total, they contained 3.5 × 10

Identifiants

pubmed: 36694663
doi: 10.1093/nargab/lqad005
pii: lqad005
pmc: PMC9869330
doi:

Types de publication

Journal Article

Langues

eng

Pagination

lqad005

Informations de copyright

© The Author(s) 2023. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

Références

Bioinformatics. 2018 Jan 15;34(2):330-337
pubmed: 29028923
Proc Natl Acad Sci U S A. 2022 Jan 25;119(4):
pubmed: 35058360
Nature. 2020 Feb;578(7793):94-101
pubmed: 32025018
Genes Chromosomes Cancer. 2021 May;60(5):314-331
pubmed: 33222322
Bioinformatics. 2019 Jul 15;35(14):i492-i500
pubmed: 31510643
PLoS Comput Biol. 2019 Feb 22;15(2):e1006799
pubmed: 30794536
Sci Rep. 2022 Jan 10;12(1):390
pubmed: 35013428
Genome Biol. 2016 Feb 22;17:31
pubmed: 26899170
Bioinformatics. 2017 Jan 1;33(1):8-16
pubmed: 27591080
Nat Med. 2017 Apr;23(4):517-525
pubmed: 28288110
BMC Bioinformatics. 2019 Sep 2;20(1):450
pubmed: 31477009
BMC Bioinformatics. 2019 Apr 18;20(Suppl 4):152
pubmed: 30999866
Science. 2022 Apr 22;376(6591):
pubmed: 35949260
PLoS One. 2019 Sep 12;14(9):e0221235
pubmed: 31513583
Nature. 2020 Feb;578(7793):112-121
pubmed: 32025012
Cell. 2012 May 25;149(5):979-93
pubmed: 22608084
Cell Rep. 2013 Jan 31;3(1):246-59
pubmed: 23318258
Nat Cancer. 2020 Feb;1(2):249-263
pubmed: 32118208
Nature. 2016 May 02;534(7605):47-54
pubmed: 27135926
Nature. 2019 Oct;574(7779):538-542
pubmed: 31645727
Commun Biol. 2021 Mar 29;4(1):424
pubmed: 33782531
Nat Commun. 2018 May 1;9(1):1746
pubmed: 29717118
Nat Genet. 2016 Jun;48(6):600-606
pubmed: 27111033
Genome Res. 2020 Jun;30(6):803-813
pubmed: 32661091
Sci Transl Med. 2017 Oct 18;9(412):
pubmed: 29046434
Cell Genom. 2022 Nov 09;2(11):None
pubmed: 36388765
BMC Bioinformatics. 2021 Nov 4;22(1):540
pubmed: 34736398
Science. 2015 Sep 25;349(6255):1483-9
pubmed: 26404825
BMC Genomics. 2022 Feb 15;23(1):134
pubmed: 35168570
Nature. 2013 Aug 22;500(7463):415-21
pubmed: 23945592
Nat Commun. 2015 Dec 07;6:8866
pubmed: 26638776

Auteurs

Mo Liu (M)

Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, 169857 Singapore.
Centre for Computational Biology, Duke-NUS Medical School, 169857 Singapore.

Yang Wu (Y)

Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, 169857 Singapore.
Centre for Computational Biology, Duke-NUS Medical School, 169857 Singapore.

Nanhai Jiang (N)

Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, 169857 Singapore.
Centre for Computational Biology, Duke-NUS Medical School, 169857 Singapore.

Arnoud Boot (A)

Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, 169857 Singapore.
Centre for Computational Biology, Duke-NUS Medical School, 169857 Singapore.

Steven G Rozen (SG)

Programme in Cancer & Stem Cell Biology, Duke-NUS Medical School, 169857 Singapore.
Centre for Computational Biology, Duke-NUS Medical School, 169857 Singapore.

Classifications MeSH