Comprehensive identification of somatic nucleotide variants in human brain tissue.
Journal
Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660
Informations de publication
Date de publication:
29 03 2021
29 03 2021
Historique:
received:
22
09
2020
accepted:
01
02
2021
entrez:
30
3
2021
pubmed:
31
3
2021
medline:
12
1
2022
Statut:
epublish
Résumé
Post-zygotic mutations incurred during DNA replication, DNA repair, and other cellular processes lead to somatic mosaicism. Somatic mosaicism is an established cause of various diseases, including cancers. However, detecting mosaic variants in DNA from non-cancerous somatic tissues poses significant challenges, particularly if the variants only are present in a small fraction of cells. Here, the Brain Somatic Mosaicism Network conducts a coordinated, multi-institutional study to examine the ability of existing methods to detect simulated somatic single-nucleotide variants (SNVs) in DNA mixing experiments, generate multiple replicates of whole-genome sequencing data from the dorsolateral prefrontal cortex, other brain regions, dura mater, and dural fibroblasts of a single neurotypical individual, devise strategies to discover somatic SNVs, and apply various approaches to validate somatic SNVs. These efforts lead to the identification of 43 bona fide somatic SNVs that range in variant allele fractions from ~ 0.005 to ~ 0.28. Guided by these results, we devise best practices for calling mosaic SNVs from 250× whole-genome sequencing data in the accessible portion of the human genome that achieve 90% specificity and sensitivity. Finally, we demonstrate that analysis of multiple bulk DNA samples from a single individual allows the reconstruction of early developmental cell lineage trees. This study provides a unified set of best practices to detect somatic SNVs in non-cancerous tissues. The data and methods are freely available to the scientific community and should serve as a guide to assess the contributions of somatic SNVs to neuropsychiatric diseases.
Sections du résumé
BACKGROUND
Post-zygotic mutations incurred during DNA replication, DNA repair, and other cellular processes lead to somatic mosaicism. Somatic mosaicism is an established cause of various diseases, including cancers. However, detecting mosaic variants in DNA from non-cancerous somatic tissues poses significant challenges, particularly if the variants only are present in a small fraction of cells.
RESULTS
Here, the Brain Somatic Mosaicism Network conducts a coordinated, multi-institutional study to examine the ability of existing methods to detect simulated somatic single-nucleotide variants (SNVs) in DNA mixing experiments, generate multiple replicates of whole-genome sequencing data from the dorsolateral prefrontal cortex, other brain regions, dura mater, and dural fibroblasts of a single neurotypical individual, devise strategies to discover somatic SNVs, and apply various approaches to validate somatic SNVs. These efforts lead to the identification of 43 bona fide somatic SNVs that range in variant allele fractions from ~ 0.005 to ~ 0.28. Guided by these results, we devise best practices for calling mosaic SNVs from 250× whole-genome sequencing data in the accessible portion of the human genome that achieve 90% specificity and sensitivity. Finally, we demonstrate that analysis of multiple bulk DNA samples from a single individual allows the reconstruction of early developmental cell lineage trees.
CONCLUSIONS
This study provides a unified set of best practices to detect somatic SNVs in non-cancerous tissues. The data and methods are freely available to the scientific community and should serve as a guide to assess the contributions of somatic SNVs to neuropsychiatric diseases.
Identifiants
pubmed: 33781308
doi: 10.1186/s13059-021-02285-3
pii: 10.1186/s13059-021-02285-3
pmc: PMC8006362
doi:
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Pagination
92Subventions
Organisme : NINDS NIH HHS
ID : R01 NS032457
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106883
Pays : United States
Organisme : NCATS NIH HHS
ID : UL1 TR001863
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106876
Pays : United States
Organisme : NHGRI NIH HHS
ID : P50 HG007735
Pays : United States
Organisme : NIGMS NIH HHS
ID : T32 GM007544
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106874
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106892
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH108898
Pays : United States
Organisme : NIMH NIH HHS
ID : F31 MH124393
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106884
Pays : United States
Organisme : NHGRI NIH HHS
ID : T32 HG002295
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106891
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106882
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106893
Pays : United States
Références
Nature. 2016 Oct 13;538(7624):260-264
pubmed: 27698416
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Nucleic Acids Res. 2018 Feb 28;46(4):e20
pubmed: 29186545
Bioinformatics. 2011 Nov 1;27(21):2957-63
pubmed: 21903629
PLoS Genet. 2016 Sep 15;12(9):e1006245
pubmed: 27632392
Proc Natl Acad Sci U S A. 2012 Sep 4;109(36):14508-13
pubmed: 22853953
Nat Commun. 2019 Aug 29;10(1):3908
pubmed: 31467286
Cell. 2012 Oct 26;151(3):483-96
pubmed: 23101622
Nat Neurosci. 2016 Dec;19(12):1583-1591
pubmed: 27618310
Nature. 2011 Mar 3;471(7336):63-7
pubmed: 21368825
Nat Genet. 2012 May 06;44(6):642-50
pubmed: 22561516
Cell Rep. 2014 Sep 11;8(5):1280-9
pubmed: 25159146
Nucleic Acids Res. 2017 Jun 2;45(10):e76
pubmed: 28132024
Science. 2018 Feb 2;359(6375):550-555
pubmed: 29217587
BMC Bioinformatics. 2008 May 29;9:253
pubmed: 18510760
Nat Rev Genet. 2018 Nov;19(11):688-704
pubmed: 30232369
Genome Res. 2019 Apr;29(4):635-645
pubmed: 30894395
Stem Cells. 2012 Mar;30(3):435-40
pubmed: 22162363
Sci Rep. 2017 Nov 15;7(1):15677
pubmed: 29142202
Nucleic Acids Res. 2020 Feb 20;48(3):1146-1163
pubmed: 31853540
Nat Biotechnol. 2013 Mar;31(3):213-9
pubmed: 23396013
Nature. 2020 May;581(7809):434-443
pubmed: 32461654
Nat Biotechnol. 2014 Mar;32(3):246-51
pubmed: 24531798
Nature. 2012 Dec 20;492(7429):438-42
pubmed: 23160490
Neuron. 2015 Jan 7;85(1):49-59
pubmed: 25569347
Science. 2018 Feb 2;359(6375):555-559
pubmed: 29217584
Nature. 2011 Oct 30;479(7374):534-7
pubmed: 22037309
Science. 2013 Nov 1;342(6158):632-7
pubmed: 24179226
Nat Biotechnol. 2020 Mar;38(3):314-319
pubmed: 31907404
Nucleic Acids Res. 2012 Aug;40(15):e115
pubmed: 22730293
Nat Biotechnol. 2016 Mar;34(3):303-11
pubmed: 26829319
Bioinformatics. 2014 Mar 1;30(5):614-20
pubmed: 24142950
Nat Rev Genet. 2018 May;19(5):269-285
pubmed: 29576615
Genome Res. 2002 Apr;12(4):656-64
pubmed: 11932250
Hum Mutat. 2008 Sep;29(9):1118-24
pubmed: 18570184
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Nat Neurosci. 2021 Feb;24(2):186-196
pubmed: 33432196
Nat Methods. 2018 Aug;15(8):591-594
pubmed: 30013048
Nat Genet. 2016 Nov;48(11):1443-1448
pubmed: 27694958
Nat Commun. 2019 Apr 16;10(1):1784
pubmed: 30992455
Genome Res. 2017 Apr;27(4):512-523
pubmed: 28235832
Nat Genet. 2012 May 06;44(6):651-8
pubmed: 22561519
Science. 2017 Apr 28;356(6336):
pubmed: 28450582
Nat Protoc. 2014 Nov;9(11):2586-606
pubmed: 25299156
Nature. 2018 Jul;559(7714):350-355
pubmed: 29995854
Nat Methods. 2017 May;14(5):491-493
pubmed: 28319112
Genome Res. 2012 Mar;22(3):568-76
pubmed: 22300766
Proc Natl Acad Sci U S A. 2012 Oct 30;109(44):18018-23
pubmed: 23043118
PLoS Genet. 2016 Oct 27;12(10):e1006385
pubmed: 27788131
Science. 2015 Oct 2;350(6256):94-98
pubmed: 26430121
Am J Hum Genet. 2008 Mar;82(3):763-71
pubmed: 18304490