Comprehensive identification of somatic nucleotide variants in human brain tissue.


Journal

Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660

Informations de publication

Date de publication:
29 03 2021
Historique:
received: 22 09 2020
accepted: 01 02 2021
entrez: 30 3 2021
pubmed: 31 3 2021
medline: 12 1 2022
Statut: epublish

Résumé

Post-zygotic mutations incurred during DNA replication, DNA repair, and other cellular processes lead to somatic mosaicism. Somatic mosaicism is an established cause of various diseases, including cancers. However, detecting mosaic variants in DNA from non-cancerous somatic tissues poses significant challenges, particularly if the variants only are present in a small fraction of cells. Here, the Brain Somatic Mosaicism Network conducts a coordinated, multi-institutional study to examine the ability of existing methods to detect simulated somatic single-nucleotide variants (SNVs) in DNA mixing experiments, generate multiple replicates of whole-genome sequencing data from the dorsolateral prefrontal cortex, other brain regions, dura mater, and dural fibroblasts of a single neurotypical individual, devise strategies to discover somatic SNVs, and apply various approaches to validate somatic SNVs. These efforts lead to the identification of 43 bona fide somatic SNVs that range in variant allele fractions from ~ 0.005 to ~ 0.28. Guided by these results, we devise best practices for calling mosaic SNVs from 250× whole-genome sequencing data in the accessible portion of the human genome that achieve 90% specificity and sensitivity. Finally, we demonstrate that analysis of multiple bulk DNA samples from a single individual allows the reconstruction of early developmental cell lineage trees. This study provides a unified set of best practices to detect somatic SNVs in non-cancerous tissues. The data and methods are freely available to the scientific community and should serve as a guide to assess the contributions of somatic SNVs to neuropsychiatric diseases.

Sections du résumé

BACKGROUND
Post-zygotic mutations incurred during DNA replication, DNA repair, and other cellular processes lead to somatic mosaicism. Somatic mosaicism is an established cause of various diseases, including cancers. However, detecting mosaic variants in DNA from non-cancerous somatic tissues poses significant challenges, particularly if the variants only are present in a small fraction of cells.
RESULTS
Here, the Brain Somatic Mosaicism Network conducts a coordinated, multi-institutional study to examine the ability of existing methods to detect simulated somatic single-nucleotide variants (SNVs) in DNA mixing experiments, generate multiple replicates of whole-genome sequencing data from the dorsolateral prefrontal cortex, other brain regions, dura mater, and dural fibroblasts of a single neurotypical individual, devise strategies to discover somatic SNVs, and apply various approaches to validate somatic SNVs. These efforts lead to the identification of 43 bona fide somatic SNVs that range in variant allele fractions from ~ 0.005 to ~ 0.28. Guided by these results, we devise best practices for calling mosaic SNVs from 250× whole-genome sequencing data in the accessible portion of the human genome that achieve 90% specificity and sensitivity. Finally, we demonstrate that analysis of multiple bulk DNA samples from a single individual allows the reconstruction of early developmental cell lineage trees.
CONCLUSIONS
This study provides a unified set of best practices to detect somatic SNVs in non-cancerous tissues. The data and methods are freely available to the scientific community and should serve as a guide to assess the contributions of somatic SNVs to neuropsychiatric diseases.

Identifiants

pubmed: 33781308
doi: 10.1186/s13059-021-02285-3
pii: 10.1186/s13059-021-02285-3
pmc: PMC8006362
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

92

Subventions

Organisme : NINDS NIH HHS
ID : R01 NS032457
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106883
Pays : United States
Organisme : NCATS NIH HHS
ID : UL1 TR001863
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106876
Pays : United States
Organisme : NHGRI NIH HHS
ID : P50 HG007735
Pays : United States
Organisme : NIGMS NIH HHS
ID : T32 GM007544
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106874
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106892
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH108898
Pays : United States
Organisme : NIMH NIH HHS
ID : F31 MH124393
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106884
Pays : United States
Organisme : NHGRI NIH HHS
ID : T32 HG002295
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106891
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106882
Pays : United States
Organisme : NIMH NIH HHS
ID : U01 MH106893
Pays : United States

Références

Nature. 2016 Oct 13;538(7624):260-264
pubmed: 27698416
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Nucleic Acids Res. 2018 Feb 28;46(4):e20
pubmed: 29186545
Bioinformatics. 2011 Nov 1;27(21):2957-63
pubmed: 21903629
PLoS Genet. 2016 Sep 15;12(9):e1006245
pubmed: 27632392
Proc Natl Acad Sci U S A. 2012 Sep 4;109(36):14508-13
pubmed: 22853953
Nat Commun. 2019 Aug 29;10(1):3908
pubmed: 31467286
Cell. 2012 Oct 26;151(3):483-96
pubmed: 23101622
Nat Neurosci. 2016 Dec;19(12):1583-1591
pubmed: 27618310
Nature. 2011 Mar 3;471(7336):63-7
pubmed: 21368825
Nat Genet. 2012 May 06;44(6):642-50
pubmed: 22561516
Cell Rep. 2014 Sep 11;8(5):1280-9
pubmed: 25159146
Nucleic Acids Res. 2017 Jun 2;45(10):e76
pubmed: 28132024
Science. 2018 Feb 2;359(6375):550-555
pubmed: 29217587
BMC Bioinformatics. 2008 May 29;9:253
pubmed: 18510760
Nat Rev Genet. 2018 Nov;19(11):688-704
pubmed: 30232369
Genome Res. 2019 Apr;29(4):635-645
pubmed: 30894395
Stem Cells. 2012 Mar;30(3):435-40
pubmed: 22162363
Sci Rep. 2017 Nov 15;7(1):15677
pubmed: 29142202
Nucleic Acids Res. 2020 Feb 20;48(3):1146-1163
pubmed: 31853540
Nat Biotechnol. 2013 Mar;31(3):213-9
pubmed: 23396013
Nature. 2020 May;581(7809):434-443
pubmed: 32461654
Nat Biotechnol. 2014 Mar;32(3):246-51
pubmed: 24531798
Nature. 2012 Dec 20;492(7429):438-42
pubmed: 23160490
Neuron. 2015 Jan 7;85(1):49-59
pubmed: 25569347
Science. 2018 Feb 2;359(6375):555-559
pubmed: 29217584
Nature. 2011 Oct 30;479(7374):534-7
pubmed: 22037309
Science. 2013 Nov 1;342(6158):632-7
pubmed: 24179226
Nat Biotechnol. 2020 Mar;38(3):314-319
pubmed: 31907404
Nucleic Acids Res. 2012 Aug;40(15):e115
pubmed: 22730293
Nat Biotechnol. 2016 Mar;34(3):303-11
pubmed: 26829319
Bioinformatics. 2014 Mar 1;30(5):614-20
pubmed: 24142950
Nat Rev Genet. 2018 May;19(5):269-285
pubmed: 29576615
Genome Res. 2002 Apr;12(4):656-64
pubmed: 11932250
Hum Mutat. 2008 Sep;29(9):1118-24
pubmed: 18570184
Nature. 2015 Oct 1;526(7571):68-74
pubmed: 26432245
Nat Neurosci. 2021 Feb;24(2):186-196
pubmed: 33432196
Nat Methods. 2018 Aug;15(8):591-594
pubmed: 30013048
Nat Genet. 2016 Nov;48(11):1443-1448
pubmed: 27694958
Nat Commun. 2019 Apr 16;10(1):1784
pubmed: 30992455
Genome Res. 2017 Apr;27(4):512-523
pubmed: 28235832
Nat Genet. 2012 May 06;44(6):651-8
pubmed: 22561519
Science. 2017 Apr 28;356(6336):
pubmed: 28450582
Nat Protoc. 2014 Nov;9(11):2586-606
pubmed: 25299156
Nature. 2018 Jul;559(7714):350-355
pubmed: 29995854
Nat Methods. 2017 May;14(5):491-493
pubmed: 28319112
Genome Res. 2012 Mar;22(3):568-76
pubmed: 22300766
Proc Natl Acad Sci U S A. 2012 Oct 30;109(44):18018-23
pubmed: 23043118
PLoS Genet. 2016 Oct 27;12(10):e1006385
pubmed: 27788131
Science. 2015 Oct 2;350(6256):94-98
pubmed: 26430121
Am J Hum Genet. 2008 Mar;82(3):763-71
pubmed: 18304490

Auteurs

Yifan Wang (Y)

Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI, 48109, USA.

Taejeong Bae (T)

Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA.

Jeremy Thorpe (J)

Program in Biochemistry, Cellular and Molecular Biology, Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA.

Maxwell A Sherman (MA)

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
MIT Department of Electrical Engineering and Computer Science, Cambridge, MA, USA.

Attila G Jones (AG)

Department of Cell, Developmental and Regenerative Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.

Sean Cho (S)

Department of Neurology, Kennedy Krieger Institute, Baltimore, MD, 21205, USA.
Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA.
Present Address: Arcus Biosciences, Hayward, CA, 94545, USA.

Kenneth Daily (K)

Sage Bionetworks, Seattle, WA, USA.

Yanmei Dou (Y)

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Javier Ganz (J)

Division of Genetics and Genomics, Manton Center for Orphan Disease, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, 02115, USA.
Departments of Neurology and Pediatrics, Harvard Medical School, Boston, MA, 02115, USA.
Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.

Alon Galor (A)

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Irene Lobon (I)

Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), PRBB, 08003, Barcelona, Catalonia, Spain.
Department of Cell Biology, Physiology and Immunology, and Institute of Neurosciences, University of Barcelona, 08028, Barcelona, Spain.

Reenal Pattni (R)

Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, 94305, USA.
Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA.

Chaggai Rosenbluh (C)

Department of Cell, Developmental and Regenerative Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.

Simone Tomasi (S)

Child Study Center, Yale University, New Haven, CT, 06520, USA.

Livia Tomasini (L)

Child Study Center, Yale University, New Haven, CT, 06520, USA.

Xiaoxu Yang (X)

Department of Neurosciences, University of California San Diego, La Jolla, CA, USA.
Rady Children's Institute for Genomic Medicine, San Diego, CA, USA.

Bo Zhou (B)

Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, 94305, USA.
Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA.

Schahram Akbarian (S)

Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
Department of Psychiatry, Icahn School of Medicine at Mount Sinai, New York, NY, USA.

Laurel L Ball (LL)

Department of Neurosciences, University of California San Diego, La Jolla, CA, USA.
Rady Children's Institute for Genomic Medicine, San Diego, CA, USA.

Sara Bizzotto (S)

Division of Genetics and Genomics, Manton Center for Orphan Disease, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, 02115, USA.
Departments of Neurology and Pediatrics, Harvard Medical School, Boston, MA, 02115, USA.
Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.

Sarah B Emery (SB)

Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.

Ryan Doan (R)

Division of Genetics and Genomics, Manton Center for Orphan Disease, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, 02115, USA.
Departments of Neurology and Pediatrics, Harvard Medical School, Boston, MA, 02115, USA.
Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.

Liana Fasching (L)

Child Study Center, Yale University, New Haven, CT, 06520, USA.

Yeongjun Jang (Y)

Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA.

David Juan (D)

Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), PRBB, 08003, Barcelona, Catalonia, Spain.

Esther Lizano (E)

Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), PRBB, 08003, Barcelona, Catalonia, Spain.

Lovelace J Luquette (LJ)

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

John B Moldovan (JB)

Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.

Rujuta Narurkar (R)

Lieber Institute for Brain Development, Baltimore, MD, 21205, USA.

Matthew T Oetjens (MT)

Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.

Rachel E Rodin (RE)

Division of Genetics and Genomics, Manton Center for Orphan Disease, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, 02115, USA.
Departments of Neurology and Pediatrics, Harvard Medical School, Boston, MA, 02115, USA.
Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.

Shobana Sekar (S)

Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA.

Joo Heon Shin (JH)

Lieber Institute for Brain Development, Baltimore, MD, 21205, USA.
Department of Neurology, Johns Hopkins School of Medicine, Baltimore, MD, USA.

Eduardo Soriano (E)

Department of Cell Biology, Physiology and Immunology, and Institute of Neurosciences, University of Barcelona, 08028, Barcelona, Spain.
Vall d'Hebron Institut de Recerca, 08035, Barcelona, Spain.
Centro de Investigación en Red sobre Enfermedades Neurodegenerativas (CIBERNED), 28031, Madrid, Spain.
ICREA Academia, 08010 Barcelona, Spain.

Richard E Straub (RE)

Lieber Institute for Brain Development, Baltimore, MD, 21205, USA.

Weichen Zhou (W)

Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI, 48109, USA.

Andrew Chess (A)

Department of Cell, Developmental and Regenerative Biology, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.
Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY, 10029, USA.
Friedman Brain Institute, Icahn School of Medicine at Mount Sinai, New York, NY, USA.
Icahn Institute for Data Science and Genomic Technologies, Icahn School of Medicine at Mount Sinai, New York, NY, USA.

Joseph G Gleeson (JG)

Department of Neurosciences, University of California San Diego, La Jolla, CA, USA.
Rady Children's Institute for Genomic Medicine, San Diego, CA, USA.

Tomas Marquès-Bonet (T)

Institut de Biologia Evolutiva (CSIC-Universitat Pompeu Fabra), PRBB, 08003, Barcelona, Catalonia, Spain.
Catalan Institution of Research and Advanced Studies (ICREA), 08010, Barcelona, Spain.
CNAG-CRG, Centre for Genomic Regulation, Barcelona Institute of Science and Technology (BIST), 08036, Barcelona, Spain.
Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, 08193, Cerdanyola del Vallès, Barcelona, Spain.

Peter J Park (PJ)

Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.

Mette A Peters (MA)

Sage Bionetworks, Seattle, WA, USA.

Jonathan Pevsner (J)

Department of Neurology, Kennedy Krieger Institute, Baltimore, MD, 21205, USA.
Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA.

Christopher A Walsh (CA)

Division of Genetics and Genomics, Manton Center for Orphan Disease, and Howard Hughes Medical Institute, Boston Children's Hospital, Boston, MA, 02115, USA.
Departments of Neurology and Pediatrics, Harvard Medical School, Boston, MA, 02115, USA.
Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.

Daniel R Weinberger (DR)

Department of Psychiatry and Behavioral Sciences, Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA.
Lieber Institute for Brain Development, Baltimore, MD, 21205, USA.
Department of Neurology, Johns Hopkins School of Medicine, Baltimore, MD, USA.
Department of Genetic Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA.
Department of Neuroscience, Johns Hopkins School of Medicine, Baltimore, MD, USA.

Flora M Vaccarino (FM)

Child Study Center, Yale University, New Haven, CT, 06520, USA.
Department of Neuroscience, Yale University, New Haven, 06520, CT, USA.

John V Moran (JV)

Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
Department of Internal Medicine, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.

Alexander E Urban (AE)

Department of Psychiatry and Behavioral Sciences, Stanford University School of Medicine, Stanford, CA, 94305, USA.
Department of Genetics, Stanford University School of Medicine, Stanford, CA, 94305, USA.
Tashia and John Morgridge Faculty Scholar, Stanford Child Health Research Institute, Stanford, CA, 94305, USA.

Jeffrey M Kidd (JM)

Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI, 48109, USA.

Ryan E Mills (RE)

Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI, 48109, USA.
Department of Computational Medicine and Bioinformatics, University of Michigan Medical School, 100 Washtenaw Avenue, Ann Arbor, MI, 48109, USA.

Alexej Abyzov (A)

Department of Health Sciences Research, Center for Individualized Medicine, Mayo Clinic, Rochester, MN, 55905, USA. Abyzov.Alexej@mayo.edu.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH