Familial long-read sequencing increases yield of de novo mutations.
autism
de novo mutation
genome sequencing
long-read sequencing
Journal
American journal of human genetics
ISSN: 1537-6605
Titre abrégé: Am J Hum Genet
Pays: United States
ID NLM: 0370475
Informations de publication
Date de publication:
07 04 2022
07 04 2022
Historique:
received:
29
09
2021
accepted:
16
02
2022
pubmed:
16
3
2022
medline:
13
4
2022
entrez:
15
3
2022
Statut:
ppublish
Résumé
Studies of de novo mutation (DNM) have typically excluded some of the most repetitive and complex regions of the genome because these regions cannot be unambiguously mapped with short-read sequencing data. To better understand the genome-wide pattern of DNM, we generated long-read sequence data from an autism parent-child quad with an affected female where no pathogenic variant had been discovered in short-read Illumina sequence data. We deeply sequenced all four individuals by using three sequencing platforms (Illumina, Oxford Nanopore, and Pacific Biosciences) and three complementary technologies (Strand-seq, optical mapping, and 10X Genomics). Using long-read sequencing, we initially discovered and validated 171 DNMs across two children-a 20% increase in the number of de novo single-nucleotide variants (SNVs) and indels when compared to short-read callsets. The number of DNMs further increased by 5% when considering a more complete human reference (T2T-CHM13) because of the recovery of events in regions absent from GRCh38 (e.g., three DNMs in heterochromatic satellites). In total, we validated 195 de novo germline mutations and 23 potential post-zygotic mosaic mutations across both children; the overall true substitution rate based on this integrated callset is at least 1.41 × 10
Identifiants
pubmed: 35290762
pii: S0002-9297(22)00065-9
doi: 10.1016/j.ajhg.2022.02.014
pmc: PMC9069071
pii:
doi:
Substances chimiques
Nucleotides
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
631-646Subventions
Organisme : NCI NIH HHS
ID : P30 CA045508
Pays : United States
Organisme : NIMH NIH HHS
ID : R01 MH101221
Pays : United States
Organisme : NCI NIH HHS
ID : R50 CA243890
Pays : United States
Organisme : NHGRI NIH HHS
ID : UM1 HG008901
Pays : United States
Informations de copyright
Copyright © 2022 American Society of Human Genetics. Published by Elsevier Inc. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of interests E.E.E. is a scientific advisory board (SAB) member of Variant Bio, Inc.
Références
Cell. 2017 Oct 19;171(3):710-722.e12
pubmed: 28965761
Nat Genet. 2021 Aug;53(8):1125-1134
pubmed: 34312540
Proc Natl Acad Sci U S A. 2015 Feb 17;112(7):2109-14
pubmed: 25646453
Nat Methods. 2012 Nov;9(11):1107-12
pubmed: 23042453
Genome Res. 2016 Nov;26(11):1565-1574
pubmed: 27646535
Am J Hum Genet. 2021 Aug 5;108(8):1409-1422
pubmed: 34237280
Neuron. 2015 Sep 23;87(6):1215-1233
pubmed: 26402605
Bioinformatics. 2021 Jan 05;:
pubmed: 33399819
NPJ Genom Med. 2018 Jan 22;3:3
pubmed: 29367880
Eur J Hum Genet. 2021 Apr;29(4):637-648
pubmed: 33257779
Nature. 2017 Sep 28;549(7673):519-522
pubmed: 28959963
PLoS Comput Biol. 2021 Jun 21;17(6):e1009078
pubmed: 34153026
Nature. 2021 Jan;589(7841):246-250
pubmed: 33442040
Nat Methods. 2010 Aug;7(8):576-7
pubmed: 20676076
Nat Genet. 2011 Jun 12;43(7):712-4
pubmed: 21666693
Bioinformatics. 2014 Oct 15;30(20):2843-51
pubmed: 24974202
Neuron. 2010 Oct 21;68(2):192-5
pubmed: 20955926
Science. 2022 Apr;376(6588):44-53
pubmed: 35357919
Genet Med. 2018 Jan;20(1):159-163
pubmed: 28640241
Neuron. 2011 Jun 9;70(5):886-97
pubmed: 21658582
Nat Biotechnol. 2018 Nov;36(10):983-987
pubmed: 30247488
Nat Genet. 2014 Aug;46(8):912-918
pubmed: 25017105
Nat Commun. 2017 Nov 3;8(1):1293
pubmed: 29101320
Nat Protoc. 2017 Jun;12(6):1151-1176
pubmed: 28492527
Proc Natl Acad Sci U S A. 2020 Feb 4;117(5):2560-2569
pubmed: 31964835
Genome Biol. 2019 May 20;20(1):97
pubmed: 31104630
Genome Med. 2020 May 27;12(1):49
pubmed: 32460841
Science. 2015 Sep 11;349(6253):aab3761
pubmed: 26249230
Bioinformatics. 2014 Jul 15;30(14):2070-2
pubmed: 24642064
Science. 2019 Jan 25;363(6425):
pubmed: 30679340
Nat Genet. 2016 Aug;48(8):935-9
pubmed: 27322544
Nature. 2012 Aug 23;488(7412):471-5
pubmed: 22914163
Nat Methods. 2021 Feb;18(2):170-175
pubmed: 33526886
Am J Hum Genet. 2021 Apr 1;108(4):597-607
pubmed: 33675682
Nature. 2014 Nov 13;515(7526):216-21
pubmed: 25363768
Nat Genet. 2015 Jul;47(7):822-826
pubmed: 25985141
Nat Methods. 2018 Aug;15(8):591-594
pubmed: 30013048
Bioinformatics. 2016 Jul 1;32(13):2029-31
pubmed: 27153727
Nat Genet. 2015 Jun;47(6):582-8
pubmed: 25961944
Science. 2021 Apr 2;372(6537):
pubmed: 33632895
Nat Commun. 2019 Apr 16;10(1):1784
pubmed: 30992455
Bioinformatics. 2011 Mar 15;27(6):764-70
pubmed: 21217122
Proc Natl Acad Sci U S A. 2019 Nov 12;116(46):23243-23253
pubmed: 31659027
Bioinformatics. 2010 Nov 15;26(22):2867-73
pubmed: 20926424
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
Genome Biol. 2020 Apr 28;21(1):102
pubmed: 32345345
Proc Natl Acad Sci U S A. 2016 Oct 18;113(42):11901-11906
pubmed: 27702888