Long-read sequencing and optical mapping generates near T2T assemblies that resolves a centromeric translocation.
Journal
Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288
Informations de publication
Date de publication:
18 Apr 2024
18 Apr 2024
Historique:
received:
28
11
2023
accepted:
13
04
2024
medline:
19
4
2024
pubmed:
19
4
2024
entrez:
18
4
2024
Statut:
epublish
Résumé
Long-read genome sequencing (lrGS) is a promising method in genetic diagnostics. Here we investigate the potential of lrGS to detect a disease-associated chromosomal translocation between 17p13 and the 19 centromere. We constructed two sets of phased and non-phased de novo assemblies; (i) based on lrGS only and (ii) hybrid assemblies combining lrGS with optical mapping using lrGS reads with a median coverage of 34X. Variant calling detected both structural variants (SVs) and small variants and the accuracy of the small variant calling was compared with those called with short-read genome sequencing (srGS). The de novo and hybrid assemblies had high quality and contiguity with N50 of 62.85 Mb, enabling a near telomere to telomere assembly with less than a 100 contigs per haplotype. Notably, we successfully identified the centromeric breakpoint of the translocation. A concordance of 92% was observed when comparing small variant calling between srGS and lrGS. In summary, our findings underscore the remarkable potential of lrGS as a comprehensive and accurate solution for the analysis of SVs and small variants. Thus, lrGS could replace a large battery of genetic tests that were used for the diagnosis of a single symptomatic translocation carrier, highlighting the potential of lrGS in the realm of digital karyotyping.
Identifiants
pubmed: 38637641
doi: 10.1038/s41598-024-59683-3
pii: 10.1038/s41598-024-59683-3
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
9000Subventions
Organisme : the Swedish Research Council
ID : 2017-02936
Informations de copyright
© 2024. The Author(s).
Références
Nature. 2023 Jul;619(7968):112-121
pubmed: 37316654
Nat Genet. 2019 Feb;51(2):354-362
pubmed: 30643257
PLoS Genet. 2019 Feb 8;15(2):e1007858
pubmed: 30735495
Genome Med. 2019 Nov 7;11(1):68
pubmed: 31694722
Bioinformatics. 2018 Mar 1;34(5):867-868
pubmed: 29096012
Bioinformatics. 2016 Oct 1;32(19):3047-8
pubmed: 27312411
Bioinformatics. 2013 Apr 15;29(8):1072-5
pubmed: 23422339
Trends Genet. 2009 Jul;25(7):298-307
pubmed: 19560228
Nat Rev Genet. 2020 Oct;21(10):597-614
pubmed: 32504078
Cell Genom. 2022 May 11;2(5):
pubmed: 35720974
F1000Res. 2017 May 10;6:664
pubmed: 28781756
Nat Biotechnol. 2023 Oct;41(10):1474-1482
pubmed: 36797493
Science. 2022 Apr;376(6588):44-53
pubmed: 35357919
Nat Commun. 2024 Jan 29;15(1):837
pubmed: 38281971
Nat Genet. 2017 Jan;49(1):36-45
pubmed: 27841880
Genet Med. 2013 Jun;15(6):450-7
pubmed: 23238528
Nat Biotechnol. 2018 Nov;36(10):983-987
pubmed: 30247488
Nat Rev Genet. 2018 May;19(5):253-268
pubmed: 29398702
Nat Methods. 2023 Aug;20(8):1143-1158
pubmed: 37386186
Sci Rep. 2022 Oct 9;12(1):16945
pubmed: 36210382
Nat Biotechnol. 2019 Oct;37(10):1155-1162
pubmed: 31406327
Genome Res. 2022 Apr;32(4):599-607
pubmed: 35361624
Nat Methods. 2021 Feb;18(2):170-175
pubmed: 33526886
Nat Biotechnol. 2021 Mar;39(3):309-312
pubmed: 33288905
Nat Biotechnol. 2019 Aug;37(8):866-868
pubmed: 31375808
Hum Mutat. 2022 Nov;43(11):1531-1544
pubmed: 36086952
Comput Struct Biotechnol J. 2020 Aug 01;18:2051-2062
pubmed: 32802277
Int J Mol Sci. 2022 Aug 20;23(16):
pubmed: 36012658
Trends Genet. 2022 Nov;38(11):1134-1146
pubmed: 35820967
Nat Rev Genet. 2015 Nov;16(11):627-40
pubmed: 26442640
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
Genome Biol. 2019 Jun 3;20(1):117
pubmed: 31159850
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278