Long-read sequencing and optical mapping generates near T2T assemblies that resolves a centromeric translocation.


Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
18 Apr 2024
Historique:
received: 28 11 2023
accepted: 13 04 2024
medline: 19 4 2024
pubmed: 19 4 2024
entrez: 18 4 2024
Statut: epublish

Résumé

Long-read genome sequencing (lrGS) is a promising method in genetic diagnostics. Here we investigate the potential of lrGS to detect a disease-associated chromosomal translocation between 17p13 and the 19 centromere. We constructed two sets of phased and non-phased de novo assemblies; (i) based on lrGS only and (ii) hybrid assemblies combining lrGS with optical mapping using lrGS reads with a median coverage of 34X. Variant calling detected both structural variants (SVs) and small variants and the accuracy of the small variant calling was compared with those called with short-read genome sequencing (srGS). The de novo and hybrid assemblies had high quality and contiguity with N50 of 62.85 Mb, enabling a near telomere to telomere assembly with less than a 100 contigs per haplotype. Notably, we successfully identified the centromeric breakpoint of the translocation. A concordance of 92% was observed when comparing small variant calling between srGS and lrGS. In summary, our findings underscore the remarkable potential of lrGS as a comprehensive and accurate solution for the analysis of SVs and small variants. Thus, lrGS could replace a large battery of genetic tests that were used for the diagnosis of a single symptomatic translocation carrier, highlighting the potential of lrGS in the realm of digital karyotyping.

Identifiants

pubmed: 38637641
doi: 10.1038/s41598-024-59683-3
pii: 10.1038/s41598-024-59683-3
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

9000

Subventions

Organisme : the Swedish Research Council
ID : 2017-02936

Informations de copyright

© 2024. The Author(s).

Références

Nature. 2023 Jul;619(7968):112-121
pubmed: 37316654
Nat Genet. 2019 Feb;51(2):354-362
pubmed: 30643257
PLoS Genet. 2019 Feb 8;15(2):e1007858
pubmed: 30735495
Genome Med. 2019 Nov 7;11(1):68
pubmed: 31694722
Bioinformatics. 2018 Mar 1;34(5):867-868
pubmed: 29096012
Bioinformatics. 2016 Oct 1;32(19):3047-8
pubmed: 27312411
Bioinformatics. 2013 Apr 15;29(8):1072-5
pubmed: 23422339
Trends Genet. 2009 Jul;25(7):298-307
pubmed: 19560228
Nat Rev Genet. 2020 Oct;21(10):597-614
pubmed: 32504078
Cell Genom. 2022 May 11;2(5):
pubmed: 35720974
F1000Res. 2017 May 10;6:664
pubmed: 28781756
Nat Biotechnol. 2023 Oct;41(10):1474-1482
pubmed: 36797493
Science. 2022 Apr;376(6588):44-53
pubmed: 35357919
Nat Commun. 2024 Jan 29;15(1):837
pubmed: 38281971
Nat Genet. 2017 Jan;49(1):36-45
pubmed: 27841880
Genet Med. 2013 Jun;15(6):450-7
pubmed: 23238528
Nat Biotechnol. 2018 Nov;36(10):983-987
pubmed: 30247488
Nat Rev Genet. 2018 May;19(5):253-268
pubmed: 29398702
Nat Methods. 2023 Aug;20(8):1143-1158
pubmed: 37386186
Sci Rep. 2022 Oct 9;12(1):16945
pubmed: 36210382
Nat Biotechnol. 2019 Oct;37(10):1155-1162
pubmed: 31406327
Genome Res. 2022 Apr;32(4):599-607
pubmed: 35361624
Nat Methods. 2021 Feb;18(2):170-175
pubmed: 33526886
Nat Biotechnol. 2021 Mar;39(3):309-312
pubmed: 33288905
Nat Biotechnol. 2019 Aug;37(8):866-868
pubmed: 31375808
Hum Mutat. 2022 Nov;43(11):1531-1544
pubmed: 36086952
Comput Struct Biotechnol J. 2020 Aug 01;18:2051-2062
pubmed: 32802277
Int J Mol Sci. 2022 Aug 20;23(16):
pubmed: 36012658
Trends Genet. 2022 Nov;38(11):1134-1146
pubmed: 35820967
Nat Rev Genet. 2015 Nov;16(11):627-40
pubmed: 26442640
Bioinformatics. 2018 Sep 15;34(18):3094-3100
pubmed: 29750242
Genome Biol. 2019 Jun 3;20(1):117
pubmed: 31159850
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278

Auteurs

Esmee Ten Berk de Boer (E)

Department of Molecular Medicine and Surgery, Karolinska Institutet, 171 76, Stockholm, Sweden.
Department of Clinical Genetics and Genomics, Karolinska University Hospital, 171 76, Stockholm, Sweden.
Science for Life Laboratory, Karolinska Institutet Science Park, 171 65, Solna, Sweden.

Adam Ameur (A)

Department of Immunology, Genetics and Pathology, Uppsala University, 752 36, Uppsala, Sweden.

Ignas Bunikis (I)

Department of Immunology, Genetics and Pathology, Uppsala University, 752 36, Uppsala, Sweden.

Marlene Ek (M)

Department of Molecular Medicine and Surgery, Karolinska Institutet, 171 76, Stockholm, Sweden.
Department of Clinical Genetics and Genomics, Karolinska University Hospital, 171 76, Stockholm, Sweden.

Eva-Lena Stattin (EL)

Department of Immunology, Genetics and Pathology, Uppsala University, 752 36, Uppsala, Sweden.

Lars Feuk (L)

Department of Immunology, Genetics and Pathology, Uppsala University, 752 36, Uppsala, Sweden.

Jesper Eisfeldt (J)

Department of Molecular Medicine and Surgery, Karolinska Institutet, 171 76, Stockholm, Sweden. jesper.eisfeldt@scilifelab.se.
Department of Clinical Genetics and Genomics, Karolinska University Hospital, 171 76, Stockholm, Sweden. jesper.eisfeldt@scilifelab.se.
Science for Life Laboratory, Karolinska Institutet Science Park, 171 65, Solna, Sweden. jesper.eisfeldt@scilifelab.se.

Anna Lindstrand (A)

Department of Molecular Medicine and Surgery, Karolinska Institutet, 171 76, Stockholm, Sweden.
Department of Clinical Genetics and Genomics, Karolinska University Hospital, 171 76, Stockholm, Sweden.

Classifications MeSH