Visualization and analysis of medically relevant tandem repeats in nanopore sequencing of control cohorts with pathSTR.
Journal
Genome research
ISSN: 1549-5469
Titre abrégé: Genome Res
Pays: United States
ID NLM: 9518021
Informations de publication
Date de publication:
15 Aug 2024
15 Aug 2024
Historique:
received:
04
03
2024
accepted:
02
08
2024
medline:
16
8
2024
pubmed:
16
8
2024
entrez:
15
8
2024
Statut:
aheadofprint
Résumé
The lack of population-scale databases hampers research and diagnostics for medically relevant tandem repeats and repeat expansions. We attempt to fill this gap using our pathSTR web tool, which leverages long-read sequencing of large cohorts to determine repeat length and sequence composition in a healthy population. The current version includes 1040 individuals of the 1000 Genomes Project cohort sequenced on the Oxford Nanopore Technologies PromethION. A comprehensive set of medically relevant tandem repeats was genotyped using STRdust and LongTR to determine the tandem repeat length and sequence composition. PathSTR provides rich visualizations of this dataset and the feature to upload one's data for comparison along the control cohort. We demonstrate the implementation of this application using data from targeted nanopore sequencing of a patient with Myotonic Dystrophy type 1. This resource will empower the genetics community to get a more complete overview of normal variation in tandem repeat length and sequence composition and, as such, enable a better assessment of rare tandem repeat alleles observed in patients.
Identifiants
pubmed: 39147583
pii: gr.279265.124
doi: 10.1101/gr.279265.124
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
Published by Cold Spring Harbor Laboratory Press.