Structural variation across 138,134 samples in the TOPMed consortium.
NGS
Population
Structural Variants
TOPMed
Journal
bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187
Informations de publication
Date de publication:
25 Jan 2023
25 Jan 2023
Historique:
pubmed:
8
2
2023
medline:
8
2
2023
entrez:
7
2
2023
Statut:
epublish
Résumé
Ever larger Structural Variant (SV) catalogs highlighting the diversity within and between populations help researchers better understand the links between SVs and disease. The identification of SVs from DNA sequence data is non-trivial and requires a balance between comprehensiveness and precision. Here we present a catalog of 355,667 SVs (59.34% novel) across autosomes and the X chromosome (50bp+) from 138,134 individuals in the diverse TOPMed consortium. We describe our methodologies for SV inference resulting in high variant quality and >90% allele concordance compared to long-read de-novo assemblies of well-characterized control samples. We demonstrate utility through significant associations between SVs and important various cardio-metabolic and hemotologic traits. We have identified 690 SV hotspots and deserts and those that potentially impact the regulation of medically relevant genes. This catalog characterizes SVs across multiple populations and will serve as a valuable tool to understand the impact of SV on disease development and progression.
Identifiants
pubmed: 36747810
doi: 10.1101/2023.01.25.525428
pmc: PMC9900832
pii:
doi:
Types de publication
Preprint
Langues
eng
Subventions
Organisme : NHLBI NIH HHS
ID : P01 HL132825
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL105756
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL123915
Pays : United States
Organisme : NHLBI NIH HHS
ID : R01 HL155742
Pays : United States