The Genomic landscape of short tandem repeats across multiple ancestries.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2023
Historique:
received: 10 08 2022
accepted: 07 12 2022
entrez: 26 1 2023
pubmed: 27 1 2023
medline: 31 1 2023
Statut: epublish

Résumé

Short Tandem Repeats (STRs) have been found to play a role in a myriad of complex traits and genetic diseases. We examined the variability in the lengths of over 850,000 STR loci in 996 children with suspected genetic disorders and 1,178 parents across six separate ancestral groups: Africans, Europeans, East Asians, Admixed Americans, Non-admixed Americans, and Pacific Islanders. For each STR locus we compared allele length between and within each ancestry group. In relation to Europeans, admixed Americans had the most similar STR lengths with only 623 positions either significantly expanded or contracted, while the divergence was highest in Africans, with 4,933 chromosomal positions contracted or expanded. We also examined probands to identify STR expansions at known pathogenic loci. The genes TCF4, AR, and DMPK showed significant expansions with lengths 250% greater than their various average allele lengths in 49, 162, and 11 individuals respectively. All 49 individuals containing an expansion in TCF4 and six individuals containing an expansion in DMPK presented with allele lengths longer than the known pathogenic length for these genes. Next, we identified individuals with significant expansions in highly conserved loci across all ancestries. Eighty loci in conserved regions met criteria for divergence. Two of these individuals were found to have exonic STR expansions: one in ZBTB4 and the other in SLC9A7, which is associated with X-linked mental retardation. Finally, we used parent-child trios to detect and analyze de novo mutations. In total, we observed 3,219 de novo expansions, where proband allele lengths are greater than twice the longest parental allele length. This work helps lay the foundation for understanding STR lengths genome-wide across ancestries and may help identify new disease genes and novel mechanisms of pathogenicity in known disease genes.

Identifiants

pubmed: 36701310
doi: 10.1371/journal.pone.0279430
pii: PONE-D-22-22440
pmc: PMC9879404
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0279430

Subventions

Organisme : NHLBI NIH HHS
ID : R01 HL145175
Pays : United States

Informations de copyright

Copyright: © 2023 Vijayaraghavan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

Annu Rev Genomics Hum Genet. 2008;9:403-33
pubmed: 18593304
Nat Rev Genet. 2005 Oct;6(10):743-55
pubmed: 16205714
Bioinformatics. 2021 May 5;37(5):731-733
pubmed: 32805020
Nature. 2016 Oct 13;538(7624):201-206
pubmed: 27654912
NAR Genom Bioinform. 2020 Jun;2(2):lqaa032
pubmed: 32500119
Clin Ophthalmol. 2014 Sep 09;8:1761-6
pubmed: 25228793
Genomics Proteomics Bioinformatics. 2007 Feb;5(1):7-14
pubmed: 17572359
Forensic Sci Int Genet. 2009 Sep;3(4):251-4
pubmed: 19647710
Nucleic Acids Res. 1989 Aug 25;17(16):6463-71
pubmed: 2780284
Nature. 2021 Jan;589(7841):246-250
pubmed: 33442040
Am J Hum Genet. 2016 May 5;98(5):919-933
pubmed: 27126583
Nucleic Acids Res. 2019 Sep 5;47(15):e90
pubmed: 31194863
Nucleic Acids Res. 2014 May;42(9):5728-41
pubmed: 24682812
Am J Hum Genet. 2017 Nov 2;101(5):700-715
pubmed: 29100084
Am J Ophthalmol. 1967 Dec;64(6):1155-8
pubmed: 6072991

Auteurs

Prashanth Vijayaraghavan (P)

Rady Children's Institute for Genomic Medicine, San Diego, CA, United States of America.

Sergey Batalov (S)

Rady Children's Institute for Genomic Medicine, San Diego, CA, United States of America.

Yan Ding (Y)

Rady Children's Institute for Genomic Medicine, San Diego, CA, United States of America.

Erica Sanford (E)

Rady Children's Institute for Genomic Medicine, San Diego, CA, United States of America.
Cedars-Sinai Medical Center, Los Angeles, CA, United States of America.

Stephen F Kingsmore (SF)

Rady Children's Institute for Genomic Medicine, San Diego, CA, United States of America.

David Dimmock (D)

Rady Children's Institute for Genomic Medicine, San Diego, CA, United States of America.

Charlotte Hobbs (C)

Rady Children's Institute for Genomic Medicine, San Diego, CA, United States of America.

Matthew Bainbridge (M)

Rady Children's Institute for Genomic Medicine, San Diego, CA, United States of America.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH