Dyads of GGC and GCC form hotspot colonies that coincide with the evolution of human and other great apes.

(GCC)2 (GGC)2 Colony Evolution Great ape Human Recombination hotspot

Journal

BMC genomic data
ISSN: 2730-6844
Titre abrégé: BMC Genom Data
Pays: England
ID NLM: 101775394

Informations de publication

Date de publication:
21 Feb 2024
Historique:
received: 31 07 2023
accepted: 11 02 2024
medline: 22 2 2024
pubmed: 22 2 2024
entrez: 21 2 2024
Statut: epublish

Résumé

GGC and GCC short tandem repeats (STRs) are of various evolutionary, biological, and pathological implications. However, the fundamental two-repeats (dyads) of these STRs are widely unexplored. On a genome-wide scale, we mapped (GGC)2 and (GCC)2 dyads in human, and found monumental colonies (distance between each dyad < 500 bp) of extraordinary density, and in some instances periodicity. The largest (GCC)2 and (GGC)2 colonies were intergenic, homogeneous, and human-specific, consisting of 219 (GCC)2 on chromosome 2 (probability < 1.545E-219) and 70 (GGC)2 on chromosome 9 (probability = 1.809E-148). We also found that several colonies were shared in other great apes, and directionally increased in density and complexity in human, such as a colony of 99 (GCC)2 on chromosome 20, that specifically expanded in great apes, and reached maximum complexity in human (probability 1.545E-220). Numerous other colonies of evolutionary relevance in human were detected in other largely overlooked regions of the genome, such as chromosome Y and pseudogenes. Several of the genes containing or nearest to those colonies were divergently expressed in human. In conclusion, (GCC)2 and (GGC)2 form unprecedented genomic colonies that coincide with the evolution of human and other great apes. The extent of the genomic rearrangements leading to those colonies support overlooked recombination hotspots, shared across great apes. The identified colonies deserve to be studied in mechanistic, evolutionary, and functional platforms.

Sections du résumé

BACKGROUND BACKGROUND
GGC and GCC short tandem repeats (STRs) are of various evolutionary, biological, and pathological implications. However, the fundamental two-repeats (dyads) of these STRs are widely unexplored.
RESULTS RESULTS
On a genome-wide scale, we mapped (GGC)2 and (GCC)2 dyads in human, and found monumental colonies (distance between each dyad < 500 bp) of extraordinary density, and in some instances periodicity. The largest (GCC)2 and (GGC)2 colonies were intergenic, homogeneous, and human-specific, consisting of 219 (GCC)2 on chromosome 2 (probability < 1.545E-219) and 70 (GGC)2 on chromosome 9 (probability = 1.809E-148). We also found that several colonies were shared in other great apes, and directionally increased in density and complexity in human, such as a colony of 99 (GCC)2 on chromosome 20, that specifically expanded in great apes, and reached maximum complexity in human (probability 1.545E-220). Numerous other colonies of evolutionary relevance in human were detected in other largely overlooked regions of the genome, such as chromosome Y and pseudogenes. Several of the genes containing or nearest to those colonies were divergently expressed in human.
CONCLUSION CONCLUSIONS
In conclusion, (GCC)2 and (GGC)2 form unprecedented genomic colonies that coincide with the evolution of human and other great apes. The extent of the genomic rearrangements leading to those colonies support overlooked recombination hotspots, shared across great apes. The identified colonies deserve to be studied in mechanistic, evolutionary, and functional platforms.

Identifiants

pubmed: 38383300
doi: 10.1186/s12863-024-01207-z
pii: 10.1186/s12863-024-01207-z
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

21

Informations de copyright

© 2024. The Author(s).

Références

Khamse S, Arabfard M, Salesi M, Behmard E, Jafarian Z, Afshar H, et al. Predominant monomorphism of the RIT2 and GPM6B exceptionally long GA blocks in human and enriched divergent alleles in the disease compartment. Genetica. 2022;150:27–40. https://doi.org/10.1007/s10709-021-00143-5 .
doi: 10.1007/s10709-021-00143-5 pubmed: 34984576
Khamse S, Alizadeh S, Bernhart SH, Afshar H, Delbari A, Ohadi M. A (GCC) repeat in SBF1 reveals a novel biological phenomenon in human and links to late onset neurocognitive disorder. Sci Rep. 2022;12:15480. https://doi.org/10.1038/s41598-022-19878-y .
doi: 10.1038/s41598-022-19878-y pubmed: 36104480 pmcid: 9474449
Jafarian Z, Khamse S, Afshar H, Khorshid HRK, Delbari A, Ohadi M. Natural selection at the RASGEF1C (GGC) repeat in human and divergent genotypes in late-onset neurocognitive disorder. Sci Rep. 2021;11:19235. https://doi.org/10.1038/s41598-021-98725-y .
doi: 10.1038/s41598-021-98725-y pubmed: 34584172 pmcid: 8479062
Fotsing SF, Margoliash J, Wang C, Saini S, Yanicky R, Shleizer-Burko S, et al. The impact of short tandem repeat variation on gene expression. Nat Genet. 2019;51:1652–9. https://doi.org/10.1038/s41588-019-0521-9 .
doi: 10.1038/s41588-019-0521-9 pubmed: 31676866 pmcid: 6917484
Hannan AJ. Tandem repeats mediating genetic plasticity in health and disease. Nat Rev Genet. 2018;19:286–98. https://doi.org/10.1038/nrg.2017.115 .
doi: 10.1038/nrg.2017.115 pubmed: 29398703
Maddi AMA, Kavousi K, Arabfard M, Ohadi H, Ohadi M. Tandem repeats ubiquitously flank and contribute to translation initiation sites. BMC Genom Data. 2022;23:59. https://doi.org/10.1186/s12863-022-01075-5 .
doi: 10.1186/s12863-022-01075-5 pubmed: 35896982 pmcid: 9331589
Arabfard M, Salesi M, Nourian YH, Arabipour I, Maddi AA, Kavousi K, et al. Global abundance of short tandem repeats is non-random in rodents and primates. BMC Genom Data. 2022;23:77. https://doi.org/10.1186/s12863-022-01092-4 .
doi: 10.1186/s12863-022-01092-4 pubmed: 36329409 pmcid: 9635179
Ohadi M, Valipour E, Ghadimi-Haddadan S, Namdar-Aligoodarzi P, Bagheri A, Kowsari A, et al. Core promoter short tandem repeats as evolutionary switch codes for primate speciation. Am J Primatol. 2015;77:34–43. https://doi.org/10.1002/ajp.22308 .
doi: 10.1002/ajp.22308 pubmed: 25099915
Ranathunge C, Pramod S, Renaut S, Wheeler GL, Perkins AD, Rieseberg LH, et al. Microsatellites as agents of adaptive change: an RNA-Seq-based comparative study of transcriptomes from five helianthus species. Symmetry. 2021;13:933.
doi: 10.3390/sym13060933
Watts PC, Kallio ER, Koskela E, Lonn E, Mappes T, Mokkonen M. Stabilizing selection on microsatellite allele length at arginine vasopressin 1a receptor and oxytocin receptor loci. Proceed Royal Society B: Biol Sci. 2017;284:20171896. https://doi.org/10.1098/rspb.2017.1896 .
doi: 10.1098/rspb.2017.1896
Press MO, Hall AN, Morton EA, Queitsch C. Substitutions are boring: some arguments about parallel mutations and high mutation rates. Trends Genet. 2019;35:253–64. https://doi.org/10.1016/j.tig.2019.01.002 .
doi: 10.1016/j.tig.2019.01.002 pubmed: 30797597 pmcid: 6435258
Arabfard M, Kavousi K, Delbari A, Ohadi M. Link between short tandem repeats and translation initiation site selection. Hum Genomics. 2018;12:47. https://doi.org/10.1186/s40246-018-0181-3 .
doi: 10.1186/s40246-018-0181-3 pubmed: 30373661 pmcid: 6206671
Jakubosky D, D'Antonio M, Bonder MJ, Smail C, Donovan MKR, Young Greenwald WW, et al. Properties of structural variants and short tandem repeats associated with gene expression and complex traits. Nat Commun. 2020;11(1):2927. https://doi.org/10.1038/s41467-020-16482-4 .
Annear DJ, Vandeweyer G, Elinck E, Sanchis-Juan A, French CE, Raymond L, et al. Abundancy of polymorphic CGG repeats in the human genome suggest a broad involvement in neurological disease. Sci Rep. 2021;11:2515. https://doi.org/10.1038/s41598-021-82050-5 .
doi: 10.1038/s41598-021-82050-5 pubmed: 33510257 pmcid: 7844047
Sawaya S, Bagshaw A, Buschiazzo E, Kumar P, Chowdhury S, Black MA, et al. Microsatellite tandem repeats are abundant in human promoters and are associated with regulatory elements. PLoS ONE. 2013;8: e54710. https://doi.org/10.1371/journal.pone.0054710 .
doi: 10.1371/journal.pone.0054710 pubmed: 23405090 pmcid: 3566118
Khamse S, Jafarian Z, Bozorgmehr A, Tavakoli M, Afshar H, Keshavarz M, et al. Novel implications of a strictly monomorphic (GCC) repeat in the human PRKACB gene. Sci Rep. 2021;11:20629. https://doi.org/10.1038/s41598-021-99932-3 .
doi: 10.1038/s41598-021-99932-3 pubmed: 34667254 pmcid: 8526596
Alizadeh S, Khamse S, Bernhart S, Vahedi M, Afshar H, Rezaei O, et al. A primate-specific (GCC) repeat in SMAD9 undergoes natural selection in humans and harbors unambiguous genotypes in late-onset neurocognitive disorder. Research Square; 2022.
Braida C, Stefanatos RK, Adam B, Mahajan N, Smeets HJ, Niel F, et al. Variant CCG and GGC repeats within the CTG expansion dramatically modify mutational dynamics and likely contribute toward unusual symptoms in some myotonic dystrophy type 1 patients. Hum Mol Genet. 2010;19:1399–412. https://doi.org/10.1093/hmg/ddq015 .
doi: 10.1093/hmg/ddq015 pubmed: 20080938
Tang H, Kirkness EF, Lippert C, Biggs WH, Fabani M, Guzman E, et al. Profiling of short-tandem-repeat disease alleles in 12,632 human whole genomes. Am J Hum Genet. 2017;101:700–15. https://doi.org/10.1016/j.ajhg.2017.09.013 .
doi: 10.1016/j.ajhg.2017.09.013 pubmed: 29100084 pmcid: 5673627
Fan Y, Shen S, Yang J, Yao D, Li M, Mao C, et al. GIPC1 CGG repeat expansion is associated with movement disorders. Ann Neurol. 2022;91:704–15. https://doi.org/10.1002/ana.26325 .
doi: 10.1002/ana.26325 pubmed: 35152460
Marques-Bonet T, Eichler EE. The evolution of human segmental duplications and the core duplicon hypothesis. Cold Spring Harb Symp Quant Biol. 2009;74:355–62. https://doi.org/10.1101/sqb.2009.74.011 .
doi: 10.1101/sqb.2009.74.011 pubmed: 19717539 pmcid: 4114149
Bailey JA, Gu Z, Clark RA, Reinert K, Samonte RV, Schwartz S, et al. Recent segmental duplications in the human genome. Science. 2002;297:1003–7. https://doi.org/10.1126/science.1072047 .
doi: 10.1126/science.1072047 pubmed: 12169732
Mehan MR, Freimer NB, Ophoff RA. A genome-wide survey of segmental duplications that mediate common human genetic variation of chromosomal architecture. Hum Genomics. 2004;1:335–44. https://doi.org/10.1186/1479-7364-1-5-335 .
doi: 10.1186/1479-7364-1-5-335 pubmed: 15588494 pmcid: 3525102
Sharp AJ, Locke DP, McGrath SD, Cheng Z, Bailey JA, Vallente RU, et al. Segmental duplications and copy-number variation in the human genome. Am J Hum Genet. 2005;77:78–88. https://doi.org/10.1086/431652 .
doi: 10.1086/431652 pubmed: 15918152 pmcid: 1226196
Winckler W, Myers SR, Richter DJ, Onofrio RC, McDonald GJ, Bontrop RE, et al. Comparison of fine-scale recombination rates in humans and chimpanzees. Science. 2005;308:107–11. https://doi.org/10.1126/science.1105322 .
doi: 10.1126/science.1105322 pubmed: 15705809
Mohammadparast S, Bayat H, Biglarian A, Ohadi M. Exceptional expansion and conservation of a CT-repeat complex in the core promoter of PAXBP1 in primates. Am J Primatol. 2014;76:747–56. https://doi.org/10.1002/ajp.22266 .
doi: 10.1002/ajp.22266 pubmed: 24573656
Nelson CE, Hersh BM, Carroll SB. The regulatory content of intergenic DNA shapes genome architecture. Genome Biol. 2004;5:R25. https://doi.org/10.1186/gb-2004-5-4-r25 .
doi: 10.1186/gb-2004-5-4-r25 pubmed: 15059258 pmcid: 395784
Closser M, Guo Y, Wang P, Patel T, Jang S, Hammelman J, et al. An expansion of the non-coding genome and its regulatory potential underlies vertebrate neuronal diversity. Neuron. 2022;110:70-85.e6. https://doi.org/10.1016/j.neuron.2021.10.014 .
doi: 10.1016/j.neuron.2021.10.014 pubmed: 34727520
Thierry-Mieg D, Thierry-Mieg J. AceView: a comprehensive cDNA-supported gene and transcripts annotation. Genome Biol. 2006;7:S12. https://doi.org/10.1186/gb-2006-7-s1-s12 .
doi: 10.1186/gb-2006-7-s1-s12 pmcid: 1810549
Rose AB. Introns as gene regulators: a brick on the accelerator. Front Genet. 2018;9:672. https://doi.org/10.3389/fgene.2018.00672 .
doi: 10.3389/fgene.2018.00672 pubmed: 30792737
Baier T, Jacobebbinghaus N, Einhaus A, Lauersen KJ, Kruse O. Introns mediate post-transcriptional enhancement of nuclear gene expression in the green microalga Chlamydomonas reinhardtii. PLoS Genet. 2020;16: e1008944. https://doi.org/10.1371/journal.pgen.1008944 .
doi: 10.1371/journal.pgen.1008944 pubmed: 32730252 pmcid: 7419008
Gallegos JE, Rose AB. An intron-derived motif strongly increases gene expression from transcribed sequences through a splicing independent mechanism in Arabidopsis thaliana. Sci Rep. 2019;9:13777. https://doi.org/10.1038/s41598-019-50389-5 .
doi: 10.1038/s41598-019-50389-5 pubmed: 31551463 pmcid: 6760150
Zattera ML, Bruschi DP. Transposable elements as a source of novel repetitive DNA in the eukaryote genome. Cells. 2022;11:3373.
doi: 10.3390/cells11213373 pubmed: 36359770 pmcid: 9659126
Zhao S, Zhang X, Chen S, Zhang S. Long noncoding RNAs: fine-tuners hidden in the cancer signaling network. Cell Death Discov. 2021;7:283. https://doi.org/10.1038/s41420-021-00678-8 .
doi: 10.1038/s41420-021-00678-8 pubmed: 34635646 pmcid: 8505617
Glavan D, Gheorman V, Gresita A, Hermann DM, Udristoiu I, Popa-Wagner A. Identification of transcriptome alterations in the prefrontal cortex, hippocampus, amygdala and hippocampus of suicide victims. Sci Rep. 2021;11:18853. https://doi.org/10.1038/s41598-021-98210-6 .
doi: 10.1038/s41598-021-98210-6 pubmed: 34552157 pmcid: 8458545
Zheng LL, Zhou KR, Liu S, Zhang DY, Wang ZL, Chen ZR, et al. dreamBase: DNA modification, RNA regulation and protein binding of expressed pseudogenes in human health and disease. Nucleic Acids Res. 2018;46:D85-d91. https://doi.org/10.1093/nar/gkx972 .
doi: 10.1093/nar/gkx972 pubmed: 29059382
Milligan MJ, Harvey E, Yu A, Morgan AL, Smith DL, Zhang E, et al. Global intersection of long non-coding RNAs with processed and unprocessed pseudogenes in the human genome. Front Genet. 2016;7:26. https://doi.org/10.3389/fgene.2016.00026 .
doi: 10.3389/fgene.2016.00026 pubmed: 27047535 pmcid: 4805607

Auteurs

M Arabfard (M)

Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran.

N Tajeddin (N)

Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.
Department of Biology, Central Tehran Branch, Islamic Azad University, Tehran, Iran.

S Alizadeh (S)

Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.

M Salesi (M)

Chemical Injuries Research Center, Systems Biology and Poisonings Institute, Baqiyatallah University of Medical Sciences, Tehran, Iran.
Research Center for Prevention of Oral and Dental Diseases, Baqiyatallah University of Medical Sciences, Tehran, Iran.

H Bayat (H)

Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.

H R Khorram Khorshid (HR)

Personalized Medicine and Genometabolomics Research Center, Hope Generation Foundation, Tehran, Iran.

S Khamse (S)

Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.

A Delbari (A)

Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran.

M Ohadi (M)

Iranian Research Center on Aging, University of Social Welfare and Rehabilitation Sciences, Tehran, Iran. mi.ohadi@uswr.ac.ir.

Classifications MeSH