DNA polymerase stalling at structured DNA constrains the expansion of short tandem repeats.
DNA secondary structure
Genome evolution
Genome instability
Polymerase stalling
Short tandem repeat
Journal
Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660
Informations de publication
Date de publication:
21 08 2020
21 08 2020
Historique:
received:
19
03
2020
accepted:
28
07
2020
entrez:
22
8
2020
pubmed:
21
8
2020
medline:
13
7
2021
Statut:
epublish
Résumé
Short tandem repeats (STRs) contribute significantly to de novo mutagenesis, driving phenotypic diversity and genetic disease. Although highly diverse, their repetitive sequences induce DNA polymerase slippage and stalling, leading to length and sequence variation. However, current studies of DNA synthesis through STRs are restricted to a handful of selected sequences, limiting our broader understanding of their evolutionary behaviour and hampering the characterisation of the determinants of their abundance and stability in eukaryotic genomes. We perform a comprehensive analysis of DNA synthesis at all STR permutations and interrogate the impact of STR sequence and secondary structure on their genomic representation and mutability. To do this, we developed a high-throughput primer extension assay that allows monitoring of the kinetics and fidelity of DNA synthesis through 20,000 sequences comprising all STR permutations in different lengths. By combining these measurements with population-scale genomic data, we show that the response of a model replicative DNA polymerase to variously structured DNA is sufficient to predict the complex genomic behaviour of STRs, including abundance and mutational constraints. We demonstrate that DNA polymerase stalling at DNA structures induces error-prone DNA synthesis, which constrains STR expansion. Our data support a model in which STR length in eukaryotic genomes results from a balance between expansion due to polymerase slippage at repeated DNA sequences and point mutations caused by error-prone DNA synthesis at DNA structures.
Sections du résumé
BACKGROUND
Short tandem repeats (STRs) contribute significantly to de novo mutagenesis, driving phenotypic diversity and genetic disease. Although highly diverse, their repetitive sequences induce DNA polymerase slippage and stalling, leading to length and sequence variation. However, current studies of DNA synthesis through STRs are restricted to a handful of selected sequences, limiting our broader understanding of their evolutionary behaviour and hampering the characterisation of the determinants of their abundance and stability in eukaryotic genomes.
RESULTS
We perform a comprehensive analysis of DNA synthesis at all STR permutations and interrogate the impact of STR sequence and secondary structure on their genomic representation and mutability. To do this, we developed a high-throughput primer extension assay that allows monitoring of the kinetics and fidelity of DNA synthesis through 20,000 sequences comprising all STR permutations in different lengths. By combining these measurements with population-scale genomic data, we show that the response of a model replicative DNA polymerase to variously structured DNA is sufficient to predict the complex genomic behaviour of STRs, including abundance and mutational constraints. We demonstrate that DNA polymerase stalling at DNA structures induces error-prone DNA synthesis, which constrains STR expansion.
CONCLUSIONS
Our data support a model in which STR length in eukaryotic genomes results from a balance between expansion due to polymerase slippage at repeated DNA sequences and point mutations caused by error-prone DNA synthesis at DNA structures.
Identifiants
pubmed: 32819438
doi: 10.1186/s13059-020-02124-x
pii: 10.1186/s13059-020-02124-x
pmc: PMC7441554
doi:
Substances chimiques
DNA
9007-49-2
DNA-Directed DNA Polymerase
EC 2.7.7.7
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
209Subventions
Organisme : Medical Research Council
ID : MC_U105178808
Pays : United Kingdom
Organisme : Medical Research Council
ID : U105178808
Pays : United Kingdom
Références
Cell. 1995 May 19;81(4):533-40
pubmed: 7758107
Trends Mol Med. 2013 Nov;19(11):655-63
pubmed: 23953480
Biochemistry. 2004 Nov 9;43(44):14218-27
pubmed: 15518572
Cold Spring Harb Perspect Biol. 2013 Jun 01;5(6):
pubmed: 23732474
Nat Rev Genet. 2011 Nov 29;13(1):36-46
pubmed: 22124482
J Biol Chem. 2004 Oct 1;279(40):41715-26
pubmed: 15292165
Proc Natl Acad Sci U S A. 1999 Feb 16;96(4):1504-9
pubmed: 9990053
Proc Natl Acad Sci U S A. 2014 Mar 18;111(11):4073-8
pubmed: 24591606
Nature. 2007 Jun 21;447(7147):932-40
pubmed: 17581576
Nat Rev Genet. 2018 May;19(5):286-298
pubmed: 29398703
Nucleic Acids Res. 2016 Jul 8;44(W1):W160-5
pubmed: 27079975
J Mol Biol. 1962 Jul;5:109-18
pubmed: 14470099
Elife. 2017 Apr 22;6:
pubmed: 28432790
Nucleic Acids Res. 2001 Jan 1;29(1):308-11
pubmed: 11125122
Angew Chem Int Ed Engl. 2018 Feb 19;57(8):2165-2169
pubmed: 29266664
Crit Rev Biochem Mol Biol. 2015 Mar-Apr;50(2):142-67
pubmed: 25608779
Nucleic Acids Res. 2005 Jul 08;33(12):3785-98
pubmed: 16006624
BMC Genomics. 2019 Feb 22;20(1):153
pubmed: 30795733
J Biol Chem. 1989 Apr 15;264(11):6447-58
pubmed: 2703498
Genome Res. 2018 Sep;28(9):1264-1271
pubmed: 30104284
Genome Biol. 2020 Aug 21;21(1):209
pubmed: 32819438
Nat Struct Mol Biol. 2018 Aug;25(8):669-676
pubmed: 30061600
Nucleic Acids Res. 1998 Sep 1;26(17):4056-62
pubmed: 9705519
Genet Res. 1973 Oct;22(2):201-4
pubmed: 4777279
Nat Genet. 2019 Nov;51(11):1652-1659
pubmed: 31676866
Nucleic Acids Res. 2020 Jan 8;48(D1):D155-D159
pubmed: 31599331
Nucleic Acids Res. 2015 Sep 30;43(17):8590-600
pubmed: 26253741
Genome Res. 2010 Nov;20(11):1469-81
pubmed: 20841430
Nucleic Acids Res. 2003 Jul 1;31(13):3406-15
pubmed: 12824337
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
Nucleic Acids Res. 2014 Nov 10;42(20):12367-79
pubmed: 25336616
Proc Natl Acad Sci U S A. 1998 Sep 1;95(18):10774-8
pubmed: 9724780
Nucleic Acids Res. 1992 Jan 25;20(2):211-5
pubmed: 1741246
Nat Genet. 2012 Oct;44(10):1161-5
pubmed: 22922873
Bioinformatics. 2015 Nov 15;31(22):3718-20
pubmed: 26209431
Nat Genet. 2016 Jan;48(1):22-9
pubmed: 26642241
Proc Natl Acad Sci U S A. 1999 Mar 2;96(5):1823-5
pubmed: 10051552
EMBO J. 2014 Nov 3;33(21):2521-33
pubmed: 25193968
Mol Biol Evol. 1987 May;4(3):203-21
pubmed: 3328815
Mol Biotechnol. 2010 Mar;44(3):250-66
pubmed: 20012711
Nucleic Acids Res. 2017 Aug 21;45(14):e132
pubmed: 28586438
Nucleic Acids Res. 2016 Feb 29;44(4):1746-59
pubmed: 26792894
Science. 2019 Feb 22;363(6429):
pubmed: 30679383
Nat Rev Genet. 2004 Jun;5(6):435-45
pubmed: 15153996
Nat Genet. 2017 Oct;49(10):1495-1501
pubmed: 28892063
Bioinformatics. 2009 Aug 15;25(16):2078-9
pubmed: 19505943