Repetitive DNA content in the maize genome is uncoupled from population stratification at SNP loci.
Plant breeding
Population structure
Repetitive DNA
Journal
BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258
Informations de publication
Date de publication:
30 Jan 2020
30 Jan 2020
Historique:
received:
25
04
2019
accepted:
20
01
2020
entrez:
1
2
2020
pubmed:
1
2
2020
medline:
2
10
2020
Statut:
epublish
Résumé
Repetitive DNA is a major component of plant genomes and is thought to be a driver of evolutionary novelty. Describing variation in repeat content among individuals and between populations is key to elucidating the evolutionary significance of repetitive DNA. However, the cost of producing references genomes has limited large-scale intraspecific comparisons to a handful of model organisms where multiple reference genomes are available. We examine repeat content variation in the genomes of 94 elite inbred maize lines using graph-based repeat clustering, a reference-free and rapid assay of repeat content. We examine population structure using genome-wide repeat profiles, and demonstrate the stiff-stalk and non-stiff-stalk heterotic populations are homogenous with regard to global repeat content. In contrast, and similar to previously reported results, the same individuals show clear differentiation, and aggregate into two populations when examining population structure using genome-wide SNPs. Additionally, we develop a novel kmer based technique to examine the chromosomal distribution of repeat clusters in silico and show a cluster dependent association with gene density. Our results indicate global repeat content variation in the heterotic populations of maize has not diverged, and is uncoupled from population stratification at SNP loci. We show that repeat families exhibit divergent patterns with regard to chromosomal distribution, some repeat clusters accumulate in regions of high gene density, whereas others aggregate in regions of low gene density.
Sections du résumé
BACKGROUND
BACKGROUND
Repetitive DNA is a major component of plant genomes and is thought to be a driver of evolutionary novelty. Describing variation in repeat content among individuals and between populations is key to elucidating the evolutionary significance of repetitive DNA. However, the cost of producing references genomes has limited large-scale intraspecific comparisons to a handful of model organisms where multiple reference genomes are available.
RESULTS
RESULTS
We examine repeat content variation in the genomes of 94 elite inbred maize lines using graph-based repeat clustering, a reference-free and rapid assay of repeat content. We examine population structure using genome-wide repeat profiles, and demonstrate the stiff-stalk and non-stiff-stalk heterotic populations are homogenous with regard to global repeat content. In contrast, and similar to previously reported results, the same individuals show clear differentiation, and aggregate into two populations when examining population structure using genome-wide SNPs. Additionally, we develop a novel kmer based technique to examine the chromosomal distribution of repeat clusters in silico and show a cluster dependent association with gene density.
CONCLUSION
CONCLUSIONS
Our results indicate global repeat content variation in the heterotic populations of maize has not diverged, and is uncoupled from population stratification at SNP loci. We show that repeat families exhibit divergent patterns with regard to chromosomal distribution, some repeat clusters accumulate in regions of high gene density, whereas others aggregate in regions of low gene density.
Identifiants
pubmed: 32000670
doi: 10.1186/s12864-020-6517-0
pii: 10.1186/s12864-020-6517-0
pmc: PMC6993463
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
98Références
Genome Biol Evol. 2013;5(4):769-82
pubmed: 23542206
PLoS One. 2012;7(5):e36963
pubmed: 22606317
New Phytol. 2015 Oct;208(2):596-607
pubmed: 26061193
Mol Biol Evol. 2011 Oct;28(10):2843-54
pubmed: 21512105
Syst Biol. 2015 Jan;64(1):112-26
pubmed: 25261464
Annu Rev Plant Biol. 2014;65:505-30
pubmed: 24579996
Nat Biotechnol. 2012 May 13;30(6):555-61
pubmed: 22580951
Bioinformatics. 2011 Mar 15;27(6):764-70
pubmed: 21217122
Nat Genet. 2003 Jan;33(1):102-6
pubmed: 12483211
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
Mob DNA. 2015 Jun 02;6:11
pubmed: 26045719
Plant Cell. 2010 Jun;22(6):1667-85
pubmed: 20581308
Science. 2009 Nov 20;326(5956):1112-5
pubmed: 19965430
Nature. 2017 Jun 22;546(7659):524-527
pubmed: 28605751
Curr Opin Plant Biol. 2017 Apr;36:135-141
pubmed: 28343122
Proc Natl Acad Sci U S A. 2012 Jul 31;109(31):12420-5
pubmed: 22802642
PLoS One. 2017 Dec 13;12(12):e0189277
pubmed: 29236738
Trends Genet. 2000 Jun;16(6):276-7
pubmed: 10827456
Bioinformatics. 2013 Mar 15;29(6):792-3
pubmed: 23376349
Genome Biol Evol. 2016 Jul 02;8(6):1940-7
pubmed: 27289095
Phys Rev E Stat Nonlin Soft Matter Phys. 2004 Dec;70(6 Pt 2):066111
pubmed: 15697438
Proc Natl Acad Sci U S A. 2004 Sep 14;101(37):13554-9
pubmed: 15342909
PLoS Genet. 2015 Jan 08;11(1):e1004915
pubmed: 25569788
BMC Genomics. 2007 Nov 21;8:427
pubmed: 18031571
New Phytol. 2013 Jul;199(2):550-8
pubmed: 23614816
Proc Natl Acad Sci U S A. 2002 Jul 9;99(14):9573-8
pubmed: 12060715
Plant J. 2013 Jun;74(5):829-39
pubmed: 23517128
Genome Biol Evol. 2011;3:219-29
pubmed: 21296765