The genomic basis of geographic differentiation and fiber improvement in cultivated cotton.
Journal
Nature genetics
ISSN: 1546-1718
Titre abrégé: Nat Genet
Pays: United States
ID NLM: 9216904
Informations de publication
Date de publication:
06 2021
06 2021
Historique:
received:
19
11
2020
accepted:
15
03
2021
pubmed:
17
4
2021
medline:
21
7
2021
entrez:
16
4
2021
Statut:
ppublish
Résumé
Large-scale genomic surveys of crop germplasm are important for understanding the genetic architecture of favorable traits. The genomic basis of geographic differentiation and fiber improvement in cultivated cotton is poorly understood. Here, we analyzed 3,248 tetraploid cotton genomes and confirmed that the extensive chromosome inversions on chromosomes A06 and A08 underlies the geographic differentiation in cultivated Gossypium hirsutum. We further revealed that the haplotypic diversity originated from landraces, which might be essential for understanding adaptative evolution in cultivated cotton. Introgression and association analyses identified new fiber quality-related loci and demonstrated that the introgressed alleles from two diploid cottons had a large effect on fiber quality improvement. These loci provided the potential power to overcome the bottleneck in fiber quality improvement. Our study uncovered several critical genomic signatures generated by historical breeding effects in cotton and a wealth of data that enrich genomic resources for the research community.
Identifiants
pubmed: 33859417
doi: 10.1038/s41588-021-00844-9
pii: 10.1038/s41588-021-00844-9
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
916-924Références
Lubbers, E. L. & Chee, P. W. in Genetics and Genomics of Cotton Part I (ed. Paterson, A. H.) 23–52 (Springer, 2009).
Brubaker, C. L. & Wendel, J. F. Reevaluating the origin of domesticated cotton (Gossypium hirsutum; Malvaceae) using nuclear restriction fragment length polymorphisms (RFLPs). Am. J. Bot. 81, 1309–1326 (1994).
doi: 10.1002/j.1537-2197.1994.tb11453.x
Yuan, D. et al. Parallel and intertwining threads of domestication in allopolyploid cotton. Adv. Sci. https://doi.org/10.1002/advs.202003634 (2021).
Wendel, J. F., Brubaker, C. L. & Seelanan, T. in Physiology of Cotton (eds Stewart, J. M. et al.) 1–18 (Springer, 2010).
Hoffmann, A. A. & Rieseberg, L. H. Revisiting the impact of inversions in evolution: from population genetic markers to drivers of adaptive shifts and speciation? Annu. Rev. Ecol. Evol. Syst. 39, 21–42 (2008).
pubmed: 20419035
pmcid: 2858385
doi: 10.1146/annurev.ecolsys.39.110707.173532
Wellenreuther, M. & Bernatchez, L. Eco-evolutionary genomics of chromosomal inversions. Trends Ecol. Evol. 33, 427–440 (2018).
pubmed: 29731154
doi: 10.1016/j.tree.2018.04.002
Fang, L. et al. Genomic analyses in cotton identify signatures of selection and loci associated with fiber quality and yield traits. Nat. Genet. 49, 1089–1098 (2017).
pubmed: 28581501
doi: 10.1038/ng.3887
Wang, M. et al. Asymmetric subgenome selection and cis-regulatory divergence during cotton domestication. Nat. Genet. 49, 579–587 (2017).
pubmed: 28263319
doi: 10.1038/ng.3807
Jia, G. et al. A haplotype map of genomic variations and genome-wide association studies of agronomic traits in foxtail millet (Setaria italica). Nat. Genet. 45, 957–961 (2013).
pubmed: 23793027
doi: 10.1038/ng.2673
Zhou, Z. et al. Resequencing 302 wild and cultivated accessions identifies genes related to domestication and improvement in soybean. Nat. Biotechnol. 33, 408–414 (2015).
pubmed: 25643055
doi: 10.1038/nbt.3096
Dai, P. et al. Extensive haplotypes are associated with population differentiation and environmental adaptability in upland cotton (Gossypium hirsutum). Theor. Appl. Genet. 133, 3273–3285 (2020).
pubmed: 32844253
doi: 10.1007/s00122-020-03668-z
Yang, Z. et al. Extensive intraspecific gene order and gene structural variations in upland cotton cultivars. Nat. Commun. 10, 2989 (2019).
Huang, G. et al. Genome sequence of Gossypium herbaceum and genome updates of Gossypium arboreum and Gossypium hirsutum provide insights into cotton A-genome evolution. Nat. Genet. 52, 516–524 (2020).
pubmed: 32284579
pmcid: 7203013
doi: 10.1038/s41588-020-0607-4
Chen, Z. J. et al. Genomic diversifications of five Gossypium allopolyploid species and their impact on cotton improvement. Nat. Genet. 52, 525–533 (2020).
pubmed: 32313247
pmcid: 7203012
doi: 10.1038/s41588-020-0614-5
Ma, Z. et al. Resequencing a core collection of upland cotton identifies genomic variation and loci influencing fiber quality and yield. Nat. Genet. 50, 803–813 (2018).
pubmed: 29736016
doi: 10.1038/s41588-018-0119-7
Mascher, M. et al. Genebank genomics bridges the gap between the conservation of crop diversity and plant breeding. Nat. Genet. 51, 1076–1081 (2019).
pubmed: 31253974
doi: 10.1038/s41588-019-0443-6
Wang, W. et al. Genomic variation in 3,010 diverse accessions of Asian cultivated rice. Nature 557, 43–49 (2018).
pubmed: 29695866
pmcid: 6784863
doi: 10.1038/s41586-018-0063-9
He, F. et al. Exome sequencing highlights the role of wild-relative introgression in shaping the adaptive landscape of the wheat genome. Nat. Genet. 51, 896–904 (2019).
pubmed: 31043759
doi: 10.1038/s41588-019-0382-2
Zhao, G. et al. A comprehensive genome variation map of melon identifies multiple domestication events and loci influencing agronomic traits. Nat. Genet. 51, 1607–1615 (2019).
pubmed: 31676864
doi: 10.1038/s41588-019-0522-8
Jia, Y., Sun, J. & Du X. in World Cotton Germplasm Resources (ed. Abdurakhmonov, I. Y.) 35–53 (IntechOpen, 2014).
Wendel, J. F., Rowley, R. & Stewart, J. M. Genetic diversity in and phylogenetic relationships of the Brazilian endemic cotton, Gossypium mustelinum (Malvaceae). Plant Syst. Evol. 192, 49–59 (1994).
doi: 10.1007/BF00985907
Hutchinson, J. B. Intra-specific differentiation in Gossypium hirsutum. Heredity 5, 161–193 (1951).
doi: 10.1038/hdy.1951.19
He, S. et al. Introgression leads to genomic divergence and responsible for important traits in upland cotton. Front. Plant Sci. 11, 929 (2020).
pubmed: 32774337
pmcid: 7381389
doi: 10.3389/fpls.2020.00929
Zamir, D. Improving plant breeding with exotic genetic libraries. Nat. Rev. Genet. 2, 983–989 (2001).
pubmed: 11733751
doi: 10.1038/35103590
Wendel, J. F., Brubaker, C. L. & Percival, A. E. Genetic diversity in Gossypium hirsutum and the origin of upland cotton. Am. J. Bot. 79, 1291–1310 (1992).
doi: 10.1002/j.1537-2197.1992.tb13734.x
Beasley, J. O. The origin of American tetraploid Gossypium species. Am. Nat. 74, 285–286 (1940).
doi: 10.1086/280895
Wang, L. et al. Alien genomic introgressions enhanced fiber strength in upland cotton (Gossypium hirsutum L.). Ind. Crop. Prod. 159, 113028 (2021).
doi: 10.1016/j.indcrop.2020.113028
Campbell, B. T. et al. Genetic improvement of the Pee Dee cotton germplasm collection following seventy years of plant breeding. Crop Sci. 51, 955–968 (2011).
doi: 10.2135/cropsci2010.09.0545
Thyssen, G. N. et al. Whole genome sequencing of a MAGIC population identified genomic loci and candidate genes for major fiber quality traits in upland cotton (Gossypium hirsutum L.). Theor. Appl. Genet. 132, 989–999 (2019).
pubmed: 30506522
doi: 10.1007/s00122-018-3254-8
Whittaker, D. J. & Triplett, B. A. Gene-specific changes in alpha-tubulin transcript accumulation in developing cotton fibers. Plant Physiol. 121, 181–188 (1999).
pubmed: 10482673
pmcid: 59366
doi: 10.1104/pp.121.1.181
Jones, F. C. et al. The genomic basis of adaptive evolution in threespine sticklebacks. Nature 484, 55–61 (2012).
pubmed: 22481358
pmcid: 3322419
doi: 10.1038/nature10944
Cheng, C. et al. Ecological genomics of Anopheles gambiae along a latitudinal cline: a population-resequencing approach. Genetics 190, 1417–1432 (2012).
pubmed: 22209907
pmcid: 3316653
doi: 10.1534/genetics.111.137794
Küpper, C. et al. A supergene determines highly divergent male reproductive morphs in the ruff. Nat. Genet. 48, 79–83 (2016).
pubmed: 26569125
doi: 10.1038/ng.3443
Wang, J. et al. A Y-like social chromosome causes alternative colony organization in fire ants. Nature 493, 664–668 (2013).
pubmed: 23334415
doi: 10.1038/nature11832
Kirubakaran, T. G. et al. Two adjacent inversions maintain genomic differentiation between migratory and stationary ecotypes of Atlantic cod. Mol. Ecol. 25, 2130–2143 (2016).
pubmed: 26923504
doi: 10.1111/mec.13592
Berg, P. R. et al. Trans-oceanic genomic divergence of Atlantic cod ecotypes is associated with large inversions. Heredity 119, 418–428 (2017).
pubmed: 28930288
pmcid: 5677996
doi: 10.1038/hdy.2017.54
Lowry, D. B. & Willis, J. H. A widespread chromosomal inversion polymorphism contributes to a major life-history transition, local adaptation, and reproductive isolation. PLoS Biol. 8, e1000500 (2010).
pubmed: 20927411
pmcid: 2946948
doi: 10.1371/journal.pbio.1000500
Fang, Z. et al. Megabase-scale inversion polymorphism in the wild ancestor of maize. Genetics 191, 883–894 (2012).
pubmed: 22542971
pmcid: 3389981
doi: 10.1534/genetics.112.138578
Lee, C. R. et al. Young inversion with multiple linked QTLs under selection in a hybrid zone. Nat. Ecol. Evol. 1, 119 (2017).
pubmed: 28812690
pmcid: 5607633
doi: 10.1038/s41559-017-0119
Rieseberg, L. H. Chromosomal rearrangements and speciation. Trends Ecol. Evol. 16, 351–358 (2001).
pubmed: 11403867
doi: 10.1016/S0169-5347(01)02187-5
Todesco, M. et al. Massive haplotypes underlie ecotypic differentiation in sunflowers. Nature 584, 602–607 (2020).
pubmed: 32641831
doi: 10.1038/s41586-020-2467-6
Minka, T. P. & Deckmyn, A. maps: draw geographical maps. R package version 3.3.0 https://cran.r-project.org/web/packages/maps/ (2018).
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2019).
Fick, S. E. & Hijmans, R. J. WorldClim 2: new 1‐km spatial resolution climate surfaces for global land areas. Int. J. Climatol. 37, 4302–4315 (2017).
doi: 10.1002/joc.5086
QGIS Geographic Information System v.3.16.3 (Open Source Geospatial Foundation Project, 2020).
Paterson, A. H., Brubaker, C. L. & Wendel, J. F. A rapid method for extraction of cotton (Gossypium spp.) genomic DNA suitable for RFLP or PCR analysis. Plant Mol. Biol. Rep. 11, 122–127 (1993).
doi: 10.1007/BF02670470
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
pubmed: 19451168
pmcid: 2705234
doi: 10.1093/bioinformatics/btp324
McKenna, A. et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 20, 1297–1303 (2010).
pubmed: 20644199
pmcid: 2928508
doi: 10.1101/gr.107524.110
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Danecek, P. et al. The variant call format and VCFtools. Bioinformatics 27, 2156–2158 (2011).
pubmed: 21653522
pmcid: 3137218
doi: 10.1093/bioinformatics/btr330
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol. Biol. Evol. 26, 1641–1650 (2009).
pubmed: 19377059
pmcid: 2693737
doi: 10.1093/molbev/msp077
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38, 904–909 (2006).
pubmed: 16862161
doi: 10.1038/ng1847
Alexander, D. H., Novembre, J. & Lange, K. Fast model-based estimation of ancestry in unrelated individuals. Genome Res. 19, 1655–1664 (2009).
pubmed: 19648217
pmcid: 2752134
doi: 10.1101/gr.094052.109
Lieberman-Aiden, E. et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science 326, 289–293 (2009).
pubmed: 19815776
pmcid: 2858594
doi: 10.1126/science.1181369
Koren, S. et al. Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation. Genome Res. 27, 722–736 (2017).
pubmed: 28298431
pmcid: 5411767
doi: 10.1101/gr.215087.116
Dudchenko, O. et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science 356, 92–95 (2017).
pubmed: 28336562
pmcid: 5635820
doi: 10.1126/science.aal3327
Bates, D., Mächler, M., Bolker, B. M. & Walker, S. C. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48 (2014).
Kang, H. M. et al. Variance component model to account for sample structure in genome-wide association studies. Nat. Genet. 42, 348–354 (2010).
pubmed: 20208533
pmcid: 3092069
doi: 10.1038/ng.548
Li, B. & Dewey, C. N. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics 12, 323 (2011).
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the [Formula: see text] method. Methods 25, 402–408 (2001).
Clough, S. J. & Bent, A. F. Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 16, 735–743 (1998).
pubmed: 10069079
doi: 10.1046/j.1365-313x.1998.00343.x