Double-digest restriction-associated DNA sequencing-based genotyping and its applications in sesame germplasm management.


Journal

The plant genome
ISSN: 1940-3372
Titre abrégé: Plant Genome
Pays: United States
ID NLM: 101273919

Informations de publication

Date de publication:
17 Apr 2024
Historique:
revised: 29 02 2024
received: 27 09 2023
accepted: 03 03 2024
medline: 17 4 2024
pubmed: 17 4 2024
entrez: 17 4 2024
Statut: aheadofprint

Résumé

Sesame (Sesamum indicum L.) is an ancient oilseed crop belonging to the family Pedaliaceae and a globally cultivated crop for its use as oil and food. In this study, 2496 sesame accessions, being conserved at the National Genebank of ICAR-National Bureau of Plant Genetic Resources (NBPGR), were genotyped using genomics-assisted double-digest restriction-associated DNA sequencing (ddRAD-seq) approach. A total of 64,910 filtered single-nucleotide polymorphisms (SNPs) were utilized to assess the genome-scale diversity. Applications of this genome-scale information (reduced representation using restriction enzymes) are demonstrated through the development of a molecular core collection (CC) representing maximal SNP diversity. This information is also applied in developing a mid-density panel (MDP) comprising 2515 hyper-variable SNPs, representing almost equally the genic and non-genic regions. The sesame CC comprising 384 accessions, a representative set of accessions with maximal diversity, was identified using multiple criteria such as k-mer (subsequence of length "k" in a sequence read) diversity, observed heterozygosity, CoreHunter3, GenoCore, and genetic differentiation. The coreset constituted around 15% of the total accessions studied, and this small subset had captured >60% SNP diversity of the entire population. In the coreset, the admixture analysis shows reduced genetic complexity, increased nucleotide diversity (π), and is geographically distributed without any repetitiveness in the CC germplasm. Within the CC, India-originated accessions exhibit higher diversity (as expected based on the center of diversity concept), than those accessions that were procured from various other countries. The identified CC set and the MDP will be a valuable resource for genomics-assisted accelerated sesame improvement program.

Identifiants

pubmed: 38628142
doi: 10.1002/tpg2.20447
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e20447

Subventions

Organisme : Department of Biotechnology, Ministry of Science and Technology, India
ID : 16113200037-1012166

Informations de copyright

© 2024 The Authors. The Plant Genome published by Wiley Periodicals LLC on behalf of Crop Science Society of America.

Références

Andrews, S. (2015). A quality control tool for highthroughput sequence data. https://www.bioinformatics.babraham.ac.uk/projects/fastqc/
Arbelaez, J. D., Dwiyanti, M. S., Tandayu, E., Llantada, K., Jarana, A., Ignacio, J. C., Platten, J. D., Cobb, J., Rutkoski, J. E., Thomson, M. J., & Kretzschmar, T. (2019). 1k‐RiCA (1K‐Rice Custom Amplicon) a novel genotyping amplicon‐based SNP assay for genetics and breeding applications in rice. Rice, 12(1), 1–15.
Basak, M., Uzun, B., & Yol, E. (2019). Genetic diversity and population structure of the Mediterranean sesame core collection with use of genome‐wide SNPs developed by double digest RAD‐Seq. PloS One, 14(10), e0223757.
Bedigian, D. (2003). Evolution of sesame revisited: Domestication, diversity and prospects. Genetic Resources and Crop Evolution, 50, 779–787.
Bhat, V. K., Babrekar, P. P., & Lakhanpaul, S. (1999). Study of genetic diversity in Indian and exotic sesame (Sesamum indicum L.) germplasm using random amplified polymorphic DNA (RAPD) markers. Euphytica, 110, 21–34.
Bhunia, R. K., Chakraborty, A., Kaur, R., Gayatri, T., Bhat, K. V., Basu, A., Maiti, K., & Sen, S. K. (2015). Analysis of fatty acid and lignan composition of Indian germplasm of sesame to evaluate their nutritional merits. Journal of the American Oil Chemists' Society, 92(1), 65–76.
Bisht, I., Bhat, K., Lakhanpaul, S., Biswas, B., Pandiyan, M., & Hanchinal, R. (2004). Broadening the genetic base of sesame (Sesamum indicum L.) through germplasm enhancement. Plant Genetic Resources Characterization and Utilization, 2(3), 143–151.
Bisht, I. S., Mahajan, R. K., Loknathan, T., & Agrawal, R. (1998). Diversity in Indian sesame collection and stratification of germplasm accessions in different diversity groups. Genetic Resources and Crop Evolution, 45, 325–335.
Bolger, A. M., Lohse, M., & Usadel, B. (2014). Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics, 30(15), 2114–2120.
Brown, A. (1989a). The case for core collections. In A. Brown, O. Frankel, D. Marshall, & J. T. Williams (Eds.), The use of plant genetic resources (pp. 136–156). Cambridge University Press.
Brown, A. (1989b). Core collections: A practical approach to genetic resources management. Genome, 31(2), 818–824.
Bukowski, R., Guo, X., Lu, Y., Zou, C., He, B., Rong, Z., & Xie, C. (2018). Construction of the third‐generation Zea mays haplotype map. Gigascience, 7(4), gix134.
Catchen, J., Hohenlohe, P. A., Bassham, S., Amores, A., & Cresko, W. A. (2013). Stacks: An analysis tool set for population genomics. Molecular Ecology, 22(11), 3124–3140.
Cingolani, P., Platts, A., Wang, L. L., Coon, M., Nguyen, T., Wang, L., Land, S. J., Lu, X., & Ruden, D. M. (2012). A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso‐2; iso‐3. Fly, 6(2), 80–92.
Danecek, P., Auton, A., Abecasis, G., Albers, C. A., Banks, E., DePristo, M. A., Handsaker, R. E., Lunter, G., Marth, G. T., Sherry, S. T., McVean, G., & Durbin, R. (2011). The variant call format and VCFtools. Bioinformatics, 27(15), 2156–2158.
Danecek, P., Bonfield, J. K., Liddle, J., Marshall, J., Ohan, V., Pollard, M. O., Whitwham, A., Keane, T., McCarthy, S. A., Davies, R. M., & Li, H. (2021). Twelve years of SAMtools and BCFtools. Gigascience, 10(2), giab008. 10.1093/gigascience/giab008
de Beukelaer, H., Davenport, G. F., & Fack, V. (2018). Core hunter 3: Flexible core subset selection. BMC Bioinformatics, 19, 1–12.
Deokar, A. A., Ramsay, L., Sharpe, A. G., Diapari, M., Sindhu, A., Bett, K., Warkentin, T. D., & Tar'an, B. (2014). Genome wide SNP identification in chickpea for use in development of a high density genetic map and improvement of chickpea reference genome assembly. BMC Genomics, 15, 1–19.
Elshire, R. J., Glaubitz, J. C., Sun, Q., Poland, J. A., Kawamoto, K., Buckler, E. S., & Mitchell, S. E. (2011). A robust, simple genotyping‐by‐sequencing (GBS) approach for high diversity species. PloS One, 6(5), e19379.
Ewels, P., Magnusson, M., Lundin, S., & Käller, M. (2016). MultiQC: Summarize analysis results for multiple tools and samples in a single report. Bioinformatics, 32(19), 3047–3048.
Franco‐Duran, J., Crossa, J., Chen, J., & Hearne, S. J. (2019). The impact of sample selection strategies on genetic diversity and representativeness in germplasm bank collections. BMC Plant Biology, 19(1), 1–17.
Holbrook, C. C., Timper, P., & Xue, H. (2000). Evaluation of the core collection approach for identifying resistance to Meloidogyne arenaria in peanut. Crop Science, 40(4), 1172–1175.
Jeong, S., Kim, J.‐Y., Jeong, S.‐C., Kang, S.‐T., Moon, J.‐K., & Kim, N. (2017). GenoCore: A simple and fast algorithm for core subset selection from large genotype datasets. PloS One, 12(7), e0181420.
Kim, K.‐W., Chung, H.‐K., Cho, G.‐T., Ma, K.‐H., Chandrabalan, D., Gwag, J.‐G., & Park, Y.‐J. (2007). PowerCore: A program applying the advanced M strategy with a heuristic search for establishing core sets. Bioinformatics, 23(16), 2155–2162.
Krzywinski, M., Schein, J., Birol, I., Connors, J., Gascoyne, R., Horsman, D., & Marra, M. A. (2009). Circos: An information aesthetic for comparative genomics. Genome Research, 19(9), 1639–1645.
Kuo, H.‐I., Dai, H.‐Y., Wu, Y.‐P., & Tseng, Y.‐C. (2021). Peanut germplasm evaluation for agronomic traits and disease resistance under a two‐season cropping system in Taiwan. Agriculture, 11(12), 1277.
Letunic, I., & Bork, P. (2019). Interactive tree of life (iTOL) v4: Recent updates and new developments. Nucleic Acids Research, 47(W1), W256–W259.
Li, H., & Durbin, R. (2010). Fast and accurate long‐read alignment with Burrows–Wheeler transform. Bioinformatics, 26(5), 589–595.
Li, H., Handsaker, B., Wysoker, A., Fennell, T., Ruan, J., Homer, N., Marth, G., Abecasis, G., Durbin, R., & Subgroup, G. P. D. P. (2009). The sequence alignment/map format and SAMtools. Bioinformatics, 25(16), 2078–2079.
Liu, C.‐C., Shringarpure, S., Lange, K., & Novembre, J. (2020). Exploring population structure with admixture models and principal component analysis. Methods Molecular Biology, 2090, 67–86.
Magalhaes, J. V., Liu, J., Guimaraes, C. T., Lana, U. G., Alves, V. M., Wang, Y.‐H., Schaffert, R. E., Hoekenga, O. A., Piñeros, M. A., Shaff, J. E., Klein, P. E., Carneiro, N. P., Coelho, C. M., Trick, H. N., & Kochian, L. V. (2007). A gene in the multidrug and toxic compound extrusion (MATE) family confers aluminum tolerance in sorghum. Nature Genetics, 39(9), 1156–1161.
Marçais, G., & Kingsford, C. (2011). A fast, lock‐free approach for efficient parallel counting of occurrences of k‐mers. Bioinformatics, 27(6), 764–770.
Mondal, N., Bhat, K., Srivastava, P., & Sen, S. (2016). Effects of domestication bottleneck and selection on fatty acid desaturases in Indian sesame germplasm. Plant Genetic Resources Characterization and Utilization, 14(2), 81–90.
Nawade, B., Kumar, A., Maurya, R., Subramani, R., Yadav, R., Singh, K., & Rangan, P. (2022). Longer duration of active oil biosynthesis during seed development is crucial for high oil yield—Lessons from genome‐wide in silico mining and RNA‐Seq validation in sesame. Plants, 11(21), 2980.
Okonechnikov, K., Conesa, A., & García‐Alcalde, F. (2016). Qualimap 2: Advanced multi‐sample quality control for high‐throughput sequencing data. Bioinformatics, 32(2), 292–294.
Oliveira, M. F., Nelson, R. L., Geraldi, I. O., Cruz, C. D., & de Toledo, J. F. F. (2010). Establishing a soybean germplasm core collection. Field Crops Research, 119(2‐3), 277–289.
Pathak, N., Bhaduri, A., Bhat, K., & Rai, A. (2015). Tracking sesamin synthase gene expression through seed maturity in wild and cultivated sesame species—A domestication footprint. Plant Biology, 17(5), 1039–1046.
Pathak, N., Rai, A. K., Kumari, R., Thapa, A., & Bhat, K. V. (2014). Sesame crop: An underexploited oilseed holds tremendous potential for enhanced food value. Agricultural Sciences, 5(6), 46023.
Peterson, B. K., Weber, J. N., Kay, E. H., Fisher, H. S., & Hoekstra, H. E. (2012). Double digest RADseq: An inexpensive method for de novo SNP discovery and genotyping in model and non‐model species. PloS One, 7(5), e37135.
Pootakham, W. (2023). Genotyping by sequencing (GBS) for genome‐wide SNP identification in plants. In Y. Shavrukov (Ed.), Plant genotyping: Methods and protocols (pp. 1–8). Springer Nature.
Pootakham, W., Sonthirod, C., Naktang, C., Jomchai, N., Sangsrakru, D., & Tangphatsornruang, S. (2016). Effects of methylation‐sensitive enzymes on the enrichment of genic SNPs and the degree of genome complexity reduction in a two‐enzyme genotyping‐by‐sequencing (GBS) approach: a case study in oil palm (Elaeis guineensis). Molecular Breeding, 36, 1–7.
Rangan, P., Henry, R., Wambugu, P., & Periyannan, S. (2023). Plant genetic and genomic resources for sustained crop improvement. Frontiers in Plant Science, 14, 1266698.
Rangan, P., Pradheep, K., Archak, S., Smýkal, P., & Henry, R. (2023). Genomics and phenomics of crop wild relatives (CWRs) for crop improvement. Frontiers in Plant Science, 14, 1221601.
Ruperao, P., Bajaj, P., Subramani, R., Yadav, R., Reddy Lachagari, V. B., Lekkala, S. P., Rathore, A., Archak, S., Angadi, U. B., Singh, R., Singh, K., Mayes, S., & Rangan, P. (2023). A pilot‐scale comparison between single and double‐digest RAD markers generated using GBS strategy in sesame (Sesamum indicum L.). PloS One, 18(6), e0286599.
Semalaiyappan, J., Selvanayagam, S., Rathore, A., Gupta, S. K., Chakraborty, A., Gujjula, K. R., Haktan, S., Viswanath, A., Malipatil, R., Shah, P., Govindaraj, M., Ignacio, J. C., Reddy, S., & Singh, A. K. (2023). Development of a new AgriSeq 4K mid‐density SNP genotyping panel and its utility in pearl millet breeding. Frontiers in Plant Science, 13, 1068883.
Shannon, C. E. (1948). A mathematical theory of communication. The Bell System Technical Journal, 27(3), 379–423.
Stehno, Z., Faberová, I., Dotlačil, L., Martynov, S., & Dobrotvorskaya, T. (2006). Genealogical analysis in the Czech spring wheat collection and its use for the creation of core collection. Czech J Genet Plant Breed, 42, 117–125.
Thachuk, C., Crossa, J., Franco, J., Dreisigacker, S., Warburton, M., & Davenport, G. F. (2009). Core hunter: An algorithm for sampling genetic resources based on multiple genetic measures. BMC Bioinformatics, 10, 1–13.
Upadhyaya, H. D., Dronavalli, N., Dwivedi, S. L., Kashiwagi, J., Krishnamurthy, L., Pande, S., Sharma, H. C., Vadez, V., Singh, S., Varshney, R. L., & Gowda, C. L. L. (2013). Mini core collection as a resource to identify new sources of variation. Crop Science, 53(6), 2506–2517.
Upadhyaya, H., Dwivedi, S., Vadez, V., Hamidou, F., Singh, S., Varshney, R., & Liao, B. (2014). Multiple resistant and nutritionally dense germplasm identified from mini core collection in peanut. Crop Science, 54(2), 679–693.
Upadhyaya, H., Pundir, R., Dwivedi, S., Gowda, C., Reddy, V. G., & Singh, S. (2009). Developing a mini core collection of sorghum for diversified utilization of germplasm. Crop Science, 49(5), 1769–1780.
Uzun, B., Arslan, Ç., & Furat, Ş. (2008). Variation in fatty acid compositions, oil content and oil yield in a germplasm collection of sesame (Sesamum indicum L.). Journal of the American Oil Chemists' Society, 85(12), 1135–1142.
Van Rheenen, H. (1980). Aspects of natural cross‐fertilization in sesame (Sesamum indicum L.). Tropical agriculture. Trinidad and Tobago, 57(1), 53–59.
Wang, J.‐C., Jin, H., Zhang, C.‐f., & Zhang, S. (2007). Assessment on evaluating parameters of rice core collections constructed by genotypic values and molecular marker information. Rice Science, 14(2), 101–110.
Wang, L., Xia, Q., Zhang, Y., Zhu, X., Zhu, X., Li, D., Ni, X., Gao, Y., Xiang, H., Wei, X., Yu, J., Quan, Z., & Zhang, X. (2016). Updated sesame genome assembly and fine mapping of plant height and seed coat color QTLs using a new high‐density genetic map. BMC Genomics, 17(1), e31.
Williams, J. T., & Holden, J. H. (Eds.). (1984). Crop genetic resources: Conservation & evaluation. George Allen & Unwin.
Xiurong, Z., Yingzhong, Z., Yong, C., Xiangyun, F., Qingyuan, G., Mingde, Z., & Hodgkin, T. (2000). Establishment of sesame germplasm core collection in China. Genetic Resources and Crop Evolution, 47, 273–279.
Yadav, R., Kalia, S., Rangan, P., Pradheep, K., Rao, G. P., Kaur, V., Pandey, R., Rai, V., Vasimalla, C. C., Langyan, S., Sharma, S., Thangavel, B., Rana, V. S., Vishwakarma, H., Shah, A., Saxena, A., Kumar, A., & Singh, K. (2022). Current research trends and prospects for yield and quality improvement in sesame, an important oilseed crop. Frontiers in Plant Science, 13, 863521.
Yol, E., Toker, R., Golukcu, M., & Uzun, B. (2015). Oil content and fatty acid characteristics in Mediterranean sesame core collection. Crop Science, 55(5), 2177–2185.
Yol, E., & Uzun, B. (2012). Geographical patterns of sesame accessions grown under Mediterranean environmental conditions, and establishment of a core collection. Crop Science, 52(5), 2206–2214.
Zhang, H., Zhang, D., Wang, M., Sun, J., Qi, Y., Li, J., Wei, X., Han, L., Qiu, Z., Tang, S., & Li, Z. (2011). A core collection and mini core collection of Oryza sativa L. in China. Theoretical and Applied Genetics, 122, 49–61.
Zhang, P., Li, J., Li, X., Liu, X., Zhao, X., & Lu, Y. (2011). Population structure and genetic diversity in a rice core collection (Oryza sativa L.) investigated with SSR markers. PLoS One, 6(12), e27565.

Auteurs

Pradeep Ruperao (P)

Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India.

Prasad Bajaj (P)

Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India.

Rashmi Yadav (R)

ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi, India.

Mahalingam Angamuthu (M)

TNAU-Regional Research Station, Vriddhachalam, India.

Rajkumar Subramani (R)

ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi, India.

Vandana Rai (V)

ICAR-National Institute of Plant Biotechnology, PUSA Campus, New Delhi, India.

Kapil Tiwari (K)

Sardarkrushinagar Dantiwada Agricultural University, Sardarkrushinagar, India.

Abhishek Rathore (A)

Excellence in Breeding Platform, CIMMYT, Hyderabad, India.

Kuldeep Singh (K)

Genebank, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India.

Gyanendra Pratap Singh (GP)

ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi, India.

Ulavappa B Angadi (UB)

ICAR-Indian Agricultural Statistical Research Institute, New Delhi, India.

Sean Mayes (S)

Center of Excellence in Genomics and Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, India.

Parimalan Rangan (P)

ICAR-National Bureau of Plant Genetic Resources, PUSA Campus, New Delhi, India.
Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, St. Lucia, Queensland, Australia.

Classifications MeSH