Chromosome-level genome assembly and population genomic resource to accelerate orphan crop lablab breeding.
Journal
Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555
Informations de publication
Date de publication:
17 04 2023
17 04 2023
Historique:
received:
21
05
2022
accepted:
17
03
2023
medline:
19
4
2023
entrez:
17
4
2023
pubmed:
18
4
2023
Statut:
epublish
Résumé
Under-utilised orphan crops hold the key to diversified and climate-resilient food systems. Here, we report on orphan crop genomics using the case of Lablab purpureus (L.) Sweet (lablab) - a legume native to Africa and cultivated throughout the tropics for food and forage. Our Africa-led plant genome collaboration produces a high-quality chromosome-scale assembly of the lablab genome. Our assembly highlights the genome organisation of the trypsin inhibitor genes - an important anti-nutritional factor in lablab. We also re-sequence cultivated and wild lablab accessions from Africa confirming two domestication events. Finally, we examine the genetic and phenotypic diversity in a comprehensive lablab germplasm collection and identify genomic loci underlying variation of important agronomic traits in lablab. The genomic data generated here provide a valuable resource for lablab improvement. Our inclusive collaborative approach also presents an example that can be explored by other researchers sequencing indigenous crops, particularly from low and middle-income countries (LMIC).
Identifiants
pubmed: 37069152
doi: 10.1038/s41467-023-37489-7
pii: 10.1038/s41467-023-37489-7
pmc: PMC10110558
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1915Informations de copyright
© 2023. The Author(s).
Références
FAO. Faostat: FAO Statistical Databases. (Food & Agriculture Organization of the United Nations (FAO), 2000).
The war in Ukraine is exposing gaps in the world’s food-systems research. Nature 604, 217–218 (2022).
Chapman, M. A., He, Y. & Zhou, M. Beyond a reference genome: pangenomes and population genomics of underutilized and orphan crops for future food and nutrition security. N. Phytol. 234, 1583–1597 (2022).
doi: 10.1111/nph.18021
Maass, B. L. et al. Lablab purpureus—a crop lost for Africa? Trop. Plant Biol. 3, 123–135 (2010).
pubmed: 20835399
pmcid: 2933844
doi: 10.1007/s12042-010-9046-1
Habib, H. M., Theuri, S. W., Kheadr, E. E. & Mohamed, F. E. Functional, bioactive, biochemical, and physicochemical properties of the Dolichos lablab bean. Food Funct. 8, 872–880 (2017).
pubmed: 27918057
doi: 10.1039/C6FO01162D
Weldeyesus, G. Forage productivity system evaluation through station screening and intercropping of lablab forage legume with maize under Irrigated lands of smallholder farmers. Afr. J. Agric. Res. 12, 1841–1847 (2017).
doi: 10.5897/AJAR2016.11989
Minde, J. J., Venkataramana, P. B. & Matemu, A. O. Dolichos Lablab-an underutilized crop with future potentials for food and nutrition security: a review. Crit. Rev. Food Sci. Nutr. 61, 2249–2261 (2021).
pubmed: 32530295
doi: 10.1080/10408398.2020.1775173
Liu, Y.-M. et al. A carbohydrate-binding protein from the edible Lablab beans effectively blocks the infections of influenza viruses and SARS-CoV-2. Cell Rep. 32, 108016 (2020).
pubmed: 32755598
pmcid: 7380208
doi: 10.1016/j.celrep.2020.108016
Missanga, J. S., Venkataramana, P. B. & Ndakidemi, P. A. Recent developments in Lablab purpureus genomics: a focus on drought stress tolerance and use of genomic resources to develop stress‐resilient varieties. Legume Sci. 3, e99 (2021).
Marks, R. A., Hotaling, S., Frandsen, P. B. & VanBuren, R. Representation and participation across 20 years of plant genome sequencing. Nat. Plants 7, 1571–1578 (2021).
pubmed: 34845350
pmcid: 8677620
doi: 10.1038/s41477-021-01031-8
Chang, Y. et al. The draft genomes of five agriculturally important African orphan crops. Gigascience 8, giy152 (2019).
pubmed: 30535374
doi: 10.1093/gigascience/giy152
Ebenezer, T. E. et al. Africa: sequence 100,000 species to safeguard biodiversity. Nature 603, 388–392 (2022).
pubmed: 35292740
doi: 10.1038/d41586-022-00712-4
Ghazal, H. et al. Plant genomics in Africa: present and prospects. Plant J. 107, 21–36 (2021).
pubmed: 33837593
doi: 10.1111/tpj.15272
Padmarasu, S., Himmelbach, A., Mascher, M. & Stein, N. In situ Hi-C for plants: an improved method to detect long-range chromatin interactions. In: Plant Long Non-Coding RNAs: Methods and Protocols (eds. Chekanova, J. A. & Wang, H.-L. V.) vol. 1933 441–472 (Springer New York, 2019).
Schmutz, J. et al. A reference genome for common bean and genome-wide analysis of dual domestications. Nat. Genet. 46, 707–713 (2014).
pubmed: 24908249
pmcid: 7048698
doi: 10.1038/ng.3008
Lonardi, S. et al. The genome of cowpea (Vigna unguiculata [L.] Walp.). Plant J. 98, 767–782 (2019).
pubmed: 31017340
pmcid: 6852540
doi: 10.1111/tpj.14349
Ou, S., Chen, J. & Jiang, N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res. 46, e126 (2018).
pubmed: 30107434
pmcid: 6265445
Rodríguez-Sifuentes, L., Marszalek, J. E., Chuck-Hernández, C. & Serna-Saldívar, S. O. Legumes protease inhibitors as biopesticides and their defense mechanisms against Biotic Factors. Int. J. Mol. Sci. 21, 3322 (2020).
pubmed: 32397104
pmcid: 7246880
doi: 10.3390/ijms21093322
Guretzki, S. & Papenbrock, J. Characterization of Lablab purpureus regarding drought tolerance, trypsin inhibitor activity and cyanogenic potential for selection in breeding programmes. J. Agron. Crop Sci. 200, 24–35 (2014).
doi: 10.1111/jac.12043
Maass, B. L., Jamnadass, R. H., Hanson, J. & Pengelly, B. C. Determining sources of diversity in cultivated and wild lablab purpureus related to provenance of germplasm by using amplified fragment length polymorphism. Genet. Resour. Crop Evol. 52, 683–695 (2005).
doi: 10.1007/s10722-003-6019-3
Maass, B. L., Robotham, O. & Chapman, M. A. Evidence for two domestication events of hyacinth bean (Lablab purpureus (L.) Sweet): a comparative analysis of population genetic data. Genet. Resour. Crop Evol. 64, 1221–1230 (2017).
doi: 10.1007/s10722-016-0431-y
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Nei, M. Genetic distance between populations. Am. Nat. 106, 283–292 (1972).
Pengelly, B. C. & Maass, B. L. Lablab purpureus (L.) Sweet–diversity, potential use and determination of a core collection of this multi-purpose tropical legume. Genet. Resour. Crop Evol. 48, 261–272 (2001).
doi: 10.1023/A:1011286111384
Linder, H. P. The evolution of African plant diversity. Front. Ecol. Evol. 2, 38 (2014).
Phytozome v13. http://phytozome.jgi.doe.gov .
Schmutz, J. et al. Genome sequence of the palaeopolyploid soybean. Nature 463, 178–183 (2010).
pubmed: 20075913
doi: 10.1038/nature08670
Jaillon, O. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–467 (2007).
pubmed: 17721507
doi: 10.1038/nature06148
Argout, X. et al. The genome of Theobroma cacao. Nat. Genet 43, 101–108 (2011).
pubmed: 21186351
doi: 10.1038/ng.736
Houston, K., Tucker, M. R., Chowdhury, J., Shirley, N. & Little, A. The plant cell wall: a complex and dynamic structure as revealed by the responses of genes under stress conditions. Front. Plant Sci. 7, 984 (2016).
pubmed: 27559336
pmcid: 4978735
doi: 10.3389/fpls.2016.00984
Wang, L. et al. A role for a cell wall localized glycine-rich protein in dehydration and rehydration of the resurrection plant Boea hygrometrica. Plant Biol. 11, 837–848 (2009).
pubmed: 19796361
doi: 10.1111/j.1438-8677.2008.00187.x
Xie, M. et al. A reference-grade wild soybean genome. Nat. Commun. 10, 1216 (2019).
pubmed: 30872580
pmcid: 6418295
doi: 10.1038/s41467-019-09142-9
Hu, G. et al. Two divergent haplotypes from a highly heterozygous lychee genome suggest independent domestication events for early and late-maturing cultivars. Nat. Genet 54, 73–83 (2022).
pubmed: 34980919
pmcid: 8755541
doi: 10.1038/s41588-021-00971-3
Zhang, K. et al. Resequencing of global Tartary buckwheat accessions reveals multiple domestication events and key loci associated with agronomic traits. Genome Biol. 22, 23 (2021).
pubmed: 33430931
pmcid: 7802136
doi: 10.1186/s13059-020-02217-7
Civáň, P., Craig, H., Cox, C. J. & Brown, T. A. Three geographically separate domestications of Asian rice. Nat. Plants 1, 15164 (2015).
pubmed: 27251535
pmcid: 4900444
doi: 10.1038/nplants.2015.164
Morrell, P. L. & Clegg, M. T. Genetic evidence for a second domestication of barley (Hordeum vulgare) east of the fertile crescent. Proc. Natl Acad. Sci. U.S.A. 104, 3289–3294 (2007).
pubmed: 17360640
pmcid: 1805597
doi: 10.1073/pnas.0611377104
Sultana, Ozaki & Okubo The use of RAPD markers in lablab bean (Lablab purpureus (L.) Sweet) phylogeny. Bull. Inst. Trop. Agric. Kyushu Univ. 23, 45–51 (2000). & Others.
Liu, C. J. Genetic diversity and relationships among Lablab purpureus genotypes evaluated using RAPD as markers. Euphytica 90, 115–119 (1996).
doi: 10.1007/BF00025167
Bohra, A., Jha, U. C., Kishor, P. B. K., Pandey, S. & Singh, N. P. Genomics and molecular breeding in lesser explored pulse crops: current trends and future opportunities. Biotechnol. Adv. 32, 1410–1428 (2014).
pubmed: 25196916
doi: 10.1016/j.biotechadv.2014.09.001
Jamnadass, R. et al. Enhancing African orphan crops with genomics. Nat. Genet. 52, 356–360 (2020).
pubmed: 32203464
doi: 10.1038/s41588-020-0601-x
Habte, E., Gari, A., Lire, H. & Jones, C. Field trial of Lablab (Lablab purpureus) genotypes under rain fed conditions in Ethiopia. https://cgspace.cgiar.org/bitstream/handle/10568/116652/lablab_report.pdf?sequence=2 (2021).
Di Tommaso, P. et al. Nextflow enables reproducible computational workflows. Nat. Biotechnol. 35, 316–319 (2017).
pubmed: 28398311
doi: 10.1038/nbt.3820
Kolmogorov, M., Yuan, J., Lin, Y. & Pevzner, P. A. Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37, 540–546 (2019).
pubmed: 30936562
doi: 10.1038/s41587-019-0072-8
Kundu, R., Casey, J. & Sung, W.-K. HyPo: Super fast & accurate polisher for long read genome assemblies. bioRxiv 2019.12.19.882506 https://doi.org/10.1101/2019.12.19.882506 (2019).
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
pubmed: 23422339
pmcid: 3624806
doi: 10.1093/bioinformatics/btt086
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
pubmed: 26059717
doi: 10.1093/bioinformatics/btv351
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
pubmed: 20080505
pmcid: 2828108
doi: 10.1093/bioinformatics/btp698
Faust, G. G. & Hall, I. M. SAMBLASTER: fast duplicate marking and structural variant read extraction. Bioinformatics 30, 2503–2505 (2014).
pubmed: 24812344
pmcid: 4147885
doi: 10.1093/bioinformatics/btu314
Ou, S. & Jiang, N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2018).
pubmed: 29233850
doi: 10.1104/pp.17.01310
Li, H. et al. The sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943
pmcid: 2723002
doi: 10.1093/bioinformatics/btp352
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
pubmed: 27467250
pmcid: 5596920
doi: 10.1016/j.cels.2015.07.012
Bickhart, D. M. et al. Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome. Nat. Genet. 49, 643–650 (2017).
pubmed: 28263316
pmcid: 5909822
doi: 10.1038/ng.3802
Burton, J. N. et al. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nat. Biotechnol. 31, 1119–1125 (2013).
pubmed: 24185095
pmcid: 4117202
doi: 10.1038/nbt.2727
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinform. 10, 421 (2009).
doi: 10.1186/1471-2105-10-421
Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49 (2012).
pubmed: 22217600
pmcid: 3326336
doi: 10.1093/nar/gkr1293
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
pubmed: 21572440
pmcid: 3571712
doi: 10.1038/nbt.1883
Palmer, J. & Stajich, J. nextgenusfs/funannotate: funannotate v1.5.3. https://doi.org/10.5281/zenodo.2604804 (2019).
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003).
pubmed: 14500829
pmcid: 206470
doi: 10.1093/nar/gkg770
Stanke, M., Steinkamp, R., Waack, S. & Morgenstern, B. AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res. 32, W309–W312 (2004).
pubmed: 15215400
pmcid: 441517
doi: 10.1093/nar/gkh379
Korf, I. Gene finding in novel genomes. BMC Bioinform. 5, 59 (2004).
doi: 10.1186/1471-2105-5-59
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878–2879 (2004).
pubmed: 15145805
doi: 10.1093/bioinformatics/bth315
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9, R7 (2008).
pubmed: 18190707
pmcid: 2395244
doi: 10.1186/gb-2008-9-1-r7
Chan, P. P. & Lowe, T. M. tRNAscan-SE: searching for tRNA genes in genomic sequences. Methods Mol. Biol. 1962, 1–14 (2019).
pubmed: 31020551
pmcid: 6768409
doi: 10.1007/978-1-4939-9173-0_1
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907–915 (2019).
pubmed: 31375807
pmcid: 7605509
doi: 10.1038/s41587-019-0201-4
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
pubmed: 25690850
pmcid: 4643835
doi: 10.1038/nbt.3122
Gremme, G., Brendel, V., Sparks, M. E. & Kurtz, S. Engineering a software tool for gene structure prediction in higher organisms. Inform. Softw. Technol. 47, 965–978 (2005).
doi: 10.1016/j.infsof.2005.09.005
Haas, Papanicolaou & Others. TransDecoder (find coding regions within transcripts). http://transdecoder.github.io .
Finn, R. D., Clements, J. & Eddy, S. R. HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 39, W29–W37 (2011).
pubmed: 21593126
pmcid: 3125773
doi: 10.1093/nar/gkr367
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
pubmed: 20110278
pmcid: 2832824
doi: 10.1093/bioinformatics/btq033
Gremme, G., Steinbiss, S. & Kurtz, S. GenomeTools: a comprehensive software library for efficient processing of structured genome annotations. IEEE/ACM Trans. Comput. Biol. Bioinform. 10, 645–656 (2013).
pubmed: 24091398
doi: 10.1109/TCBB.2013.68
Mistry, J. et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 49, D412–D419 (2021).
pubmed: 33125078
doi: 10.1093/nar/gkaa913
Blum, M. et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 49, D344–D354 (2021).
pubmed: 33156333
doi: 10.1093/nar/gkaa977
Gene Ontology Consortium. The Gene Ontology resource: enriching a GOld mine. Nucleic Acids Res. 49, D325–D334 (2021).
doi: 10.1093/nar/gkaa1113
Ashburner, M. et al. Gene Ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
Ou, S. et al. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol. 20, 275 (2019).
pubmed: 31843001
pmcid: 6913007
doi: 10.1186/s13059-019-1905-y
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinform. 9, 18 (2008).
doi: 10.1186/1471-2105-9-18
Ou, S. & Jiang, N. LTR_FINDER_parallel: parallelization of LTR_FINDER enabling rapid identification of long terminal repeat retrotransposons. Mob. DNA 10, 48 (2019).
pubmed: 31857828
pmcid: 6909508
doi: 10.1186/s13100-019-0193-0
Su, W., Gu, X. & Peterson, T. TIR-Learner, a new ensemble method for tir transposable element annotation, provides evidence for abundant new transposable elements in the maize genome. Mol. Plant 12, 447–460 (2019).
pubmed: 30802553
doi: 10.1016/j.molp.2019.02.008
Xiong, W., He, L., Lai, J., Dooner, H. K. & Du, C. HelitronScanner uncovers a large overlooked cache of Helitron transposons in many plant genomes. Proc. Natl Acad. Sci. U.S.A.111, 10263–10268 (2014).
pubmed: 24982153
pmcid: 4104883
doi: 10.1073/pnas.1410068111
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl Acad. Sci. U.S.A. 117, 9451–9457 (2020).
pubmed: 32300014
pmcid: 7196820
doi: 10.1073/pnas.1921046117
Smit, A. F. A., Hubley, R. & Green, P. RepeatMasker Open-4.0. http://www.repeatmasker.org (2013−2015).
R Core Team. R: A Language and Environment for Statistical Computing. https://www.R-project.org/ (2019).
Wickham, H. et al. Welcome to the tidyverse. J. Open Source Softw. 4, 1686 (2019).
doi: 10.21105/joss.01686
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
pubmed: 9862982
pmcid: 148217
doi: 10.1093/nar/27.2.573
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238 (2019).
pubmed: 31727128
pmcid: 6857279
doi: 10.1186/s13059-019-1832-y
Mendes, F. K., Vanderpool, D., Fulton, B. & Hahn, M. W. CAFE 5 models variation in evolutionary rates among gene families. Bioinformatics https://doi.org/10.1093/bioinformatics/btaa1022 (2020).
De Bie, T., Cristianini, N., Demuth, J. P. & Hahn, M. W. CAFE: a computational tool for the study of gene family evolution. Bioinformatics 22, 1269–1271 (2006).
pubmed: 16543274
doi: 10.1093/bioinformatics/btl097
Alexa & Rahnenfuhrer. topGO: enrichment analysis for gene ontology. R package version (2010).
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
pubmed: 24451626
pmcid: 3998142
doi: 10.1093/bioinformatics/btu031
Edgar, R. C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinform. 5, 113 (2004).
doi: 10.1186/1471-2105-5-113
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2–approximately maximum-likelihood trees for large alignments. PLoS One 5, e9490 (2010).
pubmed: 20224823
pmcid: 2835736
doi: 10.1371/journal.pone.0009490
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
pubmed: 33885785
pmcid: 8265157
doi: 10.1093/nar/gkab301
Hao, Z. et al. RIdeogram: drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput. Sci. 6, e251 (2020).
pubmed: 33816903
pmcid: 7924719
doi: 10.7717/peerj-cs.251
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
pubmed: 30423086
pmcid: 6129281
doi: 10.1093/bioinformatics/bty560
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
pubmed: 27043002
doi: 10.1038/nbt.3519
Soneson, C., Love, M. I. & Robinson, M. D. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res 4, 1521 (2015).
pubmed: 26925227
doi: 10.12688/f1000research.7563.1
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
pubmed: 25516281
pmcid: 4302049
doi: 10.1186/s13059-014-0550-8
Kolde. Pheatmap: pretty heatmaps. R package version.
Doyle, J. J. Isolation of plant DNA from fresh tissue. Focus 12, 13–15 (1990).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
pubmed: 24695404
pmcid: 4103590
doi: 10.1093/bioinformatics/btu170
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
pubmed: 22388286
pmcid: 3322381
doi: 10.1038/nmeth.1923
Picard toolkit. Broad Institute, GitHub repository https://broadinstitute.github.io/picard/ (2019).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Linck, E. & Battey, C. J. Minor allele frequency thresholds strongly affect population structure inference with genomic data sets. Mol. Ecol. Resour. 19, 639–647 (2019).
pubmed: 30659755
doi: 10.1111/1755-0998.12995
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
pubmed: 15969739
doi: 10.1111/j.1365-294X.2005.02553.x
Earl, D. A. & vonHoldt, B. M. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361 (2012).
doi: 10.1007/s12686-011-9548-7
Kilian, A. et al. Diversity arrays technology: a generic genome profiling technology on open platforms. Methods Mol. Biol. 888, 67–89 (2012).
pubmed: 22665276
doi: 10.1007/978-1-61779-870-2_5
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
pubmed: 17701901
pmcid: 1950838
doi: 10.1086/519795
Kamvar, Z. N., Tabima, J. F. & Grünwald, N. J. Poppr: an R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ 2, e281 (2014).
pubmed: 24688859
pmcid: 3961149
doi: 10.7717/peerj.281
Jombart, T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics 24, 1403–1405 (2008).
pubmed: 18397895
doi: 10.1093/bioinformatics/btn129
Paradis, E., Claude, J. & Strimmer, K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20, 289–290 (2004).
pubmed: 14734327
doi: 10.1093/bioinformatics/btg412
Wiedow, C. Morphological and Agronomical Characterization of Lablab Purpureus in Ethiopia. (Diploma thesis (Dipl.-Ing. agr.), University of Rostock, 2001).
Wang, J. & Zhang, Z. GAPIT Version 3: boosting power and accuracy for genomic association and prediction. Genom. Proteom. Bioinform. 19, 629–640 (2021).
doi: 10.1016/j.gpb.2021.08.005
Arend, D. et al. e!DAL–a framework to store, share and publish research data. BMC Bioinform. 15, 214 (2014).
doi: 10.1186/1471-2105-15-214