Chromosome-level genome assembly and population genomic resource to accelerate orphan crop lablab breeding.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
17 04 2023
Historique:
received: 21 05 2022
accepted: 17 03 2023
medline: 19 4 2023
entrez: 17 4 2023
pubmed: 18 4 2023
Statut: epublish

Résumé

Under-utilised orphan crops hold the key to diversified and climate-resilient food systems. Here, we report on orphan crop genomics using the case of Lablab purpureus (L.) Sweet (lablab) - a legume native to Africa and cultivated throughout the tropics for food and forage. Our Africa-led plant genome collaboration produces a high-quality chromosome-scale assembly of the lablab genome. Our assembly highlights the genome organisation of the trypsin inhibitor genes - an important anti-nutritional factor in lablab. We also re-sequence cultivated and wild lablab accessions from Africa confirming two domestication events. Finally, we examine the genetic and phenotypic diversity in a comprehensive lablab germplasm collection and identify genomic loci underlying variation of important agronomic traits in lablab. The genomic data generated here provide a valuable resource for lablab improvement. Our inclusive collaborative approach also presents an example that can be explored by other researchers sequencing indigenous crops, particularly from low and middle-income countries (LMIC).

Identifiants

pubmed: 37069152
doi: 10.1038/s41467-023-37489-7
pii: 10.1038/s41467-023-37489-7
pmc: PMC10110558
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1915

Informations de copyright

© 2023. The Author(s).

Références

FAO. Faostat: FAO Statistical Databases. (Food & Agriculture Organization of the United Nations (FAO), 2000).
The war in Ukraine is exposing gaps in the world’s food-systems research. Nature 604, 217–218 (2022).
Chapman, M. A., He, Y. & Zhou, M. Beyond a reference genome: pangenomes and population genomics of underutilized and orphan crops for future food and nutrition security. N. Phytol. 234, 1583–1597 (2022).
doi: 10.1111/nph.18021
Maass, B. L. et al. Lablab purpureus—a crop lost for Africa? Trop. Plant Biol. 3, 123–135 (2010).
pubmed: 20835399 pmcid: 2933844 doi: 10.1007/s12042-010-9046-1
Habib, H. M., Theuri, S. W., Kheadr, E. E. & Mohamed, F. E. Functional, bioactive, biochemical, and physicochemical properties of the Dolichos lablab bean. Food Funct. 8, 872–880 (2017).
pubmed: 27918057 doi: 10.1039/C6FO01162D
Weldeyesus, G. Forage productivity system evaluation through station screening and intercropping of lablab forage legume with maize under Irrigated lands of smallholder farmers. Afr. J. Agric. Res. 12, 1841–1847 (2017).
doi: 10.5897/AJAR2016.11989
Minde, J. J., Venkataramana, P. B. & Matemu, A. O. Dolichos Lablab-an underutilized crop with future potentials for food and nutrition security: a review. Crit. Rev. Food Sci. Nutr. 61, 2249–2261 (2021).
pubmed: 32530295 doi: 10.1080/10408398.2020.1775173
Liu, Y.-M. et al. A carbohydrate-binding protein from the edible Lablab beans effectively blocks the infections of influenza viruses and SARS-CoV-2. Cell Rep. 32, 108016 (2020).
pubmed: 32755598 pmcid: 7380208 doi: 10.1016/j.celrep.2020.108016
Missanga, J. S., Venkataramana, P. B. & Ndakidemi, P. A. Recent developments in Lablab purpureus genomics: a focus on drought stress tolerance and use of genomic resources to develop stress‐resilient varieties. Legume Sci. 3, e99 (2021).
Marks, R. A., Hotaling, S., Frandsen, P. B. & VanBuren, R. Representation and participation across 20 years of plant genome sequencing. Nat. Plants 7, 1571–1578 (2021).
pubmed: 34845350 pmcid: 8677620 doi: 10.1038/s41477-021-01031-8
Chang, Y. et al. The draft genomes of five agriculturally important African orphan crops. Gigascience 8, giy152 (2019).
pubmed: 30535374 doi: 10.1093/gigascience/giy152
Ebenezer, T. E. et al. Africa: sequence 100,000 species to safeguard biodiversity. Nature 603, 388–392 (2022).
pubmed: 35292740 doi: 10.1038/d41586-022-00712-4
Ghazal, H. et al. Plant genomics in Africa: present and prospects. Plant J. 107, 21–36 (2021).
pubmed: 33837593 doi: 10.1111/tpj.15272
Padmarasu, S., Himmelbach, A., Mascher, M. & Stein, N. In situ Hi-C for plants: an improved method to detect long-range chromatin interactions. In: Plant Long Non-Coding RNAs: Methods and Protocols (eds. Chekanova, J. A. & Wang, H.-L. V.) vol. 1933 441–472 (Springer New York, 2019).
Schmutz, J. et al. A reference genome for common bean and genome-wide analysis of dual domestications. Nat. Genet. 46, 707–713 (2014).
pubmed: 24908249 pmcid: 7048698 doi: 10.1038/ng.3008
Lonardi, S. et al. The genome of cowpea (Vigna unguiculata [L.] Walp.). Plant J. 98, 767–782 (2019).
pubmed: 31017340 pmcid: 6852540 doi: 10.1111/tpj.14349
Ou, S., Chen, J. & Jiang, N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res. 46, e126 (2018).
pubmed: 30107434 pmcid: 6265445
Rodríguez-Sifuentes, L., Marszalek, J. E., Chuck-Hernández, C. & Serna-Saldívar, S. O. Legumes protease inhibitors as biopesticides and their defense mechanisms against Biotic Factors. Int. J. Mol. Sci. 21, 3322 (2020).
pubmed: 32397104 pmcid: 7246880 doi: 10.3390/ijms21093322
Guretzki, S. & Papenbrock, J. Characterization of Lablab purpureus regarding drought tolerance, trypsin inhibitor activity and cyanogenic potential for selection in breeding programmes. J. Agron. Crop Sci. 200, 24–35 (2014).
doi: 10.1111/jac.12043
Maass, B. L., Jamnadass, R. H., Hanson, J. & Pengelly, B. C. Determining sources of diversity in cultivated and wild lablab purpureus related to provenance of germplasm by using amplified fragment length polymorphism. Genet. Resour. Crop Evol. 52, 683–695 (2005).
doi: 10.1007/s10722-003-6019-3
Maass, B. L., Robotham, O. & Chapman, M. A. Evidence for two domestication events of hyacinth bean (Lablab purpureus (L.) Sweet): a comparative analysis of population genetic data. Genet. Resour. Crop Evol. 64, 1221–1230 (2017).
doi: 10.1007/s10722-016-0431-y
Pritchard, J. K., Stephens, M. & Donnelly, P. Inference of population structure using multilocus genotype data. Genetics 155, 945–959 (2000).
Nei, M. Genetic distance between populations. Am. Nat. 106, 283–292 (1972).
Pengelly, B. C. & Maass, B. L. Lablab purpureus (L.) Sweet–diversity, potential use and determination of a core collection of this multi-purpose tropical legume. Genet. Resour. Crop Evol. 48, 261–272 (2001).
doi: 10.1023/A:1011286111384
Linder, H. P. The evolution of African plant diversity. Front. Ecol. Evol. 2, 38 (2014).
Phytozome v13. http://phytozome.jgi.doe.gov .
Schmutz, J. et al. Genome sequence of the palaeopolyploid soybean. Nature 463, 178–183 (2010).
pubmed: 20075913 doi: 10.1038/nature08670
Jaillon, O. et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature 449, 463–467 (2007).
pubmed: 17721507 doi: 10.1038/nature06148
Argout, X. et al. The genome of Theobroma cacao. Nat. Genet 43, 101–108 (2011).
pubmed: 21186351 doi: 10.1038/ng.736
Houston, K., Tucker, M. R., Chowdhury, J., Shirley, N. & Little, A. The plant cell wall: a complex and dynamic structure as revealed by the responses of genes under stress conditions. Front. Plant Sci. 7, 984 (2016).
pubmed: 27559336 pmcid: 4978735 doi: 10.3389/fpls.2016.00984
Wang, L. et al. A role for a cell wall localized glycine-rich protein in dehydration and rehydration of the resurrection plant Boea hygrometrica. Plant Biol. 11, 837–848 (2009).
pubmed: 19796361 doi: 10.1111/j.1438-8677.2008.00187.x
Xie, M. et al. A reference-grade wild soybean genome. Nat. Commun. 10, 1216 (2019).
pubmed: 30872580 pmcid: 6418295 doi: 10.1038/s41467-019-09142-9
Hu, G. et al. Two divergent haplotypes from a highly heterozygous lychee genome suggest independent domestication events for early and late-maturing cultivars. Nat. Genet 54, 73–83 (2022).
pubmed: 34980919 pmcid: 8755541 doi: 10.1038/s41588-021-00971-3
Zhang, K. et al. Resequencing of global Tartary buckwheat accessions reveals multiple domestication events and key loci associated with agronomic traits. Genome Biol. 22, 23 (2021).
pubmed: 33430931 pmcid: 7802136 doi: 10.1186/s13059-020-02217-7
Civáň, P., Craig, H., Cox, C. J. & Brown, T. A. Three geographically separate domestications of Asian rice. Nat. Plants 1, 15164 (2015).
pubmed: 27251535 pmcid: 4900444 doi: 10.1038/nplants.2015.164
Morrell, P. L. & Clegg, M. T. Genetic evidence for a second domestication of barley (Hordeum vulgare) east of the fertile crescent. Proc. Natl Acad. Sci. U.S.A. 104, 3289–3294 (2007).
pubmed: 17360640 pmcid: 1805597 doi: 10.1073/pnas.0611377104
Sultana, Ozaki & Okubo The use of RAPD markers in lablab bean (Lablab purpureus (L.) Sweet) phylogeny. Bull. Inst. Trop. Agric. Kyushu Univ. 23, 45–51 (2000). & Others.
Liu, C. J. Genetic diversity and relationships among Lablab purpureus genotypes evaluated using RAPD as markers. Euphytica 90, 115–119 (1996).
doi: 10.1007/BF00025167
Bohra, A., Jha, U. C., Kishor, P. B. K., Pandey, S. & Singh, N. P. Genomics and molecular breeding in lesser explored pulse crops: current trends and future opportunities. Biotechnol. Adv. 32, 1410–1428 (2014).
pubmed: 25196916 doi: 10.1016/j.biotechadv.2014.09.001
Jamnadass, R. et al. Enhancing African orphan crops with genomics. Nat. Genet. 52, 356–360 (2020).
pubmed: 32203464 doi: 10.1038/s41588-020-0601-x
Habte, E., Gari, A., Lire, H. & Jones, C. Field trial of Lablab (Lablab purpureus) genotypes under rain fed conditions in Ethiopia. https://cgspace.cgiar.org/bitstream/handle/10568/116652/lablab_report.pdf?sequence=2 (2021).
Di Tommaso, P. et al. Nextflow enables reproducible computational workflows. Nat. Biotechnol. 35, 316–319 (2017).
pubmed: 28398311 doi: 10.1038/nbt.3820
Kolmogorov, M., Yuan, J., Lin, Y. & Pevzner, P. A. Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol. 37, 540–546 (2019).
pubmed: 30936562 doi: 10.1038/s41587-019-0072-8
Kundu, R., Casey, J. & Sung, W.-K. HyPo: Super fast & accurate polisher for long read genome assemblies. bioRxiv 2019.12.19.882506 https://doi.org/10.1101/2019.12.19.882506 (2019).
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
pubmed: 23422339 pmcid: 3624806 doi: 10.1093/bioinformatics/btt086
Simão, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212 (2015).
pubmed: 26059717 doi: 10.1093/bioinformatics/btv351
Li, H. & Durbin, R. Fast and accurate long-read alignment with Burrows-Wheeler transform. Bioinformatics 26, 589–595 (2010).
pubmed: 20080505 pmcid: 2828108 doi: 10.1093/bioinformatics/btp698
Faust, G. G. & Hall, I. M. SAMBLASTER: fast duplicate marking and structural variant read extraction. Bioinformatics 30, 2503–2505 (2014).
pubmed: 24812344 pmcid: 4147885 doi: 10.1093/bioinformatics/btu314
Ou, S. & Jiang, N. LTR_retriever: a highly accurate and sensitive program for identification of long terminal repeat retrotransposons. Plant Physiol. 176, 1410–1422 (2018).
pubmed: 29233850 doi: 10.1104/pp.17.01310
Li, H. et al. The sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
pubmed: 19505943 pmcid: 2723002 doi: 10.1093/bioinformatics/btp352
Durand, N. C. et al. Juicebox provides a visualization system for Hi-C contact maps with unlimited zoom. Cell Syst. 3, 99–101 (2016).
pubmed: 27467250 pmcid: 5596920 doi: 10.1016/j.cels.2015.07.012
Bickhart, D. M. et al. Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome. Nat. Genet. 49, 643–650 (2017).
pubmed: 28263316 pmcid: 5909822 doi: 10.1038/ng.3802
Burton, J. N. et al. Chromosome-scale scaffolding of de novo genome assemblies based on chromatin interactions. Nat. Biotechnol. 31, 1119–1125 (2013).
pubmed: 24185095 pmcid: 4117202 doi: 10.1038/nbt.2727
Camacho, C. et al. BLAST+: architecture and applications. BMC Bioinform. 10, 421 (2009).
doi: 10.1186/1471-2105-10-421
Wang, Y. et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 40, e49 (2012).
pubmed: 22217600 pmcid: 3326336 doi: 10.1093/nar/gkr1293
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
pubmed: 21572440 pmcid: 3571712 doi: 10.1038/nbt.1883
Palmer, J. & Stajich, J. nextgenusfs/funannotate: funannotate v1.5.3. https://doi.org/10.5281/zenodo.2604804 (2019).
Haas, B. J. et al. Improving the Arabidopsis genome annotation using maximal transcript alignment assemblies. Nucleic Acids Res. 31, 5654–5666 (2003).
pubmed: 14500829 pmcid: 206470 doi: 10.1093/nar/gkg770
Stanke, M., Steinkamp, R., Waack, S. & Morgenstern, B. AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Res. 32, W309–W312 (2004).
pubmed: 15215400 pmcid: 441517 doi: 10.1093/nar/gkh379
Korf, I. Gene finding in novel genomes. BMC Bioinform. 5, 59 (2004).
doi: 10.1186/1471-2105-5-59
Majoros, W. H., Pertea, M. & Salzberg, S. L. TigrScan and GlimmerHMM: two open source ab initio eukaryotic gene-finders. Bioinformatics 20, 2878–2879 (2004).
pubmed: 15145805 doi: 10.1093/bioinformatics/bth315
Haas, B. J. et al. Automated eukaryotic gene structure annotation using EVidenceModeler and the Program to Assemble Spliced Alignments. Genome Biol. 9, R7 (2008).
pubmed: 18190707 pmcid: 2395244 doi: 10.1186/gb-2008-9-1-r7
Chan, P. P. & Lowe, T. M. tRNAscan-SE: searching for tRNA genes in genomic sequences. Methods Mol. Biol. 1962, 1–14 (2019).
pubmed: 31020551 pmcid: 6768409 doi: 10.1007/978-1-4939-9173-0_1
Kim, D., Paggi, J. M., Park, C., Bennett, C. & Salzberg, S. L. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat. Biotechnol. 37, 907–915 (2019).
pubmed: 31375807 pmcid: 7605509 doi: 10.1038/s41587-019-0201-4
Pertea, M. et al. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295 (2015).
pubmed: 25690850 pmcid: 4643835 doi: 10.1038/nbt.3122
Gremme, G., Brendel, V., Sparks, M. E. & Kurtz, S. Engineering a software tool for gene structure prediction in higher organisms. Inform. Softw. Technol. 47, 965–978 (2005).
doi: 10.1016/j.infsof.2005.09.005
Haas, Papanicolaou & Others. TransDecoder (find coding regions within transcripts). http://transdecoder.github.io .
Finn, R. D., Clements, J. & Eddy, S. R. HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 39, W29–W37 (2011).
pubmed: 21593126 pmcid: 3125773 doi: 10.1093/nar/gkr367
Quinlan, A. R. & Hall, I. M. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 26, 841–842 (2010).
pubmed: 20110278 pmcid: 2832824 doi: 10.1093/bioinformatics/btq033
Gremme, G., Steinbiss, S. & Kurtz, S. GenomeTools: a comprehensive software library for efficient processing of structured genome annotations. IEEE/ACM Trans. Comput. Biol. Bioinform. 10, 645–656 (2013).
pubmed: 24091398 doi: 10.1109/TCBB.2013.68
Mistry, J. et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 49, D412–D419 (2021).
pubmed: 33125078 doi: 10.1093/nar/gkaa913
Blum, M. et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 49, D344–D354 (2021).
pubmed: 33156333 doi: 10.1093/nar/gkaa977
Gene Ontology Consortium. The Gene Ontology resource: enriching a GOld mine. Nucleic Acids Res. 49, D325–D334 (2021).
doi: 10.1093/nar/gkaa1113
Ashburner, M. et al. Gene Ontology: tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).
Ou, S. et al. Benchmarking transposable element annotation methods for creation of a streamlined, comprehensive pipeline. Genome Biol. 20, 275 (2019).
pubmed: 31843001 pmcid: 6913007 doi: 10.1186/s13059-019-1905-y
Ellinghaus, D., Kurtz, S. & Willhoeft, U. LTRharvest, an efficient and flexible software for de novo detection of LTR retrotransposons. BMC Bioinform. 9, 18 (2008).
doi: 10.1186/1471-2105-9-18
Ou, S. & Jiang, N. LTR_FINDER_parallel: parallelization of LTR_FINDER enabling rapid identification of long terminal repeat retrotransposons. Mob. DNA 10, 48 (2019).
pubmed: 31857828 pmcid: 6909508 doi: 10.1186/s13100-019-0193-0
Su, W., Gu, X. & Peterson, T. TIR-Learner, a new ensemble method for tir transposable element annotation, provides evidence for abundant new transposable elements in the maize genome. Mol. Plant 12, 447–460 (2019).
pubmed: 30802553 doi: 10.1016/j.molp.2019.02.008
Xiong, W., He, L., Lai, J., Dooner, H. K. & Du, C. HelitronScanner uncovers a large overlooked cache of Helitron transposons in many plant genomes. Proc. Natl Acad. Sci. U.S.A.111, 10263–10268 (2014).
pubmed: 24982153 pmcid: 4104883 doi: 10.1073/pnas.1410068111
Flynn, J. M. et al. RepeatModeler2 for automated genomic discovery of transposable element families. Proc. Natl Acad. Sci. U.S.A. 117, 9451–9457 (2020).
pubmed: 32300014 pmcid: 7196820 doi: 10.1073/pnas.1921046117
Smit, A. F. A., Hubley, R. & Green, P. RepeatMasker Open-4.0. http://www.repeatmasker.org (2013−2015).
R Core Team. R: A Language and Environment for Statistical Computing. https://www.R-project.org/ (2019).
Wickham, H. et al. Welcome to the tidyverse. J. Open Source Softw. 4, 1686 (2019).
doi: 10.21105/joss.01686
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
pubmed: 9862982 pmcid: 148217 doi: 10.1093/nar/27.2.573
Emms, D. M. & Kelly, S. OrthoFinder: phylogenetic orthology inference for comparative genomics. Genome Biol. 20, 238 (2019).
pubmed: 31727128 pmcid: 6857279 doi: 10.1186/s13059-019-1832-y
Mendes, F. K., Vanderpool, D., Fulton, B. & Hahn, M. W. CAFE 5 models variation in evolutionary rates among gene families. Bioinformatics https://doi.org/10.1093/bioinformatics/btaa1022 (2020).
De Bie, T., Cristianini, N., Demuth, J. P. & Hahn, M. W. CAFE: a computational tool for the study of gene family evolution. Bioinformatics 22, 1269–1271 (2006).
pubmed: 16543274 doi: 10.1093/bioinformatics/btl097
Alexa & Rahnenfuhrer. topGO: enrichment analysis for gene ontology. R package version (2010).
Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
pubmed: 24451626 pmcid: 3998142 doi: 10.1093/bioinformatics/btu031
Edgar, R. C. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinform. 5, 113 (2004).
doi: 10.1186/1471-2105-5-113
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2–approximately maximum-likelihood trees for large alignments. PLoS One 5, e9490 (2010).
pubmed: 20224823 pmcid: 2835736 doi: 10.1371/journal.pone.0009490
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
pubmed: 33885785 pmcid: 8265157 doi: 10.1093/nar/gkab301
Hao, Z. et al. RIdeogram: drawing SVG graphics to visualize and map genome-wide data on the idiograms. PeerJ Comput. Sci. 6, e251 (2020).
pubmed: 33816903 pmcid: 7924719 doi: 10.7717/peerj-cs.251
Chen, S., Zhou, Y., Chen, Y. & Gu, J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 34, i884–i890 (2018).
pubmed: 30423086 pmcid: 6129281 doi: 10.1093/bioinformatics/bty560
Bray, N. L., Pimentel, H., Melsted, P. & Pachter, L. Near-optimal probabilistic RNA-seq quantification. Nat. Biotechnol. 34, 525–527 (2016).
pubmed: 27043002 doi: 10.1038/nbt.3519
Soneson, C., Love, M. I. & Robinson, M. D. Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences. F1000Res 4, 1521 (2015).
pubmed: 26925227 doi: 10.12688/f1000research.7563.1
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
pubmed: 25516281 pmcid: 4302049 doi: 10.1186/s13059-014-0550-8
Kolde. Pheatmap: pretty heatmaps. R package version.
Doyle, J. J. Isolation of plant DNA from fresh tissue. Focus 12, 13–15 (1990).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
pubmed: 24695404 pmcid: 4103590 doi: 10.1093/bioinformatics/btu170
Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nat. Methods 9, 357–359 (2012).
pubmed: 22388286 pmcid: 3322381 doi: 10.1038/nmeth.1923
Picard toolkit. Broad Institute, GitHub repository https://broadinstitute.github.io/picard/ (2019).
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Linck, E. & Battey, C. J. Minor allele frequency thresholds strongly affect population structure inference with genomic data sets. Mol. Ecol. Resour. 19, 639–647 (2019).
pubmed: 30659755 doi: 10.1111/1755-0998.12995
Evanno, G., Regnaut, S. & Goudet, J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol. Ecol. 14, 2611–2620 (2005).
pubmed: 15969739 doi: 10.1111/j.1365-294X.2005.02553.x
Earl, D. A. & vonHoldt, B. M. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv. Genet. Resour. 4, 359–361 (2012).
doi: 10.1007/s12686-011-9548-7
Kilian, A. et al. Diversity arrays technology: a generic genome profiling technology on open platforms. Methods Mol. Biol. 888, 67–89 (2012).
pubmed: 22665276 doi: 10.1007/978-1-61779-870-2_5
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
pubmed: 17701901 pmcid: 1950838 doi: 10.1086/519795
Kamvar, Z. N., Tabima, J. F. & Grünwald, N. J. Poppr: an R package for genetic analysis of populations with clonal, partially clonal, and/or sexual reproduction. PeerJ 2, e281 (2014).
pubmed: 24688859 pmcid: 3961149 doi: 10.7717/peerj.281
Jombart, T. adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics 24, 1403–1405 (2008).
pubmed: 18397895 doi: 10.1093/bioinformatics/btn129
Paradis, E., Claude, J. & Strimmer, K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics 20, 289–290 (2004).
pubmed: 14734327 doi: 10.1093/bioinformatics/btg412
Wiedow, C. Morphological and Agronomical Characterization of Lablab Purpureus in Ethiopia. (Diploma thesis (Dipl.-Ing. agr.), University of Rostock, 2001).
Wang, J. & Zhang, Z. GAPIT Version 3: boosting power and accuracy for genomic association and prediction. Genom. Proteom. Bioinform. 19, 629–640 (2021).
doi: 10.1016/j.gpb.2021.08.005
Arend, D. et al. e!DAL–a framework to store, share and publish research data. BMC Bioinform. 15, 214 (2014).
doi: 10.1186/1471-2105-15-214

Auteurs

Isaac Njaci (I)

International Livestock Research Institute, PO Box 30709-00100, Nairobi, Kenya.
John Innes Centre, Norwich Research Park, Norwich, NR4 7UH, UK.

Bernice Waweru (B)

International Livestock Research Institute, PO Box 30709-00100, Nairobi, Kenya.

Nadia Kamal (N)

Helmholtz Zentrum München, Plant Genome and Systems Biology, Ingolstädter Landstr. 1, 85764, Neuherberg, Germany.

Meki Shehabu Muktar (MS)

International Livestock Research Institute, Addis Ababa, Ethiopia.

David Fisher (D)

University of Southampton, School of Biological Sciences, Southampton, SO17 1BJ, UK.

Heidrun Gundlach (H)

Helmholtz Zentrum München, Plant Genome and Systems Biology, Ingolstädter Landstr. 1, 85764, Neuherberg, Germany.

Collins Muli (C)

International Livestock Research Institute, PO Box 30709-00100, Nairobi, Kenya.

Lucy Muthui (L)

International Livestock Research Institute, PO Box 30709-00100, Nairobi, Kenya.

Mary Maranga (M)

Department of Biochemistry, Jomo Kenyatta University of Agriculture and Technology, Nairobi, 00200, Kenya.

Davies Kiambi (D)

Bioscience Research Centre (PUBReC), Pwani University, P.O Box 195-80108, Kilifi, Kenya.

Brigitte L Maass (BL)

Department of Crop Sciences, Georg-August-University Göttingen, Grisebachstr 6, 37077, Göttingen, Germany.

Peter M F Emmrich (PMF)

John Innes Centre, Norwich Research Park, Norwich, NR4 7UH, UK.
Department for International Development, University of East Anglia, Norwich, NR4 7TJ, UK.

Jean-Baka Domelevo Entfellner (JB)

International Livestock Research Institute, PO Box 30709-00100, Nairobi, Kenya.

Manuel Spannagl (M)

Helmholtz Zentrum München, Plant Genome and Systems Biology, Ingolstädter Landstr. 1, 85764, Neuherberg, Germany.

Mark A Chapman (MA)

University of Southampton, School of Biological Sciences, Southampton, SO17 1BJ, UK. m.chapman@soton.ac.uk.

Oluwaseyi Shorinola (O)

International Livestock Research Institute, PO Box 30709-00100, Nairobi, Kenya. shorinolao@gmail.com.
John Innes Centre, Norwich Research Park, Norwich, NR4 7UH, UK. shorinolao@gmail.com.

Chris S Jones (CS)

International Livestock Research Institute, PO Box 30709-00100, Nairobi, Kenya. c.jones@cgiar.org.

Articles similaires

Coal Metagenome Phylogeny Bacteria Genome, Bacterial
Genome Size Genome, Plant Magnoliopsida Evolution, Molecular Arabidopsis
Genome, Bacterial Virulence Phylogeny Genomics Plant Diseases
Zea mays Triticum China Seasons Crops, Agricultural

Classifications MeSH