HASCH - A high-throughput amplicon-based SNP-platform for medicinal cannabis and industrial hemp genotyping applications.
Cannabis sativa
Genetic fingerprints
Genotyping
Integer linear programming
Plant breeding
Quantitative trait loci
SNP
Journal
BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258
Informations de publication
Date de publication:
29 Aug 2024
29 Aug 2024
Historique:
received:
20
02
2024
accepted:
22
08
2024
medline:
31
8
2024
pubmed:
31
8
2024
entrez:
29
8
2024
Statut:
epublish
Résumé
Cannabis sativa is seeing a global resurgence as a food, fiber and medicinal crop for industrial hemp and medicinal Cannabis industries respectively. However, a widespread moratorium on the use and research of C. sativa throughout most of the 20th century has seen the development of improved cultivars for specific end uses lag behind that of conventional crops. While C. sativa research and development has seen significant investments in the recent past, resulting in a suite of publicly available genomic resources and tools, a versatile and cost-effective mid-density genotyping platform for applied purposes in breeding and pre-breeding is lacking. Here we report on a first mid-density fixed-target SNP platform for C. sativa. The High-throughput Amplicon-based SNP-platform for medicinal Cannabis and industrial Hemp (HASCH) was designed using a combination of filtering and Integer Linear Programming on publicly available whole-genome sequencing and RNA sequencing data, supplemented with in-house generated genotyping-by-sequencing (GBS) data. HASCH contains 1,504 genome-wide targets of high call rate (97% mean) and even distribution across the genome, designed to be highly informative (> 0.3 minor allele frequency) across both medicinal cannabis and industrial hemp gene pools. Average numbers of mismatch SNP between any two accessions were 251 for medicinal cannabis (N = 116) and 272 for industrial hemp (N = 87). Comparing HASCH data with corresponding GBS data on a collection of diverse C. sativa accessions demonstrated high concordance and resulted in comparable phylogenies and genetic distance matrices. Using HASCH on a segregating F2 population derived from a cross between a tetrahydrocannabinol (THC)-dominant and a cannabidiol (CBD)-dominant accession resulted in a genetic map consisting of 310 markers, comprising 10 linkage groups and a total size of 582.7 cM. Quantitative Trait Locus (QTL) mapping identified a major QTL for CBD content on chromosome 7, consistent with previous findings. HASCH constitutes a versatile, easy to use and cost-effective genotyping solution for the rapidly growing Cannabis research community. It provides consistent genetic fingerprints of 1504 SNPs with wide applicability genetic resource management, quantitative genetics and breeding.
Sections du résumé
BACKGROUND
BACKGROUND
Cannabis sativa is seeing a global resurgence as a food, fiber and medicinal crop for industrial hemp and medicinal Cannabis industries respectively. However, a widespread moratorium on the use and research of C. sativa throughout most of the 20th century has seen the development of improved cultivars for specific end uses lag behind that of conventional crops. While C. sativa research and development has seen significant investments in the recent past, resulting in a suite of publicly available genomic resources and tools, a versatile and cost-effective mid-density genotyping platform for applied purposes in breeding and pre-breeding is lacking. Here we report on a first mid-density fixed-target SNP platform for C. sativa.
RESULTS
RESULTS
The High-throughput Amplicon-based SNP-platform for medicinal Cannabis and industrial Hemp (HASCH) was designed using a combination of filtering and Integer Linear Programming on publicly available whole-genome sequencing and RNA sequencing data, supplemented with in-house generated genotyping-by-sequencing (GBS) data. HASCH contains 1,504 genome-wide targets of high call rate (97% mean) and even distribution across the genome, designed to be highly informative (> 0.3 minor allele frequency) across both medicinal cannabis and industrial hemp gene pools. Average numbers of mismatch SNP between any two accessions were 251 for medicinal cannabis (N = 116) and 272 for industrial hemp (N = 87). Comparing HASCH data with corresponding GBS data on a collection of diverse C. sativa accessions demonstrated high concordance and resulted in comparable phylogenies and genetic distance matrices. Using HASCH on a segregating F2 population derived from a cross between a tetrahydrocannabinol (THC)-dominant and a cannabidiol (CBD)-dominant accession resulted in a genetic map consisting of 310 markers, comprising 10 linkage groups and a total size of 582.7 cM. Quantitative Trait Locus (QTL) mapping identified a major QTL for CBD content on chromosome 7, consistent with previous findings.
CONCLUSION
CONCLUSIONS
HASCH constitutes a versatile, easy to use and cost-effective genotyping solution for the rapidly growing Cannabis research community. It provides consistent genetic fingerprints of 1504 SNPs with wide applicability genetic resource management, quantitative genetics and breeding.
Identifiants
pubmed: 39210290
doi: 10.1186/s12864-024-10734-z
pii: 10.1186/s12864-024-10734-z
pmc: PMC11363669
doi:
Substances chimiques
Medical Marijuana
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
818Subventions
Organisme : Southern Cross University
ID : PhD stipend
Organisme : Australian Research Council
ID : LP210200606
Informations de copyright
© 2024. The Author(s).
Références
Sci Rep. 2019 Nov 12;9(1):16583
pubmed: 31719627
Mach Learn. 2020;109(2):251-277
pubmed: 32174648
Breed Sci. 2023 Sep;73(4):415-420
pubmed: 38106505
Sci Rep. 2018 Oct 8;8(1):14920
pubmed: 30297917
Bioinformatics. 2017 Jul 01;33(13):2037-2039
pubmed: 28205675
Nat Genet. 2005 Nov;37(11):1217-23
pubmed: 16244653
Bioinformatics. 2005 Jan 15;21(2):263-5
pubmed: 15297300
Plant Cell Physiol. 2014 Jan;55(1):e8
pubmed: 24363285
Genome Res. 2019 Jan;29(1):146-156
pubmed: 30409771
Plant Physiol. 2020 Sep;184(1):130-147
pubmed: 32591428
Plant J. 2020 Jan;101(1):37-56
pubmed: 31469934
Plant Physiol. 2019 Aug;180(4):1877-1897
pubmed: 31138625
PLoS One. 2015 Mar 18;10(3):e0119873
pubmed: 25785447
Proc Natl Acad Sci U S A. 1987 Apr;84(8):2363-7
pubmed: 3470801
Bioinformatics. 2003 May 1;19(7):889-90
pubmed: 12724300
PLoS One. 2011 May 04;6(5):e19379
pubmed: 21573248
Bioinformatics. 2007 Oct 1;23(19):2633-5
pubmed: 17586829
Genomics. 1992 Nov;14(3):604-10
pubmed: 1427888
DNA Res. 2011 Aug;18(4):221-32
pubmed: 21669962
Plant Sci. 2016 Jan;242:131-139
pubmed: 26566831
Mol Plant. 2014 Mar;7(3):541-53
pubmed: 24121292
Rice (N Y). 2019 Jul 26;12(1):55
pubmed: 31350673
Rice (N Y). 2017 Aug 30;10(1):40
pubmed: 28856618
PLoS Genet. 2013 Apr;9(4):e1003449
pubmed: 23637621
PLoS One. 2014 Feb 28;9(2):e90346
pubmed: 24587335
Trends Plant Sci. 2017 Nov;22(11):917-929
pubmed: 28886910
BMC Plant Biol. 2021 Jan 6;21(1):18
pubmed: 33407135
Nucleic Acids Res. 2012 Aug;40(15):e115
pubmed: 22730293
Nutrients. 2020 Jun 29;12(7):
pubmed: 32610691
Nat Commun. 2016 Feb 04;7:10532
pubmed: 26842267
Genetics. 2021 Oct 2;219(2):
pubmed: 34173826
Sci Adv. 2021 Jul 16;7(29):
pubmed: 34272249
Sci Rep. 2019 May 6;9(1):6904
pubmed: 31061473
BMC Bioinformatics. 2022 Jul 8;23(1):265
pubmed: 35804290
Front Pharmacol. 2023 Jun 15;14:1200269
pubmed: 37397476
New Phytol. 2015 Dec;208(4):1241-50
pubmed: 26189495
Nat Methods. 2019 Jan;16(1):59-62
pubmed: 30559431
J Pharm Biomed Anal. 2018 Feb 20;150:208-219
pubmed: 29247961
Artif Intell Med. 2004 Jul;31(3):183-96
pubmed: 15302085
New Phytol. 2021 May;230(4):1665-1679
pubmed: 33521943
Sci Rep. 2016 Feb 08;6:20631
pubmed: 26853558
Genome Biol. 2011 Oct 20;12(10):R102
pubmed: 22014239
Bioinformatics. 2018 Nov 15;34(22):3924-3925
pubmed: 29868827
Bioinformatics. 2016 Aug 1;32(15):2306-12
pubmed: 27153730
J Hered. 2002 Jan-Feb;93(1):77-8
pubmed: 12011185
BMC Genomics. 2021 Nov 18;22(1):835
pubmed: 34794394