The Carthamus tinctorius L. genome sequence provides insights into synthesis of unsaturated fatty acids.
Evolutionary history
Fatty acid biosynthesis
Safflower
Whole genome duplication
Journal
BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258
Informations de publication
Date de publication:
23 May 2024
23 May 2024
Historique:
received:
19
09
2023
accepted:
10
05
2024
medline:
24
5
2024
pubmed:
24
5
2024
entrez:
23
5
2024
Statut:
epublish
Résumé
Domesticated safflower (Carthamus tinctorius L.) is a widely cultivated edible oil crop. However, despite its economic importance, the genetic basis underlying key traits such as oil content, resistance to biotic and abiotic stresses, and flowering time remains poorly understood. Here, we present the genome assembly for C. tinctorius variety Jihong01, which was obtained by integrating Oxford Nanopore Technologies (ONT) and BGI-SEQ500 sequencing results. The assembled genome was 1,061.1 Mb, and consisted of 32,379 protein-coding genes, 97.71% of which were functionally annotated. Safflower had a recent whole genome duplication (WGD) event in evolution history and diverged from sunflower approximately 37.3 million years ago. Through comparative genomic analysis at five seed development stages, we unveiled the pivotal roles of fatty acid desaturase 2 (FAD2) and fatty acid desaturase 6 (FAD6) in linoleic acid (LA) biosynthesis. Similarly, the differential gene expression analysis further reinforced the significance of these genes in regulating LA accumulation. Moreover, our investigation of seed fatty acid composition at different seed developmental stages unveiled the crucial roles of FAD2 and FAD6 in LA biosynthesis. These findings offer important insights into enhancing breeding programs for the improvement of quality traits and provide reference resource for further research on the natural properties of safflower.
Identifiants
pubmed: 38783193
doi: 10.1186/s12864-024-10405-z
pii: 10.1186/s12864-024-10405-z
doi:
Substances chimiques
Fatty Acids, Unsaturated
0
Fatty Acid Desaturases
EC 1.14.19.-
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
510Informations de copyright
© 2024. The Author(s).
Références
Chapman MA, Burke JM. DNA sequence diversity and the origin of cultivated safflower (Carthamus tinctorius L.; Asteraceae). BMC Plant Biol. 2007;7:60.
pubmed: 17986334
pmcid: 2241619
doi: 10.1186/1471-2229-7-60
Dajue L, Mündel H-H. Safflower, Carthamus Tinctorius L. Volume 7. Bioversity International; 1996.
Panahi B, Ghorbanzadeh Neghab M. Genetic characterization of Iranian safflower (Carthamus tinctorius) using inter simple sequence repeats (ISSR) markers. Physiol Mol Biology Plants: Int J Funct Plant Biology. 2013;19(2):239–43.
doi: 10.1007/s12298-012-0155-1
McPherson MA, Good AG, Topinka AK, Yang RC, McKenzie RH, Cathcart RJ, Christianson JA, Strobeck C, Hall LM. Pollen-mediated gene flow from transgenic safflower (Carthamus tinctorius L.) intended for plant molecular farming to conventional safflower. Environ Biosaf Res. 2009;8(1):19–32.
doi: 10.1051/ebr/2008023
Ahmad N, Li T, Liu Y, Hoang NQV, Ma X, Zhang X, Liu J, Yao N, Liu X, Li H. Molecular and biochemical rhythms in dihydroflavonol 4-reductase-mediated regulation of leucoanthocyanidin biosynthesis in Carthamus tinctorius L. Industrial Crops Prod. 2020;156:112838.
doi: 10.1016/j.indcrop.2020.112838
Cao S, Zhou XR, Wood CC, Green AG, Singh SP, Liu L, Liu Q. A large and functionally diverse family of Fad2 genes in safflower (Carthamus tinctorius L). BMC Plant Biol. 2013;13:5.
pubmed: 23289946
pmcid: 3554562
doi: 10.1186/1471-2229-13-5
Matthaus B, Ozcan MM, Al Juhaimi FY. Fatty acid composition and tocopherol profiles of safflower (Carthamus tinctorius L.) seed oils. Nat Prod Res. 2015;29(2):193–6.
pubmed: 25329876
doi: 10.1080/14786419.2014.971316
Hou Y, Wang Y, Liu X, Ahmad N, Wang N, Jin L, Yao N, Liu X. A cinnamate 4-HYDROXYLASE1 from Safflower promotes flavonoids Accumulation and stimulates antioxidant Defense System in Arabidopsis. Int J Mol Sci. 2023;24(6):5393.
pubmed: 36982470
pmcid: 10049626
doi: 10.3390/ijms24065393
Rapson S, Wu M, Okada S, Das A, Shrestha P, Zhou XR, Wood C, Green A, Singh S, Liu Q. A case study on the genetic origin of the high oleic acid trait through FAD2-1 DNA sequence variation in safflower (Carthamus tinctorius L). Front Plant Sci. 2015;6:691.
pubmed: 26442008
pmcid: 4563165
doi: 10.3389/fpls.2015.00691
Li R, Beaudoin F, Ammah AA, Bissonnette N, Benchaar C, Zhao X, Lei C, Ibeagha-Awemu EM. Deep sequencing shows microRNA involvement in bovine mammary gland adaptation to diets supplemented with linseed oil or safflower oil. BMC Genomics. 2015;16:884.
pubmed: 26519053
pmcid: 4628385
doi: 10.1186/s12864-015-1965-7
Kazuma K, Takahashi T, Sato K, Takeuchi H, Matsumoto T, Okuno T. Quinochalcones and flavonoids from fresh florets in different cultivars of Carthamus tinctorius L. Biosci Biotechnol Biochem. 2000;64(8):1588–99.
pubmed: 10993143
doi: 10.1271/bbb.64.1588
Nguyen VC, Nakamura Y, Kanehara K. Membrane lipid polyunsaturation mediated by FATTY ACID DESATURASE 2 (FAD2) is involved in endoplasmic reticulum stress tolerance in Arabidopsis thaliana. Plant Journal: Cell Mol Biology. 2019;99(3):478–93.
doi: 10.1111/tpj.14338
Sehgal D, Rajpal VR, Raina SN, Sasanuma T, Sasakuma T. Assaying polymorphism at DNA level for genetic diversity diagnostics of the safflower (Carthamus tinctorius L.) world germplasm resources. Genetica. 2009;135(3):457–70.
pubmed: 18649115
doi: 10.1007/s10709-008-9292-4
Lu C, Shen Q, Yang J, Wang B, Song C. The complete chloroplast genome sequence of Safflower (Carthamus tinctorius L). Mitochondrial DNA Part DNA Mapp Sequencing Anal. 2016;27(5):3351–3.
doi: 10.3109/19401736.2015.1018217
Chen J, Tang X, Ren C, Wei B, Wu Y, Wu Q, Pei J. Full-length transcriptome sequences and the identification of putative genes for flavonoid biosynthesis in safflower. BMC Genomics. 2018;19(1):548.
pubmed: 30041604
pmcid: 6057038
doi: 10.1186/s12864-018-4946-9
Bowers JE, Pearl SA, Burke JM. Genetic Mapping of Millions of SNPs in Safflower (Carthamus tinctorius L.) via Whole-Genome Resequencing. G3 2016, 6(7):2203–2211.
Pearl SA, Bowers JE, Reyes-Chin-Wo S, Michelmore RW, Burke JM. Genetic analysis of safflower domestication. BMC Plant Biol. 2014;14:43.
pubmed: 24502326
pmcid: 3925122
doi: 10.1186/1471-2229-14-43
Mayerhofer M, Mayerhofer R, Topinka D, Christianson J, Good AG. Introgression potential between safflower (Carthamus tinctorius) and wild relatives of the genus Carthamus. BMC Plant Biol. 2011;11:47.
pubmed: 21401959
pmcid: 3068944
doi: 10.1186/1471-2229-11-47
Pearl SA, Burke JM. Genetic diversity in Carthamus tinctorius (Asteraceae; safflower), an underutilized oilseed crop. Am J Bot. 2014;101(10):1640–50.
pubmed: 25096251
doi: 10.3732/ajb.1400079
Lee GA, Sung JS, Lee SY, Chung JW, Yi JY, Kim YG, Lee MC. Genetic assessment of safflower (Carthamus tinctorius L.) collection with microsatellite markers acquired via pyrosequencing method. Mol Ecol Resour. 2014;14(1):69–78.
pubmed: 23875976
doi: 10.1111/1755-0998.12146
Chapman MA, Hvala J, Strever J, Matvienko M, Kozik A, Michelmore RW, Tang S, Knapp SJ, Burke JM. Development, polymorphism, and cross-taxon utility of EST–SSR markers from safflower (Carthamus tinctorius L). Theor Appl Genet. 2009;120(1):85–91.
pubmed: 19820913
doi: 10.1007/s00122-009-1161-8
Yang Y-X, Wu W, Zheng Y-L, Chen L, Liu R-J, Huang C-Y. Genetic diversity and relationships among safflower (Carthamus tinctorius L.) analyzed by inter-simple sequence repeats (ISSRs). Genet Resour Crop Evol. 2007;54(5):1043–51.
doi: 10.1007/s10722-006-9192-3
Kondrashov FA, Rogozin IB, Wolf YI, Koonin EV. Selection in the evolution of gene duplications. Genome Biol. 2002;3(2):Research0008.
pubmed: 11864370
pmcid: 65685
doi: 10.1186/gb-2002-3-2-research0008
Panchy N, Lehti-Shiu M, Shiu SH. Evolution of gene duplication in plants. Plant Physiol. 2016;171(4):2294–316.
pubmed: 27288366
pmcid: 4972278
doi: 10.1104/pp.16.00523
Wu Z, Liu H, Zhan W, Yu Z, Qin E, Liu S, Yang T, Xiang N, Kudrna D, Chen Y, et al. The chromosome-scale reference genome of safflower (Carthamus tinctorius) provides insights into linoleic acid and flavonoid biosynthesis. Plant Biotechnol J. 2021;19(9):1725–42.
pubmed: 33768699
pmcid: 8428823
doi: 10.1111/pbi.13586
Walker BJ, Abeel T, Shea T, Priest M, Abouelliel A, Sakthikumar S, Cuomo CA, Zeng Q, Wortman J, Young SK, et al. Pilon: an integrated tool for comprehensive microbial variant detection and genome assembly improvement. PLoS ONE. 2014;9(11):e112963.
pubmed: 25409509
pmcid: 4237348
doi: 10.1371/journal.pone.0112963
Huang S, Kang M, Xu A. HaploMerger2: rebuilding both haploid sub-assemblies from high-heterozygosity diploid genome assembly. Bioinformatics. 2017;33(16):2577–9.
pubmed: 28407147
pmcid: 5870766
doi: 10.1093/bioinformatics/btx220
Simao FA, Waterhouse RM, Ioannidis P, Kriventseva EV, Zdobnov EM. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics. 2015;31(19):3210–2.
pubmed: 26059717
doi: 10.1093/bioinformatics/btv351
Ou S, Chen J, Jiang N. Assessing genome assembly quality using the LTR Assembly Index (LAI). Nucleic Acids Res. 2018;46(21):e126.
pubmed: 30107434
pmcid: 6265445
Scaglione D, Reyes-Chin-Wo S, Acquadro A, Froenicke L, Portis E, Beitel C, Tirone M, Mauro R, Lo Monaco A, Mauromicale G, et al. The genome sequence of the outbreeding globe artichoke constructed de novo incorporating a phase-aware low-pass sequencing strategy of F1 progeny. Sci Rep. 2016;6:19427.
pubmed: 26786968
pmcid: 4726258
doi: 10.1038/srep19427
Badouin H, Gouzy J, Grassa CJ, Murat F, Staton SE, Cottret L, Lelandais-Briere C, Owens GL, Carrere S, Mayjonade B, et al. The sunflower genome provides insights into oil metabolism, flowering and Asterid evolution. Nature. 2017;546(7656):148–52.
pubmed: 28538728
doi: 10.1038/nature22380
Reyes-Chin-Wo S, Wang Z, Yang X, Kozik A, Arikit S, Song C, Xia L, Froenicke L, Lavelle DO, Truco MJ, et al. Genome assembly with in vitro proximity ligation data and whole-genome triplication in lettuce. Nat Commun. 2017;8:14953.
pubmed: 28401891
pmcid: 5394340
doi: 10.1038/ncomms14953
Barreda VD, Palazzesi L, Tellería MC, Olivero EB, Raine JI, Forest F. Early evolution of the angiosperm clade Asteraceae in the Cretaceous of Antarctica. Proceedings of the National Academy of Sciences 2015, 112(35):10989–10994.
Barker MS, Kane NC, Matvienko M, Kozik A, Michelmore RW, Knapp SJ, Rieseberg LH. Multiple paleopolyploidizations during the evolution of the Compositae reveal parallel patterns of duplicate gene retention after millions of years. Mol Biol Evol. 2008;25(11):2445–55.
pubmed: 18728074
pmcid: 2727391
doi: 10.1093/molbev/msn187
Li L, Stoeckert CJ Jr., Roos DS. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003;13(9):2178–89.
pubmed: 12952885
pmcid: 403725
doi: 10.1101/gr.1224503
De Bie T, Cristianini N, Demuth JP, Hahn MW. CAFE: a computational tool for the study of gene family evolution. Bioinformatics. 2006;22(10):1269–71.
pubmed: 16543274
doi: 10.1093/bioinformatics/btl097
Zwaenepoel A, Van de Peer Y. Wgd-simple command line tools for the analysis of ancient whole-genome duplications. Bioinformatics. 2019;35(12):2153–5.
pubmed: 30398564
doi: 10.1093/bioinformatics/bty915
Wang Z, Hobson N, Galindo L, Zhu S, Shi D, McDill J, Yang L, Hawkins S, Neutelings G, Datla R, et al. The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads. Plant Journal: Cell Mol Biology. 2012;72(3):461–73.
doi: 10.1111/j.1365-313X.2012.05093.x
Kumar L. Mfuzz: a software package for soft clustering of microarray data. Bioinformation. 2007;2(1):5–7.
pubmed: 18084642
pmcid: 2139991
doi: 10.6026/97320630002005
Garnatje T, Garcia S, Vilatersana R, Vallès J. Genome size variation in the genus Carthamus (Asteraceae, Cardueae): systematic implications and additive changes during allopolyploidization. Ann Bot. 2006;97(3):461–7.
pubmed: 16390843
pmcid: 2803645
doi: 10.1093/aob/mcj050
Sabzalian MR, Saeidi G, Mirlohi A. Oil content and fatty acid composition in seeds of three safflower species. J Am Oil Chem Soc. 2008;85(8):717–21.
doi: 10.1007/s11746-008-1254-6
Wei X, Liu K, Zhang Y, Feng Q, Wang L, Zhao Y, Li D, Zhao Q, Zhu X, Zhu X, et al. Genetic discovery for oil production and quality in sesame. Nat Commun. 2015;6:8609.
pubmed: 26477832
doi: 10.1038/ncomms9609
Bozan B, Temelli F. Chemical composition and oxidative stability of flax, safflower and poppy seed and seed oils. Bioresour Technol. 2008;99(14):6354–9.
pubmed: 18198133
doi: 10.1016/j.biortech.2007.12.009
Ma DW, Wierzbicki AA, Field CJ, Clandinin MT. Preparation of conjugated linoleic acid from safflower oil. J Am Oil Chem Soc. 1999;76(6):729–30.
doi: 10.1007/s11746-999-0167-3
Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, et al. The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature. 2007;449(7161):463–7.
pubmed: 17721507
doi: 10.1038/nature06148
Kim S, Park M, Yeom S-I, Kim Y-M, Lee JM, Lee H-A, Seo E, Choi J, Cheong K, Kim K-T. Genome sequence of the hot pepper provides insights into the evolution of pungency in Capsicum species. Nat Genet. 2014;46(3):270–8.
pubmed: 24441736
doi: 10.1038/ng.2877
Arabidopsis Genome I. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000;408(6814):796–815.
doi: 10.1038/35048692
Roth C, Rastogi S, Arvestad L, Dittmar K, Light S, Ekman D, Liberles DA. Evolution after gene duplication: models, mechanisms, sequences, systems, and organisms. J Experimental Zool Part B Mol Dev Evol. 2007;308(1):58–73.
doi: 10.1002/jez.b.21124
Kim S, Park M, Yeom SI, Kim YM, Lee JM, Lee HA, Seo E, Choi J, Cheong K, Kim KT, et al. Genome sequence of the hot pepper provides insights into the evolution of pungency in Capsicum species. Nat Genet. 2014;46(3):270–8.
pubmed: 24441736
doi: 10.1038/ng.2877
Ibarra-Laclette E, Lyons E, Hernandez-Guzman G, Perez-Torres CA, Carretero-Paulet L, Chang TH, Lan T, Welch AJ, Juarez MJ, Simpson J, et al. Architecture and evolution of a minute plant genome. Nature. 2013;498(7452):94–8.
pubmed: 23665961
pmcid: 4972453
doi: 10.1038/nature12132
Li Y, Beisson F, Pollard M, Ohlrogge J. Oil content of Arabidopsis seeds: the influence of seed anatomy, light and plant-to-plant variation. Phytochemistry. 2006;67(9):904–15.
pubmed: 16600316
doi: 10.1016/j.phytochem.2006.02.015
Marcais G, Kingsford C. A fast, lock-free approach for efficient parallel counting of occurrences of k-mers. Bioinformatics. 2011;27(6):764–70.
pubmed: 21217122
pmcid: 3051319
doi: 10.1093/bioinformatics/btr011
Vurture GW, Sedlazeck FJ, Nattestad M, Underwood CJ, Fang H, Gurtowski J, Schatz MC. GenomeScope: fast reference-free genome profiling from short reads. Bioinformatics. 2017;33(14):2202–4.
pubmed: 28369201
pmcid: 5870704
doi: 10.1093/bioinformatics/btx153
Servant N, Varoquaux N, Lajoie BR, Viara E, Chen CJ, Vert JP, Heard E, Dekker J, Barillot E. HiC-Pro: an optimized and flexible pipeline for Hi-C data processing. Genome Biol. 2015;16:259.
pubmed: 26619908
pmcid: 4665391
doi: 10.1186/s13059-015-0831-x
Durand NC, Shamim MS, Machol I, Rao SS, Huntley MH, Lander ES, Aiden EL. Juicer provides a one-click system for analyzing Loop-Resolution Hi-C experiments. Cell Syst. 2016;3(1):95–8.
pubmed: 27467249
pmcid: 5846465
doi: 10.1016/j.cels.2016.07.002
Dudchenko O, Batra SS, Omer AD, Nyquist SK, Hoeger M, Durand NC, Shamim MS, Machol I, Lander ES, Aiden AP, et al. De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds. Science. 2017;356:92–5.
pubmed: 28336562
pmcid: 5635820
doi: 10.1126/science.aal3327
RepeatModeler. Open-1.0 [ http://www.repeatmasker.org ].
Xu Z, Wang H. LTR_FINDER: an efficient tool for the prediction of full-length LTR retrotransposons. Nucleic Acids Res 2007, 35(Web Server issue):W265–268.
Benson G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999;27(2):573–80.
pubmed: 9862982
pmcid: 148217
doi: 10.1093/nar/27.2.573
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7.
pubmed: 15034147
pmcid: 390337
doi: 10.1093/nar/gkh340
Hu TT, Pattyn P, Bakker EG, Cao J, Cheng JF, Clark RM, Fahlgren N, Fawcett JA, Grimwood J, Gundlach H, et al. The Arabidopsis lyrata genome sequence and the basis of rapid genome size change. Nat Genet. 2011;43(5):476–81.
pubmed: 21478890
pmcid: 3083492
doi: 10.1038/ng.807
Ossowski S, Schneeberger K, Lucas-Lledo JI, Warthmann N, Clark RM, Shaw RG, Weigel D, Lynch M. The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana. Science. 2010;327(5961):92–4.
pubmed: 20044577
doi: 10.1126/science.1180677
Kent WJ. BLAT–the BLAST-like alignment tool. Genome Res. 2002;12(4):656–64.
pubmed: 11932250
pmcid: 187518
Birney E, Clamp M, Durbin R. GeneWise and Genomewise. Genome Res. 2004;14(5):988–95.
pubmed: 15123596
pmcid: 479130
doi: 10.1101/gr.1865504
Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12(4):357–60.
pubmed: 25751142
pmcid: 4655817
doi: 10.1038/nmeth.3317
Pertea M, Pertea GM, Antonescu CM, Chang TC, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33(3):290–5.
pubmed: 25690850
pmcid: 4643835
doi: 10.1038/nbt.3122
Elsik CG, Mackey AJ, Reese JT, Milshina NV, Roos DS, Weinstock GM. Creating a honey bee consensus gene set. Genome Biol. 2007;8(1):R13.
pubmed: 17241472
pmcid: 1839126
doi: 10.1186/gb-2007-8-1-r13
Jones P, Binns D, Chang HY, Fraser M, Li W, McAnulla C, McWilliam H, Maslen J, Mitchell A, Nuka G, et al. InterProScan 5: genome-scale protein function classification. Bioinformatics. 2014;30(9):1236–40.
pubmed: 24451626
pmcid: 3998142
doi: 10.1093/bioinformatics/btu031
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30(9):1312–3.
pubmed: 24451623
pmcid: 3998144
doi: 10.1093/bioinformatics/btu033
Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24(8):1586–91.
pubmed: 17483113
doi: 10.1093/molbev/msm088
Tang H, Bowers JE, Wang X, Ming R, Alam M, Paterson AH. Synteny and Collinearity in Plant genomes. Science. 2008;320:486–8.
pubmed: 18436778
doi: 10.1126/science.1153917
Ecker J, Scherer M, Schmitz G, Liebisch G. A rapid GC-MS method for quantification of positional and geometric isomers of fatty acid methyl esters. J Chromatogr B Analyt Technol Biomed Life Sci. 2012;897:98–104.
pubmed: 22542399
doi: 10.1016/j.jchromb.2012.04.015