Transcriptome-based prediction for polygenic traits in rice using different gene subsets.


Journal

BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258

Informations de publication

Date de publication:
01 Oct 2024
Historique:
received: 24 06 2024
accepted: 13 09 2024
medline: 2 10 2024
pubmed: 2 10 2024
entrez: 1 10 2024
Statut: epublish

Résumé

Transcriptome-based prediction of complex phenotypes is a relatively new statistical method that links genetic variation to phenotypic variation. The selection of large-effect genes based on a priori biological knowledge is beneficial for predicting oligogenic traits; however, such a simple gene selection method is not applicable to polygenic traits because causal genes or large-effect loci are often unknown. Here, we used several gene-level features and tested whether it was possible to select a gene subset that resulted in better predictive ability than using all genes for predicting a polygenic trait. Using the phenotypic values of shoot and root traits and transcript abundances in leaves and roots of 57 rice accessions, we evaluated the predictive abilities of the transcriptome-based prediction models. Leaf transcripts predicted shoot phenotypes, such as plant height, more accurately than root transcripts, whereas root transcripts predicted root phenotypes, such as crown root length, more accurately than leaf transcripts. Furthermore, we used the following three features to train the prediction model: (1) tissue specificity of the transcripts, (2) ontology annotations, and (3) co-expression modules for selecting gene subsets. Although models trained by a gene subset often resulted in lower predictive abilities than the model trained by all genes, some gene subsets showed improved predictive ability. For example, using genes expressed in roots but not in leaves, the predictive ability for crown root diameter was improved by more than 10% (R Our results highlight both the possibility and difficulty of selecting an appropriate gene subset to predict polygenic traits from transcript abundance, given the current biological knowledge and information. Further integration of multiple sources of information, as well as improvements in gene characterization, may enable the selection of an optimal gene set for the prediction of polygenic phenotypes.

Sections du résumé

BACKGROUND BACKGROUND
Transcriptome-based prediction of complex phenotypes is a relatively new statistical method that links genetic variation to phenotypic variation. The selection of large-effect genes based on a priori biological knowledge is beneficial for predicting oligogenic traits; however, such a simple gene selection method is not applicable to polygenic traits because causal genes or large-effect loci are often unknown. Here, we used several gene-level features and tested whether it was possible to select a gene subset that resulted in better predictive ability than using all genes for predicting a polygenic trait.
RESULTS RESULTS
Using the phenotypic values of shoot and root traits and transcript abundances in leaves and roots of 57 rice accessions, we evaluated the predictive abilities of the transcriptome-based prediction models. Leaf transcripts predicted shoot phenotypes, such as plant height, more accurately than root transcripts, whereas root transcripts predicted root phenotypes, such as crown root length, more accurately than leaf transcripts. Furthermore, we used the following three features to train the prediction model: (1) tissue specificity of the transcripts, (2) ontology annotations, and (3) co-expression modules for selecting gene subsets. Although models trained by a gene subset often resulted in lower predictive abilities than the model trained by all genes, some gene subsets showed improved predictive ability. For example, using genes expressed in roots but not in leaves, the predictive ability for crown root diameter was improved by more than 10% (R
CONCLUSIONS CONCLUSIONS
Our results highlight both the possibility and difficulty of selecting an appropriate gene subset to predict polygenic traits from transcript abundance, given the current biological knowledge and information. Further integration of multiple sources of information, as well as improvements in gene characterization, may enable the selection of an optimal gene set for the prediction of polygenic phenotypes.

Identifiants

pubmed: 39354337
doi: 10.1186/s12864-024-10803-3
pii: 10.1186/s12864-024-10803-3
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

915

Informations de copyright

© 2024. The Author(s).

Références

Meuwissen TH, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001;157(4):1819–29. https://doi.org/10.1093/genetics/157.4.1819 .
doi: 10.1093/genetics/157.4.1819 pubmed: 11290733 pmcid: 1461589
Hickey JM, Chiurugwi T, Mackay I, Powell W. Implementing genomic selection in CGIAR breeding programs workshop participants. Genomic prediction unifies animal and plant breeding programs to form platforms for biological discovery. Nat Genet. 2017;49(9):1297–303. https://doi.org/10.1038/ng.3920 .
doi: 10.1038/ng.3920 pubmed: 28854179
Voss-Fels KP, Cooper M, Hayes BJ. Accelerating crop genetic gains with genomic selection. Theor Appl Genet. 2019;132(3):669–86. https://doi.org/10.1007/s00122-018-3270-8 .
doi: 10.1007/s00122-018-3270-8 pubmed: 30569365
Gianola D, de los Campos G, Hill WG, Manfredi E, Fernando R. Additive genetic variability and the bayesian alphabet. Genetics. 2009;183(1):347–63. https://doi.org/10.1534/genetics.109.103952 .
doi: 10.1534/genetics.109.103952 pubmed: 19620397 pmcid: 2746159
Varona L, Legarra A, Toro MA, Vitezica ZG. Non-additive effects in genomic selection. Front Genet. 2018;9:78. https://doi.org/10.3389/fgene.2018.00078 .
doi: 10.3389/fgene.2018.00078 pubmed: 29559995 pmcid: 5845743
Ishimori M, Hattori T, Yamazaki K, Takanashi H, Fujimoto M, Kajiya-Kanegae H, et al. Impacts of dominance effects on genomic prediction of sorghum hybrid performance. Breed Sci. 2020;70(5):605–16. https://doi.org/10.1270/jsbbs.20042 .
doi: 10.1270/jsbbs.20042 pubmed: 33603557 pmcid: 7878944
Ramstein GP, Larsson SJ, Cook JP, Edwards JW, Ersoz ES, Flint-Garcia S, et al. Dominance effects and functional enrichments improve prediction of agronomic traits in hybrid maize. Genetics. 2020;215(1):215–30. https://doi.org/10.1534/genetics.120.303025 .
doi: 10.1534/genetics.120.303025 pubmed: 32152047 pmcid: 7198274
Azodi CB, Pardo J, VanBuren R, de Los Campos G, Shiu SH. Transcriptome-based prediction of complex traits in maize. Plant Cell. 2020;32(1):139–51. https://doi.org/10.1105/tpc.19.00332 .
doi: 10.1105/tpc.19.00332 pubmed: 31641024
Frisch M, Thiemann A, Fu J, Schrag TA, Scholten S, Melchinger AE. Transcriptome-based distance measures for grouping of germplasm and prediction of hybrid performance in maize. Theor Appl Genet. 2010;120(2):441–50. https://doi.org/10.1007/s00122-009-1204-1 .
doi: 10.1007/s00122-009-1204-1 pubmed: 19911157
Guo Z, Magwire MM, Basten CJ, Xu Z, Wang D. Evaluation of the utility of gene expression and metabolic information for genomic prediction in maize. Theor Appl Genet. 2016;129(12):2413–27. https://doi.org/10.1007/s00122-016-2780-5 .
doi: 10.1007/s00122-016-2780-5 pubmed: 27586153
Schrag TA, Westhues M, Schipprack W, Seifert F, Thiemann A, Scholten S, et al. Beyond genomic prediction: combining different types of omics data can improve prediction of hybrid performance in maize. Genetics. 2018;208(4):1373–85. https://doi.org/10.1534/genetics.117.300374 .
doi: 10.1534/genetics.117.300374 pubmed: 29363551 pmcid: 5887136
Xu S, Xu Y, Gong L, Zhang Q. Metabolomic prediction of yield in hybrid rice. Plant J. 2016;88(2):219–27. https://doi.org/10.1111/tpj.13242 .
doi: 10.1111/tpj.13242 pubmed: 27311694
Knoch D, Werner CR, Meyer RC, Riewe D, Abbadi A, Lücke S, et al. Multi-omics-based prediction of hybrid performance in canola. Theor Appl Genet. 2021;134(4):1147–65. https://doi.org/10.1007/s00122-020-03759-x .
doi: 10.1007/s00122-020-03759-x pubmed: 33523261 pmcid: 7973648
Hu H, Campbell MT, Yeats TH, Zheng X, Runcie DE, Covarrubias-Pazaran D, et al. Multi-omics prediction of oat agronomic and seed nutritional traits across environments and in distantly related populations. Theor Appl Genet. 2021;134(12):4043–54. https://doi.org/10.1007/s00122-021-03946-4 .
doi: 10.1007/s00122-021-03946-4 pubmed: 34643760 pmcid: 8580906
Hershberger J, Tanaka R, Wood JC, Kaczmar N, Wu D, Hamilton JP, et al. Transcriptome-wide association and prediction for carotenoids and tocochromanols in fresh sweet corn kernels. Plant Genome. 2022;15(2):e20197. https://doi.org/10.1002/tpg2.20197 .
doi: 10.1002/tpg2.20197 pubmed: 35262278
Wade AR, Duruflé H, Sanchez L, Segura V. eQTLs are key players in the integration of genomic and transcriptomic data for phenotype prediction. BMC Genomics. 2022;23(1):476. https://doi.org/10.1186/s12864-022-08690-7 .
doi: 10.1186/s12864-022-08690-7 pubmed: 35764918 pmcid: 9238188
Chantaraamporn J, Phumikhet P, Nguantad S, Techo T, Charoensawan V. Machine learning applications for transcription level and phenotype predictions. IUBMB Life. 2022;4(12):1273–87. https://doi.org/10.1002/iub.2693 .
doi: 10.1002/iub.2693
Cheng CY, Li Y, Varala K, Bubert J, Huang J, Kim GJ, et al. Evolutionarily informed machine learning enhances the power of predictive gene-to-phenotype relationships. Nat Commun. 2021;12(1):5627. https://doi.org/10.1038/s41467-021-25893-w .
doi: 10.1038/s41467-021-25893-w pubmed: 34561450 pmcid: 8463701
Bernardo R. Genomewide selection when major genes are known. Crop Sci. 2014;54:68–75. https://doi.org/10.2135/cropsci2013.05.0315 .
doi: 10.2135/cropsci2013.05.0315
Spindel JE, Begum H, Akdemir D, Collard B, Redoña E, Jannink JL, et al. Genome-wide prediction models that incorporate de novo GWAS are a powerful new tool for tropical rice improvement. Heredity. 2016;116(4):395–408. https://doi.org/10.1038/hdy.2015.113 .
doi: 10.1038/hdy.2015.113 pubmed: 26860200 pmcid: 4806696
Tanaka R, Wu D, Li X, Tibbs-Cortes LE, Wood JC, Magallanes-Lundback M, et al. Leveraging prior biological knowledge improves prediction of tocochromanols in maize grain. Plant Genome. 2023;16(4):e20276. https://doi.org/10.1002/tpg2.20276 .
doi: 10.1002/tpg2.20276 pubmed: 36321716
Morota G, Abdollahi-Arpanahi R, Kranis A, Gianola D. Genome-enabled prediction of quantitative traits in chickens using genomic annotation. BMC Genomics. 2014;15:109. https://doi.org/10.1186/1471-2164-15-109 .
doi: 10.1186/1471-2164-15-109 pubmed: 24502227 pmcid: 3922252
Chateigner A, Lesage-Descauses MC, Rogier O, Jorge V, Leplé JC, Brunaud V, et al. Gene expression predictions and networks in natural populations supports the omnigenic theory. BMC Genomics. 2020;21(1):416. https://doi.org/10.1186/s12864-020-06809-2 .
doi: 10.1186/s12864-020-06809-2 pubmed: 32571208 pmcid: 7310122
Sakai H, Lee SS, Tanaka T, Numa H, Kim J, Kawahara Y, et al. Rice annotation project database (RAP-DB): an integrative and interactive database for rice genomics. Plant Cell Physiol. 2013;54(2):e6. https://doi.org/10.1093/pcp/pcs183 .
doi: 10.1093/pcp/pcs183 pubmed: 23299411 pmcid: 3583025
Yamazaki Y, Sakaniwa S, Tsuchiya R, Nonomura KI, Kurata N. Oryzabase: an integrated information resource for rice science. Breed Sci. 2010;60(5):544–8. https://doi.org/10.1270/jsbbs.60.544 .
doi: 10.1270/jsbbs.60.544
Kojima Y, Ebana K, Fukuoka S, Nagamine T, Kawase M. Development of an RFLP-based rice diversity research set of germplasm. Breed Sci. 2005;55(4):431–40. https://doi.org/10.1270/jsbbs.55.431 .
doi: 10.1270/jsbbs.55.431
Tanaka N, Shenton M, Kawahara Y, Kumagai M, Sakai H, Kanamori H, et al. Whole-genome sequencing of the NARO world rice core collection (WRC) as the basis for diversity and association studies. Plant Cell Physiol. 2020;61(5):922–32. https://doi.org/10.1093/pcp/pcaa019 .
doi: 10.1093/pcp/pcaa019 pubmed: 32101292 pmcid: 7426033
Kawakatsu T, Teramoto S, Takayasu S, Maruyama N, Nishijima R, Kitomi Y, et al. The transcriptomic landscapes of rice cultivars with diverse root system architectures grown in upland field conditions. Plant J. 2021;106(4):1177–90. https://doi.org/10.1111/tpj.15226 .
doi: 10.1111/tpj.15226 pubmed: 33751672
Wei S, Tanaka R, Kawakatsu T, Teramoto S, Tanaka N, Shenton M, et al. Genome- and transcriptome-wide association studies to discover candidate genes for diverse root phenotypes in cultivated rice. Rice. 2023;16(1):55. https://doi.org/10.1186/s12284-023-00672-x .
doi: 10.1186/s12284-023-00672-x pubmed: 38063928 pmcid: 10709265
Teramoto S, Kitomi Y, Nishijima R, Takayasu S, Maruyama N, Uga Y. Backhoe-assisted monolith method for plant root phenotyping under upland conditions. Breed Sci. 2019;69(3):508–13. https://doi.org/10.1270/jsbbs.19019 .
doi: 10.1270/jsbbs.19019 pubmed: 31598085 pmcid: 6776139
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40. https://doi.org/10.1093/bioinformatics/btp616 .
doi: 10.1093/bioinformatics/btp616 pubmed: 19910308
VanRaden PM. Efficient methods to compute genomic predictions. J Dairy Sci. 2008;91(11):4414–23. https://doi.org/10.3168/jds.2007-0980 .
doi: 10.3168/jds.2007-0980 pubmed: 18946147
Endelman JB. Ridge regression and other kernels for genomic selection with R package rrBLUP. Plant Genome. 2011;4(3):250–5. https://doi.org/10.3835/plantgenome2011.08.0024 .
doi: 10.3835/plantgenome2011.08.0024
Akdemir D, Godfrey OU. EMMREML: Fitting Mixed Models with Known Covariance Structures. 2015. https://CRAN.R-project.org/package=EMMREML
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9:559. https://doi.org/10.1186/1471-2105-9-559 .
doi: 10.1186/1471-2105-9-559 pubmed: 19114008 pmcid: 2631488
Yu G, Wang LG, Han Y, He QY. clusterProfiler: an R package for comparing biological themes among gene clusters. OMICS. 2012;16(5):284–7. https://doi.org/10.1089/omi.2011.0118 .
doi: 10.1089/omi.2011.0118 pubmed: 22455463 pmcid: 3339379
Wu T, Hu E, Xu S, Chen M, Guo P, Dai Z, et al. clusterProfiler 4.0: a universal enrichment tool for interpreting omics data. Innov. 2021;l2(3):100141. https://doi.org/10.1016/j.xinn.2021.100141 .
doi: 10.1016/j.xinn.2021.100141
Tanimoto E. Tall or short? Slender or thick? A plant strategy for regulating elongation growth of roots by low concentrations of gibberellin. Ann Bot. 2012;110(2):373–81. https://doi.org/10.1093/aob/mcs049 .
doi: 10.1093/aob/mcs049 pubmed: 22437663 pmcid: 3394641
Shtin M, Dello Ioio R, Del Bianco M. It’s time for a change: the role of gibberellin in root meristem development. Front Plant Sci. 2022;13:882517. https://doi.org/10.3389/fpls.2022.882517 .
doi: 10.3389/fpls.2022.882517 pubmed: 35592570 pmcid: 9112047
Cao Y, Ai H, Jain A, Wu X, Zhang L, Pei W, et al. Identification and expression analysis of OsLPR family revealed the potential roles of OsLPR3 and 5 in maintaining phosphate homeostasis in rice. BMC Plant Biol. 2016;16(1):210. https://doi.org/10.1186/s12870-016-0853-x .
doi: 10.1186/s12870-016-0853-x pubmed: 27716044 pmcid: 5048653
Sun L, Tian J, Zhang H, Liao H. Phytohormone regulation of root growth triggered by P deficiency or Al toxicity. J Exp Bot. 2016;67(12):3655–64. https://doi.org/10.1093/jxb/erw188 .
doi: 10.1093/jxb/erw188 pubmed: 27190050
Wang J, Zhou L, Shi H, Chern M, Yu H, Yi H, et al. A single transcription factor promotes both yield and immunity in rice. Science. 2018;361(6406):1026–8. https://doi.org/10.1126/science.aat7675 .
doi: 10.1126/science.aat7675 pubmed: 30190406
Todaka D, Nakashima K, Maruyama K, Kidokoro S, Osakabe Y, Ito Y, et al. Rice phytochrome-interacting factor-like protein OsPIL1 functions as a key regulator of internode elongation and induces a morphological response to drought stress. Proc Natl Acad Sci. 2012;109(39):15947–52. https://doi.org/10.1073/pnas.1207324109 .
doi: 10.1073/pnas.1207324109 pubmed: 22984180 pmcid: 3465374
Li C, Liu Y, Shen WH, Yu Y, Dong A. Chromatin-remodeling factor OsINO80 is involved in regulation of gibberellin biosynthesis and is crucial for rice plant growth and development. J Integr Plant Biol. 2018;60(2):144–59. https://doi.org/10.1111/jipb.12603 .
doi: 10.1111/jipb.12603 pubmed: 29045007
Sui P, Jin J, Ye S, Mu C, Gao J, Feng H, et al. H3K36 methylation is critical for brassinosteroid-regulated plant growth and development in rice. Plant J. 2012;70(2):340–7. https://doi.org/10.1111/j.1365-313x.2011.04873.x .
doi: 10.1111/j.1365-313x.2011.04873.x pubmed: 22136623
Kadambari G, Vemireddy LR, Srividhya A, Nagireddy R, Jena SS, Gandikota M, et al. QTL-Seq-based genetic analysis identifies a major genomic region governing dwarfness in rice (Oryza sativa L). Plant Cell Rep. 2018;37(4):677–87. https://doi.org/10.1007/s00299-018-2260-2 .
doi: 10.1007/s00299-018-2260-2 pubmed: 29387899
Kubo FC, Yasui Y, Ohmori Y, Kumamaru T, Tanaka W, Hirano HY. DWARF WITH SLENDER LEAF1 encoding a histone deacetylase plays diverse roles in rice development. Plant Cell Physiol. 2020;61(3):457–69. https://doi.org/10.1093/pcp/pcz210 .
doi: 10.1093/pcp/pcz210 pubmed: 31697317
Nemoto K, Morita S, Baba T. Shoot and root development in rice related to the phyllochron. Crop Sci. 1995;35(1):24–9. https://doi.org/10.2135/cropsci1995.0011183X003500010005x .
doi: 10.2135/cropsci1995.0011183X003500010005x
Rebouillat J, Dievart A, Verdeil JL, Escoute J, Giese G, Breitler JC, et al. Molecular genetics of rice root development. Rice. 2009;2:15–34. https://doi.org/10.1007/s12284-008-9016-5 .
doi: 10.1007/s12284-008-9016-5
Coudert Y, Périn C, Courtois B, Khong NG, Gantet P. Genetic control of root development in rice, the model cereal. Trends Plant Sci. 2010;15(4):219–26. https://doi.org/10.1016/j.tplants.2010.01.008 .
doi: 10.1016/j.tplants.2010.01.008 pubmed: 20153971
Nobuta K, Venu RC, Lu C, Beló A, Vemaraju K, Kulkarni K, et al. An expression atlas of rice mRNAs and small RNAs. Nat Biotechnol. 2007;25(4):473–7. https://doi.org/10.1038/nbt1291 .
doi: 10.1038/nbt1291 pubmed: 17351617
Fujita M, Horiuchi Y, Ueda Y, Mizuta Y, Kubo T, Yano K, et al. Rice expression atlas in reproductive development. Plant Cell Physiol. 2010;51(12):2060–81. https://doi.org/10.1093/pcp/pcq165 .
doi: 10.1093/pcp/pcq165 pubmed: 21062870
Wang L, Xie W, Chen Y, Tang W, Yang J, Ye R, et al. A dynamic gene expression atlas covering the entire life cycle of rice. Plant J. 2010;61(5):752–66. https://doi.org/10.1111/j.1365-313X.2009.04100.x .
doi: 10.1111/j.1365-313X.2009.04100.x pubmed: 20003165
Zaidem ML, Groen SC, Purugganan MD. Evolutionary and ecological functional genomics, from lab to the wild. Plant J. 2019;97(1):40–55. https://doi.org/10.1111/tpj.14167 .
doi: 10.1111/tpj.14167 pubmed: 30444573
Hashida Y, Tezuka A, Nomura Y, Kamitani M, Kashima M, et al. Fillable and unfillable gaps in plant transcriptome under field and controlled environments. Plant Cell Environ. 2022;45(8):2410–27. https://doi.org/10.1111/pce.14367 .
doi: 10.1111/pce.14367 pubmed: 35610174 pmcid: 9544781
Nagano AJ, Sato Y, Mihara M, Antonio BA, Motoyama R, et al. Deciphering and prediction of transcriptome dynamics under fluctuating field conditions. Cell. 2012;151(6):1358–69. https://doi.org/10.1016/j.cell.2012.10.048 .
doi: 10.1016/j.cell.2012.10.048 pubmed: 23217716
Nagano AJ, Kawagoe T, Sugisaka J, Honjo MN, Iwayama K, et al. Annual transcriptome dynamics in natural environments reveals plant seasonal adaptation. Nat Plants. 2019;5(1):74–83. https://doi.org/10.1038/s41477-018-0338-z .
doi: 10.1038/s41477-018-0338-z pubmed: 30617252
Quintana-Murci L, Alcaïs A, Abel L, Casanova JL, Immunology. Naturanatura: clinical, epidemiological and evolutionary genetics of infectious diseases. Nat Immunol. 2007;8(11):1165–71. https://doi.org/10.1038/ni1535 .
doi: 10.1038/ni1535 pubmed: 17952041

Auteurs

Ryokei Tanaka (R)

Institute of Crop Sciences, National Agriculture & Food Research Organization, Tsukuba, Ibaraki, 305-8518, Japan. tanakar015@affrc.go.jp.

Tsubasa Kawai (T)

Institute of Crop Sciences, National Agriculture & Food Research Organization, Tsukuba, Ibaraki, 305-8518, Japan.

Taiji Kawakatsu (T)

Institute of Agrobiological Sciences, National Agriculture & Food Research Organization, Tsukuba, Ibaraki, 305-8604, Japan.

Nobuhiro Tanaka (N)

Institute of Crop Sciences, National Agriculture & Food Research Organization, Tsukuba, Ibaraki, 305-8518, Japan.

Matthew Shenton (M)

Institute of Crop Sciences, National Agriculture & Food Research Organization, Tsukuba, Ibaraki, 305-8518, Japan.

Shiori Yabe (S)

Institute of Crop Sciences, National Agriculture & Food Research Organization, Tsukuba, Ibaraki, 305-8518, Japan.

Yusaku Uga (Y)

Institute of Crop Sciences, National Agriculture & Food Research Organization, Tsukuba, Ibaraki, 305-8518, Japan.

Articles similaires

Drought Resistance Gene Expression Profiling Gene Expression Regulation, Plant Gossypium Multigene Family
Genome, Viral Ralstonia Composting Solanum lycopersicum Bacteriophages
Semiconductors Photosynthesis Polymers Carbon Dioxide Bacteria
Arabidopsis Arabidopsis Proteins Osmotic Pressure Cytoplasm RNA, Messenger

Classifications MeSH