GenRCA: a user-friendly rare codon analysis tool for comprehensive evaluation of codon usage preferences based on coding sequences in genomes.

Codon usage Gene design Protein expression Rare codon analysis

Journal

BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194

Informations de publication

Date de publication:
27 Sep 2024
Historique:
received: 20 12 2023
accepted: 17 09 2024
medline: 28 9 2024
pubmed: 28 9 2024
entrez: 28 9 2024
Statut: epublish

Résumé

The study of codon usage bias is important for understanding gene expression, evolution and gene design, providing critical insights into the molecular processes that govern the function and regulation of genes. Codon Usage Bias (CUB) indices are valuable metrics for understanding codon usage patterns across different organisms without extensive experiments. Considering that there is no one-fits-all index for all species, a comprehensive platform supporting the calculation and analysis of multiple CUB indices for codon optimization is greatly needed. Here, we release GenRCA, an updated version of our previous Rare Codon Analysis Tool, as a free and user-friendly website for all-inclusive evaluation of codon usage preferences of coding sequences. In this study, we manually reviewed and implemented up to 31 codon preference indices, with 65 expression host organisms covered and batch processing of multiple gene sequences supported, aiming to improve the user experience and provide more comprehensive and efficient analysis. Our website fills a gap in the availability of comprehensive tools for species-specific CUB calculations, enabling researchers to thoroughly assess the protein expression level based on a comprehensive list of 31 indices and further guide the codon optimization.

Sections du résumé

BACKGROUND BACKGROUND
The study of codon usage bias is important for understanding gene expression, evolution and gene design, providing critical insights into the molecular processes that govern the function and regulation of genes. Codon Usage Bias (CUB) indices are valuable metrics for understanding codon usage patterns across different organisms without extensive experiments. Considering that there is no one-fits-all index for all species, a comprehensive platform supporting the calculation and analysis of multiple CUB indices for codon optimization is greatly needed.
RESULTS RESULTS
Here, we release GenRCA, an updated version of our previous Rare Codon Analysis Tool, as a free and user-friendly website for all-inclusive evaluation of codon usage preferences of coding sequences. In this study, we manually reviewed and implemented up to 31 codon preference indices, with 65 expression host organisms covered and batch processing of multiple gene sequences supported, aiming to improve the user experience and provide more comprehensive and efficient analysis.
CONCLUSIONS CONCLUSIONS
Our website fills a gap in the availability of comprehensive tools for species-specific CUB calculations, enabling researchers to thoroughly assess the protein expression level based on a comprehensive list of 31 indices and further guide the codon optimization.

Identifiants

pubmed: 39333857
doi: 10.1186/s12859-024-05934-z
pii: 10.1186/s12859-024-05934-z
doi:

Substances chimiques

Codon 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

309

Informations de copyright

© 2024. The Author(s).

Références

Bahiri-Elitzur S, Tuller T. Codon-based indices for modeling gene expression and transcript evolution. Comput Struct Biotechnol J. 2021;19:2646–63.
doi: 10.1016/j.csbj.2021.04.042 pubmed: 34025951 pmcid: 8122159
Hershberg R, Petrov DA. Selection on codon bias. Annu Rev Genet. 2008;42:287–99.
doi: 10.1146/annurev.genet.42.110807.091442 pubmed: 18983258
Parvathy ST, Udayasuriyan V, Bhadana V. Codon usage bias. Mol Biol Rep. 2022;49:539–65.
doi: 10.1007/s11033-021-06749-4 pubmed: 34822069
Liu Y. A code within the genetic code: codon usage regulates co-translational protein folding. Cell Commun Signal. 2020;18:145.
doi: 10.1186/s12964-020-00642-6 pubmed: 32907610 pmcid: 7488015
Athey J, et al. A new and updated resource for codon usage tables. BMC Bioinf. 2017;18:391.
doi: 10.1186/s12859-017-1793-7
Quax TEF, Claassens NJ, Söll D, van der Oost J. Codon bias as a means to fine-tune gene expression. Mol Cell. 2015;59:149–61.
doi: 10.1016/j.molcel.2015.05.035 pubmed: 26186290 pmcid: 4794256
Sharp PM, Li WH. An evolutionary perspective on synonymous codon usage in unicellular organisms. J Mol Evol. 1986;24:28–38.
doi: 10.1007/BF02099948 pubmed: 3104616
Wright F. The ‘effective number of codons’ used in a gene. Gene. 1990;87:23–9.
doi: 10.1016/0378-1119(90)90491-9 pubmed: 2110097
Satapathy SS, Sahoo AK, Ray SK, Ghosh TC. Codon degeneracy and amino acid abundance influence the measures of codon usage bias: improved Nc ( N̂
doi: 10.1111/gtc.12474 pubmed: 28185367
Roymondal U, Das S, Sahoo S. Predicting gene expression level from relative codon usage bias: an application to Escherichia coli genome. DNA Res. 2009;16:13–30.
doi: 10.1093/dnares/dsn029 pubmed: 19131380 pmcid: 2646356
Sabi R, Tuller T. Modelling the efficiency of codon–tRNA interactions based on codon usage bias. DNA Res. 2014;21:511–26.
doi: 10.1093/dnares/dsu017 pubmed: 24906480 pmcid: 4195497
Zhang Z, et al. Codon Deviation Coefficient: a novel measure for estimating codon usage bias and its statistical significance. BMC Bioinf. 2012;13:43.
doi: 10.1186/1471-2105-13-43
Supek F, Vlahoviček K. Comparison of codon usage measures and their applicability in prediction of microbial gene expressivity. BMC Bioinf. 2005;6:182.
doi: 10.1186/1471-2105-6-182
Freire-Picos MA, et al. Codon usage in Kluyveromyces lactis and in yeast cytochrome c-encoding genes. Gene. 1994;139:43–9.
doi: 10.1016/0378-1119(94)90521-5 pubmed: 8112587
Wan X-F, Xu D, Kleinhofs A, Zhou J. Quantitative relationship between synonymous codon usage bias and GC composition across unicellular genomes. BMC Evol Biol. 2004;4:19.
doi: 10.1186/1471-2148-4-19 pubmed: 15222899 pmcid: 476735
Wan X-F, Zhou J, Xu D. CodonO: a new informatics method for measuring synonymous codon usage bias within and across genomes. Int J Gen Syst. 2006;35:109–25.
doi: 10.1080/03081070500502967
Suzuki H, Saito R, Tomita M. The ‘weighted sum of relative entropy’: a new index for synonymous codon usage bias. Gene. 2004;335:19–23.
doi: 10.1016/j.gene.2004.03.001 pubmed: 15194186
Gribskov M, Devereux J, Burgess RR. The codon preference plot: graphic analysis of protein coding sequences and prediction of gene expression. Nucleic Acids Res. 1984;12:539–49.
doi: 10.1093/nar/12.1Part2.539 pubmed: 6694906 pmcid: 321069
Urrutia AO, Hurst LD. Codon usage bias covaries with expression breadth and the rate of synonymous evolution in humans, but this is not evidence for selection. Genetics. 2001;159:1191–9.
doi: 10.1093/genetics/159.3.1191 pubmed: 11729162 pmcid: 1461876
Sharp PM, Li WH. The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications. Nucleic Acids Res. 1987;15:1281–95.
doi: 10.1093/nar/15.3.1281 pubmed: 3547335 pmcid: 340524
Ikemura T. Correlation between the abundance of Escherichia coli transfer RNAs and the occurrence of the respective codons in its protein genes: a proposal for a synonymous codon choice that is optimal for the E. coli translational system. J. Mol. Biol. 1981;151, 389–409.
Ikemura T. Correlation between the abundance of yeast transfer RNAs and the occurrence of the respective codons in protein genes. J Mol Biol. 1982;158:573–97.
doi: 10.1016/0022-2836(82)90250-9 pubmed: 6750137
Bourret J, Alizon S, Bravo IG. COUSIN (COdon usage similarity INdex): a normalized measure of codon usage preferences. Genome Biol Evol. 2019;11:3523–8.
doi: 10.1093/gbe/evz262 pubmed: 31800035 pmcid: 6934141
Bennetzen JL, Hall BD. Codon selection in yeast. J Biol Chem. 1982;257:3026–31.
doi: 10.1016/S0021-9258(19)81068-2 pubmed: 7037777
Suzuki H, Saito R, Tomita M. Measure of synonymous codon usage diversity among genes in bacteria. BMC Bioinf. 2009;10:167.
doi: 10.1186/1471-2105-10-167
Fox JM, Erill I. Relative codon adaptation: a generic codon bias index for prediction of gene expression. DNA Res. 2010;17:185–96.
doi: 10.1093/dnares/dsq012 pubmed: 20453079 pmcid: 2885275
Diament A, Pinter RY, Tuller T. Three-dimensional eukaryotic genomic organization is strongly correlated with codon usage expression and function. Nat Commun. 2014;5:5876.
doi: 10.1038/ncomms6876 pubmed: 25510862
Karlin S, Mrázek J, Campbell AM. Codon usages in different gene classes of the Escherichia coli genome. Mol Microbiol. 1998;29:1341–55.
doi: 10.1046/j.1365-2958.1998.01008.x pubmed: 9781873
Reis Md. Solving the riddle of codon usage preferences: a test for translational selection. Nucleic Acids Res. 2004;32, 5036–5044.
Anwar AM, et al. gtAI: an improved species-specific tRNA adaptation index using the genetic algorithm. Front Mol Biosci. 2023;10:1218518.
doi: 10.3389/fmolb.2023.1218518 pubmed: 37469707 pmcid: 10352787
Gouy M, Gautier C. Codon usage in bacteria: correlation with gene expressivity. Nucleic Acids Res. 1982;10:7055–74.
doi: 10.1093/nar/10.22.7055 pubmed: 6760125 pmcid: 326988
Stenico M, Lloyd AT, Sharp PM. Codon usage in Caenorhabditis elegans: delineation of translational selection and mutational biases. Nucleic Acids Res. 1994;22:2437–46.
doi: 10.1093/nar/22.13.2437 pubmed: 8041603 pmcid: 308193
Alexaki A, et al. Codon and codon-pair usage tables (CoCoPUTs): facilitating genetic variation analyses and recombinant gene design. J Mol Biol. 2019;431:2434–41.
doi: 10.1016/j.jmb.2019.04.021 pubmed: 31029701
Kunec D, Osterrieder N. Codon pair bias is a direct consequence of dinucleotide bias. Cell Rep. 2016;14:55–67.
doi: 10.1016/j.celrep.2015.12.011 pubmed: 26725119
Coleman JR, et al. Virus attenuation by genome-scale changes in codon pair bias. Science. 2008;320:1784–7.
doi: 10.1126/science.1155761 pubmed: 18583614 pmcid: 2754401
Plotkin JB, Dushoff J, Fraser HB. Detecting selection using a single genome sequence of M. tuberculosis and P. falciparum. Nature 428, 942–945 (2004).
Ghaemmaghami S, et al. Global analysis of protein expression in yeast. Nature. 2003;425:737–41.
doi: 10.1038/nature02046 pubmed: 14562106
Baycin-Hizal D, et al. Proteomic analysis of Chinese hamster ovary cells. J Proteome Res. 2012;11:5265–76.
doi: 10.1021/pr300476w pubmed: 22971049 pmcid: 3772721
Lu P, Vogel C, Wang R, Yao X, Marcotte EM. Absolute protein expression profiling estimates the relative contributions of transcriptional and translational regulation. Nat Biotechnol. 2007;25:117–24.
doi: 10.1038/nbt1270 pubmed: 17187058
Schwanhäusser B, et al. Global quantification of mammalian gene expression control. Nature. 2011;473:337–42.
doi: 10.1038/nature10098 pubmed: 21593866
Welch M, et al. Design parameters to control synthetic gene expression in Escherichia coli. PLoS ONE. 2009;4: e7002.
doi: 10.1371/journal.pone.0007002 pubmed: 19759823 pmcid: 2736378
Kudla G, Murray AW, Tollervey D, Plotkin JB. Coding-sequence determinants of gene expression in Escherichia coli. Science. 2009;324:255–8.
doi: 10.1126/science.1170160 pubmed: 19359587 pmcid: 3902468
Friberg M, von Rohr P, Gonnet G. Limitations of codon adaptation index and other coding DNA-based features for prediction of protein expression inSaccharomyces cerevisiae. Yeast. 2004;21:1083–93.
doi: 10.1002/yea.1150 pubmed: 15484285

Auteurs

Kunjie Fan (K)

Production and R&D Center I of LSS, GenScript (Shanghai) Biotech Co., Ltd., Shanghai, China.

Yuanyuan Li (Y)

Production and R&D Center I of LSS, GenScript Biotech Corporation, Nanjing, China.

Zhiwei Chen (Z)

Production and R&D Center I of LSS, GenScript Biotech Corporation, Nanjing, China.

Long Fan (L)

Production and R&D Center I of LSS, GenScript (Shanghai) Biotech Co., Ltd., Shanghai, China. leo.fan@genscript.com.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
Cephalometry Humans Anatomic Landmarks Software Internet
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH