The ribosome profiling landscape of yeast reveals a high diversity in pervasive translation.

De novo coding products Genome evolution Non-canonical translation signals Noncoding genome Pervasive translation

Journal

Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660

Informations de publication

Date de publication:
14 Oct 2024
Historique:
received: 11 09 2023
accepted: 26 09 2024
medline: 15 10 2024
pubmed: 15 10 2024
entrez: 14 10 2024
Statut: epublish

Résumé

Pervasive translation is a widespread phenomenon that plays a critical role in the emergence of novel microproteins, but the diversity of translation patterns contributing to their generation remains unclear. Based on 54 ribosome profiling (Ribo-Seq) datasets, we investigated the yeast Ribo-Seq landscape using a representation framework that allows the comprehensive inventory and classification of the entire diversity of Ribo-Seq signals, including non-canonical ones. We show that if coding regions occupy specific areas of the Ribo-Seq landscape, noncoding regions encompass a wide diversity of Ribo-Seq signals and, conversely, populate the entire landscape. Our results show that pervasive translation can, nevertheless, be associated with high specificity, with 1055 noncoding ORFs exhibiting canonical Ribo-Seq signals. Using mass spectrometry under standard conditions or proteasome inhibition with an in-house analysis protocol, we report 239 microproteins originating from noncoding ORFs that display canonical but also non-canonical Ribo-Seq signals. Each condition yields dozens of additional microprotein candidates with comparable translation properties, suggesting a larger population of volatile microproteins that are challenging to detect. Our findings suggest that non-canonical translation signals may harbor valuable information and underscore the significance of considering them in proteogenomic studies. Finally, we show that the translation outcome of a noncoding ORF is primarily determined by the initiating codon and the codon distribution in its two alternative frames, rather than features indicative of functionality. Our results enable us to propose a topology of a species' Ribo-Seq landscape, opening the way to comparative analyses of this translation landscape under different conditions.

Sections du résumé

BACKGROUND BACKGROUND
Pervasive translation is a widespread phenomenon that plays a critical role in the emergence of novel microproteins, but the diversity of translation patterns contributing to their generation remains unclear. Based on 54 ribosome profiling (Ribo-Seq) datasets, we investigated the yeast Ribo-Seq landscape using a representation framework that allows the comprehensive inventory and classification of the entire diversity of Ribo-Seq signals, including non-canonical ones.
RESULTS RESULTS
We show that if coding regions occupy specific areas of the Ribo-Seq landscape, noncoding regions encompass a wide diversity of Ribo-Seq signals and, conversely, populate the entire landscape. Our results show that pervasive translation can, nevertheless, be associated with high specificity, with 1055 noncoding ORFs exhibiting canonical Ribo-Seq signals. Using mass spectrometry under standard conditions or proteasome inhibition with an in-house analysis protocol, we report 239 microproteins originating from noncoding ORFs that display canonical but also non-canonical Ribo-Seq signals. Each condition yields dozens of additional microprotein candidates with comparable translation properties, suggesting a larger population of volatile microproteins that are challenging to detect. Our findings suggest that non-canonical translation signals may harbor valuable information and underscore the significance of considering them in proteogenomic studies. Finally, we show that the translation outcome of a noncoding ORF is primarily determined by the initiating codon and the codon distribution in its two alternative frames, rather than features indicative of functionality.
CONCLUSION CONCLUSIONS
Our results enable us to propose a topology of a species' Ribo-Seq landscape, opening the way to comparative analyses of this translation landscape under different conditions.

Identifiants

pubmed: 39402662
doi: 10.1186/s13059-024-03403-7
pii: 10.1186/s13059-024-03403-7
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

268

Informations de copyright

© 2024. The Author(s).

Références

Hanada K, Zhang X, Borevitz JO, Li W-H, Shiu S-H. A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection. Genome Res. 2007;17:632–40.
pubmed: 17395691 pmcid: 1855179 doi: 10.1101/gr.5836207
Yang X, Tschaplinski TJ, Hurst GB, Jawdy S, Abraham PE, Lankford PK, et al. Discovery and annotation of small proteins using genomics, proteomics, and computational approaches. Genome Res. 2011;21:634–41.
pubmed: 21367939 pmcid: 3065711 doi: 10.1101/gr.109280.110
Couso J-P, Patraquim P. Classification and function of small open reading frames. Nat Rev Mol Cell Biol. 2017;18:575–89.
pubmed: 28698598 doi: 10.1038/nrm.2017.58
Orr MW, Mao Y, Storz G, Qian S-B. Alternative ORFs and small ORFs: shedding light on the dark proteome. Nucleic Acids Res. 2020;48:1029–42.
pubmed: 31504789 doi: 10.1093/nar/gkz734
Guerra-Almeida D, Nunes-da-Fonseca R. Small open reading frames: how important are they for molecular evolution? Front Genet. 2020;11: 574737.
pubmed: 33193682 pmcid: 7606980 doi: 10.3389/fgene.2020.574737
Guerra-Almeida D, Tschoeke DA, Nunes-da-Fonseca R. Understanding small ORF diversity through a comprehensive transcription feature classification. DNA Res. 2021;28: dsab007.
pubmed: 34240112 pmcid: 8435553 doi: 10.1093/dnares/dsab007
Kapranov P, Cawley SE, Drenkow J, Bekiranov S, Strausberg RL, Fodor SP, et al. Large-scale transcriptional activity in chromosomes 21 and 22. Science. 2002;296:916–9.
pubmed: 11988577 doi: 10.1126/science.1068597
Ingolia NT, Ghaemmaghami S, Newman JR, Weissman JS. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science. 2009;324:218–23.
pubmed: 19213877 pmcid: 2746483 doi: 10.1126/science.1168978
Clark MB, Amaral PP, Schlesinger FJ, Dinger ME, Taft RJ, Rinn JL, et al. The reality of pervasive transcription. PLoS Biol. 2011;9: e1000625.
pubmed: 21765801 pmcid: 3134446 doi: 10.1371/journal.pbio.1000625
Carvunis A-R, Rolland T, Wapinski I, Calderwood MA, Yildirim MA, Simonis N, et al. Proto-genes and de novo gene birth. Nature. 2012;487:370–4.
pubmed: 22722833 pmcid: 3401362 doi: 10.1038/nature11184
Jensen TH, Jacquier A, Libri D. Dealing with pervasive transcription. Mol Cell. 2013;52:473–84.
pubmed: 24267449 doi: 10.1016/j.molcel.2013.10.032
Chew G-L, Pauli A, Rinn JL, Regev A, Schier AF, Valen E. Ribosome profiling reveals resemblance between long non-coding RNAs and 5′ leaders of coding RNAs. Development. 2013;140:2828–34.
pubmed: 23698349 pmcid: 3678345 doi: 10.1242/dev.098343
Aspden JL, Eyre-Walker YC, Phillips RJ, Amin U, Mumtaz MAS, Brocard M, et al. Extensive translation of small open reading frames revealed by Poly-Ribo-Seq. Elife. 2014;3: e03528.
pubmed: 25144939 pmcid: 4359375 doi: 10.7554/eLife.03528
Bazzini AA, Johnstone TG, Christiano R, Mackowiak SD, Obermayer B, Fleming ES, et al. Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation. EMBO J. 2014;33:981–93.
pubmed: 24705786 pmcid: 4193932 doi: 10.1002/embj.201488411
Smith JE, Alvarez-Dominguez JR, Kline N, Huynh NJ, Geisler S, Hu W, et al. Translation of small open reading frames within unannotated RNA transcripts in Saccharomyces cerevisiae. Cell Rep. 2014;7:1858–66.
pubmed: 24931603 pmcid: 4105149 doi: 10.1016/j.celrep.2014.05.023
Ingolia NT, Brar GA, Stern-Ginossar N, Harris MS, Talhouarne GJ, Jackson SE, et al. Ribosome profiling reveals pervasive translation outside of annotated protein-coding genes. Cell Rep. 2014;8:1365–79.
pubmed: 25159147 pmcid: 4216110 doi: 10.1016/j.celrep.2014.07.045
Ruiz-Orera J, Verdaguer-Grau P, Villanueva-Cañas J, Messeguer X, Albà MM. Translation of neutrally evolving peptides provides a basis for de novo gene evolution. Nat Ecol Evol. 2018;2:890–6.
pubmed: 29556078 doi: 10.1038/s41559-018-0506-6
Ruiz-Orera J, Albà MM. Translation of small open reading frames: roles in regulation and evolutionary innovation. Trends Genet. 2019;35:186–98.
pubmed: 30606460 doi: 10.1016/j.tig.2018.12.003
Chen J, Brunner A-D, Cogan JZ, Nuñez JK, Fields AP, Adamson B, et al. Pervasive functional translation of noncanonical human open reading frames. Science. 2020;367:1140–6.
pubmed: 32139545 pmcid: 7289059 doi: 10.1126/science.aay0262
Blevins WR, Ruiz-Orera J, Messeguer X, Blasco-Moreno B, Villanueva-Cañas JL, Espinar L, et al. Uncovering de novo gene birth in yeast using deep transcriptomics. Nat Commun. 2021;12:1–13.
doi: 10.1038/s41467-021-20911-3
Papadopoulos C, Callebaut I, Gelly J-C, Hatin I, Namy O, Renard M, et al. Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution. Genome Res. 2021;31:2303–15.
pubmed: 34810219 pmcid: 8647833 doi: 10.1101/gr.275638.121
Wacholder A, Parikh SB, Coelho NC, Acar O, Houghton C, Chou L, et al. A vast evolutionarily transient translatome contributes to phenotype and fitness. Cell Syst. 2023;14:363–381.e8.
pubmed: 37164009 pmcid: 10348077 doi: 10.1016/j.cels.2023.04.002
Smith C, Canestrari JG, Wang AJ, Champion MM, Derbyshire KM, Gray TA, et al. Pervasive translation in Mycobacterium tuberculosis. Kana BD, Shell S, editors. eLife. 2022;11:e73980.
pubmed: 35343439 pmcid: 9094748 doi: 10.7554/eLife.73980
Parikh SB, Houghton C, Van Oss SB, Wacholder A, Carvunis A. Origins, evolution, and physiological implications of de novo genes in yeast. Yeast. 2022;39:471–81.
pubmed: 35959631 doi: 10.1002/yea.3810
Mudge JM, Ruiz-Orera J, Prensner JR, Brunet MA, Calvet F, Jungreis I, et al. Standardized annotation of translated open reading frames. Nat Biotechnol. 2022;40:994–9.
pubmed: 35831657 pmcid: 9757701 doi: 10.1038/s41587-022-01369-0
Patraquim P, Magny EG, Pueyo JI, Platero AI, Couso JP. Translation and natural selection of micropeptides from long non-canonical RNAs. Nat Commun. 2022;13:6515.
pubmed: 36316320 pmcid: 9622821 doi: 10.1038/s41467-022-34094-y
Slavoff SA, Mitchell AJ, Schwaid AG, Cabili MN, Ma J, Levin JZ, et al. Peptidomic discovery of short open reading frame–encoded peptides in human cells. Nat Chem Biol. 2013;9:59.
pubmed: 23160002 doi: 10.1038/nchembio.1120
Prabakaran S, Hemberg M, Chauhan R, Winter D, Tweedie-Cullen RY, Dittrich C, et al. Quantitative profiling of peptides from RNAs classified as noncoding. Nat Commun. 2014;5:1–10.
doi: 10.1038/ncomms6429
Ma J, Ward CC, Jungreis I, Slavoff SA, Schwaid AG, Neveu J, et al. Discovery of human sORF-encoded polypeptides (SEPs) in cell lines and tissue. J Proteome Res. 2014;13:1757–65.
pubmed: 24490786 pmcid: 3993966 doi: 10.1021/pr401280w
Hsu PY, Benfey PN. Small but mighty: functional peptides encoded by small ORFs in plants. Proteomics. 2018;18: 1700038.
doi: 10.1002/pmic.201700038
Cao X, Khitun A, Na Z, Dumitrescu DG, Kubica M, Olatunji E, et al. Comparative Proteomic Profiling of Unannotated Microproteins and Alternative Proteins in Human Cell Lines. J Proteome Res. 2020;19:3418–26.
pubmed: 32449352 pmcid: 7429271 doi: 10.1021/acs.jproteome.0c00254
Cuevas MVR, Hardy M-P, Hollý J, Bonneil É, Durette C, Courcelles M, et al. Most non-canonical proteins uniquely populate the proteome or immunopeptidome. Cell Rep. 2021;34: 108815.
pmcid: 8040094 doi: 10.1016/j.celrep.2021.108815
Zheng EB, Zhao L. Protein evidence of unannotated ORFs in Drosophila reveals diversity in the evolution and properties of young proteins. Levine MT, Przeworski M, editors. eLife. 2022;11:e78772.
pubmed: 36178469 pmcid: 9560153 doi: 10.7554/eLife.78772
van Heesch S, Witte F, Schneider-Lunitz V, Schulz JF, Adami E, Faber AB, et al. The translational landscape of the human heart. Cell. 2019;178:242–260.e29.
pubmed: 31155234 doi: 10.1016/j.cell.2019.05.010
Sandmann C-L, Schulz JF, Ruiz-Orera J, Kirchner M, Ziehm M, Adami E, et al. Evolutionary origins and interactomes of human, young microproteins and small peptides translated from short open reading frames. Mol Cell. 2023;83:994–1011.e18.
pubmed: 36806354 pmcid: 10032668 doi: 10.1016/j.molcel.2023.01.023
Begun DJ, Lindfors HA, Kern AD, Jones CD. Evidence for de novo evolution of testis-expressed genes in the Drosophila yakuba/Drosophila erecta Clade. Genetics. 2007;176:1131–7.
pubmed: 17435230 pmcid: 1894579 doi: 10.1534/genetics.106.069245
Levine MT, Jones CD, Kern AD, Lindfors HA, Begun DJ. Novel genes derived from noncoding DNA in Drosophila melanogaster are frequently X-linked and exhibit testis-biased expression. Proc Natl Acad Sci. 2006;103:9935–9.
pubmed: 16777968 pmcid: 1502557 doi: 10.1073/pnas.0509809103
Cai J, Zhao R, Jiang H, Wang W. De novo origination of a new protein-coding gene in Saccharomyces cerevisiae. Genetics. 2008;179:487–96.
pubmed: 18493065 pmcid: 2390625 doi: 10.1534/genetics.107.084491
Zhou Q, Zhang G, Zhang Y, Xu S, Zhao R, Zhan Z, et al. On the origin of new genes in Drosophila. Genome Res. 2008;18:1446–55.
pubmed: 18550802 pmcid: 2527705 doi: 10.1101/gr.076588.108
Knowles DG, McLysaght A. Recent de novo origin of human protein-coding genes. Genome Res. 2009;19:1752–9.
pubmed: 19726446 pmcid: 2765279 doi: 10.1101/gr.095026.109
Siepel A. Darwinian alchemy: Human genes from noncoding DNA. Genome Res. 2009;19:1693–5.
pubmed: 19797681 pmcid: 2765273 doi: 10.1101/gr.098376.109
Tautz D, Domazet-Lošo T. The evolutionary origin of orphan genes. Nat Rev Genet. 2011;12:692–702.
pubmed: 21878963 doi: 10.1038/nrg3053
Wu D-D, Irwin DM, Zhang Y-P. De novo origin of human protein-coding genes. PLoS Genet. 2011;7: e1002379.
pubmed: 22102831 pmcid: 3213175 doi: 10.1371/journal.pgen.1002379
Wissler L, Godmann L, Bornberg-Bauer E. Evolutionary dynamics of simple sequence repeats across long evolutionary time scale in genus Drosophila. Trends in Evolutionary Biology. 2012;4:e7–e7.
doi: 10.4081/eb.2012.e7
Murphy DN, McLysaght A. De novo origin of protein-coding genes in murine rodents. PLoS ONE. 2012;7: e48650.
pubmed: 23185269 pmcid: 3504067 doi: 10.1371/journal.pone.0048650
Zhao L, Saelao P, Jones CD, Begun DJ. Origin and spread of de novo genes in Drosophila melanogaster populations. Science. 2014;343:769–72.
pubmed: 24457212 pmcid: 4391638 doi: 10.1126/science.1248286
Schlötterer C. Genes from scratch–the evolutionary fate of de novo genes. Trends Genet. 2015;31:215–9.
pubmed: 25773713 pmcid: 4383367 doi: 10.1016/j.tig.2015.02.007
Bornberg-Bauer E, Schmitz J, Heberlein M. Emergence of de novo proteins from ‘dark genomic matter’by ‘grow slow and moult.’ Biochem Soc Trans. 2015;43:867–73.
pubmed: 26517896 doi: 10.1042/BST20150089
Li Z-W, Chen X, Wu Q, Hagmann J, Han T-S, Zou Y-P, et al. On the origin of de novo genes in Arabidopsis thaliana populations. Genome Biol Evol. 2016;8:2190–202.
pubmed: 27401176 pmcid: 4987118 doi: 10.1093/gbe/evw164
Wilson BA, Foy SG, Neme R, Masel J. Young genes are highly disordered as predicted by the preadaptation hypothesis of de novo gene birth. Nat Ecol Evol. 2017;1:1–6.
doi: 10.1038/s41559-017-0146
Gubala AM, Schmitz JF, Kearns MJ, Vinh TT, Bornberg-Bauer E, Wolfner MF, et al. The goddard and saturn genes are essential for Drosophila male fertility and may have arisen de novo. Mol Biol Evol. 2017;34:1066–82.
pubmed: 28104747 pmcid: 5400382
Vakirlis N, Hebert AS, Opulente DA, Achaz G, Hittinger CT, Fischer G, et al. A molecular portrait of de novo genes in yeasts. Mol Biol Evol. 2018;35:631–45.
pubmed: 29220506 doi: 10.1093/molbev/msx315
Schmitz JF, Ullrich KK, Bornberg-Bauer E. Incipient de novo genes can evolve from frozen accidents that escaped rapid transcript turnover. Nature Ecol Evol. 2018;2:1626–32.
doi: 10.1038/s41559-018-0639-7
Van Oss SB, Carvunis AR. De novo gene birth. PLoS Genet. 2019;15:e1008160.
pubmed: 31120894 pmcid: 6542195 doi: 10.1371/journal.pgen.1008160
Zhang L, Ren Y, Yang T, Li G, Chen J, Gschwend AR, et al. Rapid evolution of protein diversity by de novo origination in Oryza. Nature Ecol Evol. 2019;3:679–90.
doi: 10.1038/s41559-019-0822-5
Prabh N, Rödelsperger C. De novo, divergence, and mixed origin contribute to the emergence of orphan genes in pristionchus nematodes. G3: Genes, Genomes, Genetics. 2019;9:2277–86.
pubmed: 31088903 doi: 10.1534/g3.119.400326
Vakirlis N, Acar O, Hsu B, Coelho NC, Van Oss SB, Wacholder A, et al. De novo emergence of adaptive membrane proteins from thymine-rich genomic sequences. Nat Commun. 2020;11:1–18.
doi: 10.1038/s41467-020-14500-z
Vakirlis N, Carvunis AR, McLysaght A. Synteny-based analyses indicate that sequence divergence is not the main source of orphan genes. eLife. 2020;9:e53500.
pubmed: 32066524 pmcid: 7028367 doi: 10.7554/eLife.53500
Heames B, Schmitz J, Bornberg-Bauer E. A continuum of evolving de novo genes drives protein-coding novelty in Drosophila. J Mol Evol. 2020;88:382–98.
pubmed: 32253450 pmcid: 7162840 doi: 10.1007/s00239-020-09939-z
Lange A, Patel PH, Heames B, Damry AM, Saenger T, Jackson CJ, et al. Structural and functional characterization of a putative de novo gene in Drosophila. Nat Commun. 2021;12:1–13.
doi: 10.1038/s41467-021-21667-6
Bornberg-Bauer E, Hlouchova K, Lange A. Structure and function of naturally evolved de novo proteins. Curr Opin Struct Biol. 2021;68:175–83.
pubmed: 33567396 doi: 10.1016/j.sbi.2020.11.010
Reinhardt JA, Wanjiru BM, Brant AT, Saelao P, Begun DJ, Jones CD. De Novo ORFs in drosophila are important to organismal fitness and evolved rapidly from previously non-coding sequences. PLoS Genet. 2013;9: e1003860.
pubmed: 24146629 pmcid: 3798262 doi: 10.1371/journal.pgen.1003860
Xie C, Bekpen C, Künzel S, Keshavarz M, Krebs-Wheaton R, Skrabar N, et al. A de novo evolved gene in the house mouse regulates female pregnancy cycles. Perry GH, Weigel D, Perry GH, Menke DB, editors. eLife. 2019;8:e44392.
pubmed: 31436535 pmcid: 6760900 doi: 10.7554/eLife.44392
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, et al. What can ribo-seq, immunopeptidomics, and proteomics tell us about the noncanonical proteome? Mol Cell Proteomics. 2023;22. https://doi.org/10.1016/j.mcpro.2023.100631 .
Verbruggen S, Gessulat S, Gabriels R, Matsaroki A, Van de Voorde H, Kuster B, et al. Spectral prediction features as a solution for the search space size problem in proteogenomics. Mol Cell Proteomics. 2021;20. https://doi.org/10.1016/j.mcpro.2021.100076 .
Wacholder A, Carvunis A-R. Biological factors and statistical limitations prevent detection of most noncanonical proteins by mass spectrometry. PLoS Biol. 2023;21: e3002409.
pubmed: 38048358 pmcid: 10721188 doi: 10.1371/journal.pbio.3002409
Wang M, Weiss M, Simonovic M, Haertinger G, Schrimpf SP, Hengartner MO, et al. PaxDb, a database of protein abundance averages across all three domains of life. Mol Cell Proteomics. 2012;11:492–500.
pubmed: 22535208 pmcid: 3412977 doi: 10.1074/mcp.O111.014704
He C, Jia C, Zhang Y, Xu P. Enrichment-based proteogenomics identifies microproteins, missing proteins, and novel smORFs in Saccharomyces cerevisiae. J Proteome Res. 2018;17:2335–44.
pubmed: 29897761 doi: 10.1021/acs.jproteome.8b00032
Hinnebusch AG. Molecular mechanism of scanning and start codon selection in eukaryotes. Microbiol Mol Biol Rev. 2011;75:434–67.
pubmed: 21885680 pmcid: 3165540 doi: 10.1128/MMBR.00008-11
Kearse MG, Wilusz JE. Non-AUG translation: a new start for protein synthesis in eukaryotes. Genes Dev. 2017;31:1717–31.
pubmed: 28982758 pmcid: 5666671 doi: 10.1101/gad.305250.117
Cao X, Slavoff SA. Non-AUG start codons: Expanding and regulating the small and alternative ORFeome. Exp Cell Res. 2020;391: 111973.
pubmed: 32209305 pmcid: 7256928 doi: 10.1016/j.yexcr.2020.111973
Gardner LB. Hypoxic inhibition of nonsense-mediated RNA decay regulates gene expression and the integrated stress response. Mol Cell Biol. 2008;28:3729–41.
pubmed: 18362164 pmcid: 2423288 doi: 10.1128/MCB.02284-07
Zetoune AB, Fontanière S, Magnin D, Anczuków O, Buisson M, Zhang CX, et al. Comparison of nonsense-mediated mRNA decay efficiency in various murine tissues. BMC Genet. 2008;9:1–11.
doi: 10.1186/1471-2156-9-83
Heinen TJ, Staubach F, Häming D, Tautz D. Emergence of a new gene from an intergenic region. Curr Biol. 2009;19:1527–31.
pubmed: 19733073 doi: 10.1016/j.cub.2009.07.049
Xie C, Zhang YE, Chen JY, Liu CJ, Zhou WZ, Li Y, et al. Hominoid-specific de novo protein-coding genes originating from long non-coding RNAs. PLoS Genetics. 2012;8:e1002942.
pubmed: 23028352 pmcid: 3441637 doi: 10.1371/journal.pgen.1002942
Papadopoulos C, Chevrollier N, Lopes A. Exploring the peptide potential of genomes. In: Simonson T, editor. Computational peptide science methods in molecular biology. New York: Springer; 2022. p. 63–82.
doi: 10.1007/978-1-0716-1855-4_3
Cherry JM, Hong EL, Amundsen C, Balakrishnan R, Binkley G, Chan ET, et al. Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res. 2012;40:D700–5.
pubmed: 22110037 doi: 10.1093/nar/gkr1029
Leinonen R, Sugawara H, Shumway M, on behalf of the International Nucleotide Sequence Database Collaboration. The sequence read archive. Nucleic Acids Research. 2011;39:D19–21.
pubmed: 21062823 doi: 10.1093/nar/gkq1019
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10–2.
doi: 10.14806/ej.17.1.200
Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019;37:907–15.
pubmed: 31375807 pmcid: 7605509 doi: 10.1038/s41587-019-0201-4
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.
pubmed: 22388286 pmcid: 3322381 doi: 10.1038/nmeth.1923
Lauria F, Tebaldi T, Bernabò P, Groen EJN, Gillingwater TH, Viero G. riboWaltz: Optimization of ribosome P-site positioning in ribosome profiling data. PLoS Comput Biol. 2018;14: e1006169.
pubmed: 30102689 pmcid: 6112680 doi: 10.1371/journal.pcbi.1006169
Faure G, Callebaut I. Identification of hidden relationships from the coupling of hydrophobic cluster analysis and domain architecture information. Bioinformatics. 2013;29:1726–33.
pubmed: 23677940 doi: 10.1093/bioinformatics/btt271
Faure G, Callebaut I. Comprehensive repertoire of foldable regions within whole genomes. PLoS Comput Biol. 2013;9:e1003280.
pubmed: 24204229 pmcid: 3812050 doi: 10.1371/journal.pcbi.1003280
Bitard-Feildel T, Callebaut I. HCAtk and pyHCA: a toolkit and python API for the hydrophobic cluster analysis of protein sequences. bioRxiv. 2018;249995. https://doi.org/10.1101/249995 .
Lamiable A, Bitard-Feildel T, Rebehmed J, Quintus F, Schoentgen F, Mornon J-P, et al. A topology-based investigation of protein interaction sites using Hydrophobic Cluster Analysis. Biochimie. 2019;167:68–80.
pubmed: 31525399 doi: 10.1016/j.biochi.2019.09.009
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.
pubmed: 2231712 doi: 10.1016/S0022-2836(05)80360-2
Hedges SB, Dudley J, Kumar S. TimeTree: a public knowledge-base of divergence times among organisms. Bioinformatics. 2006;22:2971–2.
pubmed: 17021158 doi: 10.1093/bioinformatics/btl505
Kumar S, Suleski M, Craig JM, Kasprowicz AE, Sanderford M, Li M, et al. TimeTree 5: an expanded resource for species divergence times. Mol Biol Evol. 2022;39: msac174.
pubmed: 35932227 pmcid: 9400175 doi: 10.1093/molbev/msac174
Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015;12:59–60.
pubmed: 25402007 doi: 10.1038/nmeth.3176
Yang Z. PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol Biol Evol. 2007;24:1586–91.
pubmed: 17483113 doi: 10.1093/molbev/msm088
Team R Core RC. R: A language and environment for statistical computing. 2020. Available from: https://www.R-project.org/ .
Lin M, Lucas HC Jr, Shmueli G. Research commentary—too big to fail: large samples and the p-value problem. Inf Syst Res. 2013;24:906–17.
doi: 10.1287/isre.2013.0480
Szabó Á, Papin C, Cornu D, Chélot E, Lipinszki Z, Udvardy A, et al. Ubiquitylation dynamics of the clock cell proteome and TIMELESS during a circadian cycle. Cell Rep. 2018;23:2273–82.
pubmed: 29791839 doi: 10.1016/j.celrep.2018.04.064
Perkins D, Pappin D, Creasy D, Cottrell J. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis. 1999;20:3551–67.
pubmed: 10612281 doi: 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2
Jouffret V, Miotello G, Culotta K, Ayrault S, Pible O, Armengaud J. Increasing the power of interpretation for soil metaproteomics data. Microbiome. 2021;9:195.
pubmed: 34587999 pmcid: 8482631 doi: 10.1186/s40168-021-01139-1
Fancello L, Burger T. An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics. Genome Biol. 2022;23:132.
pubmed: 35725496 pmcid: 9208142 doi: 10.1186/s13059-022-02701-2
Elias JE, Gygi SP. Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat Methods. 2007;4:207–14.
pubmed: 17327847 doi: 10.1038/nmeth1019
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc: Ser B (Methodol). 1995;57:289–300.
doi: 10.1111/j.2517-6161.1995.tb02031.x
Papadopoulos C, Arbes H, Cornu D, Chevrollier N, Blanchet S, Roginski P, et al. The Ribosome Profiling landscape of yeast reveals a high diversity in pervasive translation [Data set]. Zenodo; 2024. https://doi.org/10.5281/zenodo.13734541 .
Perez-Riverol Y, Bai J, Bandla C, García-Seisdedos D, Hewapathirana S, Kamatchinathan S, et al. The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res. 2022;50:D543–52.
pubmed: 34723319 doi: 10.1093/nar/gkab1038
Cornu D. Diversity in pervasive translation. A new translational landscape of yeast. PRIDE. 2023. https://doi.org/10.6019/PXD040766 .
doi: 10.6019/PXD040766
Papadopoulos C, Chevrollier N, Arbes H, Roginski P, Lopes A. ORFmine. 2023. Available from: https://github.com/i2bc/ORFmine .
Christiano R, Nagaraj N, Fröhlich F, Walther TC. Global Proteome turnover analyses of the yeasts S. cerevisiae and S. pombe. Cell Rep. 2014;9:1959–65.
pubmed: 25466257 pmcid: 4526151 doi: 10.1016/j.celrep.2014.10.065
Mackowiak SD, Zauber H, Bielow C, Thiel D, Kutz K, Calviello L, et al. Extensive identification and analysis of conserved small ORFs in animals. Genome Biol. 2015;16:179.
pubmed: 26364619 pmcid: 4568590 doi: 10.1186/s13059-015-0742-x
Fesenko I, Kirov I, Kniazev A, Khazigaleeva R, Lazarev V, Kharlampieva D, et al. Distinct types of short open reading frames are translated in plant cells. Genome Res. 2019;29:1464–77.
pubmed: 31387879 pmcid: 6724668 doi: 10.1101/gr.253302.119

Auteurs

Chris Papadopoulos (C)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France.
Hospital del Mar Research Institute, Barcelona, Spain.

Hugo Arbes (H)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France.

David Cornu (D)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France.

Nicolas Chevrollier (N)

Independent Investigator, Paris, France.

Sandra Blanchet (S)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France.

Paul Roginski (P)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France.

Camille Rabier (C)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France.

Safiya Atia (S)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France.

Olivier Lespinet (O)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France.

Olivier Namy (O)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France.

Anne Lopes (A)

Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Univ. Paris-Sud, Université Paris-Saclay, Gif-sur-Yvette, Cedex, 91198, France. anne.lopes@i2bc.paris-saclay.fr.

Articles similaires

Saccharomyces cerevisiae Aldehydes Biotransformation Flavoring Agents Lipoxygenase
1.00
Saccharomyces cerevisiae Lysine Cell Nucleolus RNA, Ribosomal Saccharomyces cerevisiae Proteins
Metabolic Networks and Pathways Saccharomyces cerevisiae Computational Biology Synthetic Biology Computer Simulation

Classifications MeSH