The ribosome profiling landscape of yeast reveals a high diversity in pervasive translation.
De novo coding products
Genome evolution
Non-canonical translation signals
Noncoding genome
Pervasive translation
Journal
Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660
Informations de publication
Date de publication:
14 Oct 2024
14 Oct 2024
Historique:
received:
11
09
2023
accepted:
26
09
2024
medline:
15
10
2024
pubmed:
15
10
2024
entrez:
14
10
2024
Statut:
epublish
Résumé
Pervasive translation is a widespread phenomenon that plays a critical role in the emergence of novel microproteins, but the diversity of translation patterns contributing to their generation remains unclear. Based on 54 ribosome profiling (Ribo-Seq) datasets, we investigated the yeast Ribo-Seq landscape using a representation framework that allows the comprehensive inventory and classification of the entire diversity of Ribo-Seq signals, including non-canonical ones. We show that if coding regions occupy specific areas of the Ribo-Seq landscape, noncoding regions encompass a wide diversity of Ribo-Seq signals and, conversely, populate the entire landscape. Our results show that pervasive translation can, nevertheless, be associated with high specificity, with 1055 noncoding ORFs exhibiting canonical Ribo-Seq signals. Using mass spectrometry under standard conditions or proteasome inhibition with an in-house analysis protocol, we report 239 microproteins originating from noncoding ORFs that display canonical but also non-canonical Ribo-Seq signals. Each condition yields dozens of additional microprotein candidates with comparable translation properties, suggesting a larger population of volatile microproteins that are challenging to detect. Our findings suggest that non-canonical translation signals may harbor valuable information and underscore the significance of considering them in proteogenomic studies. Finally, we show that the translation outcome of a noncoding ORF is primarily determined by the initiating codon and the codon distribution in its two alternative frames, rather than features indicative of functionality. Our results enable us to propose a topology of a species' Ribo-Seq landscape, opening the way to comparative analyses of this translation landscape under different conditions.
Sections du résumé
BACKGROUND
BACKGROUND
Pervasive translation is a widespread phenomenon that plays a critical role in the emergence of novel microproteins, but the diversity of translation patterns contributing to their generation remains unclear. Based on 54 ribosome profiling (Ribo-Seq) datasets, we investigated the yeast Ribo-Seq landscape using a representation framework that allows the comprehensive inventory and classification of the entire diversity of Ribo-Seq signals, including non-canonical ones.
RESULTS
RESULTS
We show that if coding regions occupy specific areas of the Ribo-Seq landscape, noncoding regions encompass a wide diversity of Ribo-Seq signals and, conversely, populate the entire landscape. Our results show that pervasive translation can, nevertheless, be associated with high specificity, with 1055 noncoding ORFs exhibiting canonical Ribo-Seq signals. Using mass spectrometry under standard conditions or proteasome inhibition with an in-house analysis protocol, we report 239 microproteins originating from noncoding ORFs that display canonical but also non-canonical Ribo-Seq signals. Each condition yields dozens of additional microprotein candidates with comparable translation properties, suggesting a larger population of volatile microproteins that are challenging to detect. Our findings suggest that non-canonical translation signals may harbor valuable information and underscore the significance of considering them in proteogenomic studies. Finally, we show that the translation outcome of a noncoding ORF is primarily determined by the initiating codon and the codon distribution in its two alternative frames, rather than features indicative of functionality.
CONCLUSION
CONCLUSIONS
Our results enable us to propose a topology of a species' Ribo-Seq landscape, opening the way to comparative analyses of this translation landscape under different conditions.
Identifiants
pubmed: 39402662
doi: 10.1186/s13059-024-03403-7
pii: 10.1186/s13059-024-03403-7
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
268Informations de copyright
© 2024. The Author(s).
Références
Hanada K, Zhang X, Borevitz JO, Li W-H, Shiu S-H. A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection. Genome Res. 2007;17:632–40.
pubmed: 17395691
pmcid: 1855179
doi: 10.1101/gr.5836207
Yang X, Tschaplinski TJ, Hurst GB, Jawdy S, Abraham PE, Lankford PK, et al. Discovery and annotation of small proteins using genomics, proteomics, and computational approaches. Genome Res. 2011;21:634–41.
pubmed: 21367939
pmcid: 3065711
doi: 10.1101/gr.109280.110
Couso J-P, Patraquim P. Classification and function of small open reading frames. Nat Rev Mol Cell Biol. 2017;18:575–89.
pubmed: 28698598
doi: 10.1038/nrm.2017.58
Orr MW, Mao Y, Storz G, Qian S-B. Alternative ORFs and small ORFs: shedding light on the dark proteome. Nucleic Acids Res. 2020;48:1029–42.
pubmed: 31504789
doi: 10.1093/nar/gkz734
Guerra-Almeida D, Nunes-da-Fonseca R. Small open reading frames: how important are they for molecular evolution? Front Genet. 2020;11: 574737.
pubmed: 33193682
pmcid: 7606980
doi: 10.3389/fgene.2020.574737
Guerra-Almeida D, Tschoeke DA, Nunes-da-Fonseca R. Understanding small ORF diversity through a comprehensive transcription feature classification. DNA Res. 2021;28: dsab007.
pubmed: 34240112
pmcid: 8435553
doi: 10.1093/dnares/dsab007
Kapranov P, Cawley SE, Drenkow J, Bekiranov S, Strausberg RL, Fodor SP, et al. Large-scale transcriptional activity in chromosomes 21 and 22. Science. 2002;296:916–9.
pubmed: 11988577
doi: 10.1126/science.1068597
Ingolia NT, Ghaemmaghami S, Newman JR, Weissman JS. Genome-wide analysis in vivo of translation with nucleotide resolution using ribosome profiling. Science. 2009;324:218–23.
pubmed: 19213877
pmcid: 2746483
doi: 10.1126/science.1168978
Clark MB, Amaral PP, Schlesinger FJ, Dinger ME, Taft RJ, Rinn JL, et al. The reality of pervasive transcription. PLoS Biol. 2011;9: e1000625.
pubmed: 21765801
pmcid: 3134446
doi: 10.1371/journal.pbio.1000625
Carvunis A-R, Rolland T, Wapinski I, Calderwood MA, Yildirim MA, Simonis N, et al. Proto-genes and de novo gene birth. Nature. 2012;487:370–4.
pubmed: 22722833
pmcid: 3401362
doi: 10.1038/nature11184
Jensen TH, Jacquier A, Libri D. Dealing with pervasive transcription. Mol Cell. 2013;52:473–84.
pubmed: 24267449
doi: 10.1016/j.molcel.2013.10.032
Chew G-L, Pauli A, Rinn JL, Regev A, Schier AF, Valen E. Ribosome profiling reveals resemblance between long non-coding RNAs and 5′ leaders of coding RNAs. Development. 2013;140:2828–34.
pubmed: 23698349
pmcid: 3678345
doi: 10.1242/dev.098343
Aspden JL, Eyre-Walker YC, Phillips RJ, Amin U, Mumtaz MAS, Brocard M, et al. Extensive translation of small open reading frames revealed by Poly-Ribo-Seq. Elife. 2014;3: e03528.
pubmed: 25144939
pmcid: 4359375
doi: 10.7554/eLife.03528
Bazzini AA, Johnstone TG, Christiano R, Mackowiak SD, Obermayer B, Fleming ES, et al. Identification of small ORFs in vertebrates using ribosome footprinting and evolutionary conservation. EMBO J. 2014;33:981–93.
pubmed: 24705786
pmcid: 4193932
doi: 10.1002/embj.201488411
Smith JE, Alvarez-Dominguez JR, Kline N, Huynh NJ, Geisler S, Hu W, et al. Translation of small open reading frames within unannotated RNA transcripts in Saccharomyces cerevisiae. Cell Rep. 2014;7:1858–66.
pubmed: 24931603
pmcid: 4105149
doi: 10.1016/j.celrep.2014.05.023
Ingolia NT, Brar GA, Stern-Ginossar N, Harris MS, Talhouarne GJ, Jackson SE, et al. Ribosome profiling reveals pervasive translation outside of annotated protein-coding genes. Cell Rep. 2014;8:1365–79.
pubmed: 25159147
pmcid: 4216110
doi: 10.1016/j.celrep.2014.07.045
Ruiz-Orera J, Verdaguer-Grau P, Villanueva-Cañas J, Messeguer X, Albà MM. Translation of neutrally evolving peptides provides a basis for de novo gene evolution. Nat Ecol Evol. 2018;2:890–6.
pubmed: 29556078
doi: 10.1038/s41559-018-0506-6
Ruiz-Orera J, Albà MM. Translation of small open reading frames: roles in regulation and evolutionary innovation. Trends Genet. 2019;35:186–98.
pubmed: 30606460
doi: 10.1016/j.tig.2018.12.003
Chen J, Brunner A-D, Cogan JZ, Nuñez JK, Fields AP, Adamson B, et al. Pervasive functional translation of noncanonical human open reading frames. Science. 2020;367:1140–6.
pubmed: 32139545
pmcid: 7289059
doi: 10.1126/science.aay0262
Blevins WR, Ruiz-Orera J, Messeguer X, Blasco-Moreno B, Villanueva-Cañas JL, Espinar L, et al. Uncovering de novo gene birth in yeast using deep transcriptomics. Nat Commun. 2021;12:1–13.
doi: 10.1038/s41467-021-20911-3
Papadopoulos C, Callebaut I, Gelly J-C, Hatin I, Namy O, Renard M, et al. Intergenic ORFs as elementary structural modules of de novo gene birth and protein evolution. Genome Res. 2021;31:2303–15.
pubmed: 34810219
pmcid: 8647833
doi: 10.1101/gr.275638.121
Wacholder A, Parikh SB, Coelho NC, Acar O, Houghton C, Chou L, et al. A vast evolutionarily transient translatome contributes to phenotype and fitness. Cell Syst. 2023;14:363–381.e8.
pubmed: 37164009
pmcid: 10348077
doi: 10.1016/j.cels.2023.04.002
Smith C, Canestrari JG, Wang AJ, Champion MM, Derbyshire KM, Gray TA, et al. Pervasive translation in Mycobacterium tuberculosis. Kana BD, Shell S, editors. eLife. 2022;11:e73980.
pubmed: 35343439
pmcid: 9094748
doi: 10.7554/eLife.73980
Parikh SB, Houghton C, Van Oss SB, Wacholder A, Carvunis A. Origins, evolution, and physiological implications of de novo genes in yeast. Yeast. 2022;39:471–81.
pubmed: 35959631
doi: 10.1002/yea.3810
Mudge JM, Ruiz-Orera J, Prensner JR, Brunet MA, Calvet F, Jungreis I, et al. Standardized annotation of translated open reading frames. Nat Biotechnol. 2022;40:994–9.
pubmed: 35831657
pmcid: 9757701
doi: 10.1038/s41587-022-01369-0
Patraquim P, Magny EG, Pueyo JI, Platero AI, Couso JP. Translation and natural selection of micropeptides from long non-canonical RNAs. Nat Commun. 2022;13:6515.
pubmed: 36316320
pmcid: 9622821
doi: 10.1038/s41467-022-34094-y
Slavoff SA, Mitchell AJ, Schwaid AG, Cabili MN, Ma J, Levin JZ, et al. Peptidomic discovery of short open reading frame–encoded peptides in human cells. Nat Chem Biol. 2013;9:59.
pubmed: 23160002
doi: 10.1038/nchembio.1120
Prabakaran S, Hemberg M, Chauhan R, Winter D, Tweedie-Cullen RY, Dittrich C, et al. Quantitative profiling of peptides from RNAs classified as noncoding. Nat Commun. 2014;5:1–10.
doi: 10.1038/ncomms6429
Ma J, Ward CC, Jungreis I, Slavoff SA, Schwaid AG, Neveu J, et al. Discovery of human sORF-encoded polypeptides (SEPs) in cell lines and tissue. J Proteome Res. 2014;13:1757–65.
pubmed: 24490786
pmcid: 3993966
doi: 10.1021/pr401280w
Hsu PY, Benfey PN. Small but mighty: functional peptides encoded by small ORFs in plants. Proteomics. 2018;18: 1700038.
doi: 10.1002/pmic.201700038
Cao X, Khitun A, Na Z, Dumitrescu DG, Kubica M, Olatunji E, et al. Comparative Proteomic Profiling of Unannotated Microproteins and Alternative Proteins in Human Cell Lines. J Proteome Res. 2020;19:3418–26.
pubmed: 32449352
pmcid: 7429271
doi: 10.1021/acs.jproteome.0c00254
Cuevas MVR, Hardy M-P, Hollý J, Bonneil É, Durette C, Courcelles M, et al. Most non-canonical proteins uniquely populate the proteome or immunopeptidome. Cell Rep. 2021;34: 108815.
pmcid: 8040094
doi: 10.1016/j.celrep.2021.108815
Zheng EB, Zhao L. Protein evidence of unannotated ORFs in Drosophila reveals diversity in the evolution and properties of young proteins. Levine MT, Przeworski M, editors. eLife. 2022;11:e78772.
pubmed: 36178469
pmcid: 9560153
doi: 10.7554/eLife.78772
van Heesch S, Witte F, Schneider-Lunitz V, Schulz JF, Adami E, Faber AB, et al. The translational landscape of the human heart. Cell. 2019;178:242–260.e29.
pubmed: 31155234
doi: 10.1016/j.cell.2019.05.010
Sandmann C-L, Schulz JF, Ruiz-Orera J, Kirchner M, Ziehm M, Adami E, et al. Evolutionary origins and interactomes of human, young microproteins and small peptides translated from short open reading frames. Mol Cell. 2023;83:994–1011.e18.
pubmed: 36806354
pmcid: 10032668
doi: 10.1016/j.molcel.2023.01.023
Begun DJ, Lindfors HA, Kern AD, Jones CD. Evidence for de novo evolution of testis-expressed genes in the Drosophila yakuba/Drosophila erecta Clade. Genetics. 2007;176:1131–7.
pubmed: 17435230
pmcid: 1894579
doi: 10.1534/genetics.106.069245
Levine MT, Jones CD, Kern AD, Lindfors HA, Begun DJ. Novel genes derived from noncoding DNA in Drosophila melanogaster are frequently X-linked and exhibit testis-biased expression. Proc Natl Acad Sci. 2006;103:9935–9.
pubmed: 16777968
pmcid: 1502557
doi: 10.1073/pnas.0509809103
Cai J, Zhao R, Jiang H, Wang W. De novo origination of a new protein-coding gene in Saccharomyces cerevisiae. Genetics. 2008;179:487–96.
pubmed: 18493065
pmcid: 2390625
doi: 10.1534/genetics.107.084491
Zhou Q, Zhang G, Zhang Y, Xu S, Zhao R, Zhan Z, et al. On the origin of new genes in Drosophila. Genome Res. 2008;18:1446–55.
pubmed: 18550802
pmcid: 2527705
doi: 10.1101/gr.076588.108
Knowles DG, McLysaght A. Recent de novo origin of human protein-coding genes. Genome Res. 2009;19:1752–9.
pubmed: 19726446
pmcid: 2765279
doi: 10.1101/gr.095026.109
Siepel A. Darwinian alchemy: Human genes from noncoding DNA. Genome Res. 2009;19:1693–5.
pubmed: 19797681
pmcid: 2765273
doi: 10.1101/gr.098376.109
Tautz D, Domazet-Lošo T. The evolutionary origin of orphan genes. Nat Rev Genet. 2011;12:692–702.
pubmed: 21878963
doi: 10.1038/nrg3053
Wu D-D, Irwin DM, Zhang Y-P. De novo origin of human protein-coding genes. PLoS Genet. 2011;7: e1002379.
pubmed: 22102831
pmcid: 3213175
doi: 10.1371/journal.pgen.1002379
Wissler L, Godmann L, Bornberg-Bauer E. Evolutionary dynamics of simple sequence repeats across long evolutionary time scale in genus Drosophila. Trends in Evolutionary Biology. 2012;4:e7–e7.
doi: 10.4081/eb.2012.e7
Murphy DN, McLysaght A. De novo origin of protein-coding genes in murine rodents. PLoS ONE. 2012;7: e48650.
pubmed: 23185269
pmcid: 3504067
doi: 10.1371/journal.pone.0048650
Zhao L, Saelao P, Jones CD, Begun DJ. Origin and spread of de novo genes in Drosophila melanogaster populations. Science. 2014;343:769–72.
pubmed: 24457212
pmcid: 4391638
doi: 10.1126/science.1248286
Schlötterer C. Genes from scratch–the evolutionary fate of de novo genes. Trends Genet. 2015;31:215–9.
pubmed: 25773713
pmcid: 4383367
doi: 10.1016/j.tig.2015.02.007
Bornberg-Bauer E, Schmitz J, Heberlein M. Emergence of de novo proteins from ‘dark genomic matter’by ‘grow slow and moult.’ Biochem Soc Trans. 2015;43:867–73.
pubmed: 26517896
doi: 10.1042/BST20150089
Li Z-W, Chen X, Wu Q, Hagmann J, Han T-S, Zou Y-P, et al. On the origin of de novo genes in Arabidopsis thaliana populations. Genome Biol Evol. 2016;8:2190–202.
pubmed: 27401176
pmcid: 4987118
doi: 10.1093/gbe/evw164
Wilson BA, Foy SG, Neme R, Masel J. Young genes are highly disordered as predicted by the preadaptation hypothesis of de novo gene birth. Nat Ecol Evol. 2017;1:1–6.
doi: 10.1038/s41559-017-0146
Gubala AM, Schmitz JF, Kearns MJ, Vinh TT, Bornberg-Bauer E, Wolfner MF, et al. The goddard and saturn genes are essential for Drosophila male fertility and may have arisen de novo. Mol Biol Evol. 2017;34:1066–82.
pubmed: 28104747
pmcid: 5400382
Vakirlis N, Hebert AS, Opulente DA, Achaz G, Hittinger CT, Fischer G, et al. A molecular portrait of de novo genes in yeasts. Mol Biol Evol. 2018;35:631–45.
pubmed: 29220506
doi: 10.1093/molbev/msx315
Schmitz JF, Ullrich KK, Bornberg-Bauer E. Incipient de novo genes can evolve from frozen accidents that escaped rapid transcript turnover. Nature Ecol Evol. 2018;2:1626–32.
doi: 10.1038/s41559-018-0639-7
Van Oss SB, Carvunis AR. De novo gene birth. PLoS Genet. 2019;15:e1008160.
pubmed: 31120894
pmcid: 6542195
doi: 10.1371/journal.pgen.1008160
Zhang L, Ren Y, Yang T, Li G, Chen J, Gschwend AR, et al. Rapid evolution of protein diversity by de novo origination in Oryza. Nature Ecol Evol. 2019;3:679–90.
doi: 10.1038/s41559-019-0822-5
Prabh N, Rödelsperger C. De novo, divergence, and mixed origin contribute to the emergence of orphan genes in pristionchus nematodes. G3: Genes, Genomes, Genetics. 2019;9:2277–86.
pubmed: 31088903
doi: 10.1534/g3.119.400326
Vakirlis N, Acar O, Hsu B, Coelho NC, Van Oss SB, Wacholder A, et al. De novo emergence of adaptive membrane proteins from thymine-rich genomic sequences. Nat Commun. 2020;11:1–18.
doi: 10.1038/s41467-020-14500-z
Vakirlis N, Carvunis AR, McLysaght A. Synteny-based analyses indicate that sequence divergence is not the main source of orphan genes. eLife. 2020;9:e53500.
pubmed: 32066524
pmcid: 7028367
doi: 10.7554/eLife.53500
Heames B, Schmitz J, Bornberg-Bauer E. A continuum of evolving de novo genes drives protein-coding novelty in Drosophila. J Mol Evol. 2020;88:382–98.
pubmed: 32253450
pmcid: 7162840
doi: 10.1007/s00239-020-09939-z
Lange A, Patel PH, Heames B, Damry AM, Saenger T, Jackson CJ, et al. Structural and functional characterization of a putative de novo gene in Drosophila. Nat Commun. 2021;12:1–13.
doi: 10.1038/s41467-021-21667-6
Bornberg-Bauer E, Hlouchova K, Lange A. Structure and function of naturally evolved de novo proteins. Curr Opin Struct Biol. 2021;68:175–83.
pubmed: 33567396
doi: 10.1016/j.sbi.2020.11.010
Reinhardt JA, Wanjiru BM, Brant AT, Saelao P, Begun DJ, Jones CD. De Novo ORFs in drosophila are important to organismal fitness and evolved rapidly from previously non-coding sequences. PLoS Genet. 2013;9: e1003860.
pubmed: 24146629
pmcid: 3798262
doi: 10.1371/journal.pgen.1003860
Xie C, Bekpen C, Künzel S, Keshavarz M, Krebs-Wheaton R, Skrabar N, et al. A de novo evolved gene in the house mouse regulates female pregnancy cycles. Perry GH, Weigel D, Perry GH, Menke DB, editors. eLife. 2019;8:e44392.
pubmed: 31436535
pmcid: 6760900
doi: 10.7554/eLife.44392
Prensner JR, Abelin JG, Kok LW, Clauser KR, Mudge JM, Ruiz-Orera J, et al. What can ribo-seq, immunopeptidomics, and proteomics tell us about the noncanonical proteome? Mol Cell Proteomics. 2023;22. https://doi.org/10.1016/j.mcpro.2023.100631 .
Verbruggen S, Gessulat S, Gabriels R, Matsaroki A, Van de Voorde H, Kuster B, et al. Spectral prediction features as a solution for the search space size problem in proteogenomics. Mol Cell Proteomics. 2021;20. https://doi.org/10.1016/j.mcpro.2021.100076 .
Wacholder A, Carvunis A-R. Biological factors and statistical limitations prevent detection of most noncanonical proteins by mass spectrometry. PLoS Biol. 2023;21: e3002409.
pubmed: 38048358
pmcid: 10721188
doi: 10.1371/journal.pbio.3002409
Wang M, Weiss M, Simonovic M, Haertinger G, Schrimpf SP, Hengartner MO, et al. PaxDb, a database of protein abundance averages across all three domains of life. Mol Cell Proteomics. 2012;11:492–500.
pubmed: 22535208
pmcid: 3412977
doi: 10.1074/mcp.O111.014704
He C, Jia C, Zhang Y, Xu P. Enrichment-based proteogenomics identifies microproteins, missing proteins, and novel smORFs in Saccharomyces cerevisiae. J Proteome Res. 2018;17:2335–44.
pubmed: 29897761
doi: 10.1021/acs.jproteome.8b00032
Hinnebusch AG. Molecular mechanism of scanning and start codon selection in eukaryotes. Microbiol Mol Biol Rev. 2011;75:434–67.
pubmed: 21885680
pmcid: 3165540
doi: 10.1128/MMBR.00008-11
Kearse MG, Wilusz JE. Non-AUG translation: a new start for protein synthesis in eukaryotes. Genes Dev. 2017;31:1717–31.
pubmed: 28982758
pmcid: 5666671
doi: 10.1101/gad.305250.117
Cao X, Slavoff SA. Non-AUG start codons: Expanding and regulating the small and alternative ORFeome. Exp Cell Res. 2020;391: 111973.
pubmed: 32209305
pmcid: 7256928
doi: 10.1016/j.yexcr.2020.111973
Gardner LB. Hypoxic inhibition of nonsense-mediated RNA decay regulates gene expression and the integrated stress response. Mol Cell Biol. 2008;28:3729–41.
pubmed: 18362164
pmcid: 2423288
doi: 10.1128/MCB.02284-07
Zetoune AB, Fontanière S, Magnin D, Anczuków O, Buisson M, Zhang CX, et al. Comparison of nonsense-mediated mRNA decay efficiency in various murine tissues. BMC Genet. 2008;9:1–11.
doi: 10.1186/1471-2156-9-83
Heinen TJ, Staubach F, Häming D, Tautz D. Emergence of a new gene from an intergenic region. Curr Biol. 2009;19:1527–31.
pubmed: 19733073
doi: 10.1016/j.cub.2009.07.049
Xie C, Zhang YE, Chen JY, Liu CJ, Zhou WZ, Li Y, et al. Hominoid-specific de novo protein-coding genes originating from long non-coding RNAs. PLoS Genetics. 2012;8:e1002942.
pubmed: 23028352
pmcid: 3441637
doi: 10.1371/journal.pgen.1002942
Papadopoulos C, Chevrollier N, Lopes A. Exploring the peptide potential of genomes. In: Simonson T, editor. Computational peptide science methods in molecular biology. New York: Springer; 2022. p. 63–82.
doi: 10.1007/978-1-0716-1855-4_3
Cherry JM, Hong EL, Amundsen C, Balakrishnan R, Binkley G, Chan ET, et al. Saccharomyces Genome Database: the genomics resource of budding yeast. Nucleic Acids Res. 2012;40:D700–5.
pubmed: 22110037
doi: 10.1093/nar/gkr1029
Leinonen R, Sugawara H, Shumway M, on behalf of the International Nucleotide Sequence Database Collaboration. The sequence read archive. Nucleic Acids Research. 2011;39:D19–21.
pubmed: 21062823
doi: 10.1093/nar/gkq1019
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10–2.
doi: 10.14806/ej.17.1.200
Kim D, Paggi JM, Park C, Bennett C, Salzberg SL. Graph-based genome alignment and genotyping with HISAT2 and HISAT-genotype. Nat Biotechnol. 2019;37:907–15.
pubmed: 31375807
pmcid: 7605509
doi: 10.1038/s41587-019-0201-4
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.
pubmed: 22388286
pmcid: 3322381
doi: 10.1038/nmeth.1923
Lauria F, Tebaldi T, Bernabò P, Groen EJN, Gillingwater TH, Viero G. riboWaltz: Optimization of ribosome P-site positioning in ribosome profiling data. PLoS Comput Biol. 2018;14: e1006169.
pubmed: 30102689
pmcid: 6112680
doi: 10.1371/journal.pcbi.1006169
Faure G, Callebaut I. Identification of hidden relationships from the coupling of hydrophobic cluster analysis and domain architecture information. Bioinformatics. 2013;29:1726–33.
pubmed: 23677940
doi: 10.1093/bioinformatics/btt271
Faure G, Callebaut I. Comprehensive repertoire of foldable regions within whole genomes. PLoS Comput Biol. 2013;9:e1003280.
pubmed: 24204229
pmcid: 3812050
doi: 10.1371/journal.pcbi.1003280
Bitard-Feildel T, Callebaut I. HCAtk and pyHCA: a toolkit and python API for the hydrophobic cluster analysis of protein sequences. bioRxiv. 2018;249995. https://doi.org/10.1101/249995 .
Lamiable A, Bitard-Feildel T, Rebehmed J, Quintus F, Schoentgen F, Mornon J-P, et al. A topology-based investigation of protein interaction sites using Hydrophobic Cluster Analysis. Biochimie. 2019;167:68–80.
pubmed: 31525399
doi: 10.1016/j.biochi.2019.09.009
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.
pubmed: 2231712
doi: 10.1016/S0022-2836(05)80360-2
Hedges SB, Dudley J, Kumar S. TimeTree: a public knowledge-base of divergence times among organisms. Bioinformatics. 2006;22:2971–2.
pubmed: 17021158
doi: 10.1093/bioinformatics/btl505
Kumar S, Suleski M, Craig JM, Kasprowicz AE, Sanderford M, Li M, et al. TimeTree 5: an expanded resource for species divergence times. Mol Biol Evol. 2022;39: msac174.
pubmed: 35932227
pmcid: 9400175
doi: 10.1093/molbev/msac174
Buchfink B, Xie C, Huson DH. Fast and sensitive protein alignment using DIAMOND. Nat Methods. 2015;12:59–60.
pubmed: 25402007
doi: 10.1038/nmeth.3176
Yang Z. PAML 4: Phylogenetic Analysis by Maximum Likelihood. Mol Biol Evol. 2007;24:1586–91.
pubmed: 17483113
doi: 10.1093/molbev/msm088
Team R Core RC. R: A language and environment for statistical computing. 2020. Available from: https://www.R-project.org/ .
Lin M, Lucas HC Jr, Shmueli G. Research commentary—too big to fail: large samples and the p-value problem. Inf Syst Res. 2013;24:906–17.
doi: 10.1287/isre.2013.0480
Szabó Á, Papin C, Cornu D, Chélot E, Lipinszki Z, Udvardy A, et al. Ubiquitylation dynamics of the clock cell proteome and TIMELESS during a circadian cycle. Cell Rep. 2018;23:2273–82.
pubmed: 29791839
doi: 10.1016/j.celrep.2018.04.064
Perkins D, Pappin D, Creasy D, Cottrell J. Probability-based protein identification by searching sequence databases using mass spectrometry data. Electrophoresis. 1999;20:3551–67.
pubmed: 10612281
doi: 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2
Jouffret V, Miotello G, Culotta K, Ayrault S, Pible O, Armengaud J. Increasing the power of interpretation for soil metaproteomics data. Microbiome. 2021;9:195.
pubmed: 34587999
pmcid: 8482631
doi: 10.1186/s40168-021-01139-1
Fancello L, Burger T. An analysis of proteogenomics and how and when transcriptome-informed reduction of protein databases can enhance eukaryotic proteomics. Genome Biol. 2022;23:132.
pubmed: 35725496
pmcid: 9208142
doi: 10.1186/s13059-022-02701-2
Elias JE, Gygi SP. Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat Methods. 2007;4:207–14.
pubmed: 17327847
doi: 10.1038/nmeth1019
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc: Ser B (Methodol). 1995;57:289–300.
doi: 10.1111/j.2517-6161.1995.tb02031.x
Papadopoulos C, Arbes H, Cornu D, Chevrollier N, Blanchet S, Roginski P, et al. The Ribosome Profiling landscape of yeast reveals a high diversity in pervasive translation [Data set]. Zenodo; 2024. https://doi.org/10.5281/zenodo.13734541 .
Perez-Riverol Y, Bai J, Bandla C, García-Seisdedos D, Hewapathirana S, Kamatchinathan S, et al. The PRIDE database resources in 2022: a hub for mass spectrometry-based proteomics evidences. Nucleic Acids Res. 2022;50:D543–52.
pubmed: 34723319
doi: 10.1093/nar/gkab1038
Cornu D. Diversity in pervasive translation. A new translational landscape of yeast. PRIDE. 2023. https://doi.org/10.6019/PXD040766 .
doi: 10.6019/PXD040766
Papadopoulos C, Chevrollier N, Arbes H, Roginski P, Lopes A. ORFmine. 2023. Available from: https://github.com/i2bc/ORFmine .
Christiano R, Nagaraj N, Fröhlich F, Walther TC. Global Proteome turnover analyses of the yeasts S. cerevisiae and S. pombe. Cell Rep. 2014;9:1959–65.
pubmed: 25466257
pmcid: 4526151
doi: 10.1016/j.celrep.2014.10.065
Mackowiak SD, Zauber H, Bielow C, Thiel D, Kutz K, Calviello L, et al. Extensive identification and analysis of conserved small ORFs in animals. Genome Biol. 2015;16:179.
pubmed: 26364619
pmcid: 4568590
doi: 10.1186/s13059-015-0742-x
Fesenko I, Kirov I, Kniazev A, Khazigaleeva R, Lazarev V, Kharlampieva D, et al. Distinct types of short open reading frames are translated in plant cells. Genome Res. 2019;29:1464–77.
pubmed: 31387879
pmcid: 6724668
doi: 10.1101/gr.253302.119