Proteotranscriptomics assisted gene annotation and spatial proteomics of Bombyx mori BmN4 cell line.


Journal

BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258

Informations de publication

Date de publication:
06 Oct 2020
Historique:
received: 14 04 2020
accepted: 21 09 2020
entrez: 7 10 2020
pubmed: 8 10 2020
medline: 13 4 2021
Statut: epublish

Résumé

The process of identifying all coding regions in a genome is crucial for any study at the level of molecular biology, ranging from single-gene cloning to genome-wide measurements using RNA-seq or mass spectrometry. While satisfactory annotation has been made feasible for well-studied model organisms through great efforts of big consortia, for most systems this kind of data is either absent or not adequately precise. Combining in-depth transcriptome sequencing and high resolution mass spectrometry, we here use proteotranscriptomics to improve gene annotation of protein-coding genes in the Bombyx mori cell line BmN4 which is an increasingly used tool for the analysis of piRNA biogenesis and function. Using this approach we provide the exact coding sequence and evidence for more than 6200 genes on the protein level. Furthermore using spatial proteomics, we establish the subcellular localization of thousands of these proteins. We show that our approach outperforms current Bombyx mori annotation attempts in terms of accuracy and coverage. We show that proteotranscriptomics is an efficient, cost-effective and accurate approach to improve previous annotations or generate new gene models. As this technique is based on de-novo transcriptome assembly, it provides the possibility to study any species also in the absence of genome sequence information for which proteogenomics would be impossible.

Sections du résumé

BACKGROUND BACKGROUND
The process of identifying all coding regions in a genome is crucial for any study at the level of molecular biology, ranging from single-gene cloning to genome-wide measurements using RNA-seq or mass spectrometry. While satisfactory annotation has been made feasible for well-studied model organisms through great efforts of big consortia, for most systems this kind of data is either absent or not adequately precise.
RESULTS RESULTS
Combining in-depth transcriptome sequencing and high resolution mass spectrometry, we here use proteotranscriptomics to improve gene annotation of protein-coding genes in the Bombyx mori cell line BmN4 which is an increasingly used tool for the analysis of piRNA biogenesis and function. Using this approach we provide the exact coding sequence and evidence for more than 6200 genes on the protein level. Furthermore using spatial proteomics, we establish the subcellular localization of thousands of these proteins. We show that our approach outperforms current Bombyx mori annotation attempts in terms of accuracy and coverage.
CONCLUSIONS CONCLUSIONS
We show that proteotranscriptomics is an efficient, cost-effective and accurate approach to improve previous annotations or generate new gene models. As this technique is based on de-novo transcriptome assembly, it provides the possibility to study any species also in the absence of genome sequence information for which proteogenomics would be impossible.

Identifiants

pubmed: 33023468
doi: 10.1186/s12864-020-07088-7
pii: 10.1186/s12864-020-07088-7
pmc: PMC7541253
doi:

Substances chimiques

Insect Proteins 0
Proteome 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

690

Commentaires et corrections

Type : ErratumIn

Références

Environ Sci Pollut Res Int. 2018 Dec;25(35):35048-35054
pubmed: 30374720
DNA Res. 2004 Feb 29;11(1):27-35
pubmed: 15141943
Proc Jpn Acad Ser B Phys Biol Sci. 2018;94(5):205-216
pubmed: 29760316
Nat Commun. 2019 Jan 18;10(1):331
pubmed: 30659192
Nat Biotechnol. 2008 Dec;26(12):1367-72
pubmed: 19029910
Bioinformatics. 2013 Jan 1;29(1):15-21
pubmed: 23104886
J Mol Biol. 2001 Jan 19;305(3):567-80
pubmed: 11152613
Nucleic Acids Res. 2010 Jan;38(Database issue):D453-6
pubmed: 19793867
Nat Protoc. 2013 Aug;8(8):1494-512
pubmed: 23845962
DNA Res. 2018 Jan 17;:
pubmed: 29360973
Chembiochem. 2019 Jun 14;20(12):1479-1486
pubmed: 30648812
Drug Discov Ther. 2015 Apr;9(2):133-5
pubmed: 25994065
Nat Biotechnol. 2011 May 15;29(7):644-52
pubmed: 21572440
Genome Res. 2010 Sep;20(9):1297-303
pubmed: 20644199
Science. 2004 Dec 10;306(5703):1937-40
pubmed: 15591204
Genetica. 2008 Jul;133(3):269-82
pubmed: 17901928
Insect Biochem Mol Biol. 2019 Apr;107:53-62
pubmed: 30802494
Genome Res. 2016 Aug;26(8):1134-44
pubmed: 27252236
Insect Biochem Mol Biol. 2008 Dec;38(12):1036-45
pubmed: 19121390
Viruses. 2019 Apr 01;11(4):
pubmed: 30939808
Nat Protoc. 2007;2(8):1896-906
pubmed: 17703201
Nat Biotechnol. 2019 Apr;37(4):420-423
pubmed: 30778233
Bioinformatics. 2017 Nov 1;33(21):3387-3395
pubmed: 29036616
PLoS Comput Biol. 2011 Oct;7(10):e1002195
pubmed: 22039361
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
Nucleic Acids Res. 2019 Jan 8;47(D1):D442-D450
pubmed: 30395289
Fly (Austin). 2012 Apr-Jun;6(2):80-92
pubmed: 22728672
Bioinformatics. 2005 May 1;21(9):1859-75
pubmed: 15728110
Bioinformatics. 2014 May 1;30(9):1322-4
pubmed: 24413670
Mol Cell Proteomics. 2014 Sep;13(9):2513-26
pubmed: 24942700
Nat Protoc. 2006;1(6):2856-60
pubmed: 17406544
Cell Rep. 2017 Jan 17;18(3):762-776
pubmed: 28099853
RNA. 2009 Jul;15(7):1258-64
pubmed: 19460866
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Mol Biol Evol. 2018 Mar 1;35(3):543-548
pubmed: 29220515
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
Nucleic Acids Res. 2007;35(9):3100-8
pubmed: 17452365
Gigascience. 2015 Oct 19;4:48
pubmed: 26500767
Nature. 1967 Nov 11;216(5115):613
pubmed: 4868367
Nucleic Acids Res. 2020 Jan 8;48(D1):D749-D755
pubmed: 31642484

Auteurs

Michal Levin (M)

Institute of Molecular Biology (IMB), Ackermannweg 4, 55128, Mainz, Germany. m.levin@imb.de.

Marion Scheibe (M)

Institute of Molecular Biology (IMB), Ackermannweg 4, 55128, Mainz, Germany.

Falk Butter (F)

Institute of Molecular Biology (IMB), Ackermannweg 4, 55128, Mainz, Germany. f.butter@imb.de.

Articles similaires

Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice
Animals Tail Swine Behavior, Animal Animal Husbandry

Classifications MeSH