Integrative Proteogenomics Using ProteomeGenerator2.
comprehensive proteomics
genomics
proteogenomics
transcriptomics
Journal
Journal of proteome research
ISSN: 1535-3907
Titre abrégé: J Proteome Res
Pays: United States
ID NLM: 101128775
Informations de publication
Date de publication:
04 08 2023
04 08 2023
Historique:
medline:
7
8
2023
pubmed:
7
7
2023
entrez:
7
7
2023
Statut:
ppublish
Résumé
Recent advances in nucleic acid sequencing now permit rapid and genome-scale analysis of genetic variation and transcription, enabling population-scale studies of human biology, disease, and diverse organisms. Likewise, advances in mass spectrometry proteomics now permit highly sensitive and accurate studies of protein expression at the whole proteome-scale. However, most proteomic studies rely on consensus databases to match spectra to peptide and protein sequences, and thus remain limited to the analysis of canonical protein sequences. Here, we develop ProteomeGenerator2 (PG2), based on the scalable and modular ProteomeGenerator framework. PG2 integrates genome and transcriptome sequencing to incorporate protein variants containing amino acid substitutions, insertions, and deletions, as well as noncanonical reading frames, exons, and other variants caused by genomic and transcriptomic variation. We benchmarked PG2 using synthetic data and genomic, transcriptomic, and proteomic analysis of human leukemia cells. PG2 can be integrated with current and emerging sequencing technologies, assemblers, variant callers, and mass spectral analysis algorithms, and is available open-source from https://github.com/kentsisresearchgroup/ProteomeGenerator2.
Identifiants
pubmed: 37418425
doi: 10.1021/acs.jproteome.3c00005
doi:
Substances chimiques
Peptides
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
2750-2764Subventions
Organisme : NCI NIH HHS
ID : R21 CA188881
Pays : United States
Organisme : NCI NIH HHS
ID : R01 CA204396
Pays : United States
Organisme : NCI NIH HHS
ID : P30 CA008748
Pays : United States
Commentaires et corrections
Type : UpdateOf