Ionmob: a Python package for prediction of peptide collisional cross-section values.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
02 09 2023
Historique:
received: 07 02 2023
revised: 30 06 2023
accepted: 03 08 2023
medline: 4 10 2023
pubmed: 4 8 2023
entrez: 4 8 2023
Statut: ppublish

Résumé

Including ion mobility separation (IMS) into mass spectrometry proteomics experiments is useful to improve coverage and throughput. Many IMS devices enable linking experimentally derived mobility of an ion to its collisional cross-section (CCS), a highly reproducible physicochemical property dependent on the ion's mass, charge and conformation in the gas phase. Thus, known peptide ion mobilities can be used to tailor acquisition methods or to refine database search results. The large space of potential peptide sequences, driven also by posttranslational modifications of amino acids, motivates an in silico predictor for peptide CCS. Recent studies explored the general performance of varying machine-learning techniques, however, the workflow engineering part was of secondary importance. For the sake of applicability, such a tool should be generic, data driven, and offer the possibility to be easily adapted to individual workflows for experimental design and data processing. We created ionmob, a Python-based framework for data preparation, training, and prediction of collisional cross-section values of peptides. It is easily customizable and includes a set of pretrained, ready-to-use models and preprocessing routines for training and inference. Using a set of ≈21 000 unique phosphorylated peptides and ≈17 000 MHC ligand sequences and charge state pairs, we expand upon the space of peptides that can be integrated into CCS prediction. Lastly, we investigate the applicability of in silico predicted CCS to increase confidence in identified peptides by applying methods of re-scoring and demonstrate that predicted CCS values complement existing predictors for that task. The Python package is available at github: https://github.com/theGreatHerrLebert/ionmob.

Identifiants

pubmed: 37540201
pii: 7237255
doi: 10.1093/bioinformatics/btad486
pmc: PMC10521631
pii:
doi:

Substances chimiques

Peptides 0
Ions 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2023. Published by Oxford University Press.

Références

Mol Cell Proteomics. 2021;20:100138
pubmed: 34416385
J Am Soc Mass Spectrom. 2017 Feb;28(2):294-302
pubmed: 27975328
J Proteome Res. 2021 Jun 16;:
pubmed: 34133192
J Proteome Res. 2015 Dec 4;14(12):5378-87
pubmed: 26538118
Anal Chem. 2012 Aug 21;84(16):7124-30
pubmed: 22845859
J Am Soc Mass Spectrom. 2017 May;28(5):947-959
pubmed: 28211014
Mol Cell Proteomics. 2018 Dec;17(12):2534-2545
pubmed: 30385480
Anal Chem. 2018 Sep 18;90(18):10881-10888
pubmed: 30114359
J Am Soc Mass Spectrom. 2019 Nov;30(11):2185-2195
pubmed: 31493234
Mol Cell Proteomics. 2022 Aug;21(8):100266
pubmed: 35803561
Nat Methods. 2019 Jun;16(6):509-518
pubmed: 31133760
Mass Spectrom (Tokyo). 2021;10:A0093
pubmed: 33552826
J Am Soc Mass Spectrom. 2016 Nov;27(11):1719-1727
pubmed: 27572102
Nat Commun. 2022 Nov 24;13(1):7238
pubmed: 36433986
Nat Biotechnol. 2014 Mar;32(3):223-6
pubmed: 24727771
Mass Spectrom Rev. 2019 May;38(3):291-320
pubmed: 30707468
BMC Bioinformatics. 2013;14 Suppl 8:S9
pubmed: 23815343
Nat Protoc. 2019 Jun;14(6):1687-1707
pubmed: 31092913
J Proteome Res. 2021 Apr 2;20(4):2122-2129
pubmed: 33724840
Biomolecules. 2021 Dec 19;11(12):
pubmed: 34944547
Mol Syst Biol. 2022 Mar;18(3):e10798
pubmed: 35226415
Nat Biotechnol. 2008 Dec;26(12):1367-72
pubmed: 19029910
Nat Methods. 2020 Mar;17(3):261-272
pubmed: 32015543
Nat Commun. 2021 Feb 19;12(1):1185
pubmed: 33608539
J Proteome Res. 2011 May 6;10(5):2318-29
pubmed: 21417239
Mol Cell Proteomics. 2020 Jun;19(6):1058-1069
pubmed: 32156793
Expert Rev Proteomics. 2005 Aug;2(4):553-65
pubmed: 16097888
J Comput Chem. 2013 Jul 30;34(20):1707-18
pubmed: 23609240
Elife. 2022 Mar 22;11:
pubmed: 35314027
Nucleic Acids Res. 2017 Jan 4;45(D1):D1107-D1111
pubmed: 27899654
Nat Biotechnol. 2021 Dec;39(12):1563-1573
pubmed: 34239088
BMC Bioinformatics. 2010 Apr 11;11:182
pubmed: 20380738
Curr Opin Chem Biol. 2018 Feb;42:51-59
pubmed: 29154177

Auteurs

David Teschner (D)

Institute of Computer Science, Johannes Gutenberg University, 55128 Mainz, Germany.

David Gomez-Zepeda (D)

Institute for Immunology, University Medical Center of the Johannes Gutenberg University, 55128 Mainz, Germany.
Immunoproteomics Unit, Helmholtz-Institute for Translational Oncology (HI-TRON), 55131 Mainz, Germany.

Arthur Declercq (A)

VIB-UGent Center for Medical Biotechnology, VIB, 9052 Gent, Belgium.
Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium.

Mateusz K Łącki (MK)

Institute for Immunology, University Medical Center of the Johannes Gutenberg University, 55128 Mainz, Germany.

Seymen Avci (S)

Institute of Computer Science, Johannes Gutenberg University, 55128 Mainz, Germany.

Konstantin Bob (K)

Institute of Computer Science, Johannes Gutenberg University, 55128 Mainz, Germany.

Ute Distler (U)

Institute for Immunology, University Medical Center of the Johannes Gutenberg University, 55128 Mainz, Germany.

Thomas Michna (T)

Institute for Immunology, University Medical Center of the Johannes Gutenberg University, 55128 Mainz, Germany.
Immunoproteomics Unit, Helmholtz-Institute for Translational Oncology (HI-TRON), 55131 Mainz, Germany.

Lennart Martens (L)

VIB-UGent Center for Medical Biotechnology, VIB, 9052 Gent, Belgium.
Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium.

Stefan Tenzer (S)

Institute for Immunology, University Medical Center of the Johannes Gutenberg University, 55128 Mainz, Germany.
Immunoproteomics Unit, Helmholtz-Institute for Translational Oncology (HI-TRON), 55131 Mainz, Germany.

Andreas Hildebrandt (A)

Institute of Computer Science, Johannes Gutenberg University, 55128 Mainz, Germany.

Articles similaires

Animals Hemiptera Insect Proteins Phylogeny Insecticides

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
Humans Artificial Intelligence Neoplasms Prognosis Image Processing, Computer-Assisted

Classifications MeSH