Ionmob: a Python package for prediction of peptide collisional cross-section values.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
02 09 2023
02 09 2023
Historique:
received:
07
02
2023
revised:
30
06
2023
accepted:
03
08
2023
medline:
4
10
2023
pubmed:
4
8
2023
entrez:
4
8
2023
Statut:
ppublish
Résumé
Including ion mobility separation (IMS) into mass spectrometry proteomics experiments is useful to improve coverage and throughput. Many IMS devices enable linking experimentally derived mobility of an ion to its collisional cross-section (CCS), a highly reproducible physicochemical property dependent on the ion's mass, charge and conformation in the gas phase. Thus, known peptide ion mobilities can be used to tailor acquisition methods or to refine database search results. The large space of potential peptide sequences, driven also by posttranslational modifications of amino acids, motivates an in silico predictor for peptide CCS. Recent studies explored the general performance of varying machine-learning techniques, however, the workflow engineering part was of secondary importance. For the sake of applicability, such a tool should be generic, data driven, and offer the possibility to be easily adapted to individual workflows for experimental design and data processing. We created ionmob, a Python-based framework for data preparation, training, and prediction of collisional cross-section values of peptides. It is easily customizable and includes a set of pretrained, ready-to-use models and preprocessing routines for training and inference. Using a set of ≈21 000 unique phosphorylated peptides and ≈17 000 MHC ligand sequences and charge state pairs, we expand upon the space of peptides that can be integrated into CCS prediction. Lastly, we investigate the applicability of in silico predicted CCS to increase confidence in identified peptides by applying methods of re-scoring and demonstrate that predicted CCS values complement existing predictors for that task. The Python package is available at github: https://github.com/theGreatHerrLebert/ionmob.
Identifiants
pubmed: 37540201
pii: 7237255
doi: 10.1093/bioinformatics/btad486
pmc: PMC10521631
pii:
doi:
Substances chimiques
Peptides
0
Ions
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Informations de copyright
© The Author(s) 2023. Published by Oxford University Press.
Références
Mol Cell Proteomics. 2021;20:100138
pubmed: 34416385
J Am Soc Mass Spectrom. 2017 Feb;28(2):294-302
pubmed: 27975328
J Proteome Res. 2021 Jun 16;:
pubmed: 34133192
J Proteome Res. 2015 Dec 4;14(12):5378-87
pubmed: 26538118
Anal Chem. 2012 Aug 21;84(16):7124-30
pubmed: 22845859
J Am Soc Mass Spectrom. 2017 May;28(5):947-959
pubmed: 28211014
Mol Cell Proteomics. 2018 Dec;17(12):2534-2545
pubmed: 30385480
Anal Chem. 2018 Sep 18;90(18):10881-10888
pubmed: 30114359
J Am Soc Mass Spectrom. 2019 Nov;30(11):2185-2195
pubmed: 31493234
Mol Cell Proteomics. 2022 Aug;21(8):100266
pubmed: 35803561
Nat Methods. 2019 Jun;16(6):509-518
pubmed: 31133760
Mass Spectrom (Tokyo). 2021;10:A0093
pubmed: 33552826
J Am Soc Mass Spectrom. 2016 Nov;27(11):1719-1727
pubmed: 27572102
Nat Commun. 2022 Nov 24;13(1):7238
pubmed: 36433986
Nat Biotechnol. 2014 Mar;32(3):223-6
pubmed: 24727771
Mass Spectrom Rev. 2019 May;38(3):291-320
pubmed: 30707468
BMC Bioinformatics. 2013;14 Suppl 8:S9
pubmed: 23815343
Nat Protoc. 2019 Jun;14(6):1687-1707
pubmed: 31092913
J Proteome Res. 2021 Apr 2;20(4):2122-2129
pubmed: 33724840
Biomolecules. 2021 Dec 19;11(12):
pubmed: 34944547
Mol Syst Biol. 2022 Mar;18(3):e10798
pubmed: 35226415
Nat Biotechnol. 2008 Dec;26(12):1367-72
pubmed: 19029910
Nat Methods. 2020 Mar;17(3):261-272
pubmed: 32015543
Nat Commun. 2021 Feb 19;12(1):1185
pubmed: 33608539
J Proteome Res. 2011 May 6;10(5):2318-29
pubmed: 21417239
Mol Cell Proteomics. 2020 Jun;19(6):1058-1069
pubmed: 32156793
Expert Rev Proteomics. 2005 Aug;2(4):553-65
pubmed: 16097888
J Comput Chem. 2013 Jul 30;34(20):1707-18
pubmed: 23609240
Elife. 2022 Mar 22;11:
pubmed: 35314027
Nucleic Acids Res. 2017 Jan 4;45(D1):D1107-D1111
pubmed: 27899654
Nat Biotechnol. 2021 Dec;39(12):1563-1573
pubmed: 34239088
BMC Bioinformatics. 2010 Apr 11;11:182
pubmed: 20380738
Curr Opin Chem Biol. 2018 Feb;42:51-59
pubmed: 29154177