OpenPepXL: An Open-Source Tool for Sensitive Identification of Cross-Linked Peptides in XL-MS.


Journal

Molecular & cellular proteomics : MCP
ISSN: 1535-9484
Titre abrégé: Mol Cell Proteomics
Pays: United States
ID NLM: 101125647

Informations de publication

Date de publication:
12 2020
Historique:
received: 20 06 2020
revised: 15 09 2020
pubmed: 18 10 2020
medline: 12 8 2021
entrez: 17 10 2020
Statut: ppublish

Résumé

Cross-linking MS (XL-MS) has been recognized as an effective source of information about protein structures and interactions. In contrast to regular peptide identification, XL-MS has to deal with a quadratic search space, where peptides from every protein could potentially be cross-linked to any other protein. To cope with this search space, most tools apply different heuristics for search space reduction. We introduce a new open-source XL-MS database search algorithm, OpenPepXL, which offers increased sensitivity compared with other tools. OpenPepXL searches the full search space of an XL-MS experiment without using heuristics to reduce it. Because of efficient data structures and built-in parallelization OpenPepXL achieves excellent runtimes and can also be deployed on large compute clusters and cloud services while maintaining a slim memory footprint. We compared OpenPepXL to several other commonly used tools for identification of noncleavable labeled and label-free cross-linkers on a diverse set of XL-MS experiments. In our first comparison, we used a data set from a fraction of a cell lysate with a protein database of 128 targets and 128 decoys. At 5% FDR, OpenPepXL finds from 7% to over 50% more unique residue pairs (URPs) than other tools. On data sets with available high-resolution structures for cross-link validation OpenPepXL reports from 7% to over 40% more structurally validated URPs than other tools. Additionally, we used a synthetic peptide data set that allows objective validation of cross-links without relying on structural information and found that OpenPepXL reports at least 12% more validated URPs than other tools. It has been built as part of the OpenMS suite of tools and supports Windows, macOS, and Linux operating systems. OpenPepXL also supports the MzIdentML 1.2 format for XL-MS identification results. It is freely available under a three-clause BSD license at https://openms.org/openpepxl.

Identifiants

pubmed: 33067342
pii: S1535-9476(20)60018-4
doi: 10.1074/mcp.TIR120.002186
pmc: PMC7710140
pii:
doi:

Substances chimiques

Cross-Linking Reagents 0
Peptides 0

Banques de données

PDB
['3GJX', '4F5S']

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

2157-2168

Informations de copyright

© 2020 Netz et al.

Déclaration de conflit d'intérêts

Conflict of interest—The authors declare that they have no conflicts of interest with the contents of this article.

Références

Bioinformatics. 2019 Jan 15;35(2):251-257
pubmed: 30649350
J Struct Biol. 2015 Mar;189(3):177-83
pubmed: 25661704
Mol Cell Proteomics. 2014 Feb;13(2):420-34
pubmed: 24335475
J Am Soc Mass Spectrom. 1995 Apr;6(4):229-33
pubmed: 24214167
Nat Struct Mol Biol. 2018 Nov;25(11):1000-1008
pubmed: 30374081
Curr Opin Chem Biol. 2019 Feb;48:8-18
pubmed: 30172868
Nat Protoc. 2014 Jan;9(1):120-37
pubmed: 24356771
Bioinformatics. 2019 Sep 1;35(17):3169-3170
pubmed: 30629147
Anal Chem. 2017 May 16;89(10):5311-5318
pubmed: 28402676
Nat Methods. 2017 May;14(5):513-520
pubmed: 28394336
Anal Chem. 2017 Apr 4;89(7):3829-3833
pubmed: 28267312
J Am Soc Mass Spectrom. 2012 Jan;23(1):76-87
pubmed: 22038510
J Comput Chem. 2004 Oct;25(13):1605-12
pubmed: 15264254
Nucleic Acids Res. 2017 Jan 4;45(D1):D1100-D1106
pubmed: 27924013
Protein Sci. 2015 Aug;24(8):1193-209
pubmed: 25970732
Curr Opin Struct Biol. 2015 Dec;35:100-8
pubmed: 26615471
Nat Methods. 2012 Sep;9(9):901-3
pubmed: 22772729
Trends Biochem Sci. 2016 Jan;41(1):20-32
pubmed: 26654279
Nat Methods. 2016 Aug 30;13(9):741-8
pubmed: 27575624
J Proteome Res. 2015 May 1;14(5):2190-8
pubmed: 25812159
Nat Commun. 2019 Jul 30;10(1):3404
pubmed: 31363125
Nucleic Acids Res. 2019 Jan 8;47(D1):D442-D450
pubmed: 30395289
Anal Chem. 2019 Jun 4;91(11):6953-6961
pubmed: 31045356
Nat Commun. 2020 Feb 6;11(1):742
pubmed: 32029734
Nat Methods. 2008 Apr;5(4):315-8
pubmed: 18327264
Mol Cell Proteomics. 2017 Jul;16(7):1275-1285
pubmed: 28515314
Science. 2009 May 22;324(5930):1087-91
pubmed: 19389996

Auteurs

Eugen Netz (E)

Biomolecular Interactions, Max Planck Institute for Developmental Biology, Tübingen, Germany; Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany; Applied Bioinformatics, Dept. of Computer Science, University of Tübingen, Tübingen, Germany. Electronic address: eugen.netz@tuebingen.mpg.de.

Tjeerd M H Dijkstra (TMH)

Biomolecular Interactions, Max Planck Institute for Developmental Biology, Tübingen, Germany; Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany; Applied Bioinformatics, Dept. of Computer Science, University of Tübingen, Tübingen, Germany; Center for Women's Health, University Clinic Tübingen, Tübingen, Germany.

Timo Sachsenberg (T)

Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany; Applied Bioinformatics, Dept. of Computer Science, University of Tübingen, Tübingen, Germany.

Lukas Zimmermann (L)

Biomolecular Interactions, Max Planck Institute for Developmental Biology, Tübingen, Germany; Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany; Institute for Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany.

Mathias Walzer (M)

Institute for Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.

Thomas Monecke (T)

X-Ray Crystallography Facility, Institute of Pharmaceutical Biotechnology, University of Ulm, Ulm, Germany; Department of Molecular Structural BiologyInstitute for Microbiology and GeneticsGZMB, Georg-August-University Göttingen, Göttingen, Germany.

Ralf Ficner (R)

Department of Molecular Structural BiologyInstitute for Microbiology and GeneticsGZMB, Georg-August-University Göttingen, Göttingen, Germany.

Olexandr Dybkov (O)

Department for Cellular BiochemistryMax Planck Institute for Biophysical Chemistry, Göttingen, Germany.

Henning Urlaub (H)

Bioanalytical Mass SpectrometryMax Planck Institute for Biophysical Chemistry, Göttingen, Germany; BioanalyticsInstitute for Clinical Chemistry, University Medical Center, Göttingen, Germany.

Oliver Kohlbacher (O)

Biomolecular Interactions, Max Planck Institute for Developmental Biology, Tübingen, Germany; Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany; Applied Bioinformatics, Dept. of Computer Science, University of Tübingen, Tübingen, Germany; Institute for Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany; Quantitative Biology Center, University of Tübingen, Tübingen, Germany. Electronic address: oliver.kohlbacher@uni-tuebingen.de.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH