Generating high quality libraries for DIA MS with empirically corrected peptide predictions.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
25 03 2020
Historique:
received: 21 08 2019
accepted: 28 02 2020
entrez: 28 3 2020
pubmed: 28 3 2020
medline: 15 7 2020
Statut: epublish

Résumé

Data-independent acquisition approaches typically rely on experiment-specific spectrum libraries, requiring offline fractionation and tens to hundreds of injections. We demonstrate a library generation workflow that leverages fragmentation and retention time prediction to build libraries containing every peptide in a proteome, and then refines those libraries with empirical data. Our method specifically enables rapid, experiment-specific library generation for non-model organisms, which we demonstrate using the malaria parasite Plasmodium falciparum, and non-canonical databases, which we show by detecting missense variants in HeLa.

Identifiants

pubmed: 32214105
doi: 10.1038/s41467-020-15346-1
pii: 10.1038/s41467-020-15346-1
pmc: PMC7096433
doi:

Substances chimiques

Peptide Library 0
Peptides 0
Proteome 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1548

Subventions

Organisme : NIAID NIH HHS
ID : K25 AI119229
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM133981
Pays : United States

Références

Gillet, L. C. et al. Targeted data extraction of the MS/MS spectra generated by data-independent acquisition: a new concept for consistent and accurate proteome analysis. Mol. Cell. Proteomics 11, O111.016717 (2012).
doi: 10.1074/mcp.O111.016717 pubmed: 3433915 pmcid: 3433915
Doerr, A. DIA mass spectrometry. Nat. Methods 12, 35 (2014).
doi: 10.1038/nmeth.3234
Weisbrod, C. R., Eng, J. K., Hoopmann, M. R., Baker, T. & Bruce, J. E. Accurate peptide fragment mass analysis: multiplexed peptide identification and quantification. J. Proteome Res. 11, 1621–1632 (2012).
doi: 10.1021/pr2008175 pubmed: 3319072 pmcid: 3319072
Röst, H. L. et al. OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data. Nat. Biotechnol. 32, 219–223 (2014).
doi: 10.1038/nbt.2841
Wang, J. et al. MSPLIT-DIA: sensitive peptide identification for data-independent acquisition. Nat. Methods 12, 1106–1108 (2015).
doi: 10.1038/nmeth.3655 pubmed: 4857761 pmcid: 4857761
Schubert, O. T. et al. Building high-quality assay libraries for targeted analysis of SWATH MS data. Nat. Protoc. 10, 426–441 (2015).
doi: 10.1038/nprot.2015.015
Ting, Y. S. et al. Peptide-centric proteome analysis: an alternative strategy for the analysis of tandem mass spectrometry data. Mol. Cell. Proteomics 14, 2301–2307 (2015).
doi: 10.1074/mcp.O114.047035 pubmed: 4563716 pmcid: 4563716
Rosenberger, G. et al. Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses. Nat. Methods 14, 921–927 (2017).
doi: 10.1038/nmeth.4398 pubmed: 5581544 pmcid: 5581544
Bruderer, R. et al. Optimization of experimental parameters in data-independent mass spectrometry significantly increases depth and reproducibility of results. Mol. Cell. Proteomics 16, 2296–2309 (2017).
doi: 10.1074/mcp.RA117.000314 pubmed: 5724188 pmcid: 5724188
Ting, Y. S. et al. PECAN: library-free peptide detection for data-independent acquisition tandem mass spectrometry data. Nat. Methods 14, 903–908 (2017).
doi: 10.1038/nmeth.4390 pubmed: 5578911 pmcid: 5578911
Tsou, C.-C. et al. DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics. Nat. Methods 12, 258 (2015).
doi: 10.1038/nmeth.3255 pubmed: 4399776 pmcid: 4399776
Panchaud, A. et al. Precursor acquisition independent from ion count: how to dive deeper into the proteomics ocean. Anal. Chem. 81, 6481–6488 (2009).
doi: 10.1021/ac900888s pubmed: 3086478 pmcid: 3086478
Searle, B. C. et al. Chromatogram libraries improve peptide detection and quantification by data independent acquisition mass spectrometry. Nat. Commun. 9, 5128 (2018).
doi: 10.1038/s41467-018-07454-w pubmed: 6277451 pmcid: 6277451
Gessulat, S. et al. Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat. Methods 16, 509–518 (2019).
doi: 10.1038/s41592-019-0426-7 pubmed: 31133760 pmcid: 31133760
Tiwary, S. et al. High-quality MS/MS spectrum prediction for data-dependent and data-independent acquisition data analysis. Nat. Methods 16, 519–525 (2019).
doi: 10.1038/s41592-019-0427-6
Searle, B. C., Egertson, J. D., Bollinger, J. G., Stergachis, A. B. & MacCoss, M. J. Using Data Independent Acquisition (DIA) to model high-responding peptides for targeted proteomics experiments. Mol. Cell. Proteomics 14, 2331–2340 (2015).
doi: 10.1074/mcp.M115.051300 pubmed: 4563719 pmcid: 4563719
Egertson, J. D., MacLean, B., Johnson, R., Xuan, Y. & MacCoss, M. J. Multiplexed peptide analysis using data-independent acquisition and Skyline. Nat. Protoc. 10, 887–903 (2015).
doi: 10.1038/nprot.2015.055 pubmed: 5127711 pmcid: 5127711
Amodei, D. et al. Improving precursor selectivity in data-independent acquisition using overlapping windows. J. Am. Soc. Mass Spectrom. 30, 669–684 (2019).
doi: 10.1007/s13361-018-2122-8 pubmed: 6445824 pmcid: 6445824
Röst, H. L. Deep learning adds an extra dimension to peptide fragmentation. Nat. Methods 16, 469–470 (2019).
doi: 10.1038/s41592-019-0428-5
Peterson, A. C., Russell, J. D., Bailey, D. J., Westphall, M. S. & Coon, J. J. Parallel reaction monitoring for high resolution and high mass accuracy quantitative, targeted proteomics. Mol. Cell. Proteomics 11, 1475–1488 (2012).
doi: 10.1074/mcp.O112.020131 pubmed: 3494192 pmcid: 3494192
Zolg, D. P. et al. PROCAL: a set of 40 peptide standards for retention time indexing, column performance monitoring, and collision energy calibration. Proteomics 17, https://doi.org/10.1002/pmic.201700263 (2017).
Rosenberger, G. et al. A repository of assays to quantify 10,000 human proteins by SWATH-MS. Sci. Data 1, 140031 (2014).
doi: 10.1038/sdata.2014.31 pubmed: 4322573 pmcid: 4322573
Zolg, D. P. et al. Building ProteomeTools based on a complete synthetic human proteome. Nat. Methods 14, 259–262 (2017).
doi: 10.1038/nmeth.4153 pubmed: 5868332 pmcid: 5868332
Wang, M. et al. Assembling the community-scale discoverable human proteome. Cell Syst. 7, 412–421.e5 (2018).
doi: 10.1016/j.cels.2018.08.004 pubmed: 6279426 pmcid: 6279426
Liu, Y. et al. Multi-omic measurements of heterogeneity in HeLa cells across laboratories. Nat. Biotechnol. 37, 314–322 (2019).
doi: 10.1038/s41587-019-0037-y
Tate, J. G. et al. COSMIC: the Catalogue Of Somatic Mutations In Cancer. Nucleic Acids Res. 47, D941–D947 (2019).
doi: 10.1093/nar/gky1015
Robin, T., Bairoch, A., Müller, M., Lisacek, F. & Lane, L. Large-scale reanalysis of publicly available HeLa cell proteomics data in the context of the Human Proteome Project. J. Proteome Res. 17, 4160–4170 (2018).
doi: 10.1021/acs.jproteome.8b00392
Garboczi, D. N. et al. Structure of the complex between human T-cell receptor, viral peptide and HLA-A2. Nature 384, 134–141 (1996).
doi: 10.1038/384134a0 pubmed: 8906788 pmcid: 8906788
Rosenberger, G. et al. Inference and quantification of peptidoforms in large sample cohorts by SWATH-MS. Nat. Biotechnol. 35, 781–788 (2017).
doi: 10.1038/nbt.3908 pubmed: 5593115 pmcid: 5593115
Searle, B. C., Lawrence, R. T., MacCoss, M. J. & Villén, J. Thesaurus: quantifying phosphopeptide positional isomers. Nat. Methods 16, 703–706 (2019).
doi: 10.1038/s41592-019-0498-4 pubmed: 7012383 pmcid: 7012383
Lasonder, E. et al. Integrated transcriptomic and proteomic analyses of P. falciparum gametocytes: molecular insight into sex-specific processes and translational repression. Nucleic Acids Res. 44, 6087–6101 (2016).
doi: 10.1093/nar/gkw536 pubmed: 5291273 pmcid: 5291273
Pino, L. K. et al. Matrix-matched calibration curves for assessing analytical figures of merit in quantitative proteomics. J. Proteome Res. 19, 1147–1153 (2020).
doi: 10.1021/acs.jproteome.9b00666
Tyanova, S., Temu, T. & Cox, J. The MaxQuant computational platform for mass spectrometry-based shotgun proteomics. Nat. Protoc. 11, 2301–2319 (2016).
doi: 10.1038/nprot.2016.136
Aurrecoechea, C. et al. PlasmoDB: a functional genomic database for malaria parasites. Nucleic Acids Res. 37, D539–D543 (2009).
doi: 10.1093/nar/gkn814
Bansal, A., Molina-Cruz, A., Brzostowski, J., Mu, J. & Miller, L. H. Plasmodium falciparum calcium-dependent protein kinase 2 is critical for male gametocyte exflagellation but not essential for asexual proliferation. MBio 8, e01656-17 (2017).
Nepomuceno, A. I. et al. Dual electrospray ionization source for confident generation of accurate mass tags using liquid chromatography Fourier transform ion cyclotron resonance mass spectrometry. Anal. Chem. 75, 3411–3418 (2003).
doi: 10.1021/ac0342471
Pasa-Tolić, L., Masselon, C., Barry, R. C., Shen, Y. & Smith, R. D. Proteomic analyses using an accurate mass and time tag strategy. Biotechniques 37, 621–624, 626–633, 636 passim (2004).
doi: 10.2144/04374RV01
Meier, F., Geyer, P. E., Virreira Winter, S., Cox, J. & Mann, M. BoxCar acquisition method enables single-shot proteomics at a depth of 10,000 proteins in 100 minutes. Nat. Methods 15, 440–448 (2018).
doi: 10.1038/s41592-018-0003-5
Lim, M. Y., Paulo, J. A. & Gygi, S. P. Evaluating false transfer rates from the match-between-runs algorithm with a two-proteome model. J. Proteome Res. 18, 4020–4026 (2019).
doi: 10.1021/acs.jproteome.9b00492
MacLean, B. et al. Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968 (2010).
doi: 10.1093/bioinformatics/btq054 pubmed: 2844992 pmcid: 2844992
Wessel, D. & Flügge, U. I. A method for the quantitative recovery of protein in dilute solution in the presence of detergents and lipids. Anal. Biochem. 138, 141–143 (1984).
doi: 10.1016/0003-2697(84)90782-6
Gardner, M. J. et al. Genome sequence of the human malaria parasite Plasmodium falciparum. Nature 419, 498–511 (2002).
doi: 10.1038/nature01097
Orsburn, B. Normalized collision energy calculation for Q Exactive. http://proteomicsnews.blogspot.com/2014/06/normalized-collision-energy-calculation.html (2014).
Chambers, M. C. et al. A cross-platform toolkit for mass spectrometry and proteomics. Nat. Biotechnol. 30, 918–920 (2012).
doi: 10.1038/nbt.2377 pubmed: 3471674 pmcid: 3471674
Eng, J. K., Jahan, T. A. & Hoopmann, M. R. Comet: an open-source MS/MS sequence database search tool. Proteomics 13, 22–24 (2013).
doi: 10.1002/pmic.201200439
Keller, A., Nesvizhskii, A. I., Kolker, E. & Aebersold, R. Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. Anal. Chem. 74, 5383–5392 (2002).
doi: 10.1021/ac025747h
Deutsch, E. W. et al. Trans-proteomic pipeline, a standardized data processing pipeline for large-scale reproducible proteomics informatics. Proteom. Clin. Appl. 9, 745–754 (2015).
doi: 10.1002/prca.201400164
Frewen, B. E., Merrihew, G. E., Wu, C. C., Noble, W. S. & MacCoss, M. J. Analysis of peptide MS/MS spectra from large-scale proteomics experiments using spectrum libraries. Anal. Chem. 78, 5678–5684 (2006).
doi: 10.1021/ac060279n
Pino, L. K. et al. The Skyline ecosystem: informatics for quantitative mass spectrometry proteomics. Mass Spectrom. Rev. https://doi.org/10.1002/mas.21540 (2017).
doi: 10.1002/mas.21540 pubmed: 5799042 pmcid: 5799042
Elias, J. E. & Gygi, S. P. Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat. Methods 4, 207–214 (2007).
doi: 10.1038/nmeth1019
Käll, L., Canterbury, J. D., Weston, J., Noble, W. S. & MacCoss, M. J. Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat. Methods 4, 923–925 (2007).
doi: 10.1038/nmeth1113
The, M., MacCoss, M. J., Noble, W. S. & Käll, L. Fast and Accurate protein false discovery rates on large-scale proteomics data sets with Percolator 3.0. J. Am. Soc. Mass Spectrom. 27, 1719–1727 (2016).
doi: 10.1007/s13361-016-1460-7 pubmed: 5059416 pmcid: 5059416
Savitzky, A. & Golay, M. J. E. Smoothing and differentiation of data by simplified least squares procedures. Anal. Chem. 36, 1627–1639 (1964).
doi: 10.1021/ac60214a047

Auteurs

Brian C Searle (BC)

Institute for Systems Biology, Seattle, WA, USA. bsearle@systemsbiology.org.
Proteome Software, Inc., Portland, OR, USA. bsearle@systemsbiology.org.

Kristian E Swearingen (KE)

Institute for Systems Biology, Seattle, WA, USA.

Christopher A Barnes (CA)

Novo Nordisk Research Center Seattle, Inc., Seattle, WA, USA.

Tobias Schmidt (T)

Technical University of Munich, Freising, Germany.

Siegfried Gessulat (S)

Technical University of Munich, Freising, Germany.
SAP SE, Potsdam, Germany.

Bernhard Küster (B)

Technical University of Munich, Freising, Germany.
Bavarian Center for Biomolecular Mass Spectrometry, Freising, Germany.

Mathias Wilhelm (M)

Technical University of Munich, Freising, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH