MaxDIA enables library-based and library-free data-independent acquisition proteomics.

Peptide Library Proteome / analysis Proteomics / methods Software

Journal

Nature biotechnology

ISSN: 1546-1696

Titre abrégé: Nat Biotechnol

Pays: United States

ID NLM: 9604648

Informations de publication

Date de publication:
12 2021

Historique:

received: 17 11 2020

accepted: 27 05 2021

pubmed: 10 7 2021

medline: 20 4 2022

entrez: 9 7 2021

Statut: ppublish

Résumé

MaxDIA is a software platform for analyzing data-independent acquisition (DIA) proteomics data within the MaxQuant software environment. Using spectral libraries, MaxDIA achieves deep proteome coverage with substantially better coefficients of variation in protein quantification than other software. MaxDIA is equipped with accurate false discovery rate (FDR) estimates on both library-to-DIA match and protein levels, including when using whole-proteome predicted spectral libraries. This is the foundation of discovery DIA-hypothesis-free analysis of DIA samples without library and with reliable FDR control. MaxDIA performs three- or four-dimensional feature detection of fragment data, and scoring of matches is augmented by machine learning on the features of an identification. MaxDIA's bootstrap DIA workflow performs multiple rounds of matching with increasing quality of recalibration and stringency of matching to the library. Combining MaxDIA with two new technologies-BoxCar acquisition and trapped ion mobility spectrometry-both lead to deep and accurate proteome quantification.

Identifiants

DOI: 10.1038/s41587-021-00968-7 PMID: 34239088 PMC: PMC8668435

pubmed: 34239088

doi: 10.1038/s41587-021-00968-7

pii: 10.1038/s41587-021-00968-7

pmc: PMC8668435

doi:

Substances chimiques

Peptide Library 0

Proteome 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

Pagination

1563-1573

Subventions

Organisme : Biotechnology and Biological Sciences Research Council

ID : BB/P024599/1

Pays : United Kingdom

Commentaires et corrections

Type : CommentIn

Informations de copyright

Références

Doerr, A. DIA mass spectrometry. Nat. Methods 12, 35–35 (2014).

doi: 10.1038/nmeth.3234

Navarro, P. et al. A multicenter study benchmarks software tools for label-free proteome quantification. Nat. Biotechnol. 34, 1130–1136 (2016).

pubmed: 27701404 pmcid: 5120688 doi: 10.1038/nbt.3685

Cox, J. & Mann, M. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification. Nat. Biotechnol. 26, 1367–1372 (2008).

pubmed: 19029910 doi: 10.1038/nbt.1511

Azvolinsky, A., DeFrancesco, L., Waltz, E. & Webb, S. 20 years of Nature Biotechnology research tools. Nat. Biotechnol. 34, 256–261 (2016).

pubmed: 26963546 doi: 10.1038/nbt.3507

Sinitcyn, P., Rudolph, J. D. & Cox, J. Computational methods for understanding mass spectrometry-based shotgun proteomics. Annu. Rev. Biomed. Data Sci. 1, 207–234 (2018).

doi: 10.1146/annurev-biodatasci-080917-013516

Sinitcyn, P. et al. MaxQuant goes Linux. Nat. Methods 15, 401 (2018).

pubmed: 29855570 doi: 10.1038/s41592-018-0018-y

Röst, H. L. et al. OpenSWATH enables automated, targeted analysis of data-independent acquisition MS data. Nat. Biotechnol. 32, 219–223 (2014).

pubmed: 24727770 doi: 10.1038/nbt.2841

MacLean, B. et al. Skyline: an open source document editor for creating and analyzing targeted proteomics experiments. Bioinformatics 26, 966–968 (2010).

pubmed: 20147306 pmcid: 2844992 doi: 10.1093/bioinformatics/btq054

Bruderer, R. et al. Extending the limits of quantitative proteome profiling with data-independent acquisition and application to acetaminophen-treated three-dimensional liver microtissues. Mol. Cell. Proteomics 14, 1400–1410 (2015).

pubmed: 25724911 pmcid: 4424408 doi: 10.1074/mcp.M114.044305

Demichev, V., Messner, C. B., Vernardis, S. I., Lilley, K. S. & Ralser, M. DIA-NN: neural networks and interference correction enable deep proteome coverage in high throughput. Nat. Methods 14, 41–44 (2020).

doi: 10.1038/s41592-019-0638-x

Cox, J. et al. Accurate proteome-wide label-free quantification by delayed normalization and maximal peptide ratio extraction, termed MaxLFQ. Mol. Cell. Proteomics 13, 2513–2526 (2014).

pubmed: 24942700 pmcid: 4159666 doi: 10.1074/mcp.M113.031591

Rosenberger, G. et al. Statistical control of peptide and protein error rates in large-scale targeted data-independent acquisition analyses. Nat. Methods 14, 921–927 (2017).

pubmed: 28825704 pmcid: 5581544 doi: 10.1038/nmeth.4398

Elias, J. E. & Gygi, S. P. Target-decoy search strategy for increased confidence in large-scale protein identifications by mass spectrometry. Nat. Methods 4, 207–214 (2007).

pubmed: 17327847 doi: 10.1038/nmeth1019

Tsou, C.-C. et al. DIA-Umpire: comprehensive computational framework for data-independent acquisition proteomics. Nat. Methods 12, 258–264 (2015).

pubmed: 25599550 pmcid: 4399776 doi: 10.1038/nmeth.3255

Tiwary, S. et al. High quality MS/MS spectrum prediction for data-dependent and -independent acquisition data analysis. Nat. Methods 16, 519–525 (2019).

Gessulat, S. et al. Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat. Methods 16, 509–518 (2019).

pubmed: 31133760 doi: 10.1038/s41592-019-0426-7

Yang, Y. et al. In silico spectral libraries by deep learning facilitate data-independent acquisition proteomics. Nat. Commun. 11, 146 (2020).

pubmed: 31919359 pmcid: 6952453 doi: 10.1038/s41467-019-13866-z

Searle, B. C. et al. Generating high quality libraries for DIA MS with empirically corrected peptide predictions. Nat. Commun. 11, 1548 (2020).

pubmed: 32214105 pmcid: 7096433 doi: 10.1038/s41467-020-15346-1

Lou, R. et al. Hybrid spectral library combining DIA-MS data and a targeted virtual library substantially deepens the proteome coverage. iScience 23, 100903 (2020).

pubmed: 32109675 pmcid: 7044796 doi: 10.1016/j.isci.2020.100903

Tran, N. H. et al. Deep learning enables de novo peptide sequencing from data-independent-acquisition mass spectrometry. Nat. Methods 16, 62–66 (2019).

doi: 10.1038/s41592-018-0260-3

Graves, A. et al. A novel connectionist system for unconstrained handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31, 855–868 (2009).

pubmed: 19299860 doi: 10.1109/TPAMI.2008.137

Chen, T. & Guestrin, C. XGBoost: reliable large-scale tree boosting system. Preprint at https://arxiv.org/abs/1603.02754 (2016).

Prianichnikov, N. et al. MaxQuant software for ion mobility enhanced shotgun proteomics. Mol. Cell. Proteomics 19, 1058–1069 (2020).

pubmed: 32156793 pmcid: 7261821 doi: 10.1074/mcp.TIR119.001720

Meier, F., Geyer, P. E., Virreira Winter, S., Cox, J. & Mann, M. BoxCar acquisition method enables single-shot proteomics at a depth of 10,000 proteins in 100 minutes. Nat. Methods 15, 440–448 (2018).

pubmed: 29735998 doi: 10.1038/s41592-018-0003-5

Fernandez-Lima, F., Kaplan, D. A., Suetering, J. & Park, M. A. Gas-phase separation using a trapped ion mobility spectrometer. Int. J. Ion Mobil. Spectrom. https://doi.org/10.1007/s12127-011-0067-8 (2011).

Silveira, J. A., Ridgeway, M. E. & Park, M. A. High resolution trapped ion mobility spectrometery of peptides. Anal. Chem. 86, 5624–5627 (2014).

pubmed: 24862843 doi: 10.1021/ac501261h

Meier, F. et al. Online parallel accumulation–serial fragmentation (PASEF) with a novel trapped ion mobility mass spectrometer. Mol. Cell. Proteomics 17, 2534–2545 (2018).

Perez-Riverol, Y. et al. The PRIDE database and related tools and resources in 2019: improving support for quantification data. Nucleic Acids Res. 47, D442–D450 (2019).

pubmed: 30395289 doi: 10.1093/nar/gky1106

Griss, J. et al. The mzTab data exchange format: communicating mass-spectrometry-based proteomics and metabolomics experimental results to a wider audience. Mol. Cell. Proteomics 13, 2765–2775 (2014).

pubmed: 24980485 pmcid: 4189001 doi: 10.1074/mcp.O113.036681

Martens, L. et al. mzML—a community standard for mass spectrometry data. Mol. Cell. Proteomics 10, R110 000133 (2011).

pubmed: 20716697 doi: 10.1074/mcp.R110.000133

Cox, J., Michalski, A. & Mann, M. Software lock mass by two-dimensional minimization of peptide mass errors. J. Am. Soc. Mass. Spectrom. 22, 1373–1380 (2011).

pubmed: 21953191 pmcid: 3231580 doi: 10.1007/s13361-011-0142-8

Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).

doi: 10.1023/A:1010933404324

Käll, L., Canterbury, J. D., Weston, J., Noble, W. S. & MacCoss, M. J. Semi-supervised learning for peptide identification from shotgun proteomics datasets. Nat. Methods 4, 923–925 (2007).

pubmed: 17952086 doi: 10.1038/nmeth1113

Bruderer, R. et al. Optimization of experimental parameters in data-independent mass spectrometry significantly increases depth and reproducibility of results. Mol. Cell. Proteomics 16, 2296–2309 (2017).

Ludwig, C. et al. Data‐independent acquisition‐based SWATH‐MS for quantitative proteomics: a tutorial. Mol. Syst. Biol. 14, e8126 (2018).

pubmed: 30104418 pmcid: 6088389 doi: 10.15252/msb.20178126

Mortazavi, A., Williams, B. A., McCue, K., Schaeffer, L. & Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-seq. Nat. Methods 5, 621–628 (2008).

pubmed: 18516045 doi: 10.1038/nmeth.1226

Selbach, M. et al. Widespread changes in protein synthesis induced by microRNAs. Nature 455, 58–63 (2008).

pubmed: 18668040 doi: 10.1038/nature07228

Buccitelli, C. & Selbach, M. mRNAs, proteins and the emerging principles of gene expression control. Nat. Rev. Genet. 21, 630–644 (2020).

pubmed: 32709985 doi: 10.1038/s41576-020-0258-4

UniProt: the universal protein knowledgebase. Nucleic Acids Res. 45, D158–D169 (2017).

Tsai, T. H. et al. Selection of features with consistent profiles improves relative protein quantification in mass spectrometry experiments. Mol. Cell. Proteomics 19, 944–959 (2020).

pubmed: 32234965 pmcid: 7261813 doi: 10.1074/mcp.RA119.001792

Vaca Jacome, A. S. et al. Avant-garde: an automated data-driven DIA data curation tool. Nat. Methods 17, 1237–1244 (2020).

pubmed: 33199889 doi: 10.1038/s41592-020-00986-4

Searle, B. C. et al. Chromatogram libraries improve peptide detection and quantification by data independent acquisition mass spectrometry. Nat. Commun. 9, 5128 (2018).

pubmed: 30510204 pmcid: 6277451 doi: 10.1038/s41467-018-07454-w

Teo, G. et al. MapDIA: preprocessing and statistical analysis of quantitative proteomics data from data independent acquisition mass spectrometry. J. Proteomics 129, 108–120 (2015).

pubmed: 26381204 pmcid: 4630088 doi: 10.1016/j.jprot.2015.09.013

Hebenstreit, D. et al. RNA sequencing reveals two major classes of gene expression levels in metazoan cells. Mol. Syst. Biol. 7, 497 (2011).

pubmed: 21654674 pmcid: 3159973 doi: 10.1038/msb.2011.28

Bekker-Jensen, D. B. et al. Rapid and site-specific deep phosphoproteome profiling by data-independent acquisition without the need for spectral libraries. Nat. Commun. 11, 787 (2020).

pubmed: 32034161 pmcid: 7005859 doi: 10.1038/s41467-020-14609-1

Müller, F., Kolbowski, L., Bernhardt, O. M., Reiter, L. & Rappsilber, J. Data-independent acquisition improves quantitative cross-linking mass spectrometry. Mol. Cell. Proteomics 18, 786–795 (2019).

pubmed: 30651306 pmcid: 6442367 doi: 10.1074/mcp.TIR118.001276

Rappsilber, J., Ishihama, Y. & Mann, M. Stop and go extraction tips for matrix-assisted laser desorption/ionization, nanoelectrospray, and LC/MS sample pretreatment in proteomics. Anal. Chem. 75, 663–670 (2003).

pubmed: 12585499 doi: 10.1021/ac026117i

Fonslow, B. R. et al. Digestion and depletion of abundant proteins improves proteomic coverage. Nat. Methods 10, 54–56 (2013).

pubmed: 23160281 doi: 10.1038/nmeth.2250

Wiśniewski, J. R., Zougman, A., Nagaraj, N. & Mann, M. Universal sample preparation method for proteome analysis. Nat. Methods 6, 359–362 (2009).

pubmed: 19377485 doi: 10.1038/nmeth.1322

Distler, U., Kuharev, J., Navarro, P. & Tenzer, S. Label-free quantification in ion mobility-enhanced data-independent acquisition proteomics. Nat. Protoc. 11, 795–812 (2016).

pubmed: 27010757 doi: 10.1038/nprot.2016.042

Mertins, P. et al. Reproducible workflow for multiplexed deep-scale proteome and phosphoproteome analysis of tumor tissues by liquid chromatography–mass spectrometry. Nat. Protocols 13, 1632–1661 (2018).

pubmed: 29988108 doi: 10.1038/s41596-018-0006-9

Djebali, S. et al. Landscape of transcription in human cells. Nature 489, 101–108 (2012).

pubmed: 22955620 pmcid: 3684276 doi: 10.1038/nature11233

Thul, P. J. et al. A subcellular map of the human proteome. Science 356, eaal3321 (2017).

pubmed: 28495876 doi: 10.1126/science.aal3321

Maglott, D., Ostell, J., Pruitt, K. D. & Tatusova, T. Entrez Gene: gene-centered information at NCBI. Nucleic Acids Res. 39, D52–D57 (2011).

pubmed: 21115458 doi: 10.1093/nar/gkq1237

Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2115–2120 (2014).

doi: 10.1093/bioinformatics/btu170

Dobin, A. et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).

pubmed: 23104886 doi: 10.1093/bioinformatics/bts635

Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).

pubmed: 19505943 pmcid: 2723002 doi: 10.1093/bioinformatics/btp352

Tyanova, S. et al. The Perseus computational platform for comprehensive analysis of (prote)omics data. Nat. Methods 13, 731–740 (2016).

pubmed: 27348712 doi: 10.1038/nmeth.3901

MaxDIA enables library-based and library-free data-independent acquisition proteomics.

Journal

Informations de publication

Résumé

Identifiants

Substances chimiques

Types de publication

Langues

Sous-ensembles de citation

Pagination

Subventions

Commentaires et corrections

Informations de copyright

Références

Auteurs

Pavel Sinitcyn (P)

Hamid Hamzeiy (H)

Favio Salinas Soto (F)

Daniel Itzhak (D)

Frank McCarthy (F)

Christoph Wichmann (C)

Martin Steger (M)

Uli Ohmayer (U)

Ute Distler (U)

Stephanie Kaspar-Schoenefeld (S)

Nikita Prianichnikov (N)

Şule Yılmaz (Ş)

Jan Daniel Rudolph (JD)

Stefan Tenzer (S)

Yasset Perez-Riverol (Y)

Nagarjuna Nagaraj (N)

Sean J Humphrey (SJ)

Jürgen Cox (J)

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Accuracy of web-based automated versus digital manual cephalometric landmark identification.

An arithmetic operation P system based on symmetric ternary system.

Classifications MeSH