Quality Control for the Target Decoy Approach for Peptide Identification.

Bioconductor TDA assumptions diagnostic plots false discovery rate mass spectrometry peptide identification peptide-to-spectrum match proteomics data analysis quality control target decoy approach

Journal

Journal of proteome research
ISSN: 1535-3907
Titre abrégé: J Proteome Res
Pays: United States
ID NLM: 101128775

Informations de publication

Date de publication:
03 02 2023
Historique:
pubmed: 18 1 2023
medline: 7 2 2023
entrez: 17 1 2023
Statut: ppublish

Résumé

Reliable peptide identification is key in mass spectrometry (MS) based proteomics. To this end, the target decoy approach (TDA) has become the cornerstone for extracting a set of reliable peptide-to-spectrum matches (PSMs) that will be used in downstream analysis. Indeed, TDA is now the default method to estimate the false discovery rate (FDR) for a given set of PSMs, and users typically view it as a universal solution for assessing the FDR in the peptide identification step. However, the TDA also relies on a minimal set of assumptions, which are typically never verified in practice. We argue that a violation of these assumptions can lead to poor FDR control, which can be detrimental to any downstream data analysis. We here therefore first clearly spell out these TDA assumptions, and introduce TargetDecoy, a Bioconductor package with all the necessary functionality to control the TDA quality and its underlying assumptions for a given set of PSMs.

Identifiants

pubmed: 36648107
doi: 10.1021/acs.jproteome.2c00423
doi:

Substances chimiques

Peptides 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

350-358

Auteurs

Elke Debrie (E)

Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000Ghent, Belgium.

Milan Malfait (M)

Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000Ghent, Belgium.
Statistics and Decision Sciences, Janssen Pharmaceutical Companies of Johnson and Johnson, 2340Beerse, Belgium.

Ralf Gabriels (R)

VIB-UGent Center for Medical Biotechnology, VIB, 9052Ghent, Belgium.
Department of Biomolecular Medicine, Ghent University, 9000Ghent, Belgium.

Arthur Declerq (A)

VIB-UGent Center for Medical Biotechnology, VIB, 9052Ghent, Belgium.
Department of Biomolecular Medicine, Ghent University, 9000Ghent, Belgium.

Adriaan Sticker (A)

Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000Ghent, Belgium.
VIB-UGent Center for Medical Biotechnology, VIB, 9052Ghent, Belgium.
Department of Biomolecular Medicine, Ghent University, 9000Ghent, Belgium.

Lennart Martens (L)

VIB-UGent Center for Medical Biotechnology, VIB, 9052Ghent, Belgium.
Department of Biomolecular Medicine, Ghent University, 9000Ghent, Belgium.

Lieven Clement (L)

Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000Ghent, Belgium.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
Humans Algorithms Software Artificial Intelligence Computer Simulation

Classifications MeSH