PERCEPTRON: an open-source GPU-accelerated proteoform identification pipeline for top-down proteomics.
Journal
Nucleic acids research
ISSN: 1362-4962
Titre abrégé: Nucleic Acids Res
Pays: England
ID NLM: 0411011
Informations de publication
Date de publication:
02 07 2021
02 07 2021
Historique:
accepted:
25
04
2021
revised:
10
04
2021
received:
11
03
2021
pubmed:
18
5
2021
medline:
20
7
2021
entrez:
17
5
2021
Statut:
ppublish
Résumé
PERCEPTRON is a next-generation freely available web-based proteoform identification and characterization platform for top-down proteomics (TDP). PERCEPTRON search pipeline brings together algorithms for (i) intact protein mass tuning, (ii) de novo sequence tags-based filtering, (iii) characterization of terminal as well as post-translational modifications, (iv) identification of truncated proteoforms, (v) in silico spectral comparison, and (vi) weight-based candidate protein scoring. High-throughput performance is achieved through the execution of optimized code via multiple threads in parallel, on graphics processing units (GPUs) using NVidia Compute Unified Device Architecture (CUDA) framework. An intuitive graphical web interface allows for setting up of search parameters as well as for visualization of results. The accuracy and performance of the tool have been validated on several TDP datasets and against available TDP software. Specifically, results obtained from searching two published TDP datasets demonstrate that PERCEPTRON outperforms all other tools by up to 135% in terms of reported proteins and 10-fold in terms of runtime. In conclusion, the proposed tool significantly enhances the state-of-the-art in TDP search software and is publicly available at https://perceptron.lums.edu.pk. Users can also create in-house deployments of the tool by building code available on the GitHub repository (http://github.com/BIRL/Perceptron).
Identifiants
pubmed: 33999207
pii: 6276909
doi: 10.1093/nar/gkab368
pmc: PMC8262694
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
W510-W515Informations de copyright
© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.
Références
Nat Biotechnol. 2014 Mar;32(3):223-6
pubmed: 24727771
Anal Chem. 2008 Apr 1;80(7):2499-505
pubmed: 18302345
Electrophoresis. 1999 Dec;20(18):3551-67
pubmed: 10612281
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W701-6
pubmed: 17586823
Protein Sci. 2017 Nov;26(11):2118-2125
pubmed: 28762619
Proc Natl Acad Sci U S A. 2013 Jun 18;110(25):10153-8
pubmed: 23720318
Nat Biotechnol. 2004 Nov;22(11):1459-66
pubmed: 15529173
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W340-5
pubmed: 15215407
J Mol Cell Cardiol. 2015 Oct;87:102-12
pubmed: 26268593
Methods Mol Biol. 2011;696:179-203
pubmed: 21063948
Anal Chem. 2013 Feb 5;85(3):1880-8
pubmed: 23305238
Mol Cell Proteomics. 2012 Jun;11(6):M111.008524
pubmed: 22027200
Nat Methods. 2013 Mar;10(3):186-7
pubmed: 23443629
Sci Rep. 2019 Aug 2;9(1):11267
pubmed: 31375721
J Proteome Res. 2011 Sep 2;10(9):4054-65
pubmed: 21751783
Anal Chem. 2016 Mar 15;88(6):3082-90
pubmed: 26844380
Bioinformatics. 2016 Nov 15;32(22):3495-3497
pubmed: 27423895
Nat Methods. 2017 Sep;14(9):909-914
pubmed: 28783154
Mol Cell Proteomics. 2011 Jan;10(1):R110.000133
pubmed: 20716697
Mol Cell Proteomics. 2013 Dec;12(12):3465-73
pubmed: 24023390