PERCEPTRON: an open-source GPU-accelerated proteoform identification pipeline for top-down proteomics.


Journal

Nucleic acids research
ISSN: 1362-4962
Titre abrégé: Nucleic Acids Res
Pays: England
ID NLM: 0411011

Informations de publication

Date de publication:
02 07 2021
Historique:
accepted: 25 04 2021
revised: 10 04 2021
received: 11 03 2021
pubmed: 18 5 2021
medline: 20 7 2021
entrez: 17 5 2021
Statut: ppublish

Résumé

PERCEPTRON is a next-generation freely available web-based proteoform identification and characterization platform for top-down proteomics (TDP). PERCEPTRON search pipeline brings together algorithms for (i) intact protein mass tuning, (ii) de novo sequence tags-based filtering, (iii) characterization of terminal as well as post-translational modifications, (iv) identification of truncated proteoforms, (v) in silico spectral comparison, and (vi) weight-based candidate protein scoring. High-throughput performance is achieved through the execution of optimized code via multiple threads in parallel, on graphics processing units (GPUs) using NVidia Compute Unified Device Architecture (CUDA) framework. An intuitive graphical web interface allows for setting up of search parameters as well as for visualization of results. The accuracy and performance of the tool have been validated on several TDP datasets and against available TDP software. Specifically, results obtained from searching two published TDP datasets demonstrate that PERCEPTRON outperforms all other tools by up to 135% in terms of reported proteins and 10-fold in terms of runtime. In conclusion, the proposed tool significantly enhances the state-of-the-art in TDP search software and is publicly available at https://perceptron.lums.edu.pk. Users can also create in-house deployments of the tool by building code available on the GitHub repository (http://github.com/BIRL/Perceptron).

Identifiants

pubmed: 33999207
pii: 6276909
doi: 10.1093/nar/gkab368
pmc: PMC8262694
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

W510-W515

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research.

Références

Nat Biotechnol. 2014 Mar;32(3):223-6
pubmed: 24727771
Anal Chem. 2008 Apr 1;80(7):2499-505
pubmed: 18302345
Electrophoresis. 1999 Dec;20(18):3551-67
pubmed: 10612281
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W701-6
pubmed: 17586823
Protein Sci. 2017 Nov;26(11):2118-2125
pubmed: 28762619
Proc Natl Acad Sci U S A. 2013 Jun 18;110(25):10153-8
pubmed: 23720318
Nat Biotechnol. 2004 Nov;22(11):1459-66
pubmed: 15529173
Nucleic Acids Res. 2004 Jul 1;32(Web Server issue):W340-5
pubmed: 15215407
J Mol Cell Cardiol. 2015 Oct;87:102-12
pubmed: 26268593
Methods Mol Biol. 2011;696:179-203
pubmed: 21063948
Anal Chem. 2013 Feb 5;85(3):1880-8
pubmed: 23305238
Mol Cell Proteomics. 2012 Jun;11(6):M111.008524
pubmed: 22027200
Nat Methods. 2013 Mar;10(3):186-7
pubmed: 23443629
Sci Rep. 2019 Aug 2;9(1):11267
pubmed: 31375721
J Proteome Res. 2011 Sep 2;10(9):4054-65
pubmed: 21751783
Anal Chem. 2016 Mar 15;88(6):3082-90
pubmed: 26844380
Bioinformatics. 2016 Nov 15;32(22):3495-3497
pubmed: 27423895
Nat Methods. 2017 Sep;14(9):909-914
pubmed: 28783154
Mol Cell Proteomics. 2011 Jan;10(1):R110.000133
pubmed: 20716697
Mol Cell Proteomics. 2013 Dec;12(12):3465-73
pubmed: 24023390

Auteurs

Muhammad Farhan Khalid (MF)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Kanzal Iman (K)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Amna Ghafoor (A)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Mujtaba Saboor (M)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Ahsan Ali (A)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Urwa Muaz (U)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Abdul Rehman Basharat (AR)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Taha Tahir (T)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Muhammad Abubakar (M)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Momina Amer Akhter (MA)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Waqar Nabi (W)

School of Computing Science, University of Glasgow, Glasgow, G12 8QQ, UK.

Wim Vanderbauwhede (W)

School of Computing Science, University of Glasgow, Glasgow, G12 8QQ, UK.

Fayyaz Ahmad (F)

Department of Statistics, University of Gujrat, Gujrat, Pakistan.

Bilal Wajid (B)

Department of Electrical Engineering, University of Engineering and Technology, Lahore, Pakistan.
Department of Computer Science, University of Management and Technology, Lahore, Pakistan.
Division of Research and Development, Sabz-Qalam, Lahore, Pakistan.

Safee Ullah Chaudhary (SU)

Biomedical Informatics Research Laboratory, Department of Biology, Lahore University of Management Sciences, Lahore, Pakistan.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature
Cephalometry Humans Anatomic Landmarks Software Internet

Classifications MeSH