Computational assessment of the feasibility of protonation-based protein sequencing.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2020
Historique:
received: 22 11 2019
accepted: 20 08 2020
entrez: 11 9 2020
pubmed: 12 9 2020
medline: 31 10 2020
Statut: epublish

Résumé

Recent advances in DNA sequencing methods revolutionized biology by providing highly accurate reads, with high throughput or high read length. These read data are being used in many biological and medical applications. Modern DNA sequencing methods have no equivalent in protein sequencing, severely limiting the widespread application of protein data. Recently, several optical protein sequencing methods have been proposed that rely on the fluorescent labeling of amino acids. Here, we introduce the reprotonation-deprotonation protein sequencing method. Unlike other methods, this proposed technique relies on the measurement of an electrical signal and requires no fluorescent labeling. In reprotonation-deprotonation protein sequencing, the terminal amino acid is identified through its unique protonation signal, and by repeatedly cleaving the terminal amino acids one-by-one, each amino acid in the peptide is measured. By means of simulations, we show that, given a reference database of known proteins, reprotonation-deprotonation sequencing has the potential to correctly identify proteins in a sample. Our simulations provide target values for the signal-to-noise ratios that sensor devices need to attain in order to detect reprotonation-deprotonation events, as well as suitable pH values and required measurement times per amino acid. For instance, an SNR of 10 is required for a 61.71% proteome recovery rate with 100 ms measurement time per amino acid.

Identifiants

pubmed: 32915813
doi: 10.1371/journal.pone.0238625
pii: PONE-D-19-32444
pmc: PMC7485799
doi:

Substances chimiques

Amino Acids 0
Fluorescent Dyes 0
Peptides 0
Proteins 0
Proteome 0
Protons 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0238625

Déclaration de conflit d'intérêts

The authors have the following interests: This work is funded by imec vzw. Giles Miclotte, Koen Martens and Jan Fostier are employees of imec vzw, Belgium; Giles Miclotte and Jan Fostier are employees of Ghent University, Ghent, Belgium. There are no patents, products in development or marketed products to declare. This does not alter the authors’ adherence to all the PLOS ONE policies on sharing data and materials.

Références

Phys Biol. 2015 Aug 12;12(5):055003
pubmed: 26266455
Proc Natl Acad Sci U S A. 2013 Mar 5;110(10):3847-52
pubmed: 23407163
Nat Rev Drug Discov. 2002 Sep;1(9):683-95
pubmed: 12209149
PLoS One. 2019 Mar 28;14(3):e0212868
pubmed: 30921350
Nat Nanotechnol. 2011 Feb;6(2):126-32
pubmed: 21258331
Nano Lett. 2013 Feb 13;13(2):625-31
pubmed: 23323846
Protein Sci. 2016 Feb;25(2):479-86
pubmed: 26540340
J Am Chem Soc. 2015 Mar 4;137(8):3093-101
pubmed: 25665463
Nature. 2008 Nov 6;456(7218):53-9
pubmed: 18987734
Proc Natl Acad Sci U S A. 2008 Jan 29;105(4):1176-81
pubmed: 18216253
J Am Chem Soc. 2004 May 5;126(17):5370-1
pubmed: 15113203
Biomark Med. 2012 Jun;6(3):259-71
pubmed: 22731899
Microbiology (Reading). 2010 Feb;156(Pt 2):287-301
pubmed: 19910409
PLoS Comput Biol. 2015 Feb 25;11(2):e1004080
pubmed: 25714988
J Am Chem Soc. 2013 May 29;135(21):7861-8
pubmed: 23631749
J Am Chem Soc. 2013 May 29;135(21):7855-60
pubmed: 23631761
Genomics Proteomics Bioinformatics. 2016 Oct;14(5):265-279
pubmed: 27646134
Nat Biotechnol. 2003 May;21(5):566-9
pubmed: 12665801
Nano Lett. 2008 Apr;8(4):1066-70
pubmed: 18311939
ACS Appl Mater Interfaces. 2017 Sep 13;9(36):30561-30566
pubmed: 28825461
Genomics Proteomics Bioinformatics. 2015 Oct;13(5):278-89
pubmed: 26542840
Clin Chim Acta. 2005 Jul 24;357(2):140-50
pubmed: 15946657
Science. 2012 Jan 20;335(6066):319-24
pubmed: 22267809
Nat Commun. 2014 Sep 03;5:4785
pubmed: 25183057
Proteins. 2009 Nov 15;77(3):570-88
pubmed: 19533744

Auteurs

Giles Miclotte (G)

IDLab, Ghent University-Imec, Ghent, Belgium.

Koen Martens (K)

Imec, Leuven, Belgium.

Jan Fostier (J)

IDLab, Ghent University-Imec, Ghent, Belgium.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Humans Magnetic Resonance Imaging Phantoms, Imaging Infant, Newborn Signal-To-Noise Ratio
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Classifications MeSH