Signal Peptides Generated by Attention-Based Neural Networks.


Journal

ACS synthetic biology
ISSN: 2161-5063
Titre abrégé: ACS Synth Biol
Pays: United States
ID NLM: 101575075

Informations de publication

Date de publication:
21 08 2020
Historique:
pubmed: 11 7 2020
medline: 28 5 2021
entrez: 11 7 2020
Statut: ppublish

Résumé

Short (15-30 residue) chains of amino acids at the amino termini of expressed proteins known as signal peptides (SPs) specify secretion in living cells. We trained an attention-based neural network, the Transformer model, on data from all available organisms in Swiss-Prot to generate SP sequences. Experimental testing demonstrates that the model-generated SPs are functional: when appended to enzymes expressed in an industrial

Identifiants

pubmed: 32649182
doi: 10.1021/acssynbio.0c00219
doi:

Substances chimiques

Bacterial Proteins 0
Protein Sorting Signals 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

2154-2161

Auteurs

Zachary Wu (Z)

Department of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States.

Kevin K Yang (KK)

Department of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States.

Michael J Liszka (MJ)

BASF Enzymes, San Diego, California 92121, United States.

Alycia Lee (A)

Department of Computational and Mathematical Sciences, California Institute of Technology, Pasadena, California 91125, United States.

Alina Batzilla (A)

BASF Enzymes, San Diego, California 92121, United States.

David Wernick (D)

BASF Enzymes, San Diego, California 92121, United States.

David P Weiner (DP)

BASF Enzymes, San Diego, California 92121, United States.

Frances H Arnold (FH)

Department of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States.

Articles similaires

Photosynthesis Ribulose-Bisphosphate Carboxylase Carbon Dioxide Molecular Dynamics Simulation Cyanobacteria
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis

Classifications MeSH