Protein A-like Peptide Design Based on Diffusion and ESM2 Models.


Journal

Molecules (Basel, Switzerland)
ISSN: 1420-3049
Titre abrégé: Molecules
Pays: Switzerland
ID NLM: 100964009

Informations de publication

Date de publication:
21 Oct 2024
Historique:
received: 09 09 2024
revised: 02 10 2024
accepted: 15 10 2024
medline: 26 10 2024
pubmed: 26 10 2024
entrez: 26 10 2024
Statut: epublish

Résumé

Proteins are the foundation of life, and designing functional proteins remains a key challenge in biotechnology. Before the development of AlphaFold2, the focus of design was primarily on structure-centric approaches such as using the well-known open-source software Rosetta3. Following the development of AlphaFold2, deep-learning techniques for protein design gained prominence. This study proposes a new method to generate functional proteins using the diffusion model and ESM2 protein language model. Diffusion models, which are widely used in image and natural language generation, are used here for protein design, facilitating the controlled generation of new sequences. The ESM2 model, trained on the basis of large-scale protein sequence data, provides a deep understanding of the context of the sequence, thus improving the model's ability to generate biologically relevant proteins. In this study, we used the Protein A-like peptide as a model study object, combined the diffusion model and the ESM2 model to generate new peptide sequences from minimal input data, and verified their biological activities through experiments such as the BLI affinity test. In conclusion, we developed a new method for protein design that provides a novel strategy to meet the challenges of generic protein generation.

Identifiants

pubmed: 39459333
pii: molecules29204965
doi: 10.3390/molecules29204965
pii:
doi:

Substances chimiques

Peptides 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Auteurs

Long Zhao (L)

Department of Pharmaceutics, Beijing Institute of Petrochemical Technology, Beijing 102627, China.
Department of Computer Science, Beijing Institute of Petrochemical Technology, Beijing 102627, China.

Qiang He (Q)

Department of Pharmaceutics, Beijing Institute of Petrochemical Technology, Beijing 102627, China.

Huijia Song (H)

Department of Computer Science, Beijing Institute of Petrochemical Technology, Beijing 102627, China.

Tianqian Zhou (T)

Department of Computer Science, Beijing Institute of Petrochemical Technology, Beijing 102627, China.

An Luo (A)

Department of Pharmaceutics, Beijing Institute of Petrochemical Technology, Beijing 102627, China.

Zhenguo Wen (Z)

Department of Pharmaceutics, Beijing Institute of Petrochemical Technology, Beijing 102627, China.

Teng Wang (T)

Department of Pharmaceutics, Beijing Institute of Petrochemical Technology, Beijing 102627, China.

Xiaozhu Lin (X)

Department of Computer Science, Beijing Institute of Petrochemical Technology, Beijing 102627, China.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Animals Hemiptera Insect Proteins Phylogeny Insecticides

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software
alpha-Synuclein Humans Animals Mice Lewy Body Disease

Classifications MeSH