Protein NMR assignment by isotope pattern recognition.


Journal

Science advances
ISSN: 2375-2548
Titre abrégé: Sci Adv
Pays: United States
ID NLM: 101653440

Informations de publication

Date de publication:
06 Sep 2024
Historique:
medline: 4 9 2024
pubmed: 4 9 2024
entrez: 4 9 2024
Statut: ppublish

Résumé

The current standard method for amino acid signal identification in protein NMR spectra is sequential assignment using triple-resonance experiments. Good software and elaborate heuristics exist, but the process remains laboriously manual. Machine learning does help, but its training databases need millions of samples that cover all relevant physics and every kind of instrumental artifact. In this communication, we offer a solution to this problem. We propose polyadic decompositions to store millions of simulated three-dimensional NMR spectra, on-the-fly generation of artifacts during training, a probabilistic way to incorporate prior and posterior information, and integration with the industry standard CcpNmr software framework. The resulting neural nets take [

Identifiants

pubmed: 39231223
doi: 10.1126/sciadv.ado0403
doi:

Substances chimiques

Proteins 0
Amino Acids 0
Isotopes 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

eado0403

Auteurs

Uluk Rasulov (U)

School of Chemistry, University of Southampton, University Road, Southampton SO17 1BJ, UK.

Harrison K Wang (HK)

Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.
Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA.

Thibault Viennet (T)

Department of Chemistry and iNANO, Aarhus University, Langelandsgade 140, 8000 Aarhus C, Denmark.

Maxim A Droemer (MA)

Faculty for Chemistry and Pharmacy, Ludwig-Maximilians-Universität München, Munich, Germany.

Srđan Matosin (S)

Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.
Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA.

Sebastian Schindler (S)

Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.
Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA.

Zhen-Yu J Sun (ZJ)

Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.
Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA.

Luca Mureddu (L)

Department of Molecular and Cell Biology, Institute for Structural and Chemical Biology, University of Leicester, Lancaster Road, Leicester LE1 7HB, UK.

Geerten W Vuister (GW)

Department of Molecular and Cell Biology, Institute for Structural and Chemical Biology, University of Leicester, Lancaster Road, Leicester LE1 7HB, UK.

Scott A Robson (SA)

Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.
Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA.

Haribabu Arthanari (H)

Department of Biochemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA 02115, USA.
Dana-Farber Cancer Institute, 450 Brookline Ave, Boston, MA 02215, USA.

Ilya Kuprov (I)

School of Chemistry, University of Southampton, University Road, Southampton SO17 1BJ, UK.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis

Classifications MeSH