LectinOracle: A Generalizable Deep Learning Model for Lectin-Glycan Binding Prediction.

bioinformatics carbohydrate computational biology glycobiology machine learning

Journal

Advanced science (Weinheim, Baden-Wurttemberg, Germany)
ISSN: 2198-3844
Titre abrégé: Adv Sci (Weinh)
Pays: Germany
ID NLM: 101664569

Informations de publication

Date de publication:
01 2022
Historique:
revised: 03 11 2021
received: 30 08 2021
pubmed: 5 12 2021
medline: 15 3 2022
entrez: 4 12 2021
Statut: ppublish

Résumé

Ranging from bacterial cell adhesion over viral cell entry to human innate immunity, glycan-binding proteins or lectins are abound in nature. Widely used as staining and characterization reagents in cell biology and crucial for understanding the interactions in biological systems, lectins are a focal point of study in glycobiology. Yet the sheer breadth and depth of specificity for diverse oligosaccharide motifs has made studying lectins a largely piecemeal approach, with few options to generalize. Here, LectinOracle, a model combining transformer-based representations for proteins and graph convolutional neural networks for glycans to predict their interaction, is presented. Using a curated data set of 564,647 unique protein-glycan interactions, it is shown that LectinOracle predictions agree with literature-annotated specificities for a wide range of lectins. Using a range of specialized glycan arrays, it is shown that LectinOracle predictions generalize to new glycans and lectins, with qualitative and quantitative agreement with experimental data. It is further demonstrated that LectinOracle can be used to improve lectin classification, accelerate lectin directed evolution, predict epidemiological outcomes in the context of influenza virus, and analyze whole lectomes in host-microbe interactions. It is envisioned that the herein presented platform will advance both the study of lectins and their role in (glyco)biology.

Identifiants

pubmed: 34862760
doi: 10.1002/advs.202103807
pmc: PMC8728848
doi:

Substances chimiques

Lectins 0
Polysaccharides 0

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e2103807

Informations de copyright

© 2021 The Authors. Advanced Science published by Wiley-VCH GmbH.

Références

J Fungi (Basel). 2018 May 20;4(2):
pubmed: 29783771
J Biol Chem. 2012 Jun 8;287(24):20313-20
pubmed: 22493425
Glycoconj J. 2020 Oct;37(5):533-551
pubmed: 32860551
Sci Adv. 2021 Jun 9;7(24):
pubmed: 34108208
Cell Host Microbe. 2021 Jan 13;29(1):132-144.e3
pubmed: 33120114
Chem. 2021 Dec 9;7(12):3393-3411
pubmed: 34993358
Biochemistry. 2009 May 12;48(18):3822-7
pubmed: 19292456
Curr Opin Struct Biol. 2015 Oct;34:26-34
pubmed: 26163333
ACS Cent Sci. 2021 Jun 23;7(6):1009-1018
pubmed: 34235261
Biochem Biophys Res Commun. 2019 May 21;513(1):287-290
pubmed: 30954224
Int J Mol Sci. 2021 May 26;22(11):
pubmed: 34073266
Nucleic Acids Res. 2019 Jan 8;47(D1):D1236-D1244
pubmed: 30239928
J Biochem. 1990 Feb;107(2):190-6
pubmed: 2193930
Glycobiology. 2021 Nov 18;31(10):1240-1244
pubmed: 34192308
Proc Natl Acad Sci U S A. 2001 Jun 19;98(13):7158-63
pubmed: 11404456
Int J Infect Dis. 2009 Sep;13(5):589-99
pubmed: 19111494
Anal Chem. 2013 Jun 4;85(11):5397-404
pubmed: 23627407
NPJ Biofilms Microbiomes. 2021 Jun 15;7(1):49
pubmed: 34131152
Glycobiology. 2012 Jan;22(1):160-9
pubmed: 21875884
BMC Bioinformatics. 2020 Feb 4;21(1):42
pubmed: 32019496
Nat Chem Biol. 2021 Jul;17(7):806-816
pubmed: 33958792
Mol Cell Proteomics. 2013 Apr;12(4):1026-35
pubmed: 23399549
Food Chem Toxicol. 2019 Dec;134:110827
pubmed: 31542433
Adv Sci (Weinh). 2022 Jan;9(1):e2103807
pubmed: 34862760
J Biomol Struct Dyn. 1998 Apr;15(5):853-60
pubmed: 9619508
Curr Opin Struct Biol. 2014 Oct;28:14-22
pubmed: 25102772
Oncotarget. 2017 Apr 18;8(16):26896-26910
pubmed: 28460472
Nat Protoc. 2007;2(10):2529-37
pubmed: 17947995
Nature. 2021 Aug;596(7873):583-589
pubmed: 34265844
Sci Adv. 2019 Feb 13;5(2):eaav2554
pubmed: 30788437
J Biol Chem. 2006 Nov 17;281(46):35263-71
pubmed: 16987809
Nucleic Acids Res. 2021 Jul 2;49(W1):W293-W296
pubmed: 33885785
Proc Natl Acad Sci U S A. 2019 Feb 5;116(6):1958-1967
pubmed: 30670663
Front Mol Biosci. 2021 May 07;8:656439
pubmed: 34026832
Mol Cancer Ther. 2018 Jan;17(1):183-195
pubmed: 28939555
Front Mol Biosci. 2021 Sep 22;8:755577
pubmed: 34631801
Cell. 2006 Feb 24;124(4):715-27
pubmed: 16497583
Bioinformatics. 2014 Mar 1;30(5):706-11
pubmed: 24135264
J Clin Invest. 2005 Nov;115(11):3256-64
pubmed: 16239964
Nucleic Acids Res. 2021 Jul 2;49(W1):W469-W475
pubmed: 34038555
Sci Rep. 2017 Mar 06;7:43560
pubmed: 28262709
J Biol Chem. 1987 Feb 5;262(4):1596-601
pubmed: 3805045
PLoS One. 2023 Mar 16;18(3):e0282689
pubmed: 36928239
Biochem J. 2002 May 15;364(Pt 1):173-80
pubmed: 11988090
PLoS One. 2018 May 10;13(5):e0196727
pubmed: 29746492
Int J Mol Sci. 2017 Jul 17;18(7):
pubmed: 28714909
Nat Commun. 2021 Jun 11;12(1):3573
pubmed: 34117223
Environ Microbiol. 2009 Nov;11(11):2821-39
pubmed: 19650882
Proc Natl Acad Sci U S A. 2021 Apr 13;118(15):
pubmed: 33876751
Sci Rep. 2018 Sep 3;8(1):13139
pubmed: 30177739
Faraday Discuss. 2019 Oct 30;219(0):90-111
pubmed: 31338503
J Gen Virol. 2014 Feb;95(Pt 2):263-277
pubmed: 24225499
Nature. 2008 Dec 4;456(7222):648-52
pubmed: 18971931
Glycobiology. 2014 Jan;24(1):17-25
pubmed: 24056723
Nucleic Acids Res. 2021 Jan 8;49(D1):D1548-D1554
pubmed: 33174598
Chembiochem. 2020 Feb 3;21(3):423-427
pubmed: 31317590
Glycobiology. 2004 Nov;14(11):53R-62R
pubmed: 15229195
Cell Rep. 2021 Jun 15;35(11):109251
pubmed: 34133929
J Biol Chem. 2011 Sep 9;286(36):31610-22
pubmed: 21757734
J Biol Chem. 2006 Jul 21;281(29):20171-80
pubmed: 16704980
Glycobiology. 2021 Dec 18;31(11):1543-1556
pubmed: 34192315
ACS Chem Biol. 2022 Nov 18;17(11):2993-3012
pubmed: 35084820
Glycobiology. 2017 Jan;27(1):3-49
pubmed: 27558841

Auteurs

Jon Lundstrøm (J)

Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, 41390, Sweden.
Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg, 41390, Sweden.

Emma Korhonen (E)

Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, 41390, Sweden.
Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg, 41390, Sweden.

Frédérique Lisacek (F)

Swiss Institute of Bioinformatics, Geneva, 1227, Switzerland.
Computer Science Department, UniGe, Geneva, 1227, Switzerland.
Section of Biology, UniGe, Geneva, 1205, Switzerland.

Daniel Bojar (D)

Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg, 41390, Sweden.
Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg, 41390, Sweden.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Adenosine Triphosphate Adenosine Diphosphate Mitochondrial ADP, ATP Translocases Binding Sites Mitochondria

Conservation of the cooling agent binding pocket within the TRPM subfamily.

Kate Huffer, Matthew C S Denley, Elisabeth V Oskoui et al.
1.00
TRPM Cation Channels Animals Binding Sites Mice Pyrimidinones
Humans Breast Neoplasms Female Deep Learning Ultrasonography, Mammary

Classifications MeSH