Redesigning plant specialized metabolism with supervised machine learning using publicly available reactome data.

Encoding reactome data Neural-network encoders Plant specialized metabolism Predicting enzyme promiscuity Predicting reaction-feasibility Reactome data-mining Retrobiosynthesis Supervised machine learning

Journal

Computational and structural biotechnology journal
ISSN: 2001-0370
Titre abrégé: Comput Struct Biotechnol J
Pays: Netherlands
ID NLM: 101585369

Informations de publication

Date de publication:
2023
Historique:
received: 27 10 2022
revised: 12 01 2023
accepted: 12 01 2023
entrez: 6 3 2023
pubmed: 7 3 2023
medline: 7 3 2023
Statut: epublish

Résumé

The immense structural diversity of products and intermediates of plant specialized metabolism (specialized metabolites) makes them rich sources of therapeutic medicine, nutrients, and other useful materials. With the rapid accumulation of reactome data that can be accessible on biological and chemical databases, along with recent advances in machine learning, this review sets out to outline how supervised machine learning can be used to design new compounds and pathways by exploiting the wealth of said data. We will first examine the various sources from which reactome data can be obtained, followed by explaining the different machine learning encoding methods for reactome data. We then discuss current supervised machine learning developments that can be employed in various aspects to help redesign plant specialized metabolism.

Identifiants

pubmed: 36874159
doi: 10.1016/j.csbj.2023.01.013
pii: S2001-0370(23)00013-2
pmc: PMC9976193
doi:

Types de publication

Journal Article Review

Langues

eng

Pagination

1639-1650

Informations de copyright

© 2023 The Author(s).

Déclaration de conflit d'intérêts

The authors declare no conflict of interest.

Références

J Chem Inf Comput Sci. 2002 Nov-Dec;42(6):1273-80
pubmed: 12444722
Science. 2000 Nov 10;290(5494):1163-6
pubmed: 11073455
Phytochemistry. 2003 Sep;64(1):3-19
pubmed: 12946402
Nat Chem Biol. 2014 Oct;10(10):837-44
pubmed: 25151135
Database (Oxford). 2021 Apr 7;2021:
pubmed: 33826699
J Cheminform. 2021 May 24;13(1):40
pubmed: 34030732
Nat Chem Biol. 2014 Apr;10(4):259-65
pubmed: 24609358
Methods Mol Biol. 2013;993:81-94
pubmed: 23568465
J Med Chem. 1998 Jul 2;41(14):2481-91
pubmed: 9651153
Nucleic Acids Res. 2021 Jan 8;49(D1):D570-D574
pubmed: 33156326
Mol Inform. 2021 Feb;40(2):e2000203
pubmed: 33164295
J Cheminform. 2018 Mar 9;10(1):11
pubmed: 29524042
Acc Chem Res. 2018 May 15;51(5):1281-1289
pubmed: 29715002
Nucleic Acids Res. 2021 Jan 8;49(D1):D1388-D1395
pubmed: 33151290
Chem Sci. 2020 Apr 21;11(29):7538-7552
pubmed: 33552460
J Cheminform. 2022 Mar 7;14(1):10
pubmed: 35255958
Bioinformatics. 2018 Aug 1;34(15):2642-2648
pubmed: 29584811
Nat Commun. 2018 Jan 12;9(1):184
pubmed: 29330441
Proc Natl Acad Sci U S A. 2015 Mar 17;112(11):3205-10
pubmed: 25675512
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D428-32
pubmed: 15608231
J Med Chem. 2020 Aug 27;63(16):8683-8694
pubmed: 32672961
Brief Bioinform. 2022 Jan 17;23(1):
pubmed: 34498670
Biochim Open. 2018 Jan 10;6:9-16
pubmed: 29892557
Nat Catal. 2021 Feb;4(2):98-104
pubmed: 33604511
Chem Res Toxicol. 2021 Feb 15;34(2):217-239
pubmed: 33356168
Nat Rev Mol Cell Biol. 2022 Jan;23(1):40-55
pubmed: 34518686
ChemMedChem. 2018 Mar 20;13(6):495-499
pubmed: 28544552
J Cheminform. 2020 Sep 17;12(1):56
pubmed: 33431035
Nature. 2008 Jan 3;451(7174):86-9
pubmed: 18172501
PLoS Comput Biol. 2022 Feb 10;18(2):e1009853
pubmed: 35143485
J Cheminform. 2020 Jun 12;12(1):43
pubmed: 33431010
Int J Mol Sci. 2015 Feb 11;16(2):3895-914
pubmed: 25679450
Nucleic Acids Res. 2022 Jan 7;50(D1):D687-D692
pubmed: 34788843
Nucleic Acids Res. 2013 Jan;41(Database issue):D530-5
pubmed: 23161678
Conserv Biol. 1995 Oct;9(5):1199-1207
pubmed: 34261255
Drug Discov Today. 2018 Jun;23(6):1203-1218
pubmed: 29510217
J Chem Inf Model. 2013 Nov 25;53(11):2829-36
pubmed: 24171408
Brief Bioinform. 2022 Jan 17;23(1):
pubmed: 34571535
Curr Opin Biotechnol. 2008 Dec;19(6):597-605
pubmed: 18992815
Nucleic Acids Res. 2018 Jan 4;46(D1):D633-D639
pubmed: 29059334
Nat Commun. 2021 May 26;12(1):3168
pubmed: 34039967
Drug Discov Today Technol. 2013 Sep;10(3):e443-9
pubmed: 24050141
Nat Commun. 2019 May 13;10(1):2142
pubmed: 31086174
Mol Inform. 2014 Jun;33(6-7):469-76
pubmed: 27485985
Nat Commun. 2022 Mar 23;13(1):1560
pubmed: 35322036
Science. 2012 Jun 29;336(6089):1663-7
pubmed: 22745419
Trends Plant Sci. 2017 Apr;22(4):308-315
pubmed: 28173981
Plant Physiol. 2010 Jul;153(3):895-905
pubmed: 20472751
J Chem Inf Model. 2016 Mar 28;56(3):510-6
pubmed: 26822930
Brief Bioinform. 2022 May 13;23(3):
pubmed: 35443054
Science. 1973 Jul 20;181(4096):223-30
pubmed: 4124164
Metab Eng. 2021 Jan;63:34-60
pubmed: 33221420
Chem Sci. 2020 Mar 3;11(12):3355-3364
pubmed: 34122843
Biopolymers. 2005;80(6):775-86
pubmed: 15895431
Nature. 2021 Aug;596(7873):583-589
pubmed: 34265844
Chemistry. 2017 Sep 7;23(50):12040-12063
pubmed: 28514518
Nucleic Acids Res. 2022 Jan 7;50(D1):D439-D444
pubmed: 34791371
Biotechnol Adv. 2020 Nov 1;43:107569
pubmed: 32446923
N Biotechnol. 2014 May 25;31(3):242-5
pubmed: 24614567
BMC Bioinformatics. 2019 Dec 17;20(1):723
pubmed: 31847804
Nucleic Acids Res. 2019 Jan 8;47(D1):D1229-D1235
pubmed: 30321422
BMC Bioinformatics. 2020 Jun 9;21(1):235
pubmed: 32517697
Nucleic Acids Res. 2017 Jan 4;45(D1):D353-D361
pubmed: 27899662
Nucleic Acids Res. 2020 Jul 2;48(W1):W140-W146
pubmed: 32324217
Curr Opin Chem Biol. 2006 Oct;10(5):498-508
pubmed: 16939713
Bioessays. 2021 Mar;43(3):e2000164
pubmed: 33179351
Nature. 2018 Mar 28;555(7698):604-610
pubmed: 29595767
Bioinformatics. 2019 Jan 15;35(2):309-318
pubmed: 29982330
Nucleic Acids Res. 2016 Jan 4;44(D1):D457-62
pubmed: 26476454
Front Microbiol. 2021 Jul 28;12:711077
pubmed: 34394059
Biotechnol J. 2021 May;16(5):e2000605
pubmed: 33386776
Bioinformatics. 2015 Nov 1;31(21):3429-36
pubmed: 26130574
J Med Chem. 2017 Jan 12;60(1):474-485
pubmed: 27966949
Plant Physiol. 2005 Mar;137(3):882-91
pubmed: 15734910
Structure. 2022 Aug 4;30(8):1169-1177.e4
pubmed: 35609601
J Cheminform. 2019 Nov 21;11(1):71
pubmed: 33430971
ACS Cent Sci. 2017 Oct 25;3(10):1103-1113
pubmed: 29104927
Nucleic Acids Res. 1999 Jan 1;27(1):29-34
pubmed: 9847135
Biochemistry. 2015 Feb 17;54(6):1307-13
pubmed: 25615525
J Chem Inf Model. 2019 Mar 25;59(3):947-961
pubmed: 30835112
ACS Synth Biol. 2016 Oct 21;5(10):1155-1166
pubmed: 27404214
Philos Trans R Soc Lond B Biol Sci. 2013 Jan 06;368(1612):20120432
pubmed: 23297355
FEBS J. 2021 Nov 12;:
pubmed: 34773359
Patterns (N Y). 2022 Feb 11;3(2):100410
pubmed: 35199063
BioData Min. 2017 Dec 8;10:35
pubmed: 29234465
Methods Mol Biol. 2018;1800:3-53
pubmed: 29934886
Bioinformatics. 2020 Apr 15;36(8):2547-2553
pubmed: 31879763
J Chem Inf Model. 2010 May 24;50(5):742-54
pubmed: 20426451
Proc Natl Acad Sci U S A. 2021 Apr 13;118(15):
pubmed: 33876751
Nat Commun. 2022 Feb 18;13(1):964
pubmed: 35181654
Nucleic Acids Res. 2021 Jan 8;49(D1):D480-D489
pubmed: 33237286
J Chem Inf Model. 2020 Jan 27;60(1):47-55
pubmed: 31825611
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D438-42
pubmed: 14681452
Prog Lipid Res. 2003 Jan;42(1):51-80
pubmed: 12467640
J Med Chem. 2006 Nov 16;49(23):6789-801
pubmed: 17154509
J Chem Inf Model. 2021 Apr 26;61(4):1583-1592
pubmed: 33754707
Annu Rev Plant Biol. 2006;57:303-33
pubmed: 16669764
Curr Opin Plant Biol. 2020 Jun;55:38-46
pubmed: 32200228
Nucleic Acids Res. 2008 Jan;36(Database issue):D202-5
pubmed: 17998252
Nucleic Acids Res. 2008 Jan;36(Database issue):D344-50
pubmed: 17932057
Nat Commun. 2022 Jun 10;13(1):3342
pubmed: 35688826
Nucleic Acids Res. 2022 Jan 7;50(D1):D693-D700
pubmed: 34755880
J Cheminform. 2014 Jun 11;6:32
pubmed: 24976867
ACS Synth Biol. 2020 Jan 17;9(1):157-168
pubmed: 31841626
Cold Spring Harb Symp Quant Biol. 2012;77:309-20
pubmed: 23269558
Nature. 2013 Apr 25;496(7446):528-32
pubmed: 23575629
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D334-7
pubmed: 15608210
Front Chem. 2022 Jun 08;10:852893
pubmed: 35755260
New Phytol. 2015 Sep;207(4):1170-80
pubmed: 25966996
Mol Plant. 2013 Sep;6(5):1419-37
pubmed: 23702593
Bioinformatics. 2018 Jun 15;34(12):2153-2154
pubmed: 29425325
Nucleic Acids Res. 2018 Jan 4;46(D1):D618-D623
pubmed: 29106569
J Chem Inf Model. 2019 Feb 25;59(2):673-688
pubmed: 30642173
Int J Mol Sci. 2018 Aug 10;19(8):
pubmed: 30103448
J Chem Inf Model. 2020 Jun 22;60(6):2773-2790
pubmed: 32250622
PLoS One. 2013 Oct 04;8(10):e75459
pubmed: 24124492

Auteurs

Peng Ken Lim (PK)

School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.

Irene Julca (I)

School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.

Marek Mutwil (M)

School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.

Classifications MeSH