Identification of plant vacuole proteins by using graph neural network and contact maps.
AlphaFold2
Graph convolutional neural network
Peroxisomal proteins
Plant vacuole proteins
SeqVec
Journal
BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194
Informations de publication
Date de publication:
22 Sep 2023
22 Sep 2023
Historique:
received:
13
05
2023
accepted:
12
09
2023
medline:
25
9
2023
pubmed:
23
9
2023
entrez:
22
9
2023
Statut:
epublish
Résumé
Plant vacuoles are essential organelles in the growth and development of plants, and accurate identification of their proteins is crucial for understanding their biological properties. In this study, we developed a novel model called GraphIdn for the identification of plant vacuole proteins. The model uses SeqVec, a deep representation learning model, to initialize the amino acid sequence. We utilized the AlphaFold2 algorithm to obtain the structural information of corresponding plant vacuole proteins, and then fed the calculated contact maps into a graph convolutional neural network. GraphIdn achieved accuracy values of 88.51% and 89.93% in independent testing and fivefold cross-validation, respectively, outperforming previous state-of-the-art predictors. As far as we know, this is the first model to use predicted protein topology structure graphs to identify plant vacuole proteins. Furthermore, we assessed the effectiveness and generalization capability of our GraphIdn model by applying it to identify and locate peroxisomal proteins, which yielded promising outcomes. The source code and datasets can be accessed at https://github.com/SJNNNN/GraphIdn .
Identifiants
pubmed: 37740195
doi: 10.1186/s12859-023-05475-x
pii: 10.1186/s12859-023-05475-x
pmc: PMC10517492
doi:
Substances chimiques
Plant Proteins
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
357Subventions
Organisme : Natural Science Foundation of Shandong Province
ID : ZR2021MF036
Organisme : Natural Science Foundation of Shandong Province
ID : ZR2021MF036
Organisme : Natural Science Foundation of Shandong Province
ID : ZR2021MF036
Organisme : Natural Science Foundation of Shandong Province
ID : ZR2021MF036
Organisme : Natural Science Foundation of Shandong Province
ID : ZR2021MF036
Organisme : National Natural Science Foundation of China
ID : 31872415
Informations de copyright
© 2023. BioMed Central Ltd., part of Springer Nature.
Références
Bioinformatics. 2020 Jan 1;36(1):56-64
pubmed: 31218353
Front Plant Sci. 2023 Jun 02;14:1164296
pubmed: 37332710
Front Bioeng Biotechnol. 2019 Sep 04;7:215
pubmed: 31552241
Mol Biosyst. 2015 Jan;11(1):170-7
pubmed: 25335193
Proteins. 2014 Feb;82 Suppl 2:1-6
pubmed: 24344053
Plants (Basel). 2015 Jun 11;4(2):320-33
pubmed: 27135331
PLoS Comput Biol. 2017 Jun 8;13(6):e1005420
pubmed: 28594838
J Theor Biol. 2019 Jul 21;473:38-43
pubmed: 31051179
Artif Intell Med. 2017 Nov;83:67-74
pubmed: 28320624
BMC Bioinformatics. 2006 Nov 30;7:518
pubmed: 17134515
Bioinformatics. 2006 Jul 1;22(13):1658-9
pubmed: 16731699
Front Neurosci. 2023 May 12;17:1197824
pubmed: 37250391
J Biosci. 2020;45:
pubmed: 32975233
J Theor Biol. 2016 Feb 21;391:35-42
pubmed: 26702543
Brief Funct Genomics. 2021 Mar 2;20(1):61-73
pubmed: 33527980
Comput Struct Biotechnol J. 2022 Jun 08;20:2921-2927
pubmed: 35765653
Trends Plant Sci. 2020 Jun;25(6):538-548
pubmed: 32407694
IEEE Trans Nanobioscience. 2018 Oct;17(4):474-484
pubmed: 30281471
Biochim Biophys Acta. 2011 Mar;1813(3):424-30
pubmed: 21255619
Plant Physiol. 2015 Apr;167(4):1361-73
pubmed: 25699591
J Theor Biol. 2018 Aug 7;450:86-103
pubmed: 29678694
Brief Bioinform. 2021 Sep 2;22(5):
pubmed: 33529337
Gigascience. 2022 Aug 11;11:
pubmed: 35950840
Plant Cell Physiol. 2018 Jul 1;59(7):1300-1308
pubmed: 29534212
Bioinformatics. 2021 Apr 5;36(24):5600-5609
pubmed: 33367627
J Theor Biol. 2019 Feb 21;463:99-109
pubmed: 30562500
Plant Physiol. 2015 Jan;167(1):137-52
pubmed: 25416474
Nat Methods. 2019 Dec;16(12):1315-1322
pubmed: 31636460
J Mol Biol. 2019 Apr 5;431(8):1619-1632
pubmed: 30878480
Nature. 2021 Aug;596(7873):583-589
pubmed: 34265844
Plant J. 2010 Nov;64(4):577-88
pubmed: 20807215
BMC Bioinformatics. 2019 Dec 17;20(1):723
pubmed: 31847804
Artif Intell Med. 2018 Jan;84:90-100
pubmed: 29183738
Bioinformatics. 2020 Mar 1;36(6):1896-1901
pubmed: 31688925
Acta Biotheor. 2013 Jun;61(2):259-68
pubmed: 23475502
J Theor Biol. 2017 Mar 7;416:81-87
pubmed: 28077336
Front Plant Sci. 2023 Feb 03;14:1072168
pubmed: 36818878
Protein Pept Lett. 2011 Jan;18(1):58-63
pubmed: 20955168
Bioinformatics. 2020 Feb 15;36(4):1074-1081
pubmed: 31603468
BMC Bioinformatics. 2018 Apr 11;19(Suppl 5):116
pubmed: 29671398
Bioinformatics. 2018 Dec 1;34(23):4007-4016
pubmed: 29868903
Int J Mol Sci. 2020 Aug 09;21(16):
pubmed: 32784927
Int J Mol Sci. 2021 Jun 15;22(12):
pubmed: 34203866
J Chem Inf Model. 2019 Feb 25;59(2):914-923
pubmed: 30669836