Identification of Vesicle Transport Proteins

hypergraph learning local hyperplane membrane proteins protein function prediction transport proteins

Journal

Frontiers in genetics
ISSN: 1664-8021
Titre abrégé: Front Genet
Pays: Switzerland
ID NLM: 101560621

Informations de publication

Date de publication:
2022
Historique:
received: 02 06 2022
accepted: 22 06 2022
entrez: 1 8 2022
pubmed: 2 8 2022
medline: 2 8 2022
Statut: epublish

Résumé

The prediction of protein function is a common topic in the field of bioinformatics. In recent years, advances in machine learning have inspired a growing number of algorithms for predicting protein function. A large number of parameters and fairly complex neural networks are often used to improve the prediction performance, an approach that is time-consuming and costly. In this study, we leveraged traditional features and machine learning classifiers to boost the performance of vesicle transport protein identification and make the prediction process faster. We adopt the pseudo position-specific scoring matrix (PsePSSM) feature and our proposed new classifier hypergraph regularized k-local hyperplane distance nearest neighbour (HG-HKNN) to classify vesicular transport proteins. We address dataset imbalances with random undersampling. The results show that our strategy has an area under the receiver operating characteristic curve (AUC) of 0.870 and a Matthews correlation coefficient (MCC) of 0.53 on the benchmark dataset, outperforming all state-of-the-art methods on the same dataset, and other metrics of our model are also comparable to existing methods.

Identifiants

pubmed: 35910197
doi: 10.3389/fgene.2022.960388
pii: 960388
pmc: PMC9326258
doi:

Types de publication

Journal Article

Langues

eng

Pagination

960388

Informations de copyright

Copyright © 2022 Fan, Suo and Ding.

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Références

Brief Bioinform. 2021 Nov 5;22(6):
pubmed: 34308472
Brief Bioinform. 2021 Mar 22;22(2):1902-1917
pubmed: 32363401
Brief Bioinform. 2022 Mar 10;23(2):
pubmed: 35183059
Brief Bioinform. 2020 Mar 23;21(2):621-636
pubmed: 30649171
Nucleic Acids Res. 2021 Jul 2;49(W1):W5-W14
pubmed: 33893803
Brief Bioinform. 2022 Mar 10;23(2):
pubmed: 35134117
IEEE/ACM Trans Comput Biol Bioinform. 2021 Dec 09;PP:
pubmed: 34882559
Brief Bioinform. 2022 Mar 10;23(2):
pubmed: 35018418
J Theor Biol. 2019 Feb 7;462:230-239
pubmed: 30452958
Nucleic Acids Res. 2022 Jan 7;50(D1):D1417-D1431
pubmed: 34747471
Interdiscip Sci. 2021 Sep;13(3):349-361
pubmed: 33772722
Brain. 2021 Apr 12;144(3):909-923
pubmed: 33638639
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D258-61
pubmed: 14681407
Comput Struct Biotechnol J. 2021 Jul 19;19:4123-4131
pubmed: 34527186
Curr Opin Struct Biol. 2022 Feb;72:114-126
pubmed: 34649044
Aging Cell. 2021 May;20(5):e13365
pubmed: 33909313
Brief Bioinform. 2021 Nov 5;22(6):
pubmed: 34410342
Brief Bioinform. 2021 Jul 20;22(4):
pubmed: 33320936
Nat Protoc. 2022 Jan;17(1):129-151
pubmed: 34952956
Front Neurosci. 2021 Oct 25;15:773208
pubmed: 34759797
Bioinformatics. 2021 Mar 15;:
pubmed: 33720331
Brief Bioinform. 2020 Jul 15;21(4):1437-1447
pubmed: 31504150
Comput Math Methods Med. 2021 Jan 7;2021:6664362
pubmed: 33505515
IEEE/ACM Trans Comput Biol Bioinform. 2021 Nov-Dec;18(6):2768-2774
pubmed: 33481716
Nat Commun. 2021 Jun 17;12(1):3712
pubmed: 34140507
BMC Genomics. 2021 Jan 15;22(1):56
pubmed: 33451286
Cell Rep. 2021 Jan 12;34(2):108623
pubmed: 33440152
Brief Bioinform. 2021 Jul 20;22(4):
pubmed: 33099604
Nucleic Acids Res. 2022 Jan 7;50(D1):D1398-D1407
pubmed: 34718717
Nucleic Acids Res. 2019 Jan 8;47(D1):D506-D515
pubmed: 30395287
IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1986-1995
pubmed: 31751248
IEEE J Biomed Health Inform. 2020 Oct;24(10):3012-3019
pubmed: 32142462
Comput Struct Biotechnol J. 2019 Oct 25;17:1245-1254
pubmed: 31921391
Brief Bioinform. 2020 Sep 25;21(5):1825-1836
pubmed: 31860715
Comput Math Methods Med. 2020 Oct 19;2020:8926750
pubmed: 33133228
Biochem Biophys Res Commun. 2007 Aug 24;360(2):339-45
pubmed: 17586467
Comput Biol Chem. 2020 Dec;89:107369
pubmed: 33099120
IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1831-1840
pubmed: 31985437
Nucleic Acids Res. 2008 Jul 1;36(Web Server issue):W5-9
pubmed: 18440982
Brief Bioinform. 2020 May 21;21(3):1058-1068
pubmed: 31157371
Bioinformatics. 2022 Jan 06;:
pubmed: 34999771

Auteurs

Rui Fan (R)

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China.
Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China.

Bing Suo (B)

Beidahuang Industry Group General Hospital, Harbin, China.

Yijie Ding (Y)

Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou, China.

Classifications MeSH