Fuzzy kernel evidence Random Forest for identifying pseudouridine sites.

RNA sequences evidence Random Forest fuzzy feature set kernel method pseudouridine sites

Journal

Briefings in bioinformatics
ISSN: 1477-4054
Titre abrégé: Brief Bioinform
Pays: England
ID NLM: 100912837

Informations de publication

Date de publication:
27 Mar 2024
Historique:
received: 18 01 2024
revised: 27 03 2024
accepted: 31 03 2024
medline: 16 4 2024
pubmed: 16 4 2024
entrez: 15 4 2024
Statut: ppublish

Résumé

Pseudouridine is an RNA modification that is widely distributed in both prokaryotes and eukaryotes, and plays a critical role in numerous biological activities. Despite its importance, the precise identification of pseudouridine sites through experimental approaches poses significant challenges, requiring substantial time and resources.Therefore, there is a growing need for computational techniques that can reliably and quickly identify pseudouridine sites from vast amounts of RNA sequencing data. In this study, we propose fuzzy kernel evidence Random Forest (FKeERF) to identify pseudouridine sites. This method is called PseU-FKeERF, which demonstrates high accuracy in identifying pseudouridine sites from RNA sequencing data. The PseU-FKeERF model selected four RNA feature coding schemes with relatively good performance for feature combination, and then input them into the newly proposed FKeERF method for category prediction. FKeERF not only uses fuzzy logic to expand the original feature space, but also combines kernel methods that are easy to interpret in general for category prediction. Both cross-validation tests and independent tests on benchmark datasets have shown that PseU-FKeERF has better predictive performance than several state-of-the-art methods. This new method not only improves the accuracy of pseudouridine site identification, but also provides a certain reference for disease control and related drug development in the future.

Identifiants

pubmed: 38622357
pii: 7645840
doi: 10.1093/bib/bbae169
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : National Natural Science Foundation of China
ID : 32270786
Organisme : Zhejiang Provincial Natural Science Foundation of China
ID : LY23F020003
Organisme : Municipal Government of Quzhou
ID : 2023D036

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press.

Auteurs

Mingshuai Chen (M)

Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu 611731, China.
Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou 324003, China.

Mingai Sun (M)

Beidahuang Industry Group General Hospital, Harbin 150001, China.

Xi Su (X)

Foshan Women and Children Hospital, Foshan 528000, China.

Prayag Tiwari (P)

School of Information Technology, Halmstad University, Sweden.

Yijie Ding (Y)

Yangtze Delta Region Institute (Quzhou), University of Electronic Science and Technology of China, Quzhou 324003, China.

Classifications MeSH