Fuzzy kernel evidence Random Forest for identifying pseudouridine sites.
RNA sequences
evidence Random Forest
fuzzy feature set
kernel method
pseudouridine sites
Journal
Briefings in bioinformatics
ISSN: 1477-4054
Titre abrégé: Brief Bioinform
Pays: England
ID NLM: 100912837
Informations de publication
Date de publication:
27 Mar 2024
27 Mar 2024
Historique:
received:
18
01
2024
revised:
27
03
2024
accepted:
31
03
2024
medline:
16
4
2024
pubmed:
16
4
2024
entrez:
15
4
2024
Statut:
ppublish
Résumé
Pseudouridine is an RNA modification that is widely distributed in both prokaryotes and eukaryotes, and plays a critical role in numerous biological activities. Despite its importance, the precise identification of pseudouridine sites through experimental approaches poses significant challenges, requiring substantial time and resources.Therefore, there is a growing need for computational techniques that can reliably and quickly identify pseudouridine sites from vast amounts of RNA sequencing data. In this study, we propose fuzzy kernel evidence Random Forest (FKeERF) to identify pseudouridine sites. This method is called PseU-FKeERF, which demonstrates high accuracy in identifying pseudouridine sites from RNA sequencing data. The PseU-FKeERF model selected four RNA feature coding schemes with relatively good performance for feature combination, and then input them into the newly proposed FKeERF method for category prediction. FKeERF not only uses fuzzy logic to expand the original feature space, but also combines kernel methods that are easy to interpret in general for category prediction. Both cross-validation tests and independent tests on benchmark datasets have shown that PseU-FKeERF has better predictive performance than several state-of-the-art methods. This new method not only improves the accuracy of pseudouridine site identification, but also provides a certain reference for disease control and related drug development in the future.
Identifiants
pubmed: 38622357
pii: 7645840
doi: 10.1093/bib/bbae169
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : National Natural Science Foundation of China
ID : 32270786
Organisme : Zhejiang Provincial Natural Science Foundation of China
ID : LY23F020003
Organisme : Municipal Government of Quzhou
ID : 2023D036
Informations de copyright
© The Author(s) 2024. Published by Oxford University Press.