PseU-Pred: An ensemble model for accurate identification of pseudouridine sites.

Bioinformatics Cross-validation Genomics Proteomics Sequence analysis Statistical moments

Journal

Analytical biochemistry
ISSN: 1096-0309
Titre abrégé: Anal Biochem
Pays: United States
ID NLM: 0370535

Informations de publication

Date de publication:
01 09 2023
Historique:
received: 09 03 2023
revised: 25 06 2023
accepted: 08 07 2023
medline: 27 7 2023
pubmed: 13 7 2023
entrez: 12 7 2023
Statut: ppublish

Résumé

Pseudouridine (ψ) is reported to occur frequently in all types of RNA. This uridine modification has been shown to be essential for processes such as RNA stability and stress response. Also, it is linked to a few human diseases, such as prostate cancer, anemia, etc. A few laboratory techniques, such as Pseudo-seq and N3-CMC-enriched Pseudouridine sequencing (CeU-Seq) are used for detecting ψ sites. However, these are laborious and drawn-out methods. The convenience of sequencing data has enabled the development of computationally intelligent models for improving ψ site identification methods. The proposed work provides a prediction model for the identification of ψ sites through popular ensemble methods such as stacking, bagging, and boosting. Features were obtained through a novel feature extraction mechanism with the assimilation of statistical moments, which were used to train ensemble models. The cross-validation test and independent set test were used to evaluate the precision of the trained models. The proposed model outperformed the preexisting predictors and revealed 87% accuracy, 0.90 specificity, 0.85 sensitivity, and a 0.75 Matthews correlation coefficient. A web server has been built and is available publicly for the researchers at https://taseersuleman-y-test-pseu-pred-c2wmtj.streamlit.app/.

Identifiants

pubmed: 37437648
pii: S0003-2697(23)00212-9
doi: 10.1016/j.ab.2023.115247
pii:
doi:

Substances chimiques

Pseudouridine 1445-07-4
RNA 63231-63-0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

115247

Informations de copyright

Copyright © 2023 Elsevier Inc. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Muhammad Taseer Suleman (MT)

Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, 54770, Pakistan. Electronic address: s2018288002@umt.edu.pk.

Yaser Daanial Khan (YD)

Department of Computer Science, School of Systems and Technology, University of Management and Technology, Lahore, 54770, Pakistan. Electronic address: yaser.khan@umt.edu.pk.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH