AScirRNA: A novel computational approach to discover abiotic stress-responsive circular RNAs in plant genome.

Abiotic stress Agriculture Circular RNAs Computational biology Deep learning Machine learning

Journal

Computational biology and chemistry
ISSN: 1476-928X
Titre abrégé: Comput Biol Chem
Pays: England
ID NLM: 101157394

Informations de publication

Date de publication:
06 Sep 2024
Historique:
received: 19 03 2024
revised: 12 07 2024
accepted: 04 09 2024
medline: 13 9 2024
pubmed: 13 9 2024
entrez: 12 9 2024
Statut: aheadofprint

Résumé

In the realm of plant biology, understanding the intricate regulatory mechanisms governing stress responses stands as a pivotal pursuit. Circular RNAs (circRNAs), emerging as critical players in gene regulation, have garnered attention in recent days for their potential roles in abiotic stress adaptation. A comprehensive grasp of circRNAs' functions in stress response offers avenues for breeders to manipulating plants to develop abiotic stress resistant crop cultivars to thrive in challenging climates. This study pioneers a machine learning-based model for predicting abiotic stress-responsive circRNAs. The K-tuple nucleotide composition (KNC) and Pseudo KNC (PKNC) features were utilized to numerically represent circRNAs. Three different feature selection strategies were employed to select relevant and non-redundant features. Eight shallow and four deep learning algorithms were evaluated to build the final predictive model. Following five-fold cross-validation process, XGBoost learning algorithm demonstrated superior performance with LightGBM-chosen 260 KNC features (Accuracy: 74.55 %, auROC: 81.23 %, auPRC: 76.52 %) and 160 PKNC features (Accuracy: 74.32 %, auROC: 81.04 %, auPRC: 76.43 %), over other combinations of learning algorithms and feature selection techniques. Further, the robustness of the developed models were evaluated using an independent test dataset, where the overall accuracy, auROC and auPRC were found to be 73.13 %, 72.34 % and 72.68 % for KNC feature set and 73.52 %, 79.53 % and 73.09 % for PKNC feature set, respectively. This computational approach was also integrated into an online prediction tool, AScirRNA (https://iasri-sg.icar.gov.in/ascirna/) for easy prediction by the users. Both the proposed model and the developed tool are poised to augment ongoing efforts in identifying stress-responsive circRNAs in plants.

Identifiants

pubmed: 39265460
pii: S1476-9271(24)00193-2
doi: 10.1016/j.compbiolchem.2024.108205
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

108205

Informations de copyright

Copyright © 2024 Elsevier Ltd. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Upendra Kumar Pradhan (UK)

Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi 110012, India. Electronic address: upendra.pradhan@icar.gov.in.

Prasanjit Behera (P)

Department of Bioinformatics, Odisha University of Agriculture & Technology, Bhubaneswar, Odisha 751003, India. Electronic address: prasanjit.behera20@gmail.com.

Ritwika Das (R)

Division of Agricultural Bioinformatics, ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi 110012, India. Electronic address: ritwika.das@icar.gov.in.

Sanchita Naha (S)

Division of Computer Applications, ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi 110012, India. Electronic address: sanchita.naha@icar.gov.in.

Ajit Gupta (A)

Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi 110012, India. Electronic address: ajit@icar.gov.in.

Rajender Parsad (R)

ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi 110012, India. Electronic address: rajender.parsad@icar.gov.in.

Sukanta Kumar Pradhan (SK)

Department of Bioinformatics, Odisha University of Agriculture & Technology, Bhubaneswar, Odisha 751003, India. Electronic address: ksukantapradhan@gmail.com.

Prabina Kumar Meher (PK)

Division of Statistical Genetics, ICAR-Indian Agricultural Statistics Research Institute, PUSA, New Delhi 110012, India. Electronic address: prabina.meher@icar.gov.in.

Classifications MeSH