A Heterogeneous Information Network Model for Long Non-Coding RNA Function Prediction.


Journal

IEEE/ACM transactions on computational biology and bioinformatics
ISSN: 1557-9964
Titre abrégé: IEEE/ACM Trans Comput Biol Bioinform
Pays: United States
ID NLM: 101196755

Informations de publication

Date de publication:
Historique:
pubmed: 6 8 2020
medline: 19 2 2022
entrez: 6 8 2020
Statut: ppublish

Résumé

Exciting information on the functional roles played by long non-coding RNA (lncRNA) has drawn substantial research attention these days. With the advent of techniques such as RNA-Seq, thousands of lncRNAs are identified in very short time spans. However, due to the poor annotation rate, only a few of them are functionally characterised. The wet lab experiments to elucidate lncRNA functions are challenging, slow progressing and sometimes prohibitively expensive. This work attempts to solve the crucial problem of developing computational methods to predict lncRNA functions. The model presented here, predicts the functions of lncRNAs by making use of a meta-path based measure, AvgSim on a Heterogeneous Information Network (HIN). The network is constructed from existing protein and function association data of lncRNAs, lncRNA co-expression data and protein protein interaction data. Out of the 2,758 lncRNA considered for the experiment, the proposed method predicts possible functions for 2,695 lncRNAs with an accuracy of 73.68 percent and found to perform better than the other state-of-the-art approaches for an independent test set. A case study of two well-known lncRNAs (HOTAIR and H19) is conducted and the associated functions are identified. The results were validated using experimental evidence from the literature. The script and data used for the implementation of the model is freely available at: http://bdbl.nitc.ac.in/LncFunPred/index.html.

Identifiants

pubmed: 32750859
doi: 10.1109/TCBB.2020.3000518
doi:

Substances chimiques

Proteins 0
RNA, Long Noncoding 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

255-266

Auteurs

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Humans Colorectal Neoplasms Biomarkers, Tumor Prognosis Gene Expression Regulation, Neoplastic
Metabolic Networks and Pathways Saccharomyces cerevisiae Computational Biology Synthetic Biology Computer Simulation

Classifications MeSH