circDeep: deep learning approach for circular RNA classification from other long non-coding RNA.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 01 2020
Historique:
received: 19 12 2018
revised: 13 06 2019
accepted: 01 07 2019
pubmed: 4 7 2019
medline: 29 8 2020
entrez: 4 7 2019
Statut: ppublish

Résumé

Over the past two decades, a circular form of RNA (circular RNA), produced through alternative splicing, has become the focus of scientific studies due to its major role as a microRNA (miRNA) activity modulator and its association with various diseases including cancer. Therefore, the detection of circular RNAs is vital to understanding their biogenesis and purpose. Prediction of circular RNA can be achieved in three steps: distinguishing non-coding RNAs from protein coding gene transcripts, separating short and long non-coding RNAs and predicting circular RNAs from other long non-coding RNAs (lncRNAs). However, the available tools are less than 80 percent accurate for distinguishing circular RNAs from other lncRNAs due to difficulty of classification. Therefore, the availability of a more accurate and fast machine learning method for the identification of circular RNAs, which considers the specific features of circular RNA, is essential to the development of systematic annotation. Here we present an End-to-End deep learning framework, circDeep, to classify circular RNA from other lncRNA. circDeep fuses an RCM descriptor, ACNN-BLSTM sequence descriptor and a conservation descriptor into high level abstraction descriptors, where the shared representations across different modalities are integrated. The experiments show that circDeep is not only faster than existing tools but also performs at an unprecedented level of accuracy by achieving a 12 percent increase in accuracy over the other tools. https://github.com/UofLBioinformatics/circDeep. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 31268128
pii: 5527751
doi: 10.1093/bioinformatics/btz537
pmc: PMC6956777
doi:

Substances chimiques

RNA, Circular 0
RNA, Long Noncoding 0

Types de publication

Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

73-80

Subventions

Organisme : NIGMS NIH HHS
ID : P20 GM103436
Pays : United States
Organisme : NIGMS NIH HHS
ID : R15 GM126446
Pays : United States

Informations de copyright

© The Author(s) 2019. Published by Oxford University Press.

Références

Cell Rep. 2015 Jan 13;10(2):170-7
pubmed: 25558066
Nucleic Acids Res. 2013 Jan;41(Database issue):D983-6
pubmed: 23175614
Mol Cell. 2017 Apr 6;66(1):22-37.e9
pubmed: 28344082
Curr Genomics. 2015 Oct;16(5):312-8
pubmed: 27047251
Genome Res. 2012 Sep;22(9):1775-89
pubmed: 22955988
RNA. 2014 Dec;20(12):1829-42
pubmed: 25404635
PLoS One. 2015 Nov 10;10(11):e0141287
pubmed: 26555596
Mol Biosyst. 2015 Aug;11(8):2219-26
pubmed: 26028480
Cell. 2009 Feb 20;136(4):777-93
pubmed: 19239895
Nature. 2013 Mar 21;495(7441):333-8
pubmed: 23446348
J Clin Endocrinol Metab. 2006 Jul;91(7):2689-95
pubmed: 16636128
Gene. 1995 Dec 29;167(1-2):245-8
pubmed: 8566785
Nature. 2013 Mar 21;495(7441):384-8
pubmed: 23446346
Bioinformatics. 2016 Jun 15;32(12):i121-i127
pubmed: 27307608
Mol Genet Genomics. 2018 Feb;293(1):137-149
pubmed: 28913654
Nature. 2003 May 15;423(6937):293-8
pubmed: 12714972
Mol Cell. 2017 Apr 6;66(1):1-2
pubmed: 28388436
Nat Rev Genet. 2009 Mar;10(3):155-9
pubmed: 19188922
Bioinformatics. 2017 Jul 15;33(14):i92-i101
pubmed: 28881969
PLoS Genet. 2010 Dec 02;6(12):e1001233
pubmed: 21151960
Cell. 2014 Sep 25;159(1):134-147
pubmed: 25242744
Front Genet. 2013 Dec 31;4:307
pubmed: 24427167
Sci Rep. 2015 Jan 27;5:8057
pubmed: 25624062
Mol Cell. 2017 Apr 6;66(1):9-21.e7
pubmed: 28344080
Hum Mol Genet. 2006 Apr 15;15 Spec No 1:R17-29
pubmed: 16651366

Auteurs

Mohamed Chaabane (M)

Department of Computer Engineering and Computer Science, Louisville, KY 40208, USA.

Robert M Williams (RM)

Department of Computer Engineering and Computer Science, Louisville, KY 40208, USA.

Austin T Stephens (AT)

Department of Computer Engineering and Computer Science, Louisville, KY 40208, USA.

Juw Won Park (JW)

Department of Computer Engineering and Computer Science, Louisville, KY 40208, USA.
KBRIN Bioinformatics Core, University of Louisville, Louisville, KY 40208, USA.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Humans Middle Aged Female Male Surveys and Questionnaires
Adolescent Child Female Humans Male

Classifications MeSH