A learning-based method to predict LncRNA-disease associations by combining CNN and ELM.
Association prediction
CNN
Disease
ELM
lncRNA
Journal
BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194
Informations de publication
Date de publication:
22 Mar 2022
22 Mar 2022
Historique:
received:
02
10
2021
accepted:
07
10
2021
entrez:
23
3
2022
pubmed:
24
3
2022
medline:
25
3
2022
Statut:
epublish
Résumé
lncRNAs play a critical role in numerous biological processes and life activities, especially diseases. Considering that traditional wet experiments for identifying uncovered lncRNA-disease associations is limited in terms of time consumption and labor cost. It is imperative to construct reliable and efficient computational models as addition for practice. Deep learning technologies have been proved to make impressive contributions in many areas, but the feasibility of it in bioinformatics has not been adequately verified. In this paper, a machine learning-based model called LDACE was proposed to predict potential lncRNA-disease associations by combining Extreme Learning Machine (ELM) and Convolutional Neural Network (CNN). Specifically, the representation vectors are constructed by integrating multiple types of biology information including functional similarity and semantic similarity. Then, CNN is applied to mine both local and global features. Finally, ELM is chosen to carry out the prediction task to detect the potential lncRNA-disease associations. The proposed method achieved remarkable Area Under Receiver Operating Characteristic Curve of 0.9086 in Leave-one-out cross-validation and 0.8994 in fivefold cross-validation, respectively. In addition, 2 kinds of case studies based on lung cancer and endometrial cancer indicate the robustness and efficiency of LDACE even in a real environment. Substantial results demonstrated that the proposed model is expected to be an auxiliary tool to guide and assist biomedical research, and the close integration of deep learning and biology big data will provide life sciences with novel insights.
Sections du résumé
BACKGROUND
BACKGROUND
lncRNAs play a critical role in numerous biological processes and life activities, especially diseases. Considering that traditional wet experiments for identifying uncovered lncRNA-disease associations is limited in terms of time consumption and labor cost. It is imperative to construct reliable and efficient computational models as addition for practice. Deep learning technologies have been proved to make impressive contributions in many areas, but the feasibility of it in bioinformatics has not been adequately verified.
RESULTS
RESULTS
In this paper, a machine learning-based model called LDACE was proposed to predict potential lncRNA-disease associations by combining Extreme Learning Machine (ELM) and Convolutional Neural Network (CNN). Specifically, the representation vectors are constructed by integrating multiple types of biology information including functional similarity and semantic similarity. Then, CNN is applied to mine both local and global features. Finally, ELM is chosen to carry out the prediction task to detect the potential lncRNA-disease associations. The proposed method achieved remarkable Area Under Receiver Operating Characteristic Curve of 0.9086 in Leave-one-out cross-validation and 0.8994 in fivefold cross-validation, respectively. In addition, 2 kinds of case studies based on lung cancer and endometrial cancer indicate the robustness and efficiency of LDACE even in a real environment.
CONCLUSIONS
CONCLUSIONS
Substantial results demonstrated that the proposed model is expected to be an auxiliary tool to guide and assist biomedical research, and the close integration of deep learning and biology big data will provide life sciences with novel insights.
Identifiants
pubmed: 35317723
doi: 10.1186/s12859-022-04611-3
pii: 10.1186/s12859-022-04611-3
pmc: PMC8941737
doi:
Substances chimiques
RNA, Long Noncoding
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
622Subventions
Organisme : National Key R&D Program of China
ID : 2018YFA0902600
Organisme : National Key R&D Program of China
ID : 2018AAA0100100
Organisme : National Natural Science Foundation of China
ID : 61732012
Organisme : National Natural Science Foundation of China
ID : 61772370
Organisme : National Natural Science Foundation of China
ID : 61932008
Organisme : National Natural Science Foundation of China
ID : 61772357
Organisme : National Natural Science Foundation of China
ID : 62002297
Organisme : National Natural Science Foundation of China
ID : 62002266
Organisme : National Natural Science Foundation of China
ID : 62073231
Informations de copyright
© 2022. The Author(s).
Références
Lung Cancer. 2015 May;88(2):147-53
pubmed: 25758555
Nucleic Acids Res. 2016 Jan 4;44(D1):D980-5
pubmed: 26481356
IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1743-1751
pubmed: 32946398
Bioinformatics. 2013 Oct 15;29(20):2617-24
pubmed: 24002109
Brief Funct Genomics. 2019 Feb 14;18(1):58-82
pubmed: 30247501
IEEE/ACM Trans Comput Biol Bioinform. 2020 Sep-Oct;17(5):1605-1612
pubmed: 30969931
Nucleic Acids Res. 2018 Jan 4;46(D1):D371-D374
pubmed: 29106639
Bioinformatics. 2018 Oct 1;34(19):3357-3364
pubmed: 29718113
Nucleic Acids Res. 2013 Jan;41(Database issue):D983-6
pubmed: 23175614
IEEE Trans Cybern. 2020 May;50(5):1798-1809
pubmed: 30969935
Nature. 2012 Sep 6;489(7414):101-8
pubmed: 22955620
IEEE Trans Inf Technol Biomed. 2009 Jul;13(4):599-607
pubmed: 19369170
IEEE Trans Image Process. 2012 Oct;21(10):4492-7
pubmed: 22711773
IEEE Trans Neural Netw. 2008 Dec;19(12):2099-115
pubmed: 19054734
Nucleic Acids Res. 2005 Jan 1;33(Database issue):D112-5
pubmed: 15608158
Science. 2004 Dec 24;306(5705):2242-6
pubmed: 15539566
Brief Bioinform. 2021 Jul 20;22(4):
pubmed: 33005921
IEEE Trans Cybern. 2020 May;50(5):1844-1855
pubmed: 30835233
Brief Bioinform. 2017 Jul 1;18(4):558-576
pubmed: 27345524
Neurobiol Dis. 2012 May;46(2):245-54
pubmed: 22202438
Bioinformatics. 2018 Jan 1;34(1):33-40
pubmed: 28968797
Database (Oxford). 2017 Jan 1;2017:
pubmed: 29220444
Brief Bioinform. 2022 Jan 17;23(1):
pubmed: 34471921
Tumour Biol. 2013 Apr;34(2):613-20
pubmed: 23359273
iScience. 2019 Sep 27;19:786-795
pubmed: 31494494
IEEE Trans Image Process. 2012 Nov;21(11):4667-72
pubmed: 22875247
Bioinformatics. 2010 Nov 1;26(21):2744-51
pubmed: 20817744
IEEE Trans Neural Netw Learn Syst. 2019 Feb;30(2):601-614
pubmed: 30004892
PLoS One. 2014 Jan 02;9(1):e84408
pubmed: 24392133
Bioinformatics. 2005 Jun;21 Suppl 1:i38-46
pubmed: 15961482
Neural Netw. 2019 Apr;112:1-14
pubmed: 30716617
BMC Med Genomics. 2018 Dec 31;11(Suppl 6):114
pubmed: 30598113
Bioinformatics. 2010 Jul 1;26(13):1644-50
pubmed: 20439255
Database (Oxford). 2017 Jan 1;2017:
pubmed: 29220470
Cell Stem Cell. 2014 Jun 5;14(6):752-61
pubmed: 24905165
Nucleic Acids Res. 2019 Jan 8;47(D1):D1034-D1037
pubmed: 30285109
Mol Biosyst. 2012 Jul 6;8(7):1970-8
pubmed: 22538619
Brief Bioinform. 2016 Jul;17(4):696-712
pubmed: 26283676
Bioinformatics. 2018 Dec 15;34(24):4256-4265
pubmed: 29939227
Brief Bioinform. 2021 Sep 2;22(5):
pubmed: 33498086
Brief Bioinform. 2019 Mar 22;20(2):515-539
pubmed: 29045685
IEEE Trans Cybern. 2021 Jul 08;PP:
pubmed: 34236991
IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1793-1800
pubmed: 32960766
Sci Rep. 2015 Jun 10;5:11338
pubmed: 26061969
Mol Biosyst. 2015 Mar;11(3):760-9
pubmed: 25502053
Nucleic Acids Res. 2018 Jan 4;46(D1):D1074-D1082
pubmed: 29126136
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul 07;PP:
pubmed: 32750884