Improved Deep Learning Based Method for Molecular Similarity Searching Using Stack of Deep Belief Networks.
deep belief networks (DBN)
deep learning
feature selection
similarity searching
virtual screening (VS)
Journal
Molecules (Basel, Switzerland)
ISSN: 1420-3049
Titre abrégé: Molecules
Pays: Switzerland
ID NLM: 100964009
Informations de publication
Date de publication:
29 Dec 2020
29 Dec 2020
Historique:
received:
19
11
2020
revised:
24
12
2020
accepted:
25
12
2020
entrez:
1
1
2021
pubmed:
2
1
2021
medline:
11
9
2021
Statut:
epublish
Résumé
Virtual screening (VS) is a computational practice applied in drug discovery research. VS is popularly applied in a computer-based search for new lead molecules based on molecular similarity searching. In chemical databases similarity searching is used to identify molecules that have similarities to a user-defined reference structure and is evaluated by quantitative measures of intermolecular structural similarity. Among existing approaches, 2D fingerprints are widely used. The similarity of a reference structure and a database structure is measured by the computation of association coefficients. In most classical similarity approaches, it is assumed that the molecular features in both biological and non-biologically-related activity carry the same weight. However, based on the chemical structure, it has been found that some distinguishable features are more important than others. Hence, this difference should be taken consideration by placing more weight on each important fragment. The main aim of this research is to enhance the performance of similarity searching by using multiple descriptors. In this paper, a deep learning method known as deep belief networks (DBN) has been used to reweight the molecule features. Several descriptors have been used for the MDL Drug Data Report (MDDR) dataset each of which represents different important features. The proposed method has been implemented with each descriptor individually to select the important features based on a new weight, with a lower error rate, and merging together all new features from all descriptors to produce a new descriptor for similarity searching. Based on the extensive experiments conducted, the results show that the proposed method outperformed several existing benchmark similarity methods, including Bayesian inference networks (BIN), the Tanimoto similarity method (TAN), adapted similarity measure of text processing (ASMTP) and the quantum-based similarity method (SQB). The results of this proposed multi-descriptor-based on Stack of deep belief networks method (SDBN) demonstrated a higher accuracy compared to existing methods on structurally heterogeneous datasets.
Identifiants
pubmed: 33383976
pii: molecules26010128
doi: 10.3390/molecules26010128
pmc: PMC7795308
pii:
doi:
Substances chimiques
Pharmaceutical Preparations
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : Ministry of Higher Education (MOHE) and the Research Management Centre (RMC) at the Universiti Teknologi Malaysia (UTM)
ID : VOT Q.J130000.2528.16H74 and R.J130000.7828.4F985
Références
Neural Comput. 2006 Jul;18(7):1527-54
pubmed: 16764513
J Comput Aided Mol Des. 2012 Mar;26(3):279-87
pubmed: 22249773
J Chem Inf Model. 2010 Jun 28;50(6):1012-20
pubmed: 20504032
Neurology. 2002 Apr 23;58(8):1214-20
pubmed: 11971089
J Comput Aided Mol Des. 2012 Nov;26(11):1247-66
pubmed: 23065321
Molecules. 2016 Apr 13;21(4):476
pubmed: 27089312
J Chem Inf Model. 2013 Jan 28;53(1):1-10
pubmed: 23297768
ScientificWorldJournal. 2012;2012:410914
pubmed: 22623895
Philos Trans A Math Phys Eng Sci. 2016 Apr 13;374(2065):20150202
pubmed: 26953178
Comb Chem High Throughput Screen. 2002 Mar;5(2):155-66
pubmed: 11966424
J Chem Inf Comput Sci. 2003 Mar-Apr;43(2):435-42
pubmed: 12653506
Biomed Opt Express. 2013 Jan 1;4(1):1-14
pubmed: 23304643
Chem Biol Drug Des. 2018 Jan;91(1):137-152
pubmed: 28656625
ChemMedChem. 2009 Feb;4(2):210-8
pubmed: 19072820
J Biomol Screen. 2011 Oct;16(9):1081-8
pubmed: 21862688
J Chem Inf Model. 2014 Jan 27;54(1):30-6
pubmed: 24392938
Drug Discov Today Technol. 2013 Sep;10(3):e395-401
pubmed: 24050136
Molecules. 2015 Oct 02;20(10):18107-27
pubmed: 26445039
Science. 2006 Jul 28;313(5786):504-7
pubmed: 16873662
Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:3957-60
pubmed: 25570858
Drug Des Devel Ther. 2014 Sep 02;8:1195-210
pubmed: 25214764
Neural Comput. 2002 Aug;14(8):1771-800
pubmed: 12180402
J Med Chem. 2004 May 20;47(11):2743-9
pubmed: 15139752
BMC Bioinformatics. 2010 Nov 18;11:567
pubmed: 21087509
J Chem Inf Model. 2011 Jan 24;51(1):25-32
pubmed: 21155550
J Chem Inf Comput Sci. 2002 Nov-Dec;42(6):1407-14
pubmed: 12444738
Brain Struct Funct. 2016 Jun;221(5):2569-87
pubmed: 25993900