Improved Deep Learning Based Method for Molecular Similarity Searching Using Stack of Deep Belief Networks.


Journal

Molecules (Basel, Switzerland)
ISSN: 1420-3049
Titre abrégé: Molecules
Pays: Switzerland
ID NLM: 100964009

Informations de publication

Date de publication:
29 Dec 2020
Historique:
received: 19 11 2020
revised: 24 12 2020
accepted: 25 12 2020
entrez: 1 1 2021
pubmed: 2 1 2021
medline: 11 9 2021
Statut: epublish

Résumé

Virtual screening (VS) is a computational practice applied in drug discovery research. VS is popularly applied in a computer-based search for new lead molecules based on molecular similarity searching. In chemical databases similarity searching is used to identify molecules that have similarities to a user-defined reference structure and is evaluated by quantitative measures of intermolecular structural similarity. Among existing approaches, 2D fingerprints are widely used. The similarity of a reference structure and a database structure is measured by the computation of association coefficients. In most classical similarity approaches, it is assumed that the molecular features in both biological and non-biologically-related activity carry the same weight. However, based on the chemical structure, it has been found that some distinguishable features are more important than others. Hence, this difference should be taken consideration by placing more weight on each important fragment. The main aim of this research is to enhance the performance of similarity searching by using multiple descriptors. In this paper, a deep learning method known as deep belief networks (DBN) has been used to reweight the molecule features. Several descriptors have been used for the MDL Drug Data Report (MDDR) dataset each of which represents different important features. The proposed method has been implemented with each descriptor individually to select the important features based on a new weight, with a lower error rate, and merging together all new features from all descriptors to produce a new descriptor for similarity searching. Based on the extensive experiments conducted, the results show that the proposed method outperformed several existing benchmark similarity methods, including Bayesian inference networks (BIN), the Tanimoto similarity method (TAN), adapted similarity measure of text processing (ASMTP) and the quantum-based similarity method (SQB). The results of this proposed multi-descriptor-based on Stack of deep belief networks method (SDBN) demonstrated a higher accuracy compared to existing methods on structurally heterogeneous datasets.

Identifiants

pubmed: 33383976
pii: molecules26010128
doi: 10.3390/molecules26010128
pmc: PMC7795308
pii:
doi:

Substances chimiques

Pharmaceutical Preparations 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : Ministry of Higher Education (MOHE) and the Research Management Centre (RMC) at the Universiti Teknologi Malaysia (UTM)
ID : VOT Q.J130000.2528.16H74 and R.J130000.7828.4F985

Références

Neural Comput. 2006 Jul;18(7):1527-54
pubmed: 16764513
J Comput Aided Mol Des. 2012 Mar;26(3):279-87
pubmed: 22249773
J Chem Inf Model. 2010 Jun 28;50(6):1012-20
pubmed: 20504032
Neurology. 2002 Apr 23;58(8):1214-20
pubmed: 11971089
J Comput Aided Mol Des. 2012 Nov;26(11):1247-66
pubmed: 23065321
Molecules. 2016 Apr 13;21(4):476
pubmed: 27089312
J Chem Inf Model. 2013 Jan 28;53(1):1-10
pubmed: 23297768
ScientificWorldJournal. 2012;2012:410914
pubmed: 22623895
Philos Trans A Math Phys Eng Sci. 2016 Apr 13;374(2065):20150202
pubmed: 26953178
Comb Chem High Throughput Screen. 2002 Mar;5(2):155-66
pubmed: 11966424
J Chem Inf Comput Sci. 2003 Mar-Apr;43(2):435-42
pubmed: 12653506
Biomed Opt Express. 2013 Jan 1;4(1):1-14
pubmed: 23304643
Chem Biol Drug Des. 2018 Jan;91(1):137-152
pubmed: 28656625
ChemMedChem. 2009 Feb;4(2):210-8
pubmed: 19072820
J Biomol Screen. 2011 Oct;16(9):1081-8
pubmed: 21862688
J Chem Inf Model. 2014 Jan 27;54(1):30-6
pubmed: 24392938
Drug Discov Today Technol. 2013 Sep;10(3):e395-401
pubmed: 24050136
Molecules. 2015 Oct 02;20(10):18107-27
pubmed: 26445039
Science. 2006 Jul 28;313(5786):504-7
pubmed: 16873662
Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:3957-60
pubmed: 25570858
Drug Des Devel Ther. 2014 Sep 02;8:1195-210
pubmed: 25214764
Neural Comput. 2002 Aug;14(8):1771-800
pubmed: 12180402
J Med Chem. 2004 May 20;47(11):2743-9
pubmed: 15139752
BMC Bioinformatics. 2010 Nov 18;11:567
pubmed: 21087509
J Chem Inf Model. 2011 Jan 24;51(1):25-32
pubmed: 21155550
J Chem Inf Comput Sci. 2002 Nov-Dec;42(6):1407-14
pubmed: 12444738
Brain Struct Funct. 2016 Jun;221(5):2569-87
pubmed: 25993900

Auteurs

Maged Nasser (M)

School of Computing, Universiti Teknologi Malaysia, Johor Bahru 81310, Malaysia.

Naomie Salim (N)

School of Computing, Universiti Teknologi Malaysia, Johor Bahru 81310, Malaysia.

Hentabli Hamza (H)

School of Computing, Universiti Teknologi Malaysia, Johor Bahru 81310, Malaysia.

Faisal Saeed (F)

College of Computer Science and Engineering, Taibah University, Medina 344, Saudi Arabia.

Idris Rabiu (I)

School of Computing, Universiti Teknologi Malaysia, Johor Bahru 81310, Malaysia.

Articles similaires

Humans Pharmaceutical Preparations Drug Utilization Prescription Drugs
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Alzheimer Disease Humans Regression Analysis Quantitative Structure-Activity Relationship Drug Design

Unsupervised learning for real-time and continuous gait phase detection.

Dollaporn Anopas, Yodchanan Wongsawat, Jetsada Arnin
1.00
Humans Gait Neural Networks, Computer Unsupervised Machine Learning Walking

Classifications MeSH