Interpretation of Ligand-Based Activity Cliff Prediction Models Using the Matched Molecular Pair Kernel.
SHapley Additive exPlanations
activity-cliff
chemoinformatics
matched molecular pair
model interpretation
support vector machine
Journal
Molecules (Basel, Switzerland)
ISSN: 1420-3049
Titre abrégé: Molecules
Pays: Switzerland
ID NLM: 100964009
Informations de publication
Date de publication:
13 Aug 2021
13 Aug 2021
Historique:
received:
16
07
2021
revised:
09
08
2021
accepted:
10
08
2021
entrez:
27
8
2021
pubmed:
28
8
2021
medline:
28
8
2021
Statut:
epublish
Résumé
Activity cliffs (ACs) are formed by two structurally similar compounds with a large difference in potency. Accurate AC prediction is expected to help researchers' decisions in the early stages of drug discovery. Previously, predictive models based on matched molecular pair (MMP) cliffs have been proposed. However, the proposed methods face a challenge of interpretability due to the black-box character of the predictive models. In this study, we developed interpretable MMP fingerprints and modified a model-specific interpretation approach for models based on a support vector machine (SVM) and MMP kernel. We compared important features highlighted by this SVM-based interpretation approach and the SHapley Additive exPlanations (SHAP) as a major model-independent approach. The model-specific approach could capture the difference between AC and non-AC, while SHAP assigned high weights to the features not present in the test instances. For specific MMPs, the feature weights mapped by the SVM-based interpretation method were in agreement with the previously confirmed binding knowledge from X-ray co-crystal structures, indicating that this method is able to interpret the AC prediction model in a chemically intuitive manner.
Identifiants
pubmed: 34443503
pii: molecules26164916
doi: 10.3390/molecules26164916
pmc: PMC8401777
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Références
J Chem Theory Comput. 2019 Mar 12;15(3):1884-1895
pubmed: 30776226
J Chem Inf Model. 2010 Mar 22;50(3):339-48
pubmed: 20121045
Mol Inform. 2020 Dec;39(12):e2000103
pubmed: 32830451
J Chem Inf Model. 2016 Sep 26;56(9):1631-40
pubmed: 27564682
J Med Chem. 2020 Aug 27;63(16):8761-8777
pubmed: 31512867
J Mol Biol. 2010 Apr 9;397(4):1042-54
pubmed: 20156458
J Chem Inf Model. 2010 May 24;50(5):742-54
pubmed: 20426451
J Comput Aided Mol Des. 2020 Oct;34(10):1013-1026
pubmed: 32361862
J Chem Inf Model. 2015 Jun 22;55(6):1136-47
pubmed: 25988274
J Med Chem. 2011 Apr 28;54(8):2944-51
pubmed: 21443196
J Chem Inf Model. 2006 Jul-Aug;46(4):1535
pubmed: 16859285
J Med Chem. 2012 Apr 12;55(7):2932-42
pubmed: 22236250
J Chem Inf Model. 2019 Jun 24;59(6):2656-2663
pubmed: 31059251
Chem Biol Drug Des. 2015 Dec;86(6):1458-65
pubmed: 26094578
J Chem Inf Model. 2019 Sep 23;59(9):3679-3691
pubmed: 31381335
Neural Netw. 2005 Oct;18(8):1093-110
pubmed: 16157471
J Chem Inf Model. 2012 May 25;52(5):1138-45
pubmed: 22489665
Nucleic Acids Res. 2019 Jan 8;47(D1):D930-D940
pubmed: 30398643
Comput Struct Biotechnol J. 2016 Dec 13;15:86-90
pubmed: 28066532
J Chem Inf Model. 2012 Sep 24;52(9):2354-65
pubmed: 22894655
J Comput Aided Mol Des. 2021 Mar 19;:
pubmed: 33740200