DEL-Dock: Molecular Docking-Enabled Modeling of DNA-Encoded Libraries.


Journal

Journal of chemical information and modeling
ISSN: 1549-960X
Titre abrégé: J Chem Inf Model
Pays: United States
ID NLM: 101230060

Informations de publication

Date de publication:
08 05 2023
Historique:
medline: 9 5 2023
pubmed: 20 4 2023
entrez: 20 04 2023
Statut: ppublish

Résumé

DNA-encoded library (DEL) technology has enabled significant advances in hit identification by enabling efficient testing of combinatorially generated molecular libraries. DEL screens measure protein binding affinity though sequencing reads of molecules tagged with unique DNA barcodes that survive a series of selection experiments. Computational models have been deployed to learn the latent binding affinities that are correlated to the sequenced count data; however, this correlation is often obfuscated by various sources of noise introduced in its complicated data-generation process. In order to denoise DEL count data and screen for molecules with good binding affinity, computational models require the correct assumptions in their modeling structure to capture the correct signals underlying the data. Recent advances in DEL models have focused on probabilistic formulations of count data, but existing approaches have thus far been limited to only utilizing 2-D molecule-level representations. We introduce a new paradigm, DEL-Dock, that combines ligand-based descriptors with 3-D spatial information from docked protein-ligand complexes. 3-D spatial information allows our model to learn over the actual binding modality rather than using only structure-based information of the ligand. We show that our model is capable of effectively denoising DEL count data to predict molecule enrichment scores that are better correlated with experimental binding affinity measurements compared to prior works. Moreover, by learning over a collection of docked poses we demonstrate that our model, trained only on DEL data, implicitly learns to perform good docking pose selection without requiring external supervision from expensive-to-source protein crystal structures.

Identifiants

pubmed: 37079427
doi: 10.1021/acs.jcim.2c01608
doi:

Substances chimiques

Ligands 0
Proteins 0
DNA 9007-49-2

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

2719-2727

Auteurs

Kirill Shmilovich (K)

Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States.

Benson Chen (B)

Insitro, South San Francisco, California 94080, United States.

Theofanis Karaletsos (T)

Insitro, South San Francisco, California 94080, United States.

Mohammad M Sultan (MM)

Insitro, South San Francisco, California 94080, United States.

Articles similaires

Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Animals Hemiptera Insect Proteins Phylogeny Insecticides
alpha-Synuclein Humans Animals Mice Lewy Body Disease

Conservation of the cooling agent binding pocket within the TRPM subfamily.

Kate Huffer, Matthew C S Denley, Elisabeth V Oskoui et al.
1.00
TRPM Cation Channels Animals Binding Sites Mice Pyrimidinones

Classifications MeSH