Generic Context-Aware Group Contributions.


Journal

IEEE/ACM transactions on computational biology and bioinformatics
ISSN: 1557-9964
Titre abrégé: IEEE/ACM Trans Comput Biol Bioinform
Pays: United States
ID NLM: 101196755

Informations de publication

Date de publication:
Historique:
pubmed: 6 8 2020
medline: 19 2 2022
entrez: 6 8 2020
Statut: ppublish

Résumé

Many properties of molecules vary systematically with changes in the structural formula and can thus be estimated from regression models defined on small structural building blocks, usually functional groups. Typically, such approaches are limited to a particular class of compounds and requires hand-curated lists of chemically plausible groups. This limits their use in particular in the context of generative approaches to explore large chemical spaces. Here we overcome this limitation by proposing a generic group contribution method that iteratively identifies significant regressors of increasing size. To this end, LASSO regression is used and the context-dependent contributions are "anchored" around a reference edge to reduce ambiguities and prevent overcounting due to multiple embeddings. We benchmark our approach, which is available as "Context AwaRe Group cOntribution" ( CARGO), on artificial data, typical applications from chemical thermodynamics. As we shall see, this method yields stable results with accuracies comparable to other regression techniques. As a by-product, we obtain interpretable additive contributions for individual chemical bonds and correction terms depending on local contexts.

Identifiants

pubmed: 32750852
doi: 10.1109/TCBB.2020.2998948
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

429-442

Auteurs

Articles similaires

Humans Perioperative Period Systematic Reviews as Topic Regression Analysis Developing Countries
Alzheimer Disease Humans Regression Analysis Quantitative Structure-Activity Relationship Drug Design
Receptor, Cannabinoid, CB1 Ligands Molecular Dynamics Simulation Protein Binding Thermodynamics
Humans Female Ethiopia Adolescent Adult

Classifications MeSH