Data-Driven Imputation of Miscibility of Aqueous Solutions via Graph-Regularized Logistic Matrix Factorization.


Journal

The journal of physical chemistry. B
ISSN: 1520-5207
Titre abrégé: J Phys Chem B
Pays: United States
ID NLM: 101157530

Informations de publication

Date de publication:
21 Sep 2023
Historique:
medline: 8 9 2023
pubmed: 8 9 2023
entrez: 8 9 2023
Statut: ppublish

Résumé

Aqueous, two-phase systems (ATPSs) may form upon mixing two solutions of independently water-soluble compounds. Many separation, purification, and extraction processes rely on ATPSs. Predicting the miscibility of solutions can accelerate and reduce the cost of the discovery of new ATPSs for these applications. Whereas previous machine learning approaches to ATPS prediction used physicochemical properties of each solute as a descriptor, in this work, we show how to impute missing miscibility outcomes directly from an incomplete collection of pairwise miscibility experiments. We use graph-regularized logistic matrix factorization (GR-LMF) to learn a latent vector of each solution from (i) the observed entries in the pairwise miscibility matrix and (ii) a graph where each node is a solution and edges are relationships indicating the general category of the solute (i.e., polymer, surfactant, salt, protein). For an experimental data set of the pairwise miscibility of 68 solutions from Peacock et al. [

Identifiants

pubmed: 37682958
doi: 10.1021/acs.jpcb.3c03789
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

7964-7973

Auteurs

Diba Behnoudfar (D)

School of Chemical, Biological, and Environmental Engineering, Oregon State University, Corvallis, Oregon 97331, United States.

Cory M Simon (CM)

School of Chemical, Biological, and Environmental Engineering, Oregon State University, Corvallis, Oregon 97331, United States.

Joshua Schrier (J)

Department of Chemistry, Fordham University, The Bronx, New York 10458, United States.

Classifications MeSH