Data-Driven Imputation of Miscibility of Aqueous Solutions via Graph-Regularized Logistic Matrix Factorization.
Journal
The journal of physical chemistry. B
ISSN: 1520-5207
Titre abrégé: J Phys Chem B
Pays: United States
ID NLM: 101157530
Informations de publication
Date de publication:
21 Sep 2023
21 Sep 2023
Historique:
medline:
8
9
2023
pubmed:
8
9
2023
entrez:
8
9
2023
Statut:
ppublish
Résumé
Aqueous, two-phase systems (ATPSs) may form upon mixing two solutions of independently water-soluble compounds. Many separation, purification, and extraction processes rely on ATPSs. Predicting the miscibility of solutions can accelerate and reduce the cost of the discovery of new ATPSs for these applications. Whereas previous machine learning approaches to ATPS prediction used physicochemical properties of each solute as a descriptor, in this work, we show how to impute missing miscibility outcomes directly from an incomplete collection of pairwise miscibility experiments. We use graph-regularized logistic matrix factorization (GR-LMF) to learn a latent vector of each solution from (i) the observed entries in the pairwise miscibility matrix and (ii) a graph where each node is a solution and edges are relationships indicating the general category of the solute (i.e., polymer, surfactant, salt, protein). For an experimental data set of the pairwise miscibility of 68 solutions from Peacock et al. [
Identifiants
pubmed: 37682958
doi: 10.1021/acs.jpcb.3c03789
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM