scikit-matter : A Suite of Generalisable Machine Learning Methods Born out of Chemistry and Materials Science.

KPCovR PCovR Python directional convex hull feature reconstruction feature selection sample selection

Journal

Open research Europe
ISSN: 2732-5121
Titre abrégé: Open Res Eur
Pays: Belgium
ID NLM: 9918230081006676

Informations de publication

Date de publication:
2023
Historique:
accepted: 01 09 2023
medline: 18 1 2024
pubmed: 18 1 2024
entrez: 18 1 2024
Statut: epublish

Résumé

Easy-to-use libraries such as scikit-learn have accelerated the adoption and application of machine learning (ML) workflows and data-driven methods. While many of the algorithms implemented in these libraries originated in specific scientific fields, they have gained in popularity in part because of their generalisability across multiple domains. Over the past two decades, researchers in the chemical and materials science community have put forward general-purpose machine learning methods. The deployment of these methods into workflows of other domains, however, is often burdensome due to the entanglement with domainspecific functionalities. We present the python library scikit-matter that targets domain-agnostic implementations of methods developed in the computational chemical and materials science community, following the scikit-learn API and coding guidelines to promote usability and interoperability with existing workflows.

Identifiants

pubmed: 38234865
doi: 10.12688/openreseurope.15789.2
pmc: PMC10792272
doi:

Types de publication

Journal Article

Langues

eng

Pagination

81

Informations de copyright

Copyright: © 2023 Goscinski A et al.

Déclaration de conflit d'intérêts

No competing interests were disclosed.

Auteurs

Alexander Goscinski (A)

Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, Ecole Polytechnique Federale de Lausanne, Lausanne, Vaud, 1015, Switzerland.

Victor Paul Principe (VP)

Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, Ecole Polytechnique Federale de Lausanne, Lausanne, Vaud, 1015, Switzerland.

Guillaume Fraux (G)

Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, Ecole Polytechnique Federale de Lausanne, Lausanne, Vaud, 1015, Switzerland.

Sergei Kliavinek (S)

Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, Ecole Polytechnique Federale de Lausanne, Lausanne, Vaud, 1015, Switzerland.

Benjamin Aaron Helfrecht (BA)

Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, Ecole Polytechnique Federale de Lausanne, Lausanne, Vaud, 1015, Switzerland.
Pacific Northwest National Laboratory, Richland, WA, 99352, USA.

Philip Loche (P)

Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, Ecole Polytechnique Federale de Lausanne, Lausanne, Vaud, 1015, Switzerland.

Michele Ceriotti (M)

Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, Ecole Polytechnique Federale de Lausanne, Lausanne, Vaud, 1015, Switzerland.

Rose Kathleen Cersonsky (RK)

Laboratory of Computational Science and Modeling (COSMO), Institute of Materials, Ecole Polytechnique Federale de Lausanne, Lausanne, Vaud, 1015, Switzerland.
Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, Wisconsin, 53706, USA.

Classifications MeSH