MELLODDY: Cross-pharma Federated Learning at Unprecedented Scale Unlocks Benefits in QSAR without Compromising Proprietary Information.


Journal

Journal of chemical information and modeling
ISSN: 1549-960X
Titre abrégé: J Chem Inf Model
Pays: United States
ID NLM: 101230060

Informations de publication

Date de publication:
29 Aug 2023
Historique:
pubmed: 29 8 2023
medline: 29 8 2023
entrez: 29 8 2023
Statut: aheadofprint

Résumé

Federated multipartner machine learning has been touted as an appealing and efficient method to increase the effective training data volume and thereby the predictivity of models, particularly when the generation of training data is resource-intensive. In the landmark MELLODDY project, indeed, each of ten pharmaceutical companies realized aggregated improvements on its own classification or regression models through federated learning. To this end, they leveraged a novel implementation extending multitask learning across partners, on a platform audited for privacy and security. The experiments involved an unprecedented cross-pharma data set of 2.6+ billion confidential experimental activity data points, documenting 21+ million physical small molecules and 40+ thousand assays in on-target and secondary pharmacodynamics and pharmacokinetics. Appropriate complementary metrics were developed to evaluate the predictive performance in the federated setting. In addition to predictive performance increases in labeled space, the results point toward an extended applicability domain in federated learning. Increases in collective training data volume, including by means of auxiliary data resulting from single concentration high-throughput and imaging assays, continued to boost predictive performance, albeit with a saturating return. Markedly higher improvements were observed for the pharmacokinetics and safety panel assay-based task subsets.

Identifiants

pubmed: 37642660
doi: 10.1021/acs.jcim.3c00799
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Auteurs

Wouter Heyndrickx (W)

Janssen Pharmaceutica NV, Turnhoutseweg 30, Beerse 2340, Belgium.

Lewis Mervin (L)

AstraZeneca R&D, Biomedical Campus, 1 Francis Crick Ave, Cambridge CB2 0SL, U.K.

Tobias Morawietz (T)

Bayer Pharma AG, Global Drug Discovery, Chemical Research, Computational Chemistry, Aprather Weg 18 a, Wuppertal 42096, Germany.

Noé Sturm (N)

Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland.

Lukas Friedrich (L)

Merck KGaA, Global Research & Development, Frankfurter Strasse 250, Darmstadt 64293, Germany.

Adam Zalewski (A)

Amgen Research (Munich) GmbH, Staffelseestraße 2, Munich 81477, Germany.

Anastasia Pentina (A)

Bayer AG, Machine Learning Research, Research & Development, Pharmaceuticals, Berlin 10117, Germany.

Lina Humbeck (L)

BI Medicinal Chemistry Department, Boehringer Ingelheim Pharma GmbH & Co. KG, Birkendorfer Str. 65, Biberach an der Riss 88397, Germany.

Martijn Oldenhof (M)

KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium.

Ritsuya Niwayama (R)

Institut de recherches Servier, 125 chemin de ronde Croissy-sur-Seine, Île-de-France 78290, France.

Peter Schmidtke (P)

Discngine, Avenue Ledru Rollin 79, Paris 75012, France.

Nikolas Fechner (N)

Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland.

Jaak Simm (J)

KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium.

Adam Arany (A)

KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium.

Nicolas Drizard (N)

Iktos, 65 rue de Prony, Paris 75017, France.

Rama Jabal (R)

Iktos, 65 rue de Prony, Paris 75017, France.

Arina Afanasyeva (A)

Modality Informatics Group, Digital Research Solutions, Advanced Informatics & Analytics, Astellas Pharma Inc., 21 Miyukigaoka, Tsukuba-shi, Ibaraki 305-8585, Japan.

Regis Loeb (R)

KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium.

Shlok Verma (S)

GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.

Simon Harnqvist (S)

GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.

Matthew Holmes (M)

GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.

Balazs Pejo (B)

Budapest University of Technology and Economics, Department of Networked Systems and Services, Műegyetem rkp. 3, Budapest 1111, Hungary.

Maria Telenczuk (M)

Owkin, 12 Rue Martel, Paris 75010, France.

Nicholas Holway (N)

Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland.

Arne Dieckmann (A)

Bayer AG, API Production, Product Supply, Pharmaceuticals, Ernst-Schering-Straße 14, Bergkamen 59192, Germany.

Nicola Rieke (N)

NVIDIA GmbH, Floessergasse 2, Munich 81369, Germany.

Friederike Zumsande (F)

Amgen Research (Munich) GmbH, Staffelseestraße 2, Munich 81477, Germany.

Djork-Arné Clevert (DA)

Bayer AG, Machine Learning Research, Research & Development, Pharmaceuticals, Berlin 10117, Germany.

Michael Krug (M)

Merck KGaA, Global Research & Development, Frankfurter Strasse 250, Darmstadt 64293, Germany.

Christopher Luscombe (C)

GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.

Darren Green (D)

GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.

Peter Ertl (P)

Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland.

Peter Antal (P)

Budapest University of Technology and Economics, Department of Measurement and Information Systems, Műegyetem rkp. 3, Budapest 1111, Hungary.

David Marcus (D)

GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.

Nicolas Do Huu (N)

Iktos, 65 rue de Prony, Paris 75017, France.

Hideyoshi Fuji (H)

Modality Informatics Group, Digital Research Solutions, Advanced Informatics & Analytics, Astellas Pharma Inc., 21 Miyukigaoka, Tsukuba-shi, Ibaraki 305-8585, Japan.

Stephen Pickett (S)

GlaxoSmithKline, Computational Sciences, Gunnels Wood Road Stevenage, Herts SG1 2NY, U.K.

Gergely Acs (G)

Budapest University of Technology and Economics, Department of Networked Systems and Services, Műegyetem rkp. 3, Budapest 1111, Hungary.

Eric Boniface (E)

Substra Foundation - Labelia Labs, 4 rue Voltaire, Nantes 44000, France.

Bernd Beck (B)

BI Medicinal Chemistry Department, Boehringer Ingelheim Pharma GmbH & Co. KG, Birkendorfer Str. 65, Biberach an der Riss 88397, Germany.

Yax Sun (Y)

Amgen Research, 1 Amgen Center Drive, Thousand Oaks, California 92130, United States.

Arnaud Gohier (A)

Institut de recherches Servier, 125 chemin de ronde Croissy-sur-Seine, Île-de-France 78290, France.

Friedrich Rippmann (F)

Merck KGaA, Global Research & Development, Frankfurter Strasse 250, Darmstadt 64293, Germany.

Ola Engkvist (O)

AstraZeneca, Molecular AI, Discovery Sciences, R&D, Pepparedsleden 1, Mölndal 431 50, Sweden.

Andreas H Göller (AH)

Bayer Pharma AG, Global Drug Discovery, Chemical Research, Computational Chemistry, Aprather Weg 18 a, Wuppertal 42096, Germany.

Yves Moreau (Y)

KU Leuven, ESAT-STADIUS, Kasteelpark Arenberg 10, Heverlee 3001, Belgium.

Mathieu N Galtier (MN)

Owkin, 4 Rue Voltaire, Nantes 44000, France.

Ansgar Schuffenhauer (A)

Novartis Institutes for BioMedical Research, Novartis Campus, Basel 4002, Switzerland.

Hugo Ceulemans (H)

Janssen Pharmaceutica NV, Turnhoutseweg 30, Beerse 2340, Belgium.

Classifications MeSH