Decoding phenotypic screening: A comparative analysis of image representations.

Activity prediction Deep Learning High Content Screening Image representation Self-supervised learning

Journal

Computational and structural biotechnology journal
ISSN: 2001-0370
Titre abrégé: Comput Struct Biotechnol J
Pays: Netherlands
ID NLM: 101585369

Informations de publication

Date de publication:
Dec 2024
Historique:
received: 13 11 2023
revised: 26 02 2024
accepted: 26 02 2024
medline: 21 3 2024
pubmed: 21 3 2024
entrez: 21 3 2024
Statut: epublish

Résumé

Biomedical imaging techniques such as high content screening (HCS) are valuable for drug discovery, but high costs limit their use to pharmaceutical companies. To address this issue, The JUMP-CP consortium released a massive open image dataset of chemical and genetic perturbations, providing a valuable resource for deep learning research. In this work, we aim to utilize the JUMP-CP dataset to develop a universal representation model for HCS data, mainly data generated using U2OS cells and CellPainting protocol, using supervised and self-supervised learning approaches. We propose an evaluation protocol that assesses their performance on mode of action and property prediction tasks using a popular phenotypic screening dataset. Results show that the self-supervised approach that uses data from multiple consortium partners provides representation that is more robust to batch effects whilst simultaneously achieving performance on par with standard approaches. Together with other conclusions, it provides recommendations on the training strategy of a representation model for HCS images.

Identifiants

pubmed: 38510976
doi: 10.1016/j.csbj.2024.02.022
pii: S2001-0370(24)00046-1
pmc: PMC10951426
doi:

Types de publication

Journal Article

Langues

eng

Pagination

1181-1188

Informations de copyright

© 2024 The Authors.

Déclaration de conflit d'intérêts

The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Jacek Tabor reports financial support was provided by National Science Centre Poland. Dawid Rymarczyk reports financial support was provided by National Science Centre Poland. Lukasz Struski reports financial support was provided by National Science Centre Poland. Bartosz Zielinski reports financial support was provided by National Science Centre Poland. If there are other authors, they declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Adriana Borowa (A)

Jagiellonian University, Faculty of Mathematics and Computer Science, Kraków, Poland.
Jagiellonian University, Doctoral School of Exact and Natural Sciences, Kraków, Poland.
Ardigen SA, Kraków, Poland.

Dawid Rymarczyk (D)

Jagiellonian University, Faculty of Mathematics and Computer Science, Kraków, Poland.
Ardigen SA, Kraków, Poland.

Marek Żyła (M)

Ardigen SA, Kraków, Poland.

Maciej Kańdula (M)

Janssen Pharmaceutica NV, Beerse, Belgium.

Ana Sánchez-Fernández (A)

Janssen Pharmaceutica NV, Beerse, Belgium.

Krzysztof Rataj (K)

Ardigen SA, Kraków, Poland.

Łukasz Struski (Ł)

Jagiellonian University, Faculty of Mathematics and Computer Science, Kraków, Poland.

Jacek Tabor (J)

Jagiellonian University, Faculty of Mathematics and Computer Science, Kraków, Poland.

Bartosz Zieliński (B)

Jagiellonian University, Faculty of Mathematics and Computer Science, Kraków, Poland.
Ardigen SA, Kraków, Poland.

Classifications MeSH