Three million images and morphological profiles of cells treated with matched chemical and genetic perturbations.


Journal

Nature methods
ISSN: 1548-7105
Titre abrégé: Nat Methods
Pays: United States
ID NLM: 101215604

Informations de publication

Date de publication:
09 Apr 2024
Historique:
received: 23 05 2023
accepted: 11 03 2024
medline: 10 4 2024
pubmed: 10 4 2024
entrez: 9 4 2024
Statut: aheadofprint

Résumé

The identification of genetic and chemical perturbations with similar impacts on cell morphology can elucidate compounds' mechanisms of action or novel regulators of genetic pathways. Research on methods for identifying such similarities has lagged due to a lack of carefully designed and well-annotated image sets of cells treated with chemical and genetic perturbations. Here we create such a Resource dataset, CPJUMP1, in which each perturbed gene's product is a known target of at least two chemical compounds in the dataset. We systematically explore the directionality of correlations among perturbations that target the same protein encoded by a given gene, and we find that identifying matches between chemical and genetic perturbations is a challenging task. Our dataset and baseline analyses provide a benchmark for evaluating methods that measure perturbation similarities and impact, and more generally, learn effective representations of cellular state from microscopy images. Such advancements would accelerate the applications of image-based profiling of cellular states, such as uncovering drug mode of action or probing functional genomics.

Identifiants

pubmed: 38594452
doi: 10.1038/s41592-024-02241-6
pii: 10.1038/s41592-024-02241-6
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : U.S. Department of Health & Human Services | National Institutes of Health (NIH)
ID : NIH MIRA R35 GM122547

Informations de copyright

© 2024. The Author(s).

Références

Chandrasekaran, S. N., Ceulemans, H., Boyd, J. D. & Carpenter, A. E. Image-based profiling for drug discovery: due for a machine-learning upgrade? Nat. Rev. Drug Discov. 20, 145–159 (2021).
doi: 10.1038/s41573-020-00117-w pubmed: 33353986
Caicedo, J. C. et al. Data-analysis strategies for image-based cell profiling. Nat. Methods 14, 849–863 (2017).
doi: 10.1038/nmeth.4397 pubmed: 28858338 pmcid: 6871000
Pratapa, A., Doron, M. & Caicedo, J. C. Image-based cell phenotyping with deep learning. Curr. Opin. Chem. Biol. 65, 9–17 (2021).
doi: 10.1016/j.cbpa.2021.04.001 pubmed: 34023800
Moshkov, N. et al. Learning representations for image-based profiling of perturbations. https://doi.org/10.1038/s41467-024-45999-1 (2022).
Canham, S. M. et al. Systematic chemogenetic library assembly. Cell Chem. Biol. 27, 1124–1129 (2020).
doi: 10.1016/j.chembiol.2020.07.004 pubmed: 32707038
Bray, M.-A. et al. Cell Painting, a high-content image-based assay for morphological profiling using multiplexed fluorescent dyes. Nat. Protoc. 11, 1757–1774 (2016).
doi: 10.1038/nprot.2016.105 pubmed: 27560178 pmcid: 5223290
Cimini, B. A. et al. Optimizing the Cell Painting assay for image-based profiling. Nat. Protoc. 18, 1981–2013 (2023).
doi: 10.1038/s41596-023-00840-9 pubmed: 37344608
Corsello, S. M. et al. The Drug Repurposing Hub: a next-generation drug library and information resource. Nat. Med. 23, 405–408 (2017).
doi: 10.1038/nm.4306 pubmed: 28388612 pmcid: 5568558
Perlman, Z. E. et al. Multidimensional drug profiling by automated microscopy. Science 306, 1194–1198 (2004).
doi: 10.1126/science.1100709 pubmed: 15539606
Rohban, M. H. et al. Virtual screening for small-molecule pathway regulators by image-profile matching. Cell Syst. 13, 724–736 (2022).
doi: 10.1016/j.cels.2022.08.003 pubmed: 36057257 pmcid: 9509476
Lill, J. R., Mathews, W. R., Rose, C. M. & Schirle, M. Proteomics in the pharmaceutical and biotechnology industry: a look to the next decade. Expert Rev. Proteomics 18, 503–526 (2021).
doi: 10.1080/14789450.2021.1962300 pubmed: 34320887
Rastelli, G. & Pinzi, L. Computational polypharmacology comes of age. Front. Pharmacol. 6, 157 (2015).
doi: 10.3389/fphar.2015.00157 pubmed: 26283966 pmcid: 4516879
Proschak, E., Stark, H. & Merk, D. Polypharmacology by design: a medicinal chemist’s perspective on multitargeting compounds. J. Med. Chem. 62, 420–444 (2019).
doi: 10.1021/acs.jmedchem.8b00760 pubmed: 30035545
Chandrasekaran, S. N. et al. JUMP Cell Painting dataset: morphological impact of 136,000 chemical and genetic perturbations. Preprint at https://doi.org/10.1101/2023.03.23.534023 (2023).
Arrowsmith, C. H. et al. The promise and peril of chemical probes. Nat. Chem. Biol. 11, 536–541 (2015).
doi: 10.1038/nchembio.1867 pubmed: 26196764 pmcid: 4706458
Rohban, M. H. et al. Systematic morphological profiling of human gene and allele function via Cell Painting. Elife 6, e24060 (2017).
doi: 10.7554/eLife.24060 pubmed: 28315521 pmcid: 5386591
Bray, M.-A. et al. A dataset of images and morphological profiles of 30 000 small-molecule treatments using the Cell Painting assay. Gigascience 6, 1–5 (2017).
doi: 10.1093/gigascience/giw014 pubmed: 28327978 pmcid: 5721342
Doench, J. G. et al. Optimized sgRNA design to maximize activity and minimize off-target effects of CRISPR-Cas9. Nat. Biotechnol. 34, 184–191 (2016).
doi: 10.1038/nbt.3437 pubmed: 26780180 pmcid: 4744125
Lundholt, B. K., Scudder, K. M. & Pagliaro, L. A simple technique for reducing edge effect in cell-based assays. J. Biomol. Screen. 8, 566–570 (2003).
doi: 10.1177/1087057103256465 pubmed: 14567784
Schindelin, J. et al. Fiji: an open-source platform for biological-image analysis. Nat. Methods 9, 676–682 (2012).
doi: 10.1038/nmeth.2019 pubmed: 22743772
Stirling, D. R. et al. CellProfiler 4: improvements in speed, utility and usability. BMC Bioinformatics 22, 433 (2021).
doi: 10.1186/s12859-021-04344-9 pubmed: 34507520 pmcid: 8431850
Singh, S., Bray, M.-A., Jones, T. R. & Carpenter, A. E. Pipeline for illumination correction of images for high-throughput microscopy. J. Microsc. 256, 231–236 (2014).
doi: 10.1111/jmi.12178 pubmed: 25228240 pmcid: 4359755
Methods for image-based cell profiling [R package cytominer version 0.2.2]. (2020).
Cimini, B. A. et al. Broad Institute Imaging Platform Profiling Handbook. https://github.com/cytomining/profiling-handbook
Way, G. et al. Pycytominer: Data processing functions for profiling perturbations.
Chandrasekaran, S. N., Weisbart, E., Way, G., Carpenter, A. & Singh, S. Broad Institute Imaging Platform Profiling Recipe.
Hirling, D. et al. Segmentation metric misinterpretations in bioimage analysis. Nat. Methods 21, 213–216 (2024).
doi: 10.1038/s41592-023-01942-8 pubmed: 37500758
Loizides, F. & Schmidt, B. (eds) Positioning and Power in Academic Publishing: Players, Agents and Agendas: Proceedings of the 20th International Conference on Electronic Publishing (IOS Press, 2016).
Van Rossum, G. & Drake, F. L. The Python Language Reference Manual (Network Theory Ltd., 2011).
Harris, C. R. et al. Array programming with NumPy. Nature 585, 357–362 (2020).
doi: 10.1038/s41586-020-2649-2 pubmed: 32939066 pmcid: 7759461
Virtanen, P. et al. SciPy 1.0: fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
doi: 10.1038/s41592-019-0686-2 pubmed: 32015543 pmcid: 7056644
McKinney, W., Van Der Walt, S. & Millman, J. Proceedings of the 9th Python in Science Conference. Preprint at https://conference.scipy.org/proceedings/scipy2010/pdfs/mckinney.pdf (2010).
Pedregosa, F. et al. Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
Hunter, J. D. Matplotlib: a 2D graphics environment. Comput. Sci. Eng. 9, 90–95 (2007).
doi: 10.1109/MCSE.2007.55
Waskom, M. seaborn: statistical data visualization. J. Open Source Softw. 6(60), 3021 (2021).
doi: 10.21105/joss.03021
Plotly Technologies Inc. Collaborative Data Science (Plotly Technologies Inc., 2015).
McInnes, L., Healy, J. & Melville, J. UMAP: uniform manifold approximation and projection for dimension reduction. Preprint at https://doi.org/10.48550/arxiv.1802.03426 (2018).

Auteurs

Srinivas Niranj Chandrasekaran (SN)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Beth A Cimini (BA)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Amy Goodale (A)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Lisa Miller (L)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Maria Kost-Alimova (M)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Nasim Jamali (N)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

John G Doench (JG)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Briana Fritchman (B)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Adam Skepner (A)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Michelle Melanson (M)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Alexandr A Kalinin (AA)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

John Arevalo (J)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Marzieh Haghighi (M)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Juan C Caicedo (JC)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Daniel Kuhn (D)

Merck Healthcare KGaA, Darmstadt, Germany.

Desiree Hernandez (D)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

James Berstler (J)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Hamdah Shafqat-Abbasi (H)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

David E Root (DE)

Broad Institute of MIT and Harvard, Cambridge, MA, USA.

Susanne E Swalley (SE)

Biogen Inc., Cambridge, MA, USA.

Sakshi Garg (S)

Merck Healthcare KGaA, Darmstadt, Germany.

Shantanu Singh (S)

Broad Institute of MIT and Harvard, Cambridge, MA, USA. shantanu@broadinstitute.org.

Anne E Carpenter (AE)

Broad Institute of MIT and Harvard, Cambridge, MA, USA. anne@broadinstitute.org.

Classifications MeSH