A mini-review on perturbation modelling across single-cell omic modalities.

Deep learning Machine learning Perturbation ScRNAseq Single-cell RNA sequencing

Journal

Computational and structural biotechnology journal
ISSN: 2001-0370
Titre abrégé: Comput Struct Biotechnol J
Pays: Netherlands
ID NLM: 101585369

Informations de publication

Date de publication:
Dec 2024
Historique:
received: 01 02 2024
revised: 23 04 2024
accepted: 23 04 2024
medline: 9 5 2024
pubmed: 9 5 2024
entrez: 9 5 2024
Statut: epublish

Résumé

Recent advances in single-cell omics technology have transformed the landscape of cellular and molecular research, enriching the scope and intricacy of cellular characterisation. Perturbation modelling seeks to comprehensively grasp the effects of external influences like disease onset or molecular knock-outs or external stimulants on cellular physiology, specifically on transcription factors, signal transducers, biological pathways, and dynamic cell states. Machine and deep learning tools transform complex perturbational phenomena in algorithmically tractable tasks to formulate predictions based on various types of single-cell datasets. However, the recent surge in tools and datasets makes it challenging for experimental biologists and computational scientists to keep track of the recent advances in this rapidly expanding filed of single-cell modelling. Here, we recapitulate the main objectives of perturbation modelling and summarise novel single-cell perturbation technologies based on genetic manipulation like CRISPR or compounds, spanning across omic modalities. We then concisely review a burgeoning group of computational methods extending from classical statistical inference methodologies to various machine and deep learning architectures like shallow models or autoencoders, to biologically informed approaches based on gene regulatory networks, and to combinatorial efforts reminiscent of ensemble learning. We also discuss the rising trend of large foundational models in single-cell perturbation modelling inspired by large language models. Lastly, we critically assess the challenges that underline single-cell perturbation modelling while pointing towards relevant future perspectives like perturbation atlases, multi-omics and spatial datasets, causal machine learning for interpretability, multi-task learning for performance and explainability as well as prospects for solving interoperability and benchmarking pitfalls.

Identifiants

pubmed: 38721585
doi: 10.1016/j.csbj.2024.04.058
pii: S2001-0370(24)00141-7
pmc: PMC11076269
doi:

Types de publication

Journal Article Review

Langues

eng

Pagination

1886-1896

Informations de copyright

© 2024 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology.

Déclaration de conflit d'intérêts

The authors declare no conflict of interest whatsoever.

Auteurs

George I Gavriilidis (GI)

Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece.

Vasileios Vasileiou (V)

Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece.
Department of Molecular Biology and Genetics, Democritus University of Thrace, Alexandroupolis, Greece.

Aspasia Orfanou (A)

Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece.

Naveed Ishaque (N)

Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Center of Digital Health, Berlin, Germany.

Fotis Psomopoulos (F)

Institute of Applied Biosciences, Centre for Research and Technology Hellas, Thessaloniki, Greece.

Classifications MeSH