Scalable Integration of Multiomic Single Cell Data Using Generative Adversarial Networks.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
02 May 2024
Historique:
received: 05 01 2024
revised: 22 03 2024
accepted: 30 04 2024
medline: 2 5 2024
pubmed: 2 5 2024
entrez: 2 5 2024
Statut: aheadofprint

Résumé

Single cell profiling has become a common practice to investigate the complexity of tissues, organs and organisms. Recent technological advances are expanding our capabilities to profile various molecular layers beyond the transcriptome such as, but not limited to, the genome, the epigenome and the proteome. Depending on the experimental procedure, these data can be obtained from separate assays or from the very same cells. Despite development of computational methods for data integration is an active research field, most of the available strategies have been devised for the joint analysis of two modalities and cannot accommodate a high number of them. We here propose a multiomic data integration framework based on Wasserstein Generative Adversarial Networks (MOWGAN) suitable for the analysis of paired or unpaired data with high number of modalities (>2). At the core of our strategy is a single network trained on all modalities together, limiting the computational burden when many molecular layers are evaluated. Source code of our framework is available at https://github.com/vgiansanti/MOWGAN. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 38696763
pii: 7663468
doi: 10.1093/bioinformatics/btae300
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press.

Auteurs

Valentina Giansanti (V)

Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Milan, Italy.
Center for Omics Sciences, IRCCS San Raffaele Institute, Milan, Italy.

Francesca Giannese (F)

Center for Omics Sciences, IRCCS San Raffaele Institute, Milan, Italy.

Oronza A Botrugno (OA)

Functional Genomics of Cancer Unit, IRCCS San Raffaele Institute, Milan, Italy.
Vita-Salute San Raffaele University, Milan, Italy.

Giorgia Gandolfi (G)

Center for Omics Sciences, IRCCS San Raffaele Institute, Milan, Italy.

Chiara Balestrieri (C)

Center for Omics Sciences, IRCCS San Raffaele Institute, Milan, Italy.
Experimental Hematology Unit, IRCCS San Raffaele Scientific Institute, Milan, Italy.

Marco Antoniotti (M)

Department of Informatics, Systems and Communication, Università degli Studi di Milano-Bicocca, Milan, Italy.
Bicocca Bioinformatics Biostatistics and Bioimaging Centre-B4, Università degli Studi di Milano-Bicocca, Milan, Italy.
Istituto di Bioimmagini e Fisiologia Molecolare, Consiglio Nazionale delle Ricerche (CNR), Milan, Italy.

Giovanni Tonon (G)

Center for Omics Sciences, IRCCS San Raffaele Institute, Milan, Italy.
Functional Genomics of Cancer Unit, IRCCS San Raffaele Institute, Milan, Italy.
Vita-Salute San Raffaele University, Milan, Italy.

Davide Cittaro (D)

Center for Omics Sciences, IRCCS San Raffaele Institute, Milan, Italy.

Classifications MeSH