COSGAP: COntainerized Statistical Genetics Analysis Pipelines.


Journal

Bioinformatics advances
ISSN: 2635-0041
Titre abrégé: Bioinform Adv
Pays: England
ID NLM: 9918282081306676

Informations de publication

Date de publication:
2024
Historique:
received: 29 11 2023
revised: 30 03 2024
accepted: 08 05 2024
medline: 29 5 2024
pubmed: 29 5 2024
entrez: 29 5 2024
Statut: epublish

Résumé

The collection and analysis of sensitive data in large-scale consortia for statistical genetics is hampered by multiple challenges, due to their non-shareable nature. Time-consuming issues in installing software frequently arise due to different operating systems, software dependencies, and limited internet access. For federated analysis across sites, it can be challenging to resolve different problems, including format requirements, data wrangling, setting up analysis on high-performance computing (HPC) facilities, etc. Easier, more standardized, automated protocols and pipelines can be solutions to overcome these issues. We have developed one such solution for statistical genetic data analysis using software container technologies. This solution, named COSGAP: "COntainerized Statistical Genetics Analysis Pipelines," consists of already established software tools placed into Singularity containers, alongside corresponding code and instructions on how to perform statistical genetic analyses, such as genome-wide association studies, polygenic scoring, LD score regression, Gaussian Mixture Models, and gene-set analysis. Using provided helper scripts written in Python, users can obtain auto-generated scripts to conduct the desired analysis either on HPC facilities or on a personal computer. COSGAP is actively being applied by users from different countries and projects to conduct genetic data analyses without spending much effort on software installation, converting data formats, and other technical requirements. COSGAP is freely available on GitHub (https://github.com/comorment/containers) under the GPLv3 license.

Identifiants

pubmed: 38808072
doi: 10.1093/bioadv/vbae067
pii: vbae067
pmc: PMC11132817
doi:

Types de publication

Journal Article

Langues

eng

Pagination

vbae067

Informations de copyright

© The Author(s) 2024. Published by Oxford University Press.

Déclaration de conflit d'intérêts

Dr. Andreassen has received speaker fees from Lundbeck, Janssen, Otsuka, and Sunovion and is a consultant to Cortechs.ai. and Precision Health. Dr. Frei is a consultant to Precision Health.

Auteurs

Bayram Cevdet Akdeniz (BC)

Department of Informatics, Centre for Bioinformatics, University of Oslo, Oslo 0373, Norway.
Centre for Precision Psychiatry, Institute of Clinical Medicine, University of Oslo, Oslo 0450, Norway.

Oleksandr Frei (O)

Department of Informatics, Centre for Bioinformatics, University of Oslo, Oslo 0373, Norway.
Centre for Precision Psychiatry, Institute of Clinical Medicine, University of Oslo, Oslo 0450, Norway.

Espen Hagen (E)

Centre for Precision Psychiatry, Institute of Clinical Medicine, University of Oslo, Oslo 0450, Norway.

Tahir Tekin Filiz (TT)

Centre for Precision Psychiatry, Institute of Clinical Medicine, University of Oslo, Oslo 0450, Norway.

Sandeep Karthikeyan (S)

Centre for Precision Psychiatry, Institute of Clinical Medicine, University of Oslo, Oslo 0450, Norway.

Joëlle Pasman (J)

Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm 17177, Sweden.

Andreas Jangmo (A)

Department of Mental Health and Suicide, Norwegian Institute of Public Health, Oslo 0213, Norway.

Jacob Bergstedt (J)

Unit of Integrative Epidemiology, Institute of Environmental Medicine, Karolinska Institutet, Stockholm 17177, Sweden.

John R Shorter (JR)

Institute of Biological Psychiatry, Mental Health Center Sct. Hans, Mental Health Services Copenhagen, Roskilde 4000, Denmark.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research (iPSYCH), Copenhagen 8210, Denmark.

Richard Zetterberg (R)

Institute of Biological Psychiatry, Mental Health Center Sct. Hans, Mental Health Services Copenhagen, Roskilde 4000, Denmark.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research (iPSYCH), Copenhagen 8210, Denmark.

Joeri Meijsen (J)

Institute of Biological Psychiatry, Mental Health Center Sct. Hans, Mental Health Services Copenhagen, Roskilde 4000, Denmark.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research (iPSYCH), Copenhagen 8210, Denmark.

Ida Elken Sønderby (IE)

Centre for Precision Psychiatry, Institute of Clinical Medicine, University of Oslo, Oslo 0450, Norway.
Department of Medical Genetics, Oslo University Hospital, Oslo 0424, Norway.

Alfonso Buil (A)

Institute of Biological Psychiatry, Mental Health Center Sct. Hans, Mental Health Services Copenhagen, Roskilde 4000, Denmark.
The Lundbeck Foundation Initiative for Integrative Psychiatric Research (iPSYCH), Copenhagen 8210, Denmark.

Martin Tesli (M)

Department of Mental Health and Suicide, Norwegian Institute of Public Health, Oslo 0213, Norway.

Yi Lu (Y)

Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm 17177, Sweden.

Patrick Sullivan (P)

Department of Medical Epidemiology and Biostatistics, Karolinska Institutet, Stockholm 17177, Sweden.

Ole A Andreassen (OA)

Centre for Precision Psychiatry, Institute of Clinical Medicine, University of Oslo, Oslo 0450, Norway.
KG Jebsen Centre for Neurodevelopmental Disorders, University of Oslo and Oslo University Hospital, Oslo 4956, Norway.

Eivind Hovig (E)

Department of Informatics, Centre for Bioinformatics, University of Oslo, Oslo 0373, Norway.
Department of Tumor Biology, Institute for Cancer Research, Oslo University Hospital, Oslo 0424, Norway.

Classifications MeSH