The Collaborative Research Center FONDA.

Big data processing Data science Distributed systems Research software engineering Scientific workflows

Journal

Datenbank-Spektrum : Zeitschrift fur Datenbanktechnologie : Organ der Fachgruppe Datenbanken der Gesellschaft fur Informatik e.V
ISSN: 1610-1995
Titre abrégé: Datenbank Spektrum
Pays: Germany
ID NLM: 101718074

Informations de publication

Date de publication:
2021
Historique:
pubmed: 18 11 2021
medline: 18 11 2021
entrez: 17 11 2021
Statut: ppublish

Résumé

Today's scientific data analysis very often requires complex Data Analysis Workflows (DAWs) executed over distributed computational infrastructures, e.g., clusters. Much research effort is devoted to the tuning and performance optimization of specific workflows for specific clusters. However, an arguably even more important problem for accelerating research is the reduction of development, adaptation, and maintenance times of DAWs. We describe the design and setup of the Collaborative Research Center (CRC) 1404 "FONDA -- Foundations of Workflows for Large-Scale Scientific Data Analysis", in which roughly 50 researchers jointly investigate new technologies, algorithms, and models to increase the portability, adaptability, and dependability of DAWs executed over distributed infrastructures. We describe the motivation behind our project, explain its underlying core concepts, introduce FONDA's internal structure, and sketch our vision for the future of workflow-based scientific data analysis. We also describe some lessons learned during the "making of" a CRC in Computer Science with strong interdisciplinary components, with the aim to foster similar endeavors.

Identifiants

pubmed: 34786019
doi: 10.1007/s13222-021-00397-5
pii: 397
pmc: PMC8587492
doi:

Types de publication

News

Langues

eng

Pagination

255-260

Informations de copyright

© Gesellschaft für Informatik e.V. and Springer-Verlag GmbH Germany, part of Springer Nature 2021.

Auteurs

Ulf Leser (U)

Humboldt-Universität zu Berlin, Berlin, Germany.

Marcus Hilbrich (M)

Humboldt-Universität zu Berlin, Berlin, Germany.

Claudia Draxl (C)

Humboldt-Universität zu Berlin, Berlin, Germany.

Peter Eisert (P)

Humboldt-Universität zu Berlin, Berlin, Germany.
Fraunhofer Heinrich-Hertz Institut, Berlin, Germany.

Lars Grunske (L)

Humboldt-Universität zu Berlin, Berlin, Germany.

Patrick Hostert (P)

Humboldt-Universität zu Berlin, Berlin, Germany.

Dagmar Kainmüller (D)

Max-Delbrück Center for Molecular Medicine, Regensburg, Germany.

Odej Kao (O)

Technische Universität Berlin, Berlin, Germany.

Birte Kehr (B)

Universität Regensburg, Regensburg, Germany.

Timo Kehrer (T)

Humboldt-Universität zu Berlin, Berlin, Germany.

Christoph Koch (C)

Humboldt-Universität zu Berlin, Berlin, Germany.

Volker Markl (V)

Technische Universität Berlin, Berlin, Germany.

Henning Meyerhenke (H)

Humboldt-Universität zu Berlin, Berlin, Germany.

Tilmann Rabl (T)

Hasso Plattner Institut, University of Potsdam, Potsdam, Germany.

Alexander Reinefeld (A)

Humboldt-Universität zu Berlin, Berlin, Germany.
Zuse-Institut Berlin, Berlin, Germany.

Knut Reinert (K)

Freie Universität Berlin, Berlin, Germany.

Kerstin Ritter (K)

Charité - Universitätsmedizin Berlin, Berlin, Germany.

Björn Scheuermann (B)

Humboldt-Universität zu Berlin, Berlin, Germany.

Florian Schintke (F)

Zuse-Institut Berlin, Berlin, Germany.

Nicole Schweikardt (N)

Humboldt-Universität zu Berlin, Berlin, Germany.

Matthias Weidlich (M)

Humboldt-Universität zu Berlin, Berlin, Germany.

Classifications MeSH