Reproducible biomedical benchmarking in the cloud: lessons from crowd-sourced data challenges.


Journal

Genome biology
ISSN: 1474-760X
Titre abrégé: Genome Biol
Pays: England
ID NLM: 100960660

Informations de publication

Date de publication:
10 09 2019
Historique:
received: 25 04 2019
accepted: 13 08 2019
entrez: 12 9 2019
pubmed: 12 9 2019
medline: 19 11 2019
Statut: epublish

Résumé

Challenges are achieving broad acceptance for addressing many biomedical questions and enabling tool assessment. But ensuring that the methods evaluated are reproducible and reusable is complicated by the diversity of software architectures, input and output file formats, and computing environments. To mitigate these problems, some challenges have leveraged new virtualization and compute methods, requiring participants to submit cloud-ready software packages. We review recent data challenges with innovative approaches to model reproducibility and data sharing, and outline key lessons for improving quantitative biomedical data analysis through crowd-sourced benchmarking challenges.

Identifiants

pubmed: 31506093
doi: 10.1186/s13059-019-1794-0
pii: 10.1186/s13059-019-1794-0
pmc: PMC6737594
doi:

Banques de données

figshare
['10.6084/m9.figshare.3115156.v2']

Types de publication

Letter Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

195

Subventions

Organisme : NCI NIH HHS
ID : 5U24CA209923
Pays : United States
Organisme : NIGMS NIH HHS
ID : R01 GM109031
Pays : United States
Organisme : NCI NIH HHS
ID : P30CA016042
Pays : United States
Organisme : NCI NIH HHS
ID : R01CA180778
Pays : United States
Organisme : NCI NIH HHS
ID : U24 CA210990
Pays : United States

Références

Nature. 2016 May 11;533(7602):S62-4
pubmed: 27167394
Leukemia. 2012 Nov;26(11):2406-13
pubmed: 22722715
Blood. 2017 Jul 27;130(4):453-459
pubmed: 28600341
Nucleic Acids Res. 2016 Jul 8;44(W1):W3-W10
pubmed: 27137889
JAMA Oncol. 2017 Nov 1;3(11):1463-1464
pubmed: 28472204
Genome Biol. 2019 Sep 10;20(1):195
pubmed: 31506093
Blood. 2007 Mar 15;109(6):2276-84
pubmed: 17105813
Mol Syst Biol. 2011 Oct 11;7:537
pubmed: 21988833
Nat Biotechnol. 2018 May 9;36(5):391-392
pubmed: 29734317
Nat Rev Genet. 2016 Jul 15;17(8):470-86
pubmed: 27418159
Radiology. 2017 Apr;283(1):59-69
pubmed: 28244803

Auteurs

Kyle Ellrott (K)

Biomedical Engineering, Oregon Health and Science University, Portland, OR, 97239, USA.

Alex Buchanan (A)

Biomedical Engineering, Oregon Health and Science University, Portland, OR, 97239, USA.

Allison Creason (A)

Biomedical Engineering, Oregon Health and Science University, Portland, OR, 97239, USA.

Michael Mason (M)

Sage Bionetworks, Seattle, WA, USA.

Thomas Schaffter (T)

IBM Research, Yorktown Heights, NY, USA.

Bruce Hoff (B)

Sage Bionetworks, Seattle, WA, USA.

James Eddy (J)

Sage Bionetworks, Seattle, WA, USA.

John M Chilton (JM)

Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, State College, PA, USA.

Thomas Yu (T)

Sage Bionetworks, Seattle, WA, USA.

Joshua M Stuart (JM)

University of California, Santa Cruz, Santa Cruz, CA, USA.

Julio Saez-Rodriguez (J)

Institute for Computational Biomedicine, Heidelberg University, Faculty of Medicine and Heidelberg University Hospital, Bioquant, Heidelberg, Germany.
Joint Research Center for Computational Biomedicine, RWTH Aachen University, Faculty of Medicine, Aachen, Germany.

Gustavo Stolovitzky (G)

IBM Research, Yorktown Heights, NY, USA.

Paul C Boutros (PC)

Ontario Institute for Cancer Research, Toronto, Canada.
Departments of Medical Biophysics and Pharmacology & Toxicology, University of Toronto, Toronto, Canada.
Departments of Human Genetics and Urology, University of California, Los Angeles, CA, USA.
Jonsson Comprehensive Cancer Centre, University of California, Los Angeles, CA, USA.
Institute for Precision Health, University of California, Los Angeles, CA, USA.

Justin Guinney (J)

Sage Bionetworks, Seattle, WA, USA. justin.guinney@sagebase.org.
Biomedical Informatics and Medical Education, University of Washington, Seattle, WA, 98195, USA. justin.guinney@sagebase.org.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Humans Middle Aged Female Male Surveys and Questionnaires
1.00
Humans Magnetic Resonance Imaging Brain Infant, Newborn Infant, Premature

Classifications MeSH