Enriching Data Science and Health Care Education: Application and Impact of Synthetic Data Sets Through the Health Gym Project.

HIV accessibility antiretroviral therapy (ART) data privacy data science data sets educational purposes generative adversarial networks generative model health care AI human immunodeficiency virus (HIV) hypotension medical education privacy science education sepsis

Journal

JMIR medical education
ISSN: 2369-3762
Titre abrégé: JMIR Med Educ
Pays: Canada
ID NLM: 101684518

Informations de publication

Date de publication:
16 Jan 2024
Historique:
received: 30 07 2023
accepted: 08 11 2023
revised: 20 10 2023
medline: 16 1 2024
pubmed: 16 1 2024
entrez: 16 1 2024
Statut: epublish

Résumé

Large-scale medical data sets are vital for hands-on education in health data science but are often inaccessible due to privacy concerns. Addressing this gap, we developed the Health Gym project, a free and open-source platform designed to generate synthetic health data sets applicable to various areas of data science education, including machine learning, data visualization, and traditional statistical models. Initially, we generated 3 synthetic data sets for sepsis, acute hypotension, and antiretroviral therapy for HIV infection. This paper discusses the educational applications of Health Gym's synthetic data sets. We illustrate this through their use in postgraduate health data science courses delivered by the University of New South Wales, Australia, and a Datathon event, involving academics, students, clinicians, and local health district professionals. We also include adaptable worked examples using our synthetic data sets, designed to enrich hands-on tutorial and workshop experiences. Although we highlight the potential of these data sets in advancing data science education and health care artificial intelligence, we also emphasize the need for continued research into the inherent limitations of synthetic data.

Identifiants

pubmed: 38227356
pii: v10i1e51388
doi: 10.2196/51388
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e51388

Informations de copyright

©Nicholas I-Hsien Kuo, Oscar Perez-Concha, Mark Hanly, Emmanuel Mnatzaganian, Brandon Hao, Marcus Di Sipio, Guolin Yu, Jash Vanjara, Ivy Cerelia Valerie, Juliana de Oliveira Costa, Timothy Churches, Sanja Lujic, Jo Hegarty, Louisa Jorm, Sebastiano Barbieri. Originally published in JMIR Medical Education (https://mededu.jmir.org), 16.01.2024.

Auteurs

Nicholas I-Hsien Kuo (NI)

Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.

Oscar Perez-Concha (O)

Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.

Mark Hanly (M)

Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.

Emmanuel Mnatzaganian (E)

The University of New South Wales, Sydney, Australia.

Brandon Hao (B)

The University of New South Wales, Sydney, Australia.

Marcus Di Sipio (M)

The University of New South Wales, Sydney, Australia.

Guolin Yu (G)

The University of New South Wales, Sydney, Australia.

Jash Vanjara (J)

The University of New South Wales, Sydney, Australia.

Ivy Cerelia Valerie (IC)

The University of New South Wales, Sydney, Australia.

Juliana de Oliveira Costa (J)

Medicines Intelligence Research Program, School of Population Health, The University of New South Wales, Sydney, Australia.

Timothy Churches (T)

School of Clinical Medicine, University of New South Wales, Sydney, Australia.
Ingham Institute of Applied Medical Research, Liverpool, Sydney, Australia.

Sanja Lujic (S)

Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.

Jo Hegarty (J)

Sydney Local Health District, Sydney, Australia.

Louisa Jorm (L)

Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.

Sebastiano Barbieri (S)

Centre for Big Data Research in Health, The University of New South Wales, Sydney, Australia.

Classifications MeSH