Data Management for Health Data Reuse: Proposal of a Standard Workflow and a R Tutorial with Jupyter Notebook.
Data Science
Data management
Data reuse
Education
Programming
Journal
Studies in health technology and informatics
ISSN: 1879-8365
Titre abrégé: Stud Health Technol Inform
Pays: Netherlands
ID NLM: 9214582
Informations de publication
Date de publication:
31 Aug 2022
31 Aug 2022
Historique:
entrez:
8
9
2022
pubmed:
9
9
2022
medline:
11
9
2022
Statut:
ppublish
Résumé
The data collected in the clinical registries or by data reuse require some modifications in order to suit the research needs. Several common operations are frequently applied to select relevant patients across the cohort, combine data from multiple sources, add new variables if needed and create unique tables depending on the research purpose. We carried out a qualitative survey by conducting semi-structured interviews with 7 experts in data reuse and proposed a standard workflow for health data management. We implemented a R tutorial based on a synthetic data set using Jupyter Notebook for a better understanding of the data management workflow.
Identifiants
pubmed: 36073461
pii: SHTI220912
doi: 10.3233/SHTI220912
doi:
Types de publication
Journal Article
Langues
eng