Privacy Risk Assessment for Synthetic Longitudinal Health Data.

Data sharing Epidemiological study Privacy risk assessment Synthetic data

Journal

Studies in health technology and informatics
ISSN: 1879-8365
Titre abrégé: Stud Health Technol Inform
Pays: Netherlands
ID NLM: 9214582

Informations de publication

Date de publication:
30 Aug 2024
Historique:
medline: 5 9 2024
pubmed: 5 9 2024
entrez: 5 9 2024
Statut: ppublish

Résumé

A modern approach to ensuring privacy when sharing datasets is the use of synthetic data generation methods, which often claim to outperform classic anonymization techniques in the trade-off between data utility and privacy. Recently, it was demonstrated that various deep learning-based approaches are able to generate useful synthesized datasets, often based on domain-specific analyses. However, evaluating the privacy implications of releasing synthetic data remains a challenging problem, especially when the goal is to conform with data protection guidelines. Therefore, the recent privacy risk quantification framework Anonymeter has been built for evaluating multiple possible vulnerabilities, which are specifically based on privacy risks that are considered by the European Data Protection Board, i.e. singling out, linkability, and attribute inference. This framework was applied to a synthetic data generation study from the epidemiological domain, where the synthesization replicates time and age trends previously found in data collected during the DONALD cohort study (1312 participants, 16 time points). The conducted privacy analyses are presented, which place a focus on the vulnerability of outliers. The resulting privacy scores are discussed, which vary greatly between the different types of attacks. Challenges encountered during their implementation and during the interpretation of their results are highlighted, and it is concluded that privacy risk assessment for synthetic data remains an open problem.

Identifiants

pubmed: 39234731
pii: SHTI240867
doi: 10.3233/SHTI240867
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

270-279

Auteurs

Julian Schneider (J)

Knowledge Management, ZB MED - Information Centre for Life Sciences, Cologne, Germany.

Marvin Walter (M)

Knowledge Management, ZB MED - Information Centre for Life Sciences, Cologne, Germany.

Karen Otte (K)

Medical Informatics Group, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany.

Thierry Meurers (T)

Medical Informatics Group, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany.

Ines Perrar (I)

Institute of Nutritional and Food Sciences - Nutritional Epidemiology, University of Bonn, Bonn, Germany.

Ute Nöthlings (U)

Institute of Nutritional and Food Sciences - Nutritional Epidemiology, University of Bonn, Bonn, Germany.

Tim Adams (T)

Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing SCAI, Sankt Augustin, Germany.

Holger Fröhlich (H)

Department of Bioinformatics, Fraunhofer Institute for Algorithms and Scientific Computing SCAI, Sankt Augustin, Germany.

Fabian Prasser (F)

Medical Informatics Group, Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Berlin, Germany.

Juliane Fluck (J)

Knowledge Management, ZB MED - Information Centre for Life Sciences, Cologne, Germany.
The Agricultural Faculty, University of Bonn, Bonn, Germany.

Lisa Kühnel (L)

Knowledge Management, ZB MED - Information Centre for Life Sciences, Cologne, Germany.
Graduate School DILS, Bielefeld Institute for Bioinformatics Infrastructure (BIBI), Faculty of Technology, Bielefeld University, Bielefeld, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH