The Impact of Medical Big Data Anonymization on Early Acute Kidney Injury Risk Prediction.
Acute Kidney Injury
Data Anonymization
Data utility
Medical Big Data
Re-identification risk
Journal
AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science
ISSN: 2153-4063
Titre abrégé: AMIA Jt Summits Transl Sci Proc
Pays: United States
ID NLM: 101539486
Informations de publication
Date de publication:
2020
2020
Historique:
entrez:
2
6
2020
pubmed:
2
6
2020
medline:
2
6
2020
Statut:
epublish
Résumé
Artificial intelligence enabled medical big data analysis has the potential to revolutionize medical practice from diagnosis and prediction of complex diseases to making recommendations and resource allocation decisions in an evidence-based manner. However, big data comes with big disclosure risks. To preserve privacy, excessive data anonymization is often necessary, leading to significant loss of data utility. In this paper, we develop a systematic data scrubbing procedure for large datasets when key variables are uncertain for re-identification risk assessment and assess the trade-off between anonymization of electronic health record data for sharing in support of open science and performance of machine learning models for early acute kidney injury risk prediction using the data. Results demonstrate that our proposed data scrubbing procedure can maintain good feature diversity and moderate data utility but raises concerns regarding its impact on knowledge discovery capability.
Types de publication
Journal Article
Langues
eng
Pagination
617-625Informations de copyright
©2020 AMIA - All rights reserved.
Références
AMIA Annu Symp Proc. 2018 Apr 16;2017:1430-1439
pubmed: 29854212
Nat Rev Genet. 2011 Jun;12(6):417-28
pubmed: 21587298
J Am Med Inform Assoc. 2010 Mar-Apr;17(2):169-77
pubmed: 20190059
Nature. 2017 Feb 2;542(7639):115-118
pubmed: 28117445
Circ Cardiovasc Qual Outcomes. 2019 Jul;12(7):e005122
pubmed: 31284738
Am J Kidney Dis. 2002 May;39(5):930-6
pubmed: 11979336
Am J Med. 1983 Feb;74(2):243-8
pubmed: 6824004
Am J Hum Genet. 2017 Feb 2;100(2):316-322
pubmed: 28065469
PLoS One. 2011;6(12):e28071
pubmed: 22164229
PLoS One. 2015 Mar 25;10(3):e0120592
pubmed: 25807380
N Engl J Med. 2011 Feb 10;364(6):498-9
pubmed: 21226658
Clin J Am Soc Nephrol. 2011 Apr;6(4):856-63
pubmed: 21212419
N Engl J Med. 2016 Dec 8;375(23):2293-2297
pubmed: 27959688
Am J Kidney Dis. 2005 Jan;45(1):96-101
pubmed: 15696448
AMIA Annu Symp Proc. 2011;2011:1454-63
pubmed: 22195209
Clin J Am Soc Nephrol. 2016 Nov 7;11(11):1935-1943
pubmed: 27633727
Crit Care Med. 2018 Jul;46(7):1070-1077
pubmed: 29596073
JAMA. 2017 Dec 12;318(22):2211-2223
pubmed: 29234807
JAMA. 2016 Dec 13;316(22):2402-2410
pubmed: 27898976
Nephrol Dial Transplant. 2011 Jan;26(1):144-50
pubmed: 20591815
JAMA. 2017 Dec 12;318(22):2199-2210
pubmed: 29234806
IEEE Trans Inf Technol Biomed. 2012 May;16(3):413-23
pubmed: 22287248
N Engl J Med. 2008 Oct 16;359(16):1675-84
pubmed: 18832239
JAMIA Open. 2019 Apr;2(1):115-122
pubmed: 30976758
AMIA Annu Symp Proc. 2018 Apr 16;2017:565-574
pubmed: 29854121
Proc Natl Acad Sci U S A. 2010 Apr 27;107(17):7898-903
pubmed: 20385806
Cell. 2015 Nov 19;163(5):1079-1094
pubmed: 26590418