Experiences of Transforming a Complex Nephrologic Care and Research Database into i2b2 Using the IDRT Tools.
Biomedical Research
/ statistics & numerical data
Computational Biology
Data Management
Databases, Factual
/ statistics & numerical data
Electronic Health Records
/ statistics & numerical data
Germany
Humans
Information Storage and Retrieval
Kidney Transplantation
/ statistics & numerical data
Nephrology
/ statistics & numerical data
Software
Journal
Journal of healthcare engineering
ISSN: 2040-2295
Titre abrégé: J Healthc Eng
Pays: England
ID NLM: 101528166
Informations de publication
Date de publication:
2019
2019
Historique:
received:
07
05
2018
revised:
18
09
2018
accepted:
05
12
2018
entrez:
26
2
2019
pubmed:
26
2
2019
medline:
18
8
2020
Statut:
epublish
Résumé
The secondary use of data from electronic medical records has become an important factor to determine and to identify various causes of disease. For this reason, applications like informatics for integrating biology and the bedside (i2b2) offer a GUI-based front end to select patient cohorts. To make use of those tools, however, clinical data need to be extracted from the Electronic Health Record (EHR) system and integrated into the data schema of i2b2. We used TBase, a documentation system for nephrologic transplantations, as a source system and applied the Integrated Data Repository Toolkit (IDRT) for the Extract, Transform, and Load (ETL) process to load the data into i2b2. Since i2b2 uses an entity-attribute-value (EAV) schema, which is a fundamentally different way of modeling data in comparison to a standard relational schema in TBase, we evaluated if (a) the data relationship of the source system entities can still be represented in the i2b2 schema and if (b) the IDRT is a suitable solution for loading the data of a comprehensive data schema like TBase into i2b2. For that reason, we identified entities in the TBase data schema which were relevant for answering questions on cohort identification. By doing so, we found out that the entities had different structures that needed to be handled differently for the ETL process. Furthermore, the use of IDRT revealed shortcomings with regard to large input data and specific data structures that are part of most modern EHR systems. However, this project also showed that our way of modeling the TBase data in i2b2 has been proven to be successful in terms of answering the most common questions of clinicians on cohort identification.
Identifiants
pubmed: 30800257
doi: 10.1155/2019/5640685
pmc: PMC6360056
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
5640685Références
Appl Clin Inform. 2011 Mar 30;2(1):116-27
pubmed: 23616864
Int J Med Inform. 2017 Jun;102:138-149
pubmed: 28495342
Stud Health Technol Inform. 2011;169:502-6
pubmed: 21893800
Methods Inf Med. 2009;48(1):38-44
pubmed: 19151882
J Biomed Inform. 2016 Oct;63:277-294
pubmed: 27507090
Int J Med Inform. 2007 Nov-Dec;76(11-12):769-79
pubmed: 17098467
PLoS One. 2015 Jan 14;10(1):e0116656
pubmed: 25588043
Methods Inf Med. 2016;55(2):125-35
pubmed: 26534843
BMC Bioinformatics. 2012 Mar 28;13 Suppl 4:S5
pubmed: 22536972
BMC Bioinformatics. 2012;13 Suppl 14:S1
pubmed: 23095472
AMIA Jt Summits Transl Sci Proc. 2013 Mar 18;2013:203-7
pubmed: 24303265
J Am Med Inform Assoc. 2010 Mar-Apr;17(2):124-30
pubmed: 20190053