Healthcare Research and Analytics Data Infrastructure Solution: A Data Warehouse for Health Services Research.
data integration
data warehousing
health services research
iterative process model
systems analysis and design
Journal
Journal of medical Internet research
ISSN: 1438-8871
Titre abrégé: J Med Internet Res
Pays: Canada
ID NLM: 100959882
Informations de publication
Date de publication:
04 06 2020
04 06 2020
Historique:
received:
05
03
2020
accepted:
16
04
2020
revised:
08
04
2020
entrez:
5
6
2020
pubmed:
5
6
2020
medline:
15
12
2020
Statut:
epublish
Résumé
Health services researchers spend a substantial amount of time performing integration, cleansing, interpretation, and aggregation of raw data from multiple public or private data sources. Often, each researcher (or someone in their team) duplicates this effort for their own project, facing the same challenges and experiencing the same pitfalls discovered by those before them. This paper described a design process for creating a data warehouse that includes the most frequently used databases in health services research. The design is based on a conceptual iterative process model framework that utilizes the sociotechnical systems theory approach and includes the capacity for subsequent updates of the existing data sources and the addition of new ones. We introduce the theory and the framework and then explain how they are used to inform the methodology of this study. The application of the iterative process model to the design research process of problem identification and solution design for the Healthcare Research and Analytics Data Infrastructure Solution (HRADIS) is described. Each phase of the iterative model produced end products to inform the implementation of HRADIS. The analysis phase produced the problem statement and requirements documents. The projection phase produced a list of tasks and goals for the ideal system. Finally, the synthesis phase provided the process for a plan to implement HRADIS. HRADIS structures and integrates data dictionaries provided by the data sources, allowing the creation of dimensions and measures for a multidimensional business intelligence system. We discuss how HRADIS is complemented with a set of data mining, analytics, and visualization tools to enable researchers to more efficiently apply multiple methods to a given research project. HRADIS also includes a built-in security and account management framework for data governance purposes to ensure customized authorization depending on user roles and parts of the data the roles are authorized to access. To address existing inefficiencies during the obtaining, extracting, preprocessing, cleansing, and filtering stages of data processing in health services research, we envision HRADIS as a full-service data warehouse integrating frequently used data sources, processes, and methods along with a variety of data analytics and visualization tools. This paper presents the application of the iterative process model to build such a solution. It also includes a discussion on several prominent issues, lessons learned, reflections and recommendations, and future considerations, as this model was applied.
Sections du résumé
BACKGROUND
Health services researchers spend a substantial amount of time performing integration, cleansing, interpretation, and aggregation of raw data from multiple public or private data sources. Often, each researcher (or someone in their team) duplicates this effort for their own project, facing the same challenges and experiencing the same pitfalls discovered by those before them.
OBJECTIVE
This paper described a design process for creating a data warehouse that includes the most frequently used databases in health services research.
METHODS
The design is based on a conceptual iterative process model framework that utilizes the sociotechnical systems theory approach and includes the capacity for subsequent updates of the existing data sources and the addition of new ones. We introduce the theory and the framework and then explain how they are used to inform the methodology of this study.
RESULTS
The application of the iterative process model to the design research process of problem identification and solution design for the Healthcare Research and Analytics Data Infrastructure Solution (HRADIS) is described. Each phase of the iterative model produced end products to inform the implementation of HRADIS. The analysis phase produced the problem statement and requirements documents. The projection phase produced a list of tasks and goals for the ideal system. Finally, the synthesis phase provided the process for a plan to implement HRADIS. HRADIS structures and integrates data dictionaries provided by the data sources, allowing the creation of dimensions and measures for a multidimensional business intelligence system. We discuss how HRADIS is complemented with a set of data mining, analytics, and visualization tools to enable researchers to more efficiently apply multiple methods to a given research project. HRADIS also includes a built-in security and account management framework for data governance purposes to ensure customized authorization depending on user roles and parts of the data the roles are authorized to access.
CONCLUSIONS
To address existing inefficiencies during the obtaining, extracting, preprocessing, cleansing, and filtering stages of data processing in health services research, we envision HRADIS as a full-service data warehouse integrating frequently used data sources, processes, and methods along with a variety of data analytics and visualization tools. This paper presents the application of the iterative process model to build such a solution. It also includes a discussion on several prominent issues, lessons learned, reflections and recommendations, and future considerations, as this model was applied.
Identifiants
pubmed: 32496199
pii: v22i6e18579
doi: 10.2196/18579
pmc: PMC7303827
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
e18579Informations de copyright
©Bunyamin Ozaydin, Ferhat Zengul, Nurettin Oner, Sue S Feldman. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 04.06.2020.
Références
J Biomed Inform. 2009 Apr;42(2):377-81
pubmed: 18929686
J Rural Health. 2006 Summer;22(3):229-36
pubmed: 16824167
Health Serv Res. 2004 Apr;39(2):279-300
pubmed: 15032955
Health Care Manage Rev. 2013 Apr-Jun;38(2):146-55
pubmed: 22543824
Health Econ. 2005 Jul;14(7):737-53
pubmed: 15744748
Health Care Manage Rev. 2018 Jan/Mar;43(1):2-11
pubmed: 27467169
Health Serv Res. 2007 Aug;42(4):1758-72
pubmed: 17286625
IEEE Trans Biomed Eng. 2015 Dec;62(12):2776-86
pubmed: 26126271
Health Aff (Millwood). 2014 Jul;33(7):1115-22
pubmed: 25006136
Med Care. 2010 Jul;48(7):659-63
pubmed: 20548254