Measuring follow-up time in routinely-collected health datasets: Challenges and solutions.
Algorithms
Continuity of Patient Care
/ standards
Data Collection
/ methods
Databases, Factual
Datasets as Topic
/ standards
Diagnostic Tests, Routine
/ standards
Electronic Health Records
/ organization & administration
Female
Follow-Up Studies
Humans
Incidence
Longitudinal Studies
Male
Medical Record Linkage
/ standards
Practice Patterns, Physicians'
/ statistics & numerical data
Primary Health Care
/ organization & administration
Research Design
Stroke
/ drug therapy
Time Factors
Wales
/ epidemiology
Warfarin
/ therapeutic use
Journal
PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081
Informations de publication
Date de publication:
2020
2020
Historique:
received:
02
09
2019
accepted:
18
01
2020
entrez:
12
2
2020
pubmed:
12
2
2020
medline:
7
5
2020
Statut:
epublish
Résumé
A key requirement for longitudinal studies using routinely-collected health data is to be able to measure what individuals are present in the datasets used, and over what time period. Individuals can enter and leave the covered population of administrative datasets for a variety of reasons, including both life events and characteristics of the datasets themselves. An automated, customizable method of determining individuals' presence was developed for the primary care dataset in Swansea University's SAIL Databank. The primary care dataset covers only a portion of Wales, with 76% of practices participating. The start and end date of the data varies by practice. Additionally, individuals can change practices or leave Wales. To address these issues, a two step process was developed. First, the period for which each practice had data available was calculated by measuring changes in the rate of events recorded over time. Second, the registration records for each individual were simplified. Anomalies such as short gaps and overlaps were resolved by applying a set of rules. The result of these two analyses was a cleaned set of records indicating start and end dates of available primary care data for each individual. Analysis of GP records showed that 91.0% of events occurred within periods calculated as having available data by the algorithm. 98.4% of those events were observed at the same practice of registration as that computed by the algorithm. A standardized method for solving this common problem has enabled faster development of studies using this data set. Using a rigorous, tested, standardized method of verifying presence in the study population will also positively influence the quality of research.
Identifiants
pubmed: 32045428
doi: 10.1371/journal.pone.0228545
pii: PONE-D-19-24676
pmc: PMC7012444
doi:
Substances chimiques
Warfarin
5Q7ZVV76EI
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
e0228545Subventions
Organisme : Department of Health
ID : RP-PG-0407-10314
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/M501633/2
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/S004084/1
Pays : United Kingdom
Organisme : Medical Research Council
ID : MR/K006584/1
Pays : United Kingdom
Organisme : Department of Health
ID : 05/40/04
Pays : United Kingdom
Organisme : Medical Research Council
ID : MC_PC_13041
Pays : United Kingdom
Organisme : Medical Research Council
ID : G0902393
Pays : United Kingdom
Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.
Références
PLoS One. 2015 Oct 15;10(10):e0140817
pubmed: 26469346
J Med Internet Res. 2018 May 29;20(5):e185
pubmed: 29844010
BMC Med Inform Decis Mak. 2009 Jan 16;9:3
pubmed: 19149883
Int J Epidemiol. 2015 Jun;44(3):827-36
pubmed: 26050254
J Biomed Inform. 2013 Oct;46(5):830-6
pubmed: 23820016
BMC Health Serv Res. 2009 Sep 04;9:157
pubmed: 19732426
PLoS Med. 2007 Feb;4(2):e28
pubmed: 17326704
Am J Epidemiol. 2006 May 1;163(9):783-9
pubmed: 16510544