Validation Of Cancer Diagnoses In Electronic Health Records: Results From The Information System For Research In Primary Care (SIDIAP) In Northeast Spain.
cancer
electronic health records
population-based cancer registries
primary health care
validation studies
Journal
Clinical epidemiology
ISSN: 1179-1349
Titre abrégé: Clin Epidemiol
Pays: New Zealand
ID NLM: 101531700
Informations de publication
Date de publication:
2019
2019
Historique:
received:
31
07
2019
accepted:
30
10
2019
entrez:
11
12
2019
pubmed:
11
12
2019
medline:
11
12
2019
Statut:
epublish
Résumé
Electronic health records are becoming an increasingly valuable resource for epidemiology but their data quality needs to be quantified. We aimed to validate twenty-five types of incident cancer cases in the Information System for Research in Primary Care (SIDIAP) in Catalonia with the population-based cancer registries of Girona and Tarragona as the gold-standard. We calculated the sensitivity, positive predictive values (PPV), and the time-difference between the date of diagnosis entered into the SIDIAP and into the registries. We added hospital discharge cancer diagnoses to the SIDIAP to assess sensitivity changes. We identified 27,046 incident cancer diagnoses in the SIDIAP from 2009-2015 among the 949,841 residents of Girona and Tarragona. The cancer types with the highest sensitivity were breast (89%, 95% CI: 88-90%), colorectal (81%, 95% CI: 80-82%), and prostate (81%, 95% CI: 80-83%). Trachea, bronchus and lung cancers had the highest PPV (76%, 95% CI: 74%-78%) followed by stomach (72%, 95% CI: 68-75%) and pancreas (71%, 95% CI: 67-75%). Most cancer diagnoses were reported with less than three months of difference between the SIDIAP and the registries. More cases were registered first in the registries than in the SIDIAP. By adding cancer diagnoses based on hospital discharge data, sensitivity increased for all cancers, especially for gallbladder and biliary tract for which the sensitivity increased by 21%. The SIDIAP includes 76% of the cancer diagnoses in the cancer registries but includes a considerable number of cases that are not in the registries. The SIDIAP reports most of the cancer diagnoses within a three-month period difference from the date of diagnosis in the cancer registries. Our results support the use of the SIDIAP cancer diagnoses for epidemiological research when cancer is the outcome of interest. We recommend adding hospital discharge data to the SIDIAP to increase data quality, particularly for less frequent cancer types.
Sections du résumé
BACKGROUND
BACKGROUND
Electronic health records are becoming an increasingly valuable resource for epidemiology but their data quality needs to be quantified. We aimed to validate twenty-five types of incident cancer cases in the Information System for Research in Primary Care (SIDIAP) in Catalonia with the population-based cancer registries of Girona and Tarragona as the gold-standard.
METHODS
METHODS
We calculated the sensitivity, positive predictive values (PPV), and the time-difference between the date of diagnosis entered into the SIDIAP and into the registries. We added hospital discharge cancer diagnoses to the SIDIAP to assess sensitivity changes.
RESULTS
RESULTS
We identified 27,046 incident cancer diagnoses in the SIDIAP from 2009-2015 among the 949,841 residents of Girona and Tarragona. The cancer types with the highest sensitivity were breast (89%, 95% CI: 88-90%), colorectal (81%, 95% CI: 80-82%), and prostate (81%, 95% CI: 80-83%). Trachea, bronchus and lung cancers had the highest PPV (76%, 95% CI: 74%-78%) followed by stomach (72%, 95% CI: 68-75%) and pancreas (71%, 95% CI: 67-75%). Most cancer diagnoses were reported with less than three months of difference between the SIDIAP and the registries. More cases were registered first in the registries than in the SIDIAP. By adding cancer diagnoses based on hospital discharge data, sensitivity increased for all cancers, especially for gallbladder and biliary tract for which the sensitivity increased by 21%.
CONCLUSION
CONCLUSIONS
The SIDIAP includes 76% of the cancer diagnoses in the cancer registries but includes a considerable number of cases that are not in the registries. The SIDIAP reports most of the cancer diagnoses within a three-month period difference from the date of diagnosis in the cancer registries. Our results support the use of the SIDIAP cancer diagnoses for epidemiological research when cancer is the outcome of interest. We recommend adding hospital discharge data to the SIDIAP to increase data quality, particularly for less frequent cancer types.
Identifiants
pubmed: 31819655
doi: 10.2147/CLEP.S225568
pii: 225568
pmc: PMC6899079
doi:
Types de publication
Journal Article
Langues
eng
Pagination
1015-1024Informations de copyright
© 2019 Recalde et al.
Déclaration de conflit d'intérêts
The authors report no conflicts of interest in this work.
Références
Gac Sanit. 2008 May-Jun;22(3):179-87
pubmed: 18579042
J Clin Epidemiol. 2012 Mar;65(3):343-349.e2
pubmed: 22197520
Cancer Epidemiol. 2012 Oct;36(5):425-9
pubmed: 22727737
Pharmacoepidemiol Drug Saf. 2009 Aug;18(8):730-6
pubmed: 19479713
Med Clin (Barc). 2012 May 19;138(14):617-21
pubmed: 22444996
Breathe (Sheff). 2019 Mar;15(1):64-68
pubmed: 30838062
Ann Oncol. 2010 May;21 Suppl 3:iii3-13
pubmed: 20427357
PLoS One. 2014 Oct 20;9(10):e109706
pubmed: 25329578
Clin Transl Oncol. 2017 Jul;19(7):799-825
pubmed: 28093701
Inform Prim Care. 2011;19(3):135-45
pubmed: 22688222
Pharmacoepidemiol Drug Saf. 2013 Feb;22(2):168-75
pubmed: 23239282
Rev Esp Cardiol (Engl Ed). 2012 Jan;65(1):29-37
pubmed: 22036238
Epidemiology. 2018 Mar;29(2):308-313
pubmed: 29135571
Br J Clin Pharmacol. 2010 Jan;69(1):4-14
pubmed: 20078607
Br J Surg. 2015 Jan;102(2):e93-e101
pubmed: 25627139