Web Application for the Automated Extraction of Diagnosis and Site From Pathology Reports for Keratinocyte Cancers.


Journal

JCO clinical cancer informatics
ISSN: 2473-4276
Titre abrégé: JCO Clin Cancer Inform
Pays: United States
ID NLM: 101708809

Informations de publication

Date de publication:
08 2020
Historique:
entrez: 7 8 2020
pubmed: 7 8 2020
medline: 1 9 2021
Statut: ppublish

Résumé

Keratinocyte cancers are exceedingly common in high-risk populations, but accurate measures of incidence are seldom derived because the burden of manually reviewing pathology reports to extract relevant diagnostic information is excessive. Thus, we sought to develop supervised learning algorithms for classifying basal and squamous cell carcinomas and other diagnoses, as well as disease site, and incorporate these into a Web application capable of processing large numbers of pathology reports. Participants in the QSkin study were recruited in 2011 and comprised men and women age 40-69 years at baseline (N = 43,794) who were randomly selected from a population register in Queensland, Australia. Histologic data were manually extracted from free-text pathology reports for participants with histologically confirmed keratinocyte cancers for whom a pathology report was available (n = 25,786 reports). This provided a training data set for the development of algorithms capable of deriving diagnosis and site from free-text pathology reports. We calculated agreement statistics between algorithm-derived classifications and 3 independent validation data sets of manually abstracted pathology reports. The agreement for classifications of basal cell carcinoma (κ = 0.97 and κ = 0.96) and squamous cell carcinoma (κ = 0.93 for both) was almost perfect in 2 validation data sets but was slightly lower for a third (κ = 0.82 and κ = 0.90, respectively). Agreement for total counts of specific diagnoses was also high (κ > 0.8). Similar levels of agreement between algorithm-derived and manually extracted data were observed for classifications of keratoacanthoma and intraepidermal carcinoma. Supervised learning methods were used to develop a Web application capable of accurately and rapidly classifying large numbers of pathology reports for keratinocyte cancers and related diagnoses. Such tools may provide the means to accurately measure subtype-specific skin cancer incidence.

Identifiants

pubmed: 32755460
doi: 10.1200/CCI.19.00152
pmc: PMC7469600
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

711-723

Références

JCO Clin Cancer Inform. 2018 Dec;2:1-8
pubmed: 30652586
AMIA Annu Symp Proc. 2015 Nov 05;2015:953-62
pubmed: 26958232
Aust N Z J Public Health. 2016 Apr;40(2):154-8
pubmed: 26558736
J Pathol Inform. 2012;3:23
pubmed: 22934236
J Invest Dermatol. 2016 Jul;136(7):1382-1386
pubmed: 26968258
Methods Inf Med. 2012;51(3):242-51
pubmed: 21792466
J Invest Dermatol. 2012 Aug;132(8):2005-9
pubmed: 22475754
Int J Med Inform. 2014 Sep;83(9):605-23
pubmed: 25008281
CA Cancer J Clin. 2018 Nov;68(6):394-424
pubmed: 30207593
Med J Aust. 2012 Nov 19;197(10):565-8
pubmed: 23163687
AMIA Annu Symp Proc. 2006;:899
pubmed: 17238518
Med J Aust. 2006 Jan 2;184(1):6-10
pubmed: 16398622
BMC Med Res Methodol. 2019 Mar 19;19(1):64
pubmed: 30890124
JAMA Dermatol. 2015 Oct;151(10):1081-6
pubmed: 25928283
Int J Epidemiol. 2012 Aug;41(4):929-929i
pubmed: 22933644
J Am Coll Surg. 2007 Nov;205(5):690-7
pubmed: 17964445

Auteurs

Bridie S Thompson (BS)

Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.

Sam Hardy (S)

Otso, Brisbane, Queensland, Australia.

Nirmala Pandeya (N)

Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.
School of Public Health, University of Queensland, Brisbane, Queensland, Australia.

Jean Claude Dusingize (JC)

Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.

Adele C Green (AC)

Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.
Leeds Institute of Medical Research at St James's, University of Leeds, Leeds, United Kingdom.

Athon Millane (A)

School of Public Health, University of Queensland, Brisbane, Queensland, Australia.

Daniel Bourke (D)

Max Kelsen, Brisbane, Queensland, Australia.

Ronald Grande (R)

Max Kelsen, Brisbane, Queensland, Australia.

Cameron D Bean (CD)

Max Kelsen, Brisbane, Queensland, Australia.

Catherine M Olsen (CM)

Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.
Faculty of Medicine, University of Queensland, Brisbane, Queensland, Australia.

David C Whiteman (DC)

Department of Population Health, QIMR Berghofer Medical Research Institute, Brisbane Queensland, Australia.
Faculty of Medicine, University of Queensland, Brisbane, Queensland, Australia.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH