Development and Validation of Coding Algorithms to Identify Patients with Incident Non-Small Cell Lung Cancer in United States Healthcare Claims Data.

algorithm machine learning medicare claims non-small cell lung cancer positive predictive value sensitivity validation

Journal

Clinical epidemiology
ISSN: 1179-1349
Titre abrégé: Clin Epidemiol
Pays: New Zealand
ID NLM: 101531700

Informations de publication

Date de publication:
2023
Historique:
received: 14 09 2022
accepted: 23 12 2022
entrez: 20 1 2023
pubmed: 21 1 2023
medline: 21 1 2023
Statut: epublish

Résumé

We sought to develop and validate an incident non-small cell lung cancer (NSCLC) algorithm for United States (US) healthcare claims data. Diagnoses and procedures, but not medications, were incorporated to support longer-term relevance and reliability. Patients with newly diagnosed NSCLC per Surveillance, Epidemiology, and End Results (SEER) served as cases. Controls included newly diagnosed small-cell lung cancer and other lung cancers, and two 5% random samples for other cancer and without cancer. Algorithms derived from logistic regression and machine learning methods used the entire sample (Approach A) or started with a previous algorithm for those with lung cancer (Approach B). Sensitivity, specificity, positive predictive values (PPV), negative predictive values, and F-scores (compared for 1000 bootstrap samples) were calculated. Misclassification was evaluated by calculating the odds of selection by the algorithm among true positives and true negatives. The best performing algorithm utilized neural networks (Approach B). A 10-variable point-score algorithm was derived from logistic regression (Approach B); sensitivity was 77.69% and PPV = 67.61% (F-score = 72.30%). This algorithm was less sensitive for patients ≥80 years old, with Medicare follow-up time <3 months, or missing SEER data on stage, laterality, or site and less specific for patients with SEER primary site of main bronchus, SEER summary stage 2000 regional by direct extension only, or pre-index chronic pulmonary disease. Our study developed and validated a practical, 10-variable, point-based algorithm for identifying incident NSCLC cases in a US claims database based on a previously validated incident lung cancer algorithm.

Identifiants

pubmed: 36659903
doi: 10.2147/CLEP.S389824
pii: 389824
pmc: PMC9842515
doi:

Types de publication

Journal Article

Langues

eng

Pagination

73-89

Informations de copyright

© 2023 Beyrer et al.

Déclaration de conflit d'intérêts

JB, DRN, KMS, and YJH are employees and shareholders of Eli Lilly and Company. YKL was an employee of Eli Lilly and Company during the conduct of the study. ALH is an employee of the University of Cincinnati and reports grants from Eli Lilly during the conduct of the study. The authors report no other conflicts of interest in this work.

Références

JCO Clin Cancer Inform. 2018 Dec;2:1-10
pubmed: 30652573
Value Health. 2013 Jun;16(4):655-69
pubmed: 23796301
Front Pharmacol. 2017 Nov 30;8:883
pubmed: 29249970
Front Oncol. 2016 Feb 01;6:18
pubmed: 26870695
Lung Cancer. 2016 Dec;102:108-117
pubmed: 27987578
Clin Pharmacol Ther. 2021 May;109(5):1189-1196
pubmed: 32911562
Health Serv Res. 2004 Dec;39(6 Pt 1):1733-49
pubmed: 15533184
Pharmacoepidemiol Drug Saf. 2020 Nov;29(11):1465-1479
pubmed: 33012044
Cancer Epidemiol. 2015 Dec;39(6):1136-44
pubmed: 26138902
Proc Mach Learn Res. 2017 Aug;68:25-38
pubmed: 30542673
J Comp Eff Res. 2022 May;11(7):499-511
pubmed: 35296149
BMJ Open. 2018 Jul 23;8(7):e019264
pubmed: 30037859
JAMA Dermatol. 2016 Oct 1;152(10):1122-1127
pubmed: 27533718
J Clin Epidemiol. 2012 Feb;65(2):126-31
pubmed: 22075111
Ann Transl Med. 2017 Nov;5(21):436
pubmed: 29201888
Drug Saf. 2023 Jan;46(1):87-97
pubmed: 36396894
Stat Med. 2004 May 30;23(10):1631-60
pubmed: 15122742

Auteurs

Julie Beyrer (J)

Eli Lilly and Company, Indianapolis, IN, USA.

David R Nelson (DR)

Eli Lilly and Company, Indianapolis, IN, USA.

Kristin M Sheffield (KM)

Eli Lilly and Company, Indianapolis, IN, USA.

Yu-Jing Huang (YJ)

Eli Lilly and Company, Indianapolis, IN, USA.

Yiu-Keung Lau (YK)

Eli Lilly and Company, Indianapolis, IN, USA.

Ana L Hincapie (AL)

University of Cincinnati James L. Winkle College of Pharmacy, Cincinnati, OH, USA.

Classifications MeSH