Data Integration to Improve Real-world Health Outcomes Research for Non-Small Cell Lung Cancer in the United States: Descriptive and Qualitative Exploration.
administrative claims data
cancer
data aggregation
electronic health record
health services research
medical records
non–small cell lung cancer
population health
real-world data
retrospective study
Journal
JMIR cancer
ISSN: 2369-1999
Titre abrégé: JMIR Cancer
Pays: Canada
ID NLM: 101666844
Informations de publication
Date de publication:
12 Apr 2021
12 Apr 2021
Historique:
received:
04
08
2020
accepted:
01
02
2021
revised:
29
01
2021
entrez:
12
4
2021
pubmed:
13
4
2021
medline:
13
4
2021
Statut:
epublish
Résumé
The integration of data from disparate sources could help alleviate data insufficiency in real-world studies and compensate for the inadequacies of single data sources and short-duration, small sample size studies while improving the utility of data for research. This study aims to describe and evaluate a process of integrating data from several complementary sources to conduct health outcomes research in patients with non-small cell lung cancer (NSCLC). The integrated data set is also used to describe patient demographics, clinical characteristics, treatment patterns, and mortality rates. This retrospective cohort study integrated data from 4 sources: administrative claims from the HealthCore Integrated Research Database, clinical data from a Cancer Care Quality Program (CCQP), clinical data from abstracted medical records (MRs), and mortality data from the US Social Security Administration. Patients with lung cancer who initiated second-line (2L) therapy between November 01, 2015, and April 13, 2018, were identified in the claims and CCQP data. Eligible patients were 18 years or older and received atezolizumab, docetaxel, erlotinib, nivolumab, pembrolizumab, pemetrexed, or ramucirumab in the 2L setting. The main analysis cohort included patients with claims data and data from at least one additional data source (CCQP or MR). Patients without integrated data (claims only) were reported separately. Descriptive and univariate statistics were reported. Data integration resulted in a main analysis cohort of 2195 patients with NSCLC; 2106 patients had CCQP and 407 patients had MR data. The claims-only cohort included 931 eligible patients. For the main analysis cohort, the mean age was 62.1 (SD 9.27) years, 48.56% (1066/2195) were female, the median length of follow-up was 6.8 months, and for 37.77% (829/2195), death was observed. For the claims-only cohort, the mean age was 66.6 (SD 12.69) years, 52.1% (485/931) were female, the median length of follow-up was 8.6 months, and for 29.3% (273/931), death was observed. The most frequent 2L treatment was immunotherapy (1094/2195, 49.84%), followed by platinum-based regimens (472/2195, 21.50%) and single-agent chemotherapy (441/2195, 20.09%); mean duration of 2L therapy was 5.6 (SD 4.9, median 4) months. We describe challenges and learnings from the data integration process, and the benefits of the integrated data set, which includes a richer set of clinical and outcome data to supplement the utilization metrics available in administrative claims. The management of patients with NSCLC requires care from a multidisciplinary team, leading to a lack of a single aggregated data source in real-world settings. The availability of integrated clinical data from MRs, health plan claims, and other sources of clinical care may improve the ability to assess emerging treatments.
Sections du résumé
BACKGROUND
BACKGROUND
The integration of data from disparate sources could help alleviate data insufficiency in real-world studies and compensate for the inadequacies of single data sources and short-duration, small sample size studies while improving the utility of data for research.
OBJECTIVE
OBJECTIVE
This study aims to describe and evaluate a process of integrating data from several complementary sources to conduct health outcomes research in patients with non-small cell lung cancer (NSCLC). The integrated data set is also used to describe patient demographics, clinical characteristics, treatment patterns, and mortality rates.
METHODS
METHODS
This retrospective cohort study integrated data from 4 sources: administrative claims from the HealthCore Integrated Research Database, clinical data from a Cancer Care Quality Program (CCQP), clinical data from abstracted medical records (MRs), and mortality data from the US Social Security Administration. Patients with lung cancer who initiated second-line (2L) therapy between November 01, 2015, and April 13, 2018, were identified in the claims and CCQP data. Eligible patients were 18 years or older and received atezolizumab, docetaxel, erlotinib, nivolumab, pembrolizumab, pemetrexed, or ramucirumab in the 2L setting. The main analysis cohort included patients with claims data and data from at least one additional data source (CCQP or MR). Patients without integrated data (claims only) were reported separately. Descriptive and univariate statistics were reported.
RESULTS
RESULTS
Data integration resulted in a main analysis cohort of 2195 patients with NSCLC; 2106 patients had CCQP and 407 patients had MR data. The claims-only cohort included 931 eligible patients. For the main analysis cohort, the mean age was 62.1 (SD 9.27) years, 48.56% (1066/2195) were female, the median length of follow-up was 6.8 months, and for 37.77% (829/2195), death was observed. For the claims-only cohort, the mean age was 66.6 (SD 12.69) years, 52.1% (485/931) were female, the median length of follow-up was 8.6 months, and for 29.3% (273/931), death was observed. The most frequent 2L treatment was immunotherapy (1094/2195, 49.84%), followed by platinum-based regimens (472/2195, 21.50%) and single-agent chemotherapy (441/2195, 20.09%); mean duration of 2L therapy was 5.6 (SD 4.9, median 4) months. We describe challenges and learnings from the data integration process, and the benefits of the integrated data set, which includes a richer set of clinical and outcome data to supplement the utilization metrics available in administrative claims.
CONCLUSIONS
CONCLUSIONS
The management of patients with NSCLC requires care from a multidisciplinary team, leading to a lack of a single aggregated data source in real-world settings. The availability of integrated clinical data from MRs, health plan claims, and other sources of clinical care may improve the ability to assess emerging treatments.
Identifiants
pubmed: 33843600
pii: v7i2e23161
doi: 10.2196/23161
pmc: PMC8076987
doi:
Types de publication
Journal Article
Langues
eng
Pagination
e23161Informations de copyright
©Michael Grabner, Cliff Molife, Liya Wang, Katherine B Winfree, Zhanglin Lin Cui, Gebra Cuyun Carter, Lisa M Hess. Originally published in JMIR Cancer (http://cancer.jmir.org), 12.04.2021.
Références
Pharmacoepidemiol Drug Saf. 2019 Feb;28(2):171-178
pubmed: 30411431
J Am Med Inform Assoc. 2019 Jul 1;26(7):594-602
pubmed: 30938759
Comput Methods Programs Biomed. 2018 Jun;159:87-101
pubmed: 29650322
Value Health. 2007 Sep-Oct;10(5):326-35
pubmed: 17888097
J Multidiscip Healthc. 2018 Jul 02;11:295-304
pubmed: 29997436
Front Pharmacol. 2017 Nov 30;8:883
pubmed: 29249970
Clin Lung Cancer. 2018 Sep;19(5):e629-e645
pubmed: 29885945
Med Care. 2017 Dec;55(12):e144-e149
pubmed: 29135778
JCO Clin Cancer Inform. 2018 Dec;2:1-12
pubmed: 30652555
N Engl J Med. 2015 Jul 9;373(2):123-35
pubmed: 26028407
J Natl Compr Canc Netw. 2018 Oct;16(10):1171-1182
pubmed: 30323087
Curr Med Res Opin. 2016 Sep;32(9):1589-97
pubmed: 27207562
JCO Clin Cancer Inform. 2018 Dec;2:1-10
pubmed: 30652573
Lung Cancer. 2014 Oct;86(1):22-8
pubmed: 25172266
Radiat Oncol. 2019 Jan 28;14(1):18
pubmed: 30691492
J Am Med Inform Assoc. 2005 Jan-Feb;12(1):3-7
pubmed: 15492029
J Oncol Pract. 2014 May;10(3):e150-9
pubmed: 24520142
EGEMS (Wash DC). 2016 Apr 13;4(1):1209
pubmed: 27195308
Med Care. 2018 Dec;56(12):e83-e89
pubmed: 29334524
Lancet. 2016 Apr 9;387(10027):1540-1550
pubmed: 26712084
Eur J Clin Pharmacol. 2012 Feb;68(2):123-9
pubmed: 21808989
Transl Lung Cancer Res. 2019 Sep;8(Suppl 2):S172-S183
pubmed: 31673522
JMIR Res Protoc. 2017 Oct 11;6(10):e195
pubmed: 29021129
Am J Public Health. 2020 May;110(5):e2
pubmed: 32267743
CA Cancer J Clin. 2020 Jan;70(1):7-30
pubmed: 31912902
Int J Appl Basic Med Res. 2015 May-Aug;5(2):82
pubmed: 26097811
JAMA. 2007 Jan 3;297(1):71-6
pubmed: 17200477
PLoS One. 2019 Jan 4;14(1):e0209709
pubmed: 30608948
Am J Epidemiol. 2011 Mar 15;173(6):676-82
pubmed: 21330339
Clin Trials. 2019 Aug;16(4):419-430
pubmed: 31081367
BMC Med. 2018 Feb 16;16(1):24
pubmed: 29448944
J Thorac Dis. 2015 Nov;7(11):E541-7
pubmed: 26716052
N Engl J Med. 2015 Oct 22;373(17):1627-39
pubmed: 26412456
J Manag Care Spec Pharm. 2017 Jun;23(6):643-652
pubmed: 28530522
Future Oncol. 2019 Oct;15(30):3491-3502
pubmed: 31497994
JCO Clin Cancer Inform. 2019 May;3:1-19
pubmed: 31070985
Lancet. 2014 Aug 23;384(9944):665-73
pubmed: 24933332
PLoS One. 2017 Apr 14;12(4):e0175679
pubmed: 28410405
Future Oncol. 2019 Sep;15(25):2943-2953
pubmed: 31037966
PLoS One. 2017 Jun 23;12(6):e0178420
pubmed: 28644837
Clin Lung Cancer. 2018 Jul;19(4):360-370
pubmed: 29576407
Lung Cancer. 2019 Feb;128:74-90
pubmed: 30642457
J Comp Eff Res. 2018 Dec;7(12):1133-1143
pubmed: 30411972
Epidemiology. 2019 May;30(3):466-471
pubmed: 30829831