A machine learning approach to identify distinct subgroups of veterans at risk for hospitalization or death using administrative and electronic health record data.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2021
Historique:
received: 12 10 2020
accepted: 02 02 2021
entrez: 19 2 2021
pubmed: 20 2 2021
medline: 24 8 2021
Statut: epublish

Résumé

Identifying individuals at risk for future hospitalization or death has been a major priority of population health management strategies. High-risk individuals are a heterogeneous group, and existing studies describing heterogeneity in high-risk individuals have been limited by data focused on clinical comorbidities and not socioeconomic or behavioral factors. We used machine learning clustering methods and linked comorbidity-based, sociodemographic, and psychobehavioral data to identify subgroups of high-risk Veterans and study long-term outcomes, hypothesizing that factors other than comorbidities would characterize several subgroups. In this cross-sectional study, we used data from the VA Corporate Data Warehouse, a national repository of VA administrative claims and electronic health data. To identify high-risk Veterans, we used the Care Assessment Needs (CAN) score, a routinely-used VA model that predicts a patient's percentile risk of hospitalization or death at one year. Our study population consisted of 110,000 Veterans who were randomly sampled from 1,920,436 Veterans with a CAN score≥75th percentile in 2014. We categorized patient-level data into 119 independent variables based on demographics, comorbidities, pharmacy, vital signs, laboratories, and prior utilization. We used a previously validated density-based clustering algorithm to identify 30 subgroups of high-risk Veterans ranging in size from 50 to 2,446 patients. Mean CAN score ranged from 72.4 to 90.3 among subgroups. Two-year mortality ranged from 0.9% to 45.6% and was highest in the home-based care and metastatic cancer subgroups. Mean inpatient days ranged from 1.4 to 30.5 and were highest in the post-surgery and blood loss anemia subgroups. Mean emergency room visits ranged from 1.0 to 4.3 and were highest in the chronic sedative use and polysubstance use with amphetamine predominance subgroups. Five subgroups were distinguished by psychobehavioral factors and four subgroups were distinguished by sociodemographic factors. High-risk Veterans are a heterogeneous population consisting of multiple distinct subgroups-many of which are not defined by clinical comorbidities-with distinct utilization and outcome patterns. To our knowledge, this represents the largest application of ML clustering methods to subgroup a high-risk population. Further study is needed to determine whether distinct subgroups may benefit from individualized interventions.

Sections du résumé

BACKGROUND
Identifying individuals at risk for future hospitalization or death has been a major priority of population health management strategies. High-risk individuals are a heterogeneous group, and existing studies describing heterogeneity in high-risk individuals have been limited by data focused on clinical comorbidities and not socioeconomic or behavioral factors. We used machine learning clustering methods and linked comorbidity-based, sociodemographic, and psychobehavioral data to identify subgroups of high-risk Veterans and study long-term outcomes, hypothesizing that factors other than comorbidities would characterize several subgroups.
METHODS AND FINDINGS
In this cross-sectional study, we used data from the VA Corporate Data Warehouse, a national repository of VA administrative claims and electronic health data. To identify high-risk Veterans, we used the Care Assessment Needs (CAN) score, a routinely-used VA model that predicts a patient's percentile risk of hospitalization or death at one year. Our study population consisted of 110,000 Veterans who were randomly sampled from 1,920,436 Veterans with a CAN score≥75th percentile in 2014. We categorized patient-level data into 119 independent variables based on demographics, comorbidities, pharmacy, vital signs, laboratories, and prior utilization. We used a previously validated density-based clustering algorithm to identify 30 subgroups of high-risk Veterans ranging in size from 50 to 2,446 patients. Mean CAN score ranged from 72.4 to 90.3 among subgroups. Two-year mortality ranged from 0.9% to 45.6% and was highest in the home-based care and metastatic cancer subgroups. Mean inpatient days ranged from 1.4 to 30.5 and were highest in the post-surgery and blood loss anemia subgroups. Mean emergency room visits ranged from 1.0 to 4.3 and were highest in the chronic sedative use and polysubstance use with amphetamine predominance subgroups. Five subgroups were distinguished by psychobehavioral factors and four subgroups were distinguished by sociodemographic factors.
CONCLUSIONS
High-risk Veterans are a heterogeneous population consisting of multiple distinct subgroups-many of which are not defined by clinical comorbidities-with distinct utilization and outcome patterns. To our knowledge, this represents the largest application of ML clustering methods to subgroup a high-risk population. Further study is needed to determine whether distinct subgroups may benefit from individualized interventions.

Identifiants

pubmed: 33606819
doi: 10.1371/journal.pone.0247203
pii: PONE-D-20-31997
pmc: PMC7894856
doi:

Types de publication

Journal Article Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0247203

Subventions

Organisme : HSRD VA
ID : I21 HX002564
Pays : United States
Organisme : HSRD VA
ID : IK6 HX003157
Pays : United States

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

J Am Acad Orthop Surg. 2020 Nov 15;28(22):923-929
pubmed: 32004175
Am J Manag Care. 2020 Jan;26(1):40-44
pubmed: 31951358
Med Care. 2013 Apr;51(4):368-73
pubmed: 23269113
JAMA Intern Med. 2013 Sep 9;173(16):1549-52
pubmed: 23835865
JAMA. 2016 Feb 16;315(7):651-2
pubmed: 26881365
Med Care. 2014 Mar;52 Suppl 3:S31-6
pubmed: 24561756
N Engl J Med. 2016 Dec 8;375(23):2218-2220
pubmed: 27959672
Health Aff (Millwood). 2010 Apr;29(4):629-38
pubmed: 20368592
J Gen Intern Med. 2019 Feb;34(2):218-225
pubmed: 30511290
Healthc (Amst). 2016 Sep;4(3):160-5
pubmed: 27637821
J Gen Intern Med. 2019 Feb;34(2):211-217
pubmed: 30543022
Issue Brief (Commonw Fund). 2016 Aug;26:1-14
pubmed: 27571599
Health Aff (Millwood). 2014 Jul;33(7):1203-11
pubmed: 25006147
JAMA Netw Open. 2020 Sep 1;3(9):e2018318
pubmed: 32986109
Issue Brief (Commonw Fund). 2014 Aug;19:1-19
pubmed: 25115035
JAMA. 2016 Oct 25;316(16):1657-1658
pubmed: 27669168
Healthc (Amst). 2017 Mar;5(1-2):62-67
pubmed: 27914968
J Gen Intern Med. 2018 Dec;33(12):2120-2126
pubmed: 30225769
Health Aff (Millwood). 2016 Nov 1;35(11):2109-2115
pubmed: 27834253
N Engl J Med. 2017 Apr 27;376(17):1609-1612
pubmed: 28445664
JAMA. 2009 Feb 11;301(6):603-18
pubmed: 19211468
Am J Manag Care. 2009 Jul;15(7):457-64
pubmed: 19589013
Issue Brief (Commonw Fund). 2019 Jan 01;2019:1-17
pubmed: 30645057
J Rehabil Res Dev. 2010;47(8):739-50
pubmed: 21141302
BMJ Open. 2015 Apr 16;5(4):e007771
pubmed: 25882486

Auteurs

Ravi B Parikh (RB)

Corporal Michael J. Crescenz Veterans Affairs Medical Center, Philadelphia, Pennsylvania, United States of America.
VA Center for Health Equity Research and Promotion, Pittsburgh, Pennsylvania, United States of America.
Department of Medical Ethics and Health Policy, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.
Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

Kristin A Linn (KA)

Department of Biostatistics, Epidemiology and Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

Jiali Yan (J)

Department of Medical Ethics and Health Policy, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.
Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

Matthew L Maciejewski (ML)

Durham Center of Innovation to Accelerate Discovery and Practice Transformation, Durham Veterans Affairs Health Care System, Durham, North Carolina, United States of America.

Ann-Marie Rosland (AM)

VA Center for Health Equity Research and Promotion, Pittsburgh, Pennsylvania, United States of America.

Kevin G Volpp (KG)

Corporal Michael J. Crescenz Veterans Affairs Medical Center, Philadelphia, Pennsylvania, United States of America.
VA Center for Health Equity Research and Promotion, Pittsburgh, Pennsylvania, United States of America.
Department of Medical Ethics and Health Policy, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.
Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

Peter W Groeneveld (PW)

VA Center for Health Equity Research and Promotion, Pittsburgh, Pennsylvania, United States of America.
Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

Amol S Navathe (AS)

Corporal Michael J. Crescenz Veterans Affairs Medical Center, Philadelphia, Pennsylvania, United States of America.
VA Center for Health Equity Research and Promotion, Pittsburgh, Pennsylvania, United States of America.
Department of Medical Ethics and Health Policy, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.
Leonard Davis Institute of Health Economics, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH