Replicating prediction algorithms for hospitalization and corticosteroid use in patients with inflammatory bowel disease.
Journal
PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081
Informations de publication
Date de publication:
2021
2021
Historique:
received:
23
06
2021
accepted:
04
09
2021
entrez:
20
9
2021
pubmed:
21
9
2021
medline:
24
11
2021
Statut:
epublish
Résumé
Previous work had shown that machine learning models can predict inflammatory bowel disease (IBD)-related hospitalizations and outpatient corticosteroid use based on patient demographic and laboratory data in a cohort of United States Veterans. This study aimed to replicate this modeling framework in a nationally representative cohort. A retrospective cohort design using Optum Electronic Health Records (EHR) were used to identify IBD patients, with at least 12 months of follow-up between 2007 and 2018. IBD flare was defined as an inpatient/emergency visit with a diagnosis of IBD or an outpatient corticosteroid prescription for IBD. Predictors included demographic and laboratory data. Logistic regression and random forest (RF) models were used to predict IBD flare within 6 months of each visit. A 70% training and 30% validation approach was used. A total of 95,878 patients across 780,559 visits were identified. Of these, 22,245 (23.2%) patients had at least one IBD flare. Patients were predominantly White (87.7%) and female (57.1%), with a mean age of 48.0 years. The logistic regression model had an area under the receiver operating curve (AuROC) of 0.66 (95% CI: 0.65-0.66), sensitivity of 0.69 (95% CI: 0.68-0.70), and specificity of 0.74 (95% CI: 0.73-0.74) in the validation cohort. The RF model had an AuROC of 0.80 (95% CI: 0.80-0.81), sensitivity of 0.74 (95% CI: 0.73-0.74), and specificity of 0.72 (95% CI: 0.72-0.72) in the validation cohort. Important predictors of IBD flare in the RF model were the number of previous flares, age, potassium, and white blood cell count. The machine learning modeling framework was replicated and results showed a similar predictive accuracy in a nationally representative cohort of IBD patients. This modeling framework could be embedded in routine practice as a tool to distinguish high-risk patients for disease activity.
Identifiants
pubmed: 34543353
doi: 10.1371/journal.pone.0257520
pii: PONE-D-21-20616
pmc: PMC8452029
doi:
Substances chimiques
Adrenal Cortex Hormones
0
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
e0257520Déclaration de conflit d'intérêts
Ryan Gan and Diana Sun are full time employees of Genentech, Inc., a member of the Roche group, and own shares of Roche stock. Amanda Tatro is a full time employee of F. Hoffmann La Roche AG and own shares of Roche stock. This does not alter out adherence to PLOS ONE policies on sharing data and materials. No other authors have competing interests.
Références
J Stat Softw. 2010;33(1):1-22
pubmed: 20808728
Nat Mach Intell. 2020 Jan;2(1):56-67
pubmed: 32607472
Gastroenterology. 2004 May;126(6):1504-17
pubmed: 15168363
PLoS Biol. 2018 Nov 20;16(11):e2006930
pubmed: 30457984
BMJ Open. 2013 Aug 01;3(8):
pubmed: 23906948
Med Decis Making. 2006 Nov-Dec;26(6):565-74
pubmed: 17099194
Inflamm Bowel Dis. 2017 Dec 19;24(1):45-53
pubmed: 29272474
Dig Dis Sci. 2014 Oct;59(10):2406-10
pubmed: 24817338
Inflamm Bowel Dis. 2012 Oct;18(10):1894-9
pubmed: 22238138
Neural Comput. 2002 Jan;14(1):21-41
pubmed: 11747533
BMC Bioinformatics. 2007 Jan 25;8:25
pubmed: 17254353
Circulation. 2015 Jan 13;131(2):211-9
pubmed: 25561516
Int J Methods Psychiatr Res. 2011 Mar;20(1):40-9
pubmed: 21499542
Clin Gastroenterol Hepatol. 2007 Dec;5(12):1424-9
pubmed: 17904915
Science. 2011 Dec 2;334(6060):1226-7
pubmed: 22144613