Use of machine learning to analyse routinely collected intensive care unit data: a systematic review.

Adult Data Analysis Electronic Health Records Female Humans Intensive Care Units / organization & administration Machine Learning / standards Male

Artificial intelligence Intensive care unit Machine learning Routinely collected data

Journal

Critical care (London, England)

ISSN: 1466-609X

Titre abrégé: Crit Care

Pays: England

ID NLM: 9801902

Informations de publication

Date de publication:
22 Aug 2019

Historique:

received: 12 06 2019

accepted: 09 08 2019

entrez: 24 8 2019

pubmed: 24 8 2019

medline: 25 2 2020

Statut: epublish

Résumé

Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients' journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in order to provide decision support to clinicians. Systematic review of the applications of machine learning to routinely collected ICU data. Web of Science and MEDLINE databases were searched to identify candidate articles: those on image processing were excluded. The study aim, the type of machine learning used, the size of dataset analysed, whether and how the model was validated, and measures of predictive accuracy were extracted. Of 2450 papers identified, 258 fulfilled eligibility criteria. The most common study aims were predicting complications (77 papers [29.8% of studies]), predicting mortality (70 [27.1%]), improving prognostic models (43 [16.7%]), and classifying sub-populations (29 [11.2%]). Median sample size was 488 (IQR 108-4099): 41 studies analysed data on > 10,000 patients. Analyses focused on 169 (65.5%) papers that used machine learning to predict complications, mortality, length of stay, or improvement of health. Predictions were validated in 161 (95.2%) of these studies: the area under the ROC curve (AUC) was reported by 97 (60.2%) but only 10 (6.2%) validated predictions using independent data. The median AUC was 0.83 in studies of 1000-10,000 patients, rising to 0.94 in studies of > 100,000 patients. The most common machine learning methods were neural networks (72 studies [42.6%]), support vector machines (40 [23.7%]), and classification/decision trees (34 [20.1%]). Since 2015 (125 studies [48.4%]), the most common methods were support vector machines (37 studies [29.6%]) and random forests (29 [23.2%]). The rate of publication of studies using machine learning to analyse routinely collected ICU data is increasing rapidly. The sample sizes used in many published studies are too small to exploit the potential of these methods. Methodological and reporting guidelines are needed, particularly with regard to the choice of method and validation of predictions, to increase confidence in reported findings and aid in translating findings towards routine use in clinical practice.

Sections du résumé

BACKGROUND BACKGROUND

METHODS METHODS

Systematic review of the applications of machine learning to routinely collected ICU data. Web of Science and MEDLINE databases were searched to identify candidate articles: those on image processing were excluded. The study aim, the type of machine learning used, the size of dataset analysed, whether and how the model was validated, and measures of predictive accuracy were extracted.

RESULTS RESULTS

Of 2450 papers identified, 258 fulfilled eligibility criteria. The most common study aims were predicting complications (77 papers [29.8% of studies]), predicting mortality (70 [27.1%]), improving prognostic models (43 [16.7%]), and classifying sub-populations (29 [11.2%]). Median sample size was 488 (IQR 108-4099): 41 studies analysed data on > 10,000 patients. Analyses focused on 169 (65.5%) papers that used machine learning to predict complications, mortality, length of stay, or improvement of health. Predictions were validated in 161 (95.2%) of these studies: the area under the ROC curve (AUC) was reported by 97 (60.2%) but only 10 (6.2%) validated predictions using independent data. The median AUC was 0.83 in studies of 1000-10,000 patients, rising to 0.94 in studies of > 100,000 patients. The most common machine learning methods were neural networks (72 studies [42.6%]), support vector machines (40 [23.7%]), and classification/decision trees (34 [20.1%]). Since 2015 (125 studies [48.4%]), the most common methods were support vector machines (37 studies [29.6%]) and random forests (29 [23.2%]).

CONCLUSIONS CONCLUSIONS

The rate of publication of studies using machine learning to analyse routinely collected ICU data is increasing rapidly. The sample sizes used in many published studies are too small to exploit the potential of these methods. Methodological and reporting guidelines are needed, particularly with regard to the choice of method and validation of predictions, to increase confidence in reported findings and aid in translating findings towards routine use in clinical practice.

Identifiants

DOI: 10.1186/s13054-019-2564-9 PMID: 31439010 PMC: PMC6704673

pubmed: 31439010

doi: 10.1186/s13054-019-2564-9

pii: 10.1186/s13054-019-2564-9

pmc: PMC6704673

doi:

Types de publication

Journal Article Systematic Review

Langues

eng

Sous-ensembles de citation

Pagination

284

Subventions

Organisme : National Institute for Health Research

ID : BRC at University of Bristol and UH Bristol NHSFT

Organisme : National Institute for Health Research

ID : BRC at the University of Bristol and UH Bristol NHS FT

Organisme : National Institute for Health Research

ID : BRC at The University of Bristol and UH Bristol NHS FT

Références

Crit Care Med. 2001 Feb;29(2):427-35

pubmed: 11269246

JAMA. 2001 Apr 18;285(15):1992-5

pubmed: 11308436

Neural Netw. 2002 Jan;15(1):11-39

pubmed: 11958484

J Clin Epidemiol. 2007 Mar;60(3):241-9

pubmed: 17292017

Artif Intell Med. 2007 Jul;40(3):211-21

pubmed: 17580112

BMC Med Inform Decis Mak. 2007 Nov 22;7:35

pubmed: 18034872

PLoS Med. 2009 Jul 21;6(7):e1000097

pubmed: 19621072

Stud Health Technol Inform. 2009;150:590-4

pubmed: 19745380

BMC Med. 2010 Mar 24;8:18

pubmed: 20334633

BMC Med Inform Decis Mak. 2011 Oct 25;11:64

pubmed: 22027016

Sensors (Basel). 2013 Nov 15;13(11):15613-32

pubmed: 24248278

Health Care Manag Sci. 2015 Mar;18(1):58-66

pubmed: 24777832

BMC Med Res Methodol. 2014 Dec 22;14:137

pubmed: 25532820

Ann Intern Med. 2015 Jan 6;162(1):55-63

pubmed: 25560714

Artif Intell Med. 2015 Mar;63(3):191-207

pubmed: 25579436

Sci Data. 2016 May 24;3:160035

pubmed: 27219127

Proc IEEE Inst Electr Electron Eng. 2016 Feb;104(2):444-466

pubmed: 27765959

Comput Math Methods Med. 2016;2016:7087053

pubmed: 27818706

AMIA Annu Symp Proc. 2017 Feb 10;2016:954-963

pubmed: 28269892

J Am Med Inform Assoc. 2017 Nov 1;24(6):1052-1061

pubmed: 28379439

BMJ. 2017 May 18;357:j2439

pubmed: 28522583

Health Serv Manage Res. 2017 May;30(2):105-120

pubmed: 28539083

Intensive Care Med. 2018 Sep;44(9):1524-1527

pubmed: 29279970

BMJ Open. 2018 Jan 26;8(1):e017833

pubmed: 29374661

Crit Care Med. 2018 Jun;46(6):e481-e488

pubmed: 29419557

JAMA. 2018 Apr 3;319(13):1317-1318

pubmed: 29532063

Crit Care Med. 2018 Jul;46(7):1070-1077

pubmed: 29596073

Ann Intern Med. 2019 Jan 1;170(1):51-58

pubmed: 30596875

N Engl J Med. 2019 Apr 4;380(14):1347-1358

pubmed: 30943338

Comput Biomed Res. 1993 Jun;26(3):220-9

pubmed: 8325002

JAMA. 1996 Aug 28;276(8):637-9

pubmed: 8773637

Use of machine learning to analyse routinely collected intensive care unit data: a systematic review.

Journal

Informations de publication

Résumé

Sections du résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Subventions

Références

Auteurs

Duncan Shillan (D)

Jonathan A C Sterne (JAC)

Alan Champneys (A)

Ben Gibbison (B)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH