Use of machine learning to analyse routinely collected intensive care unit data: a systematic review.
Artificial intelligence
Intensive care unit
Machine learning
Routinely collected data
Journal
Critical care (London, England)
ISSN: 1466-609X
Titre abrégé: Crit Care
Pays: England
ID NLM: 9801902
Informations de publication
Date de publication:
22 Aug 2019
22 Aug 2019
Historique:
received:
12
06
2019
accepted:
09
08
2019
entrez:
24
8
2019
pubmed:
24
8
2019
medline:
25
2
2020
Statut:
epublish
Résumé
Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients' journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in order to provide decision support to clinicians. Systematic review of the applications of machine learning to routinely collected ICU data. Web of Science and MEDLINE databases were searched to identify candidate articles: those on image processing were excluded. The study aim, the type of machine learning used, the size of dataset analysed, whether and how the model was validated, and measures of predictive accuracy were extracted. Of 2450 papers identified, 258 fulfilled eligibility criteria. The most common study aims were predicting complications (77 papers [29.8% of studies]), predicting mortality (70 [27.1%]), improving prognostic models (43 [16.7%]), and classifying sub-populations (29 [11.2%]). Median sample size was 488 (IQR 108-4099): 41 studies analysed data on > 10,000 patients. Analyses focused on 169 (65.5%) papers that used machine learning to predict complications, mortality, length of stay, or improvement of health. Predictions were validated in 161 (95.2%) of these studies: the area under the ROC curve (AUC) was reported by 97 (60.2%) but only 10 (6.2%) validated predictions using independent data. The median AUC was 0.83 in studies of 1000-10,000 patients, rising to 0.94 in studies of > 100,000 patients. The most common machine learning methods were neural networks (72 studies [42.6%]), support vector machines (40 [23.7%]), and classification/decision trees (34 [20.1%]). Since 2015 (125 studies [48.4%]), the most common methods were support vector machines (37 studies [29.6%]) and random forests (29 [23.2%]). The rate of publication of studies using machine learning to analyse routinely collected ICU data is increasing rapidly. The sample sizes used in many published studies are too small to exploit the potential of these methods. Methodological and reporting guidelines are needed, particularly with regard to the choice of method and validation of predictions, to increase confidence in reported findings and aid in translating findings towards routine use in clinical practice.
Sections du résumé
BACKGROUND
BACKGROUND
Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients' journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in order to provide decision support to clinicians.
METHODS
METHODS
Systematic review of the applications of machine learning to routinely collected ICU data. Web of Science and MEDLINE databases were searched to identify candidate articles: those on image processing were excluded. The study aim, the type of machine learning used, the size of dataset analysed, whether and how the model was validated, and measures of predictive accuracy were extracted.
RESULTS
RESULTS
Of 2450 papers identified, 258 fulfilled eligibility criteria. The most common study aims were predicting complications (77 papers [29.8% of studies]), predicting mortality (70 [27.1%]), improving prognostic models (43 [16.7%]), and classifying sub-populations (29 [11.2%]). Median sample size was 488 (IQR 108-4099): 41 studies analysed data on > 10,000 patients. Analyses focused on 169 (65.5%) papers that used machine learning to predict complications, mortality, length of stay, or improvement of health. Predictions were validated in 161 (95.2%) of these studies: the area under the ROC curve (AUC) was reported by 97 (60.2%) but only 10 (6.2%) validated predictions using independent data. The median AUC was 0.83 in studies of 1000-10,000 patients, rising to 0.94 in studies of > 100,000 patients. The most common machine learning methods were neural networks (72 studies [42.6%]), support vector machines (40 [23.7%]), and classification/decision trees (34 [20.1%]). Since 2015 (125 studies [48.4%]), the most common methods were support vector machines (37 studies [29.6%]) and random forests (29 [23.2%]).
CONCLUSIONS
CONCLUSIONS
The rate of publication of studies using machine learning to analyse routinely collected ICU data is increasing rapidly. The sample sizes used in many published studies are too small to exploit the potential of these methods. Methodological and reporting guidelines are needed, particularly with regard to the choice of method and validation of predictions, to increase confidence in reported findings and aid in translating findings towards routine use in clinical practice.
Identifiants
pubmed: 31439010
doi: 10.1186/s13054-019-2564-9
pii: 10.1186/s13054-019-2564-9
pmc: PMC6704673
doi:
Types de publication
Journal Article
Systematic Review
Langues
eng
Sous-ensembles de citation
IM
Pagination
284Subventions
Organisme : National Institute for Health Research
ID : BRC at University of Bristol and UH Bristol NHSFT
Organisme : National Institute for Health Research
ID : BRC at the University of Bristol and UH Bristol NHS FT
Organisme : National Institute for Health Research
ID : BRC at The University of Bristol and UH Bristol NHS FT
Références
Crit Care Med. 2001 Feb;29(2):427-35
pubmed: 11269246
JAMA. 2001 Apr 18;285(15):1992-5
pubmed: 11308436
Neural Netw. 2002 Jan;15(1):11-39
pubmed: 11958484
J Clin Epidemiol. 2007 Mar;60(3):241-9
pubmed: 17292017
Artif Intell Med. 2007 Jul;40(3):211-21
pubmed: 17580112
BMC Med Inform Decis Mak. 2007 Nov 22;7:35
pubmed: 18034872
PLoS Med. 2009 Jul 21;6(7):e1000097
pubmed: 19621072
Stud Health Technol Inform. 2009;150:590-4
pubmed: 19745380
BMC Med. 2010 Mar 24;8:18
pubmed: 20334633
BMC Med Inform Decis Mak. 2011 Oct 25;11:64
pubmed: 22027016
Sensors (Basel). 2013 Nov 15;13(11):15613-32
pubmed: 24248278
Health Care Manag Sci. 2015 Mar;18(1):58-66
pubmed: 24777832
BMC Med Res Methodol. 2014 Dec 22;14:137
pubmed: 25532820
Ann Intern Med. 2015 Jan 6;162(1):55-63
pubmed: 25560714
Artif Intell Med. 2015 Mar;63(3):191-207
pubmed: 25579436
Sci Data. 2016 May 24;3:160035
pubmed: 27219127
Proc IEEE Inst Electr Electron Eng. 2016 Feb;104(2):444-466
pubmed: 27765959
Comput Math Methods Med. 2016;2016:7087053
pubmed: 27818706
AMIA Annu Symp Proc. 2017 Feb 10;2016:954-963
pubmed: 28269892
J Am Med Inform Assoc. 2017 Nov 1;24(6):1052-1061
pubmed: 28379439
BMJ. 2017 May 18;357:j2439
pubmed: 28522583
Health Serv Manage Res. 2017 May;30(2):105-120
pubmed: 28539083
Intensive Care Med. 2018 Sep;44(9):1524-1527
pubmed: 29279970
BMJ Open. 2018 Jan 26;8(1):e017833
pubmed: 29374661
Crit Care Med. 2018 Jun;46(6):e481-e488
pubmed: 29419557
JAMA. 2018 Apr 3;319(13):1317-1318
pubmed: 29532063
Crit Care Med. 2018 Jul;46(7):1070-1077
pubmed: 29596073
Ann Intern Med. 2019 Jan 1;170(1):51-58
pubmed: 30596875
N Engl J Med. 2019 Apr 4;380(14):1347-1358
pubmed: 30943338
Comput Biomed Res. 1993 Jun;26(3):220-9
pubmed: 8325002
JAMA. 1996 Aug 28;276(8):637-9
pubmed: 8773637