Consensus modeling: Safer transfer learning for small health systems.

Hospital acquired infection Machine learning Predictive modeling Transfer learning

Journal

Artificial intelligence in medicine
ISSN: 1873-2860
Titre abrégé: Artif Intell Med
Pays: Netherlands
ID NLM: 8915031

Informations de publication

Date de publication:
24 May 2024
Historique:
received: 15 04 2021
revised: 25 03 2022
accepted: 21 05 2024
medline: 7 6 2024
pubmed: 7 6 2024
entrez: 6 6 2024
Statut: aheadofprint

Résumé

Predictive modeling is becoming an essential tool for clinical decision support, but health systems with smaller sample sizes may construct suboptimal or overly specific models. Models become over-specific when beside true physiological effects, they also incorporate potentially volatile site-specific artifacts. These artifacts can change suddenly and can render the model unsafe. To obtain safer models, health systems with inadequate sample sizes may adopt one of the following options. First, they can use a generic model, such as one purchased from a vendor, but often such a model is not sufficiently specific to the patient population and is thus suboptimal. Second, they can participate in a research network. Paradoxically though, sites with smaller datasets contribute correspondingly less to the joint model, again rendering the final model suboptimal. Lastly, they can use transfer learning, starting from a model trained on a large data set and updating this model to the local population. This strategy can also result in a model that is over-specific. In this paper we present the consensus modeling paradigm, which uses the help of a large site (source) to reach a consensus model at the small site (target). We evaluate the approach on predicting postoperative complications at two health systems with 9,044 and 38,045 patients (rare outcomes at about 1% positive rate), and conduct a simulation study to understand the performance of consensus modeling relative to the other three approaches as a function of the available training sample size at the target site. We found that consensus modeling exhibited the least over-specificity at either the source or target site and achieved the highest combined predictive performance.

Identifiants

pubmed: 38843692
pii: S0933-3657(24)00141-6
doi: 10.1016/j.artmed.2024.102899
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

102899

Informations de copyright

Copyright © 2024. Published by Elsevier B.V.

Déclaration de conflit d'intérêts

Declaration of competing interest None of the authors have conflict of interest to report.

Auteurs

Roshan Tourani (R)

Institute for Health Informatics, University of Minnesota, Twin Cities, MN, United States of America. Electronic address: roshan@umn.edu.

Dennis H Murphree (DH)

Department of Health Sciences Research, Mayo Clinic, MN, United States of America. Electronic address: Murphree.Dennis@mayo.edu.

Adam Sheka (A)

Department of Surgery, University of Minnesota, Twin Cities, MN, United States of America. Electronic address: sheka015@umn.edu.

Genevieve B Melton (GB)

Institute for Health Informatics, University of Minnesota, Twin Cities, MN, United States of America; Department of Surgery, University of Minnesota, Twin Cities, MN, United States of America. Electronic address: gmelton@umn.edu.

Daryl J Kor (DJ)

Department of Anesthesia, Mayo Clinic, MN, United States of America. Electronic address: Kor.Daryl@mayo.edu.

Gyorgy J Simon (GJ)

Institute for Health Informatics, University of Minnesota, Twin Cities, MN, United States of America; Department of Medicine, University of Minnesota, MN, United States of America. Electronic address: simo0342@umn.edu.

Classifications MeSH