Consensus modeling: Safer transfer learning for small health systems.
Hospital acquired infection
Machine learning
Predictive modeling
Transfer learning
Journal
Artificial intelligence in medicine
ISSN: 1873-2860
Titre abrégé: Artif Intell Med
Pays: Netherlands
ID NLM: 8915031
Informations de publication
Date de publication:
24 May 2024
24 May 2024
Historique:
received:
15
04
2021
revised:
25
03
2022
accepted:
21
05
2024
medline:
7
6
2024
pubmed:
7
6
2024
entrez:
6
6
2024
Statut:
aheadofprint
Résumé
Predictive modeling is becoming an essential tool for clinical decision support, but health systems with smaller sample sizes may construct suboptimal or overly specific models. Models become over-specific when beside true physiological effects, they also incorporate potentially volatile site-specific artifacts. These artifacts can change suddenly and can render the model unsafe. To obtain safer models, health systems with inadequate sample sizes may adopt one of the following options. First, they can use a generic model, such as one purchased from a vendor, but often such a model is not sufficiently specific to the patient population and is thus suboptimal. Second, they can participate in a research network. Paradoxically though, sites with smaller datasets contribute correspondingly less to the joint model, again rendering the final model suboptimal. Lastly, they can use transfer learning, starting from a model trained on a large data set and updating this model to the local population. This strategy can also result in a model that is over-specific. In this paper we present the consensus modeling paradigm, which uses the help of a large site (source) to reach a consensus model at the small site (target). We evaluate the approach on predicting postoperative complications at two health systems with 9,044 and 38,045 patients (rare outcomes at about 1% positive rate), and conduct a simulation study to understand the performance of consensus modeling relative to the other three approaches as a function of the available training sample size at the target site. We found that consensus modeling exhibited the least over-specificity at either the source or target site and achieved the highest combined predictive performance.
Identifiants
pubmed: 38843692
pii: S0933-3657(24)00141-6
doi: 10.1016/j.artmed.2024.102899
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
102899Informations de copyright
Copyright © 2024. Published by Elsevier B.V.
Déclaration de conflit d'intérêts
Declaration of competing interest None of the authors have conflict of interest to report.