Multiple retrieval case-based reasoning for incomplete datasets.

Case-based reasoning Incomplete data Medical decision support Missingness types Multiple imputation

Journal

Journal of biomedical informatics
ISSN: 1532-0480
Titre abrégé: J Biomed Inform
Pays: United States
ID NLM: 100970413

Informations de publication

Date de publication:
04 2019
Historique:
received: 07 12 2018
revised: 01 02 2019
accepted: 02 02 2019
pubmed: 17 2 2019
medline: 26 6 2020
entrez: 17 2 2019
Statut: ppublish

Résumé

The performance of case-based reasoning (CBR) depends on an accurate ranking of similar cases in the retrieval phase that affects all subsequent phases and profits from the potential of large databases. Unfortunately, growing databases come along with a rising amount of missing data that reduces the stability of the ranking since incomplete cases cannot be ranked as reliable as complete ones. In context of CBR hardly any work was done so far to rigorously analyze the impact of missing data and solutions to tackle this issue. In particular, a generalized solution which is able to process data under different missingness conditions for different variable types is missing. In this paper we present a multiple retrieval case-based reasoning (MRCBR) framework for incomplete databases that provides a statistically accurate ranking for similar cases. It unifies the advantages of multiple imputation and CBR while it preserves both the data distribution and database structure. Built as generalized CBR system, MRCBR was optimized and tested for medical decision support but can be extended to any CBR requirement as well. It is suitable for numerical and categorical variables and all sorts of missingness conditions. The approach was compared to eight competing methods applicable to handle incomplete databases in context of CBR. The comparison to the true ranking was based on two various error measures. In the evaluation we tested four representative scenarios that considered different conditions for missing data analysis. The outcome for every method in each scenario resulted in 200 miscellaneous setups. MRCBR outperforms all compared CBR methods in presence of missing data and shows reliable and stable results in every scenario. Especially with larger databases and rising number of incomplete variables it enlarges its lead to all other methods. Our study demonstrates that missing data must not be ignored when a correct CBR outcome is required.

Identifiants

pubmed: 30771484
pii: S1532-0464(19)30045-0
doi: 10.1016/j.jbi.2019.103127
pii:
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

103127

Informations de copyright

Copyright © 2019 Elsevier Inc. All rights reserved.

Auteurs

Nikolas Löw (N)

Experimental Radiation Oncology, Department of Radiation Oncology, University Medical Center Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany. Electronic address: nikolas.loew@medma.uni-heidelberg.de.

Jürgen Hesser (J)

Experimental Radiation Oncology, Department of Radiation Oncology, University Medical Center Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany. Electronic address: juergen.hesser@medma.uni-heidelberg.de.

Manuel Blessing (M)

Experimental Radiation Oncology, Department of Radiation Oncology, University Medical Center Mannheim, Heidelberg University, Theodor-Kutzer-Ufer 1-3, 68167 Mannheim, Germany. Electronic address: manuel.blessing@medma.uni-heidelberg.de.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH