Measuring calibration of likelihood-ratio systems: A comparison of four metrics, including a new metric devPAV.
Cllr
Empirical cross-entropy
Misleading evidence
PAV
Reliability
Validation
Journal
Forensic science international
ISSN: 1872-6283
Titre abrégé: Forensic Sci Int
Pays: Ireland
ID NLM: 7902034
Informations de publication
Date de publication:
Apr 2021
Apr 2021
Historique:
received:
05
11
2020
revised:
05
02
2021
accepted:
09
02
2021
pubmed:
9
3
2021
medline:
9
3
2021
entrez:
8
3
2021
Statut:
ppublish
Résumé
Numerical likelihood-ratio (LR) systems aim to calculate evidential strength for forensic evidence evaluation. Calibration of such LR-systems is essential: one does not want to over- or understate the strength of the evidence. Metrics that measure calibration differ in sensitivity to errors in calibration of such systems. In this paper we compare four calibration metrics by a simulation study based on Gaussian Log LR-distributions. Three calibration metrics are taken from the literature (Good, 1985; Royall, 1997; Ramos and Gonzalez-Rodriguez, 2013) [1-3], and a fourth metric is proposed by us. We evaluated these metrics by two performance criteria: differentiation (between well- and ill-calibrated LR-systems) and stability (of the value of the metric for a variety of well-calibrated LR-systems). Two metrics from the literature (the expected values of LR and of 1/LR, and the rate of misleading evidence stronger than 2) do not behave as desired in many simulated conditions. The third one (C
Identifiants
pubmed: 33684845
pii: S0379-0738(21)00042-6
doi: 10.1016/j.forsciint.2021.110722
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
110722Informations de copyright
Copyright © 2021 Elsevier B.V. All rights reserved.