Literature DB >> 33684845

Measuring calibration of likelihood-ratio systems: A comparison of four metrics, including a new metric devPAV.

Peter Vergeer1, Yara van Schaik2, Marjan Sjerps3.   

Abstract

Numerical likelihood-ratio (LR) systems aim to calculate evidential strength for forensic evidence evaluation. Calibration of such LR-systems is essential: one does not want to over- or understate the strength of the evidence. Metrics that measure calibration differ in sensitivity to errors in calibration of such systems. In this paper we compare four calibration metrics by a simulation study based on Gaussian Log LR-distributions. Three calibration metrics are taken from the literature (Good, 1985; Royall, 1997; Ramos and Gonzalez-Rodriguez, 2013) [1-3], and a fourth metric is proposed by us. We evaluated these metrics by two performance criteria: differentiation (between well- and ill-calibrated LR-systems) and stability (of the value of the metric for a variety of well-calibrated LR-systems). Two metrics from the literature (the expected values of LR and of 1/LR, and the rate of misleading evidence stronger than 2) do not behave as desired in many simulated conditions. The third one (Cllrcal) performs better, but our newly proposed method (which we coin devPAV) is shown to behave equally well to clearly better under almost all simulated conditions. On the basis of this work, we recommend to use both devPAV and Cllrcal to measure calibration of LR-systems, where the current results indicate that devPAV is the preferred metric. In the future external validity of this comparison study can be extended by simulating non-Gaussian LR-distributions.
Copyright © 2021 Elsevier B.V. All rights reserved.

Keywords:  Cllr; Empirical cross-entropy; Misleading evidence; PAV; Reliability; Validation

Year:  2021        PMID: 33684845     DOI: 10.1016/j.forsciint.2021.110722

Source DB:  PubMed          Journal:  Forensic Sci Int        ISSN: 0379-0738            Impact factor:   2.395


  2 in total

1.  Utilization of Machine Learning for the Differentiation of Positional NPS Isomers with Direct Analysis in Real Time Mass Spectrometry.

Authors:  Jennifer L Bonetti; Saer Samanipour; Arian C van Asten
Journal:  Anal Chem       Date:  2022-03-17       Impact factor: 6.986

2.  In the context of forensic casework, are there meaningful metrics of the degree of calibration?

Authors:  Geoffrey Stewart Morrison
Journal:  Forensic Sci Int Synerg       Date:  2021-06-12
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.