|
Recall (sensitivity)
| 22 | Proportion of correctly identified positives amongst all real positives |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{\mathrm{TP}}{\mathrm{TP}+\mathrm{F}\mathrm{N}} $$\end{document}TPTP+FN
|
|
Precision
| 18 | Proportion of correctly identified positives amongst all positives. |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{TP}{TP+FP} $$\end{document}TPTP+FP
|
|
F measure
| 10 | Combines precision and recall. Values of β < 1.0 indicate precision is more important than recall, whilst values of β > 1.0 indicate recall is more important than precision |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ {F}_{\beta, k}\kern0.5em =\kern0.5em \frac{\left({\beta}^2+1\right){\mathrm{TP}}_k}{\left({\beta}^2+1\right){\mathrm{TP}}_k+{\mathrm{FP}}_k+{\beta}^2{\mathrm{FN}}_k} $$\end{document}Fβ,k=β2+1TPkβ2+1TPk+FPk+β2FNk Where β is a value that specifies the relative importance of recall and precision. |
|
ROC (AUC)
| 10 | Area under the curve traced out by graphing the true positive rate against the false positive rate. 1.0 is a perfect score and 0.50 is equivalent to a random ordering |
|
Accuracy
| 8 | Proportion of agreements to total number of documents. |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{\mathrm{TP}+\mathrm{T}\mathrm{N}}{\mathrm{TP}+\mathrm{F}\mathrm{P}+\mathrm{F}\mathrm{N}+\mathrm{T}\mathrm{N}} $$\end{document}TP+TNTP+FP+FN+TN
|
|
Work saved over sampling
| 8 | The percentage of papers that the reviewers do not have to read because they have been screened out by the classifier |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \mathrm{W}\mathrm{S}\mathrm{S}\ \mathrm{at}\ 95\%\ \mathrm{recall} = \kern0.5em \frac{\mathrm{TN}+\mathrm{F}\mathrm{N}}{N-0.05} $$\end{document}WSSat95%recall=TN+FNN−0.05
|
|
Time
| 7 | Time taken to screen (usually in minutes) | |
|
Burden
| 4 | The fraction of the total number of items that a human must screen (active learning) |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ Burden=\frac{t{p}^T+t{n}^T+f{p}^T+t{p}^U+f{p}^U}{N} $$\end{document}Burden=tpT+tnT+fpT+tpU+fpUN
|
|
Yield
| 3 | The fraction of items that are identified by a given screening approach (active learning) |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \mathrm{Yield}\kern0.5em =\kern0.5em \frac{{\mathrm{tp}}^T+{\mathrm{tp}}^U}{{\mathrm{tp}}^T+{\mathrm{tp}}^U+{\mathrm{fn}}^U} $$\end{document}Yield=tpT+tpUtpT+tpU+fnU
|
|
Utility
| 5 | Relative measure of burden and yield that takes into account reviewer preferences for weighting these two concepts (active learning) |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{\beta \cdot \mathrm{yield}+\left(1\kern0.5em -\kern0.5em \mathrm{burden}\right)}{\beta +1} $$\end{document}β⋅yield+1−burdenβ+1 Where β is the user-defined weight |
|
Baseline inclusion rate
| 2 | The proportion of includes in a random sample of items before prioritisation or classification takes place. The number to be screened is determined using a power calculation |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{n_i}{n_t} $$\end{document}nint Where n
i = number of items included in the random sample; n
t = total number of items in the random sample |
|
Performance (efficiency)
a
| 2 | Number of relevant items selected divided by the time spent screening, where relevant items were those marked as included by two or more people |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{\mathrm{Selected},\kern0.5em \mathrm{relevant}\kern0.5em \mathrm{items}}{\mathrm{Time}} $$\end{document}Selected,relevantitemsTime
|
|
Specificity
| 2 | The proportion of correctly identified negatives (excludes) out of the total number of negatives |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{\mathrm{TN}}{\mathrm{TN}+\mathrm{F}\mathrm{P}} $$\end{document}TNTN+FP
|
|
True positives
| 2 | The number of correctly identified positives (includes) | TP |
|
False negatives
| 1 | The number of incorrectly identified negatives (excludes) | FN |
|
Coverage
| 1 | The ratio of positives in the data pool that are annotated during active learning |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{{\mathrm{TP}}^L}{{\mathrm{TP}}^L+{\mathrm{FN}}^L+{\mathrm{TP}}^U+{\mathrm{FN}}^U} $$\end{document}TPLTPL+FNL+TPU+FNU Where L refers to labelled items and U refers to unlabelled items |
|
Unit cost
| 1 | Expected time to label an item multiplied by the unit cost of the labeler (salary per unit of time), as calculated from their (known or estimated) salary | timeexpected × costunit
|
|
Classification error
| 1 | Proportion of disagreements to total number of documents | 100 % − accuracy % |
|
Error
| 1 | Total number of falsely classified items divided by the total number of items |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{\sum \left(\mathrm{F}\mathrm{P}+\mathrm{F}\mathrm{N}\right)}{\sum \left(\mathrm{T}\mathrm{P}+\mathrm{F}\mathrm{P}+\mathrm{F}\mathrm{N}+\mathrm{T}\mathrm{N}\right)} $$\end{document}∑FP+FN∑TP+FP+FN+TN
|
|
Absolute screening reduction
| 1 | Number of items excluded by the classifier that do not need to be manually screened | TN + FN |
|
Prioritised inclusion rate
| 1 | The proportion of includes out of the total number screened, after prioritisation or classification takes place |
\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$ \frac{n_{\mathrm{ip}}}{n_{\mathrm{tp}}} $$\end{document}nipntp Where nip = number of items included in prioritised sample; ntp = total number of items in the prioritised sample |