| Literature DB >> 35292671 |
Benjamin P Raysmith1,2,3, Toomas Timpka4, Jenny Jacobsson4,5, Michael K Drew6,7,8, Örjan Dahlström4,9.
Abstract
In applied program settings, such as in natural environment control and education, performance evaluation is usually conducted by evaluators considering both self-comparison and comparison with peers. We have developed the performance outcome scoring template (POS-T) for assessments with high face-validity in these settings. POS-T puts achievements of individuals or groups in context, i.e. the resulting performance outcome score (POS) reflects a meaningful measure of performance magnitude with regards to internal and external comparisons. Development of a POS is performed in four steps supported by a statistical framework. Software is supplied for creation of scoring applications in different performance evaluation settings. We demonstrate the POS-T by evaluation of CO2 emissions reduction amongst 36 OECD member countries.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35292671 PMCID: PMC8924175 DOI: 10.1038/s41598-022-08368-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Figure 1POS-T signifying four cardinal and one optional (weighting) data handling levels.
Component parameters and how they are quantified for an entity ‘i’.
| Component | Component output | Metric quantification description |
|---|---|---|
| Achievement | Achievement output ( | Outcome of interest crude measure |
| Achievement reference ( | Achievement output from previous time-point | |
| OR | ||
| Mean achievement output over a time-period | ||
| OR | ||
| A population standard | ||
| OR | ||
| Other comparator of interest measured by the same metric and continuous scale | ||
| Achievement deviation ( | Deviation of: | |
| Individual achievement outputs over a time-period | ||
| OR | ||
| Population achievement outputs from peers in cohort of interest | ||
| OR | ||
| Population deviation for metric of interest | ||
| Lower and upper reference limiters | Practical lower and upper reference limits. To establish a cohort range for consistent future comparative evaluations | |
| Rank | Final rank ( | Entity rank in order of crude measures at evaluation time-point |
| Initial rank ( | Entity rank at time-point of comparison prior to evaluation event or period | |
| Transformed rank-scores | Ranks transformed to a continuous value. Reflecting non-equidistance between entity ranks generated from crude measure | |
| Rank score output | Magnitude difference of final rank position relative to peers. Aggregation of transformed rank-scores from entities with a final rank behind entity | |
| Rank score reference | Median of simulated measures | |
| Rank score deviation | Deviations of simulated rank score outputs. Absolute value of the difference between the median and either of |
Figure 2(A) Entity rankings, (B) Simulated rank scores (example data). (A) Initial rank-order (ranked by average annual CO2 emissions per capita 1996–2005) (ρi) versus average CO2 emission per capita 1996–2005 (tonnes) and transformed rank-score f(ρ). (B) Simulation (example with 100 iterations shown for illustrative purposes) of potential rank score outputs (Φ) for each entity based on underlying distribution of achievement output. Box plot showing 16th and 84th percentiles.
Contextual features framing the evaluation.
| Context | Description |
|---|---|
| Entities to evaluate | Four countries (Sweden, Mexico, Norway, Luxembourg) |
| Comparison cohort | 36 OECD countries |
| Comparator | CO2 emissions per capita over the period 2006–2015 |
| Quantifiable domains (components) | Internal comparison—achievement |
| External comparison—rank |
Components, parameters, and metrics used to populate the component outputs.
| Component | Parameter | Metric |
|---|---|---|
| Achievement | Output measure | Tonnes of CO2 equivalent 2015 |
| Standardising parameter | per capita | |
| Reference | 10-year annual output average 1996–2005 | |
| Distribution | Individual entity annual variations 1996–2005 | |
| Rank | Output measure | OECD ranking table 2015 |
| Reference | Rank of median achievement references |
Component outputs for the four stakeholders derived from Step 2 of the POS-T. (Units of achievement = tonnes of CO2 equivalent per capita. Rank score output = proportional score gained from ranking ahead of other countries. Rank score reference = median score from simulation based on achievement descriptive statistics. Rank score deviation = simulation outputs based on 16th and 84th percentiles. LUX. = Luxembourg).
| Component | Component output | Sweden | Mexico | Norway | Lux |
|---|---|---|---|---|---|
| Achievement | Achievement output ( | 5.47 | 5.74 | 10.47 | 18.17 |
| Achievement reference ( | 7.90 | 5.48 | 12.23 | 24.06 | |
| Achievement deviation ( | 0.34 | 0.10 | 0.14 | 2.78 | |
| Lower and upper reference limiters | Lower reference limiter 4 Upper reference limiter 26 | ||||
| Rank | Final rank ( | 1 | 3 | 24 | 33 |
| Initial rank ( | 9 | 4 | 21 | 34 | |
| Transformed rank-score out of 100 | 82.44 | 93.36 | 62.97 | 9.72 | |
| Rank score output | 26.44 | 21.32 | 7.14 | 2.23 | |
| Rank score reference | 17.65 | 20.14 | 8.61 | 0.86 | |
| Rank score deviation | 1.01 (16.63–18.66) | 1.28 (18.86–21.42) | 0.05 (8.56–8.66) | 1.38 (− 0.53 to 2.27) | |
Component indicators (normalised component outputs) derived from Step 3 of the POS-T.
| Indicator | Sweden | Mexico | Norway | Luxembourg |
|---|---|---|---|---|
| z-achievement | 6.85 | − 2.56 | 12.92 | 2.12 |
| z-rank | 8.69 | 0.92 | − 27.73 | 1.00 |
OECD countries ordered highest to lowest by the POS including component crude output variation and component magnitude of change over the observation period. (Crude CO2 emissions = tonnes of CO2 equivalent per capita).
Significant values are in bold and italics.
Figure 3Composite score development model with five development steps and three example phenomena.