| Literature DB >> 35588112 |
David E Huber1, Andrew L Cohen1, Adrian Staub1.
Abstract
We introduce the statistical concept of 'compensatory selection', which arises when selecting a subset of applicants based on multiple predictors, such as when standardized test scores are used in combination with other predictors required in a school application (e.g., previous grades, references letters, and personal statements). Post-hoc analyses often fail to find a positive correlation between test scores and subsequent success, and this failure is sometimes taken as evidence against the predictive validity of the standardized test. The present analysis reveals that the failure to find a negative correlation indicates that the standardized test is in fact a valid predictor of success. This is due to compensation between predictors during selection: Some students are admitted despite a low test score because their application is exceptional in other respects, while other students are admitted primarily based on a high test score despite weakness in the rest of their application. This compensatory selection process introduces a negative correlation between test scores and other predictors among those admitted (a 'collider bias' or 'Berkson's paradox' effect). If test scores are valid predictors of success, this negative correlation between the predictors counteracts the positive correlation between test scores and success that would have been observed if all applicants were admitted. If test scores are not predictive of success, but were nevertheless used in a compensatory selection process, there would be a spurious negative correlation between test scores and success (i.e., an admitted student with a weak application except for a high test score would be unlikely to succeed). The selection effect that is described here is fundamentally different from the well-known 'restricted range' problem and can powerfully alter results even in situations that accept most applicants.Entities:
Mesh:
Year: 2022 PMID: 35588112 PMCID: PMC9119552 DOI: 10.1371/journal.pone.0265459
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Fig 1Different ground truths. The displayed simulation results compare three different admissions policies (columns) applied to four different “ground truth” situations (rows). Each graph plots both the probability of acceptance and the probability of degree completion for accepted students at each standardized test percentile. For the middle column, the admissions policy added up all predictors, including test scores, accepting the top 10% of applicants based on the summed score. The right column does the same, but without test scores. The percent numbers in the lower right-hand corner of each graph show the probability of degree completion averaged across all accepted students. For each simulation, the correlation (r-value) between test percentile and degree completion probability is reported and double stars (**) indicate significance values less than .001 (all other r-values had p-values greater than .05).
Factor-loading matrix for ground truth #4.
|
|
|
|
| |
|---|---|---|---|---|
|
| 1 | 0 | 0 | 0 |
|
| .58 | .71 | .41 | 0 |
|
| 0 | .41 | .71 | .58 |
|
| 0 | 0 | 0 | 1 |
Factor-loading weights applied to latent characteristics (columns) created observed predictors (rows).
Fig 2Test score regression slope as function of acceptance percentage.
This graph shows the role of selective acceptance, ranging from 100% (accept everyone) to 5% (accept top 5%) in 5% steps for the same four ground truths shown in Fig 1. The graph shows the regression slope between Test scores and degree completion on the y-axis in units of percent completion change with each change of Test percentile. For instance, a value of .2 indicates that a change of 10% in terms of Test percentiles corresponds to a 2% increase in the probability of degree completion. The vertical line at 10% indicates the level of selectivity shown in Fig 1 and the results at this level are identical to the results in Fig 1.