| Literature DB >> 36026494 |
Stephen A Gallo1, Karen B Schmaling2.
Abstract
Peer review, commonly used in grant funding decisions, relies on scientists' ability to evaluate research proposals' quality. Such judgments are sometimes beyond reviewers' discriminatory power and could lead to a reliance on subjective biases, including preferences for lower risk, incremental projects. However, peer reviewers' risk tolerance has not been well studied. We conducted a cross-sectional experiment of peer reviewers' evaluations of mock primary reviewers' comments in which the level and sources of risks and weaknesses were manipulated. Here we show that proposal risks more strongly predicted reviewers' scores than proposal strengths based on mock proposal evaluations. Risk tolerance was not predictive of scores but reviewer scoring leniency was predictive of overall and criteria scores. The evaluation of risks dominates reviewers' evaluation of research proposals and is a source of inter-reviewer variability. These results suggest that reviewer scoring variability may be attributed to the interpretation of proposal risks, and could benefit from intervention to improve the reliability of reviews. Additionally, the valuation of risk drives proposal evaluations and may reduce the chances that risky, but highly impactful science, is supported.Entities:
Mesh:
Year: 2022 PMID: 36026494 PMCID: PMC9417194 DOI: 10.1371/journal.pone.0273813
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.752
Demographic summary of sample with no missing OIS data.
Total N is 605, PI risk group (199), Approach risk group (204), PI and Approach risk group (202).
| Factor | Total Proportion (N) | Proportion PI Risk (N) | Proportion Approach Risk (N) | Proportion PI and Approach Risk (N) | |
|---|---|---|---|---|---|
| Gender | Male | 65% (361) | 65% (129) | 65% (132) | 62% (125) |
| Female | 35% (197) | 35% (70) | 35% (72) | 38% (76) | |
| Non-Binary | <1% (1) | 0% (0) | 0% (0) | <1% (1) | |
| English as Your First Language | Yes | 71% (397) | 68% (136) | 70% (143) | 71% (144) |
| No | 29% (162) | 32% (63) | 30% (61) | 29% (58) | |
| Race/Ethnicity | Asian | 18% (102) | 20% (39) | 14% (29) | 21% (43) |
| Black/African American | 1% (7) | 2% (3) | 1% (3) | <1% (1) | |
| Latinx | 5% (27) | 4% (7) | 7% (14) | 4% (9) | |
| Native American/Indigenous | 1% (4) | 0% (0) | 1% (2) | <1% (1) | |
| White | 75% (421) | 80% (159) | 77% (157) | 74% (150) | |
| Degree Type | PhD | 83% (464) | 80% (159) | 87% (178) | 85% (171) |
| MD | 28% (158) | 31% (62) | 23% (47) | 29% (59) | |
| Neither | 1% (3) | <1% (1) | <1% (1) | <1% (1) | |
| Years Since Degree | Up to 20 years | 20% (111) | 21% (41) | 22% (44) | 15% (31) |
| 20 to 40 years | 66% (371) | 63% (125) | 64% (131) | 68% (138) | |
| 40 to 60 years | 14% (77) | 15% (30) | 11% (23) | 14% (29) | |
| Total Review Panels in last 3yrs | Up to 10 | 41% (228) | 39% (78) | 37% (76) | 43% (86) |
| 10 to 20 | 37% (207) | 40% (80) | 42% (85) | 32% (64) | |
| 20 to 30 | 14% (80) | 14% (27) | 14% (28) | 15% (31) | |
| 30 and over | 8% (44) | 7% (14) | 8% (16) | 10% (20) | |
| Review Panels–Funding Agency | NIH | 95% (532) | 95% (190) | 96% (196) | 93% (188) |
| NSF | 19% (104) | 20% (39) | 18% (36) | 19% (39) | |
| AHRQ | 4% (24) | 5% (10) | 4% (9) | 4% (8) | |
| DoD | 38% (214) | 39% (77) | 41% (84) | 34% (68) | |
| Other | 68% (380) | 70% (139) | 62% (126) | 70% (142) |
Overall score–multi-level ordinal regression models made with the reduced data set for direct comparison (n = 559).
| Model | Variance Across Participants | Changes in 2LL | Nagelkerke R2 |
|---|---|---|---|
| Baseline Across Participants | 7E-09 | 0 | --- |
| Risk (R) | 1.283 | 571.6 | 0.42 |
| R + Demographic Variable Block (DV) | 1.246 | 8.3 | 0.42 |
| R + DV + Research Similarity (RS) | 1.249 | 2.2 | 0.42 |
| R + DV + RS + Pre-disposition (PD) | 1.249 | 0 | 0.42 |
| R + DV + RS + PD + Risk Tolerance (NEO) | 1.247 | 0.3 | 0.42 |
| R + DV + RS + PD + NEO + Criteria Scores (CS) | 0.4992 | 896.1 | 0.77 |
| R + DV + RS + PD + NEO + CS + R:CS Interactions | 0.5619 | 67.9 | 0.78 |
* p< 0.05
** p<0.01; 95% CI in parentheses; each successive model is compared to previous via -2LL (a fixed intercept model was used as baseline); Nagelkerke R2 was calculated comparing to baseline model.
Cumulative link mixed model of overall score fitted with the Laplace approximation from the total data set (605 participants).
| Term | Odds Ratio | 95% CI | p-value |
|---|---|---|---|
| Risk | |||
| PI risk | 14.98 | 3.40, 66.00 | 0.0003 |
| Approach risk | 21.69 | 5.50, 85.43 | <0.0001 |
| PI-Approach risk | 36.17 | 8.47, 154.43 | <0.0001 |
| Demographic Block | |||
| Gender (Male) | 1.03 | 0.76, 1.41 | 0.8411 |
| Gender (Non-Binary) | 1.05 | 0.05, 22.39 | 0.9743 |
| Race Ethnicity (Non-White) | 0.94 | 0.64, 1.40 | 0.7736 |
| English as a First Language (Yes) | 1.27 | 0.89, 1.81 | 0.1960 |
| PhD (Yes) | 0.96 | 0.56, 1.63 | 0.8711 |
| MD (Yes) | 0.91 | 0.59, 1.41 | 0.6627 |
| Year Since Last Degree | 1.00 | 0.98, 1.01 | 0.6303 |
| Total Review Panels in the last 3 years | 1.02 | 1.01, 1.03 | 0.0033** |
| Research Similarity | 1.01 | 0.92, 1.09 | 0.9016 |
| Evaluative Predisposition | 0.92 | 0.82, 1.03 | 0.1536 |
| NEO Openness Scale | 1.02 | 0.87, 1.18 | 0.8370 |
| Review Criteria | |||
| Significance Score | 3.53 | 2.60, 4.79 | <0.0001 |
| Innovation Score | 2.24 | 1.69, 2.97 | <0.0001 |
| Investigator Score | 0.91 | 0.63, 1.31 | 0.6213 |
| Approach Score | 3.31 | 2.46, 4.45 | <0.0001 |
| Environment Score | 1.15 | 0.80, 1.66 | 0.4417 |
| Interactions | |||
| Risk PI: Significance Score | 0.84 | 0.49, 1.45 | 0.5332 |
| Risk Approach: Significance Score | 0.90 | 0.58, 1.39 | 0.6194 |
| Risk PI-Approach: Significance Score | 0.52 | 0.33, 0.83 | 0.0057 |
| Risk PI: Innovation Score | 0.55 | 0.34, 0.89 | 0.0158 |
| Risk Approach: Innovation Score | 0.38 | 0.24, 0.61 | 0.0001 |
| Risk PI-Approach: Innovation Score | 0.52 | 0.34, 0.80 | 0.0026 |
| Risk PI: Investigator Score | 2.23 | 1.38, 3.6 | 0.0011 |
| Risk Approach: Investigator Score | 1.53 | 0.84, 2.79 | 0.1664 |
| Risk PI-Approach: Investigator Score | 1.63 | 1.02, 2.61 | 0.0409 |
| Risk PI: Approach Score | 0.57 | 0.34, 0.94 | 0.0283 |
| Risk Approach: Approach Score | 1.31 | 0.91, 1.90 | 0.1516 |
| Risk PI-Approach: Approach Score | 1.36 | 0.94, 1.98 | 0.1044 |
| Risk PI: Environment Score | 1.03 | 0.64, 1.66 | 0.8990 |
| Risk Approach: Environment Score | 0.75 | 0.44, 1.27 | 0.2833 |
| Risk PI-Approach: Environment Score | 0.70 | 0.45, 1.09 | 0.1113 |
| Threshold Coefficients | |||
| 1|2 | 4.89 | 3.71, 6.07 | <0.0001 |
| 2|3 | 9.53 | 8.1, 10.95 | <0.0001 |
| 3|4 | 12.25 | 10.67, 13.83 | <0.0001 |
| 4|5 | 14.58 | 12.85, 16.31 | <0.0001 |
| 5|6 | 17.75 | 15.78, 19.72 | <0.0001 |
| 6|7 | 19.90 | 17.71, 22.09 | <0.0001 |
| 7|8 | 23.31 | 20.28, 26.34 | <0.0001 |
* p< 0.05
** p<0.01
*** p<0.001