| Literature DB >> 35694383 |
Daniel Garcia-Costa1, Flaminio Squazzoni2, Bahar Mehmani3, Francisco Grimaldo1.
Abstract
Reviewers do not only help editors to screen manuscripts for publication in academic journals; they also serve to increase the rigor and value of manuscripts by constructive feedback. However, measuring this developmental function of peer review is difficult as it requires fine-grained data on reports and journals without any optimal benchmark. To fill this gap, we adapted a recently proposed quality assessment tool and tested it on a sample of 1.3 million reports submitted to 740 Elsevier journals in 2018-2020. Results showed that the developmental standards of peer review are shared across areas of research, yet with remarkable differences. Reports submitted to social science and economics journals show the highest developmental standards. Reports from junior reviewers, women and reviewers from Western Europe are generally more developmental than those from senior, men and reviewers working in academic institutions outside Western regions. Our findings suggest that increasing the standards of peer review at journals requires effort to assess interventions and measure practices with context-specific and multi-dimensional frameworks. ©2022 Garcia-Costa et al.Entities:
Keywords: Academic journals; Natural language processing; Peer review; Reviewers; Standards
Year: 2022 PMID: 35694383 PMCID: PMC9186327 DOI: 10.7717/peerj.13539
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 3.061
Number of journals per quartile of impact factor and area of research.
| PS | SSE | HMS | LS | Total | |
|---|---|---|---|---|---|
| Journals | 333 | 99 | 174 | 134 | 740 |
| Journals Q1 | 161 | 45 | 40 | 38 | 283 |
| Journals Q2 | 110 | 20 | 40 | 49 | 219 |
| Journals Q3 | 29 | 17 | 32 | 27 | 105 |
| Journals Q4 | 8 | 3 | 7 | 3 | 21 |
| Journals NI | 25 | 14 | 55 | 18 | 112 |
Number of reviews per journal quartile and area of research.
| PS | SSE | HMS | LS | Total | |
|---|---|---|---|---|---|
| Reviews | 825.247 | 171.070 | 150.296 | 185.328 | 1.331.941 |
| Reviews Q1 | 602.763 | 146.088 | 51.860 | 88.089 | 888.800 |
| Reviews Q2 | 165.506 | 18.422 | 40.733 | 61.104 | 285.765 |
| Reviews Q3 | 29.743 | 5.147 | 46.596 | 26.375 | 107.861 |
| Reviews Q4 | 2.104 | 468 | 3.236 | 978 | 6.786 |
| Reviews NI | 25.131 | 945 | 7.871 | 8.782 | 42.729 |
Number of reviews per reviewers’ geographical location and area of research.
(Note: Countries are classified according to ISO 3166 country codes, while their aggregation complies with the United Nation M49 standard).
| PS | SSE | HMS | LS | Total | |
|---|---|---|---|---|---|
| Northern America | 120392 | 64254 | 52027 | 52763 | 289436 (21.73%) |
| Western Europe | 64603 | 16539 | 14798 | 17923 | 113863 ( 8.55%) |
| Eastern Asia | 290125 | 32140 | 20583 | 37496 | 380344 (28.56%) |
| Southern Asia | 57994 | 3880 | 6450 | 7124 | 75448 ( 5.66%) |
| Northern Europe | 46505 | 16048 | 13235 | 12387 | 88175 ( 6.62%) |
| Eastern Europe | 37165 | 1722 | 3622 | 5935 | 48444 ( 3.64%) |
| Latin America and the Caribbean | 34886 | 2791 | 6329 | 11713 | 55719 ( 4.18%) |
| Southern Europe | 85495 | 11726 | 15388 | 21733 | 134342 (10.09%) |
| South-East Asia | 19079 | 2158 | 1995 | 3378 | 26610 ( 2.00%) |
| Western Asia (Middle East) | 27071 | 5211 | 5653 | 5121 | 43056 ( 3.23%) |
| Australia and New Zealand | 24925 | 12463 | 5716 | 5756 | 48860 ( 3.67%) |
| Northern Africa | 8006 | 317 | 2300 | 1383 | 12006 ( 0.90%) |
| Central Asia | 306 | 16 | 13 | 35 | 370 ( 0.03%) |
| Sub-Saharan Africa | 5254 | 750 | 955 | 1201 | 8160 ( 0.61%) |
| Micronesia | 134 | 24 | 74 | 52 | 284 ( 0.02%) |
| Melanesia | 70 | 10 | 7 | 17 | 104 ( 0.01%) |
| Polynesia | 23 | 3 | 2 | 5 | 33 ( 0.00%) |
| Missing | 3214 | 1018 | 1149 | 1306 | 6687 ( 0.50%) |
Number of reviews per gender, seniority and area of research.
| PS | SSE | HMS | LS | Total (%) | |
|---|---|---|---|---|---|
| Women | 148807 | 44927 | 41754 | 59562 | 295050 (22.15%) |
| Men | 645547 | 120529 | 106096 | 121488 | 993660 (74.60%) |
| Missing gender | 30893 | 5614 | 2446 | 4278 | 43231 (3.25%) |
| <5 years | 21365 | 9463 | 3867 | 4070 | 38765 (2.91%) |
| 5 to 18 years | 435892 | 101008 | 67912 | 85074 | 689886 (51.80%) |
| >18 years | 335270 | 51557 | 69659 | 86483 | 542969 (40.77%) |
| Missing seniority | 32720 | 9042 | 8858 | 9701 | 60321 (4.53%) |
Figure 1Steps of the dictionary building process.
Initial seed terms for each developmental dimension.
| Developmental dimensions | Initial seed terms |
|---|---|
| Impact | relevant, impact, novel, original, innovator paper, |
| interest paper, disappointing paper, important topic, | |
| relevant paper, research community | |
| Relevant Literature | cite, consider reference, require reference, reference paper, |
| related work, literature, bibliography, similar work, | |
| previous work, existing work | |
| Study Methods | methodology, approach, experiment, techniques, analysis, |
| procedures, provide justification, provide comparison, | |
| exploratory, meticulous | |
| Statistical Methods | statistics, null hypothesis, regression, coefficient, |
| significance, correlation, deviation, Bayesian, | |
| response variable, effect size | |
| Study Conclusions | result, discussion, conclusion, findings, research question, |
| unjustified, evidence, inconsistency, unsolved problem, | |
| explanation | |
| Limitations | limitations, weakness, robustness, future work, |
| lack acknowledg, acknowledg limit, expertise, | |
| under-investigated, flaws, bottleneck | |
| Applicability | work applicability, application domain, reproducible, |
| generalizable results, generalizable study, scalable, | |
| transferable, irreproducible, reusable, universal method | |
| Presentation | table, figure, row, column, image, axis, caption, |
| legend, graph, footer | |
| Data Availability | database, data available, accessible data, experiment data, |
| publish data, repository, source code, opaque, secrecy, | |
| available resources | |
| Organization and Writing | rewrite, well written, poor written, reorganize, move, |
| spelling, page, line, sentence, paragraph |
Number of terms for each dimension of the developmental score.
| Item | Num of terms | Item | Num of terms |
|---|---|---|---|
| Impact | 175 | Literature | 235 |
| Methods | 240 | Statistics | 122 |
| Conclusions | 283 | Limitations | 71 |
| Applicability | 139 | Presentation | 72 |
| Data | 128 | Writing | 168 |
Figure 2Distribution of the developmental score per research area.
The density curves in this violin plot show the distribution of the score for all research areas and, separately, for PS, SEE, HMS or LS.
Explained variance by each principal component.
| % of variance | Cumulative % of variance | |
|---|---|---|
| PC1 | 39.38 | 39.38 |
| PC2 | 9.88 | 49.26 |
| PC3 | 8.76 | 58.02 |
| PC4 | 7.18 | 65.20 |
| PC5 | 6.99 | 72.19 |
| PC6 | 6.63 | 78.82 |
| PC7 | 5.99 | 84.81 |
| PC8 | 5.54 | 90.36 |
| PC9 | 5.24 | 95.59 |
| PC10 | 4.41 | 100.00 |
Cronbach alphas and Item-total correlations.
| Dimension | Item-total correlation | |
|---|---|---|
| Impact | 0.81 | 0.53 |
| Relevant literature | 0.82 | 0.47 |
| Study Methods | 0.81 | 0.60 |
| Statistical Methods | 0.80 | 0.63 |
| Study Conclusions | 0.79 | 0.72 |
| Limitations | 0.81 | 0.55 |
| Applicability | 0.81 | 0.58 |
| Presentation | 0.82 | 0.43 |
| Data availability | 0.82 | 0.49 |
| Organization and writing | 0.80 | 0.65 |
CFA factor loadings for each developmental item.
| Indicator | Estimate | Std.Err | P( >| |
|---|---|---|---|
| Impact | 1.00 | 0.00 | 0.00 |
| Relevant literature | 1.01 | 0.00 | 0.00 |
| Study methods | 1.10 | 0.00 | 0.00 |
| Statistical methods | 1.19 | 0.00 | 0.00 |
| Study conclusions | 1.28 | 0.00 | 0.00 |
| Limitations | 1.19 | 0.00 | 0.00 |
| Applicability | 1.20 | 0.00 | 0.00 |
| Presentation | 0.88 | 0.00 | 0.00 |
| Data availability | 1.06 | 0.00 | 0.00 |
| Organization and writing | 1.13 | 0.00 | 0.00 |
Mean and standard deviation (in brackets) for each developmental score dimension per research area.
| PS | SSE | HMS | LS | |
|---|---|---|---|---|
| Impact | 0.473 (0.318) | 0.662 (0.317) | 0.516 (0.329) | 0.521 (0.325) |
| Literature | 0.377 (0.371) | 0.509 (0.384) | 0.328 (0.362) | 0.358 (0.371) |
| Methods | 0.527 (0.316) | 0.585 (0.31) | 0.43 (0.314) | 0.479 (0.313) |
| Statistics | 0.442 (0.329) | 0.645 (0.329) | 0.499 (0.338) | 0.484 (0.335) |
| Conclusions | 0.487 (0.302) | 0.608 (0.303) | 0.521 (0.306) | 0.559 (0.307) |
| Limitations | 0.369 (0.361) | 0.608 (0.364) | 0.437 (0.372) | 0.405 (0.375) |
| Applicability | 0.441 (0.361) | 0.532 (0.359) | 0.423 (0.366) | 0.447 (0.369) |
| Presentation | 0.42 (0.36) | 0.314 (0.331) | 0.322 (0.342) | 0.407 (0.365) |
| Data | 0.315 (0.364) | 0.512 (0.39) | 0.38 (0.377) | 0.388 (0.376) |
| Writing | 0.507 (0.31) | 0.543 (0.305) | 0.492 (0.314) | 0.556 (0.313) |
Effect of research area and journal impact factor on the developmental score using a Gamma Generalized Linear Model with developmental score as response variable.
|
| |
|---|---|
| Developmental score | |
| AreaHMS | −0.071 |
| AreaPS | −0.105 |
| AreaLS | −0.084 |
| IFQuartileQ2 | 0.033 |
| IFQuartileQ3 | 0.058 |
| IFQuartileQ4+NI | −0.025 |
| AreaHMS:IFQuartileQ2 | −0.047 |
| AreaPS:IFQuartileQ2 | −0.052 |
| AreaLS:IFQuartileQ2 | −0.035 |
| AreaHMS:IFQuartileQ3 | −0.158 |
| AreaPS:IFQuartileQ3 | −0.069 |
| AreaLS:IFQuartileQ3 | −0.052 |
| AreaHMS:IFQuartileQ4+NI | −0.060 |
| AreaPS:IFQuartileQ4+NI | −0.037 |
| AreaLS:IFQuartileQ4+NI | −0.027 |
| Constant | 0.547 |
| Observations | 1,331,247 |
| Log Likelihood | 196,834.500 |
| Akaike Inf. Crit. | −393,636.900 |
Notes.
p < 0.1.
p < 0.05.
p < 0.01.
Reference categories were: reports submitted to SSE journals listed in the first quartile of impact factor.
Figure 3Interaction between journal prestige and research area.
Note that due to the restricted number of cases in the sample and for the sake of readability, we included fourth quartile and not-indexed journals in the same category.
Effect of report delivery time on the developmental score per research area using a Gamma Generalized Linear Model with developmental score as response variable.
|
| ||||
|---|---|---|---|---|
| Developmental score | ||||
| PS | SSE | HMS | LS | |
| Report delivery time | 0.002 | 0.001 | 0.002 | 0.003 |
| (0.00002) | (0.00003) | (0.00005) | (0.00004) | |
| Constant | 0.401 | 0.525 | 0.400 | 0.414 |
| (0.0004) | (0.001) | (0.001) | (0.001) | |
| Observations | 824,954 | 171,055 | 149,978 | 185,256 |
| Log Likelihood | 145,774.700 | 21,203.550 | 18,616.660 | 21,472.280 |
| Akaike Inf. Crit. | −291,545.500 | −42,403.110 | −37,229.320 | −42,940.560 |
Notes.
p < 0.1.
p < 0.05.
p < 0.01.
Effect of gender and seniority on the developmental score per area of research using a Gamma Generalized Linear Model with developmental score as response variable.
|
| ||||
|---|---|---|---|---|
| Developmental score | ||||
| PS | SSE | HMS | LS | |
| Seniority 5 to 18 years | −0.050 | −0.064 | −0.039 | −0.033 |
| (0.004) | (0.005) | (0.007) | (0.006) | |
| Seniority > 18 years | −0.065 | −0.090 | −0.058 | −0.058 |
| (0.004) | (0.005) | (0.007) | (0.006) | |
| Gender Man | −0.018 | −0.087 | −0.089 | −0.054 |
| (0.004) | (0.005) | (0.008) | (0.008) | |
| Seniority 5 to 18 years: Gender Man | 0.003 | 0.074 | 0.026 | 0.0001 |
| (0.004) | (0.005) | (0.009) | (0.008) | |
| Seniority > 18 years: Gender Man | 0.005 | 0.099 | 0.020 | 0.015 |
| (0.004) | (0.006) | (0.009) | (0.008) | |
| Constant | 0.504 | 0.630 | 0.532 | 0.538 |
| (0.003) | (0.005) | (0.007) | (0.006) | |
| Observations | 762,864 | 156,575 | 138,933 | 171,641 |
| Log Likelihood | 129,470.100 | 19,139.370 | 17,814.220 | 19,073.790 |
| Akaike Inf. Crit. | −258,928.200 | −38,266.750 | −35,616.450 | −38,135.570 |
Notes.
p < 0.1.
p < 0.05.
p < 0.01.
Reference categories were: women reviewers with < 5 years of seniority. Note that seniority was estimated by looking at the first publication of each reviewer indexed in Scopus.
The effect of the geographical location of reviewers on the developmental score per area of research.
|
| ||||
|---|---|---|---|---|
| Developmental score | ||||
| PS | SSE | HMS | LS | |
| Southern Asia | −0.133 | −0.081 | −0.105 | −0.170 |
| (0.001) | (0.004) | (0.003) | (0.003) | |
| Northern Europe | −0.018 | −0.026 | 0.014 | 0.005 |
| (0.001) | (0.002) | (0.003) | (0.003) | |
| Southern Europe | −0.036 | −0.032 | −0.036 | −0.071 |
| (0.001) | (0.003) | (0.003) | (0.002) | |
| Northern Africa | −0.131 | −0.158 | −0.113 | −0.148 |
| (0.002) | (0.009) | (0.004) | (0.005) | |
| Sub-Saharan Africa | −0.052 | −0.167 | −0.021 | −0.055 |
| (0.003) | (0.006) | (0.007) | (0.006) | |
| Latin America and the Caribbean | −0.057 | −0.083 | −0.038 | −0.080 |
| (0.001) | (0.004) | (0.003) | (0.003) | |
| Western Asia (Middle East) | −0.115 | −0.059 | −0.118 | −0.135 |
| (0.001) | (0.003) | (0.003) | (0.003) | |
| Australia and New Zealand | −0.041 | −0.028 | 0.038 | 0.011 |
| (0.002) | (0.003) | (0.004) | (0.004) | |
| Eastern Europe | −0.082 | −0.083 | −0.064 | −0.084 |
| (0.001) | (0.005) | (0.004) | (0.003) | |
| Northern America | −0.031 | −0.069 | 0.001 | −0.013 |
| (0.001) | (0.002) | (0.002) | (0.002) | |
| South-East Asia | −0.095 | −0.072 | −0.055 | −0.103 |
| (0.002) | (0.005) | (0.005) | (0.004) | |
| East Asia | −0.145 | −0.131 | −0.125 | −0.169 |
| (0.001) | (0.002) | (0.002) | (0.002) | |
| Constant | 0.521 | 0.617 | 0.467 | 0.527 |
| (0.001) | (0.002) | (0.002) | (0.002) | |
| Observations | 821,213 | 169,984 | 148,745 | 183,842 |
| Log Likelihood | 167,148.900 | 23,405.270 | 21,644.070 | 28,294.330 |
| Akaike Inf. Crit. | −334,271.800 | −46,784.550 | −43,262.150 | −56,562.670 |
Notes.
Countries are classified according to ISO 3166 country codes, while their aggregation complies with the United Nation M49 standard. In case of Sub-Saharan Africa, more than the 50% of our observations included reviewers located in South Africa). We used a Gamma Generalized Linear Model with developmental score as response variable.
p < 0.1.
p < 0.05.
p < 0.01.
The reference category were Western European reviewers.
Figure 4Median values of each dimension of the developmental score (i.e., cumulative distribution functions F in Materials and Methods) per geographical region.