| Literature DB >> 30161249 |
Anand Krishna1, Sebastian M Peter2.
Abstract
Although questionable research practices (QRPs) and p-hacking have received attention in recent years, little research has focused on their prevalence and acceptance in students. Students are the researchers of the future and will represent the field in the future. Therefore, they should not be learning to use and accept QRPs, which would reduce their ability to produce and evaluate meaningful research. 207 psychology students and fresh graduates provided self-report data on the prevalence and predictors of QRPs. Attitudes towards QRPs, belief that significant results constitute better science or lead to better grades, motivation, and stress levels were predictors. Furthermore, we assessed perceived supervisor attitudes towards QRPs as an important predictive factor. The results were in line with estimates of QRP prevalence from academia. The best predictor of QRP use was students' QRP attitudes. Perceived supervisor attitudes exerted both a direct and indirect effect via student attitudes. Motivation to write a good thesis was a protective factor, whereas stress had no effect. Students in this sample did not subscribe to beliefs that significant results were better for science or their grades. Such beliefs further did not impact QRP attitudes or use in this sample. Finally, students engaged in more QRPs pertaining to reporting and analysis than those pertaining to study design. We conclude that supervisors have an important function in shaping students' attitudes towards QRPs and can improve their research practices by motivating them well. Furthermore, this research provides some impetus towards identifying predictors of QRP use in academia.Entities:
Mesh:
Year: 2018 PMID: 30161249 PMCID: PMC6117074 DOI: 10.1371/journal.pone.0203470
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
| Stress | I felt stressed while working on my final thesis. | .82 | |
| Motivation | I am motivated to work on my final thesis. | .62 | |
| Good science leads to significant results | If someone does good scientific work, they produce significant results. | .65 | |
| Supervisor rewards significant results | My grade depends on whether my results are significant. | .81 | |
Note. All items are translated. “(rev.)” indicates an item that was reverse-scored for the scale. The items shown were adapted into the past tense where applicable for participants who indicated their thesis was already complete. N = 207 for all scales.
| Failing to report all dependent measures that are relevant for a finding | Reporting and analysis QRP (Fiedler & Schwarz) |
| Failing to report all conditions that are relevant for a finding | Reporting and analysis QRP (Fiedler & Schwarz) |
| Rounding off p | Reporting and analysis QRP (Fiedler & Schwarz) |
| Selectively reporting studies regarding a specific finding that “worked” | Reporting and analysis QRP (Fiedler & Schwarz) |
| Deciding whether to exclude data after looking at the impact of doing so regarding a specific finding | Reporting and analysis QRP (Fiedler & Schwarz) |
| Claiming to have predicted an unexpected result | Reporting and analysis QRP (Fiedler & Schwarz) |
| Claiming that results are unaffected by demographic variables (e.g. gender) although one is actually unsure (or knows that they do) | Reporting and analysis QRP (Fiedler & Schwarz) |
| Falsifying data | Reporting and analysis QRP (Fiedler & Schwarz) |
| Changing or formulating new hypotheses after analyzing the data | Reporting and analysis QRP (self-generated) |
| Collecting more data after seeing whether results were significant in order to render non-significant results significant | Study design QRP |
| Stopping data collection after achieving the desired result concerning a specific finding | Study design QRP |
| Conducting a power analysis | Positive or neutral practice |
| Reporting effect sizes | Positive or neutral practice |
| Utilizing a sequential analysis for planned early data collection stopping | Positive or neutral practice |
| Using Bayesian analysis | Positive or neutral practice |
Note. All items are translated.
| Selectively reporting studies | 159 (75.8%) | 28.3% | 13.0% |
| Deciding whether to exclude data after looking at the results | 161 (77.8%) | 15.5% | 12.1% |
| Changing or formulating new hypotheses after analyzing the data | 171 (82.6%) | 15.0% | 7.2% |
| Rounding off p | 163 (78.7%) | 10.4% | 11.6% |
| Claiming to have predicted an unexpected result | 165 (79.7%) | 10.3% | 10.1% |
| Failing to report all relevant conditions | 166 (80.2%) | 7.7% | 9.7% |
| Failing to report all relevant dependent measures | 157 (75.8%) | 5.8% | 14.0% |
| Falsifying data | 175 (84.5%) | 2.9% | 5.3% |
| Falsely claiming that results are unaffected by demographics | 155 (74.9%) | 2.6% | 15.0% |
| Collecting more data in order to achieve significance | 169 (81.6%) | 2.4% | 8.2% |
| Stopping data collection after achieving the desired result | 173 (83.6%) | 1.9% | 6.3% |
| Reporting effect sizes | 162 (78.3%) | 69.1% | 11.6% |
| Conducting a power analysis | 150 (72.5%) | 35.3% | 16.9% |
| Using Bayesian analysis | 119 (57.5%) | 3.9% | 32.4% |
| Utilizing sequential analysis | 158 (76.3%) | 1.0% | 7.2% |
Note. The first column shows the number (and percentage) of participants who responded either “yes” or “no”, the second shows the relative frequency of yes responses among such participants. The third column is the percentage of all 207 participants who responded “don’t know/can’t judge”; the shortfall to 100% is due to item nonresponses.
| 0 | 80 (44.4%) | 105 (58.3%) |
| 1 | 60 (33.3%) | 44 (24.4%) |
| 2 | 20 (11.1%) | 16 (8.9%) |
| 3 | 9 (5.0%) | 6 (3.3%) |
| 4 | 6 (3.3%) | 4 (2.2%) |
| 5 | 1 (0.6%) | 3 (1.7%) |
| 6 | 2 (1.1%) | 2 (1.1%) |
| 7 | 2 (1.1%) | - |
Note. The subsample is those participants who responded to at least one QRP item with “yes” or “no” (n = 180).
| Selectively reporting studies | 199 | 2.18 (.98) | 121 | 2.03 (1.22) |
| Deciding whether to exclude data after looking at the results | 189 | 1.96 (.94) | 119 | 2.13 (1.25) |
| Changing or formulating new hypotheses after analyzing the data | 201 | 1.92 (1.00) | 140 | 1.97 (1.21) |
| Rounding off p | 196 | 2.02 (.97) | 120 | 1.87 (1.19) |
| Claiming to have predicted an unexpected result | 199 | 1.89 (.87) | 134 | 1.78 (1.06) |
| Failing to report all relevant conditions | 201 | 1.65 (.73) | 129 | 1.66 (1.01) |
| Failing to report all relevant dependent measures | 190 | 1.79 (.79) | 127 | 1.75 (.97) |
| Falsifying data | 202 | 1.08 (.47) | 163 | 1.07 (.39) |
| Falsely claiming that results are unaffected by demographics | 188 | 1.57 (.74) | 126 | 1.52 (.86) |
| Collecting more data in order to achieve significance | 199 | 2.21 (1.09) | 123 | 2.34 (1.32) |
| Stopping data collection after achieving the desired result | 199 | 2.04 (.96) | 115 | 1.70 (.98) |
| Reporting effect sizes | 192 | 4.61 (.74) | 137 | 4.64 (.79) |
| Conducting a power analysis | 160 | 4.47 (.65) | 99 | 4.37 (.89) |
| Using Bayesian analysis | 67 | 3.57 (1.02) | 37 | 3.30a (1.35) |
| Utilizing sequential analysis | 100 | 2.23 (1.06) | 59 | 1.97 (1.25) |
Note. All attitudes are coded such that greater values indicate greater endorsement. All attitude values except those marked a differ significantly from the scale midpoint after Bonferroni correction (all t ≥ 4.55, all p ≤ .0001, all d ≥ .55).
| Prevalence for reporting and analysis QRPs | 179 | .105 (.17) |
| Prevalence for study design QRPs | 174 | .026 (.12) |
| Participant attitude towards reporting and analysis QRPs | 202 | 1.73 (.48) |
| Participant attitude towards study design QRPs | 201 | 2.12 (.83) |
| Perceived supervisor attitude towards reporting and analysis QRPs | 168 | 1.71 (.82) |
| Perceived supervisor attitude towards study design QRPs | 136 | 2.09 (1.10) |
| Stress | 207 | 3.22 (.76) |
| Motivation | 207 | 3.73 (1.01) |
| Good science leads to significant results | 207 | 1.45 (.59) |
| Supervisor rewards significant results | 207 | 1.62 (.69) |
Fig 1Regression weights for models predicting reporting and analysis (R&A) and study design (S) QRP use.
Standard errors are given in parentheses. Asterisks denote statistical significance: *: p < .05, **: p < .01, ***: p < .001.
Fig 2Regression weights for mediation models predicting self-reported reporting and analysis (R&A) and study design (S) QRP use.
Standard errors are given in parentheses. Asterisks denote statistical significance: *: p < .05, **: p < .01, ***: p < .001.