| Literature DB >> 14516474 |
Noori Akhtar-Danesh1, Mahshid Dehghan-Kooshkghazi.
Abstract
BACKGROUND: Misconduct in medical research has been the subject of many papers in recent years. Among different types of misconduct, data fabrication might be considered as one of the most severe cases. There have been some arguments that correlation coefficients in fabricated data-sets are usually greater than that found in real data-sets. We aim to study the differences between real and fabricated data-sets in term of the association between two variables.Entities:
Mesh:
Year: 2003 PMID: 14516474 PMCID: PMC212490 DOI: 10.1186/1471-2288-3-18
Source DB: PubMed Journal: BMC Med Res Methodol ISSN: 1471-2288 Impact factor: 4.615
Summary statistics of height and weight for 65 students
| 145 | 175 | 159.5 | 7.2 | ||
| 39 | 84 | 54.5 | 9.2 | ( |
Figure 1Scatter-plot of weight and height for data-sets made up by 34 individuals
Comparison between the significance levels of the correlations for the made up data-sets and 2500 random samples produced based on the specifications of Table 1*
| 0.05 < | ||||
| 24 (70.6%) | 3 (8.8%) | 7 (20.6%) | 34 | |
| 137 (5.5%) | 125 (5.0%) | 2238 (89.5%) | 2500 |
* data-sets with p-value ≤ 0.05 were compared with p-value > 0.05, (Fisher's exact test, p < 0.0001)
Figure 2Scatter-plot of birth weight by gestational age for 637 newborn boys
Summary statistics of gestational age (GA) and birth weight for 637 newborn boys
| 38.0 | 44.0 | 40.1 | 1.0 | ||
| 1750 | 5000 | 3277 | 443 | ( |
Figure 3Scatter-plot of birth weight and gestational age for 34 made up data-sets
Comparison between the made up data-sets and 2500 real random samples from 637 newborn boys of the p-values of the correlation between GA and birth weight+
| 0.05 < | ||||
| 22 (64.7%) | 1 (2.9%) | 11 (32.4%) | 34 | |
| 109 (4.4%) | 113 (4.5%) | 2278 (91.1%) | 2500 |
+ data-sets with p-value ≤ 0.05 were compared with p-value > 0.05, (Fisher's exact test, p < 0.0001)
Summary statistics of communication skill in two groups of students
| 12 | 24 | 19.1 | 2.8 | ||
| 8 | 22 | 16.6 | 3.4 |
Figure 4Box-plots of 17 made up and the real data-sets for the two different curriculum
Comparison between made up and simulated data-sets about communication skill++
| 0.05 < | ||||
| 9 (52.9%) | 2 (11.8%) | 6 (35.3%) | 17 | |
| 2151 (86.0%) | 180 (7.2%) | 169 (6.8%) | 2500 |
++ data-sets with p-value ≤ 0.05 were compared with p-value > 0.05, (Fisher's exact test, p = 0.0011)