| Literature DB >> 28240524 |
Rasha M Allam1, Maissa K Noaman, Manar M Moneer, Inas A Elattar.
Abstract
Purpose: To identify statistical errors and pitfalls in dissertations performed as part of the requirements for the Medical Doctorate (MD) degree at the National Cancer Institute (NCI), Cairo University (CU) to improve the quality of medical research.Entities:
Keywords: MD theses; National Cancer Institute; study design; statistical methodology
Year: 2017 PMID: 28240524 PMCID: PMC5563106 DOI: 10.22034/APJCP.2017.18.1.231
Source DB: PubMed Journal: Asian Pac J Cancer Prev ISSN: 1513-7368
Application and Misapplication of Different Statistical Tests Used - NCI, 2009-2013
| Test statistic | Number of misapplication | Cause of misapplication | Correction |
|---|---|---|---|
| Mann-Whitney U test (n=20) | 3 | ||
| 1 | Comparing median for censored data (TTP) | Log-rank test | |
| 1 | ND | Student t-tests | |
| 1 | Comparing categorical variables | Chi square or Fisher’s Exact as appropriate | |
| Student t-test (n=16) | 21 | ||
| 14 | Not ND | Mann-Whitney U test | |
| 5 | Small sample size not tested for normality | Testing for normality then select appropriate test | |
| 1 | Testing relation between categorical variables | Chi square or Fisher’s Exact as appropriate | |
| 1 | Paired data and not ND | Wilcoxon signed- rank test | |
| Chi Square (n=50) | 44 | ||
| 15 | Expected count < 1 | Use Fisher exact test if it was 2 by 2 table or combine categories and then use Fisher | |
| 18 | Expected counts < 5 in more than 25% of cells | Combine categories then use Chi square test | |
| 3 | More than one p value in same table | Only one P value | |
| 5 | Calculation of direct estimates with censored data | Survival analysis | |
| 2 | Only one proportion tested | Summary statistics with CI | |
| 1 | Required but not used | To be used | |
| Log rank (n=36) | 11 | ||
| 1 | No post hoc tests for more than 2 groups | Post hoc test | |
| 1 | Comparison between time to relapse and DFS | Omit | |
| 9 | Response to treatment in relation to survival in Log rank | Do not put response as a prognostic factor as it is related to outcome |
ND, normally distributed; TTP, time to progression; CI, confidence interval; DFS, disease free survival;
More than 1 misapplication can occur in one thesis
Characteristics of Results Section and Use of Statistical Analysis of MD Dissertations - NCI, 2009-2010 vs. 2011-2013
| Year of defense | p value | ||
|---|---|---|---|
| (2009-2010) | (2011-2013) | ||
| n=29 | n=33 | ||
| Results support aims | 8 (27.6) | 16 (48.5) | 0.092 |
| Comparable groups in relevant measures | 4 (33.3) | 8 (57.1) | 0.225 |
| Complementary text with data in tables and illustrations | 21 (72.4) | 30 (90.9) | 0.057 |
| Missing data for each variable stated | 7 (24.1) | 10 (30.3) | 0.587 |
| Misinterpretation of results | 24 (82.8) | 27 (81.8) | 0.923 |
| Misapplication of statistical words | 24 (82.8) | 24 (72.7) | 0.346 |
| Type of analysis | |||
| Multivariate | 5 (17.2) | 8 (24.2) | 0.499 |
| Univariate | 24 (82.8) | 25 (75.8) | |
| Proper type of analysis | 22 (75.9) | 26 (78.8) | 0.783 |
| Accurate title and labels of tables and graphs | 6 (20.7) | 11 (33.3) | 0.265 |
| Good organization of tables and graphs | 8 (27.6) | 10 (30.3) | 0.814 |
| Satisfactory presentation of statistical output | 8 (27.6) | 15 (45.5) | 0.146 |
| Discrepancies between text and tables | 10 (34.5) | 16 (48.5) | 0.265 |
| Misuse of statistical tests | 14 (48.3) | 19 (57.6) | 0.464 |
| Statistical tests fulfilling assumptions | 14 (48.3) | 17 (51.5) | 0.799 |
Data is presented as n (%)
Comparison between Strasak et al., (2007b) Review and Present Study Regarding Design of Study, Data Analysis, Documentation and Presentation
| Category | WKW | WMW | Present Study n=62 (%) |
|---|---|---|---|
| n= 15 (%) | n= 7 (%) | ||
| Design of study | |||
| No sample size/power calculation (overall) | 73.3 | 57.1 | 93.5 |
| Prospective study design | 26.7 | 28.6 | 43.5 |
| Retrospective study design | 26.7 | 28.6 | 19.4 |
| Study design not classifiable | 20.0 | 0.0 | 45.2 |
| Data analysis | |||
| Use of a wrong statistical test | 20.0 | 42.9 | 53.2 |
| Failure to include a multiple-comparison correction/α level correction | 20.0 | 14.3 | 0.0 |
| Special errors with χ2-tests | |||
| No Yates correction if small numbers | 13.3 | 0.0 | 0.0 |
| Use of χ2 when expected numbers in a cell < 5 | 6.7 | 28.6 | 36.0 |
| Documentation | |||
| Failure to state number of tails | 80.0 | 85.7 | 93.5 |
| Failure to specify which test was performed on a given set of data | 26.7 | 14.3 | 100.0 |
| Presentation | |||
| Giving standard error (SEM) instead of SD for statistical description | 6.7 | 0.0 | 6.0 |
| p = NS, p < 0.05, p > 0.05 etc. instead of reporting exact p-values | 46.7 | 71.4 | 8.1 |
WKW, Wiener Klinische Wochenschrift; WMW, Wiener MedizinischeWochenschrift
Comparison between Strasak et al., (2007b) Review and Present Study Regarding Type and Frequencies of Statistical Tests Used
| Types and frequencies | WKW | WMW | Present study |
|---|---|---|---|
| n =35 (%) | n =16 (%) | n =62 (%) | |
| No statistical methods | 2.9 | 25 | 0 |
| Descriptive statistics only | 22.9 | 31.3 | 1.6 |
| Inferential methods | 74.3 | 43.8 | 98.4 |
| t-tests | 28.6 | 12.5 | 61.3 |
| Contingency table analysis (χ2, Fishers exact test) | 19.4 | 31.3 | 95.2 |
| Non-parametric tests | 28.6 | 25 | 59.7 |
| One-way ANOVA | 5.7 | 0 0 | 6.5 |
| Correlation coefficients | 22.9 | 2 12.5 | 20.1 |
| Regression | 25.7 | 1 6.3 | 9.7 |
| Survival analysis | 11.4 | 0 0 | 58.1 |
| Confidence intervals | 14.4 | 212.5 | 58.3 |
WKW, Wiener KlinischeWochenschrift; WMW, Wiener MedizinischeWochenschrift
Comparison between Hanif and Ajmal Study and Present Study Regarding Statistical Methodology, Design and Statistical Errors
| Statistical Methodology | Hanif and Ajmal study (2011) | Present study |
|---|---|---|
| n = 80 (%) | n = 62 (%) | |
| Design of study not given | 52.5 | 85.5 |
| No Sample size calculation/ power calculation (overall) | 92.5 | 93.5 |
| Sampling Selection criteria not given | 75.0 | 79.0 |
| No statistical methods | 26.3 | 14.5 |
| Data analysis technique defined | 48.7 | 85.5 |
| No statistical package defined with version | 70.0 | 32.3 |
| Descriptive statistics only | 28.8 | 1.6 |
| Inferential methods with descriptive | 41.3 | 98.3 |
| Contingency table analysis | 30.0 | 95.2 |
| t-tests | 13.8 | 61.3 |
| Basic Chi-square, Fisher’s Test | 30.0 | 95.2 |
| Non-Parametric tests | 3.8 | 59.7 |
| Analysis of Variance | 7.5 | 6.5 |
| Correlation coefficient | 7.5 | 21.0 |
| Logistic Regression | 8.7 | 9.7 |
| Survival Analysis | 0.0 | 58.1 |
| Confidence interval | 15.0 | 58.1 |
| Use of wrong statistical analysis | 28.7 | 21.0 |
| Incompatibility of statistical test with type of data examined | 20.0 | 53.2 |
| Over all inappropriate interpretation | 13.7 | 82.3 |
Comparison between Leucuța et al. Study and Present Study
| Category | Leucuța et al., Study (2013) | Present study |
|---|---|---|
| n = 170 (%) | n=62 (%) | |
| Summarize each variable with descriptive statistics | 97.1 | 100.0 |
| Verify that data conformed to the assumptions | 12.4 | 46.8 |
| Indicate whether and how any allowance or adjustments were made for multiple comparisons | 44.0 | 0.0 |
| Report how any outlying data were treated in the analysis | 4.9 | 0.0 |
| Say whether tests were one- or two-tailed | 7.8 | 6.5 |
| Report the alpha level (e.g. 0.05) | 75.5 | 6.5 |
| Name the statistical package or program used | 32.8 | 79.2 |
| Report total or group sample size for analyses | 80 | 100 |
| 95% confidence coefficient to indicate the precision of an estimate | 11.1 | 58.3 |