| Literature DB >> 34750673 |
Laura C Guglielmetti1, Fabio Faber-Castell2, Lukas Fink3, Raphael N Vuille-Dit-Bille4.
Abstract
BACKGROUND: Statistic scripts are often made by mathematicians and cryptic for clinicians or non-mathematician scientists. Nevertheless, almost all research projects necessitate the application of some statistical tests or at least an understanding thereof. The present review aims on giving an overview of the most common statistical terms and concepts. It further ensures good statistical practice by providing a five-step approach guiding the reader to the correct statistical test. METHODS ANDEntities:
Keywords: Five-step approach; Good statistical practice; Smartphone application; Statistical analysis; Statistical test; Statistics
Mesh:
Year: 2021 PMID: 34750673 PMCID: PMC8933355 DOI: 10.1007/s00423-021-02360-0
Source DB: PubMed Journal: Langenbecks Arch Surg ISSN: 1435-2443 Impact factor: 2.895
Fig. 1Box plot and columns displaying the height in a cohort of n = 9 people. Boxplot representation (a): The line within the box represents the median, while the upper and lower part of the box display the 75th and the 25th percentile. The meaning of the whiskers has to be defined by the authors and may represent maximum and minimum values (example here), 5th and 95th percentile, or other values. The same data is represented as columns showing mean and SD (b). c Age according to body height in a scatterplot
Fig. 2Frequency distribution of the heights of n = 60 patients. The black overlying curve represents the standard normal distribution and helps us to visually assess for normality. In our case, we see how difficult the visual assessment can be
Fig. 3Sample size calculation: Significance level alpha is set at 0.05. Power is set at 0.8. The effect size (calculated from expected difference between groups and standard deviation) is given on the x-axis, and the calculated sample size is given on the y-axis. Decreasing the effect size drastically increases the sample size needed for the experiment
Types of bias
| Type of bias | Description | Example | Prevention |
|---|---|---|---|
| Selection bias | Some participants are more likely to be selected for a study. Included participants are not representative of the population | Unemployed people more likely to participate in a time-consuming study | Allocation concealment, sequence generation |
| Detection bias | A certain condition is more likely to be detected in a subgroup of participants due to systematic differences in how outcomes are determined | Detection of appendicitis by ultrasound in thin versus obese patients | Blinding of outcome assessment |
| Reporting bias | Positive results and correlations are more likely to be reported | Non-finding or negative finding is not published | Preemptive determination of outcomes of interest |
| Exclusion bias | A certain population is more likely to be excluded from a study | Pregnancy, vulnerable patients such as small children or elderly are not included | Preemptive definition of exclusion criteria and consideration of these during discussion of the results found |
| Attrition bias | Loss of follow-up of a certain subgroup of participants | Elderly people not reachable via email | Reporting of incomplete outcome data, intention to treat analysis |
| Performance bias | Systematic differences between the groups regarding the exposure or care other than the intervention | Group receiving a drug gets more frequent blood examinations | Double blinding |
Adapted from [38]
Example of a crosstable (2 × 2)
| N (%) | Group A | Group B |
|---|---|---|
| Males | 2 (20%) | 4 (40%) |
| Females | 8 (80%) | 6 (60%) |
| Total | 10 (100%) | 10 (100%) |
The proportion of males and females within two groups of a sample population are compared
Fig. 4Decision-making when a statistical test is applied
Fig. 5SPSS output for the comparison of the mean age between two groups. Framed in green, we see the Levene test for equality of variances, which SPSS automatically applies when we compare means with a t-test. A significant p Value for the Levene test means that homogeneity of variance cannot be assumed. Therefore, the second line from the SPSS output (framed in dark blue) “equal variances not assumed” contains the p value from the correct t-test (Welch’s t-test). In Fig. 5, the p value for the Levene Test is > 0.05, which means that homogeneity of the variance can be assumed: The upper line, framed in orange, contains the correct t-test (Student’s t-test); the p value we are looking for is p = 0.129