| Literature DB >> 31145670 |
Daniel A Pollard1, Thomas D Pollard2, Katherine S Pollard3.
Abstract
We provide guidelines for using statistical methods to analyze the types of experiments reported in cellular and molecular biology journals such as Molecular Biology of the Cell. Our aim is to help experimentalists use these methods skillfully, avoid mistakes, and extract the maximum amount of information from their laboratory work. We focus on comparing the average values of control and experimental samples. A Supplemental Tutorial provides examples of how to analyze experimental data using R software.Entities:
Mesh:
Year: 2019 PMID: 31145670 PMCID: PMC6724699 DOI: 10.1091/mbc.E15-02-0076
Source DB: PubMed Journal: Mol Biol Cell ISSN: 1059-1524 Impact factor: 4.138
FIGURE 1:Examples of distributions of measurements. (A) Normal distribution with vertical lines showing the mean = median = mode (dotted) and ±1, 2, and 3 standard deviations (SD or σ). The fractions of the distribution are ∼0.67 within ±1 SD and ∼0.95 within ±2 SD. (B) Histogram of approximately normally distributed data. (C) Histogram of a skewed distribution of data. (D) Histogram of the natural log transformation of the skewed data in C. (D) Histogram of exponentially distributed data. (F) Histogram of a bimodal distribution of data.
Matching types of data appropriately with commonly used statistical tests (Crawley, 2013; Whitlock and Schluter, 2014).
| Response type | Treatment type | Statistical tests | Typical null hypothesis | Assumptions | Functions in Excel and R | Examples |
|---|---|---|---|---|---|---|
| Continuous numerical (e.g., reaction rate) | Binary (e.g., wild type vs. mutant) | Means equal for two treatments | Randomized treatment; normally distributed response; equal variance of response for two treatment groups | Excel: TTEST(controlrate, genotype, tails = 2, type = 2) R: t.test(rate∼genotype, var.equal = TRUE) | ||
| Continuous numerical (e.g., reaction rate) | Categorial (e.g. wild type vs mutant 1 vs mutant 2) | ANOVA followed by Tukey-Kramer post-hoc test | Means equal across treatments | Randomized treatment; normally distributed response; equal variance of response between treatment groups | Excel: Analysis ToolPak add-in for single factor ANOVA required if treatment has more than two levels R: aov(rate∼genotype) TukeyHSD() | |
| Continuous numerical (e.g., reaction rate) | Continuous numerical (e.g., drug concentrations) | Linear regression followed by | Coefficients equal to zero; correlation coefficient equal to zero | Randomized treatment; linear relationship between treatment and response; treatment and response bivariate normally distributed; normally distributed residuals | Excel: LINEST(rate, drug, const = TRUE, stats = TRUE) R: lm(rate∼drug) summary() | |
| Continuous numerical (e.g., reaction rate) | More than one treatment: Continuous numerical (e.g., drug concentrations) plus categorical (e.g., wild type vs. mutant 1 vs. mutant 2) | Analysis of covariance (ANCOVA) | Coefficients equal to zero | Randomized treatment; numerical treatment and response bivariate normally distributed; equal variance of response between treatment groups; linear relationship between numerical treatment and response; normally distributed residuals | Excel: LINEST(rate, treatments, const = TRUE, stats = TRUE) R: lm(rate∼drug + genotype) summary() | |
| Categorical (e.g., cell cycle stage) | Categorical (e.g., wild type vs. mutant 1 genotype vs. mutant 2 genotype) | Chi-square or G contingency test; binomial test; Fisher’s exact test | Proportions between response categories are equal between treatments | Randomized treatment; all expected counts are one or more and no more than 20% of expected counts less than five; binomial: response or treatment sample sizes fixed; Fisher’s: response and treatment sample sizes fixed | Excel: CHISQ.TEST(actual_range,expected_range) R: chisq.test(stage, genotype) | |
| Binary categorical (e.g., alive or dead) | Continuous numerical (e.g., drug concentrations) | Logistic regression; generalized linear model | Slope or intercept equal to zero | Randomized treatment; linear relationship between treatment and log odds of one response | Excel: Not available R: glm(alive∼drug, family = binomial) summary() | |
| Binary categorical (e.g., alive or dead) | More than one treatment: Categorical (e.g., wild type vs. mutant 1 vs. mutant 2) plus categorical (e.g., day 1 vs. day 2 vs. day 3) | Logistic regression; generalized linear model | Slope or intercept equal to zero | Randomized treatment | Excel: Not available R: glm(alive∼genotype + day, family = binomial) summary() |
FIGURE 2:Decision tree to select an appropriate statistical test for association between a response and one or more treatments. Multiple treatments or a treatment and potential confounders can be tested using linear models (also known as ANCOVA) or generalized linear models (e.g., logistic regression for binary responses). Multiple treatments with repeated measurements on the same specimens, such as time courses, can be tested using mixed model regression. Questions in squares; answers on solid arrows; actions in ovals; tests in diamonds.
FIGURE 3:Comparison of data presentation for three experiments on the constriction of cytokinetic contractile rings with several perturbations. (A) Rate of ring constriction in Caenorhabditis elegans embryos from Zhuravlev . Error bars represent SD; p values were obtained by an unpaired, two-tailed Student’s t test; n.s., p ≥ 0.05; *, p < 0.05; **, p < 0.01; ****, p < 0.0001. Sample sizes 10–12. (B) Time to complete ring constriction in Schizosaccharomyces pombe from Li . Error bars, SD; n ≥ 10 cells. *, p < 0.05 obtained with one-tailed t tests for two samples with unequal variance. (C) Kaplan-Meier outcomes plots comparing the times (relative to spindle pole body separation) of the onset of contractile ring constriction in populations of (○) wild-type and (⬤) blt1∆ fission yeast cells from Goss . A log-rank test determined that the curves differed with p < 0.0001.
FIGURE 4:Three graphs show factors affecting the statistical power, the probability of rejecting a truly false null hypothesis in a two-sample t test. The statistical power depends on three factors: (A) increases with the number of measurements (n); (B) decreases with the size of the SD (sd); and (C) increases with effect size (Δ), the difference between the control and the test samples on both sides of minimum at zero effect size. Two variables are held constant in each example.
FIGURE 5:Comparison of two t distributions with degrees of freedom of 3 (sample size 4) and 10 (sample size 11) with a normal distribution with a mean value of 0 and SD = 1. The vertical dashed lines are 2.5th and 97.5th quantiles of the corresponding (same color) t distribution. The area below the left dashed line and above the right dashed line totals 5% of the total area under the curve. The t distribution is the theoretical probability of obtaining a given t statistic with many random samples from a population where the null hypothesis is true. The shape of the distribution depends on the sample size. The distribution is symmetric, centered on 0. The tails are thicker than a standard normal distribution, reflecting the higher chance of values away from the mean when both the mean and the variance are being estimated from a sample. The t distribution is a probability density function so the total area under the curve is equal to 1. The area under the curve between two x-axis (t statistic) values can be calculated using integration. With large sample sizes the accuracy of estimates of the true variance in an experiment increase and the t distribution converges on a standard normal distribution. To determine the probability of the observed statistic if the null hypothesis were true, one compares the t statistic from an experiment with the theoretical t distribution. For a one-sided test in the greater-than direction, the area above the observed t statistic is the p value. The 97.5th quantile has p = 0.025. For a one-sided test in the less-than direction, the area below the observed t statistic is the p value. The 2.5th quantile has p = 0.025 in this case. For a two-sided test, the p value is the sum of the area beyond the observed statistic and the area beyond the negative of the observed statistic. If this probability value (p value) is low, the data are not likely under the null hypothesis.