| Literature DB >> 22412947 |
Leah Armon1, S Roy Caplan, Michael Eisenbach, Benjamin M Friedrich.
Abstract
Biased motion of motile cells in a concentration gradient of a chemoattractant is frequently studied on the population level. This approach has been particularly employed in human sperm chemotactic assays, where the fraction of responsive cells is low and detection of biased motion depends on subtle differences. In these assays, statistical measures such as population odds ratios of swimming directions can be employed to infer chemotactic performance. Here, we report on an improved method to assess statistical significance of experimentally determined odds ratios and discuss the strong impact of data correlations that arise from the directional persistence of sperm swimming.Entities:
Mesh:
Year: 2012 PMID: 22412947 PMCID: PMC3297605 DOI: 10.1371/journal.pone.0032909
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Directional persistence of sperm swimming paths prompts adapted statistical test for motion bias.
A. One out of 30,000 control human sperm tracks (blue) and the corresponding averaged swimming path (purple; computed using a second-order Savitzky-Golay filter). The fast wiggling of the sperm head center is clearly visible. For later odds ratio calculations, angles ψ between a preferred direction and the frame-to-frame displacement vectors were binned according to the color wheel shown; the color-coded track illustrates the binning. B. Orientational correlation function C(t) of the swimming direction angle ψ for the sperm track from panel A (solid blue). This correlation function shows fast oscillations resulting from periodic head wiggling as well as slow decay on a time-scale of several seconds, which reflects directional persistence of sperm swimming. Also shown is a sample average of this autocorrelation (dotted blue) computed by averaging individual angle autocorrelation functions from n = 4,000 long sperm tracks (duration >10 sec). We can further define an analogous angle autocorrelation function for the direction angle of the averaged path (solid purple: for the averaged path from panel A; dotted purple: sample average). C. Empirical significance thresholds for the odds ratio of swimming direction angles for a human sperm population assay: An odds ratio O.R. = (N +/N −)/(N + 0/N − 0) greater than 1+Δ95%(N) with sample size N = min(N ++N −,N + 0+N − 0) should be statistically significant for positive chemotaxis at a 5%-confidence level. The test for negative chemotaxis reads O.R.<1−Δ5%. Significance thresholds were determined by block bootstrapping based on a large control data set of swimming direction angles of 30,000 sperm tracks. For various sample sizes N, we sampled a distribution of odds ratios by computing odds values for suitable random subsamples of size about N. Each subsample comprises the full angle data corresponding to a random selection of tracks. Upper inset: Distribution of odds ratios for N = 106 by bootstrapping. The 5% and 95% percentiles of this distribution represent the significance thresholds 1−Δ5% and 1+Δ95%, respectively. Lower inset: Significance thresholds Δ*5% and Δ*95% for a simulated control data set devoid of correlations as a function of test sample size N* (continuous lines, green Δ*5%, red Δ*95%). We obtain almost identical “significance thresholds”, if we employ simple bootstrapping drawing subsamples from pooled experimental angle data (not shown). The significance thresholds determined by block bootstrapping (open symbols, green Δ5%, red Δ95%) superpose with those for the simulated control data if we renormalize sample size as N* = 0.029N, i.e. Δ*5%(N*) ≈Δ5%(N) and Δ*95%(N*)≈Δ95%(N). N* can be regarded as an effective number of independent data points in an experimental sample of size N. D. Odds ratios characterizing biased motion of human sperm cells in a concentration gradient of the chemoattractant progesterone for various initial concentrations (black dots). Errorbars denote symmetric 90%-confidence intervals that were determined using bootstrapping based on the data from this particular experiment. Using bootstrapping on a separate, very large control data set, we can assign accurate significance levels p to each odds ratio. These significance levels represent the likelihood that the odds ratios in this particular experiment were drawn from the control distribution.
Figure 2Effect of tunable directional persistence on tests of motion bias for a simple mathematical model of stochastic swimming paths.
A. Orientational correlation function C(t) of the (frame-to-frame) swimming direction angle of simulated persistent random walks for two values of the persistence time τ = Δt (red) and τ = 10Δt (blue). Also shown are two example tracks. B. 95% percentile of the odds ratio distribution of simulated persistent random walks as a function of sample size N for different values of the persistence time τ (green: τ = 0.1Δt, red: τ = Δt, blue: τ = 10Δt). (For each value of τ, we used simulated control data sets each comprising 104 persistent random walks of duration 100Δt.) Lower inset: Significance threshold Δ*95% for a simulated control data set devoid of correlations as a function of test sample size N* (magenta line). The significance thresholds determined by block bootstrapping of simulated persistent random walks superpose with those for the correlation-free control data if we renormalize sample size as N* = ρN, i.e. Δ*95%(N*) ≈Δ95%(N). N* can be regarded as an effective number of independent data points in a sample of size N. C. The “effective number of independent data points” N* in a sample of size N of angles of simulated persistent random walks is a function of the persistence time τ. The ratio N*/N was determined from a superposition of percentiles-curves of the odds ratio distribution for different values of τ by rescaling N→N* (“data collapse”), see inset of Figure 2B. D. We can test simulated biased persistent random walks for the presence of biased motion by comparing the corresponding odds ratio to “empirical” significance thresholds as computed in Figure 2B. This involves the risk of type-II errors (“false negatives”), in which biased motion is falsely classified as unbiased (the null hypothesis is falsely accepted). The rate of such a type-II errors decreases with sample size N, but increases with the persistence time τ of the biased persistent random walks (green: τ = 0.1Δt, red: τ = Δt, blue: τ = 10Δt). Parameter: β = 0.1. (Note that curves do not superpose as a function of the effective number N* of independent data points as the mean of the odds ratio distribution