| Literature DB >> 32408713 |
Alessandro Romano1, Piergiorgio Stevanato2.
Abstract
Germination data are analyzed by several methods, which can be mainly classified as germination indexes and traditional regression techniques to fit non-linear parametric functions to the temporal sequence of cumulative germination. However, due to the nature of germination data, often different from other biological data, the abovementioned methods may present some limits, especially when ungerminated seeds are present at the end of an experiment. A class of methods that could allow addressing these issues is represented by the so-called "time-to-event analysis", better known in other scientific fields as "survival analysis" or "reliability analysis". There is relatively little literature about the application of these methods to germination data, and some reviews dealt only with parts of the possible approaches such as either non-parametric and semi-parametric or parametric ones. The present study aims to give a contribution to the knowledge about the reliability of these methods by assessing all the main approaches to the same germination data provided by sugar beet (Beta vulgaris L.) seeds cohorts. The results obtained confirmed that although the different approaches present advantages and disadvantages, they could generally represent a valuable tool to analyze germination data providing parameters whose usefulness depends on the purpose of the research.Entities:
Keywords: Cox’s proportional hazard model; Kaplan Meier estimator; accelerated failure time model; germination; sugar beet; survival analysis; water stress
Year: 2020 PMID: 32408713 PMCID: PMC7285257 DOI: 10.3390/plants9050617
Source DB: PubMed Journal: Plants (Basel) ISSN: 2223-7747
Figure 1Kaplan-Meier estimates of survival functions (step curves) and AFT log-normal (A,B) and log-logistic (C,D) model curves (continuous curves) for genotypes Sh and Hy. Dotted lines indicate the stress condition (ψ = −0.6 MPa), continuous lines indicate the control (ψ =0 MPa).
Results of the comparison tests for survival functions between stress treatments within each genotype (left side of the table) and between genotypes for each treatment (right side of the table). The asterisk on p-values indicates a statistical significance (α < 0.05). z-values are standardized. G stands for genotype, T for treatment.
|
|
|
|
| |||||||
|
| Test | χ2 | df |
| χ2 | df | ||||
| M-H Log-rank | 50.012 | 1 | <0.0001 | ±7.072 | 4.432 | 1 | 0.0353 * | ±2.105 | ||
| Gehan-Wilcoxon | 38.361 | 1 | <0.0001 | ±6.194 | 2.588 | 1 | 0.1077 | ±1.609 | ||
| Tarone-Ware | 43.982 | 1 | <0.0001 | ±6.632 | 3.359 | 1 | 0.0668 | ±1.833 | ||
| Peto-Peto | 38.428 | 1 | <0.0001 | ±6.199 | 2.354 | 1 | 0.1249 | ±1.534 | ||
| Modified Peto-Peto | 38.380 | 1 | <0.0001 | ±6.195 | 2.348 | 1 | 0.1254 | ±1.532 | ||
| Fleming-Harrington | 48.561 | 1 | <0.0001 | ±6.969 | 5.602 | 1 | 0.0179 * | ±2.367 | ||
|
| M-H Log-rank | 43.580 | 1 | <0.0001 | ±6.602 |
| 2.972 | 1 | 0.0847 | ±1.724 |
| Gehan-Wilcoxon | 36.546 | 1 | <0.0001 | ±6.045 | 2.588 | 1 | 0.1077 | ±1.609 | ||
| Tarone-Ware | 40.225 | 1 | <0.0001 | ±6.342 | 2.799 | 1 | 0.0943 | ±1.673 | ||
| Peto-Peto | 36.321 | 1 | <0.0001 | ±6.027 | 2.452 | 1 | 0.1174 | ±1.566 | ||
| Modified Peto-Peto | 36.292 | 1 | <0.0001 | ±6.024 | 2.451 | 1 | 0.1175 | ±1.565 | ||
| Fleming-Harrington | 39.032 | 1 | <0.0001 | ±6.248 | 2.741 | 1 | 0.0978 | ±1.656 | ||
Figure 2Kaplan-Meier estimates of survival functions of control (A; ψ = 0 MPa) and stressed (B; ψ = −0.6 MPa) samples for genotypes Sh (continuous curve) and Hy (dotted curve).
Summary table of the Cox’s PH model for germination data. The z-value of the Wald test tests the hypothesis that βi = 0 against the alternative βi ≠ 0. p-value represents the probability of obtaining a z-value larger in absolute value than the one obtained. The asterisk on p-values indicates a statistical significance (α < 0.05).
| Explanatory Variable |
| SE of | Exp ( | Wald | Exp ( | |
|---|---|---|---|---|---|---|
| Genotype (Hy) | −0.2344 | 0.087 | 0.791 | −2.6875 | 0.0072 * | 0.6667–0.9385 |
| Treatment (stress) | −0.8685 | 0.090 | 0.419 | −9.6221 | <0.00001 | 0.3515–0.5008 |
Figure 3Log-normal distribution probability plots for Sh and Hy genotypes. On the horizontal axis, the expected quantile of the theoretical distribution, Φ−1(1 − e−H(t)), where H (t) is the cumulative hazard rate and Φ is the standard normal distribution function; on the vertical axis, the natural logarithm of the time value. For tied data, only one point is shown for each set of ties.
Shape (s) and scale (m) parameters of the log-normal and log-logistic distributions as estimated by the Maximum Likelihood Estimates (MLE). SE represents the standard error.
|
| ||||||
|
|
|
|
|
|
|
|
|
| 0 | 1.311 | 0.038 | 0.538 | 0.029 | 747.08 |
| −0.6 | 1.550 | 0.092 | 1.109 | 0.084 | 618.34 | |
|
| 0 | 1.425 | 0.044 | 0.601 | 0.036 | 740.10 |
| −0.6 | 1.835 | 0.114 | 1.242 | 0.105 | 565.32 | |
|
| ||||||
|
| 0 | 1.315 | 0.037 | 0.304 | 0.019 | 742.94 |
| −0.6 | 1.497 | 0.095 | 0.709 | 0.058 | 620.46 | |
|
| 0 | 1.394 | 0.043 | 0.355 | 0.024 | 738.74 |
| −0.6 | 1.792 | 0.117 | 0.800 | 0.074 | 567.14 | |
Summary table of the AFT model (log-normal and log-logistic distribution). z-values test the hypothesis that the parameter value is zero. p-values test the significance of the corresponding parameter. The asterisks on p-values indicate a statistical significance (α < 0.05).
|
| |||||||
|
|
|
|
|
|
|
|
|
| Intercept | 1.3185 (α0) | 0.0403 | 32.68 | <0.0001 | 1.24–1.40 | 3.7381 | 1 |
| Treatment | 0.4520 (α1) | 0.0482 | 9.36 | <0.0001 | 0.35–0.54 | 5.8747 | 1.5715 |
| Genotype | 0.1151 (α2) | 0.0481 | 2.39 | 0.0167 * | 0.02–0.20 | 4.1944 | 1.1220 |
|
| |||||||
|
|
|
|
|
|
|
|
|
| Intercept | 1.3086 (α0) | 0.0397 | 32.92 | <0.0001 | 1.23–1.38 | 3.7013 | 1 |
| Treatment | 0.4554 (α1) | 0.0487 | 9.34 | <0.0001 | 0.36–0.55 | 5.8364 | 1.5768 |
| Genotype | 0.1184 (α2) | 0.0493 | 2.40 | 0.0163* | 0.02–0.21 | 4.1666 | 1.1257 |