| Literature DB >> 30002990 |
Youhua Chen1, Yongbin Wu2, Tsung-Jen Shen3.
Abstract
Rao's quadratic diversity index is one of the most widely applied diversity indices in functional and phylogenetic ecology. The standard way of computing Rao's quadratic diversity index for an ecological assemblage with a group of species with varying abundances is to sum the functional or phylogenetic distances between a pair of species in the assemblage, weighted by their relative abundances. Here, using both theoretically derived and observed empirical datasets, we show that this standard calculation routine in practical applications will statistically underestimate the true value, and the bias magnitude is derived accordingly. The underestimation will become worse when the studied ecological community contains more species or the pairwise species distance is large. For species abundance data measured using the number of individuals, we suggest calculating the unbiased Rao's quadratic diversity index.Entities:
Keywords: Biodiversity measure; Biometrics; Estimation accuracy; Forest ecology; Functional traits; Phylogenetic ecology
Year: 2018 PMID: 30002990 PMCID: PMC6037161 DOI: 10.7717/peerj.5211
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Measurements of bias, precision and accuracy in evaluating the performance of the bias-corrected Rao’s quadratic diversity index.
The intersection of the vertical and horizontal lines represents the true value, while blue solid dots represent estimated values. Shaded circles in the middle of the targets represent high-accuracy zones when estimated values fall within them.
Given a hypothetical assemblage of three species with relative abundances A = 1/6, B = 1/3 and C = 1/2, four abundance patterns along with the corresponding probabilities are demonstrated when four individuals were randomly sampled from the assemblage.
| Abundance pattern | Probability | Estimator | |||
|---|---|---|---|---|---|
| Empirical bias | Empirical bias | ||||
| (4, 0, 0) | 0.076 | 0 | −0.611 | 0 | −0.611 |
| (3, 1, 0) | 0.364 | 0.5 | −0.111 | 0.375 | −0.236 |
| (2, 2, 0) | 0.227 | 0.667 | 0.056 | 0.5 | −0.111 |
| (2, 1, 1) | 0.333 | 0.833 | 0.222 | 0.625 | 0.014 |
Notes:
For each abundance pattern, the bias magnitude of both estimators was calculated in detail for comparison. The overall statistical bias of the unbiased index is calculated as 0.076 * (−0.611) + 0.364 * (−0.111) + 0.227 * 0.056 + 0.333 * 0.222 = 0, and that of the biased index is computed as 0.076 * (−0.611) + 0.364 * (−0.236) + 0.227 * (−0.111) + 0.333 * 0.014 = −0.153. Moreover, the mean square error (MSE) of the unbiased and the biased indices are computed as 0.076 * (−0.611)2 + 0.364 * (−0.111)2 + 0.227 * 0.0562 + 0.333 * 0.2222 = 0.0499 and 0.076 * (−0.611)2 + 0.364 * (−0.236)2 + 0.227 * (−0.111)2 + 0.333 * 0.0142 = 0.0514. Though we used the root MSE (RMSE) for the two empirical cases, the difference between the RMSE and MSE is that the former is preserved to have the same unit as the estimator while the later is in a square scale of the RMSE.
Figure 2Comparison on the performance of the inbiased index against the biased index with the sample numerical example.
The MSE (or equivalent to the RMSE) was illustrated as an accuracy measure by simultaneously taking the statistical bias and variance (in terms of precision) into consideration. For the biased index, Bias2 = (−0.153)2 = 0.0234; for the unbiased index, Bias2 = (0)2 = 0. Accordingly, the variance (reciprocal of precision) is the difference between MSE and Bias2.
Comparison of estimates of the true Rao’s quadratic diversity using biased and unbiased estimators on the two empirical datasets.
| True | MLE: | Unbiased: | |||||
|---|---|---|---|---|---|---|---|
| Avg | BIAS | RMSE | Avg | BIAS | RMSE | ||
| Plant community in Italy | |||||||
| 30 | 1.4480 | 1.3993 | −0.0487 | 0.0727 | 1.4476 | −0.0004 | 0.0558 |
| 50 | 1.4182 | −0.0298 | 0.0480 | 1.4472 | −0.0008 | 0.0384 | |
| 100 | 1.4342 | −0.0139 | 0.0301 | 1.4487 | 0.0006 | 0.0270 | |
| 200 | 1.4408 | −0.0072 | 0.0198 | 1.4481 | 0.0001 | 0.0185 | |
| 1,000 | 1.4463 | −0.0017 | 0.0078 | 1.4478 | −0.0002 | 0.0077 | |
| 3,000 | 1.4477 | −0.0003 | 0.0046 | 1.4482 | 0.0001 | 0.0046 | |
| 5,000 | 1.4478 | −0.0002 | 0.0035 | 1.4481 | 0.0001 | 0.0035 | |
| BCI plot | |||||||
| 30 | 237.88 | 229.65 | −8.23 | 16.74 | 237.57 | −0.31 | 15.08 |
| 50 | 233.16 | −4.71 | 12.20 | 237.92 | 0.05 | 11.48 | |
| 100 | 235.86 | −2.01 | 8.17 | 238.24 | 0.37 | 8.01 | |
| 200 | 236.58 | −1.29 | 5.76 | 237.77 | −0.10 | 5.64 | |
| 1,000 | 237.56 | −0.32 | 2.55 | 237.80 | −0.08 | 2.53 | |
| 3,000 | 237.78 | −0.09 | 1.51 | 237.86 | −0.01 | 1.51 | |
| 5,000 | 237.82 | −0.05 | 1.11 | 237.87 | −0.00 | 1.11 | |
Notes:
Routine calculation method of the index and the bias-corrected method were computed using Eqs. (1) and (6), respectively. Avg denotes the average of estimates using 2,000 replicates, BIAS represents the magnitude of the bias, and the root mean squared error (RMSE) is used to reflect the estimate accuracy for each considered estimator.