| Literature DB >> 20657734 |
Pedro P Olea1, Patricia Mateo-Tomás, Angel de Frutos.
Abstract
BACKGROUND: Hierarchical partitioning (HP) is an analytical method of multiple regression that identifies the most likely causal factors while alleviating multicollinearity problems. Its use is increasing in ecology and conservation by its usefulness for complementing multiple regression analysis. A public-domain software "hier.part package" has been developed for running HP in R software. Its authors highlight a "minor rounding error" for hierarchies constructed from >9 variables, however potential bias by using this module has not yet been examined. Knowing this bias is pivotal because, for example, the ranking obtained in HP is being used as a criterion for establishing priorities of conservation. METHODOLOGY/PRINCIPALEntities:
Mesh:
Year: 2010 PMID: 20657734 PMCID: PMC2908144 DOI: 10.1371/journal.pone.0011698
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Number of studies using the HP package of Walsh and Mac Nally [ for the R software over time.
The subject categories with most number of studies are shown. Filled circles: number of total studies using HP package (n = 128). Filled squares: “Ecology” subject category. Opened circles: “Biodiversity conservation” subject category. Filled triangles: “Environmental sciences” subject category. Filled diamonds: “Geography physical” subject category. Red filled circles and dashed line: Number of studies using the HP package with more than 9 variables (n = 26). Note that a same study can pertain to more than 1 subject category and thus the sum of the number of studies from all the categories is higher than the number of total of studies.
Percentage of times that a variable changes its position within the ranking.
| Position with other variable orders | ||||||||||||
| Ranking order | 1° | 2° | 3° | 4° | 5° | 6° | 7° | 8° | 9° | 10° | 11° | 12° |
| Data set-1 to Data set-4 | ||||||||||||
| First |
| 6.2 | 1.2 | |||||||||
| Second | 6.0 |
| 7.8 | 4.3 | 1.8 | 3.3 | 0.1 | |||||
| Third | 1.7 | 8.7 |
| 25.9 | 6.0 | 1.1 | 0.6 | 0.1 | 0.2 | 0.1 | 0.3 | |
| Fourth | 3.8 | 24.7 |
| 22.9 | 4.8 | 1.8 | 0.8 | 0.2 | 0.3 | 0.8 | ||
| Fifth | 1.2 | 5.7 | 14.2 |
| 24.5 | 7.9 | 2.3 | 0.4 | 0.6 | 1.4 | 0.3 | |
| Lesser Kestrel Data set | ||||||||||||
| AUTOCOV4 |
| |||||||||||
| FARMLAND |
| 2 | 1 | 2 | ||||||||
| DROOST | 2 |
| 21 | 5 | 2 | |||||||
| FOREST | 21 |
| 29 | 1 | ||||||||
| DCOLONY10 | 3 | 9 | 30 |
| 4 | 1 | 1 | |||||
| Egyptian Vulture Data set | ||||||||||||
| ELEVATION |
| 16 | 3 | |||||||||
| SHRUB | 21 |
| 3 | |||||||||
| PATCH | 6 |
| ||||||||||
| ROAD |
| 10 | 1 | 1 | 1 | |||||||
| LENGTH | 7 |
| 19 | 10 | 13 | 2 | 1 | |||||
Ranking order: Position by amount of independent explanatory power and analysed in the reference order (i.e. alphabetic order). Only the first five variables are shown.
Data set-1 to 4: numerical simulations, N = 1,200.
Lesser Kestrel Data set: N = 100, only eleven variables.
Egyptian Vulture Data set: N = 100.
Ranking of all the models explaining probability of a variable changing its position (%).
| Models | AIC | AICc | ΔAICc | ωm | Ranking |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| DIFPREVIOUS+DIFALL | 100.5 | 100.94 | 5.80 | 0.05 | 3 |
| DIFPREVIOUS+VARIABLES+DIFALL | 106.3 | 106.92 | 11.78 | 0.00 | 4 |
| DIFALL | 112.8 | 113.09 | 17.95 | 0.00 | 5 |
| VARIABLES+DIFALL | 119.3 | 119.74 | 24.60 | 0.00 | 6 |
| Null model | 129.5 | 129.67 | 34.53 | 0.00 | 7 |
| VARIABLES | 133.4 | 133.69 | 38.55 | 0.00 | 8 |
AICc: Value used to rank the models.
ωm: Akaike weight of each model.
The best models (Σωm = 0.95) are shown in bold.
Figure 2Probability of a variable changing ranking position obtained from the averaged general linear mixed model.
Solid lines: Effect of the difference in independent variance explained (IVE) between a particular variable and the previous one in the ranking (established for explaining the response variable) on the probability of changing the position for data sets formed by 10, 11 and 12 explanatory variables in analysis of hierarchical partitioning. Note that no change is expected when the difference in explained variance between a variable and the previous one in the ranking is >17.1. Dotted lines: upper and lower limits according to the standard error of the averaged mixed-model coefficients.
Percentage of independent, joint and total explained variance for each considered variable.
| Data set-1 | Lesser Kestrel Data set | Egyptian Vulture Data set | |||||||||
| Variables | Independent | Joint | Total | Variables | Independent | Joint | Total | Variables | Independent | Joint | Total |
| XA | 4.10 | −3.79 | 0.30 | AUTOCOV4 | 10.63 | 20.85 | 31.48 | COWS | 2.49 | 0.48 | 2.97 |
| XB | 4.35 | −3.70 | 0.64 | BUILDUP | 1.75 | −0.66 | 1.10 | ELEVATION | 8.94 | 18.51 | 27.45 |
| XC | 4.03 | −3.50 | 0.53 | DCOLONY10 | 3.71 | 7.16 | 10.87 | HEIGHT | 1.45 | −1.42 | 0.03 |
| XD | 4.05 | −3.93 | 0.12 | DROOST | 4.82 | 4.66 | 9.48 | LENGTH | 2.74 | 2.03 | 4.77 |
| XE | 5.91 | 0.59 | 6.50 | EDGE | 1.36 | −1.01 | 0.36 | LIVESTOCK | 1.67 | −1.67 | 0.01 |
| XF | 5.15 | −0.48 | 4.67 | EFFORT | 2.37 | 1.65 | 4.02 | NEIGHBOUR | 2.44 | 4.19 | 6.63 |
| XG | 4.96 | 0.40 | 5.37 | FARMLAND | 5.98 | 12.74 | 18.72 | PASTURE | 2.74 | 2.25 | 4.99 |
| XH | 4.68 | −1.12 | 3.56 | FOREST | 4.59 | 9.91 | 14.50 | PATCH | 5.34 | 13.02 | 18.36 |
| XI | 8.63 | 9.26 | 17.88 | GRASSLAND | 1.30 | −0.96 | 0.34 | ROAD | 3.67 | 4.36 | 8.03 |
| XJ | 5.78 | 8.36 | 14.14 | SHANDIVER | 2.68 | 4.76 | 7.44 | SHEEP | 0.37 | −0.30 | 0.07 |
| XK | 9.84 | 9.73 | 19.58 | WIRE | 2.13 | −0.43 | 1.70 | SHRUB | 6.32 | 20.36 | 26.68 |
| XL | 10.35 | 16.36 | 26.71 | SLOPE | 2.09 | −2.07 | 0.02 | ||||
Total: Total explained variance, correlation (in “R2”) of each of the variables with the response variable (for example, XA, XB, XC and XD have a correlation coefficient with the response variable Y, r<0.10, i.e. each of them explain <1% of the variance of the response variable, r2<1%).
Joint: Negative joint variance indicates that the other variables act as suppressors of the particular variable.
Variables: The variable order shown in the tables is the order used (i.e. alphabetic order) for this particular analysis of hierarchical partitioning.
Variables used for explaining the probability of a variable changing its ranking position.
| Variable | Definition |
| DIFNEXT | Difference in independent variance explained between a variable and the next one |
| DIFPREVIOUS | Difference in independent variance explained between a variable and the previous one |
| DIFSECOND | Difference in independent variance explained between a variable and the second-further one |
| DIFTHIRD | Difference in independent variance explained between a variable and the third-further one |
| DIFNEXTPREVIOUS | Sum of the differences in independent variance explained between a variable and the next and previous ones |
| DIFALL | Sum of the differences in independent variance explained between a variable and all the rest |
| VARIABLES | Number of variables used to perform the HP analysis |