| Literature DB >> 24984933 |
Agustín Estrada-Peña1, Adrián Estrada-Sánchez, José de la Fuente.
Abstract
BACKGROUND: Correlative modelling combines observations of species occurrence with environmental variables to capture the niche of organisms. It has been argued for the use of predictors that are ecologically relevant to the target species, instead of the automatic selection of variables. Without such biological background, the forced inclusion of numerous variables can produce models that are highly inflated and biologically irrelevant. The tendency in correlative modelling is to use environmental variables that are interpolated from climate stations, or monthly estimates of remotely sensed features.Entities:
Mesh:
Year: 2014 PMID: 24984933 PMCID: PMC4089935 DOI: 10.1186/1756-3305-7-302
Source DB: PubMed Journal: Parasit Vectors ISSN: 1756-3305 Impact factor: 3.876
Figure 1The background of harmonic regression. Panels A, B, and C show how changes in the seven coefficients of a harmonic regression (namely A1 to A7) can be used to reconstruct the mean values of a variable and the peak moment of the year can be modelled. In A, the pattern is obtained leaving A1 = 20, A3 = −15, A4 = 2.357, A5 = −0.12, A6 = −0.094, and A7 = −0.237. The value of A2 was varied between −10 and 10 at constant intervals to produce the pattern observed in the series 1–8. In B, values were left constant for A1 (20) A3 (−10) and A4 to A7 (−0.12), while the value of A3 was varied between −15 and −1, at constant intervals to produce the pattern reproduced. It is observed that changes in A2 and A3 account for the seasonality of the complete year, showing the peak of a variable in both its value and moment of the year. In C, A4 was varied between −15 and 15 at constant intervals leaving the other coefficients with fixed values, namely A1 = 20, A2 = −10, A3 = −15, A5 to A7 = −0.12. Charts in A to C show simulated temperature values. Actual data for temperature were obtained from five sites in either the northern or southern hemisphere (D) and then subjected to a harmonic regression (E), which was fitted with the parameters and the equation included in E. Capital letters in the equation refer to the rows in the table for each of the five sites simulated.
Figure 2The reported distribution of 9,534 records of ticks of the subgenus . Only records with a pair of coordinates were included in the map and considered for further computations. Records from Asia lack such reliable georeferencing and were not included.
Collinearity among the coefficients of the harmonic regression of T, NDVI, and LAI
| T2 | 1.03 | | | | | | | | | | | | | | | | | | | |
| T3 | 1.85 | 1.20 | | | | | | | | | | | | | | | | | | |
| T4 | 1.00 | 1.03 | 1.00 | | | | | | | | | | | | | | | | | |
| T5 | 1.37 | 1.02 | 1.01 | 1.07 | | | | | | | | | | | | | | | | |
| T6 | 1.02 | 1.00 | 1.00 | 1.00 | 1.02 | | | | | | | | | | | | | | | |
| T7 | 1.37 | 1.02 | 1.26 | 1.02 | 1.07 | 1.00 | | | | | | | | | | | | | | |
| NDVI1 | 1.01 | 1.00 | 1.17 | 1.00 | 1.00 | 1.00 | 1.02 | | | | | | | | | | | | | |
| NDVI2 | 1.05 | 1.29 | 1.02 | 1.04 | 1.00 | 1.00 | 1.01 | 1.00 | | | | | | | | | | | | |
| NDVI3 | 3.00 | 1.15 | 1.85 | 1.04 | 1.08 | 1.02 | 1.33 | 1.01 | 1.16 | | | | | | | | | | | |
| NDVI4 | 1.00 | 1.00 | 1.01 | 1.03 | 1.04 | 1.07 | 1.01 | 1.06 | 1.04 | 1.01 | | | | | | | | | | |
| NDVI5 | 1.59 | 1.01 | 1.20 | 1.01 | 1.32 | 1.01 | 1.28 | 1.11 | 1.02 | 1.48 | 1.19 | | | | | | | | | |
| NDVI6 | 1.28 | 1.01 | 1.11 | 1.01 | 1.04 | 1.01 | 1.22 | 1.00 | 1.03 | 1.40 | 1.02 | 1.09 | | | | | | | | |
| NDVI7 | 1.05 | 1.00 | 1.02 | 1.00 | 1.13 | 1.00 | 1.00 | 1.07 | 1.00 | 1.01 | 1.22 | 1.27 | 1.04 | | | | | | | |
| LAI1 | 1.00 | 1.00 | 1.07 | 1.01 | 1.23 | 1.00 | 1.01 | 3.18 | 1.00 | 1.02 | 1.00 | 1.01 | 1.00 | 1.01 | | | | | | |
| LAI2 | 1.19 | 1.07 | 1.16 | 1.00 | 1.00 | 1.00 | 1.17 | 1.01 | 1.45 | 1.08 | 1.06 | 1.01 | 1.10 | 1.01 | 1.03 | | | | | |
| LAI3 | 1.43 | 1.22 | 1.22 | 1.01 | 1.02 | 1.01 | 1.00 | 1.01 | 1.19 | 2.28 | 1.07 | 1.03 | 1.07 | 1.00 | 1.00 | 1.00 | | | | |
| LAI4 | 1.40 | 1.01 | 1.16 | 1.05 | 1.02 | 1.00 | 1.17 | 1.00 | 1.01 | 1.26 | 1.18 | 1.01 | 1.30 | 1.03 | 1.00 | 1.61 | 1.11 | | | |
| LAI5 | 1.21 | 1.02 | 1.10 | 1.00 | 1.06 | 1.00 | 1.02 | 1.00 | 1.03 | 1.29 | 1.00 | 1.36 | 1.03 | 1.02 | 1.01 | 1.00 | 1.45 | 1.01 | | |
| LAI6 | 1.11 | 1.01 | 1.02 | 1.01 | 1.01 | 1.01 | 1.05 | 1.00 | 1.00 | 1.07 | 1.00 | 1.03 | 1.40 | 1.00 | 1.00 | 1.05 | 1.07 | 1.36 | 1.06 | |
| LAI7 | 1.00 | 1.00 | 1.00 | 1.00 | 1.00 | 1.01 | 1.02 | 1.00 | 1.00 | 1.00 | 1.01 | 1.04 | 1.00 | 1.13 | 1.00 | 1.02 | 1.00 | 1.03 | 1.08 | 1.00 |
Collinearity was calculated as the variance inflation factor. Values lower than 10 are indicative of low collinearity and could be used together in models of the environmental niche. The number after the letters of the variables indicates the ordinal coefficient in the harmonic regression of the variable.
Collinearity among the monthly values of temperature
| Feb | 58.43 | | | | | | | | | | |
| Mar | 9.59 | 20.80 | | | | | | | | | |
| Apr | 3.77 | 5.51 | 18.89 | | | | | | | | |
| May | 2.18 | 2.75 | 5.32 | 19.04 | | | | | | | |
| Jun | 1.59 | 1.85 | 2.80 | 5.58 | 20.08 | | | | | | |
| Jul | 1.63 | 1.89 | 2.82 | 5.42 | 16.42 | 142.31 | | | | | |
| Aug | 1.66 | 1.93 | 2.90 | 5.60 | 16.29 | 69.95 | 229.85 | | | | |
| Sep | 2.55 | 3.21 | 6.14 | 17.01 | 30.26 | 12.84 | 14.47 | 18.06 | | | |
| Oct | 4.58 | 6.62 | 18.71 | 32.47 | 10.29 | 4.50 | 4.67 | 5.10 | 19.31 | | |
| Nov | 16.60 | 31.02 | 33.66 | 8.83 | 3.73 | 2.28 | 2.34 | 2.43 | 4.82 | 14.85 | |
| Dec | 213.04 | 56.23 | 10.83 | 4.13 | 2.32 | 1.66 | 1.70 | 1.74 | 2.75 | 5.25 | 23.96 |
Eight-day-interval MODIS-derived series of values were converted to monthly composites and collinearity calculated as the variance inflation factor. Values higher than 10 are indicative of high collinearity.
Collinearity among the monthly values of the normalised difference vegetation index
| Feb | 35.82 | | | | | | | | | | |
| Mar | 15.40 | 31.65 | | | | | | | | | |
| Apr | 5.09 | 6.05 | 10.50 | | | | | | | | |
| May | 1.74 | 2.04 | 2.31 | 4.27 | | | | | | | |
| Jun | 1.13 | 1.23 | 1.27 | 1.53 | 3.75 | | | | | | |
| Jul | 1.06 | 1.10 | 1.13 | 1.25 | 2.09 | 11.21 | | | | | |
| Aug | 1.11 | 1.16 | 1.18 | 1.32 | 2.21 | 7.43 | 24.85 | | | | |
| Sep | 1.59 | 1.75 | 1.81 | 2.33 | 4.36 | 3.75 | 3.06 | 4.55 | | | |
| Oct | 2.54 | 2.66 | 2.80 | 4.26 | 4.67 | 1.97 | 1.57 | 1.88 | 7.74 | | |
| Nov | 6.11 | 4.97 | 5.50 | 7.58 | 2.87 | 1.37 | 1.20 | 1.33 | 2.87 | 8.89 | |
| Dec | 31.42 | 12.00 | 9.80 | 4.95 | 1.79 | 1.22 | 1.14 | 1.20 | 1.75 | 2.79 | 9.14 |
Eight-day-interval MODIS-derived series of values were converted to monthly composites and collinearity calculated as the variance inflation factor. Values higher than 10 are indicative of high collinearity.
Collinearity among the monthly values of the leaf area index
| Feb | 3.21 | | | | | | | | | | |
| Mar | 2.14 | 7.88 | | | | | | | | | |
| Apr | 2.23 | 5.26 | 9.48 | | | | | | | | |
| May | 1.89 | 3.23 | 4.38 | 6.35 | | | | | | | |
| Jun | 1.50 | 2.86 | 3.68 | 4.91 | 18.29 | | | | | | |
| Jul | 1.21 | 2.84 | 3.64 | 4.82 | 16.73 | 54.59 | | | | | |
| Aug | 1.56 | 2.89 | 3.72 | 4.94 | 17.06 | 43.16 | 62.82 | | | | |
| Sep | 1.98 | 3.00 | 3.91 | 5.29 | 21.09 | 37.14 | 38.85 | 50.07 | | | |
| Oct | 2.01 | 5.54 | 7.93 | 9.71 | 6.13 | 4.85 | 4.77 | 4.91 | 5.27 | | |
| Nov | 2.92 | 6.72 | 6.36 | 4.78 | 3.08 | 2.75 | 2.73 | 2.78 | 2.88 | 5.08 | |
| Dec | 3.05 | 4.17 | 3.21 | 2.71 | 2.14 | 2.00 | 1.99 | 2.01 | 2.06 | 2.74 | 4.76 |
Eight-day-interval MODIS-derived series of values were converted to monthly composites and collinearity calculated as the variance inflation factor. Values higher than 10 are indicative of high collinearity.
Collinearity among the monthly values of temperature obtained by interpolated data (Worldclim)
| Feb | 184.56 | | | | | | | | | | |
| Mar | 24.31 | 50.92 | | | | | | | | | |
| Apr | 7.77 | 10.89 | 32.00 | | | | | | | | |
| May | 3.61 | 4.34 | 7.17 | 21.47 | | | | | | | |
| Jun | 1.91 | 2.11 | 2.75 | 4.52 | 12.87 | | | | | | |
| Jul | 1.50 | 1.61 | 1.96 | 2.77 | 5.24 | 27.31 | | | | | |
| Aug | 1.88 | 2.06 | 2.64 | 4.07 | 8.91 | 37.03 | 38.90 | | | | |
| Sep | 3.63 | 4.27 | 6.59 | 13.49 | 27.88 | 10.75 | 5.80 | 12.66 | | | |
| Oct | 10.07 | 13.78 | 29.90 | 40.82 | 12.22 | 3.91 | 2.61 | 3.97 | 15.91 | | |
| Nov | 41.60 | 58.81 | 45.65 | 14.48 | 5.40 | 2.42 | 1.81 | 2.43 | 5.69 | 27.95 | |
| Dec | 329.62 | 152.40 | 27.89 | 8.74 | 3.89 | 2.00 | 1.56 | 1.98 | 3.99 | 12.30 | 73.92 |
Collinearity was calculated as the variance inflation factor. Values higher than 10 are indicative of high collinearity.
Collinearity among the monthly values of rainfall obtained by interpolated data (Worldclim)
| Feb | 24.41 | | | | | | | | | | |
| Mar | 6.82 | 12.22 | | | | | | | | | |
| Apr | 2.12 | 2.54 | 4.86 | | | | | | | | |
| May | 1.23 | 1.29 | 1.57 | 3.33 | | | | | | | |
| Jun | 1.03 | 1.04 | 1.10 | 1.40 | 3.58 | | | | | | |
| Jul | 1.00 | 1.00 | 1.02 | 1.14 | 1.76 | 5.42 | | | | | |
| Aug | 1.00 | 1.00 | 1.02 | 1.13 | 1.72 | 4.14 | 13.64 | | | | |
| Sep | 1.05 | 1.05 | 1.11 | 1.35 | 2.23 | 3.55 | 3.34 | 5.06 | | | |
| Oct | 1.38 | 1.37 | 1.55 | 2.04 | 2.33 | 1.73 | 1.40 | 1.53 | 3.09 | | |
| Nov | 2.70 | 2.40 | 2.65 | 2.47 | 1.59 | 1.16 | 1.06 | 1.08 | 1.34 | 3.55 | |
| Dec | 10.33 | 6.13 | 4.56 | 2.24 | 1.30 | 1.05 | 1.01 | 1.01 | 1.10 | 1.73 | 5.77 |
Collinearity was calculated as the variance inflation factor. Values higher than 10 are indicative of high collinearity.
Collinearity among the “bioclim” variables derived from interpolated data
| Bio2 | 1.34 | | | | | | | | | | | | | | | | | |
| Bio3 | 3.40 | 1.16 | | | | | | | | | | | | | | | | |
| Bio4 | 3.29 | 1.04 | 4.79 | | | | | | | | | | | | | | | |
| Bio5 | 5.05 | 1.96 | 1.62 | 1.36 | | | | | | | | | | | | | | |
| Bio6 | 15.65 | 1.13 | 4.72 | 8.17 | 2.40 | | | | | | | | | | | | | |
| Bio7 | 2.15 | 1.00 | 3.10 | 17.21 | 1.15 | 4.31 | | | | | | | | | | | | |
| Bio8 | 2.98 | 1.38 | 1.71 | 1.35 | 3.54 | 2.01 | 1.18 | | | | | | | | | | | |
| Bio9 | 8.46 | 1.22 | 2.90 | 3.91 | 2.68 | 10.14 | 2.58 | 1.61 | | | | | | | | | | |
| Bio10 | 8.08 | 1.59 | 1.80 | 1.53 | 41.52 | 3.15 | 1.27 | 4.04 | 3.27 | | | | | | | | | |
| Bio11 | 25.25 | 1.20 | 4.84 | 7.05 | 2.72 | 123.00 | 3.53 | 2.18 | 10.45 | 3.60 | | | | | | | | |
| Bio12 | 1.18 | 1.07 | 1.48 | 1.46 | 1.02 | 1.33 | 1.62 | 1.08 | 1.18 | 1.05 | 1.28 | | | | | | | |
| Bio13 | 1.28 | 1.01 | 1.52 | 1.48 | 1.06 | 1.40 | 1.54 | 1.17 | 1.22 | 1.11 | 1.37 | 5.14 | | | | | | |
| Bio14 | 1.01 | 1.16 | 1.06 | 1.07 | 1.02 | 1.04 | 1.15 | 1.00 | 1.02 | 1.00 | 1.02 | 2.06 | 1.20 | | | | | |
| Bio15 | 1.15 | 1.35 | 1.08 | 1.03 | 1.22 | 1.07 | 1.00 | 1.25 | 1.07 | 1.20 | 1.11 | 1.03 | 1.02 | 1.36 | | | | |
| Bio16 | 1.26 | 1.02 | 1.52 | 1.49 | 1.05 | 1.39 | 1.56 | 1.15 | 1.21 | 1.10 | 1.36 | 6.72 | 69.34 | 1.24 | 1.01 | | | |
| Bio17 | 1.01 | 1.16 | 1.08 | 1.09 | 1.01 | 1.05 | 1.18 | 1.00 | 1.02 | 1.00 | 1.03 | 2.29 | 1.24 | 84.01 | 1.36 | 1.29 | | |
| Bio18 | 1.06 | 1.04 | 1.16 | 1.14 | 1.00 | 1.10 | 1.19 | 1.08 | 1.03 | 1.01 | 1.09 | 2.80 | 2.24 | 1.48 | 1.01 | 2.39 | 1.54 | |
| Bo19 | 1.07 | 1.06 | 1.24 | 1.21 | 1.00 | 1.16 | 1.30 | 1.01 | 1.11 | 1.02 | 1.12 | 2.36 | 1.54 | 1.85 | 1.08 | 1.61 | 1.98 | 1.18 |
Collinearity was calculated as the variance inflation factor. Values higher than 10 are indicative of high collinearity. The names Bio1 to Bio19 are the names defined by the Worldclim dataset, namely annual mean temp., mean diurnal range, isothermality, temp. seasonality, max temp. of warmest month, min. temp. of coldest month, temp. annual range, mean temp. of wettest quarter, mean temp. of driest quarter, mean temp. of warmest quarter, mean temp. of coldest quarter, annual precipitation, precipitation of wettest month, precipitation of driest month, precipitation seasonality, precipitation of wettest quarter, precipitation of driest quarter, precipitation of warmest quarter, precipitation of coldest quarter.
Percent of correctly discriminated species of the subgenus , using the sets of descriptive covariates
| 1. Discriminant analysis with 7 coefficients of LST, 7 coefficients of NDVI, and 5 coefficients of LAI. Correct determinations: 82.4% | ||||||
| 0.955 | 69.94 | 0.00 | 2.76 | 25.15 | 2.15 | |
| 0.977 | 0.00 | 93.39 | 1.17 | 0.00 | 5.43 | |
| 0.905 | 2.21 | 0.18 | 79.85 | 9.26 | 8.50 | |
| 0.986 | 1.41 | 0.00 | 1.41 | 97.18 | 0.00 | |
| 0.924 | 2.53 | 1.86 | 16.39 | 1.31 | 77.91 | |
| 2. Discriminant analysis with 7 coefficients of LST, and 7 coefficients of NDVI. Correct determinations: 72.9% | ||||||
| 0.959 | 46.79 | 0.00 | 3.21 | 48.08 | 1.92 | |
| 0.995 | 0.02 | 94.45 | 1.44 | 0.00 | 4.08 | |
| 0.922 | 4.94 | 0.09 | 73.84 | 8.94 | 12.19 | |
| 0.989 | 3.47 | 0.00 | 1.39 | 93.75 | 1.39 | |
| 0.946 | 5.36 | 1.33 | 12.06 | 0.73 | 80.52 | |
| 3. Discriminant analysis with 12 months of remotely sensed LST and NDVI. Correct determinations: 62.3% | ||||||
| 0.931 | 32.69 | 1.28 | 4.49 | 57.05 | 4.49 | |
| 0.991 | 0.20 | 96.83 | 1.22 | 0.00 | 1.75 | |
| 0.889 | 4.27 | 1.91 | 69.93 | 10.77 | 13.12 | |
| 0.979 | 6.25 | 0.69 | 0.00 | 92.36 | 0.69 | |
| 0.919 | 4.68 | 5.93 | 16.69 | 2.62 | 70.08 | |
| 4. Discriminant analysis with monthly remotely sensed LST and NDVI, after removal of months with high collinearity. Only values for January, March, May and October were included for LST. Data for February, March and July were removed from NDVI. Correct determinations: 56.7% | ||||||
| 0.912 | 34.62 | 1.28 | 0.64 | 57.05 | 6.41 | |
| 0.947 | 0.49 | 90.83 | 3.20 | 0.00 | 5.48 | |
| 0.761 | 5.12 | 12.06 | 52.89 | 14.10 | 15.84 | |
| 0.971 | 9.72 | 1.39 | 0.69 | 88.19 | 0.00 | |
| 0.826 | 8.23 | 15.12 | 18.87 | 2.46 | 55.32 | |
| 5. Discriminant analysis with 12 months of gridded interpolated temperature and rainfall (Worldclim dataset). Correct determinations: 69.7% | ||||||
| 0.872 | 42.31 | 1.92 | 3.85 | 44.23 | 7.69 | |
| 0.996 | 0.00 | 99.67 | 0.13 | 0.00 | 0.20 | |
| 0.923 | 4.23 | 1.33 | 78.29 | 7.96 | 8.19 | |
| 0.985 | 5.56 | 0.69 | 2.78 | 85.42 | 5.56 | |
| 0.949 | 2.66 | 7.78 | 14.88 | 0.81 | 73.87 | |
| 6. Discriminant analysis with monthly gridded interpolated temperature and rainfall (Worldclim dataset) after removal of the months with high collinearity. Only data of temperatures of January, April, June and September were included. Data of rainfall of February, August and December were removed. Correct determinations: 65.1% | ||||||
| 0.889 | 44.23 | 4.49 | 0.64 | 45.51 | 5.13 | |
| 0.982 | 0.00 | 97.63 | 2.35 | 0.00 | 0.02 | |
| 0.851 | 8.41 | 3.65 | 72.15 | 10.63 | 5.16 | |
| 0.965 | 15.97 | 0.00 | 0.00 | 76.39 | 7.64 | |
| 0.879 | 6.45 | 16.57 | 14.88 | 3.06 | 59.03 | |
| 7. Discriminant analysis with “bioclim variables” derived from monthly gridded interpolated temperature and rainfall (Worldclim dataset). Correct determinations: 57.9% | ||||||
| 0.941 | 28.21 | 1.92 | 8.97 | 46.15 | 14.74 | |
| 0.990 | 0.00 | 97.45 | 0.00 | 0.00 | 2.55 | |
| 0.876 | 3.91 | 2.58 | 74.64 | 9.34 | 9.52 | |
| 0.968 | 1.39 | 2.08 | 3.47 | 83.33 | 9.72 | |
| 0.901 | 1.49 | 11.09 | 19.60 | 1.98 | 65.85 | |
| 8. Discriminant analysis with “bioclim variables” derived from monthly gridded interpolated temperature and rainfall (Worldclim dataset) after removal of variables with high collinearity. Correct determinations: 57.4% | ||||||
| 0.930 | 29.49 | 1.92 | 7.69 | 53.21 | 7.69 | |
| 0.991 | 0.00 | 95.12 | 0.00 | 0.09 | 4.79 | |
| 0.890 | 3.51 | 2.49 | 73.84 | 10.85 | 9.30 | |
| 0.979 | 1.39 | 0.69 | 1.39 | 86.81 | 9.72 | |
| 0.920 | 2.30 | 15.52 | 19.68 | 5.85 | 56.65 | |
For some of these datasets of descriptive covariates, the analysis was repeated with every variable included (e.g., the 12 months of average temperature) and after the highly correlated variables were removed. A discriminant analysis was conducted, and its reliability evaluated by the percent of records correctly predicted and the area under the curve (AUC). The AUC is a general measure of model performance and does not consider individual results of true positives for each species. Therefore, some models may perform better for a particular species while having a general low AUC. The percent of correctly determined records of each species is also included.