| Literature DB >> 28320440 |
Falk Huettmann1, Emily Elizabeth Magnuson2, Karsten Hueffer3.
Abstract
BACKGROUND: Rabies is a disease of global significance including in the circumpolar Arctic. In Alaska enzootic rabies persist in northern and western coastal areas. Only sporadic cases have occurred in areas outside of the regions considered enzootic for the virus, such as the interior of the state and urbanized regions.Entities:
Keywords: Alaska; Data mining; Ecologic niche; Predictions; Rabies
Mesh:
Year: 2017 PMID: 28320440 PMCID: PMC5359834 DOI: 10.1186/s13028-017-0285-0
Source DB: PubMed Journal: Acta Vet Scand ISSN: 0044-605X Impact factor: 1.695
Fig. 1Diagnosed rabies cases over time in animals and people. Blue bars represents reported cases according to the annual infectious disease reports (1973–2014) published by the Section of Epidemiology for the State of Alaska. The red bar represents cases used to train our models (Additional file 2) and the green bars represents the cases included in testing our models (Additional file 3)
Settings and explanations of the TreeNet model run
| Metric | Setting | Effect | Justification |
|---|---|---|---|
| Learnrate | AUTO | A detailed but slow model run | Known to provide best results for the algorithm ‘learning’ data |
| Subsample fraction | 50% | Internal testing while model is grown | Standard approach for balanced tree models |
| Logistic residual trim fraction | 0.10 | Fine-tuning | Allows for better fits |
| Huber-M fraction of error squared | 0.90 | Accuracy level | A statistical standard threshold for certainty |
| Optimal logistic model selection | Cross entropy | How to find the optimal model | Usually the best setting for tree-based models |
| Number of trees to build | 1000 | Number of trees tried out for the best solution | This number should widely overshot the known optimum |
| Maximum number of nodes | 6 | Determines the node depth of trees used | This number determines whether a ‘stump’ or a fully fit tree is run |
| Terminal node minimum training cases | 10 | For most data cases it provides a robust tree | Number of cases for each tree branch split |
| Maximum number of most-optimal models to save summary results | 1 | Just 1 most-optimal model is saved | |
| Regression loss criterion | Huber-M (Blend LS and LAD) | A statistical metric to express gain vs cost of a new rule | Standard approach in trees |
Predictors of rabies in Alaska and for assembling the ecological niche
| Predictor | Source | Comment |
|---|---|---|
| Euclidean distance to Alaska coastline | Alaska GAP data | Obtained with ArcGIS tools |
| Euclidean distance to Alaska infrastructure | Alaska GAP data | Obtained with ArcGIS tools |
| Elevation | Alaska GAP data | |
| Monthly mean temperature | Alaska GAP data (taken from SNAP) | |
| Monthly mean precipitation | Alaska (taken from SNAP) |
For public data sources see [43, 44]
Fig. 2Alaska map and location of diagnosed rabies cases used to build models. Cases classified as enzootic is indicated in black and epizootic cases in purple. Settlements and road infrastructure is shown in grey
Fig. 3Alaska map and location of diagnosed rabies cases data to assess model performance. Seventy three locations were used, representing 127 diagnosed cases to assess the models
Fig. 4a Best TreeNet model (pooled data) prediction of rabies in Alaska. Colors show relative index of occurrence (RIO), where red is high RIO, yellow is mid range RIO and green is low RIO; rabies used to build the model are overlaid for overview. Letter indicate regions of special interest in the model output: A Brooks Range, B Eastern Yukon Basin, C Lower Yukon/Yukon Delta, D Middle Yukon. b The same RIO map classified into a presence/absence scheme. Rabies cases used to build the model are indicated in black and purple; (see Fig. 1a) and assessment data in blue) are overlaid for overview
TreeNet variable importance of parameters utilized in best performing model (148 Alaska rabies data locations pooled regardless of outbreak or enzootic locations)
| Variable | Score |
|---|---|
| Distance to infrastructure | 100.00 |
| Elevation | 56.09 |
| Distance to coast | 31.95 |
| Precipitation June | 30.63 |
| Precipitation February | 21.78 |
| Precipitation October | 20.28 |
| Temperature October | 20.27 |
| Precipitation March | 19.22 |
| Precipitation May | 18.64 |
| Temperature April | 18.49 |
| Precipitation August | 18.29 |
| Temperature December | 17.68 |
| Temperature February | 17.56 |
| Precipitation April | 16.89 |
| Precipitation July | 16.77 |
| Precipitation September | 16.47 |
| Precipitation December | 14.99 |
| Precipitation January | 13.40 |
| Temperature August | 13.13 |
| Temperature November | 12.93 |
| Temperature January | 12.50 |
| Temperature May | 11.64 |
| Temperature March | 10.25 |
| Precipitation November | 9.44 |
| Temperature September | 8.89 |
| Temperature June | 8.64 |
| Temperature July | 5.62 |
The variables are listed by importance together with their relative score in informing the model on the likelihood of rabies occurrence
Fig. 5Climate niche predictions of rabies using Treenet. The top panel shows the rabies prediction using the climate niche from 2010 [A1B1 obtained from scenarios network for Alaska + Arctic planning (SNAP)]. The bottom panel depicts the rabies prediction using the climate niche from 2050 (A1B1 obtained from SNAP)