| Literature DB >> 26487352 |
Abstract
Corylus, Alnus, and Betula trees are among the most important sources of allergic pollen in the temperate zone of the Northern Hemisphere and have a large impact on the quality of life and productivity of allergy sufferers. Therefore, it is important to predict high pollen concentrations, both in time and space. The aim of this study was to create and evaluate spatiotemporal models for predicting high Corylus, Alnus, and Betula pollen concentration levels, based on gridded meteorological data. Aerobiological monitoring was carried out in 11 cities in Poland and gathered, depending on the site, between 2 and 16 years of measurements. According to the first allergy symptoms during exposure, a high pollen count level was established for each taxon. An optimizing probability threshold technique was used for mitigation of the problem of imbalance in the pollen concentration levels. For each taxon, the model was built using a random forest method. The study revealed the possibility of moderately reliable prediction of Corylus and highly reliable prediction of Alnus and Betula high pollen concentration levels, using preprocessed gridded meteorological data. Cumulative growing degree days and potential evaporation proved to be two of the most important predictor variables in the models. The final models predicted not only for single locations but also for continuous areas. Furthermore, the proposed modeling framework could be used to predict high pollen concentrations of Corylus, Alnus, Betula, and other taxa, and in other countries.Entities:
Keywords: Allergenic pollen; Betulaceae; Machine learning; Predictive modeling; Random forest; Spatiotemporal models
Mesh:
Substances:
Year: 2015 PMID: 26487352 PMCID: PMC4879172 DOI: 10.1007/s00484-015-1077-8
Source DB: PubMed Journal: Int J Biometeorol ISSN: 0020-7128 Impact factor: 3.787
Fig. 1A flowchart of the processes for the predictive mapping of the pollen concentration levels
Population and area of the cities with the aerobiological monitoring sites; longitude, latitude, altitude, and the studied years of the aerobiological monitoring sites
| Site | Population (in thousands) | Area ( |
|
| Altitude (a.s.l.) | Studied years |
|---|---|---|---|---|---|---|
| Bydgoszcz | 361 | 176 | 18.13 | 53.14 | 51 | 2009–2011 |
| Gdańsk | 460 | 262 | 18.61 | 54.39 | 12 | 1998–2005, 2009–2011 |
| Kraków | 758 | 327 | 19.96 | 50.06 | 207 | 1998–2005, 2009–2011 |
| Łódź | 719 | 293 | 19.47 | 51.77 | 216 | 2003–2005, 2009–2011 |
| Lublin | 348 | 147 | 22.54 | 51.24 | 194 | 2001–2005, 2009–2011 |
| Olsztyn | 175 | 88 | 20.49 | 53.78 | 132 | 2009–2011 |
| Poznań | 551 | 262 | 16.92 | 52.47 | 93 | 1996–2011 |
| Rzeszów | 182 | 116 | 22.02 | 50.03 | 201 | 1997–2005, 2009–2011 |
| Siedlce | 76 | 32 | 22.31 | 52.18 | 147 | 2010–2011 |
| Sosnowiec | 214 | 91 | 19.14 | 50.30 | 253 | 2001–2011 |
| Szczecin | 409 | 301 | 14.55 | 53.44 | 28 | 2002–2011 |
Fig. 2Pollen count of Corylus, Alnus, and Betula for all of the analyzed sites and all years, on a logarithmic scale. Vertical lines indicate the temporal scope of analysis for each taxon. Horizontal lines separate the two pollen concentration levels of low and high: 35 grains/m 3 for Corylus, 45 grains/m 3 for Alnus, and 20 grains/m 3 for Betula
Explanation of the predictor variable abbreviations used in spatiotemporal modeling of Corylus, Alnus, and Betula pollen concentration levels
| Abbreviation | Predictor variable name | Unit |
|---|---|---|
| TAVG_JANUARY_PREVYEAR | Average monthly temperature for January in the preceding year | ∘C |
| TAVG_FEBRUARY_PREVYEAR | Average monthly temperature for February in the preceding year | ∘C |
| TAVG_MARCH_PREVYEAR | Average monthly temperature for March in the preceding year | ∘C |
| TAVG_APRIL_PREVYEAR | Average monthly temperature for April in the preceding year | ∘C |
| TAVG_MAY_PREVYEAR | Average monthly temperature for May in the preceding year | ∘C |
| TAVG_JUNE_PREVYEAR | Average monthly temperature for June in the preceding year | ∘C |
| TAVG_JULY_PREVYEAR | Average monthly temperature for July in the preceding year | ∘C |
| TAVG_AUGUST_PREVYEAR | Average monthly temperature for August in the preceding year | ∘C |
| TAVG_SEPTEMBER_PREVYEAR | Average monthly temperature for September in the preceding year | ∘C |
| TAVG_OCTOBER_PREVYEAR | Average monthly temperature for October in the preceding year | ∘C |
| TAVG_NOVEMBER_PREVYEAR | Average monthly temperature for November in the preceding year | ∘C |
| TAVG_DECEMBER_PREVYEAR | Average monthly temperature for December in the preceding year | ∘C |
| TMAX_4DAYS_AVG_1DAYLAG | Average maximum temperature in preceding 4 days | ∘C |
| TMAX_16DAYS_AVG_1DAYLAG | Average maximum temperature in preceding 16 days | ∘C |
| TMIN_4DAYS_AVG_1DAYLAG | Average minimum temperature in preceding 4 days | ∘C |
| TMIN_16DAYS_AVG_1DAYLAG | Average minimum temperature in preceding 16 days | ∘C |
| VAPORPRESSURE_4DAYS_AVG_1DAYLAG | Average vapor pressure in preceding 4 days | hPa |
| VAPORPRESSURE_16DAYS_AVG_1DAYLAG | Average vapor pressure in preceding 16 days | hPa |
| WINDSPEED_4DAYS_AVG_1DAYLAG | Average wind speed in preceding 4 days | m/s |
| WINDSPEED_16DAYS_AVG_1DAYLAG | Average wind speed in preceding 16 days | m/s |
| PRECIPITATION_4DAYS_AVG_1DAYLAG | Average daily precipitation in the preceiding 4 days | mm |
| PRECIPITATION_16DAYS_AVG_1DAYLAG | Average daily precipitation in the preceiding 16 days | mm |
| EVAPORATION_4DAYS_AVG_1DAYLAG | Average potential evaporation in the preceding 4 days | mm/day |
| EVAPORATION_16DAYS_AVG_1DAYLAG | Average potential evaporation in the preceding 16 days | mm/day |
| RADIATION_4DAYS_AVG_1DAYLAG | Average total global radiation in the preceding 4 days | KJ/m 2/day |
| RADIATION_16DAYS_AVG_1DAYLAG | Average total global radiation in the preceding 16 days | KJ/m 2/day |
| GDD_1DAYLAG | Cummulated growing degree days (GDD) lagged by one day | GDD |
| LONGITUDE | Grid cell longitude | degrees |
| LATITUDE | Grid cell latitude | degrees |
| ALTITUDE | Average altitude of grid cell | m a.s.l. |
Fig. 3Resampled values of sensitivity, specificity, positive predictive value, negative predictive value, and the numerical distance between those values for each taxon model
Fig. 4Variable importance of each input variable for Corylus, Alnus, and Betula models. The variables are showed by the mean value of variable importance for all of the taxa in descending order
Fig. 5Relationship between probability of high pollen concentration level and the values of the four most important variables on the training set of the Corylus, Alnus, and Betula models
Fig. 6Relation between observed and predicted days with low and high concentration levels for Corylus, Alnus, and Betula pollen and prediction errors for the first and the second test set
Fig. 7Examples of Corylus, Alnus, and Betula models’ prediction for nine regularly distributed days in the year 2011, based on the data for Poland and the area within 200 km of the Polish border