| Literature DB >> 32647115 |
Jane Hall1, Fiona Lo2, Shubhayu Saha3, Ambarish Vaidyanathan4,5, Jeremy Hess6,7,8.
Abstract
Tracking concentrations of regional airborne pollen is valuable for a variety of fields including plant and animal ecology as well as human health. However, current methods for directly measuring regional pollen concentrations are labor-intensive, requiring special equipment and manual counting by professionals leading to sparse data availability in select locations. Here, we use publicly available Google Trends data to evaluate whether searches for the term "pollen" can be used to approximate local observed early-season pollen concentrations as reported by the National Allergy Bureau across 25 U.S. regions from 2012-2017, in the context of site-specific characteristics. Our findings reveal that two major factors impact the ability of internet search data to approximate observed pollen: (1) volume/availability of internet search data, which is tied to local population size and media use; and (2) signal intensity of the seasonal peak in searches. Notably, in regions and years where internet search data was abundant, we found strong correlations between local search patterns and observed pollen, thus revealing a potential source of daily pollen data across the U.S. where observational pollen data are not reliably available.Entities:
Year: 2020 PMID: 32647115 PMCID: PMC7347639 DOI: 10.1038/s41598-020-68095-y
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Correlation between Google Trends searches and National Allergy Bureau pollen concentration data with respect to data quality and pattern. (a) Correlation by quartiles of annual percent of missing Google Trends data. (b) Signal to noise ratio (size of peak relative to smoothing function).
Figure 2Overlay of lightly smoothed, normalized Google Trends search data (blue) and NAB pollen concentration data (orange) for representative station-years. Examples of (a) excellent, (b,c) good to moderate, and (d) poor correlation between GT and NAB data.
Figure 3Difference in days between Google Trends- and NAB-calculated season start dates. Differences in start dates are shown for untransformed, smoothed, and log-transformed smoothed data. As a reference, differences between NAB-calculated start dates those calculated from NAB data for the previous year dates are displayed as well, for (a) All available station-years, (b) additional inclusion criteria of NAB data collection beginning within first month of the year applied, (c) additional inclusion criteria of < 20% missing GT data applied.
Figure 4Summary of findings: covariates related to strength of correlation between regional internet searches and observed pollen, with representative examples. Line graphs show lightly smoothed normalized values for both Google Trends search volumes and daily observed pollen concentrations.