| Literature DB >> 35271589 |
Eric Kontowicz1,2, Grant Brown3, James Torner1, Margaret Carrel4, Kelly K Baker5, Christine A Petersen1,2,6.
Abstract
Lyme disease is the most widely reported vector-borne disease in the United States. 95% of confirmed human cases are reported in the Northeast and upper Midwest (25,778 total confirmed cases from Northeast and upper Midwest / 27,203 total US confirmed cases). Human cases typically occur in the spring and summer months when an infected nymph Ixodid tick takes a blood meal. Current federal surveillance strategies report data on an annual basis, leading to nearly a year lag in national data reporting. These lags in reporting make it difficult for public health agencies to assess and plan for the current burden of Lyme disease. Implementation of a nowcasting model, using historical data to predict current trends, provides a means for public health agencies to evaluate current Lyme disease burden and make timely priority-based budgeting decisions. The objective of the study was to develop and compare the performance of nowcasting models using free data from Google Trends and Centers of Disease Control and Prevention surveillance reports. We developed two sets of elastic net models for five regions of the United States: 1. Using only monthly proportional hit data from the 21 disease symptoms and tick related terms, and 2. Using monthly proportional hit data from terms identified via Google correlate and the disease symptom and vector terms. Elastic net models using the full-term list were highly accurate (Root Mean Square Error: 0.74, Mean Absolute Error: 0.52, R2: 0.97) for four of the five regions of the United States and improved accuracy 1.33-fold while reducing error 0.5-fold compared to predictions from models using disease symptom and vector terms alone. Many of the terms included and found to be important for model performance were environmentally related. These models can be implemented to help local and state public health agencies accurately monitor Lyme disease burden during times of reporting lag from federal public health reporting agencies.Entities:
Mesh:
Year: 2022 PMID: 35271589 PMCID: PMC8912246 DOI: 10.1371/journal.pone.0251165
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Candidate search terms identified via Google CorrelateTM by region with symptom/vector terms.
|
|
|
|
|
|
| free concerts, july calendar, necbl, little league all stars, alive at five, movies under the stars, prospect park bandshell, summer recipe, harwich mariners, freezer jam | festivals milwaukee, beaches in michigan, kings island discount, easy summer recipes, lake beaches, motel wisconsin dells, movies in the park, summer desserts, dorm bedding, drive in ohio |
|
|
|
|
|
|
| intex, cloudy pool, summer things, alabama water park, blue bayou in baton rouge, cloudy pool water, baking soda pool, summer things to do, green pool, springtails | loans for, how to make string bracelets, pigeon forge hotels, recipes on the grill, sandstone amphitheater, cheap bmx bikes, cataratas del niagara, world rv, cave of the winds colorado springs, produce stand |
|
|
|
|
|
|
| concert in the park, berry picking, movies in park, concert in park, blueberry picking, outdoor movies, soak city, lake water park, blueberry farm, broomfield bay | tick, black tick, lyme, lyme disease, rash, bullseye rash, bell’s palsy, facial paralysis, side of face paralyzed, knee pain, swollen knees, swollen joint, swollen joints, joint pain, fever, tired, deer tick, black-legged tick, black legged tick, black leg tick |
Number of search terms that had monthly proportional hit data available from GtrendsTM.
| Region | Terms Into GtrendsTM | Terms From GtrendsTM |
|---|---|---|
| Northeast | 120 | 87 |
| Midwest | 120 | 86 |
| Southeast | 120 | 80 |
| Southwest | 120 | 42 |
| West | 120 | 83 |
Summary values of bivariate correlation of full-term list search terms to regional Lyme disease rates of model training data.
| Region | Range | Mean Correlation | Median Correlation |
|---|---|---|---|
| Northeast | -0.279, 0.893 | 0.560 | 0.663 |
| Midwest | -0.245, 0.898 | 0.602 | 0.691 |
| Southeast | -0.137, 0.840 | 0.524 | 0.590 |
| Southwest | -0.065, 0.612 | 0.229 | 0.231 |
| West | -0.165, 0.836 | 0.421 | 0.416 |
Ten most correlated regional search terms for training period (2004–2012).
| Northeast | Midwest | Southeast | Southwest | West | |||||
|---|---|---|---|---|---|---|---|---|---|
| Search Term | Corr. Value | Search Term | Corr. Value | Search Term | Corr. Value | Search Term | Corr. Value | Search Term | Corr. Value |
| july calendar | 0.89 | kings island discount | 0.90 | intex | 0.84 | loans for | 0.61 | movies in park | 0.84 |
| free concerts | 0.88 | beaches in michigan | 0.90 | cloudy pool | 0.84 | hotels ca | 0.55 | movies in the park | 0.83 |
| movies under the stars | 0.87 | festivals milwaukee | 0.89 | summer things | 0.81 | ca water | 0.55 | movie in park | 0.82 |
| lyme | 0.85 | easy summer recipes | 0.88 | baking soda pool | 0.80 | deer tick | 0.45 | concert in the park | 0.80 |
| summer recipe | 0.85 | lake beaches | 0.88 | green pool | 0.80 | moon bay ca | 0.44 | berry picking | 0.80 |
| lyme disease | 0.85 | motel wisconsin dells | 0.87 | alabama water park | 0.79 | half moon bay ca | 0.40 | blueberry farm | 0.79 |
| little league all stars | 0.85 | blueberry farm | 0.85 | cloudy pool water | 0.79 | make string bracelets | 0.40 | concert in park | 0.79 |
| necbl | 0.84 | summer desserts | 0.85 | summer things to do | 0.79 | rash | 0.39 | blueberry picking | 0.78 |
| berry picking | 0.83 | movies in the park | 0.85 | blue bayou in baton rouge | 0.77 | tick | 0.38 | outdoor movies | 0.77 |
| alive at five | 0.83 | watermelon recipe | 0.84 | springtails | 0.75 | how to make string bracelets | 0.38 | lake water park | 0.76 |
** p << 0.05
* p < 0.05.
Predictions from symptoms and vector terms only models produce accurate predictions with low error.
| Northeast | Midwest | Southeast | Southwest | West | |
|---|---|---|---|---|---|
|
| 0.47, 0.60 | 0.33, 0.20 | 0.29, 0.07 | 0.11, 0.01 | 0.1, 0.01 |
|
| |||||
|
| 1.32 | 0.36 | 0.11 | 0.01 | 0.01 |
|
| 0.89 | 0.21 | 0.07 | 0.01 | 0.01 |
|
| 0.77 | 0.65 | 0.67 | 0.32 | 0.50 |
|
| |||||
|
| 1.50 | 0.38 | 0.11 | 0.01 | 0.01 |
|
| 1.01 | 0.25 | 0.07 | 0.01 | 0.01 |
|
| 0.71 | 0.59 | 0.69 | 0.38 | 0.29 |
|
| |||||
|
| 1.65 | 0.43 | 0.14 | 0.01 | 0.01 |
|
| 1.38 | 0.34 | 0.10 | 0.01 | 0.01 |
|
| 0.79 | 0.76 | 0.82 | 0.37 | 0.63 |
Predictions form full-term list models produce highly accurate predictions with low error.
| Northeast | Midwest | Southeast | Southwest | West | |
|---|---|---|---|---|---|
|
| 0.1, 0.85 | 0.93, 0.00 | 0.1, 0.07 | 0.1, 0.01 | 0.1, 0.00 |
|
| |||||
|
| 0.66 | 0.12 | 0.06 | 0.01 | 0.01 |
|
| 0.46 | 0.09 | 0.04 | 0.01 | 0.00 |
|
| 0.94 | 0.95 | 0.91 | 0.56 | 0.84 |
|
| |||||
|
| 0.99 | 0.23 | 0.08 | 0.01 | 0.01 |
|
| 0.62 | 0.14 | 0.05 | 0.01 | 0.01 |
|
| 0.87 | 0.85 | 0.84 | 0.44 | 0.70 |
|
| |||||
|
| 0.74 | 0.29 | 0.14 | 0.01 | 0.01 |
|
| 0.52 | 0.17 | 0.09 | 0.01 | 0.01 |
|
| 0.97 | 0.94 | 0.91 | 0.45 | 0.82 |
Three most important terms for each model often environmentally themed.
|
| |||
|
|
| ||
|
|
|
|
|
| July Calendar | 100.00 | July Calendar | 100.00 |
| Fresh Cherry Pie | 82.12 | Fresh Cherry Pie | 83.29 |
| Bullseye Rash | 75.51 | Bullseye Rash | 75.47 |
|
| |||
|
|
| ||
|
|
|
|
|
| Festivals Milwaukee | 100.00 | Festivals Milwaukee | 100.00 |
| Lake Beaches | 97.35 | Kings Island Discount | 99.16 |
| Kings Island Discount | 96.35 | Lake Beaches | 97.40 |
|
| |||
|
|
| ||
|
|
|
|
|
| Intex Pool Cover | 100.00 | Intex Pool Cover | 100.00 |
| Rash | 87.07 | Rash | 88.06 |
| Swampdogs | 85.64 | Swampdogs | 85.45 |
|
| |||
|
|
| ||
|
|
|
|
|
| Loans for | 100.00 | Loans for | 100.00 |
| CA Water | 67.20 | CA Water | 66.82 |
| Hotels CA | 61.00 | Hotels CA | 60.14 |
|
| |||
|
|
| ||
|
|
|
|
|
| Movies in the Park | 100.00 | Movies in the Park | 100.00 |
| Concert in the Park | 69.18 | Concert in the Park | 69.65 |
| Waterworld Denver | 62.13 | Waterworld Denver | 62.44 |