| Literature DB >> 30001360 |
Sean D Young1, Qingpeng Zhang2.
Abstract
BACKGROUND: A large and growing body of "big data" is generated by internet search engines, such as Google. Because people often search for information about public health and medical issues, researchers may be able to use search engine data to monitor and predict public health problems, such as HIV. We sought to assess the feasibility of using Google search data to analyze and predict new HIV diagnoses cases in the United States. METHODS ANDEntities:
Mesh:
Year: 2018 PMID: 30001360 PMCID: PMC6042696 DOI: 10.1371/journal.pone.0199527
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Results of using Google Trends-based model to predict new HIV case diagnoses from 2011–2014.
| 2011 | 2012 | 2013 | 2014 | RMSE(Avg.) | |
|---|---|---|---|---|---|
| 109.85 | 109.674 | 62.394 | 193.1 | 118.75 | |
| 0.997 | 0.998 | 0.999 | 0.995 | ||
| 0.995 | 0.997 | 0.997 | 0.991 |
RMSE = Root-mean-square error.
Average coefficient of each variable (excluding control variables) and the proportion of states in which the variable is significant.
| Variable | Prediction for 2011 | Prediction for 2012 | Prediction for 2013 | Prediction for 2014 | Significant proportion |
|---|---|---|---|---|---|
| 0 | 0 | 0 | 0 | 0% | |
| 0 | 0 | 0 | 0 | 0% | |
| -0.05 | 0 | 0 | 0 | 25% | |
| 0 | 0 | 0 | 0 | 0% | |
| -0.04 | -0.03 | 0 | 0 | 50% | |
| 0 | 0 | 0 | 0 | 0% | |
| 0 | 0 | 0 | 0 | 0% | |
| 0 | 0 | 0 | 0 | 0% | |
| 0.02 | 0.01 | 0 | 0 | 50% | |
| 0.02 | 0.01 | 0 | 0 | 50% | |
| 0 | 0 | 0 | 0 | 0% | |
| 0 | 0 | 0 | 0 | 0% | |
| 0 | 0 | 0 | 0 | 0% | |
| 0 | 0 | 0 | 0 | 0% | |
| 0 | 0 | 0.01 | 0 | 25% | |
| 0 | 0 | 0.01 | 0.01 | 50% | |
| -0.17 | -0.11 | -0.07 | -0.05 | 100% | |
| 0 | 0 | 0 | 0 | 0% | |
| 0 | 0 | 0 | 0 | 0% | |
| -0.03 | 0 | 0 | 0 | 25% | |
| 0 | 0 | 0 | 0 | 0% | |
| 0.01 | 0 | 0 | 0 | 25% | |
| 0.45 | 0.56 | 0 | 0.36 | 75% | |
| 0.95 | 0.97 | 0.98 | 0.98 | 100% |
Fig 1The average percentage of difference (forecast error) for each state (2011 to 2014).
Fig 2Map of social media using the keyword “HIV” in the United States.
Image from UCIPT’s HIV ChatterMap tool.