| Literature DB >> 25895907 |
Steven Gittelman1, Victor Lange, Carol A Gotway Crawford, Catherine A Okoro, Eugene Lieb, Satvinder S Dhingra, Elaine Trimarchi.
Abstract
BACKGROUND: Investigation into personal health has become focused on conditions at an increasingly local level, while response rates have declined and complicated the process of collecting data at an individual level. Simultaneously, social media data have exploded in availability and have been shown to correlate with the prevalence of certain health conditions.Entities:
Keywords: big data; chronic illness; social networks; surveillance
Mesh:
Year: 2015 PMID: 25895907 PMCID: PMC4419195 DOI: 10.2196/jmir.3970
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Ordinary least squares regression coefficients (β) for life expectancy (all independent variables are standardized).
|
| Facebook only | SES only | Facebook and SES | ||||
|
| β |
| β |
| β |
| |
|
|
|
|
|
|
|
| |
|
| 1 | –0.14 | <.001 | — | — | 0.20 | <.001 |
|
| 2 | 0.79 | <.001 | — | — | 0.43 | <.001 |
|
| 3 | –0.96 | <.001 | — | — | –0.30 | <.001 |
|
| 4 | 0.60 | <.001 | — | — | 0.42 | <.001 |
|
| 5 | 0.69 | <.001 | — | — | 0.41 | <.001 |
|
| 6 | 0.21 | <.001 | — | — | –0.04 | .05 |
|
| 7 | –0.08 | <.001 | — | — | –0.04 | .04 |
|
| 8 | –0.61 | <.001 | — | — | –0.49 | <.001 |
|
| 9 | 0.12 | <.001 | — | — | 0.10 | .70 |
| Age | — | — | 0.16 | <.001 | 0.01 | .87 | |
| Income | — | — | 0.62 | <.001 | 0.59 | <.001 | |
| Education | — | — | 0.88 | <.001 | 0.61 | <.001 | |
| Unemployment | — | — | –0.05 | 0.07 | 0.01 | .70 | |
| Nonwhite population | — | — | –0.85 | <.001 | –0.47 | <.001 | |
| Constant | 77.08 | <.001 | 77.06 | <.001 | 77.06 | <.001 | |
| Adjusted | .69 |
| .64 |
| .81 |
| |
| RMSE | 1.28 |
| 1.29 |
| 1.01 |
| |
Facebook likes impact on model fit for 214 counties.
| Dependent variable | Sourcea | Facebook, | SES, | SES + Facebook, | Improvement with Facebook, % |
| Life expectancy | NVSS | .69 | .64 | .81 | 27% |
| Mortality | NVSS | .57 | .49 | .60 | 22% |
| Low birthweight | NVSS | .53 | .17 | .57 | 235% |
| Obesity | BRFSS | .46 | .56 | .60 | 7% |
| Diabetes | BRFSS | .36 | .39 | .55 | 41% |
| Heart attack | BRFSS | .32 | .46 | .46 | 0% |
| Stroke | BRFSS | .27 | .30 | .41 | 46% |
| Exercise | BRFSS | .57 | .51 | .76 | 49% |
| Insured | BRFSS | .48 | .37 | .65 | 76% |
| Self-Reported health | BRFSS | .51 | .20 | .55 | 175% |
| Smoker | BRFSS | .40 | .42 | .54 | 29% |
| Last checkup | BRFSS | .69 | .30 | .72 | 140% |
| Declined treatment | BRFSS | .39 | .35 | .49 | 40% |
a BRFSS: Behavioral Risk Factor Surveillance System; NVSS: National Vital Statistics System.
Ordinary least squares regression (β) results for prediction of obesity.
| Header | Facebook only | SES only | Facebook and SES | ||||
| β |
| β |
| β |
| ||
|
|
|
|
|
|
|
| |
|
| 1 | 0.04 | .05 | — | — | –0.03 | <.001 |
|
| 2 | –0.02 | .06 | — | — | –0.01 | .14 |
|
| 3 | 0.03 | <.001 | — | — | –0.01 | .07 |
|
| 4 | –0.02 | .06 | — | — | –0.01 | .74 |
|
| 5 | –0.02 | .04 | — | — | 0.03 | .01 |
|
| 6 | –0.02 | .07 | — | — | –0.02 | .13 |
|
| 7 | –0.05 | .30 | — | — | 0.02 | .04 |
|
| 8 | 0.01 | .34 | — | — | 0.01 | .90 |
|
| 9 | 0.02 | .36 | — | — | –0.01 | .17 |
| Age | — | — | –0.01 | .01 | –0.01 | .01 | |
| Income | — | — | –0.01 | .37 | –0.01 | .59 | |
| Education | — | — | –0.03 | <.001 | 0.01 | .35 | |
| Unemployment | — | — | –0.01 | .04 | 0.01 | .58 | |
| Nonwhite population |
|
| 0.02 | .04 |
|
| |
| Constant | 0.29 | <.001 | 0.30 | <.001 | 0.30 | <.001 | |
| Adjusted | .77 | .72 | .8 | ||||
| RMSE | 0.03 |
| 0.03 |
| 0.03 |
| |
Figure 1Actual statistics compared with predicted values for obesity, 2010 BRFSS. Darker colors represent higher prevalence. Light gray indicates missing data.