| Literature DB >> 28431582 |
Sanmitra Bhattacharya1,2, Padmini Srinivasan3, Philip Polgreen4.
Abstract
BACKGROUND: It is becoming increasingly common for individuals and organizations to use social media platforms such as Facebook. These are being used for a wide variety of purposes including disseminating, discussing and seeking health related information. U.S. Federal health agencies are leveraging these platforms to 'engage' social media users to read, spread, promote and encourage health related discussions. However, different agencies and their communications get varying levels of engagement. In this study we use statistical models to identify factors that associate with engagement.Entities:
Keywords: Data mining; Engagement analysis; Facebook; Hurdle model; Proportional hazards model; Social media mining; Statistical modeling
Mesh:
Year: 2017 PMID: 28431582 PMCID: PMC5401385 DOI: 10.1186/s12911-017-0447-z
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Facebook features examined
| Features | Description |
|---|---|
| Page likes | # of Facebook users liking a page (log-transformed) |
| Post type | Classification of the post into six categories such as link, photo, video, etc. |
| Sentiment | Two scores: one for positivity and the other for negativity |
| Content (Semantic Groups) | Classification of each post into 15 semantic groups using MTI followed by post-processing. Multiple classes per post allowed. |
Agencies and accounts on Facebook
| Agency | Name | # accounts | Examples of accounts |
|---|---|---|---|
| ACF | Administration for Children & Families | 1 | Child_Welfare_Information_Gateway |
| AoA | Administration on Aging | 2 | Administration_on_Aging, etc. |
| CDC | Center for Disease Control & Prevention | 10 | CDC_Tobacco_Free, etc. |
| FDA | U.S. Food & Drug Administration | 1 | U.S._Food_and_Drug_Administration |
| HRSA | Health Resources & Services Administration | 2 | Health_Resources_and_Service_Administration_(HRSA), etc. |
| NIH | National Institutes of Health | 8 | Fogarty_International_Center, etc. |
| NIH/NCCAM | National Center for Complementary & Alternative Medicine | 1 | National_Center_for_Complementary_and_Alternative_Medicine |
| NIH/NCI | National Cancer Institute | 3 | National_Cancer_Institute, etc. |
| NIH/NEI | National Eye Institute | 1 | National_Eye_Health_Education_Program_(NEHEP) |
| NIH/NHGRI | National Human Genome Research Institute | 1 | National_DNA_Day |
| NIH/NHLBI | National Heart, Blood & Lung Institute | 4 | National_Heart,_Lung,_and_Blood_Institute_(NHLBI), etc. |
| NIH/NIAID | National Institute of Allergy & Infectious Diseases | 1 | National_Institute_of_Allergy_and_Infectious_Diseases_(NIAID) |
| NIH/NIAMS | National Institute of Arthritis & Musculoskeletal & Skin Diseases | 2 | National_Institute_of_Arthritis_and_Musculoskeletal_and_Skin_Diseases_Labs, etc. |
| NIH/NICHD | National Institute of Child Health and Human Development | 1 | Eunice_Kennedy_Shriver_National_Institute_of_Child_Health_and_Human_Development |
| NIH/NIDA | National Institute of Drug Abuse | 2 | Drug_Facts, etc. |
| NIH/NIDDK | National Institute of Diabetes and Digestive and Kidney Diseases | 3 | National_Diabetes_Education_Program_(NDEP), etc. |
| NIH/NIEHS | National Institute of Environmental Health Sciences | 1 | National_Institute_of_Environmental_Health_Sciences |
| NIH/NIGMS | National Institute of General Medical Sciences | 1 | National_Institute_of_General_Medical_Sciences |
| NIH/NIMH | National Institute of Mental Health | 1 | National_Institute_of_Mental_Health |
| NIH/NINDS | National Institute of Neurological Disorders and Stroke | 1 | Know_Stroke |
| NIH/NLM | National Library of Medicine | 6 | Women’s_Health_Resources, etc. |
| NIH/OBSSR | NIH Office of Behavioral and Social Sciences Research | 1 | The_Office_of_Behavioral_and_Social_Sciences_Research_(OBSSR) |
| OS | Office of the Secretary | 16 | Best_Bones_Forever!, etc. |
| SAMHSA | The Substance Abuse & Mental Health Services | 2 | Disaster_Distress_Helpline, etc. |
| Grand Total | 72 |
Posts and activities per agency on Facebook
| Agency | #posts | #posts with zero activity | # posts with atleast one activity | # likes | # shares | # comments | # total activity | # activity per post | # activity per non-zero activity post |
|---|---|---|---|---|---|---|---|---|---|
| ACF | 372 | 21 (5.65%) | 351 (94.35%) | 2235 | 647 | 265 | 3147 | 8.46 | 8.97 |
| AoA | 1878 | 320 (17.04%) | 1558 (82.96%) | 5138 | 3381 | 363 | 8882 | 4.73 | 5.70 |
| CDC | 7313 | 1149 (15.71%) | 6164 (84.29%) | 253,607 | 118,644 | 35,659 | 407,910 | 55.78 | 66.18 |
| FDA | 538 | 119 (22.12%) | 419 (77.88%) | 12,008 | 6321 | 6085 | 24,414 | 45.38 | 58.27 |
| HRSA | 2456 | 609 (24.8%) | 1847 (75.2%) | 8203 | 1306 | 2092 | 11,601 | 4.72 | 6.28 |
| NIH | 2831 | 738 (26.07%) | 2093 (73.93%) | 27,391 | 10,012 | 1985 | 39,388 | 13.91 | 18.82 |
| NIH/NCCAM | 659 | 79 (11.99%) | 580 (88.01%) | 5803 | 2338 | 510 | 8651 | 13.13 | 14.92 |
| NIH/NCI | 3455 | 585 (16.93%) | 2870 (83.07%) | 27,685 | 4429 | 5475 | 37,589 | 10.88 | 13.10 |
| NIH/NEI | 447 | 87 (19.46%) | 360 (80.54%) | 1799 | 1860 | 86 | 3745 | 8.38 | 10.40 |
| NIH/NHGRI | 417 | 25 (6%) | 392 (94%) | 5226 | 1613 | 409 | 7248 | 17.38 | 18.49 |
| NIH/NHLBI | 3510 | 524 (14.93%) | 2986 (85.07%) | 82,420 | 26,606 | 6078 | 115,104 | 32.79 | 38.55 |
| NIH/NIAID | 632 | 114 (18.04%) | 518 (81.96%) | 2811 | 383 | 181 | 3375 | 5.34 | 6.52 |
| NIH/NIAMS | 414 | 44 (10.63%) | 370 (89.37%) | 1165 | 128 | 63 | 1356 | 3.28 | 3.66 |
| NIH/NICHD | 332 | 40 (12.05%) | 292 (87.95%) | 762 | 192 | 48 | 1002 | 3.02 | 3.43 |
| NIH/NIDA | 1657 | 177 (10.68%) | 1480 (89.32%) | 13,772 | 11,423 | 1232 | 26,427 | 15.95 | 17.86 |
| NIH/NIDDK | 1720 | 451 (26.22%) | 1269 (73.78%) | 4702 | 1239 | 785 | 6726 | 3.91 | 5.30 |
| NIH/NIEHS | 148 | 47 (31.76%) | 101 (68.24%) | 287 | 90 | 41 | 418 | 2.82 | 4.14 |
| NIH/NIGMS | 236 | 53 (22.46%) | 183 (77.54%) | 1191 | 222 | 166 | 1579 | 6.69 | 8.63 |
| NIH/NIMH | 427 | 23 (5.39%) | 404 (94.61%) | 13,130 | 6574 | 1752 | 21,456 | 50.25 | 53.11 |
| NIH/NINDS | 83 | 17 (20.48%) | 66 (79.52%) | 427 | 121 | 86 | 634 | 7.64 | 9.61 |
| NIH/NLM | 4076 | 1695 (41.58%) | 2381 (58.42%) | 24,280 | 5861 | 1903 | 32,044 | 7.86 | 13.46 |
| NIH/OBSSR | 188 | 75 (39.89%) | 113 (60.11%) | 212 | 55 | 26 | 293 | 1.56 | 2.59 |
| OS | 9158 | 1233 (13.46%) | 7925 (86.54%) | 172,550 | 57,372 | 28,281 | 258,203 | 28.19 | 32.58 |
| SAMHSA | 2915 | 761 (26.11%) | 2154 (73.89%) | 25,657 | 11,059 | 3089 | 39,805 | 13.66 | 18.48 |
| Total | 45,862 | 8986 (19.59%) | 36,876 (80.41%) | 692,461 | 271,876 | 96,660 | 1,060,997 | 23.13 | 28.77 |
| Median | 645.5 | 116.5 | 549 | 5514.5 | 2099 | 647.5 | 8766.5 | 8.42 | 11.75 |
| Mean (SD) | 1910.92 (2327.30) | 374.42 (459.42) | 1536.50 (1947.75) | 28852.54 (60566.59) | 11328.17 (25970.74) | 4027.50 (8879.44) | 44208.21 (94905.29) | 15.24 (15.67) | 18.29 (18.17) |
Top 10 accounts with most activity per Facebook post
| Account (Agency) | # posts | # posts with non-zero activity | # posts with zero activity | # likes | # shares | # comments | # total activity | # activities per non-zero activity post |
|---|---|---|---|---|---|---|---|---|
| Let’s_Move (OS) | 457 | 446 (97.59%) | 11 (2.41%) | 73,144 | 23,535 | 13,117 | 109,796 | 246.18 |
| StopBullying.Gov (OS) | 173 | 168 (97.11%) | 5 (2.89%) | 21,882 | 9583 | 4788 | 36,253 | 215.79 |
| Million_Hearts (CDC) | 488 | 432 (88.52%) | 56 (11.48%) | 36,041 | 13,515 | 2204 | 51,760 | 119.81 |
| CDC_Tobacco_Free (CDC) | 457 | 317 (69.37%) | 140 (30.63%) | 15,315 | 17,355 | 1803 | 34,473 | 108.75 |
| CDC (CDC) | 2867 | 2667 (93.02%) | 200 (6.98%) | 177,302 | 78,890 | 29,155 | 285,347 | 106.99 |
| The_Heart_Truth (NIH/NHLBI) | 1056 | 879 (83.24%) | 177 (16.76%) | 61,843 | 21,387 | 3733 | 86,963 | 98.93 |
| National_Institutes_of_Health_(NIH) | 427 | 408 (95.55%) | 19 (4.45%) | 17,522 | 8885 | 947 | 27,354 | 67.04 |
| U.S._Food_and_Drug_Administration (FDA) | 538 | 419 (77.88%) | 119 (22.12%) | 12,008 | 6321 | 6085 | 24,414 | 58.27 |
| National_Institute_of_Mental_Health (NIH/NIMH) | 427 | 404 (94.61%) | 23 (5.39%) | 13,130 | 6574 | 1752 | 21,456 | 53.11 |
| NCBI_-_National_Center_for_Biotechnology_Information (NIH/NLM) | 298 | 260 (87.25%) | 38 (12.75%) | 9658 | 1930 | 619 | 12,207 | 46.95 |
Facebook page likes
| Account | # page likes |
|---|---|
| CDC | 241,342 |
| Let’s_Move | 115,940 |
| Million_Hearts | 53,728 |
| StopBullying.Gov | 49,721 |
| U.S._Food_and_Drug_Administration | 43,240 |
| NCBI_-_National_Center_for_Biotechnology_Information | 43,201 |
| National_Institutes_of_Health_(NIH) | 35,054 |
| The_Heart_Truth | 34,012 |
| National_Institute_of_Mental_Health | 32,484 |
| CDC_en_Español | 20,923 |
Count of various post types
| Post type | # posts |
|---|---|
| link | 28,830 (62.8%) |
| status | 9121 (19.8%) |
| photo | 6428 (14.1%) |
| video/swf | 1333 (2.9%) |
| music | 76 (0.2%) |
| question | 74 (0.2%) |
Distribution of positive and negative sentiments for Facebook posts on a 5-point scale
| Sentiment-level | # of positive posts | # of negative posts |
|---|---|---|
| neutral | 17,477 (38.11%) | 24,281 (52.94%) |
| moderate-medium | 22,846 (49.81%) | 10,426 (22.73%) |
| medium | 4625 (10.08%) | 5267 (11.48%) |
| medium-extreme | 905 (1.97%) | 5673 (12.37%) |
| extreme | 9 (0.02%) | 215 (0.47%) |
| Total | 45,862 | 45,862 |
Semantic groups and their prevalence in the Facebook dataset
| Semantic Groups | # posts |
|---|---|
| Concepts & Ideas | 24,922 (54.34%) |
| Living Beings | 22,733 (49.56%) |
| Geographic Areas | 19,891 (43.37%) |
| Disorders | 19,826 (43.22%) |
| Organizations | 19,299 (42.08%) |
| Activities & Behaviors | 15,072 (32.86%) |
| Physiology | 14,158 (30.87%) |
| Chemicals & Drugs | 9549 (20.82%) |
| Procedures | 9223 (20.11%) |
| Objects | 9034 (19.7%) |
| Phenomena | 6784 (14.79%) |
| Occupations | 4367 (9.52%) |
| Anatomy | 3731 (8.13%) |
| Genes & Molecular Sequences | 406 (0.89%) |
| Devices | 364 (0.79%) |
Results of hurdle negative binomial model for Facebook data. The estimate/coefficient (SE), exponent of coefficient (OR and IRR), z and p-values (*p < 0.05, **p < 0.01, ***p < 0.001) are shown
| Zero Portion | Count Portion | |||||||
|---|---|---|---|---|---|---|---|---|
| Estimate (SE) | OR |
|
| Estimate (SE) | IRR |
|
| |
| (Intercept) | −2.71 (0.47) | 0.067 | −5.763 | *** | −5.631 (0.169) | 0.004 | −33.356 | *** |
| Log-transformed page likes | 1.102 (0.025) | 3.010 | 43.931 | *** | 1.797 (0.01) | 6.033 | 174.673 | *** |
| link | −0.817 (0.462) | 0.442 | −1.77 | 0.554 (0.162) | 1.741 | 3.421 | *** | |
| music | −0.48 (0.57) | 0.619 | −0.843 | 0.06 (0.223) | 1.062 | 0.271 | ||
| photo | −0.22 (0.464) | 0.802 | −0.475 | 1.833 (0.163) | 6.253 | 11.267 | *** | |
| question | −5.62 (0.659) | 0.004 | −8.528 | *** | −0.456 (0.54) | 0.634 | −0.844 | |
| status | −2.499 (0.462) | 0.082 | −5.408 | *** | 0.861 (0.163) | 2.365 | 5.28 | *** |
| video | −0.388 (0.473) | 0.679 | −0.82 | 1.041 (0.165) | 2.833 | 6.302 | *** | |
| Positive Sentiment | 0.16 (0.023) | 1.174 | 7.051 | *** | 0.118 (0.009) | 1.126 | 12.986 | *** |
| Negative Sentiment | −0.121 (0.015) | 0.886 | −7.857 | *** | −0.068 (0.006) | 0.934 | −10.692 | *** |
| Activities & Behaviors | 0.644 (0.031) | 1.903 | 20.605 | *** | 0.06 (0.013) | 1.061 | 4.741 | *** |
| Anatomy | 0.088 (0.051) | 1.092 | 1.743 | 0.048 (0.022) | 1.049 | 2.191 | * | |
| Chemicals & Drugs | 0.112 (0.035) | 1.118 | 3.237 | ** | 0.07 (0.015) | 1.073 | 4.771 | *** |
| Concepts & Ideas | 0.366 (0.027) | 1.441 | 13.361 | *** | −0.013 (0.012) | 0.987 | −1.041 | |
| Devices | 0.321 (0.161) | 1.378 | 1.998 | * | −0.021 (0.066) | 0.980 | −0.312 | |
| Disorders | 0.329 (0.032) | 1.390 | 10.369 | *** | −0.035 (0.014) | 0.965 | −2.514 | * |
| Genes & Molecular Sequences | 0.567 (0.199) | 1.763 | 2.85 | ** | −0.084 (0.06) | 0.920 | −1.402 | |
| Geographic Areas | 0.091 (0.041) | 1.095 | 2.232 | * | −0.187 (0.017) | 0.830 | −10.776 | *** |
| Living Beings | 0.242 (0.028) | 1.274 | 8.675 | *** | 0.01 (0.012) | 1.010 | 0.787 | |
| Objects | 0.212 (0.036) | 1.236 | 5.9 | *** | −0.117 (0.015) | 0.889 | −7.769 | *** |
| Occupations | 0.055 (0.05) | 1.057 | 1.108 | −0.232 (0.02) | 0.793 | −11.472 | *** | |
| Organizations | −0.35 (0.041) | 0.705 | −8.468 | *** | −0.078 (0.018) | 0.925 | −4.425 | *** |
| Phenomena | 0.257 (0.041) | 1.293 | 6.25 | *** | 0.144 (0.017) | 1.155 | 8.44 | *** |
| Physiology | 0.284 (0.031) | 1.328 | 9.13 | *** | 0.034 (0.013) | 1.035 | 2.614 | ** |
| Procedures | 0.2 (0.036) | 1.222 | 5.597 | *** | −0.034 (0.015) | 0.966 | −2.277 | * |
| Log(theta) | −0.172 (0.011) | 0.842 | −15.005 | *** | ||||
Results of Cox proportional hazards model for interval between a Facebook post and its last activity. The Coefficient (SE), hazard ratio (HR), z and p-values (*p < 0.05, **p < 0.01, ***p < 0.001) for various independent variables are shown
| Interval between Facebook Post & Last Activity | ||||
|---|---|---|---|---|
| Coefficient (SE) | HR |
|
| |
| Log-transformed page likes | −0.424 (0.008) | 0.654 | −54.583 | *** |
| Link | −0.142 (0.128) | 0.868 | −1.103 | |
| Music | −0.211 (0.172) | 0.810 | −1.228 | |
| Photo | −0.435 (0.129) | 0.647 | −3.377 | *** |
| Question | 0.221 (0.173) | 1.248 | 1.28 | |
| Status | −0.105 (0.129) | 0.900 | −0.816 | |
| Video | −0.291 (0.131) | 0.748 | −2.214 | * |
| Positive Sentiment | −0.022 (0.007) | 0.979 | −2.989 | ** |
| Negative Sentiment | 0.007 (0.005) | 1.007 | 1.437 | |
| Activities & Behaviors | −0.03 (0.01) | 0.971 | −2.925 | ** |
| Anatomy | −0.004 (0.017) | 0.996 | −0.207 | |
| Chemicals & Drugs | −0.011 (0.012) | 0.989 | −0.935 | |
| Concepts & Ideas | −0.023 (0.01) | 0.977 | −2.376 | * |
| Devices | 0.137 (0.053) | 1.147 | 2.593 | ** |
| Disorders | 0.012 (0.011) | 1.012 | 1.101 | |
| Genes & Molecular Sequences | −0.146 (0.051) | 0.864 | −2.876 | ** |
| Geographic Areas | −0.004 (0.014) | 0.996 | −0.295 | |
| Living Beings | 0 (0.01) | 1.000 | 0.02 | |
| Objects | 0.02 (0.012) | 1.020 | 1.66 | . |
| Occupations | 0.042 (0.016) | 1.043 | 2.578 | ** |
| Organizations | 0.054 (0.014) | 1.056 | 3.846 | *** |
| Phenomena | −0.068 (0.014) | 0.935 | −4.988 | *** |
| Physiology | −0.005 (0.011) | 0.995 | −0.434 | |
| Procedures | −0.028 (0.012) | 0.973 | −2.296 | * |