| Literature DB >> 28438725 |
Xi Wang1, Kang Zhao1,2, Nick Street1,2.
Abstract
BACKGROUND: Online health communities (OHCs) have become a major source of social support for people with health problems. Members of OHCs interact online with similar peers to seek, receive, and provide different types of social support, such as informational support, emotional support, and companionship. As active participations in an OHC are beneficial to both the OHC and its users, it is important to understand factors related to users' participations and predict user churn for user retention efforts.Entities:
Keywords: community networks; machine learning; patient engagement; prediction; social support
Mesh:
Year: 2017 PMID: 28438725 PMCID: PMC5422656 DOI: 10.2196/jmir.6834
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Example posts for different types of social support.
| Social support category | Examples |
| Companionship (COM) | (1) |
| Seeking informational support (SIS) | |
| Seeking emotional support (SES) | |
| Providing informational support (PIS) | |
| Providing emotional support (PES) | |
| Providing informational support (PIS) and providing emotional support (PES) |
Control variables and independent variables in the survival analysis.
| Variables | Descriptions |
| TotalPosta | The total number of posts a user has published (excluded from the model due to strong correlation with |
| InitPosta | The total number of threads a user initiated |
| NumThreada | The number of threads a user contributed to (excluded from the model due to strong correlation with |
| PESb | The number of a user’s posts that provided emotional support |
| PISc | The number of a user’s posts that provided informational support |
| SESd | The number of a user’s posts that sought emotional support |
| SISe | The number of a user’s posts that sought informational support |
| COMf | The number of a user’s posts that were related to companionship |
| RISD | Direct informational support received—the number of informational support posts a user received after initiating a support-seeking thread. |
| RESD | Direct emotional support received—the number of emotional support posts a user received after initiating a support-seeking thread. |
| RISIg | Indirect informational support received—the number of informational support posts a user was exposed to in threads that she or he did not initiate but contributed to. |
| RESIg | Indirect emotional support received—the number of emotional support posts a user was exposed to in threads that she or he did not initiate but contributed to. |
| RCOMg | Companionship received—the number of companionship posts a user was exposed to in threads that she or he did not initiate but contributed to. |
adenotes the three control variables.
bPES: providing emotional support.
cPIS: providing informational support.
dSES: seeking emotional support.
eSIS: seeking informational support.
fCOM: companionship.
gFor RISI, RESI, and RCOM, we assumed that a user read others’ replies that were posted within 7 days before the user’s replies in the same thread.
Figure 1Log-log plot of users’ posting activities in the online health community (OHC).
Figure 2Log-log plot for the time span of users’ online posting activities.
Performance of classification algorithms for 5 categories of social support.
| Social support | Results | Naïve Bayesian | Logistic regression | Support Vector Machine (polynomial kernel) | Random forest | Decision tree | AdaBoost |
| COMa | Accuracy | .696 | .787 | .783 | .771 | .767 | .804f |
| AUC | .839 | .817 | .768 | .848 | .75 | .852f | |
| PESb | Accuracy | .713 | .830 | .840f | .830 | .81 | .817 |
| AUC | .823 | .787 | .681 | .825f | .687 | .817 | |
| PISc | Accuracy | .753 | .813 | .823f | .767 | .779 | .801 |
| AUC | .824 | .83 | .783 | .837 | .717 | .859f | |
| SESd | Accuracy | .893 | .901 | .970f | .967 | .963 | .963 |
| AUC | .749 | .867f | .656 | .851 | .671 | .668 | |
| SISe | Accuracy | .851 | .880 | .943f | .931 | .937 | .914 |
| AUC | .893f | .803 | .745 | .86 | .766 | .869 |
aCOM: companionship.
bPES: providing emotional support.
cPIS: providing informational support.
dSES: seeking emotional support.
eSIS: seeking informational support.
fdenotes the best performer for each row.
Results from the survival analysis experiment.
| Variables | Hazard ratio | |
| InitPost (control) | .995 | .75 |
| PESa | 1.000 | .99 |
| PISb | .948*** | .001 |
| SESc | .972* | .01 |
| SISd | 1.050*** | .000 |
| COMe | .919*** | .000 |
| RISD | 1.047* | .02 |
| RESD | .997 | .79 |
| RISI | 1.053* | .02 |
| RESI | .964 | .11 |
| RCOM | .983 | .41 |
aPES: providing emotional support.
bPIS: providing informational support.
cSES: seeking emotional support.
dSIS: seeking informational support.
eCOM: companionship.
Performance of the unified model on hold-out testing sets in different weeks (precision and recall are for the positive class).
| Measures | Churn in the 1st week | Churn in the 3rd week | Churn in the 5th week | Churn in the 13th week |
| Precision | .950 | .872 | .880 | .838 |
| Recall | .937 | .534 | .511 | .504 |
| F1 score | .943 | .662 | .647 | .629 |
| AUCa | .972 | .901 | .909 | .929 |
aAUC: area under the receiver operating characteristic (ROC) curve.
Figure 3Empirical and predicted hazard curves for user participations.
Top 20 features by information gain for the full unified model.
| Rank | Feature |
| 1 | Stability of the total number of threads a user initiated during the last week of the training period |
| 2 | Stability of the number of threads a user participated during the last week of the training period |
| 3 | Stability of the number of SISa a user posted during the last week of the training period |
| 4 | Stability of the number of SESb a user posted during the last week of the training period |
| 5 | Stability of the total number of posts from a user during the last week of the training period |
| 6 | Stability of the number of PISc a user posted during the last week of the training period |
| 7 | Stability of the number of PESd posts a user received directly during the last week of the training period |
| 8 | Stability of the number of PES a user posted during the last week of the training period |
| 9 | Stability of the number of COMe a user posted during the last week of the training period |
| 10 | Stability of the number of PIS posts a user received directly during the last week of the training period |
| 11 | Stability of the number of COM a user was exposed to during the last week of the training period |
| 12 | Stability of the number of PES posts a user received indirectly during the last week of the training period |
| 13 | Stability of the number of PIS posts a user received indirectly during the last week of the training period |
| 14 | Total number of posts from a user during the last week of the training period |
| 15 | The number of threads a user participated in during the last week of the training period |
| 16 | Stability of the number of threads a user participated in across weeks |
| 17 | Stability of the total number of posts from a user across weeks |
| 18 | Entropy of the total number of posts from a user during the last week of the training period |
| 19 | Stability of the number of PIS posts a user received indirectly across weeks |
| 20 | Stability of the total number of threads a user initiated across weeks |
aSIS: seeking informational support.
bSES: seeking emotional support.
cPIS: providing informational support.
dPES: providing emotional support.
eCOM: companionship.
Total numbers of posts in each category of social support.
| Social support category | Total number of posts |
| Companionship (COM) | 932,538 |
| Seeking informational support (SIS) | 284,027 |
| Seeking emotional support (SES) | 227,188 |
| Providing informational support (PIS) | 1,034,682 |
| Providing emotional support (PES) | 497,096 |