| Literature DB >> 28768609 |
Min-Je Choi1, Sung-Hee Kim2, Sukwon Lee3, Bum Chul Kwon4, Ji Soo Yi5, Jaegul Choo1, Jina Huh6.
Abstract
BACKGROUND: While online health social networks (OHSNs) serve as an effective platform for patients to fulfill their various social support needs, predicting the needs of users and providing tailored information remains a challenge.Entities:
Keywords: gradient boosting trees; machine learning; online health community; online health social network; prediction models; social media
Mesh:
Year: 2017 PMID: 28768609 PMCID: PMC5559652 DOI: 10.2196/jmir.7660
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Comparison of different classifier models for obtaining AUC values.
| Classification algorithms | |||||
| Gradient boosting tree | Support vector machine | Decision tree | Random forest | Logistic regression | |
| Emotional support | 0.87 | 0.89 | 0.78 | 0.85 | 0.77 |
| Experience-based information | 0.86 | 0.80 | 0.76 | 0.83 | 0.74 |
| Unconventional information | 0.80 | 0.75 | 0.69 | 0.75 | 0.66 |
| Medical facts | 0.83 | 0.72 | 0.67 | 0.83 | 0.61 |
Figure 1Individual feature importance on each social support need.
Comparison of different data sources for prediction in OHSNs
| Data source | Survey data | User-generated data | User log data |
| Design questionnaires | Perform text mining | Extract data from server database | |
| Conduct surveys | Apply natural language processing on text | ||
| Slow | Fast | Instantaneous | |
| Need to conduct new survey to get recent data | Hundreds of posts written by users everyday | New generated with every user action (eg, access time, search history) | |
| Very easy to understand | Relatively easy to understand | Difficult to derive meaning from raw data | |
| Questions directly suited to user’s intentions | Requires data processing to extract features from long texts | Requires insight on what features to obtain from given data | |
| Numerical data | Text data | Periodical data | |
| Demographic information (eg, age, sex, region) | Demographic information (eg, user profile information) | ||
| Text data for open-ended questions | Hypertext data (eg, accessed links) | ||
| Text data (eg, keywords typed in for search) | |||
| A user’s (dis)agreement toward a particular characteristic | Words that represent a user’s main interests or concerns | Visiting frequency | |
| Open-ended answers toward a question | Response to a particular article | Reading preference | |
| Search preference | |||