| Literature DB >> 28389418 |
Yingjie Lu1, Yang Wu1, Jingfang Liu2, Jia Li3, Pengzhu Zhang4.
Abstract
BACKGROUND: Health care social media used for health information exchange and emotional communication involves different types of users, including patients, caregivers, and health professionals. However, it is difficult to identify different stakeholders because user identification data are lacking due to privacy protection and proprietary interests. Therefore, identifying the concerns of different stakeholders and how they use health care social media when confronted with huge amounts of health-related messages posted by users is a critical problem.Entities:
Keywords: health care social media; sentiment analysis; stakeholder analysis; topic analysis
Mesh:
Year: 2017 PMID: 28389418 PMCID: PMC5400888 DOI: 10.2196/jmir.7087
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Data collection statistics (January 2007 to October 2016) from 3 disease forums on MedHelp.org.
| Disease type | No. of messages | No. of members | Messages per member |
| Lung cancer | 5317 | 2416 | 2.20 |
| Diabetes | 35,193 | 11,571 | 3.04 |
| Breast cancer | 97,651 | 25,619 | 3.81 |
Figure 1Research framework.
Stakeholder analysis in the 3 disease forums.
| Cluster | Keywords | Authorship |
| 1 | in my chest, on my chest, my lungs, my left lung, my right lung, please help me, of my chest, I was diagnosed, my ct scan, doctor tell me, my xray result, my question, i was wondering, I have cancer, be greatly appreciated | Patients |
| 2 | my husband, thank you, my mother, my dad, my mom, my sister, my question is, my father, do you think, i am worried, on his lung, thanks in advance, thanks so much, father in law | Caregivers |
| 3 | good luck, all the best, hope this helps, stay positive, you should, god bless, your mother, your father, your husband, with your doctor, you need to, let us know, sorry to hear, you could consider, see your doctor, your symptoms, with your physician, best of luck | Specialists |
| 1 | my sugar, help me, I was diagnosed, my sugar levels, my blood, I was wondering, thank you, my body, my question, thanks for your, my blood sugar, I need to, my question is, I have diabetes, be greatly appreciated, type 1 diabetic | Patients |
| 2 | my husband, was diagnosed, he was diagnosed, her blood sugar, my son, my daughter, his blood sugar, low blood sugar, want to know, he has been, she has been, thanks so much, my son is, daughter was diagnosed | Caregivers |
| 3 | your blood, you need, your doctor, you need to, good luck, you should, your glucose, your blood sugar, I would suggest, your blood sugars, with your doctor, let us know, hope this helps, I hope you, your glucose levels, | Specialists |
| 1 | my breast, my nipple, my breasts, thank you, my question, should I, my left breast, in my right, on my left, in my breast, my right breast, found a lump, thanks for your, be greatly appreciated, of my breast | Patients |
| 2 | my mom, she had, my mother, her breast, she was diagnosed, family history, had breast cancer, my question is, worry about, her left breast, her right breast, my sister | Caregivers |
| 3 | I hope, you can, your doctor, you should, best wishes, your breast, good luck, you need to, your oncologist, let us know, with your doctor, hope this helps, all the best, a second opinion, second opinion | Specialists |
Performance measures for distinguishing stakeholders using different textual feature setsa.
| Disease | Feature set | Performance measure | ||
| Rand index | Jaccard similarity coefficient | Fowlkes-Mallows index | ||
| Lung cancer | F1 | 0.712 | 0.261 | 0.395 |
| F1+F2 | 0.731 | 0.321 | 0.441 | |
| F1+F2+F3 | 0.757 | 0.349 | 0.473 | |
| F1+F2+F3+F4 | 0.785 | 0.371 | 0.501 | |
| Diabetes | F1 | 0.717 | 0.273 | 0.401 |
| F1+F2 | 0.742 | 0.335 | 0.456 | |
| F1+F2+F3 | 0.780 | 0.367 | 0.489 | |
| F1+F2+F3+F4 | 0.792 | 0.381 | 0.523 | |
| Breast cancer | F1 | 0.725 | 0.297 | 0.421 |
| F1+F2 | 0.779 | 0.356 | 0.481 | |
| F1+F2+F3 | 0.793 | 0.385 | 0.529 | |
| F1+F2+F3+F4 | 0.802 | 0.393 | 0.537 | |
aFeature set components: style-based features (F1), word n-grams (F2), medical domain-specific terminologies (F3), and kinship terminologies (F4).
Figure 2Distributions of the 3 stakeholder groups.
Clustering results in the stakeholder analysis.
| Disease | Cluster | No. of members | No. of messages | Messages per member | Authorship |
| Lung cancer | 1 | 1202 | 1378 | 1.15 | Patients |
| 2 | 1053 | 1607 | 1.53 | Caregivers | |
| 3 | 161 | 2332 | 14.48 | Specialists | |
| Diabetes | 1 | 5738 | 7691 | 1.34 | Patients |
| 2 | 4836 | 7136 | 1.48 | Caregivers | |
| 3 | 997 | 20,366 | 20.43 | Specialists | |
| Breast cancer | 1 | 17,489 | 27,012 | 1.55 | Patients |
| 2 | 6343 | 9727 | 1.53 | Caregivers | |
| 3 | 1787 | 60,912 | 34.09 | Specialists |
Figure 3Changes in the proportions of stakeholder groups in (A) lung cancer, (B) diabetes, and (C) breast cancer forums, January 2007 to October 2016.
Topic analysis in the 3 disease forums.
| Cluster | Topics | Keywords | UMLSa semantic types |
| 1 | Symptom | cough, pain, breathless, symptoms, chest pain, painful, shortness of breath, wheezing, short of breath, coughing up blood, nausea | sosy |
| 2 | Complication | infection, bronchitis, pneumonia, tuberculosis, asthma, pleural effusion, copd, emphysema, collapsed lung, atelectasis | dsyn, patf |
| 3 | Examination | scans, x-ray, cat scan, mri, biopsy, pet scan, chest x-ray, imaging, biopsy needle, bronchoscopy | diap |
| 4 | Procedure | chemo, operation, surgery, radiation, therapy, chemotherapy, removal, radiation therapy, wedge resection, lobectomy | topp |
| 5 | Drug | silicas, morphine, advil, tarceva, chantix, carboplatin, alimta, dilaudid, taxol, coumadin | phsu |
| 1 | Drug | insulin, lantus, januvia, metformin, glucophage, actos, avandia, amaryl, marijuana, glipizide | phsu |
| 2 | Complication | infection, hypoglycemia, low blood sugar, dka, obesity, pcos, kidney disease, coma, diabetic neuropathy, bgs | dsyn, patf |
| 3 | Symptom | pain, tired, thirsty, nausea, fatigue, frequent urination, hungry, dizzy, itchy, sore, tingling | sosy |
| 4 | Examination | blood test, glucose test, fasting test, fasting blood sugar, cat scan, hemoglobin a1c test, gtts, glucose tolerance test, mri | lbpr, diap |
| 5 | Procedure | infusion, therapy, injection, transplant, dialysis, rx, ect, insulin injection, cde, amputation | topp |
| 1 | Examination | biopsy, mri, ultrasound, mammogram, screening, bi-rads, core biopsy, cat scan, imaging, biopsy needle | diap, lbpr |
| 2 | Procedure | chemo, operation, chemotherapy, radiation, radiotherapy, mastectomy, lumpectomy, implant, removal, surgical | topp |
| 3 | Symptom | sore, pain, painful, breast pain, nipple discharge, itching, tingling, hot flashes, nausea, itchy | sosy |
| 4 | Drug | tamoxifen, arimidex, taxol, femara, taxotere, carboplatin, effexor, docetaxel, valium, raloxifene | phsu |
| 5 | Complication | infection, rash, lymph edema, fibrocystic breast, mastitis, idc, eczema, complex cyst, complex cysts, neuropathy, fibrocystic breast disease, fibrocystic disease | dsyn |
aUMLS: Unified Medical Language System.
Performance measures for distinguishing hot topics using different textual feature setsa.
| Disease | Feature set | Rand | Jaccard | FMb |
| Lung cancer | F1 | 0.703 | 0.242 | 0.382 |
| F1+F2 | 0.761 | 0.352 | 0.478 | |
| Diabetes | F1 | 0.718 | 0.275 | 0.411 |
| F1+F2 | 0.774 | 0.351 | 0.478 | |
| Breast cancer | F1 | 0.722 | 0.285 | 0.417 |
| F1+F2 | 0.783 | 0.369 | 0.495 |
aFeature set components: word n-grams (F1) and medical domain-specific terminologies (F2).
bFM: Fowlkes-Mallows index.
Figure 4Distributions of (A) patients, (B) caregivers, and (C) specialists in the 5 hot topics.
Figure 5Distributions of informative messages and emotional messages by (A) disease and by stakeholder group in (B) lung cancer, (C) diabetes, and (D) breast cancer forums.
Figure 6Sentiment measures of the 3 stakeholder groups.
Figure 7Distributions of positive and negative messages posted by the 3 stakeholder groups in the 5 hot topics.
Figure 8Changes in the sentiment expression of (A) patients, (B) caregivers, and (C) specialists over the first year of each user’s involvement.