| Literature DB >> 25689608 |
Jake Luo1, Guo-Qiang Zhang, Susan Wentz, Licong Cui, Rong Xu.
Abstract
BACKGROUND: There has been a significant increase in the popularity of Web-based question-and-answer (Q&A) services that provide health care information for consumers. Large amounts of Q&As have been archived in these online communities, which form a valuable knowledge base for consumers who seek answers to their health care concerns. However, due to consumers' possible lack of professional knowledge, it is still very challenging for them to find Q&As that are closely relevant to their own health problems. Consumers often repeatedly ask similar questions that have already been answered previously by other users.Entities:
Keywords: Consumer Health Informatics; Consumer Question Retrieval; Health Care Questions; Health Information Delivery; Netwellness.org; Online Health Information Seeking; Search and Query; Similarity Analysis
Mesh:
Year: 2015 PMID: 25689608 PMCID: PMC4376128 DOI: 10.2196/jmir.3388
Source DB: PubMed Journal: J Med Internet Res ISSN: 1438-8871 Impact factor: 5.428
Figure 1Overview of the SimQ framework for analyzing similarity of consumer health question.
Figure 2Parsed syntactic tree and semantic dependency.
Figure 3Dice coefficient (1) and cosine similarity (2) formulas.
Figure 4Overview of the semantic dependency network of the “Diet and Nutrition” topic.
Examples of SimQ calculated similar questions.
| Rank | Similar questions | Similarity score | |
|
|
| ||
| 1 |
| Swollen throat glands are sore? | 0.7368 |
| 2 |
| Sore throat and swollen glands? | 0.6718 |
| 3 |
| Swollen feeling in throat, can`t swallow well? | 0.6545 |
| 4 |
| My throat is sore all the time and also my glands? | 0.5901 |
| 5 |
| Painful swollen uvula, please help? | 0.5611 |
|
|
| ||
| 1 |
| Less platelet count? | 0.8235 |
| 2 |
| What causes low platelet count? | 0.7906 |
| 3 |
| Extremely low platelet count? | 0.7726 |
| 4 |
| Decreased platelet count? | 0.7003 |
| 5 |
| Food for increase in platelet count? | 0.5957 |
Evaluation of different feature representations for consumer Q&A similarity analysis (average of 12 experiments using 12 seed questions).
| Feature | True | False | True | False | Precision % | Recall % |
| |
|
| ||||||||
|
| Baseline (B) | 12.83 | 7.67 | 1969.33 | 7.67 | 62.6% | 62.6% | 62.6% |
|
| Normalized (N) | 12.67 | 6.67 | 1970.33 | 7.83 | 65.5% | 61.8% | 63.6% |
|
| Concept (C) | 14.00 | 8.33 | 1968.67 | 6.50 | 62.7% | 68.3% | 65.4% |
|
| N+POS (P) | 11.67 | 7.67 | 1969.33 | 8.83 | 60.3% | 56.9% | 58.6% |
|
| N+ Concept (NC) | 15.00 | 7.50 | 1969.50 | 5.50 | 66.7% | 73.2% | 69.8% |
|
| N+C+Type (NCT) | 15.33 | 6.67 | 1970.33 | 5.17 | 69.7% | 74.8% | 72.1% |
|
| ||||||||
|
| Baseline (B) | 11.33 | 3.17 | 1973.83 | 9.17 | 78.1% | 55.3% | 64.7% |
|
| Normalized (N) | 15.50 | 10.33 | 1966.67 | 5.00 | 60.0% | 75.6% | 66.9% |
|
| Concept (C) | 15.33 | 8.00 | 1969.00 | 5.17 | 65.7% | 74.8% | 70.0% |
|
| N+POS (P) | 11.67 | 5.67 | 1971.33 | 8.83 | 67.3% | 56.9% | 61.7% |
|
| N+ Concept (NC) | 14.33 | 3.83 | 1973.17 | 6.17 | 78.9% | 69.9% | 74.1% |
|
| N+C+Type (NCT) | 16.00 | 6.17 | 1970.83 | 4.50 | 72.2% | 78.0% | 75.0% |
Figure 5Application of SimQ for NetWellness.