| Literature DB >> 27485666 |
Papis Wongchaisuwat1, Diego Klabjan, Siddhartha Reddy Jonnalagadda.
Abstract
BACKGROUND: Community-based question answering (CQA) sites play an important role in addressing health information needs. However, a significant number of posted questions remain unanswered. Automatically answering the posted questions can provide a useful source of information for Web-based health communities.Entities:
Keywords: Web-based health communities; consumer health informatics; machine learning; natural language processing; question answering
Year: 2016 PMID: 27485666 PMCID: PMC4987493 DOI: 10.2196/medinform.5490
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Overall architecture for training the system.
Figure 2The distance between two sequences.
List of features used in the model.
| Type of features | Features | Value |
| 1.Text length of Qp | 5 | |
| 2.Text length of Qt | 12 | |
| 3. Number of stop words contained in Qp | 1 | |
| 4. Number of stop words contained in Qt | 5 | |
| 5. VS(Qp, Qt) | 3.7052 | |
| 6. The difference between VS(Qp, At) and VS(Qt, At) | 0.4303 | |
| 7. DTW(Qp, Qt) | 29 | |
| 8. The difference between DTW(Qp, At) and DTW(Qt, At) | 14.5 | |
| 9. Number of overlapping words in SP and ST | 3 | |
| 10. Number of overlapping words in SP and SA | 3 | |
| 11. Binary variable indicating whether a set of overlapping words in (SP, ST) and (SP, SA) are different | 0 | |
| 12. Cardinality of the set difference of SP and ST | 4 | |
| 13. Cardinality of the set difference of SP and SA | 5 |
Corpus annotation examples.
| A target question | A training question | A training answer | Label |
| Can fully recovered alcoholics drink again | Can a recovered alcoholic drink again? | What they say at AA is that there is no such thing as permanent recovery from alcoholism. There are alcoholics who never drink again, but never alcoholics who stop being alcoholics. | valid |
| Can fully recovered alcoholics drink again | If both my parents are recovered alcoholics, will I have a problem with alcohol? | Yes, there is a good chance that you could inherit a tendency towards alcoholism. | invalid |
| Anxiety medication for drug/alcohol addiction? | Is chlordiazepoxide/librium a good medication for alcohol withdrawal and the associated anxiety? | Chlordiazepoxide has been the standard drug used for rapid alcohol detox for decades and has stood the test of time. | valid |
| Anxiety medication for drug/alcohol addiction? | Negative effects of alcohol and ADHD medication? | Drinking in moderation is wise for everyone, but it is imperative for adults with ADHD. | invalid |
Figure 3Process flow of the testing step.
Figure 4The Mean Reciprocal Rank (MRR) with a set of test questions Q.
Figure 5System output.
Figure 6An example result returned from the algorithm to determine candidate answers.
Information gain score of 5 significant features
| Features | Information gain |
| 1. Number of stop words contained in Qp | 0.0912 |
| 2. Text length of Qp | 0.0804 |
| 3. DTW(Qp, Qt) | 0.0395 |
| 4. Number of overlapping words in SP and ST | 0.0393 |
| 5. VS(Qp, Qt) | 0.0350 |
Evaluation metrics.
| Evaluation | Supervised learning | Semi-supervised learning (EM) | ||||||||||
| NNET | NNET_L2 | SVMa | LOGb | 1 iteration | 10 iterations | |||||||
| NNET | NNET_L2 | SVM | LOG | NNET | NNET_L2 | SVM | LOG | |||||
| 0.5818 | 0.6993 | 0.6305 | 0.6245 | 0.8623 | 0.7105 | 0.6774 | 0.6473 | 0.8491 | 0.71 | 0.6783 | 0.6478 | |
| 0.4216 | 0.5534 | 0.6224 | 0.6336 | 0.5686 | 0.6339 | 0.631 | 0.6266 | 0.5681 | 0.6332 | 0.6313 | 0.628 | |
| 0.1 | 0.3786 | 0.3045 | 0.3214 | 0.3222 | 0.3996 | 0.3667 | 0.3622 | 0.316 | 0.3977 | 0.3656 | 0.3626 | |
| 0.0746 | 0.3614 | 0.4803 | 0.5073 | 0.2294 | 0.3981 | 0.4493 | 0.4421 | 0.2219 | 0.3942 | 0.4478 | 0.44 | |
| 0.1433 | 0.4 | 0.241 | 0.2659 | 0.6801 | 0.4214 | 0.3239 | 0.3224 | 0.6562 | 0.4209 | 0.3229 | 0.3233 | |
aSVM: support vector machine.
bLOG: logistic regression.
cMRR: mean reciprocal rank.
Figure 7The confusion matrices for 1 iteration of EM trained with NNET, NNET_L2, SVM, and LOG.
Figure 8Performance between the original and adjusted model to test significance of UMLS-based features (health features).