| Literature DB >> 26306280 |
Karla Caballero1, Ram Akella2.
Abstract
Patients often search for information on the web about treatments and diseases after they are discharged from the hospital. However, searching for medical information on the web poses challenges due to related terms and synonymy for the same disease and treatment. In this paper, we present a method that combines Statistical Topics Models, Language Models and Natural Language Processing to retrieve healthcare related documents. In addition, we test if the incorporation of terms extracted from the patient's discharge summary improves the retrieval performance. We show that the proposed framework outperformed the winner of the retrieval CLEF eHealth 2013 challenge by 68% in the MAP measure (0:5226 vs 0:3108), and by 13% in NDCG (0:5202 vs 0:3637). Compared with standard language models, we obtain an improvement of 92% in MAP (0:2666) and 45% in NDCG. (0:3637).Entities:
Year: 2015 PMID: 26306280 PMCID: PMC4525234
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Document features: Single Words and with noun Phrases extracted
| Feature | Single words | Noun phrases |
|---|---|---|
|
| ||
| Vocabulary size | 98; 734 | 101; 497 |
| Average unique terms per document | 767:553 | 760:522 |
Mean Performance Results of the base model and the variants of the model for the test set
| Model | P@5 | P@10 | MAP | NDCG@10 | Doc. Retrieved |
|---|---|---|---|---|---|
|
| |||||
| 0:4520 | 0:4700 | 0:3043 | 0:4169 | 1651 | |
| 0:4960 | 0:5180 | 0:3108 | 0:4665 | 1673 | |
| 0:4040 | 0:4040 | 0:2666 | 0:3637 | 1646 | |
|
| |||||
| K=75 | |||||
|
| |||||
| TM | 0:5183 | 0:4828 | 0:5179 | 1715 | |
| TM+DS | 0:5224 | 0:4960 | 0:4998 | 0:5103 | 1014 |
| TM+DSTF | 0:5008 | 0:5204 | 0:4958 | 1023 | |
| TM+NPh | 0:4920 | 0:5040 | 0:4746 | 0:5059 | 1697 |
| TM+ NPh +DS | 0:5060 | 0:4959 | 0:5041 | 0:4995 | 1433 |
| TM +NPh +DSTF | 0:5320 | 0:5102 | 0:5049 | 0:5170 | 958 |
|
| |||||
| K=100 | |||||
|
| |||||
| TM | 0:5224 | 0:5122 | 0:4840 | 0:5117 | 1722 |
| TM+DS | 0:4840 | 0:5107 | 0:5017 | 1148 | |
| TM+DSTF | 0:5200 | 0:5000 | 0:5202 | 1013 | |
| TM+NPh | 0:5160 | 0:5166 | 0:4849 | 0:5088 | 1694 |
| TM+ NPh +DS | 0:5160 | 0:4645 | 0:4881 | 0:4776 | 991 |
| TM +NPh +DSTF | 0:4020 | 0:4022 | 0:4523 | 0:4235 | 827 |
|
| |||||
| K=150 | |||||
|
| |||||
| TM | 0:5265 | 0:5224 | 0:4877 | 0:5186 | 1718 |
| TM+DS | 0:5326 | 0:5041 | 0:5011 | 0:5155 | 1508 |
| TM+DSTF | 0:5306 | 0:5163 | 0:5200 | 0:5239 | 1445 |
| TM+NPh | 0:4760 | 0:5020 | 0:5167 | 0:4899 | 1670 |
| TM+ NPh +DS | 0.5340 | 0:4632 | 0:4762 | 0:4846 | 842 |
| TM +NPh +DSTF | 0:4709 | 0:4204 | 0:4464 | 0:4325 | 714 |