| Literature DB >> 29220467 |
Theodore B Wright1, David Ball1, William Hersh1.
Abstract
Database URL: https://biocaddie.org/benchmark-data.Entities:
Mesh:
Year: 2017 PMID: 29220467 PMCID: PMC5737054 DOI: 10.1093/database/bax065
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Software dependencies
| Role | Software |
|---|---|
| Indexing and Query Processor | Elasticsearch 5.0.0 ( |
| Programming Language | Python 3.5 ( |
| Natural Language Processing (NLP) Framework | Natural Language Tool Kit (NLTK) ( |
| Python Application Programming Interface (API) to Search Service | Elasticsearch-py ( |
| Elasticsearch_dsl ( | |
| API to Entrez | BioPython ( |
| Other Dependencies | Oracle Java Runtime Environment 1.8 |
| Operating System | Microsoft Windows 10 64-bit |
Figure 1.Method overview.
Figure 2.Example query processing.
Submission run characteristics
| Run ID | Max Mesh Terms per token | MeSH Term Relative Weight (MeSH:Baseline) |
|---|---|---|
| OHSU-1 | NA | NA |
| OHSU-2 | 5 | 1:01 |
| OHSU-3 | 5 | 1:02 |
| OHSU-4 | 5 | 1:05 |
| OHSU-5 | 20 | 1:02 |
Official OHSU bioCADDIE challenge results
| Run ID | infAP | infNDCG | NDCG@10 | P@10 | P@10 |
|---|---|---|---|---|---|
| (+partial) | (−partial) | ||||
| OHSU-1 | 0.3193 | 0.3965 | 0.3333 | ||
| OHSU-2 | 0.1396 | 0.4024 | 0.3953 | 0.48 | 0.1933 |
| OHSU-3 | 0.1921 | 0.4405 | 0.5345 | 0.6533 | 0.28 |
| OHSU-4 | 0.2862 | 0.3333 | |||
| OHSU-5 | 0.083 | 0.3156 | 0.2531 | 0.34 | 0.1133 |
Bolded scores emphasize high performance runs.
Figure 3.Official bioCADDIE challenge results—all participants, best infNDCG.
Figure 4.Score breakdown by query.
Figure 5.MeSH terms and Weight analysis.
Best Mesh Terms and Weight (Wt- Baseline:MeSH) per query compared with baseline score
| Query | Baseline infNDCG | Best term | Best Wt | Theoretical Best infNDCG | Improvement over baseline |
|---|---|---|---|---|---|
| 0.470 | 10 | 1:6 | 0.673 | 0.20 | |
| 0.382 | 4 | 1:1 | 0.608 | 0.23 | |
| 0.691 | 5 | 1:10 | 0.688 | 0.00 | |
| 0.442 | 4 | 1:4 | 0.449 | 0.01 | |
| 0.306 | 2 | 1:5 | 0.305 | 0.00 | |
| 0.495 | 4 | 1:1 | 0.631 | 0.14 | |
| 0.303 | 5 | 1:1 | 0.884 | 0.58 | |
| 0.181 | 4 | 1:4 | 0.244 | 0.06 | |
| 0.350 | 10 | 1:2 | 0.631 | 0.28 | |
| 0.305 | 10 | 1:5 | 0.375 | 0.07 | |
| 0.369 | 10 | 1:6 | 0.67 | 0.30 | |
| 0.290 | 2 | 1:8 | 0.284 | −0.01 | |
| 0.235 | 10 | 1:1 | 0.243 | 0.01 | |
| 0.435 | 5 | 1:6 | 0.611 | 0.18 | |
| 0.696 | 1 | 1:1 | 0.746 | 0.05 |