| Literature DB >> 26034558 |
Nigel Collier1, Anika Oellrich2, Tudor Groza3.
Abstract
BACKGROUND: Phenotypes form the basis for determining the existence of a disease against the given evidence. Much of this evidence though remains locked away in text - scientific articles, clinical trial reports and electronic patient records (EPR) - where authors use the full expressivity of human language to report their observations.Entities:
Year: 2015 PMID: 26034558 PMCID: PMC4450611 DOI: 10.1186/s13326-015-0019-z
Source DB: PubMed Journal: J Biomed Semantics
ShARE/CLEF e-health training corpus semantic types
|
|
|
|
|
|
|---|---|---|---|---|
| T047 | Disease or syndrome | 1803 | 410 | 1.97 |
| T184 | Sign or symptom | 842 | 163 | 1.56 |
| T046 | Pathologic function | 518 | 133 | 1.65 |
| T037 | Injury or poisoning | 213 | 96 | 2.00 |
| T019 | Congenital abnormality | 184 | 25 | 3.61 |
| T190 | Anatomical abnormality | 103 | 36 | 1.77 |
| T191 | Neoplastic process | 92 | 49 | 1.87 |
| T048 | Mental or behavioral dysfunction | 84 | 32 | 1.76 |
| T033 | Finding | 45 | 15 | 2.90 |
| T020 | Acquired abnormality | 40 | 17 | 1.93 |
Distribution of UMLS semantic types for annotations by frequency and frequency without duplication as well as the average term length in tokens.
ShARE/CLEF e-health test corpus semantic types
|
|
|
|
|
|
|---|---|---|---|---|
| T047 | Disease or syndrome | 1723 | 371 | 1.88 |
| T184 | Sign or symptom | 816 | 149 | 1.51 |
| T046 | Pathologic function | 520 | 113 | 1.59 |
| T037 | Injury or poisoning | 106 | 33 | 1.75 |
| T019 | Congenital abnormality | 96 | 18 | 1.88 |
| T190 | Anatomical abnormality | 125 | 26 | 1.74 |
| T191 | Neoplastic process | 73 | 34 | 2.02 |
| T048 | Mental or behavioral dysfunction | 137 | 32 | 1.67 |
| T033 | Finding | 13 | 6 | 1.11 |
| T020 | Acquired abnormality | 41 | 21 | 1.62 |
Distribution of UMLS semantic types for annotations by frequency and frequency without duplication as well as the average term length in tokens.
Figure 1Example of sentence annotations from the ShARE/CLEF corpus. The example shows concept annotations for ‘headache; (C0018681 | T184), ‘neck stiffness’ (CO151315 | T184) and ‘unable to walk’ (C0560048 | T033). An example decomposition for ‘neck stiffness’ is shown with an illustrative mapping to PATO:0001545 (‘inflexible’) and FMA:Neck.
Feature blocks used to build the ensemble model
|
|
|
|---|---|
| FB1 | A Boolean set of features for the system identifiers (i.e. M1 … M9); |
| FB2 | A Boolean set of features for the semantic types that are predicted by the system to appear and not appear in the sentence (i.e. T047, T184, … etc.); |
| FB3 | A set of integer valued features for the counts of vocabulary terms appearing in UMLS concepts that are predicted by the systems to appear in the sentence; In total the set consisted of 1,008 UMLS CUIs; |
| FB4 | A set of integer valued features for the counts of vocabulary terms appearing in the sentence; The vocabulary consisted of 13,565 terms; |
| FB5 | A set of integer valued features for the ‘45 cluster’ distributed semantic classes which match to FB3. The 45 cluster classes derived by Richard Socher and Christoph Manning from PubMed are available at |
Brief comparative overview on the learn to rank approaches, adapted from [ 34 ]
|
|
| |
|---|---|---|
|
| Ranking by learning on object pairs | Ranking by learning on object lists |
|
| pairwise loss, e.g., hinge loss, exponential loss, logistic loss | listwise loss, e.g., cross entropy loss, cosine loss |
|
| Theoretical aspects are well studied | Considers the relationship among objects to their full extent |
|
| Considers only pairwise orders; May be biased towards lists with more objects | Theoretical aspects are less well studied |
|
| SVMRank [ | ListNet [ |
Comparison of stand-alone systems on training data
|
|
|
|
|
|---|---|---|---|
| M1: NCBO Annotator | 0.0393 | 0.5044 | 0.0729 |
| M2: BeCAS | 0.0146 | 0.0134 | 0.0140 |
| M3: Apache cTAKES | 0.0933 | 0.5675 |
|
| M4: MetaMap -A -negex | 0.0389 | 0.2992 | 0.0689 |
| M5: MetaMap -A -y | 0.0498 | 0.2505 | 0.0831 |
| M6: MetaMap -g | 0.0387 | 0.2905 | 0.0683 |
| M7: MetaMap -i | 0.0392 | 0.2994 | 0.0693 |
| M8: MetaMap | 0.0389 | 0.2992 | 0.0689 |
| M9: MetaMap -A | 0.0389 | 0.2992 | 0.0689 |
Macro precision, recall and F1 of the individual systems on the training data. The highest scoring system F1 is shown in bold.
Comparison of stand-alone systems on training data
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| T047 | M1 | 0.39 | 0.55 | 0.45 | T191 | M1 | 0.24 | 0.30 | 0.26 |
| M2 | 0.03 | 0.01 | 0.02 | M2 | 0.05 | 0.03 | 0.04 | ||
| M3 | 0.44 | 0.63 |
| M3 | 0.29 | 0.64 |
| ||
| M4 | 0.58 | 0.28 | 0.38 | M4 | 0.21 | 0.25 | 0.23 | ||
| M5 |
| 0.22 | 0.34 | M5 |
| 0.23 | 0.28 | ||
| M6 | 0.58 | 0.27 | 0.37 | M6 | 0.21 | 0.25 | 0.23 | ||
| M7 | 0.58 | 0.28 | 0.38 | M7 | 0.22 | 0.25 | 0.23 | ||
| M8 | 0.58 | 0.28 | 0.38 | M8 | 0.21 | 0.25 | 0.23 | ||
| M9 | 0.58 | 0.28 | 0.38 | M9 | 0.21 | 0.25 | 0.23 | ||
| T184 | M1 | 0.35 | 0.61 | 0.45 | T048 | M1 | 0.28 | 0.49 | 0.35 |
| M2 | 0.02 | 0.01 | 0.01 | M2 | 0.04 | 0.03 | 0.03 | ||
| M3 | 0.47 | 0.58 |
| M3 | 0.45 | 0.55 |
| ||
| M4 | 0.62 | 0.41 | 0.49 | M4 | 0.53 | 0.34 | 0.42 | ||
| M5 |
| 0.36 | 0.47 | M5 |
| 0.27 | 0.38 | ||
| M6 | 0.61 | 0.40 | 0.49 | M6 | 0.54 | 0.35 | 0.43 | ||
| M7 | 0.61 | 0.41 | 0.49 | M7 | 0.54 | 0.34 | 0.42 | ||
| M8 | 0.62 | 0.41 | 0.49 | M8 | 0.53 | 0.34 | 0.42 | ||
| M9 | 0.62 | 0.41 | 0.49 | M9 | 0.53 | 0.34 | 0.42 | ||
| T046 | M1 | 0.28 | 0.62 | 0.39 | T033 | M1 | 0.01 | 0.36 | 0.01 |
| M2 | 0.03 | 0.04 | 0.03 | M2 | 0.00 | 0.00 | 0.00 | ||
| M3 | 0.30 | 0.69 |
| M3 | 0.00 | 0.11 | 0.00 | ||
| M4 | 0.50 | 0.34 | 0.40 | M4 | 0.00 | 0.13 | 0.01 | ||
| M5 |
| 0.26 | 0.34 | M5 | 0.00 | 0.13 | 0.01 | ||
| M6 | 0.49 | 0.33 | 0.39 | M6 | 0.00 | 0.13 | 0.01 | ||
| M7 | 0.50 | 0.34 | 0.41 | M7 | 0.00 | 0.13 | 0.01 | ||
| M8 | 0.50 | 0.34 | 0.40 | M8 | 0.00 | 0.13 | 0.01 | ||
| M9 | 0.50 | 0.34 | 0.40 | M9 | 0.00 | 0.13 | 0.01 | ||
| T037 | M1 | 0.19 | 0.24 | 0.21 | T020 | M1 | 0.36 | 0.50 |
|
| M2 | 0.00 | 0.00 | 0.00 | M2 | 0.00 | 0.00 | 0.00 | ||
| M3 | 0.26 | 0.34 |
| M3 | 0.33 | 0.57 |
| ||
| M4 | 0.38 | 0.22 | 0.28 | M4 | 0.36 | 0.36 | 0.36 | ||
| M5 |
| 0.21 | 0.28 | M5 |
| 0.21 | 0.27 | ||
| M6 | 0.36 | 0.20 | 0.25 | M6 | 0.36 | 0.33 | 0.35 | ||
| M7 | 0.37 | 0.21 | 0.27 | M7 | 0.36 | 0.36 | 0.36 | ||
| M8 | 0.38 | 0.22 | 0.28 | M8 | 0.36 | 0.36 | 0.36 | ||
| M9 | 0.38 | 0.22 | 0.28 | M9 | 0.36 | 0.36 | 0.36 | ||
| T190 | M1 | 0.12 | 0.44 | 0.19 | T019 | M1 | 0.40 | 0.11 | 0.18 |
| M2 | 0.01 | 0.01 | 0.01 | M2 | 0.00 | 0.00 | 0.00 | ||
| M3 | 0.12 | 0.55 | 0.19 | M3 | 0.58 | 0.14 |
| ||
| M4 | 0.28 | 0.22 |
| M4 | 0.27 | 0.07 | 0.11 | ||
| M5 |
| 0.19 | 0.24 | M5 | 0.34 | 0.06 | 0.11 | ||
| M6 | 0.28 | 0.22 |
| M6 | 0.25 | 0.07 | 0.11 | ||
| M7 | 0.28 | 0.22 |
| M7 | 0.27 | 0.07 | 0.11 | ||
| M8 | 0.28 | 0.22 |
| M8 | 0.27 | 0.07 | 0.11 | ||
| M9 | 0.28 | 0.22 |
| M9 | 0.27 | 0.07 | 0.11 |
Type-based micro precision, recall and F1 of the individual systems on the training data. The highest scoring system F1 for each semantic type is shown in bold. Note that italics scores indicate the highest level achieved for recall and precision for each semantic type by any system.
Learn to rank on training data
|
|
| |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Top-1 | Union | SVMRank | 0.1513 | 0.5960 |
| 0.25 | 0.04 |
| 0.01 | 0.00 | 0.01 | 0.00 | - | 0.23 |
| ListNet | 0.1153 | 0.5880 | 0.1928 |
| 0.03 | 0.21 | 0.02 | - | 0.00 | - | - | - | ||
| RankNet | 0.0924 | 0.5206 | 0.1570 |
| - | - | - | - | - | - | - | - | ||
| RankBoost | 0.1296 | 0.6125 | 0.2139 | 0.46 | 0.05 |
| - | - | 0.01 | 0.28 | 0.28 | 0.28 | ||
| Oracle | SVMRank | 0.1513 | 0.5960 | 0.2413 | 0.25 | 0.04 |
| 0.01 | 0.00 | 0.01 | 0.00 | - | 0.23 | |
| ListNet | 0.1153 | 0.5880 | 0.1928 |
| 0.03 | 0.21 | 0.02 | 0.00 | - | - | - | |||
| RankNet | 0.0924 | 0.5206 | 0.1570 |
| - | - | - | - | - | - | - | - | ||
| RankBoost | 0.1791 | 0.6113 |
| 0.27 | 0.04 |
| - | - | 0.01 | 0.18 | 0.00 | - | ||
| Top-2 | Union | SVMRank | 0.1122 | 0.6426 |
| 0.58 | 0.07 |
| 0.03 | 0.00 | 0.02 | 0.00 | 0.23 | 0.40 |
| ListNet | 0.0996 | 0.6566 | 0.1730 |
| 0.06 | 0.88 | 0.10 | 0.01 | 0.01 | 0.00 | - | - | ||
| RankNet | 0.0989 | 0.6477 | 0.1716 |
| - |
| - | - | - | - | - | |||
| RankBoost | 0.1084 | 0.6469 | 0.1857 | 0.55 | 0.09 | 0.61 | 0.01 | 0.00 | 0.28 |
|
| - | ||
| Oracle | SVMRank | 0.2340 | 0.6316 | 0.3415 | 0.09 | 0.00 |
| 0.01 | 0.00 | 0.01 | - | - | 0.23 | |
| ListNet | 0.2390 | 0.6439 | 0.3486 | 0.19 | 0.00 |
| 0.05 | 0.01 | 0.00 | - | - | - | ||
| RankNet | 0.2533 | 0.6363 |
| 0.17 | - |
| - | - | - | - | - | - | ||
| RankBoost | 0.2385 | 0.6359 | 0.3469 | 0.11 | 0.00 |
| - | - | 0.02 | 0.29 | 0.00 | - | ||
| Top-3 | Union | SVMRank | 0.1051 | 0.6545 |
| 0.61 | 0.18 |
| 0.07 | 0.02 | 0.03 | 0.24 | 0.40 | 0.77 |
| ListNet | 0.0921 | 0.6761 | 0.1621 |
| 0.60 | 0.96 | 0.40 | 0.03 | 0.03 | 0.01 | - | - | ||
| RankNet | 0.0943 | 0.6486 | 0.1647 |
|
|
| - | - | - | - | - | |||
| RankBoost | 0.1048 | 0.6532 | 0.1806 | 0.56 | 0.11 | 0.63 | 0.29 | 0.23 | 0.75 |
|
| - | ||
| Oracle | SVMRank | 0.2469 | 0.6409 | 0.3565 | 0.07 | 0.00 |
| 0.02 | 0.01 | 0.01 | 0.00 | - | 0.28 | |
| ListNet | 0.2716 | 0.6596 |
| 0.13 | 0.00 |
| 0.14 | 0.01 | 0.00 | - | - | - | ||
| RankNet | 0.2536 | 0.6367 | 0.3627 | 0.17 | 0.00 |
| - | - | - | - | - | - | ||
| RankBoost | 0.2553 | 0.6397 | 0.3650 | 0.10 | 0.00 |
| - | 0.09 | 0.02 | 0.23 | 0.00 | - | ||
Macro precision, recall and F1 at different top K levels. The highest scoring system F1 for each level (both union and oracle strategies) is shown in bold. The table also shows the individual contribution of the systems to the final score where italics scores indicate the highest contributing individual system(s) to each ensemble.
Type-based learn to rank on training data
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| T019 | SVMRank | 0.46 | 0.15 |
| T047 | SVMRank | 0.53 | 0.64 |
|
| ListNet | 0.47 | 0.13 | 0.20 | ListNet | 0.42 | 0.65 | 0.51 | ||
| RankNet | 0.47 | 0.11 | 0.18 | RankNet | 0.43 | 0.57 | 0.49 | ||
| RankBoost | 0.43 | 0.16 |
| RankBoost | 0.46 | 0.68 | 0.55 | ||
| T020 | SVMRank | 0.32 | 0.57 |
| T048 | SVMRank | 0.44 | 0.62 | 0.52 |
| ListNet | 0.29 | 0.50 | 0.36 | ListNet | 0.42 | 0.54 | 0.47 | ||
| RankNet | 0.33 | 0.42 | 0.37 | RankNet | 0.39 | 0.51 | 0.44 | ||
| RankBoost | 0.21 | 0.50 | 0.30 | RankBoost | 0.48 | 0.67 |
| ||
| T033 | SVMRank | 0.01 | 0.32 | 0.02 | T184 | SVMRank | 0.46 | 0.63 |
|
| ListNet | 0.01 | 0.48 | 0.02 | ListNet | 0.40 | 0.66 | 0.50 | ||
| RankNet | 0.01 | 0.48 |
| RankNet | 0.39 | 0.62 | 0.48 | ||
| RankBoost | 0.01 | 0.45 | 0.02 | RankBoost | 0.45 | 0.64 |
| ||
| T037 | SVMRank | 0.34 | 0.33 |
| T190 | SVMRank | 0.20 | 0.61 |
|
| ListNet | 0.26 | 0.24 | 0.25 | ListNet | 0.17 | 0.58 | 0.27 | ||
| RankNet | 0.22 | 0.21 | 0.22 | RankNet | 0.15 | 0.44 | 0.22 | ||
| RankBoost | 0.30 | 0.29 | 0.29 | RankBoost | 0.16 | 0.60 | 0.25 | ||
| T046 | SVMRank | 0.29 | 0.66 |
| T191 | SVMRank | 0.31 | 0.56 | 0.40 |
| ListNet | 0.25 | 0.67 | 0.36 | ListNet | 0.37 | 0.44 | 0.40 | ||
| RankNet | 0.27 | 0.61 | 0.37 | RankNet | 0.35 | 0.36 | 0.36 | ||
| RankBoost | 0.24 | 0.67 | 0.35 | RankBoost | 0.35 | 0.56 |
|
Note that the highest scoring system F1 for each semantic type is shown in bold.
Learn to rank on test data
|
|
| |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Top-1 | Union | SVMRank | 0.1712 | 0.6426 |
| 0.17 | 0.04 |
| - | - | 0.01 | 0.00 | - | 0.28 |
| ListNet | 0.1271 | 0.6170 | 0.2108 |
| 0.04 | 0.26 | 0.05 | - | 0.00 | - | - | - | ||
| RankNet | 0.0923 | 0.5096 | 0.1562 |
| - | - | - | - | - | - | - | - | ||
| RankBoost | 0.1408 | 0.6524 | 0.2316 | 0.43 | 0.05 |
| - | - | 0.01 | 0.28 | 0.28 | 0.28 | ||
| Oracle | SVMRank | 0.1712 | 0.6426 | 0.2703 | 0.17 | 0.04 |
| - | - | 0.01 | 0.00 | - | 0.28 | |
| ListNet | 0.1271 | 0.6170 | 0.2108 |
| 0.04 | 0.26 | 0.05 | - | 0.00 | - | - | - | ||
| RankNet | 0.0923 | 0.5096 | 0.1562 |
| - | - | - | - | - | - | - | - | ||
| RankBoost | 0.1872 | 0.6504 |
| 0.23 | 0.05 |
| - | - | 0.01 | 0.20 | 0.00 | - | ||
| Top-2 | Union | SVMRank | 0.1244 | 0.6986 |
| 0.51 | 0.07 |
| - | - | 0.01 | 0.00 | 0.28 | 0.50 |
| ListNet | 0.1107 | 0.7109 | 0.1915 | 0.87 | 0.07 |
| 0.13 | 0.03 | 0.02 | 0.00 | - | - | ||
| RankNet | 0.1070 | 0.7028 | 0.1857 |
| - |
| - | - | - | - | - | - | ||
| RankBoost | 0.1188 | 0.7034 | 0.2032 | 0.53 | 0.09 | 0.62 | 0.01 | 0.01 | 0.28 |
|
|
| ||
| Oracle | SVMRank | 0.2350 | 0.6869 | 0.3501 | 0.07 | 0.00 |
| - | - | 0.00 | 0.00 | - | 0.29 | |
| ListNet | 0.2534 | 0.6981 | 0.3718 | 0.14 | 0.00 |
| 0.06 | 0.02 | 0.00 | 0.00 | - | - | ||
| RankNet | 0.2629 | 0.6905 |
| 0.15 | - |
| - | - | - | - | - | - | ||
| RankBoost | 0.2420 | 0.6908 | 0.3584 | 0.10 | 0.00 |
| - | 0.01 | 0.02 | 0.29 | 0.01 | - | ||
| Top-3 | Union | SVMRank | 0.1157 | 0.7081 |
| 0.53 | 0.10 | 0.63 | - | 0.01 | 0.28 | 0.50 |
| |
| ListNet | 0.1019 | 0.7287 | 0.1788 | 0.91 | 0.57 |
| 0.44 | 0.09 | 0.06 | 0.02 | 0.00 | - | ||
| RankNet | 0.1029 | 0.7045 | 0.1796 |
|
|
| - | - | - | - | - | - | ||
| RankBoost | 0.1128 | 0.7109 | 0.1947 | 0.54 | 0.11 | 0.64 | 0.29 | 0.22 | 0.75 |
|
|
| ||
| Oracle | SVMRank | 0.2444 | 0.6933 | 0.3615 | 0.06 | 0.00 |
| - | 0.01 | 0.00 | - | 0.33 | ||
| ListNet | 0.2773 | 0.7126 |
| 0.11 | 0.00 |
| 0.12 | 0.04 | 0.01 | 0.00 | - | - | ||
| RankNet | 0.2643 | 0.6914 | 0.3824 | 0.15 | 0.01 |
| - | - | - | - | - | - | ||
| RankBoost | 0.2593 | 0.6956 | 0.3777 | 0.09 | 0.00 |
| - | 0.10 | 0.02 | 0.22 | 0.00 | - | ||
Macro precision, recall and F1 at different top K levels. The highest scoring system F1 for each level (both union and oracle strategies) is shown in bold. The table also shows the individual contribution of the systems to the final score where italics scores indicate the highest contributing individual system(s) to each ensemble.
Type-based learn to rank on test data
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| T019 | SVMRank | 0.51 | 0.21 |
| T047 | SVMRank | 0.52 | 0.66 |
|
| ListNet | 0.59 | 0.18 | 0.28 | ListNet | 0.41 | 0.65 | 0.50 | ||
| RankNet | 0.54 | 0.16 | 0.24 | RankNet | 0.38 | 0.51 | 0.44 | ||
| RankBoost | 0.47 | 0.21 |
| RankBoost | 0.45 | 0.68 | 0.54 | ||
| T020 | SVMRank | 0.29 | 0.53 | 0.37 | T048 | SVMRank | 0.48 | 0.68 |
|
| ListNet | 0.34 | 0.50 |
| ListNet | 0.40 | 0.49 | 0.44 | ||
| RankNet | 0.34 | 0.48 | 0.40 | RankNet | 0.38 | 0.48 | 0.43 | ||
| RankBoost | 0.29 | 0.55 | 0.38 | RankBoost | 0.44 | 0.68 | 0.54 | ||
| T033 | SVMRank | 0.00 | 0.07 | 0.00 | T184 | SVMRank | 0.52 | 0.62 |
|
| ListNet | 0.00 | 0.27 | 0.00 | ListNet | 0.44 | 0.61 | 0.51 | ||
| RankNet | 0.00 | 0.27 |
| RankNet | 0.40 | 0.58 | 0.47 | ||
| RankBoost | 0.00 | 0.20 | 0.00 | RankBoost | 0.48 | 0.62 | 0.54 | ||
| T037 | SVMRank | 0.37 | 0.50 |
| T190 | SVMRank | 0.28 | 0.69 |
|
| ListNet | 0.35 | 0.46 | 0.40 | ListNet | 0.28 | 0.65 | 0.39 | ||
| RankNet | 0.31 | 0.44 | 0.37 | RankNet | 0.25 | 0.55 | 0.34 | ||
| RankBoost | 0.35 | 0.49 | 0.41 | RankBoost | 0.28 | 0.68 | 0.39 | ||
| T046 | SVMRank | 0.37 | 0.70 |
| T191 | SVMRank | 0.27 | 0.54 |
|
| ListNet | 0.37 | 0.70 |
| ListNet | 0.23 | 0.34 | 0.27 | ||
| RankNet | 0.36 | 0.56 | 0.44 | RankNet | 0.17 | 0.24 | 0.20 | ||
| RankBoost | 0.35 | 0.70 | 0.47 | RankBoost | 0.25 | 0.53 | 0.34 |
Note that the highest scoring system F1 for each semantic type is shown in bold.