| Literature DB >> 22185221 |
Halil Kilicoglu1, Graciela Rosemblat, Marcelo Fiszman, Thomas C Rindflesch.
Abstract
BACKGROUND: Semantic relations increasingly underpin biomedical text mining and knowledge discovery applications. The success of such practical applications crucially depends on the quality of extracted relations, which can be assessed against a gold standard reference. Most such references in biomedical text mining focus on narrow subdomains and adopt different semantic representations, rendering them difficult to use for benchmarking independently developed relation extraction systems. In this article, we present a multi-phase gold standard annotation study, in which we annotated 500 sentences randomly selected from MEDLINE abstracts on a wide range of biomedical topics with 1371 semantic predications. The UMLS Metathesaurus served as the main source for conceptual information and the UMLS Semantic Network for relational information. We measured interannotator agreement and analyzed the annotations closely to identify some of the challenges in annotating biomedical text with relations based on an ontology or a terminology.Entities:
Mesh:
Year: 2011 PMID: 22185221 PMCID: PMC3281188 DOI: 10.1186/1471-2105-12-486
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Sample annotation provided to the annotators. The sample annotation provided to the annotators before the practice phase. The first line of the annotation corresponds to the subject, the second line to the predicate and the third line to the object of the predication. Some fields are not shown for readability.
Overall semantic predication statistics in the practice phase on the set of 50 sentences annotated by all three annotators
| Annotator | # of Predications | Per sentence | Max. per sentence |
|---|---|---|---|
| A | 130 | 2.60 | 12 |
| B | 156 | 3.12 | 9 |
| C | 116 | 2.32 | 11 |
Top ontological predicates and their annotation frequency in the practice phase
| Predicate | Average Count | % | A | B | C |
|---|---|---|---|---|---|
| LOCATION_OF | 19.3 | 14.4 | 26 | 19 | 13 |
| PROCESS_OF | 17.3 | 12.9 | 14 | 28 | 10 |
| INHIBITS | 12.3 | 9.2 | 16 | 7 | 14 |
| INTERACTS_WITH | 11.3 | 8.4 | 13 | 4 | 17 |
| ISA | 10.3 | 7.7 | 12 | 9 | 10 |
| PART_OF | 7.3 | 5.4 | 3 | 14 | 5 |
| TREATS | 7 | 5.2 | 5 | 11 | 5 |
| CAUSES | 7 | 5.2 | 7 | 5 | 9 |
Top indicator types and their annotation frequency in the practice phase
| Predicate Type | Average Count | A | B | C |
|---|---|---|---|---|
| PREP | 37 | 39 | 47 | 25 |
| VERB | 27 | 26 | 25 | 30 |
| NOM | 24 | 19 | 23 | 30 |
| MOD_HEAD | 19.3 | 14 | 32 | 12 |
| PART | 12.7 | 21 | 10 | 7 |
Inter-annotator agreement (IAA) in the practice phase, calculated as F-measure among pairs of annotators
| Pair | A-B | A-C | B-C |
|---|---|---|---|
| IAA ( | 0.415 | 0.475 | 0.378 |
| IAA ( | 0.428 | 0.500 | 0.434 |
Overall semantic predication statistics in the main annotation phase on the set of 500 sentences annotated by two annotators
| Annotator | # of Predications | Per sentence | Max. per sentence |
|---|---|---|---|
| A | 1293 | 2.59 | 24 |
| B | 1344 | 2.69 | 22 |
Interannotator agreement (A-B) in the main annotation phase, calculated as F-measure among the pair of annotators
| Equivalence Criteria | ||||
|---|---|---|---|---|
| 0.500 (0.535) | 0.624 | 0.536 | 0.655 | |
| 0.530 (0.567) | 0.654 | 0.566 | 0.684 | |
| 0.505 (0.539) | 0.628 | 0.542 | 0.659 | |
| 0.659 | 0.573 | 0.688 | ||
PE: predication equivalence, GP: gene/gene product correspondence, CK: conceptual knowledge, RK: relational knowledge, CRK: conceptual and relational knowledge
Top indicator types and their frequencies in the main annotation phase
| Predicate Type | Average Count | A | B |
|---|---|---|---|
| PREP | 462 | 467 | 457 |
| VERB | 243 | 257 | 229 |
| MOD_HEAD | 199 | 179 | 219 |
| NOM | 136 | 125 | 147 |
| SPEC | 101 | 96 | 106 |
Most frequent ontological predicates and interannotator agreement specific to these predicates
| Predicate | Average Count | % | A | B | IAA |
|---|---|---|---|---|---|
| PROCESS_OF | 236 | 17.9 | 226 | 246 | 0.755 |
| LOCATION_OF | 199 | 15.0 | 198 | 200 | 0.578 |
| PART_OF | 164.5 | 12.5 | 150 | 179 | 0.500 |
| TREATS | 118.5 | 9.0 | 124 | 113 | 0.591 |
| AFFECTS | 104 | 7.9 | 88 | 120 | 0.308 |
| ISA | 101 | 7.7 | 96 | 106 | 0.593 |
| CAUSES | 57.5 | 4.4 | 52 | 63 | 0.561 |
| USES | 50.5 | 3.8 | 60 | 41 | 0.495 |
| INTERACTS_WITH | 46.5 | 3.5 | 61 | 32 | 0.366 |
| ADMINISTERED_TO | 35.5 | 2.7 | 26 | 45 | 0.500 |
Highest and lowest agreement rates by ontological predicates, annotated more than 10 times
| Predicate | IAA | Predicate | IAA |
|---|---|---|---|
| PROCESS_OF | 0.755 | INHIBITS | 0.400 |
| PREVENTS | 0.667 | PRODUCES | 0.367 |
| ISA | 0.593 | INTERACTS_WITH | 0.366 |
| TREATS | 0.591 | PRECEDES | 0.320 |
| DIAGNOSES | 0.585 | ASSOCIATED_WITH | 0.320 |
| LOCATION_OF | 0.578 | AFFECTS | 0.308 |
| CAUSES | 0.561 | STIMULATES | 0.238 |
| PART_OF | 0.500 | DISRUPTS | 0.214 |
Highest (column 1-3) and lowest (column 4-6) agreement rates for ontological predications annotated more than 10 times (N > 10)
| Ontological Predication | N | IAA | Ontological Predication | N | IAA |
|---|---|---|---|---|---|
| 15 | 0.947 | 13 | 0.375 | ||
| 16 | 0.909 | 16 | 0.222 | ||
| 28 | 0.866 | 10 | 0 | ||
| 141 | 0.859 | 12 | 0 | ||
| 54 | 0.857 | 11 | 0 | ||
aapp: Amino Acid, Peptide, or Protein; celf: Cell Function; cell: Cell; dsyn: Disease or Syndrome; gngm: Gene or Genome; humn: Human; inpo: Injury or Poisoning; mamm: Mammal; neop: Neoplastic Process; rcpt: Receptor; sosy: Sign or Symptom
Top ontological predicates and their annotation frequency in the gold standard
| Ontological Predicate | Count |
|---|---|
| PROCESS_OF | 239 |
| LOCATION_OF | 216 |
| PART_OF | 179 |
| TREATS | 126 |
| ISA | 111 |
| AFFECTS | 99 |
| CAUSES | 62 |
| INTERACTS_WITH | 51 |
| USES | 41 |
| ADMINISTERED_TO | 34 |
Top indicator types and their frequencies in the gold standard
| Indicator Type | Count |
|---|---|
| PREP | 465 |
| VERB | 249 |
| MOD_HEAD | 217 |
| NOM | 141 |
| SPEC | 111 |