| Literature DB >> 18834500 |
William A Baumgartner1, Zhiyong Lu, Helen L Johnson, J Gregory Caporaso, Jesse Paquette, Anna Lindemann, Elizabeth K White, Olga Medvedeva, K Bretonnel Cohen, Lawrence Hunter.
Abstract
BACKGROUND: Reliable information extraction applications have been a long sought goal of the biomedical text mining community, a goal that if reached would provide valuable tools to benchside biologists in their increasingly difficult task of assimilating the knowledge contained in the biomedical literature. We present an integrated approach to concept recognition in biomedical text. Concept recognition provides key information that has been largely missing from previous biomedical information extraction efforts, namely direct links to well defined knowledge resources that explicitly cement the concept's semantics. The BioCreative II tasks discussed in this special issue have provided a unique opportunity to demonstrate the effectiveness of concept recognition in the field of biomedical language processing.Entities:
Mesh:
Year: 2008 PMID: 18834500 PMCID: PMC2559993 DOI: 10.1186/gb-2008-9-s2-s9
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
GM results: performances of systems and individual components on the test and training data.
| Test Data | Training Data | |||||
| Tagger | Precision | Recall | F-measure | Precision | Recall | F-measure |
| CCP | 77.30 | 77.74 | 77.52 | 83.68 | 83.48 | 83.58 |
| ABNER | 80.38 | 73.26 | 76.65 | 83.85 | 80.86 | 82.33 |
| LingPipe | 72.53 | 80.00 | 76.09 | 88.47 | 92.77 | 90.57 |
| 2/3 Majority | 85.54 (2) | 76.83 (3) | 80.95 (3) | 91.15 | 86.33 | 88.68 |
| Unanimous | 92.78 (1) | 49.12 (4) | 64.24 (4) | 94.56 | 61.41 | 74.46 |
| Overlap | 66.22 (4) | 83.72 (2) | 73.94 (4) | 79.27 | 91.17 | 84.80 |
| Median | 85.08 | 79.05 | 81.32 | |||
Presented are median scores, as supplied by organizers. Quartiles for our runs are shown in parentheses. GM, gene mention.
GN results: performance on the development data with and without conjunction resolution.
| Steps | Precision | Recall | F measure |
| Without conjunction resolution | 0.836 | 0.691 | 0.757 |
| With conjunction resolution | 0.827 | 0.727 | 0.774 |
GN, gene normalization.
GN results: performance with and without the gene name disambiguation procedure when evaluated against the development data.
| Steps | Precision | Recall | F measure |
| Without disambiguation | 0.848 | 0.689 | 0.760 |
| Use abbreviations only | 0.825 | 0.722 | 0.770 |
| Use abbreviations and flanking regions | 0.827 | 0.727 | 0.774 |
GN, gene normalization.
GN results: performance of nine heuristics used to filter false-positive gene mentions or modify gene mentions to improve dictionary matching performance.
| Presence of ... | Example | R | F | Modified | ||
| 0 | 0.770 | 0.673 | 0.718 | 0 | ||
| 1 | Gene chromosome location | 3p11-3p12.1 | 0.772 | 0.673 | 0.719 | 34 |
| 2 | Single, short lowercase word | heme | 0.778 | 0.672 | 0.721 | 112 |
| 3 | Strings of only numbers &/or punct | 9+/-76 | 0.779 | 0.672 | 0.722 | 206 |
| 4 | Extra preceding words | protein SNF to SNF | 0.790 | 0.681 | 0.731 | 225a |
| 5 | Extra trailing words | SNF protein to SNF | 0.812 | 0.723 | 0.765 | 419a |
| 6 | Amino acids | Ser-119 | 0.815 | 0.723 | 0.766 | 460 |
| 7 | Protein families | Bcl-2 family proteins | 0.816 | 0.722 | 0.766 | 701 |
| 8 | Protein domains, motifs, fusion | SNH domain | 0.828 | 0.722 | 0.771 | 883 |
| 9 | Nonhuman keywords | rat IFN gamma | 0.829 | 0.725 | 0.774 | 1,086a |
Results depicted here are from the development dataset. Step 0 indicates performance before application of any rules. At each step, the rules of preceding steps are also applied. Modified refers to the cumulative number of gene mentions removed or altered. aRules 4 and 5 result in modification of gene mentions only. Rule 9 can result in either modification or removal of gene mentions. All other rules result in removal of gene mentions. GN, gene normalization.
GN results: performance on the development data using different online resources for lexicon construction.
| Resources | Genes entries | Precision | Recall | F measure |
| Entrez Gene | 21,206 | 0.827 | 0.727 | 0.774 |
| UniProt | 18,580 | 0.834 | 0.591 | 0.692 |
| Entrez Gene + UniProt | 24,182 | 0.827 | 0.708 | 0.762 |
GN, gene normalization.
GN results: performance impact of the seven heuristics used to normalize gene names on the development data.
| Rule | Example | R | F | ||
| 0 | 0.783 | 0.469 | 0.586 | ||
| 1 | Substitution: Roman letters > arabic numerals | carbonic andydrase XI to carbonic andydrase 11 | 0.778 | 0.492 | 0.603 |
| 2 | Substitution: Greek letters > single letters | AP-2alpha to AP-2a | 0.779 | 0.497 | 0.607 |
| 3 | Normalization of case | CAMK2A to camk2a | 0.787 | 0.619 | 0.693 |
| 4 | Removal: parenthesized materials | sialyltransferase 1 (beta-galactoside alpha-2,6-sialytransferase) to sialyltransferase 1 | 0.782 | 0.623 | 0.694 |
| 5 | Removal: punctuation | VLA-2 to VLA2 | 0.768 | 0.667 | 0.714 |
| 6 | Removal: spaces | calcineurin B to calcineurinB | 0.784 | 0.742 | 0.762 |
| 7 | Removal: strings < 2 characters | P | 0.827 | 0.727 | 0.774 |
Presented are the seven heuristics used to normalize gene names in both lexicon construction and during processing of the gene tagger output, and the performance on the development data after each step was performed. GN, gene normalization.
GN results: performance on the GN test data.
| Run | True positives | False positives | False negatives | Precision | Recall | F measure | Quartile |
| 1 | 576 | 109 | 209 | 0.841 | 0.734 | 0.784 | 1 |
| 2 | 583 | 120 | 202 | 0.829 | 0.743 | 0.784 | 1 |
| 3 | 587 | 129 | 198 | 0.820 | 0.748 | 0.782 | 1 |
GN, gene normalization.
IAS methods: the three classifiers used for the IAS subtask.
| Name | Classifier | IG threshold | |
| Run 1 | SVM | RBF kernel, complexity factor 100, gamma 0.001 | 0.0001 |
| Run 2 | Naïve Bayes | kernel estimation enabled | 0.001 |
| Run 3 | SVM with balanced ± | RBF kernel, complexity factor 100, gamma 0.001 | 0.0001 |
IG threshold is the information gain feature selection threshold. IAS, interaction article subtask; RBF, radial basis function; SVM, support vector machine.
ISS methods: scoring requirements.
| Requires | Scored on | |||||||||
| Location | P | N | G | X | I | P | N | G | X | I |
| Abstract | × | × | × | × | ||||||
| Figure/table caption | × | × | × | × | × | × | ||||
| Section/subsection heading | × | × | × | × | ||||||
| Othera | × | × | × | × | × | × | ||||
Column definitions: P = has positive cue words; N = does not have negative cue words; G = has > 0 gene mentions; X = has experimental methods; I = has interaction key word. aIf a sentence includes a reference to a figure or table, the score for the caption is added to the score for the sentence. ISS, interaction sentence subtask.
ISS results: interaction passages extracted from the ISS task test data.
| Run | Passages | TP | Unique | U_TP | TP/Passages | U_TP/Unique | MRR |
| Run #1 | 372 | 51 | 361 | 51 | 13.71 | 14.13 | 1.0 |
| Run #2 | 372 | 71 | 361 | 70 | 19.09 | 19.39 | 1.0 |
Column headings: Passages = the total number of passages evaluated; TP = the number of evaluated passages that were preselected by human curators; Unique = the number of unique passages evaluated; U_TP, the number of unique passages that were pre-selected; MRR = mean reciprocal rank of the correct passages. ISS, interaction sentence subtask.
Figure 1IPS: steps of the protein-protein interaction extraction system. IPS, interaction pair subtask.
IPS results: comparison of interaction pairs results on the IPS task test data.
| Calculated by interaction | Calculated by article | |||||
| R | F | R | F | |||
| Run 1 | 0.38 | 0.06 | 0.11 | 0.39 | 0.31 | 0.29 |
| Task median | 0.06 | 0.11 | 0.07 | 0.08 | 0.20 | 0.08 |
'Calculated by interaction' indicates each interaction pair extracted was given equal weight. 'Calculated by article' indicates the measure was calculated by averaging over articles. Run1 was tuned to maximize precision. IPS, interaction pair subtask.
IPS results: comparison of normalization results on the IPS task test data.
| Calculated by interactor | Calculated by article | Calculated by article with interactions | |||||||
| R | F | R | F | R | F | ||||
| Run 1 | 0.57 | 0.12 | 0.19 | 0.15 | 0.13 | 0.13 | 0.56 | 0.46 | 0.48 |
| Median | 0.18 | 0.25 | 0.19 | 0.16 | 0.28 | 0.17 | 0.21 | 0.39 | 0.24 |
'Calculated by interactor' indicates each interactor extracted was given equal weight. 'Calculated by article' means the measure was calculated by averaging over articles. 'Calculated by article with interactions' means that only articles in which at least a single prediction submitted by a team was considered in the calculation. IPS, interaction pair subtask.
IPS results: distribution of the predicates of the true positive protein-protein interactions extracted from the IPS task test data.
| Verbs/nominalizations | Predicate | Count |
| Verbs | Interact | 26 |
| Co-localize | 9 | |
| Bind | 7 | |
| Regulate | 7 | |
| Inhibit | 6 | |
| Associate | 5 | |
| Co-immunoprecipitate | 2 | |
| Suppress | 2 | |
| Co-precipitate | 2 | |
| Modulate | 1 | |
| Total | 67 | |
| Nominalizations | Interaction | 68 |
| Association | 29 | |
| Binding | 20 | |
| Co-localization | 2 | |
| Phosphorylation | 1 | |
| Total | 103 |
IPS, interaction pair subtask.