| Literature DB >> 21347163 |
Abstract
BIOMEDICAL TEXTS CAN BE TYPICALLY REPRESENTED BY FOUR RHETORICAL CATEGORIES: introduction, methods, results and discussion (IMRAD). Classifying sentences into these categories can benefit many other text-mining tasks. Although many studies have applied approaches to automatically classify sentences in MEDLINE abstracts into the IMRAD categories, few have explored the classification of sentences that appear in full-text biomedical articles. We explored different approaches to automatically classify a sentence in a full-text biomedical article into the IMRAD categories. Our best system is a support vector machine classifier that achieved 81.30% accuracy, which is significantly higher than baseline systems.Entities:
Year: 2009 PMID: 21347163 PMCID: PMC3041564
Source DB: PubMed Journal: Summit Transl Bioinform ISSN: 2153-6430
Confidence value assigned by the annotators to the set of 391 sentences
| Annotator2 + Annotator3 | |||||
|---|---|---|---|---|---|
| High | Medium | Low | Total | ||
| Annotatorl (SA) | High | 246 | 72 | 5 | 323 |
| Medium | 38 | 18 | 8 | 64 | |
| Low | 4 | 0 | 0 | 4 | |
| Total | 288 | 90 | 13 | 391 | |
Annotator1 vs. Annotator2+3’s agreement on annotating sentences into the IMRAD categories.
| High Confidence Sentences | All Sentences | |||
|---|---|---|---|---|
| Kappa | OA(%) | Kappa | OA(%) | |
| Introduction | 0.688 | 88.2 | 0.514 | 80.1 |
| Methods | 0.862 | 94.3 | 0.704 | 89.0 |
| Results | 0.756 | 90.7 | 0.58 | 85.2 |
| Discussion | 0.532 | 84.6 | 0.358 | 75.7 |
OA: Overall Agreement
Performance (%) with standard-deviation across the 10-folds of all classifiers.
| Words | Words + tense | Words + IMRAD | Words+Tense+IMRAD | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| A | 69.29±3.54 | 55.40±8.80 | 69.03±3.86 | 69.43±3.41 | 58.88±5.95 | 60.08 ±4.36 | 75.83±5.08 | 76.10±4.48 | 81.04±4.82 | 81.30±4.67 |
| I | 69.9±5.76 | 63.4±10.8 | 69.7±5.77 | 69.7±5.77 | 61.4±9.65 | 66.6±4.04 | 80.6±6.31 | 82.2±6.69 | 83.5±4.99 | 84.3±5.13 |
| M | 81.2±6.73 | 59.7±11.3 | 80.8±5.72 | 81.4±5.49 | 70.8±6.21 | 66.2±7.45 | 76.3±7.02 | 76.2±7.79 | 83.9±8.96 | 84.1±8.12 |
| R | 72.2±7.26 | 32.0±8.43 | 71.3±8.46 | 71.9±8.02 | 54.5±11.8 | 54.5±12.4 | 69.7±8.78 | 68.3±7.63 | 77.6±10.2 | 77.2±11.2 |
| D | 46.3±12.3 | 37.5±18.2 | 46.6±13.3 | 46.7±13.2 | 39.4±13.7 | 42.6±12.9 | 59.7±21.8 | 59.4±20.0 | 58.4±24.9 | 61.5±14.8 |
| WA | 70.5 | 51.8 | 70.2 | 70.5 | 59.5 | 60.7 | 74.4 | 74.6 | 79.2 | 79.8 |
A: Accuracy, I: Introduction f-score, M: Methods f-score, R: Results f-score, D: Discussion f-score, WA: Weighted average of f-score.