| Literature DB >> 22182279 |
Emily Seymour1, Rohini Damle, Alessandro Sette, Bjoern Peters.
Abstract
BACKGROUND: The Immune Epitope Database (IEDB) project manually curates information from published journal articles that describe immune epitopes derived from a wide variety of organisms and associated with different diseases. In the past, abstracts of scientific articles were retrieved by broad keyword queries of PubMed, and were classified as relevant (curatable) or irrelevant (not curatable) to the scope of the database by a Naïve Bayes classifier. The curatable abstracts were subsequently manually classified into categories corresponding to different disease domains. Over the past four years, we have examined how to further improve this approach in order to enhance classification performance and to reduce the need for manual intervention.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22182279 PMCID: PMC3314711 DOI: 10.1186/1471-2105-12-482
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1IEDB PubMed abstract triaging process. Abstracts of references retrieved from the PubMed queries that have not been introduced to the IEDB's database and curation pipeline proceed to at least one of four hierarchical levels of classification. At Level 0, an abstract is evaluated for epitope-specific content. Abstracts which contain epitope-specific data are assigned to one of the seven Level 1 categories. References receive increasingly specific category assignments at Levels 2 and 3. High IEDB priority categories are Allergy, Autoimmunity, Infectious Disease, and Transplantation. Low IEDB priority categories are Cancer, HIV, and Other. Transplant and Cancer references are not assigned Level 2 categories. HIV references do not receive Level 2 or 3 category assignments.
Figure 2Comparison of Naïve Bayes and SVM algorithms at training Level 0. The performance of the Naïve Bayes and SVM classifiers was evaluated with 10-fold cross-validation. As is shown in the ROC curve, the SVM classifier outperformed the Naïve Bayes classifier on curatability predictions for the cross-validation dataset of 89,884 abstracts. The AUC value for the SVM classifier was 0.899 and the AUC value for the Naïve Bayes classifier was 0.854. At the 5% false negative rate for the curatability decision, the SVM classifier had a true positive rate of 41.4% and the Naïve Bayes classifier had a true positive rate of 33.5%.
Hierarchical SVM classifier performance on training dataset for Level 1 predictions.
| Level 1 category | Curatable abstracts | AUC individual category (SVM) | Category prediction accuracy (%)(MLP) |
|---|---|---|---|
| Allergy | 1146 | 0.994 | 91.6 |
| Autoimmunity | 4350 | 0.988 | 88.9 |
| Infectious Disease | 7525 | 0.989 | 92.7 |
| Transplantation | 888 | 0.985 | 76.4 |
| HIV | 2369 | 0.989 | 92.6 |
| Cancer | 2650 | 0.988 | 89.8 |
| Other | 3905 | 0.985 | 85.4 |
Performance was evaluated with 10-fold cross-validation. An AUC value was calculated for a given category, where documents assigned by the expert to be of that category are considered "positive" while documents assigned to any other category are negative. To evaluate the performance of the document classification by the Multilayer Perceptron (MLP) into specific categories, we calculated the percent agreement of categories. The AUC and category prediction accuracy values are entered for each Level 1 category in addition to the total prediction accuracy.
Hierarchical SVM classifier performance on Autoimmunity training dataset for Level 2 predictions.
| Level 2 category | Curatable abstracts | AUC individual category (SVM) | Category prediction accuracy (%)(MLP) |
|---|---|---|---|
| Beta-Amyloid | 213 | .998 | 97.2 |
| Diabetes | 443 | .998 | 96.6 |
| General Autoimmune | 1311 | .987 | 87.6 |
| Lupus | 704 | .988 | 94.6 |
| Multiple Sclerosis | 957 | .990 | 97.3 |
| Myasthenia Gravis | 221 | .992 | 95.0 |
| Rheumatoid Arthritis | 501 | .990 | 84.0 |
Performance was evaluated with five-fold cross-validation. The AUC and category prediction accuracy values are entered for each Autoimmunity Level 2 category.
Hierarchical SVM classifier performance on Diabetes training dataset for Level 3 predictions.
| Level 3 category | Curatable abstracts | AUC individual category (SVM) | Category prediction accuracy (%)(MLP) |
|---|---|---|---|
| GAD | 148 | .966 | 87.2 |
| HSP | 19 | .997 | 89.5 |
| IA2 | 26 | .987 | 80.8 |
| IGRP | 16 | .995 | 75.0 |
| INSULIN | 126 | .964 | 87.3 |
| OTH | 79 | .823 | 53.2 |
| VAR | 29 | .695 | 13.8 |
The AUC and category prediction accuracy values are entered for each Diabetes Level 3 category.
Performance comparison of non-hierarchical and hierarchical SVM classifiers.
| Category | Number of curatable abstracts | Non-hierarchical AUC | Hierarchical AUC |
|---|---|---|---|
| Beta-Amyloid | 213 | .994 | .998 |
| Diabetes | 443 | .997 | .998 |
| General Autoimmune | 1311 | .967 | .987 |
| Lupus | 704 | .975 | .988 |
| Multiple Sclerosis | 957 | .982 | .990 |
| Myasthenia Gravis | 221 | .983 | .992 |
| Rheumatoid Arthritis | 501 | .983 | .990 |
AUC values from the non-hierarchical and hierarchical Autoimmunity category SVM classifiers.
Comparison of training Level 1 category predictions with and without cost sensitivity.
| Number of references | No cost | Cost sensitive |
|---|---|---|
| Correct classification | 12515 | 12978 |
| Incorrect, should be... | 1207 | 2042 |
| Other high priority | 407 | 464 |
| Low priority | 800 | 1578 |
| Correct classification | 7799 | 7112 |
| Incorrect, should be... | 1312 | 701 |
| Other low priority | 325 | 234 |
| High priority | 987 | 467 |
The number of references predicted into the Level 1 categories with and without cost sensitivity. In the cost sensitive scenario, there was a decrease in the number of high priority references misclassified into low priority categories.
Comparison of no cost and cost sensitive classification on training Level 3 Diabetes references.
| Level 3 categories | Curatable abstracts | Correct predictions with no cost | No cost category prediction accuracy (%) | Correct predictions with cost sensitivity | Cost sensitivity category prediction accuracy (%) |
|---|---|---|---|---|---|
| GAD | 148 | 129 | 87.2 | 140 | 94.6 |
| HSP | 19 | 17 | 89.5 | 18 | 94.7 |
| IA2 | 26 | 21 | 80.8 | 21 | 80.8 |
| IGRP | 16 | 12 | 75.0 | 13 | 81.3 |
| INSULIN | 126 | 110 | 87.3 | 120 | 95.2 |
| OTH | 79 | 42 | 53.2 | 9 | 11.4 |
| VAR | 29 | 4 | 13.8 | 3 | 10.3 |
The number of correct predictions and category prediction accuracy for no cost and cost sensitive classification of training Level 3 Diabetes references.
Comparison of predicted and actual Level 1 category assignments on independent dataset.
| Classifier | ||||||||
|---|---|---|---|---|---|---|---|---|
| Allergy | 1 | 0 | 0 | 0 | 0 | 0 | ||
| Autoimmunity | 0 | 0 | 0 | 0 | 0 | 1 | ||
| Infectious | 0 | 1 | 0 | 1 | 0 | 2 | ||
| Disease | ||||||||
| Transplantation | 0 | 1 | 0 | 0 | 0 | 0 | ||
| Cancer | 0 | 1 | 0 | 1 | 0 | 2 | ||
| HIV | 0 | 0 | 2 | 0 | 0 | 0 | ||
| Other | 0 | 1 | 1 | 0 | 1 | 0 | ||
Columns represent predictions by the classifier and rows represent the Level 1 category assigned by a human expert. For example, one reference predicted as Transplant was actually Cancer. The Total Incorrect row represents the total number of references that were predicted into Level 1 categories by the classifier that differed from the decision of the human expert. Of the 642 abstracts predicted to be curatable, 355 abstracts were overruled as uncuratable which can be seen in the Uncuratable row. Of the 287 curatable abstracts, 94.4% were assigned to the correct Level 1 category.
Performance of the hierarchical SVM classifiers at Levels 2 and 3.
| Level 1 | Assigned by expert | Level 2: Correct predictions | Level 2: Correct predictions (%) | Level 3: Correct predictions | Level 3: Correct predictions (%) |
|---|---|---|---|---|---|
| Allergy | 12 | 11 | 91.7% | 9 | 75.0% |
| Autoimmunity | 59 | 58 | 98.3% | 48 | 81.4% |
| Infectious Disease | 104 | 98 | 94.2% | 91 | 87.5% |
| Transplantation | 9 | n/a | n/a | 8 | 88.9% |
| Cancer | 45 | n/a | n/a | 32 | 71.1% |
| HIV | 35 | n/a | n/a | n/a | n/a |
| Other | 23 | 19 | 82.6% | 19 | 82.6% |
Transplantation and Cancer do not receive Level 2 category assignments. HIV does not receive Level 2 or 3 category assignments.