| Literature DB >> 26323593 |
Abstract
BACKGROUND: Systematic reviews and their implementation in practice provide high quality evidence for clinical practice but are both time and labor intensive due to the large number of articles. Automatic text classification has proven to be instrumental in identifying relevant articles for systematic reviews. Existing approaches use machine learning model training to generate classification algorithms for the article screening process but have limitations.Entities:
Keywords: automatic document classification; evidence-based medicine; relevance feedback; systematic review
Year: 2015 PMID: 26323593 PMCID: PMC4705019 DOI: 10.2196/medinform.3982
Source DB: PubMed Journal: JMIR Med Inform
Figure 1Illustrated article network.
Total article numbers and rates of inclusion.
| SR report topic | Total | Abstract | Full text |
| ACE inhibitors | 2544 | 183 (7.19) | 41(1.61) |
| ADHD | 851 | 84 (9.87) | 20 (2.35) |
| Antihistamines | 310 | 92 (29.68) | 16 (5.16) |
| Atypical antipsychotics | 1120 | 363 (32.41) | 146 (13.04) |
| Beta blockers | 2072 | 302 (14.58) | 42 (2.03) |
| Calcium channel blockers | 1218 | 279 (22.91) | 100 (8.21) |
| Estrogens | 368 | 80 (21.74) | 80 (21.74) |
| NSAIDS | 393 | 88 (22.39) | 41 (10.43) |
| Opioids | 1915 | 48 (2.51) | 15 (0.78) |
| Oral hypoglycemics | 503 | 139 (27.63) | 136 (27.04) |
| Proton pump inhibitors | 1333 | 238 (17.85) | 51 (3.83) |
| Skeletal muscle relaxants | 1643 | 34 (2.07) | 9 (0.55) |
| Statins | 3465 | 173 (4.99) | 85 (2.45) |
| Triptans | 671 | 218 (32.49) | 24 (3.58) |
| Urinary incontinence | 327 | 78 (23.85) | 40 (12.23) |
Figure 2Calculation of the similarity between articles.
Figure 3Simulated interactive recommendation process.
Figure 4Formulas of precision, recall, and F1.
Figure 5Formulas of WSS and WSS95.
Single element WSS95 performance.
| SR report topic | TI | AB | PT | AU | MH |
| ACE inhibitors | 76.49 | 71.07 | 33.22 | 0 | 47.37 |
| ADHD | 80.26 | 65.10 | 22.56 | 0 | 47.00 |
| Antihistamines | 13.55 | 15.81 | 32.58 | 0 | 2.58 |
| Atypical antipsychotics | 17.23 | 20.54 | 19.64 | 0 | 9.46 |
| Beta blockers | 44.74 | 49.95 | 43.77 | 0 | 28.67 |
| Calcium channel blockers | 19.38 | 16.34 | 18.64 | 0 | 20.94 |
| Estrogens | 29.35 | 29.08 | 17.93 | 0 | 38.59 |
| NSAIDS | 63.36 | 66.67 | 58.27 | 0 | 33.84 |
| Opioids | 8.30 | 9.82 | 37.23 | 0 | 6.48 |
| Oral hypoglycemics | 11.73 | 12.13 | 22.27 | 0 | 7.55 |
| Proton pump inhibitors | 43.74 | 15.60 | 35.48 | 0 | 20.56 |
| Skeletal muscle relaxants | 0 | 36.03 | 74.68 | 0 | 42.85 |
| Statins | 25.52 | 30.17 | 13.31 | 0 | 13.68 |
| Triptans | 45.60 | 42.47 | 28.17 | 0 | 33.23 |
| Urinary incontinence | 30.89 | 18.65 | 43.43 | 0 | 26.91 |
| Average WSS95 | 34.01 | 33.30 | 33.41 | 0 | 25.31 |
Single element F1 performance; percentage of articles screened at F1.
| SR report topic | TI | AB | PT | AU | MH |
| ACE inhibitors | 0.3444 (4) | 0.3121 (4) | 0.2182 (<1) | 0.1872 (6) | 0.2368 (1) |
| ADHD | 0.2885 (10) | 0.3824 (6) | 0.2963 (<1) | 0.0909 (<1) | 0.5556 (4) |
| Antihistamines | 0.2593 (12) | 0.4000 (3) | 0.2759 (<1) | 0.1111 (<1) | 0.3333 (3) |
| Atypical antipsychotics | 0.3447 (26) | 0.4248 (14) | 0.4363 (5, 12a) | 0.0135 (<1) | 0.3113 (40) |
| Beta blockers | 0.1972 (1) | 0.2710 (5) | 0.2105 (<1) | 0.0417 (<1) | 0.0957 (19) |
| Calcium channel | 0.2026 (10) | 0.2672 (11) | 0.2662 (15) | 0.1261 (9) | 0.2579 (2) |
| Estrogens | 0.5140 (36) | 0.5612 (29) | 0.4937 (18) | 0.0244 (<1) | 0.5536 (39) |
| NSAIDS | 0.4368 (34) | 0.5870 (13) | 0.6761 (8) | 0.4853 (24) | 0.3650 (24) |
| Opioids | 0.2727 (<1) | 0.1429 (<1) | 0.2222 (<1) | 0.1111 (<1) | 0.2500 (<1) |
| Oral hypoglycemics | 0.4509 (88) | 0.4603 (76) | 0.5019 (78) | 0.0145 (<1) | 0.4527 (53) |
| Proton pump | 0.3333 (1) | 0.3860 (5) | 0.1299 (42) | 0.0377 (<1) | 0.1775 (25) |
| Skeletal muscle | 0.1429 (<1) | 0.1981 (<1) | 0.2286 (2) | 0.1429 (<1) | 0.2222 (<1) |
| Statins | 0.2278 (6) | 0.2479 (1) | 0.4019 (4) | 0.1484 (12) | 0.1563 (1) |
| Triptans | 0.1739 (10) | 0.360 (4) | 0.2569 (13) | 0.0690 (<1) | 0.2750 (8) |
| Urinary | 0.3697 (24) | 0.5243 (19) | 0.5405 (10) | 0.4444 (13) | 0.4317 (30) |
| Averageb | 0.3039 (18) | 0.3683 (13) | 0.3437 (14) | 0.1365 (5) | 0.3116 (17) |
aBoth 5% and 12% have F1 = 0.4363. The average of 5% and 12% (8.5%) is taken to calculate the average value on the last row of the table.
b<1% is considered as 1% for calculating the average percentage.
WSS95 of the top 6 combinations.
| SR report topic | TI+AB | TI+AB | TI+AB | TI+AB | TI+AB | TI+AB+MH |
| ACE inhibitors | 76.38 | 76.85 | 74.29 | 75.79 | 73.70 | 75.08 |
| ADHD | 80.38 | 79.79 | 67.92 | 80.14 | 67.92 | 56.17 |
| Antihistamines | 16.13 | 10.65 | 24.52 | 16.13 | 24.52 | 18.39 |
| Atypical antipsychotics | 20.89 | 14.20 | 17.95 | 20.63 | 17.77 | 14.38 |
| Beta blockers | 60.14 | 60.09 | 65.01 | 60.96 | 64.72 | 65.21 |
| Calcium channel blockers | 18.23 | 18.64 | 17.32 | 18.39 | 17.49 | 22.82 |
| Estrogens | 33.42 | 36.14 | 22.55 | 33.97 | 22.55 | 29.08 |
| NSAIDS | 72.26 | 75.57 | 77.35 | 70.48 | 76.34 | 77.86 |
| Opioids | 6.01 | 11.75 | 8.98 | 5.95 | 8.98 | 12.17 |
| Oral hypoglycemics | 11.33 | 13.12 | 13.52 | 11.13 | 13.52 | 12.72 |
| Proton pump inhibitors | 19.20 | 21.31 | 19.65 | 19.05 | 19.65 | 20.11 |
| Skeletal muscle relaxants | 41.94 | 46.44 | 58.55 | 41.87 | 58.49 | 60.01 |
| Statins | 29.10 | 27.11 | 27.80 | 30.96 | 27.71 | 26.07 |
| Triptans | 48.29 | 51.71 | 39.64 | 50.52 | 39.79 | 40.98 |
| Urinary incontinence | 12.84 | 11.01 | 20.80 | 12.84 | 20.80 | 14.37 |
| Average | 36.44 | 36.96 | 37.06 | 36.59 | 36.93 | 36.35 |
F1 of the top 6 combinations.
| SR report topic | TI+AB | TI+AB+MH | TI+AB+PT | TI+AB+AU | TI+AB+PT+AU | TI+AB+MH+PT+AU | ||||
| ACE inhibitors | 0.4156 (1) | 0.4000 (2) | 0.4051 (1) | 0.3902 (2) | 0.3971 (4) | 0.3774 (3) | ||||
| ADHD | 0.4000 (3) | 0.4688 (5) | 0.5455 (4) | 0.4286 (6) | 0.5306 (3) | 0.5818 (4) | ||||
| Antihistamines | 0.3226 (5) | 0.3333 (10) | 0.2903 (15) | 0.3226 (5) | 0.2903 (15) | 0.2813 (15) | ||||
| Atypical | 0.4364 (16) | 0.4241 (15) | 0.4887 (15) | 0.4411 (17) | 0.4856 (15) | 0.4606 (15) | ||||
| Beta blockers | 0.2800 (3) | 0.3043 (2) | 0.3590 (2) | 0.2667 (3) | 0.3596 (2) | 0.3333 (3) | ||||
| Calcium channel | 0.2335 (8) | 0.2620 (11) | 0.2804 (9) | 0.2323 (8) | 0.2816 (9) | 0.2995 (9) | ||||
| Estrogens | 0.6000 (30) | 0.6237 (29) | 0.6047 (25) | 0.5979 (31) | 0.6118 (24) | 0.6171 (26) | ||||
| NSAIDS | 0.6667 (16) | 0.6154 (16) | 0.6966 (12) | 0.6471 (16) | 0.6809 (13) | 0.6667 (15) | ||||
| Opioids | 0.3000 (0) | 0.3158 (<1) | 0.3000 (<1) | 0.3000 (<1) | 0.3000 (<1) | 0.3158 (<1) | ||||
| Oral hypoglycemics | 0.4497 (90) | 0.4541 (88) | 0.4553 (86) | 0.4489 (92) | 0.4561 (75) | 0.4635 (82) | ||||
| Proton pump | 0.4384 (7) | 0.4737 (5) | 0.5172 (5) | 0.4552 (7) | 0.5455 (5) | 0.5079 (6) | ||||
| Skeletal muscle | 0.2222 (1) | 0.2353 (<1) | 0.2500 (<1) | 0.2222 (1) | 0.2500 (<1) | 0.2667 (<1) | ||||
| Statins | 0.2994 (2) | 0.3281 (1) | 0.3382 (1) | 0.2959 (2) | 0.3358 (2) | 0.3465 (1) | ||||
| Triptans | 0.3636 (3) | 0.3913 (3) | 0.3556 (3) | 0.3556 (3) | 0.3529 (4) | 0.3913 (3) | ||||
| Urinary incontinence | 0.5063 (12) | 0.5347 (19) | 0.5505 (21) | 0.5263 (11) | 0.5507 (9) | 0.5843 (15) | ||||
| Averagea | 0.3956 (13) | 0.4110 (14) | 0.4291 (14) | 0.3954 (14) | 0.4286 (12) | 0.4329 (13) | ||||
a<1% is considered as 1% for calculating the average percentage.
WSS95 comparison with the Cohen and Matwin systems across 15 SR topics.
| SR report topic | Cohen 2006 [ | Cohen 2008 [ | Matwin 2010 [ | Our study |
| ACE inhibitors | 56.61 | 73.30 | 52.30 | 74.29 |
| ADHD | 67.95 | 52.60 | 62.20 | 67.92 |
| Antihistamines | 0 | 23.60 | 14.90 | 24.52 |
| Atypical antipsychotics | 14.11 | 17.00 | 20.60 | 17.95 |
| Beta blockers | 28.44 | 46.50 | 36.70 | 65.01 |
| Calcium channel blockers | 12.21 | 43.00 | 23.40 | 17.32 |
| Estrogens | 18.34 | 41.40 | 37.50 | 22.55 |
| NSAIDS | 49.67 | 67.20 | 52.80 | 77.35 |
| Opioids | 13.32 | 36.40 | 55.40 | 8.98 |
| Oral hypoglycemics | 8.96 | 13.60 | 8.50 | 13.52 |
| Proton pump inhibitors | 27.68 | 32.80 | 22.90 | 19.65 |
| Skeletal muscle relaxants | 0 | 37.40 | 26.50 | 58.55 |
| Statins | 24.71 | 49.10 | 31.50 | 27.80 |
| Triptans | 3.37 | 34.60 | 27.40 | 39.64 |
| Urinary incontinence | 26.14 | 43.20 | 29.60 | 20.80 |
| Average | 23.43 | 40.80 | 33.50 | 37.06 |
aVP: voting Perceptron-based automated citation classification system
bFCNB/WE: factorized complement naïve Bayes with weight engineering
cSVM: support vector machine
The P values of pairwise comparison of four studies.
|
| Cohen 2006 | Cohen 2008 | Matwin 2010 | Our study (TI+AB+PT) |
| Cohen 2006 | — | 0.0012 | 0.0433 | 0.0475 |
| Cohen 2008 | 0.0012 | — | 0.0649 | 0.4979 |
| Matwin 2010 | 0.0433 | 0.0649 | — | 0.4979 |
| Our study | 0.0475 | 0.4979 | 0.4979 | — |
Figure 6Proton Pump Inhibitors recall performance during the recommendation process using two different element combinations.