| Literature DB >> 25600450 |
Henry Petersen1, Josiah Poon, Simon K Poon, Clement Loy.
Abstract
BACKGROUND: Comprehensive literature searches are conducted over multiple medical databases in order to meet stringent quality standards for systematic reviews. These searches are often very laborious, with authors often manually screening thousands of articles. Information retrieval (IR) techniques have proven increasingly effective in improving the efficiency of this process. IR challenges for systematic reviews involve building classifiers using training data with very high class-imbalance, and meeting the requirement for near perfect recall on relevant studies. Traditionally, most systematic reviews have focused on questions relating to treatment. The last decade has seen a large increase in the number of systematic reviews of diagnostic test accuracy (DTA).Entities:
Keywords: classification and clustering; data mining; information storage and retrieval; meta-analysis; review literature
Year: 2014 PMID: 25600450 PMCID: PMC4288066 DOI: 10.2196/medinform.3037
Source DB: PubMed Journal: JMIR Med Inform
List of expected manifestations (differences between DTA and treatment reviews) for all hypotheses.
| Manifestation | Description | Hypothesis A: | Hypothesis B: | Hypothesis C: |
| FTa | The mean number of full-text articles screened would be higher for DTA reviews than for treatment | Yes | - | - |
| SRb | The mean number of search results would be higher for DTA reviews than for treatment | Yes | - | Yes |
| INCc/ FT | The number of included studies as a percentage of the number of full-text articles screened would be lower for DTA reviews than for treatment | - | Yes | - |
| INC/SR | The number of included studies as a percentage of the total number of search results would be lower for DTA reviews than for treatment | Yes | - | - |
| FT/SR | The number of full-text articles retrieved as a percentage of the total search results would be lower for DTA reviews than for treatment | - | - | Yes |
anumber of references for which full-text papers were screened
bnumber of references retrieved by the original search
cnumber of references included in the final meta-analysis
Summary of the total number of DTA and treatment reviews randomly selected for inclusion in our analysis, ordered by CRG.
| Cochrane review groups | DTA reviews | Treatment reviews |
| Acute respiratory infections | 1 | 15 |
| Airways | 1 | 15 |
| Back | 3 | 45 |
| Bone, joint, and muscle trauma | 1 | 15 |
| Eyes and vision | 1 | 15 |
| Gynecological cancer | 1 | 15 |
| Infectious diseases | 2 | 30 |
| Pregnancy | 1 | 15 |
| Renal | 1 | 15 |
| Stroke | 1 | 15 |
| Total | 13 | 195 |
Table 3. Summary of the sample sizes (number of reviews reporting nonzero values) for evaluating each of the expected manifestations.
|
| DTA | Treatment |
| DATAINC | 13/13 | 186/195 |
| DATAFT | 12/13 | 133/195 |
| DATASR | 13/13 | 101/195 |
| DATAINC / FT | 12/13 | 126/195 |
| DATAINC / SR | 13/13 | 95/195 |
| DATAFT / SR | 12/13 | 92/195 |
Ratio between mean and median for collected treatment reviews.
|
| Mean | Median | Mean / Median |
| DATAINC | 19.56 | 11.0 | 1.78 |
| DATAFT | 71.89 | 33.00 | 2.18 |
| DATASR | 1799.04 | 900.00 | 2.00 |
| DATAINC / FT | 0.394 | 0.357 | 1.11 |
| DATAINC / SR | 0.033 | 0.013 | 2.47 |
| DATAFT / SR | 0.099 | 0.046 | 2.13 |
Summary of mean values for collected statistics.
|
| MeanDTA | MeanTreat | MeanDTA/ MeanTreat |
| DATAFT | 191.92 (n=13,s=233.51) | 71.89 (n=133,s=154.76) | 2.67 |
| DATASR | 5144.23 (n=13,s=4109.78) | 1799.04 (n=101,s=2530.11) | 2.86 |
| DATAINC / FT | 0.191 (n=13,s=0.11) | 0.394 (n=126,s=0.24) | 0.49 |
| DATAINC / SR | 0.021 (n=13,s=0.036) | 0.033 (n=95,s=0.049) | 0.63 |
| DATAFT / SR | 0.087 (n=13,s=0.124) | 0.100 (n=92,s=0.156) | 0.87 |
Summary of ranked data for collected statistics.
|
| MeanDTA | MedianDTA | MeanTreat | MedianTreat |
| DATAFT | 110.67 (n=12,s=27.64) | 113.0 | 68.51 (n=133,s=41.16) | 67.0 |
| DATASR | 85.54 (n=13,s=27.84) | 94.0 | 52.76 (n=101,s=31.62) | 52.0 |
| DATAINC / FT | 35.67 (n=12,s=24.69) | 29.0 | 71.63 (n=126,s=39.60) | 73.5 |
| DATAINC / SR | 40.54 (n=13,s=31.12) | 35.0 | 55.27 (n=95,s=30.76) | 56.0 |
| DATAFT / SR | 47.5 (n=12,s=30.18) | 45.5 | 52.02 (n=92,s=29.97) | 53.5 |
Summary linking each hypothesis, expected manifestation, and literature screening stage.
|
| Hypothesis A: | Hypothesis B: | Hypothesis C: |
| Total articles screened | Increase | - | Increase |
| 5144.2 DTA > 1799.0TR
| 5144.2 DTA > 1799.0TR
| ||
| Full-text articles obtained | Increase | - | Decreased as a % of total articles screened |
| 191.9DTA> 71.9TR
| 0.087DTA< 0.100TR
| ||
| Included Articles | Decrease as a % of total articles screened | Decreased as a % of full-text articles obtained | - |
| 0.021DTA< 0.033TR
| 0.191DTA< 0.394TR
|