| Literature DB >> 18957129 |
Grace Y Chung1, Enrico Coiera.
Abstract
BACKGROUND: This paper proposes the use of decision trees as the basis for automatically extracting information from published randomized controlled trial (RCT) reports. An exploratory analysis of RCT abstracts is undertaken to investigate the feasibility of using decision trees as a semantic structure. Quality-of-paper measures are also examined.Entities:
Mesh:
Year: 2008 PMID: 18957129 PMCID: PMC2584633 DOI: 10.1186/1472-6947-8-48
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Figure 1Example tree structure from an RCT (PMID: 12637461). The tree compares tamoxifen treatment with tamoxifen plus aminoglutenthimide for postmenopausal breast cancer patients. The primary outcome measure is 5 year disease free survival (DFS), whose numerical values can directly be utilized in decision analysis. The corresponding true decision tree is illustrated.
Figure 2Example intermediate decision tree derived from an RCT (PMID: 16782917). The primary outcome measure is the median Progression Free Survival (PFS) time, computed for each group in each treatment arm. The output decision tree taken directly from the article is an intermediate representation as opposed to a true "decision-ready" tree structure.
Number of abstracts in each subcategory in a randomly selected corpus of 455 RCT abstracts.
| 455 | 336 (73.8%) | 21 (4.6%) | 34 (7.5%) | 5 (1.1%) | 6 (1.3%) | 53 (11.6%) | |
The number of unstructured and structured abstracts in Group A, the original set of abstracts and Group R1, the primary RCT reports from the randomly selected subset.
| Total abstracts | 7620 (100%) | 336 (100%) |
| Unstructured abstracts | 3179 (42%) | 106 (32%) |
| Structured abstracts | 4441 (58%) | 230 (68%) |
| Structured abstracts with explicit heading for intervention | 283 (3.7%) | 14 (4.2%) |
| Structured abstracts with explicit heading for population | 433 (5.7%) | 17 (5.0%) |
| Structured abstracts with explicit heading for outcome measures | 329 (4.3%) | 18 (5.3%) |
| Structured abstracts with explicit heading for all three subheadings | 162 (2.2%) | 11 (3.3%) |
Examples of equivalence classes of pre-defined sub-headings in structured abstracts.
| Aim | Goal, Aim of the study, Purpose |
| Setting | Setting, Study setting, Settings and Location |
| Participants | Study population, Type of participants, Patients or participants, Sample |
| Outcome measure | Measurements, Primary outcome measure, Study endpoints, Major outcome measures |
Examples of the patterns that occur in the section headings of structured RCT abstracts of Group A.
| Background, Method, Result Conclusion | 16 |
| Aim, Method, Result, Conclusion | 14 |
| Aim, Patient and Method, Result, Conclusion | 8.5 |
| Background, Aim, Method, Result, Conclusion | 7.6 |
| Background, Method and Results, Conclusion | 6.6 |
| Aim, Participants, Design, Measurements, Result Conclusion | < 1 |
| Context, Design, Setting, Participants, Outcome Measures, Result, Conclusion | < 1 |
| Aim, Design and Setting, Participants, Intervention, Measurements and Main Results, Conclusion | < 1 |
Intervention information for primary RCT abstracts (R1).
| Total abstracts | 213 | 123 |
| Number of treatment arms unknown | 1 | 1 |
| 2 treatment arms | 161 | 96 |
| 3 treatment arms | 33 | 20 |
| 4 or more treatment arms | 5 | 6 |
Distribution of sentences describing interventions and comparisons with respect to the classes of pre-defined subheadings in primary RCT structured abstracts (R1).
| Method | 197 (78%) |
| Intervention | 19 (7.5%) |
| Aim | 14 (5.5%) |
| Method and Results | 12 (4.7%) |
| Design | 8 (3.1%) |
| Results | 2 (0.1%) |
| Background | 1 (<0.1%) |
| Setting | 1(<0.1%) |
Number of abstracts in each sub-category for population treatment for 336 primary RCT abstracts (R1)
| Number of abstracts | 320 | 11 | 5 |
Overall population information for primary RCT abstracts (R1).
| Abstracts reporting total subjects in study | 280 (84%) |
| Abstracts reporting subjects assigned to each arm | 122 (36%) |
| Abstracts reporting the number of drop outs | 5 (1.5%) |
| No information about population | 20 (6%) |
Distribution population information with respect to the classes of pre-defined subheadings in structured abstracts.
| Method | 195 |
| Intervention | 3 |
| Aim | 3 |
| Method and Results | 22 |
| Design | 16 |
| Results | 82 |
| Patients | 15 |
Number of abstracts in each sub-category for reporting of outcomes
| Number of abstracts | 15 | 2 | 4 |