| Literature DB >> 19126221 |
Hong Yu1, Shashank Agarwal, Mark Johnston, Aaron Cohen.
Abstract
BACKGROUND: Biomedical scientists need to access figures to validate research facts and to formulate or to test novel research hypotheses. However, figures are difficult to comprehend without associated text (e.g., figure legend and other reference text). We are developing automated systems to extract the relevant explanatory information along with figures extracted from full text articles. Such systems could be very useful in improving figure retrieval and in reducing the workload of biomedical scientists, who otherwise have to retrieve and read the entire full-text journal article to determine which figures are relevant to their research. As a crucial step, we studied the importance of associated text in biomedical figure comprehension.Entities:
Year: 2009 PMID: 19126221 PMCID: PMC2631451 DOI: 10.1186/1747-5333-4-1
Source DB: PubMed Journal: J Biomed Discov Collab ISSN: 1747-5333
Sample data for rhetorical unit evaluation.
| Figure + Associated text | Subject-Generated Text Structured by Rhetorical Units | ||||
|---|---|---|---|---|---|
| Purpose | Methods | Results | Conclusions | Other Criteria | |
| Fig+L | Effect of pH... | Enzyme kinetics | Hard to say, need detailed fitting data | Don't know | Full text |
| Fig+L+T+A | The goal of... | X-ray crystallography... | Hydrogen bonds... | RNA binding of protein... | Methods are still lacking |
| F+Legend+T+A | To show that... | A is just a chart of... | Affinity of TF... | A model for the formation... | How was the final model arrived at (part c)? |
Space limitations prevent us from including all the text descriptions for the four rhetorical units. Legend (L), title (T), and abstract (A). The original data are available at http://www.bioex.us.com/evaluation_data/JudgeEvaluation.xml.
Score of comprehension and Confidence for each Text Type
| Text Type | Score of comprehension | Confidence |
|---|---|---|
| Fig+L | 5.82 ± 2.90 | 7.63 ± 2.28 |
| Fig+L+T+A | 7.41 ± 2.19 | 8.17 ± 1.70 |
| Full text | 8.17 ± 1.97 | 8.46 ± 1.81 |
The P-value of McNemar test of SCORE and CONFIDENCE
| Text Type 1 | Text Type 2 | Binary SCORE | Binary CONFIDENCE |
|---|---|---|---|
| Fig+L | Fig+L+T+A | 0.0006 | 0.2120 |
| Fig+L | Full text | p ≤ 0.0001 | 0.0055 |
| Fig+L+T+A | Full text | 0.0271 | 0.0995 |
Average number of words and standard deviation in annotator summaries
| Purpose | Methods | Results | Conclusions | |
|---|---|---|---|---|
| Fig+L | 11 ± 8 | 8 ± 11 | 18 ± 18 | 8 ± 7 |
| Fig+L+T+A | 18 ± 11 | 10 ± 13 | 45 ± 40 | 21 ± 15 |
| Full text | 18 ± 11 | 18 ± 13 | 44 ± 35 | 26 ± 19 |
Inter-rater pairwise and kappa agreement
| Purpose Missing | Method Missing | Result Missing | Conclusion Missing | |
|---|---|---|---|---|
| Fig+L | 89.87% (0.79) | 93.67% (0.87) | 79.75% (0.59) | 86.08% (0.66) |
| Fig+L+T+A | 96.15% (0.71) | 93.59% (0.83) | 87.18% (0.65) | 91.03% (0.78) |
| Full text | 98.68% (0.66) | 93.42% (0.58) | 88.16% (0.40) | 88.16% (0.54) |
P-value of McNemar's test for differences between figure – text types (judge 1)
| Text Type 1 | Text Type 2 | Purpose Missing | Method Missing | Result Missing | Conclusion Missing |
|---|---|---|---|---|---|
| Fig+L | Fig+L+T+A | 0.0000 | 0.0154 | 0.0004 | 0.0000 |
| Fig+L | Full Text | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Fig+L+T+A | Full Text | 0.4868 | 0.0043 | 0.0851 | 0.1910 |
P-value of McNemar's test score for differences between figure – text types (judge 2)
| Text Type 1 | Text Type 2 | Purpose Missing | Method Missing | Result Missing | Conclusion Missing |
|---|---|---|---|---|---|
| Fig+L | Fig+L+T+A | 0.0000 | 0.0205 | 0.0000 | 0.0000 |
| Fig+L | Full text | 0.0000 | 0.0000 | 0.0000 | 0.0000 |
| Fig+L+T+A | Full text | 0.2300 | 0.0099 | 0.0533 | 0.0246 |
Comprehension rates (non-missing rhetorical units) associated with figure – text types for four rhetorical units.
| Fig+L | Fig+L+T+A | Full Text | |
|---|---|---|---|
| Purpose | 0.57 | 0.92 | 0.97 |
| Methods | 0.61 | 0.76 | 0.91 |
| Results | 0.49 | 0.76 | 0.89 |
| Conclusions | 0.32 | 0.73 | 0.86 |
ROUGE-L F-scores for different figure – text types.
| Fig+L | Fig+L+T+A | Full text | |
|---|---|---|---|
| Purpose | 0.46 | 0.60 | 0.61 |
| Methods | 0.38 | 0.48 | 0.59 |
| Results | 0.28 | 0.35 | 0.58 |
| Conclusions | 0.22 | 0.28 | 0.56 |
| Full summaries | 0.32 | 0.40 | 0.58 |