Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Extractive text summarization system to aid data extraction from full text in systematic review development.

Literature DB >> 27989816

Extractive text summarization system to aid data extraction from full text in systematic review development.

Duy Duc An Bui¹, Guilherme Del Fiol², John F Hurdle², Siddhartha Jonnalagadda³.

Abstract

OBJECTIVES: Extracting data from publication reports is a standard process in systematic review (SR) development. However, the data extraction process still relies too much on manual effort which is slow, costly, and subject to human error. In this study, we developed a text summarization system aimed at enhancing productivity and reducing errors in the traditional data extraction process.
METHODS: We developed a computer system that used machine learning and natural language processing approaches to automatically generate summaries of full-text scientific publications. The summaries at the sentence and fragment levels were evaluated in finding common clinical SR data elements such as sample size, group size, and PICO values. We compared the computer-generated summaries with human written summaries (title and abstract) in terms of the presence of necessary information for the data extraction as presented in the Cochrane review's study characteristics tables.
RESULTS: At the sentence level, the computer-generated summaries covered more information than humans do for systematic reviews (recall 91.2% vs. 83.8%, p<0.001). They also had a better density of relevant sentences (precision 59% vs. 39%, p<0.001). At the fragment level, the ensemble approach combining rule-based, concept mapping, and dictionary-based methods performed better than individual methods alone, achieving an 84.7% F-measure.
CONCLUSION: Computer-generated summaries are potential alternative information sources for data extraction in systematic review development. Machine learning and natural language processing are promising approaches to the development of such an extractive summarization system. Copyright Â

Entities: Chemical Disease Gene Species

Keywords: Data collection; Machine learning; Review literature as topic; Text classification; Text summarization

Mesh：

Year: 2016 PMID： 27989816 PMCID： PMC5362293 DOI： 10.1016/j.jbi.2016.10.014

Source DB: PubMed Journal: J Biomed Inform ISSN： 1532-0464 Impact factor: 6.317

25 in total

Extractive text summarization system to aid data extraction from full text in systematic review development.

1. Aggregating UMLS semantic types for reducing conceptual complexity.

2. A simple algorithm for identifying abbreviation definitions in biomedical text.

3. A language independent acronym extraction from biomedical texts with hidden Markov models.

4. Data extraction errors in meta-analyses that use standardized mean differences.

5. How quickly do systematic reviews go out of date? A survival analysis.

6. Summarising complex ICU data in natural language.

7. PICO element detection in medical text without metadata: are first sentences enough?

8. Domain adaptation for semantic role labeling in the biomedical domain.

9. Learning regular expressions for clinical text classification.

10. Automatically finding relevant citations for clinical guideline development.

1. Data extraction methods for systematic review (semi)automation: A living systematic review.

2. A systematic review of automatic text summarization for biomedical literature and EHRs.

3. Improving reference prioritisation with PICO recognition.

Review 4. Applications of natural language processing in ophthalmology: present and future.

5. Clinical Context-Aware Biomedical Text Summarization Using Deep Neural Network: Model Development and Validation.

Review 6. A tutorial on methodological studies: the what, when, how and why.