Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Categorization of sentence types in medical abstracts.

Literature DB >> 14728211

Categorization of sentence types in medical abstracts.

Abstract

This study evaluated the use of machine learning techniques in the classification of sentence type. 7253 structured abstracts and 204 unstructured abstracts of Randomized Controlled Trials from MedLINE were parsed into sentences and each sentence was labeled as one of four types (Introduction, Method, Result, or Conclusion). Support Vector Machine (SVM) and Linear Classifier models were generated and evaluated on cross-validated data. Treating sentences as a simple "bag of words", the SVM model had an average ROC area of 0.92. Adding a feature of relative sentence location improved performance markedly for some models and overall increasing the average ROC to 0.95. Linear classifier performance was significantly worse than the SVM in all datasets. Using the SVM model trained on structured abstracts to predict unstructured abstracts yielded performance similar to that of models trained with unstructured abstracts in 3 of the 4 types. We conclude that classification of sentence type seems feasible within the domain of RCT's. Identification of sentence types may be helpful for providing context to end users or other text summarization techniques.

Mesh：

Year: 2003 PMID： 14728211 PMCID： PMC1479904

Source DB: PubMed Journal: AMIA Annu Symp Proc ISSN： 1559-4076

2 in total

1. Exploring text mining from MEDLINE.

Authors: Padmini Srinivasan; Thomas Rindflesch
Journal: Proc AMIA Symp Date: 2002

2. Developing optimal search strategies for detecting clinically sound studies in MEDLINE.

Authors: R B Haynes; N Wilczynski; K A McKibbon; C J Walker; J C Sinclair
Journal: J Am Med Inform Assoc Date: 1994 Nov-Dec Impact factor: 4.497

2 in total

25 in total

1. Automatically identifying health outcome information in MEDLINE records.

Authors: Dina Demner-Fushman; Barbara Few; Susan E Hauser; George Thoma
Journal: J Am Med Inform Assoc Date: 2005-10-12 Impact factor: 4.497

2. Combining text classification and Hidden Markov Modeling techniques for categorizing sentences in randomized clinical trial abstracts.

Authors: Rong Xu; Kaustubh Supekar; Yang Huang; Amar Das; Alan Garber
Journal: AMIA Annu Symp Proc Date: 2006

3. Shallow semantic parsing of randomized controlled trial reports.

Authors: Hyung Paek; Yacov Kogan; Prem Thomas; Seymour Codish; Michael Krauthammer
Journal: AMIA Annu Symp Proc Date: 2006

4. Automated information extraction of key trial design elements from clinical trial publications.

Authors: Berry de Bruijn; Simona Carini; Svetlana Kiritchenko; Joel Martin; Ida Sim
Journal: AMIA Annu Symp Proc Date: 2008-11-06

5. Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion.

Authors: Shashank Agarwal; Hong Yu
Journal: Bioinformatics Date: 2009-09-25 Impact factor: 6.937

Categorization of sentence types in medical abstracts.

1. Exploring text mining from MEDLINE.

2. Developing optimal search strategies for detecting clinically sound studies in MEDLINE.

1. Automatically identifying health outcome information in MEDLINE records.

2. Combining text classification and Hidden Markov Modeling techniques for categorizing sentences in randomized clinical trial abstracts.

3. Shallow semantic parsing of randomized controlled trial reports.

4. Automated information extraction of key trial design elements from clinical trial publications.

5. Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion.

6. Building an automated SOAP classifier for emergency department reports.

7. Dynamic categorization of clinical research eligibility criteria by hierarchical clustering.

8. Collection of cancer stage data by classifying free-text medical reports.

9. Combining classifiers for robust PICO element detection.

10. Are figure legends sufficient? Evaluating the contribution of associated text to biomedical figure comprehension.