Literature DB >> 14728211

Categorization of sentence types in medical abstracts.

Larry McKnight1, Padmini Srinivasan.   

Abstract

This study evaluated the use of machine learning techniques in the classification of sentence type. 7253 structured abstracts and 204 unstructured abstracts of Randomized Controlled Trials from MedLINE were parsed into sentences and each sentence was labeled as one of four types (Introduction, Method, Result, or Conclusion). Support Vector Machine (SVM) and Linear Classifier models were generated and evaluated on cross-validated data. Treating sentences as a simple "bag of words", the SVM model had an average ROC area of 0.92. Adding a feature of relative sentence location improved performance markedly for some models and overall increasing the average ROC to 0.95. Linear classifier performance was significantly worse than the SVM in all datasets. Using the SVM model trained on structured abstracts to predict unstructured abstracts yielded performance similar to that of models trained with unstructured abstracts in 3 of the 4 types. We conclude that classification of sentence type seems feasible within the domain of RCT's. Identification of sentence types may be helpful for providing context to end users or other text summarization techniques.

Mesh:

Year:  2003        PMID: 14728211      PMCID: PMC1479904     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  2 in total

1.  Exploring text mining from MEDLINE.

Authors:  Padmini Srinivasan; Thomas Rindflesch
Journal:  Proc AMIA Symp       Date:  2002

2.  Developing optimal search strategies for detecting clinically sound studies in MEDLINE.

Authors:  R B Haynes; N Wilczynski; K A McKibbon; C J Walker; J C Sinclair
Journal:  J Am Med Inform Assoc       Date:  1994 Nov-Dec       Impact factor: 4.497

  2 in total
  25 in total

1.  Automatically identifying health outcome information in MEDLINE records.

Authors:  Dina Demner-Fushman; Barbara Few; Susan E Hauser; George Thoma
Journal:  J Am Med Inform Assoc       Date:  2005-10-12       Impact factor: 4.497

2.  Combining text classification and Hidden Markov Modeling techniques for categorizing sentences in randomized clinical trial abstracts.

Authors:  Rong Xu; Kaustubh Supekar; Yang Huang; Amar Das; Alan Garber
Journal:  AMIA Annu Symp Proc       Date:  2006

3.  Shallow semantic parsing of randomized controlled trial reports.

Authors:  Hyung Paek; Yacov Kogan; Prem Thomas; Seymour Codish; Michael Krauthammer
Journal:  AMIA Annu Symp Proc       Date:  2006

4.  Automated information extraction of key trial design elements from clinical trial publications.

Authors:  Berry de Bruijn; Simona Carini; Svetlana Kiritchenko; Joel Martin; Ida Sim
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

5.  Automatically classifying sentences in full-text biomedical articles into Introduction, Methods, Results and Discussion.

Authors:  Shashank Agarwal; Hong Yu
Journal:  Bioinformatics       Date:  2009-09-25       Impact factor: 6.937

6.  Building an automated SOAP classifier for emergency department reports.

Authors:  Danielle Mowery; Janyce Wiebe; Shyam Visweswaran; Henk Harkema; Wendy W Chapman
Journal:  J Biomed Inform       Date:  2011-09-09       Impact factor: 6.317

7.  Dynamic categorization of clinical research eligibility criteria by hierarchical clustering.

Authors:  Zhihui Luo; Meliha Yetisgen-Yildiz; Chunhua Weng
Journal:  J Biomed Inform       Date:  2011-06-12       Impact factor: 6.317

8.  Collection of cancer stage data by classifying free-text medical reports.

Authors:  Iain A McCowan; Darren C Moore; Anthony N Nguyen; Rayleen V Bowman; Belinda E Clarke; Edwina E Duhig; Mary-Jane Fry
Journal:  J Am Med Inform Assoc       Date:  2007-08-21       Impact factor: 4.497

9.  Combining classifiers for robust PICO element detection.

Authors:  Florian Boudin; Jian-Yun Nie; Joan C Bartlett; Roland Grad; Pierre Pluye; Martin Dawes
Journal:  BMC Med Inform Decis Mak       Date:  2010-05-15       Impact factor: 2.796

10.  Are figure legends sufficient? Evaluating the contribution of associated text to biomedical figure comprehension.

Authors:  Hong Yu; Shashank Agarwal; Mark Johnston; Aaron Cohen
Journal:  J Biomed Discov Collab       Date:  2009-01-06
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.