Literature DB >> 17238323

An effective general purpose approach for automated biomedical document classification.

Aaron M Cohen1.   

Abstract

Automated document classification can be a valuable tool for biomedical tasks that involve large amounts of text. However, in biomedicine, documents that have the desired properties are often rare, and special methods are usually required to address this issue. We propose and evaluate a method of classifying biomedical text documents, optimizing for utility when misclassification costs are highly asymmetric between the positive and negative classes. The method uses chi-square feature selection and several iterations of cost proportionate rejection sampling followed by application of a support vector machine (SVM), combining the resulting classifier results with voting. It is straightforward, fast, and achieves competitive performance on a set of standardized biomedical text classification evaluation tasks. The method is a good general purpose approach for classifying biomedical text.

Mesh:

Year:  2006        PMID: 17238323      PMCID: PMC1839342     

Source DB:  PubMed          Journal:  AMIA Annu Symp Proc        ISSN: 1559-4076


  3 in total

1.  Protein names precisely peeled off free text.

Authors:  Sven Mika; Burkhard Rost
Journal:  Bioinformatics       Date:  2004-08-04       Impact factor: 6.937

2.  Text categorization models for high-quality article retrieval in internal medicine.

Authors:  Yindalon Aphinyanaphongs; Ioannis Tsamardinos; Alexander Statnikov; Douglas Hardin; Constantin F Aliferis
Journal:  J Am Med Inform Assoc       Date:  2004-11-23       Impact factor: 4.497

3.  Reducing workload in systematic review preparation using automated citation classification.

Authors:  A M Cohen; W R Hersh; K Peterson; Po-Yin Yen
Journal:  J Am Med Inform Assoc       Date:  2005-12-15       Impact factor: 4.497

  3 in total
  14 in total

1.  Five-way smoking status classification using text hot-spot identification and error-correcting output codes.

Authors:  Aaron M Cohen
Journal:  J Am Med Inform Assoc       Date:  2007-10-18       Impact factor: 4.497

2.  A system for classifying disease comorbidity status from medical discharge summaries using automated hotspot and negated concept detection.

Authors:  Kyle H Ambert; Aaron M Cohen
Journal:  J Am Med Inform Assoc       Date:  2009-04-23       Impact factor: 4.497

3.  Cross-topic learning for work prioritization in systematic review creation and update.

Authors:  Aaron M Cohen; Kyle Ambert; Marian McDonagh
Journal:  J Am Med Inform Assoc       Date:  2009-06-30       Impact factor: 4.497

4.  The role of the electronic medical record in the assessment of health related quality of life.

Authors:  Serguei V S Pakhomov; Nilay D Shah; Holly K Van Houten; Penny L Hanson; Steven A Smith
Journal:  AMIA Annu Symp Proc       Date:  2011-10-22

5.  Integrating image caption information into biomedical document classification in support of biocuration.

Authors:  Xiangying Jiang; Pengyuan Li; James Kadin; Judith A Blake; Martin Ringwald; Hagit Shatkay
Journal:  Database (Oxford)       Date:  2020-01-01       Impact factor: 3.451

6.  Automatic quality of life prediction using electronic medical records.

Authors:  Sergeui Pakhomov; Nilay Shah; Penny Hanson; Saranya Balasubramaniam; Steven A Smith; Steven Allan Smith
Journal:  AMIA Annu Symp Proc       Date:  2008-11-06

7.  Automatic classification of foot examination findings using clinical notes and machine learning.

Authors:  Serguei V S Pakhomov; Penny L Hanson; Susan S Bjornsen; Steven A Smith
Journal:  J Am Med Inform Assoc       Date:  2007-12-20       Impact factor: 4.497

8.  The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text.

Authors:  Martin Krallinger; Miguel Vazquez; Florian Leitner; David Salgado; Andrew Chatr-Aryamontri; Andrew Winter; Livia Perfetto; Leonardo Briganti; Luana Licata; Marta Iannuccelli; Luisa Castagnoli; Gianni Cesareni; Mike Tyers; Gerold Schneider; Fabio Rinaldi; Robert Leaman; Graciela Gonzalez; Sergio Matos; Sun Kim; W John Wilbur; Luis Rocha; Hagit Shatkay; Ashish V Tendulkar; Shashank Agarwal; Feifan Liu; Xinglong Wang; Rafal Rak; Keith Noto; Charles Elkan; Zhiyong Lu; Rezarta Islamaj Dogan; Jean-Fred Fontaine; Miguel A Andrade-Navarro; Alfonso Valencia
Journal:  BMC Bioinformatics       Date:  2011-10-03       Impact factor: 3.169

9.  MScanner: a classifier for retrieving Medline citations.

Authors:  Graham L Poulter; Daniel L Rubin; Russ B Altman; Cathal Seoighe
Journal:  BMC Bioinformatics       Date:  2008-02-19       Impact factor: 3.169

Review 10.  Using text mining for study identification in systematic reviews: a systematic review of current approaches.

Authors:  Alison O'Mara-Eves; James Thomas; John McNaught; Makoto Miwa; Sophia Ananiadou
Journal:  Syst Rev       Date:  2015-01-14
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.