Literature DB >> 26357193

Visual Classifier Training for Text Document Retrieval.

F Heimerl1, S Koch, H Bosch, T Ertl.   

Abstract

Performing exhaustive searches over a large number of text documents can be tedious, since it is very hard to formulate search queries or define filter criteria that capture an analyst's information need adequately. Classification through machine learning has the potential to improve search and filter tasks encompassing either complex or very specific information needs, individually. Unfortunately, analysts who are knowledgeable in their field are typically not machine learning specialists. Most classification methods, however, require a certain expertise regarding their parametrization to achieve good results. Supervised machine learning algorithms, in contrast, rely on labeled data, which can be provided by analysts. However, the effort for labeling can be very high, which shifts the problem from composing complex queries or defining accurate filters to another laborious task, in addition to the need for judging the trained classifier's quality. We therefore compare three approaches for interactive classifier training in a user study. All of the approaches are potential candidates for the integration into a larger retrieval system. They incorporate active learning to various degrees in order to reduce the labeling effort as well as to increase effectiveness. Two of them encompass interactive visualization for letting users explore the status of the classifier in context of the labeled documents, as well as for judging the quality of the classifier in iterative feedback loops. We see our work as a step towards introducing user controlled classification methods in addition to text search and filtering for increasing recall in analytics scenarios involving large corpora.

Entities:  

Mesh:

Year:  2012        PMID: 26357193     DOI: 10.1109/TVCG.2012.277

Source DB:  PubMed          Journal:  IEEE Trans Vis Comput Graph        ISSN: 1077-2626            Impact factor:   4.579


  6 in total

1.  NLPReViz: an interactive tool for natural language processing on clinical text.

Authors:  Gaurav Trivedi; Phuong Pham; Wendy W Chapman; Rebecca Hwa; Janyce Wiebe; Harry Hochheiser
Journal:  J Am Med Inform Assoc       Date:  2018-01-01       Impact factor: 4.497

2.  Interactive NLP in Clinical Care: Identifying Incidental Findings in Radiology Reports.

Authors:  Gaurav Trivedi; Esmaeel R Dadashzadeh; Robert M Handzel; Wendy W Chapman; Shyam Visweswaran; Harry Hochheiser
Journal:  Appl Clin Inform       Date:  2019-09-04       Impact factor: 2.342

3.  Facetto: Combining Unsupervised and Supervised Learning for Hierarchical Phenotype Analysis in Multi-Channel Image Data.

Authors:  Robert Krueger; Johanna Beyer; Won-Dong Jang; Nam Wook Kim; Artem Sokolov; Peter K Sorger; Hanspeter Pfister
Journal:  IEEE Trans Vis Comput Graph       Date:  2019-09-10       Impact factor: 4.579

4.  Emotion computing using Word Mover's Distance features based on Ren_CECps.

Authors:  Fuji Ren; Ning Liu
Journal:  PLoS One       Date:  2018-04-06       Impact factor: 3.240

5.  Interactive Exploration of Longitudinal Cancer Patient Histories Extracted From Clinical Text.

Authors:  Zhou Yuan; Sean Finan; Jeremy Warner; Guergana Savova; Harry Hochheiser
Journal:  JCO Clin Cancer Inform       Date:  2020-05

6.  Projections as visual aids for classification system design.

Authors:  Paulo E Rauber; Alexandre X Falcão; Alexandru C Telea
Journal:  Inf Vis       Date:  2017-06-27       Impact factor: 0.956

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.