| Literature DB >> 19414535 |
Yoshinobu Kano1, William A Baumgartner, Luke McCrohon, Sophia Ananiadou, K Bretonnel Cohen, Lawrence Hunter, Jun'ichi Tsujii.
Abstract
SUMMARY: Due to the increasing number of text mining resources (tools and corpora) available to biologists, interoperability issues between these resources are becoming significant obstacles to using them effectively. UIMA, the Unstructured Information Management Architecture, is an open framework designed to aid in the construction of more interoperable tools. U-Compare is built on top of the UIMA framework, and provides both a concrete framework for out-of-the-box text mining and a sophisticated evaluation platform allowing users to run specific tools on any target text, generating both detailed statistics and instance-based visualizations of outputs. U-Compare is a joint project, providing the world's largest, and still growing, collection of UIMA-compatible resources. These resources, originally developed by different groups for a variety of domains, include many famous tools and corpora. U-Compare can be launched straight from the web, without needing to be manually installed. All U-Compare components are provided ready-to-use and can be combined easily via a drag-and-drop interface without any programming. External UIMA components can also simply be mixed with U-Compare components, without distinguishing between locally and remotely deployed resources. AVAILABILITY: http://u-compare.org/Entities:
Mesh:
Year: 2009 PMID: 19414535 PMCID: PMC2712335 DOI: 10.1093/bioinformatics/btp289
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Partial list of currently ready-to-use components in U-Compare
| Component type | Component names |
|---|---|
| Collection readers | AImed, Bio1, BioIE, Texas, Yapex, NLPBA |
| Sentence detectors | Genia, LingPipe, NaCTeM, OpenNLP, UIMA |
| Tokenizers | GENIA, OpenNLP, UIMA, PennBio |
| POS taggers | GENIA, LingPipe, OpenNLP, Stepp |
| Syntactic parsers | Enju HPSG Parser, OpenNLP Parser, Stanford Parser |
| Relation extracters | Akane++, BioNLP '09 Shared Task Format Reader |
| Named entity recognizers | ABNER, GENIA Tagger, NeMine, MedTNER, MedTNER-M, LingPipe Entity Tagger, OpenNLP |
Fig. 1.Screenshots of (A) U-Compare Statistics Viewer showing comparison between AImed corpus and three NERs; (B) U-Compare Tree and Feature Structure Visualizer showing an HPSG syntactic tree; and (C) U-Compare Graphical Annotation Viewer showing biological event annotations.