| Literature DB >> 28466087 |
Shervin Malmasi, Nicolae L Sandor, Naoshi Hosomura, Matt Goldberg, Stephen Skentzos, Alexander Turchin1.
Abstract
Information Extraction methods can help discover critical knowledge buried in the vast repositories of unstructured clinical data. However, these methods are underutilized in clinical research, potentially due to the absence of free software geared towards clinicians with little technical expertise. The skills required for developing/using such software constitute a major barrier for medical researchers wishing to employ these methods. To address this, we have developed Canary, a free and open-source solution designed for users without natural language processing (NLP) or software engineering experience. It was designed to be fast and work out of the box via a user-friendly graphical interface.Entities:
Keywords: Information extraction; clinical informatics; natural language processing
Mesh:
Year: 2017 PMID: 28466087 PMCID: PMC6241741 DOI: 10.4338/ACI-2017-01-IE-0018
Source DB: PubMed Journal: Appl Clin Inform ISSN: 1869-0327 Impact factor: 2.342
Fig. 1An overview of the Canary software. The main Canary user interface is shown, along with examples of preprocessing, vocabulary and phrase structure rules that can be created.
Fig. 2A typical Canary project begins with data collection and annotation. The annotated data are used to develop and evaluate a model, which is then used to process unannotated documents.