| Literature DB >> 30937439 |
Jin-Dong Kim1, Yue Wang1, Toyofumi Fujiwara1, Shujiro Okuda2, Tiffany J Callahan3, K Bretonnel Cohen3,4.
Abstract
MOTIVATION: Most currently available text mining tools share two characteristics that make them less than optimal for use by biomedical researchers: they require extensive specialist skills in natural language processing and they were built on the assumption that they should optimize global performance metrics on representative datasets. This is a problem because most end-users are not natural language processing specialists and because biomedical researchers often care less about global metrics like F-measure or representative datasets than they do about more granular metrics such as precision and recall on their own specialized datasets. Thus, there are fundamental mismatches between the assumptions of much text mining work and the preferences of potential end-users.Entities:
Mesh:
Year: 2019 PMID: 30937439 PMCID: PMC6821251 DOI: 10.1093/bioinformatics/btz227
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Communication models of PubAnnotation: passive communication model, (a) and (b), and active communication model, (c)
Fig. 2.Three TextAE instances which render different annotations: They are parts of the results of the SPARQL search shown in Figure 3. The figure shows the display of identifiers from multiple vocabularies
Fig. 3.A pre-defined SPARQL template for the Preeclampsia project. SPARQL is powerful for searching across datasets and annotation categories, but difficult to learn. PubAnnotation’s support for pre-defined templates allows uses to search for arbitrarily specific entities without learning SPARQL
Fig. 4.The number of locations per epitope, that are stored in database (indicated by black bars), and that are extracted from literature (white bars). Epitopes are sorted by the number of locations extracted from literature. The difference indicates the potential of further curation through mining the annotation
Fig. 5.Some indicators of association between Lewis X and brain which are found through the SPARQL search over the annotated data
Fig. 6.Distribution of articles mentioning PE-associated genes