| Literature DB >> 29218915 |
Mayla Boguslav1, K Bretonnel Cohen, William A Baumgartner, Lawrence E Hunter.
Abstract
Most natural language processing applications exhibit a trade-off between precision and recall. In some use cases for natural language processing, there are reasons to prefer to tilt that trade-off toward high precision. Relying on the Zipfian distribution of false positive results, we describe a strategy for increasing precision, using a variety of both pre-processing and post-processing methods. They draw on both knowledge-based and frequentist approaches to modeling language. Based on an existing high-performance biomedical concept recognition pipeline and a previously published manually annotated corpus, we apply this hybrid rationalist/empiricist strategy to concept normalization for eight different ontologies. Which approaches did and did not improve precision varied widely between the ontologies.Entities:
Mesh:
Year: 2018 PMID: 29218915 PMCID: PMC5730334
Source DB: PubMed Journal: Pac Symp Biocomput ISSN: 2335-6928