Literature DB >> 23276747

Approaches to verb subcategorization for biomedicine.

Thomas Lippincott1, Laura Rimell, Karin Verspoor, Anna Korhonen.   

Abstract

Information about verb subcategorization frames (SCFs) is important to many tasks in natural language processing (NLP) and, in turn, text mining. Biomedicine has a need for high-quality SCF lexicons to support the extraction of information from the biomedical literature, which helps biologists to take advantage of the latest biomedical knowledge despite the overwhelming growth of that literature. Unfortunately, techniques for creating such resources for biomedical text are relatively undeveloped compared to general language. This paper serves as an introduction to subcategorization and existing approaches to acquisition, and provides motivation for developing techniques that address issues particularly important to biomedical NLP. First, we give the traditional linguistic definition of subcategorization, along with several related concepts. Second, we describe approaches to learning SCF lexicons from large data sets for general and biomedical domains. Third, we consider the crucial issue of linguistic variation between biomedical fields (subdomain variation). We demonstrate significant variation among subdomains, and find the variation does not simply follow patterns of general lexical variation. Finally, we note several requirements for future research in biomedical SCF lexicon acquisition: a high-quality gold standard, investigation of different definitions of subcategorization, and minimally-supervised methods that can learn subdomain-specific lexical usage without the need for extensive manual work.
Copyright © 2012 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2012        PMID: 23276747     DOI: 10.1016/j.jbi.2012.12.001

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  2 in total

1.  A generalizable NLP framework for fast development of pattern-based biomedical relation extraction systems.

Authors:  Yifan Peng; Manabu Torii; Cathy H Wu; K Vijay-Shanker
Journal:  BMC Bioinformatics       Date:  2014-08-23       Impact factor: 3.169

2.  BioVerbNet: a large semantic-syntactic classification of verbs in biomedicine.

Authors:  Olga Majewska; Charlotte Collins; Simon Baker; Jari Björne; Susan Windisch Brown; Anna Korhonen; Martha Palmer
Journal:  J Biomed Semantics       Date:  2021-07-15
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.