| Literature DB >> 30658707 |
Billy Chiu1, Olga Majewska2, Sampo Pyysalo2, Laura Wey3, Ulla Stenius4, Anna Korhonen2, Martha Palmer5.
Abstract
BACKGROUND: VerbNet, an extensive computational verb lexicon for English, has proved useful for supporting a wide range of Natural Language Processing tasks requiring information about the behaviour and meaning of verbs. Biomedical text processing and mining could benefit from a similar resource. We take the first step towards the development of BioVerbNet: A VerbNet specifically aimed at describing verbs in the area of biomedicine. Because VerbNet-style classification is extremely time consuming, we start from a small manual classification of biomedical verbs and apply a state-of-the-art neural representation model, specifically developed for class-based optimization, to expand the classification with new verbs, using all the PubMed abstracts and the full articles in the PubMed Central Open Access subset as data.Entities:
Keywords: Representation learning; Verb lexicon
Mesh:
Year: 2019 PMID: 30658707 PMCID: PMC6339329 DOI: 10.1186/s13326-018-0193-x
Source DB: PubMed Journal: J Biomed Semantics
Example gold standard classes and class members from Korhonen et al. (2006) [15]
| Index | Class name | Subclass name | Example members |
|---|---|---|---|
|
| Biochemical events | Biochemical modification | dephosphorylate, phosphorylate |
|
| Experimental procedure | Label | stain, label, immunoblot, probe, fix |
|
| Precipitate | coprecipitate, coimmunoprecipitate, precipitate | |
|
| Report | Examine | assess, evaluate, estimate, examine, explore, analyze |
|
| Establish | establish, test, investigate | |
|
| Presentational | argue, hypothesize, conclude, reason, note, speculate, assume | |
|
| Perform | Quantitate | quantify, quantitate, measure, monitor |
|
| Release | Release | release, detach, excise, dissociate |
|
| Use | Use | utilize, employ, exploit |
|
| Call | Call | name, designate |
|
| Appear | Appear | become, occur, seem |
Performance on the BioSimVerb (in ρ) using representations learned with different context configurations. (Bold: best-performing configuration and its score)
| Baseline | Spearman’s |
|---|---|
| BOW (win=5) | 0.4664 |
| DEP-ALL | 0.4323 |
| Configurations: Verb | |
| POOL-ALL | 0.4724 |
| conj+obj+pcomp+prep+rel+subj | 0.475 |
| conj+obj+prep+rel+subj |
|
| conj+obj+pcomp+prep+subj | 0.4578 |
| conj+obj+pcomp+rel+subj | 0.4478 |
| conj+obj+pcomp+prep+rel | 0.4406 |
| conj+obj+prep+subj | 0.4611 |
| conj+obj+rel+subj | 0.4572 |
| conj+obj+prep+rel | 0.442 |
| comp+obj+pcomp+prep+rel+subj | 0.4376 |
| comp+conj+obj+prep+rel+subj | 0.4762 |
| comp+conj+obj+pcomp+prep+subj | 0.4655 |
| comp+conj+obj+pcomp+rel+subj | 0.4583 |
| comp+conj+obj+pcomp+prep+rel | 0.4413 |
| comp+conj+obj+prep+subj | 0.4635 |
| comp+conj+obj+rel+subj | 0.4592 |
| comp+conj+obj+prep+rel | 0.442 |
| obj+pcomp+prep+rel+subj | 0.4446 |
| obj+prep+rel+subj | 0.441 |
BOW denotes a basic SGNS learned with bag-of-words context with context window size 5. DEP-ALL denotes a configuration where no filtering of context is used. POOL-ALL denotes a configuration where all individual context bags from the verb-related pools are used. “Best” identifies the best-performing configuration found
Example classes validated by experts
| Index | Subclass name | Example members | New candidates |
|---|---|---|---|
|
| Suppress | suppress, repress | downregulate, transactivate |
|
| Collect | harvest, select, collect | decide, pick, cultivate, procure, gather, choose, transfuse, prioritize, obtain |
|
| Encompass | encompass, possess, comprise, bear, span, harbor | overlie, display, hold, exhibit, cover, infest, belong, range |
|
| Call | call, name, designate | qualify, regard, rename, mention, request |
|
| Modify | modify, catalyze | hydroxylate, hydrolyze, methylate, deaminate, esterify, oxidize, detoxify, metabolize |
|
| Label | stain, label, immunoblot, probe, fix | supershift, assay, immunostain, tag, immunolabel, clone, postfix, digest, clamp, counterstain, buffer, electroblot, fluoresce, radiolabel, blot |
|
| Release | release, detach, excise, dissociate | reinsert, retract, disassemble, deacylate, extrude, remove, depolymerize, mobilize, lose, resect, separate |
|
| Conduct | perform, conduct | execute, undertake |
Results of class validation by experts, for seven general scientific (General) and seven biomedical classes (Biomedical), and across the two domains (Total). Bold: the total no of correct/incorrect candidates (in %) as rated by annotators of each sub-group, and the sum of the two
| No. of new candidates | No. of correct candidates | % correct candidates | No. of incorrect candidates | % incorrect candidates | |
|---|---|---|---|---|---|
|
| 9 | 6 | 66.7 | 3 | 33.3 |
|
| 21 | 19 | 90.5 | 2 | 9.5 |
|
| 11 | 10 | 90.9 | 1 | 9.1 |
|
| 2 | 2 | 100 | 0 | 0.0 |
|
| 8 | 6 | 75.0 | 2 | 25.0 |
|
| 5 | 4 | 80.0 | 1 | 20.0 |
|
| 19 | 16 | 84.2 | 3 | 15.8 |
|
| 75 | 63 |
| 12 |
|
|
| 2 | 2 | 100 | 0 | 0.0 |
|
| 15 | 11 | 73.3 | 4 | 26.7 |
|
| 8 | 6 | 75 | 2 | 25 |
|
| 21 | 19 | 90.5 | 2 | 9.5 |
|
| 15 | 11 | 73.3 | 4 | 26.7 |
|
| 19 | 17 | 89.5 | 2 | 10.5 |
|
| 11 | 10 | 90.9 | 1 | 9.1 |
|
| 91 | 76 |
| 15 |
|
|
| 166 | 139 |
| 27 |
|