| Literature DB >> 25422719 |
Zhe He1, C Paul Morrey2, Yehoshua Perl3, Gai Elhanan4, Ling Chen5, Yan Chen5, James Geller3.
Abstract
BACKGROUND: The Refined Semantic Network (RSN) for the UMLS was previously introduced to complement the UMLS Semantic Network (SN). The RSN partitions the UMLS Metathesaurus (META) into disjoint groups of concepts. Each such group is semantically uniform. However, the RSN was initially an order of magnitude larger than the SN, which is undesirable since to be useful, a semantic network should be compact. Most semantic types in the RSN represent combinations of semantic types in the UMLS SN. Such a "combination semantic type" is called Intersection Semantic Type (IST). Many ISTs are assigned to very few concepts. Moreover, when reviewing those concepts, many semantic type assignment inconsistencies were found. After correcting those inconsistencies many ISTs, among them some that contradicted UMLS rules, disappeared, which made the RSN smaller.Entities:
Keywords: Abstraction Network; Correction of Inconsistencies; Intersection Semantic Types; Refined Semantic Network; Refined Semantic Types; Semantic Network; UMLS
Year: 2014 PMID: 25422719 PMCID: PMC4235323 DOI: 10.5210/ojphi.v6i2.5412
Source DB: PubMed Journal: Online J Public Health Inform ISSN: 1947-2579
Figure 1Example of a concept assigned two semantic types.
Figure
2Example of an IST of two semantic types.
Figure 3The methodology framework of “sculpting” the RSN for the UMLS.
Progress of RSN over time
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1998 | 476K | 132 | 1163 | 8622 | 77 | 422 | n/a | n/a | n/a | n/a | n/a | n/a | n/a |
| 2001 | 800K | 134 | 874 | 12161 | 40 | 322 | 113 | 64 | 35 | 28 | 25 | 587 | 1170 |
| 2006AC | 1.4M | 135 | 559 | 91 | 7 | 124 | 68 | 37 | 32 | 26 | 18 | 305 | 737 |
| 2007AA | 1.4M | 135 | 555 | 598 | 11 | 111 | 65 | 40 | 33 | 23 | 17 | 289 | 710 |
| 2007AC | 1.5M | 135 | 532 | 0 | 0 | 116 | 56 | 35 | 34 | 20 | 15 | 276 | 659 |
| 2008AA | 1.6M | 135 | 464 | 3 | 2 | 105 | 44 | 25 | 25 | 15 | 14 | 228 | 499 |
| 2008AB | 1.9M | 135 | 397 | 0 | 0 | 64 | 30 | 29 | 14 | 17 | 12 | 166 | 424 |
| 2009AA | 2.1M | 135 | 381 | 0 | 0 | 59 | 32 | 24 | 13 | 16 | 11 | 155 | 393 |
| 2009AB | 2.2M | 135 | 385 | 0 | 0 | 61 | 30 | 25 | 15 | 14 | 13 | 158 | 404 |
| 2010AA | 2.2M | 133 | 384 | 0 | 0 | 58 | 32 | 24 | 15 | 16 | 9 | 154 | 388 |
| 2010AB | 2.4M | 133 | 392 | 0 | 0 | 66 | 35 | 19 | 16 | 16 | 8 | 160 | 385 |
| 2011AA | 2.4M | 133 | 409 | 1 | 1 | 75 | 38 | 24 | 16 | 17 | 6 | 176 | 408 |
| 2011AB | 2.6M | 133 | 406 | 0 | 0 | 72 | 34 | 25 | 16 | 19 | 8 | 174 | 422 |
| 2012AA | 2.6M | 133 | 407 | 0 | 0 | 73 | 33 | 26 | 16 | 17 | 7 | 172 | 408 |
| 2012AB | 2.8M | 133 | 402 | 0 | 0 | 61 | 37 | 26 | 14 | 18 | 9 | 165 | 413 |
| 2013AA | 2.9M | 133 | 401 | 0 | 0 | 63 | 33 | 27 | 18 | 16 | 11 | 168 | 428 |
| 2013 Audit | 2.9M | 133 | 336 | 0 | 0 | 48 | 28 | 10 | 3 | 8 | 6 | 103 | 222 |
Figure 4Progress of the Semantic Network, ISTs and ISTs with small extents. Blue bars show the number of semantic types in the UMLS Semantic Network. Red bars show the number of ISTs in the RSN. Green bars show the number of ISTs with small extents.
Progress of IST removal in the past five releases
|
|
|
|
|
|
| |
|
| 409 | 406 | 407 | 402 | 401 | |
|
| 176 | 174 | 172 | 165 | 168 | |
|
| 23 | 17 | 13 | 14 | 11 | 78 |
|
| 12 | 6 | 4 | 6 | 7 | 35 |
|
| 3 | 1 | 3 | 1 | 3 | 11 |
|
| 6 | 20 | 12 | 19 | 12 | 69 |
New ISTs in UMLS release 2013AA.
|
|
|
| |||||||
| 1 | 2012AA | 2011AB | 2011AA | 2010AB | 2010AA | 2009AB | 2008AA | 2007AC | |
| 1 | 2011AA | 2007AC | 2007AB | ||||||
| 1 | 2008AA | 2007AC | 2007AB | 2007AA | |||||
| 1 | 2007AC | 2007AB | 2007AA | ||||||
| 1 | |||||||||
| 4 | 2012AA | 2008AA | 2007AC | 2007AB | 2007AA | ||||
| 1 | |||||||||
| 2 | |||||||||
| 5 | |||||||||
| 2 | 2008AA | ||||||||
| 1 | 2007AA | ||||||||
|
| |||||||||
| IST removed once | |||||||||
| IST removed twice | |||||||||
| IST appeared the first time | |||||||||
| IST appeared the second time | |||||||||
New ISTs in 2012AA
|
|
|
| |||
| 1 | |||||
| 4 | |||||
| 1 | |||||
| 1 | 2008AA | 2007AB | 2007AA | ||
| 1 | 2010AB | ||||
| 1 | |||||
| 1 | |||||
| 1 | |||||
| 1 | |||||
| 2 | |||||
| 2 | 2008AA | 2007AB | 2007AA | ||
| 1 | 2008AA | 2007AB | 2007AA | ||
| 1 | |||||
|
| |||||
| IST removed once | |||||
| IST removed twice | |||||
| IST appeared the first time | |||||
| IST appeared the second time | |||||
Auditing impact on 2013AA non-Chemical ISTs of the sculpted RSN
|
|
|
|
|
|
|
|
|
| 1 | 7 | 5 | 71.4% | 1 | 14.3% | 3 | 57.1% |
| 2 | 3 | 2 | 66.7% | 0 | 0% | 1 | 66.7% |
| 3 | 5 | 3 | 60% | 1 | 33.3% | 3 | 60% |
| 4 | 6 | 4 | 66.7% | 0 | 0% | 2 | 33.3% |
| 5 | 2 | 1 | 50% | 0 | 0% | 1 | 50% |
| 6 | 6 | 1 | 16.7% | 0 | 0% | 5 | 16.7% |
| Total | 29 | 16 | 55.2% | 2 | 6.9% | 15 | 48.3% |
Auditing impact on 2013AA Chemical ISTs of the sculpted RSN
|
|
|
|
|
|
|
|
|
| 1 | 56 | 44 | 78.5% | 33 | 58.9% | 45 | 19.6% |
| 2 | 30 | 19 | 63.3% | 16 | 53.3% | 27 | 10% |
| 3 | 22 | 21 | 95.5% | 6 | 27.3% | 7 | 68.2% |
| 4 | 12 | 11 | 91.7% | 0 | 0% | 1 | 91.7% |
| 5 | 14 | 10 | 71.4% | 3 | 21.4% | 7 | 50% |
| 6 | 5 | 4 | 80% | 0 | 0% | 1 | 80% |
| Total | 139 | 109 | 78.4% | 58 | 41.7% | 88 | 36.7% |
Figure 5An excerpt of the RSN after sculpting. This figure shows all the ISTs with at least one non-chemical ST and their ancestors. All the Chemical STs are marked in red. All the ISTs are shown as yellow boxes.