| Literature DB >> 30066643 |
Hee-Jin Lee1, Yaoyun Zhang1, Min Jiang2, Jun Xu1, Cui Tao1, Hua Xu3.
Abstract
BACKGROUND: Most of the current work on clinical temporal relation identification follows the convention developed in the general domain, aiming to identify a comprehensive set of temporal relations from a document including both explicit and implicit relations. While such a comprehensive set can represent temporal information in a document in a complete manner, some of the temporal relations in the comprehensive set may not be essential depending on the clinical application of interest. Moreover, as the types of evidence that should be used to identify explicit and implicit relations are different, current clinical temporal relation identification systems that target both explicit and implicit relations still show low performances for practical use.Entities:
Keywords: Direct temporal relation; Information extraction; Syntactic structure; TLINK; Temporal relation identification
Mesh:
Year: 2018 PMID: 30066643 PMCID: PMC6069692 DOI: 10.1186/s12911-018-0627-5
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Fig. 1Visualization of a small partial temporal relation network excerpted from the i2b2 2012 corpora [14]. Two sentences contain 8 temporal mentions and 26 temporal relations. For clarity, when there are more than one relations for a pair of temporal mentions, only one of the relations is shown
Fig. 2Examples of direct temporal relations. Five examples with labels, a, b, c, d, e, are illustrated
Fig. 3Examples of non-direct temporal relations. Four examples with labels, a, b, c, d, are illustrated
Examples of type-preserving phrases
| Templatea | Examplesb |
|---|---|
| the first course of [Event-Treatment] | the first course of |
| episodes of [Event-Problem] | episodes of |
| the time of [Time-Date] | the time of |
| period of [Time-Duration] | period of |
aIn the templates, squared parentheses mark the placeholder for a time expression or an event mention
bIn the examples, italicized letters mark a time expression or an event mention
Fig. 4The process of direct temporal relations corpus construction
Rules for transitive closure calculationa
| - If A overlap B, then B overlap A |
aIt is possible that the rules may produce false positive or false negative temporal relations. For instance, A and C may not overlap even when A overlap B and B overlap C. The false positive relations are removed and missing relations are added during the manual annotation process by the domain experts
Fig. 5Structure of the direct temporal relation identification system
Type distribution of direct temporal relations
| Temporal relation type | Training set | Test set | Overall |
|---|---|---|---|
| Before | 387 (17%) | 355 (20%) | 742 (18%) |
| After | 345 (15%) | 299 (16%) | 644 (16%) |
| Overlap | 1518 (68%) | 1173 (64%) | 2690 (66%) |
| total | 2249 (100%) | 1827 (100%) | 4076 (100%) |
Performances of automatic direct temporal relation identification systems
| System | P | R | F1 |
|---|---|---|---|
| SVM-based system | 63.93 | 63.62 |
|
| Original Vanderbilt system | 43.53 |
| 55.61 |
| Re-trained Vanderbilt system | 64.16 | 49.15 | 55.66 |
| Syntactic graph kernel based system |
| 54.27 | 58.92 |
| CRF-based system | 48.51 | 39.52 | 43.56 |
Best scores for precision, recall and F1-score are marked bold
Example outputs of svm-based system
| Sentence | Predicted | Gold standard |
|---|---|---|
| Subsequently [his creatinine]e rose for [three days]t and then stabilized at 10. | overlap | overlap |
| [Cardiac catheterization]e was performed without complication from the right neck and right groin on [the day]t of admission. | overlap | overlap |
| 3. On [the morning of 12–01]t, the patient had some transient episodes of hypotension with [SBP s]e in the 70 s. | overlap | N/A |
| Initially on [Vanc]e and Cipro on [Friday]t but then seen by ID who recommended no abx but a bone biopsy, blood cx. | N/A | overlap |
| Anti-coagulation was started with Warfarin 5 mg with a goal of 2–3 and a plan for [cardioversion]e in [6 weeks]t. | overlap | after |
| [POD# 15/6]t, she resumed [TF]e ‘s and TPN was tapered again. | overlap | after |
In the “sentences”, the target time expression and the target event mention are marked with square brackets and subscripts ‘t’ and ‘e’, respectively. “Predicted” is the type of direct temporal relation predicted by the SVM-based system. “Gold standard” is the gold standard type of direct temporal relation. When the temporal relation is non-direct, it is represented as “N/A”
Performance of svm-based system for each relation type
| Temporal relation type | P | R | F1 |
|---|---|---|---|
| After | 53.44 | 33.78 | 31.39 |
| Before | 56.87 | 42.21 | 48.46 |
| Overlap | 66.74 | 77.66 | 71.79 |