| Literature DB >> 28815135 |
Ramon Maldonado1, Travis R Goodwin1, Sanda M Harabagiu1.
Abstract
The annotation of a large corpus of Electroencephalography (EEG) reports is a crucial step in the development of an EEG-specific patient cohort retrieval system. The annotation of multiple types of EEG-specific medical concepts, along with their polarity and modality, is challenging, especially when automatically performed on Big Data. To address this challenge, we present a novel framework which combines the advantages of active and deep learning while producing annotations that capture a variety of attributes of medical concepts. Results obtained through our novel framework show great promise.Entities:
Year: 2017 PMID: 28815135 PMCID: PMC5543351
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
Figure 1:Architecture of the Multi-Task Active Deep Learning for annotating EEG Reports.
Performance of our model when automatically detecting attributes of EEG activities. Default attribute values are denoted by an asterisk where applicable.
| DISORGANIZATION | 0.979 | 0.887 | 0.788 | 0.834 | 80 |
| GPEDS | 0.999 | 0.000 | 0.000 | 0.000 | 1 |
| POLYSPIKE_AND_WAVE | 0.992 | 0.222 | 0.400 | 0.286 | 5 |
| AMPLITUDE_GRADIENT | 0.999 | 0.833 | 1.000 | 0.909 | 5 |
| SPIKE_AND_SLOW_WAVE | 0.995 | 0.941 | 0.970 | 0.955 | 66 |
| SPIKE | 0.992 | 0.850 | 0.708 | 0.773 | 24 |
| PLEDS | 0.993 | 0.750 | 0.500 | 0.600 | 12 |
| LAMBDA_WAVE | 1.000 | 1.000 | 1.000 | 1.000 | 18 |
| K_COMPLEX | 0.998 | 1.000 | 0.750 | 0.857 | 8 |
| POLYSPIKE | 0.991 | 0.750 | 0.529 | 0.621 | 17 |
| SLOW_WAVE | 0.994 | 0.941 | 0.923 | 0.932 | 52 |
| RHYTHM | 0.924 | 0.813 | 0.919 | 0.862 | 307 |
| BETS | 0.999 | 0.000 | 0.000 | 0.000 | 1 |
| SLEEP_SPINDLE | 0.998 | 1.000 | 0.913 | 0.955 | 23 |
| SHARP_AND_SLOW_WAVE | 0.996 | 0.600 | 0.500 | 0.545 | 6 |
| SUPPRESSION | 0.995 | 0.917 | 0.846 | 0.880 | 26 |
| PHOTIC_DRIVING | 0.998 | 1.000 | 0.947 | 0.973 | 38 |
| TRIPHASIC WAVE | 0.999 | 1.000 | 0.909 | 0.952 | 11 |
| SHARP_WAVE | 0.989 | 0.886 | 0.963 | 0.923 | 81 |
| WICKET | 1.000 | 1.000 | 1.000 | 1.000 | 10 |
| UNSPECIFIED | 0.952 | 0.517 | 0.508 | 0.513 | 59 |
| SPIKE_AND_SHARP_WAVE | 1.000 | 0.000 | 0.000 | 0.000 | 0 |
| EPILEPTIFORM_DISCHARGE | 0.981 | 0.891 | 0.882 | 0.887 | 102 |
| SLOWING | 0.990 | 0.966 | 0.953 | 0.959 | 149 |
| BREACH_RHYTHM | 0.996 | 1.000 | 0.583 | 0.737 | 12 |
| VERTEX_WAVE | 1.000 | 1.000 | 1.000 | 1.000 | 30 |
| *N/A | 0.888 | 0.898 | 0.938 | 0.918 | 791 |
| LEFT | 0.942 | 0.717 | 0.711 | 0.714 | 121 |
| RIGHT | 0.965 | 0.756 | 0.782 | 0.768 | 87 |
| BOTH | 0.901 | 0.730 | 0.584 | 0.649 | 185 |
| HIGH | 0.921 | 0.714 | 0.563 | 0.630 | 142 |
| LOW | 0.937 | 0.817 | 0.618 | 0.704 | 144 |
| *NORMAL | 0.869 | 0.886 | 0.950 | 0.917 | 898 |
| REPEATED | 0.805 | 0.752 | 0.760 | 0.756 | 470 |
| *NONE | 0.787 | 0.750 | 0.773 | 0.761 | 520 |
| CONTINUOUS | 0.899 | 0.717 | 0.639 | 0.676 | 194 |
| LOCALIZED | 0.882 | 0.759 | 0.684 | 0.720 | 263 |
| *N/A | 0.822 | 0.835 | 0.894 | 0.863 | 745 |
| GENERALIZED | 0.910 | 0.732 | 0.619 | 0.671 | 176 |
| GAMMA | 1.000 | 0.000 | 0.000 | 0.000 | 0 |
| *N/A | 0.945 | 0.940 | 0.983 | 0.961 | 811 |
| DELTA | 0.979 | 0.945 | 0.811 | 0.873 | 106 |
| MU | 1.000 | 0.000 | 0.000 | 0.000 | 0 |
| ALPHA | 0.981 | 0.897 | 0.870 | 0.883 | 100 |
| BETA | 0.992 | 0.957 | 0.918 | 0.937 | 73 |
| THETA | 0.975 | 0.910 | 0.755 | 0.826 | 94 |
| PARIETO OCCIPITAL | - | - | - | - | 0 |
| FRONTAL | 0.929 | 0.724 | 0.640 | 0.679 | 139 |
| OCCIPITAL | 0.959 | 0.916 | 0.841 | 0.877 | 208 |
| TEMPORAL | 0.944 | 0.702 | 0.590 | 0.641 | 100 |
| FRONTOTEMPORAL | 0.993 | 0.727 | 0.615 | 0.667 | 13 |
| FRONTOCENTRAL | 0.990 | 0.882 | 0.789 | 0.833 | 38 |
| CENTRAL | 0.980 | 0.619 | 0.448 | 0.520 | 29 |
| PARIETAL | 0.995 | 0.000 | 0.000 | 0.000 | 6 |
| CENTROPARIETAL | - | - | - | - | 0 |
| POSSIBLE | 0.968 | 0.615 | 0.195 | 0.296 | 41 |
| *FACTUAL | 0.963 | 0.967 | 0.996 | 0.981 | 1136 |
| PROPOSED | 0.999 | 0.000 | 0.000 | 0.000 | 1 |
Figure 2:Deep Learning architecture for the identification of (1) the EEG activity anchors and (2) the boundaries of expressions of (a) EEG events, (b) medical problems; (c) medical tests and (d) medical treatments (e.g. medications).
Figure 3:Deep Learning Architectures for Automatic Recognition of (1) attributes of EEG activities; (2) type for all the other medical concepts expressed in EEG reports; and (3) modality and polarity for all concepts.
Performance of our model when automatically detecting anchors and boundaries of medical concepts
| EEG Activity Anchors | Other Medical concept Boundaries | ||||
|---|---|---|---|---|---|
| .8949 | .9591 | .9161 | .9469 | ||
| .8125 | .8228 | .8797 | .8831 | ||
| .8517 | .8857 | .8975 | .9139 | ||
Performance of our model when automatically detecting attributes of EEG events and medical problems, treatments, and tests.
| TEST | 0.983 | 0.982 | 0.958 | 0.970 | 669 |
| PROBLEM | 0.953 | 0.901 | 0.960 | 0.929 | 747 |
| TREATMENT | 0.971 | 0.964 | 0.898 | 0.930 | 500 |
| EEG_EVENT | 0.974 | 0.926 | 0.928 | 0.927 | 419 |
| POSSIBLE | 0.977 | 0.634 | 0.406 | 0.495 | 64 |
| FACTUAL | 0.963 | 0.971 | 0.990 | 0.980 | 2199 |
| PROPOSED | 0.980 | 0.622 | 0.418 | 0.500 | 55 |
Figure 4:Learning curves for all annotations, shown over the first 100 EEG Reports annotated and evaluated with F1 measure.
Rhythm: continuous, rhythmic activity Transient
Single Wave:
V wave Wicket spikes Spike Sharp wave Slow wave Complex: A sequence of two or more waves having a characteristic form or recurring with a fairly consistent form, distinguished from background activity.
K-complex Sleep spindles Spike-and-sharp-wave complex Spike-and-slow-wave complex Sharp-and-slow-wave complex Triphasic wave Polyspike complex Polyspike-and-slow-wave complex Pattern: any characteristic EEG Activity
Suppression Amplitude Gradient Slowing Breach Rhythm Benign Epileptic Transients of Sleep (BETS) Photic driving (response) Periodic Laterilized Epilepitiform Discharges (PLEDs) Generalized periodic epileptiform discharges (GPEDs) Epileptiform discharge (unspecified) Disorganization Positive occipital sharp transients of sleep (POSTS) Unspecified: the default attribute value, used if no morphological information is given at all |
Alpha (8 - 13 Hz) Beta (13 - 32 Hz) Delta ( < 4 Hz) Theta (4 - 8 Hz) Gamma ( > 32 Hz) |
Yes No |
Low: e.g. subtle (spike), small (polyspike discharge) High: e.g. high (voltage burst); high amplitude (spike); excess (theta) Normal: the default value | Continuous: the activity repeats in a continuous, uninterrupted manner Repeated: the activity repeats intermittently None: the activity occurs once |
Localized (focal): limited to a small area of the brain Generalized (diffuse): occurring over a large area of the brain or both sides of the head | Right Left Both |
|
The lemma of the token and the previous/next tokens The PoS of the token and the previous/next tokens The phrase chunk of the token and the previous/next tokens The lemmas of the previous, current, and next tokens The Brown cluster of the token The UMLS Concept Unique Identifier (cui) of UMLS concepts containing the token The title of the section containing the token |
The medical concept mention itself The lemmas of each token in the medical concept mention The PoS of each token in the medical concept mention The lemmas of 3 tokens before/after the medical concept mention The number of “hops” in the syntactic dependency path from the head of the medical concept mention to The number of medical concepts between the medical concept mention and |