| Literature DB >> 27747503 |
Zaigham Faraz Siddiqui1, Georg Krempl1, Myra Spiliopoulou2, Jose M Peña3, Nuria Paul4, Fernando Maestu5.
Abstract
Predicting the evolution of individuals is a rather new mining task with applications in medicine. Medical researchers are interested in the progression of a disease and/or how do patients evolve or recover when they are subjected to some treatment. In this study, we investigate the problem of patients' evolution on the basis of medical tests before and after treatment after brain trauma: we want to understand to what extend a patient can become similar to a healthy participant. We face two challenges. First, we have less information on healthy participants than on the patients. Second, the values of the medical tests for patients, even after treatment started, remain well-separated from those of healthy people; this is typical for neurodegenerative diseases, but also for further brain impairments. Our approach encompasses methods for modelling patient evolution and for predicting the health improvement of different patients' subpopulations, i.e. prediction of label if they recovered or not. We test our approach on a cohort of patients treated after brain trauma and a corresponding cohort of controls.Entities:
Keywords: Clustering; Evolution modelling; Label prediction; Traumatic brain injury
Year: 2015 PMID: 27747503 PMCID: PMC4883158 DOI: 10.1007/s40708-015-0010-6
Source DB: PubMed Journal: Brain Inform ISSN: 2198-4026
Acronyms and description of cognitive tests from the TBI dataset presented in [16]
| Name | Description |
|---|---|
| TMT-B | Train making test-part B: measures cognitive flexibility (frontal lobe function) |
| BTA | Brief test of attention (total score) |
| WCST-NC | Wisconsin card shorting test: percentage total score of conceptual level (number of categories correctly achieved); also measures cognitive flexibility |
| WCST-RP | Wisconsin card shorting test: # preservative responses (represent error) |
| FAS | Phonetic fluency test which uses as cues letters F, A, and S as the initial letters for the patients to start the production of words |
| ICP | Measure a subject’s ability to perform daily activities, and awareness of the disease |
| CIV | Verbal intelligent quotient (VIQ): measures ability to handle verbal material |
| CIM | Performance IQ (PIQ): measures ability to handle visio-spatial / non-verbal material |
| CV | Verbal comprehension index (VCI) |
| MT | Working memory (WM): measures the subject’s ability to maintain information in short-term memory and recall it |
| OP | Perceptual organization (PO) |
| VP | Processing speed index (PSI) |
| IAC | Attention/concentration index (ACI) |
| IMG | General memory index (GMI) |
| IRD | Delayed recall index (DRI) |
Fig. 1Plot of differences in the ICP values from and (x-axis) against ICP values (y-axis). Squares represent the values for , while rhombuses represent the values from . Patients can be separated into 4 classes based on the difference and the ICP values from , i.e. Class_1 = [low diff, low ICP1] (green region), Class_2 = [high diff, low ICP1] (red region), Class_3 = [low diff, high ICP1] (yellow region), and Class_4 = [high diff, high ICP1] (blue region). (Color figure online)
List of used symbols and notations
| Symbol | Description |
|---|---|
|
| Timepoint before the start of the treatment |
|
| Timepoint after the end of the treatment |
|
| Set of individuals. The cardinality of the set is |
|
| Instance of a patient at timepoint |
|
| Label of a patient at timepoint |
|
| Clustering model learned over the instances of individuals from |
|
| Clustering model learned over the instances of individuals from |
|
| A cluster of individuals from the model |
|
| A cluster transition graph learned over clustering |
Fig. 2Clustering, Evolution graph and Soft projection with EvolutionPred: in (a) the nodes are patient instantiations at (yellow) and (aubergine), instantiations of the same individuals are connected with dashed arrows; additionally, we also show the controls (green); (b) clustering is performed at each moment (i.e. and ), showing that not all individuals of a pre-treatment cluster evolve similarly; in (c) the evolution graph is built by linking pre- and post-treatment clusters that share some individuals; the weights of the graph edges are used to compute the soft projection of the instance denoted by the red star. (Color figure online)
Fig. 3Variance plots for patient projections, where is set to predict the (already known) instances at : the solid lines represent the mean values of the true patient instantiations at moment , and of the projected patient instantiations, while the surrounding regions (same colour as the solid line) represent the variance of the instantiations; the two projections overlap almost completely with the true distribution at , both with respect to the line of the mean and to the region of the variance. (Color figure online)
Hard and soft projection of patients from towards , with MASE and Hits per patient: low MASE is better, values larger than 1 are poor; high Hits are better, 1.0 is best; averages over all patients after excluding outlier patient #14
Fig. 4Average assessment values and variance regions for controls and for patients before (Pre) and after treatment start (Post) for 16 variables: despite some overlaps, lines and regions of patients are mostly distinct from those of the controls. (Color figure online)
Fig. 5Average assessment values and variance regions for controls and for patients before (Pre) and after treatment start (Post), and as result of Hard (yellow) and Soft (green) projection: the projected patient assessments are closer to the controls. (Color figure online)
Fig. 6Controls clustered with the patients before treatment (Pre: red), after treatment start (Post: yellow), with the Hard projection (green) and the Soft one (blue dashed): entropy drops as the number of clusters increases, but has higher (better) values for the projected instantiations, indicating that these are closer to the controls
Label prediction accuracies of each patient for EvoLabelPred with GroundTruth based on ICP attribute
| ID | EvoLabelPred | |
|---|---|---|
|
|
| |
| #1 | 0.00 | 0.00 |
| #2 | 0.93 | 0.91 |
| #3 | 0.41 | 0.23 |
| #4 | 1.00 | 0.70 |
| #5 | 0.00 | 0.01 |
| #6 | 0.03 | 0.01 |
| #7 | 0.00 | 0.00 |
| #8 | 0.91 | 0.87 |
| #9 | 0.05 | 0.01 |
| #10 | 0.95 | 0.87 |
| #11 | 1.00 | 1.00 |
| #12 | 0.89 | 1.00 |
| #13 | 0.09 | 0.09 |
| #14 | 0.50 | 0.16 |
| #15 | 0.95 | 0.89 |
Fig. 7The line plots for clusters that were discovered by applying K-Means over (left) and (right) . The outliers were excluded from the data. The depicted clusters are discovered over the complete TBI data rather than bootstraped samples (they are used only for individual runs). The bold lines represent the centroids of the clusters, while thin dotted lines depict the patients. The colours show which clusters from and are related to each other. (Color figure online)
Meta Information on the clustering model from Fig. 7
| Colour | Members | |
|---|---|---|
| @ | @ | |
| Black | #6, #1, #8 | #6, #3, #5 |
| Red | #7, #9, #10, #14, #12, #13 | #7, #9, #10, #14 |
| Green | #2, #4, #3, #5 | #2, #4, #1, #8, #12, #13 |
Label prediction accuracies of each patient for EvoLabelPred with GroundTruth based on ICP ICP variable; PCA was applied over the TBI dataset prior to the learning of the evolutionary model
| ID | PCA with | |
|---|---|---|
| With outliers | Without outliers | |
| #1 | 0.00 | 0.00 |
| #2 | 97.73 | 93.06 |
| #3 | 47.92 | 11.69 |
| #4 | 100.00 | 100.00 |
| #5 | 0.00 | 0.00 |
| #6 | 0.00 | 0.00 |
| #7 | 0.00 | 0.00 |
| #8 | 98.10 | 88.57 |
| #9 | 1.79 | 0.00 |
| #10 | 92.31 | 91.57 |
| #11 | 100.00 | – |
| #12 | 93.81 | 90.41 |
| #13 | 6.93 | 5.00 |
| #14 | 59.78 | 10.71 |
| #15 | 96.55 | – |