| Literature DB >> 24312072 |
Paul E Rapp1, Christopher J Cellucci, David O Keyser, Adele M K Gilpin, David M Darmon.
Abstract
The identification and longitudinal assessment of traumatic brain injury presents several challenges. Because these injuries can have subtle effects, efforts to find quantitative physiological measures that can be used to characterize traumatic brain injury are receiving increased attention. The results of this research must be considered with care. Six reasons for cautious assessment are outlined in this paper. None of the issues raised here are new. They are standard elements in the technical literature that describes the mathematical analysis of clinical data. The purpose of this paper is to draw attention to these issues because they need to be considered when clinicians evaluate the usefulness of this research. In some instances these points are demonstrated by simulation studies of diagnostic processes. We take as an additional objective the explicit presentation of the mathematical methods used to reach these conclusions. This material is in the appendices. The following points are made: (1) A statistically significant separation of a clinical population from a control population does not ensure a successful diagnostic procedure. (2) Adding more variables to a diagnostic discrimination can, in some instances, actually reduce classification accuracy. (3) A high sensitivity and specificity in a TBI versus control population classification does not ensure diagnostic successes when the method is applied in a more general neuropsychiatric population. (4) Evaluation of treatment effectiveness must recognize that high variability is a pronounced characteristic of an injured central nervous system and that results can be confounded by either disease progression or spontaneous recovery. A large pre-treatment versus post-treatment effect size does not, of itself, establish a successful treatment. (5) A procedure for discriminating between treatment responders and non-responders requires, minimally, a two phase investigation. This procedure must include a mechanism to discriminate between treatment responders, placebo responders, and spontaneous recovery. (6) A search for prodromes of neuropsychiatric disorders following traumatic brain injury can be implemented with these procedures.Entities:
Keywords: Mahalanobis distance; neuropsychiatric diagnosis; research design; statistical errors; statistical variability; treatment effects
Year: 2013 PMID: 24312072 PMCID: PMC3832983 DOI: 10.3389/fneur.2013.00177
Source DB: PubMed Journal: Front Neurol ISSN: 1664-2295 Impact factor: 4.003
Figure 1. Two normal distributions: μA = 3.2117, σA = 14.8328 (in blue), μB = − 3.1433, (in red) σB = 14.8255, NA = NB = 500. Given assumptions that the distributions are normal and that an optimal Bayesian classifier is used to classify individual elements, PSAME = 2.1096 × 10−11 and PERROR − FORMULA = 0.4078.
EEG classification error rates.
| Condition | Error rate of random assignment (%) | Error rate minimum Mahalanobis distance within-sample classification (%) | Error rate maximum Bayesian likelihood within-sample classification (%) | Error rate minimum Mahalanobis distance | Error rate maximum Bayesian likelihood |
|---|---|---|---|---|---|
| Eyes open | 50 | 7.7 | 0 | 85 | 69 |
| Eyes closed | 50 | 0 | 0 | 46 | 46 |
The formula determined error rate is 15.7%, a serious underestimate of the true error rate. When the element to be classified is used in the construction of the classifier, this is the within sample error rate, the calculated error rate is again significantly smaller than the error rate determined by a k-fold classification. k-fold classification tests provide a test of classifier performance in actual practice [Modified from Watanabe et al. (9)].
Figure 2Sensitivity of discrimination and backward elimination: The between-group Mahalanobis distance . At each step the least significant variable was removed. From Watanabe et al. (9).
Figure 3Error rate in a . The elimination sequence in the upper trace (denoted by circles) was determined by a backward elimination. The elimination sequence of the lower trace (triangles) was determined in a sequential correlation deletion. From Watanabe et al. (9).
Questions addressed in analysis of treatment effectiveness.
| 1. Is there an adequate pre-treatment separation between the clinical population and the healthy controls? |
| 2. Is the waitlist control group appropriately constructed? |
| 3. Is the waitlist control group stable during the duration of the trial? |
| 4. If there is a change in the waitlist control group, is it the result of continuing deterioration? |
| 5. If there is a change in the waitlist control group, is it the result of spontaneous recovery? |
| 6. Does the treatment group change during the trial? |
| 7. If there is a change in the treatment group, is it due to continuing deterioration? |
| 8. If there is a change in the treatment group is it due to spontaneous recovery? |
| 9. Is there a positive response to treatment? |
Figure 4. PSAME (GA, GB) was calculated as a function of effect size (equivalently the one-dimensional Mahalanobis distance) for different group sizes. In all calculations, the number of members in each group was the same, NA = NB. The populations are NA = NB = 10 (top curve), 20, 50, 100, 200, 500 (bottom curve).
Classification error rate in a univariate simulation..
| < | < | < | ||
|---|---|---|---|---|
| 10 | 0.35659 | 0.37104 | 0.44250 | 0.712 |
| 20 | 0.25619 | 0.39337 | 0.42218 | 0.642 |
| 30 | 0.17765 | 0.39831 | 0.41356 | 0.607 |
| 40 | 0.13235 | 0.40107 | 0.41074 | 0.587 |
| 50 | 0.10093 | 0.40320 | 0.41194 | 0.569 |
| 100 | 0.02239 | 0.40685 | 0.40989 | 0.548 |
| 150 | 0.00501 | 0.40762 | 0.40957 | 0.533 |
| 200 | 0.00121 | 0.40793 | 0.40941 | 0.534 |
| 250 | 0.00023 | 0.40799 | 0.40927 | 0.537 |
| 300 | 0.00005 | 0.40815 | 0.40884 | 0.517 |
| 350 | 0.6 × 10−5 | 0.40801 | 0.40854 | 0.517 |
| 400 | 0.2 × 10−5 | 0.40835 | 0.40876 | 0.515 |
| 450 | 0.4 × 106 | 0.40831 | 0.40853 | 0.513 |
| 500 | 0.1 × 10−6 | 0.40829 | 0.40860 | 0.507 |
NA = NB is the number of members in each group.