| Literature DB >> 32488126 |
Katharina Schultebraucks1,2,3, Meng Qian4, Duna Abu-Amara4, Kelsey Dean5, Eugene Laska6,7, Carole Siegel6,7, Aarti Gautam8, Guia Guffanti9,10, Rasha Hammamieh8, Burook Misganaw5, Synthia H Mellon11, Owen M Wolkowitz12, Esther M Blessing4, Amit Etkin13,14,15, Kerry J Ressler9,10, Francis J Doyle5, Marti Jett8, Charles R Marmar4.
Abstract
Active-duty Army personnel can be exposed to traumatic warzone events and are at increased risk for developing post-traumatic stress disorder (PTSD) compared with the general population. PTSD is associated with high individual and societal costs, but identification of predictive markers to determine deployment readiness and risk mitigation strategies is not well understood. This prospective longitudinal naturalistic cohort study-the Fort Campbell Cohort study-examined the value of using a large multidimensional dataset collected from soldiers prior to deployment to Afghanistan for predicting post-deployment PTSD status. The dataset consisted of polygenic, epigenetic, metabolomic, endocrine, inflammatory and routine clinical lab markers, computerized neurocognitive testing, and symptom self-reports. The analysis was computed on active-duty Army personnel (N = 473) of the 101st Airborne at Fort Campbell, Kentucky. Machine-learning models predicted provisional PTSD diagnosis 90-180 days post deployment (random forest: AUC = 0.78, 95% CI = 0.67-0.89, sensitivity = 0.78, specificity = 0.71; SVM: AUC = 0.88, 95% CI = 0.78-0.98, sensitivity = 0.89, specificity = 0.79) and longitudinal PTSD symptom trajectories identified with latent growth mixture modeling (random forest: AUC = 0.85, 95% CI = 0.75-0.96, sensitivity = 0.88, specificity = 0.69; SVM: AUC = 0.87, 95% CI = 0.79-0.96, sensitivity = 0.80, specificity = 0.85). Among the highest-ranked predictive features were pre-deployment sleep quality, anxiety, depression, sustained attention, and cognitive flexibility. Blood-based biomarkers including metabolites, epigenomic, immune, inflammatory, and liver function markers complemented the most important predictors. The clinical prediction of post-deployment symptom trajectories and provisional PTSD diagnosis based on pre-deployment data achieved high discriminatory power. The predictive models may be used to determine deployment readiness and to determine novel pre-deployment interventions to mitigate the risk for deployment-related PTSD.Entities:
Mesh:
Year: 2020 PMID: 32488126 PMCID: PMC8589682 DOI: 10.1038/s41380-020-0789-2
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Classification task and observed positive and negative events for each outcome. The term “positive events” refers to the outcome-of-interest, i.e., to those participants in the sample who are on an “increasing” PTSD symptom trajectory or who meet the cutoff for provisional PTSD diagnosis (PCL-5 score ≥31). “Negative events” are those participants who are on a “resilient” trajectory or who do not meet the cutoff. Depicted are the sample size for each outcome for the training set, the test set and the total sample.
| Classification task | Training set (75%) | Test set (25%) | Total ( |
|---|---|---|---|
| PTSD symptom trajectories | |||
| “Increasing” | |||
| “Resilient” | |||
| PCL-5 cutoff score | |||
| Provisional PTSD | |||
| No PTSD | |||
Sample characteristics at Phase 1 of those participants included into the analysis.
| Phase 1 | ||||
|---|---|---|---|---|
| “Increasing” trajectory ( | “Resilient” trajectory ( | Provisional PTSD ( | No PTSD ( | |
| Age | 27.16 (5.99) | 25.66 (5.92) | 26.67 (6.15) | 25.73 (5.92) |
| Gender (% females) | 11.6% ( | 5.3% ( | 13.9% ( | 5.3% ( |
| PCL-5 score | 14.12 (16.66) | 2.63 (5.37) | 13.83 (17.17) | 2.84 (5.83) |
| PHQ-8 score | 4.5 (5.13) | 1.31 (2.49) | 4.54 (5.33) | 1.36 (2.55) |
| GAD-7 score | 5.55 (5.42) | 1.65 (2.64) | 5.4 (5.57) | 1.72 (2.75) |
| AUDIT score | 2.43 (3.28) | 2.23 (2.50) | 2.58 (3.57) | 2.22 (2.48) |
| PSQI score | 8.46 (3.79) | 4.88 (2.9) | 8 (3.63) | 4.97 (3.01) |
| DRRI-2 score | 51.7 (22.5) | 35.94 (16.5) | 50.86 (23.73) | 36.52 (16.86) |
| TBI status: improbable | 63.4% ( | 84.6% ( | 61.8% ( | 84.3% ( |
| TBI status: possible | 12.1% ( | 5.7% ( | 14.7% ( | 5.6% ( |
| TBI status: mild | 14.6% ( | 8.1% ( | 11.8% ( | 8.4% ( |
| TBI status: moderate | 9.8% ( | 1% ( | 11.8% ( | 0.9% ( |
| TBI status: severe | 0% ( | 0.7% ( | 0% ( | 0.7% ( |
| CSI current | 15.44 (12.22) | 5.73 (8.92) | 16.15 (12.82) | 6.01 (9.02) |
| CSI lifetime | 27.94 (18.72) | 12.35 (15.2) | 25.69 (15.21) | 13.45 (16.7) |
| Number of previous deployments | 1.09 (1.54) | 0.76 (1.11) | 0.81 (1.31) | 0.79 (1.15) |
PCL-5 PTSD Checklist for DSM-5, PHQ-8 Patient Health Questionnaire, GAD-7 Generalized Anxiety Disorder; AUDIT Alcohol Use Identification Test, PSQI Pittsburgh Sleep Quality Index, DRRI-2 Deployment Risk and Resilience Inventory-2, TBI traumatic brain injury, CSI Concussion Symptoms Inventory (current (past month) and lifetime (month in which symptoms were their “worst”)).
Fig. 1Unconditional model for the latent trajectories of the longitudinal PTSD symptom development based on PCL-5 scores through 90–180 days (Phase 1, 2, and 3).
The term “unconditional” means that there are no covariates included in this LGMM model but only the PCL-5 scores (outcome-of-interest) [25]. A two-class solution with fixed slope and linear weights was identified as the best-fitting model with an entropy of 0.98 (see Supplementary Tables 3, 4). We chose linear rather than quadratic solutions for trajectories because a minimum of four time points is recommended to fit quadratic solutions. Those two trajectories can be qualitatively described as “increasing” trajectory (N = 43, 9.1%) and as “resilient” trajectory (N = 430, 90.9%).
Fig. 2Discriminatory power of RF and SVM using different data types as predictor variables.
Receiver operating characteristic curve (ROC) for the prediction of the provisional PTSD diagnosis post deployment (a) and of PTSD symptom trajectories (b) using genetic, metabolomic, methylation, inflammation, neuropsychological, and clinical data collected prior to deployment. Depicted is the optimal ROC thresholds for sensitivity and specificity as determined by min((1 − sensitivities)2 + (1 − specificities)2), which yields the threshold closest to the top-left corner of the ROC curve [73]. DeLong’s test for two correlated ROC curves [74] shows no significant difference between the RF and SVM models for predicting LGMM trajectories (Z = 0.403, p = 0.3435), but significant differences for provisional PTSD diagnosis (Z = 1.7587, p = 0.03932). The bar plot (c) displays the comparison of the predictive models with different benchmark models. All four models (SVM and RF models predicting provisional PTSD diagnosis and SVM and RF models predicting PTSD symptom trajectories) have significantly higher discriminatory power than a non-informative model that predicts all participants as “PTSD negative,” i.e., is low or subthreshold PTSD symptoms (see Supplementary Table 7). All four models are significantly better than a benchmark model using a subject-specific baseline score as predicted outcome [56], i.e., using the individual pre-deployment PTSD status as indicated by the PCL-5 (see Supplementary Table 14).
Fig. 3Display of the top 15 predictor variables for predicting LGMM trajectories (green bars) and for predicting provisional PTSD diagnosis (blue bars).
In permutation-based ranking [34], the importance of a feature is measured by calculating the increase in the model’s prediction error after reshuffling the distribution of the feature values. The y-axis presents the importance ranking, with the top features being the most important ones. The x-axis denotes the classification error scaled to range 0 to 100. It is not recommended to interpret the absolute importance value, but only the rank order between features [75]. All features shown in Fig. 3 contributed significantly (p < 0.01) to the respective predictive model [34]. Statistical significance is indicated by the bias-correcting PIMP algorithm, which tests the importance of each predictor under the distribution of “null importance” values derived for every variable from 100 permutations of the response variable [34].