| Literature DB >> 31501510 |
Kelsey R Dean1,2, Rasha Hammamieh3, Synthia H Mellon4, Duna Abu-Amara5, Janine D Flory6,7, Guia Guffanti8, Kai Wang9, Bernie J Daigle10, Aarti Gautam3, Inyoul Lee9, Ruoting Yang11, Lynn M Almli12, F Saverio Bersani13,14, Nabarun Chakraborty15, Duncan Donohue15, Kimberly Kerley12, Taek-Kyun Kim9, Eugene Laska5, Min Young Lee9, Daniel Lindqvist13,16, Adriana Lori12, Liangqun Lu10, Burook Misganaw2, Seid Muhie15, Jennifer Newman5, Nathan D Price9, Shizhen Qin9, Victor I Reus13, Carole Siegel5, Pramod R Somvanshi2, Gunjan S Thakur2, Yong Zhou9, Leroy Hood9, Kerry J Ressler8, Owen M Wolkowitz13, Rachel Yehuda6,7, Marti Jett3, Francis J Doyle17, Charles Marmar5.
Abstract
Post-traumatic stress disorder (PTSD) impacts many veterans and active duty soldiers, but diagnosis can be problematic due to biases in self-disclosure of symptoms, stigma within military populations, and limitations identifying those at risk. Prior studies suggest that PTSD may be a systemic illness, affecting not just the brain, but the entire body. Therefore, disease signals likely span multiple biological domains, including genes, proteins, cells, tissues, and organism-level physiological changes. Identification of these signals could aid in diagnostics, treatment decision-making, and risk evaluation. In the search for PTSD diagnostic biomarkers, we ascertained over one million molecular, cellular, physiological, and clinical features from three cohorts of male veterans. In a discovery cohort of 83 warzone-related PTSD cases and 82 warzone-exposed controls, we identified a set of 343 candidate biomarkers. These candidate biomarkers were selected from an integrated approach using (1) data-driven methods, including Support Vector Machine with Recursive Feature Elimination and other standard or published methodologies, and (2) hypothesis-driven approaches, using previous genetic studies for polygenic risk, or other PTSD-related literature. After reassessment of ~30% of these participants, we refined this set of markers from 343 to 28, based on their performance and ability to track changes in phenotype over time. The final diagnostic panel of 28 features was validated in an independent cohort (26 cases, 26 controls) with good performance (AUC = 0.80, 81% accuracy, 85% sensitivity, and 77% specificity). The identification and validation of this diverse diagnostic panel represents a powerful and novel approach to improve accuracy and reduce bias in diagnosing combat-related PTSD.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31501510 PMCID: PMC7714692 DOI: 10.1038/s41380-019-0496-z
Source DB: PubMed Journal: Mol Psychiatry ISSN: 1359-4184 Impact factor: 15.992
Fig. 1Overview of PTSD biomarker identification approach—details of cohort recruitment, and biomarker identification, down-selection, and validation
Fig. 2Overview of molecular datasets and cohort symptom severity. a Flow diagram for participant recruitment and enrollment. Participant eligibility was determined through a phone pre-screen and a baseline diagnostic clinical interview. Eligible participants completed fasting blood draws for multi-omic molecular assays. Participants in the initial discovery cohort were invited to return for follow-up in the recall cohort. Some participants returned with symptom changes, including “subthreshold” PTSD symptoms (below original study inclusion criteria). b Trajectory of PTSD symptoms in recalled participants. CAPS total for current symptoms at baseline (T0) and follow-up (T1) for each participant are connected. Participants who remained in the PTSD + group at both time points are shown in red. Participants who remained in the PTSD- group are shown blue. Participants with PTSD status changes are shown in gray, including participants who became “subthreshold” PTSD cases. c Distribution of molecular data types at three stages of biomarker identification: full exploratory dataset (All Data), reduced set of 343 potential biomarkers (candidate set) and the final panel of 28 biomarker (final set). Methylation and GWAS data represents 99% of initial data screen due to high-throughput arrays. Other molecular data types are well represented in the second and final stages of biomarker identification and selection
Summary of cohort demographics and clinical symptoms
| Cohort 1 (discovery cohort) | Cohort 2 (recalled cohort) | Cohort 3 (validation cohort) | |||||
|---|---|---|---|---|---|---|---|
| PTSD + | PTSD− | PTSD + | Subthreshold PTSD | PTSD− | PTSD + | PTSD− | |
| Age, years [mean (sd)] | 32.8 (7.4) | 32.6 (8.0) | 33.7 (8.2) | 35.6 (8.0) | 36.6 (8.9) | 36.8 (10.2) | 33.0 (8.2) |
| Race/ethnicity [ | |||||||
| Hispanic | 34 (44%) | 24 (32%) | 9 (60%) | 3 (27%) | 12 (41%) | 11 (42%) | 2 (8%) |
| Non-Hispanic Asian | 1 (1%) | 5 (7%) | 0 (0%) | 0 (0%) | 1 (3%) | 3 (12%) | 4 (15%) |
| Non-Hispanic black | 21 (27%) | 16 (22%) | 4 (27%) | 3 (27%) | 5 (17%) | 5 (19%) | 4 (15%) |
| Non-Hispanic white | 18 (23%) | 24 (32%) | 2 (13%) | 4 (36%) | 9 (31%) | 7 (27%) | 16 (62%) |
| Non-Hispanic other | 3 (4%) | 5 (7%) | 0 (0%) | 1 (9%) | 2 (7%) | 0 (0%) | 0 (0%) |
| Education [ | |||||||
| Less than 12th grade | 2 (3%) | 0 (0%) | 1 (7%) | 0 (0%) | 0 (0%) | 1 (4%) | 0 (0%) |
| HS diploma or GED | 27 (35%) | 13 (18%) | 2 (13%) | 1 (9%) | 4 (14%) | 10 (38%) | 6 (23%) |
| 2 years college, AA degree | 23 (30%) | 21 (28%) | 6 (40%) | 4 (36%) | 3 (10%) | 7 (27%) | 6 (23%) |
| 4 years college, BA degree | 22 (29%) | 28 (38%) | 5 (33%) | 4 (36%) | 19 (66%) | 5 (19%) | 7 (27%) |
| Master’s degree | 3 (4%) | 11 (15%) | 1 (7%) | 2 (18%) | 3 (10%) | 3 (12%) | 7 (27%) |
| Doctoral degree | 0 (0%) | 1 (1%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) |
| Body mass index [mean (sd)] | 30.1 (5.1) | 28.1 (4.3) | 32.2 (5.3) | 31.1 (8.0) | 27.2 (6.7) | 29.4 (6.7) | 26.9 (2.5) |
| Cholesterol [mean (sd)] | |||||||
| HDL cholesterol | 47.6 (11.9) | 50.1 (13.3) | 43.0 (8.0) | 45.9 (11.1) | 53.3 (13.9) | 47.3 (12.8) | 53.3 (11.9) |
| LDL cholesterol | 108.2 (32.3) | 100.0 (25.4) | 115.1 (26.4) | 110.3 (37.7) | 105.3 (27.6) | 116.2 (30.0) | 98.1 (31.2) |
| HbA1c [mean (sd)] | 5.4 (0.9) | 5.4 (0.4) | 5.6 (0.8) | 5.6 (0.7) | 5.3 (0.5) | 5.4 (0.3) | 5.3 (0.4) |
| PTSD Severity, Total CAPS [mean (sd)] | 68.0 (16.1) | 3.6 (4.9) | 69.5 (18.5) | 37.6 (9.0) | 5.2 (7.3) | 67.2 (19.4) | 2.7 (4.5) |
| Early trauma exposure, ETISR total [mean (sd)] | 7.7 (5.8) | 5.1 (4.0) | 5.2 (4.6) | 6.9 (4.8) | 4.5 (3.9) | 7.2 (4.8) | 4.9 (3.3) |
| Major depressive disorder [ | 42 (55%) | 0 (0%) | 9 (64%) | 2 (18%) | 0 (0%) | 9 (35%) | 1 (4%) |
| Peritraumatic dissociate experience, Rater version [mean (sd)] | 1.8 (0.4) | 1.2 (0.2) | 1.8 (0.5) | 1.8 (0.5) | 1.3 (0.3) | 1.8 (0.4) | 1.2 (0.2) |
| Peritraumatic distress inventory, Rater version [mean (sd)] | 2.1 (0.8) | 1.1 (0.6) | 2.2 (0.7) | 2.1 (0.9) | 1.0 (0.6) | 2.0 (0.9) | 1.0 (0.7) |
| Sleep quality, PSQI [mean (sd)] | 13.1 (3.3) | 5.9 (3.6) | 11.8 (3.5) | 10.7 (4.2) | 5.6 (3.0) | 11.9 (3.6) | 6.2 (3.3) |
| Number of tours [mean (sd)] | 1.8 (0.9) | 1.7 (0.8) | 1.5 (0.6) | 1.6 (0.7) | 1.9 (1.1) | 1.3 (0.6) | 2.1 (1.5) |
Fig. 3Validation of biomarker panels. a ROC curve for identified biomarker panel (28 markers), illustrating good performance in an independent validation dataset (26 cases, 26 controls). Shaded region indicates 95% confidence interval, determined by 2000 bootstrapping iterations. Operating point closest to (0,1) on ROC curve used for calculating sensitivity, specificity, and accuracy. b Predicted probability of PTSD based on trained random forest model using a biomarker panel of 28 features. In PTSD participants, predicted PTSD probability is correlated with PTSD symptom severity, measured by CAPS (r = 0.59, p < 0.01). c Random forest variable importance of the final 28 biomarkers. Variable importance was determined using biomarker model training data (cohorts 1 and 2). The top 10 biomarkers, based on random forest variable importance, contain multiple data types, including methylation markers (cg01208318, cg20578780, and cg15687973), physiological features (heart rate), miRNAs (miR-133a-1-3p, miR-192-5p, and miR-9-1-5p), clinical lab measurements (insulin and mean platelet volume), and metabolites (gammaglutamyltyrosine). d Correlation between PTSD biomarkers. Pearson correlation coefficients were computed in the combined set of all three cohorts. The final set of identified biomarkers show small clusters of moderately correlated features, primarily grouped by molecular data type (proteins, miRNAs, and methylation markers). e Biomarker panel performance evaluation during panel refinement, across molecular data types, and in nonlinear features. The validation AUC improves after biomarker down-selection and model refinement. The final biomarker panel validates with greater AUC over the initial biomarker candidate pool (343 markers, AUC = 0.74), and stage one refined panel (77 markers, AUC = 0.75). The final multi-omic panel also outperforms each individual molecular data type. Performance metrics for nonlinear feature combinations, Global Arginine Bioavailability Ratio (GABR) and lactate/citrate. Both nonlinear combinations outperform their individual components in AUC (0.60 vs. 0.51 and 0.55 vs. 0.52 in GABR and lactate/citrate, respectively). Error bars indicate 95% confidence interval, determined by 2000 bootstrapping iterations. f Validation performance by ethnicity, and in the presence of major depressive disorder (MDD). Validation performance in Hispanic participants was higher than other ethnicities (non-Hispanic White, non-Hispanic Black, non-Hispanic Asian). PTSD cases with comorbid MDD (n = 9) are easily distinguishable from all combat-exposed controls (n = 26), with AUC = 0.92, while PTSD cases without comorbid MDD (n = 17) are only moderately distinguishable from controls (n = 26), with AUC = 0.73
Overview of biomarker signals in each of the three cohorts
| Cohort 1 (Discovery Cohort) | Cohort 2 (Recalled Cohort) | Cohort 3 (Validation Cohort) | Findings from the literature | ||
|---|---|---|---|---|---|
| 70672835 (SHANK2) | Methylation | ↓ | ↑ | ↓ | Genetic variants of SHANK2 associated with schizophrenia [ |
| 75938326 (C2orf3) | Methylation | ||||
| 75938338 (C2orf3) | Methylation | ||||
| AFM-LPN | Protein | ↑ | ↓ | ↓ | |
| cg01208318 | Methylation | ||||
| cg03405026 (MLH1) | Methylation | ↑ | ↓ | ↓ | |
| cg03433241 | Methylation | ↑ | ↑ | ↓ | |
| cg04112106 (CES2) | Methylation | ↑ | ↑ | ↓ | |
| cg15687973 (PDE9A) | Methylation | ↓ | ↓ | ↑ | PDE9A expression has been associated with monoamine neurotransmitter regulation and depression [ |
| cg17137457 (CPT1B) | Methylation | CPT1B expression has been associated with rodent stress and human PTSD [ | |||
| cg20578780 | Methylation | ||||
| cg26454601 (MDC1) | Methylation | ↑ | ↑ | ↓ | |
| CPN1-IVQ | Protein | ||||
| CPN2-LLN | Protein | ↓ | ↑ | ↑ | |
| CTSS-GID | Protein | ↑ | ↑ | ↓ | |
| F10-NCE | Protein | ↑ | ↓ | ↑ | Decreased coagulation in PTSD [ |
| Global Arginine Bioavailability Ratio (GABR) | Metabolite | ↓ | ↑ | ↑ | GABR is decreased in patients with MDD [ |
| Gammaglutamyltyrosine | Metabolite | Gammaglutamyltyrosine negatively correlated with leukocyte telomere length [ | |||
| heart rate | Physiological | Elevated heart rate following trauma associated with development of PTSD [ | |||
| hsa-miR-133a-3p | miRNA | ||||
| hsa-miR-192-5p | miRNA | ↓ | ↓ | ↑ | Abundant in liver; associated with obesity and diabetes [ |
| hsa-miR-424-3p | miRNA | miR-424-3p has been associated with inflammation [ | |||
| hsa-miR-9-5p | miRNA | Enriched in brain tissue and regulates neurogenesis [ | |||
| Insulin | Clinical Labs | Increased insulin resistance in veterans with PTSD [ | |||
| ITIH2-VQF | Protein | ||||
| Lactate/citrate | Metabolite | Lactate has been considered panic-inducing in both Panic Disorder and PTSD [ | |||
| Mean platelet volume | Clinical Labs | Increased mean platelet volume associated with panic disorder [ | |||
| PTGDS-AQG | Protein | Prostaglandin dysregulation has been associated with both rodent stress models of PTSD and in pathways related to human PTSD [ |
Arrows indicate upregulated and downregulated signals, respectively. Underlined arrows indicate consistent signals across all three cohorts, while non-underlined arrows indicate contradictory signal directions