| Literature DB >> 30041705 |
Abanish Singh1,2,3, Michael A Babyak4,5, Beverly H Brummett4,5, William E Kraus6,7, Ilene C Siegler4,5, Elizabeth R Hauser6,8, Redford B Williams4,5.
Abstract
OBJECTIVES: Among many challenges in cardiovascular disease (CVD) risk prediction are interactions of genes with stress, race, and/or sex and developing robust estimates of these interactions. Improved power with larger sample size contributed by the accumulation of epidemiological data could be helpful, but integration of these datasets is difficult due the absence of standardized phenotypic measures. In this paper, we describe the details of our undertaking to harmonize a dozen datasets and provide a detailed account of a number of decisions made in the process.Entities:
Keywords: CVD-risk; Correlation; Data harmonization; Depressive symptoms; GxE interaction; Mega-analysis; Synthetic psychosocial stress
Mesh:
Year: 2018 PMID: 30041705 PMCID: PMC6057001 DOI: 10.1186/s13104-018-3595-z
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Fig. 1a Distributions of chronic stress z-scores in dbGaP public-access datasets, i.e., MESA, Framingham Offspring, ARIC, CARDIA, WHI and JHS, and b Duke datasets, i.e., CHASE, DFHS, Duke Caregiver, STRRIDE-AT/RT, and STRRIDE-PD. With the exception of MESA and JHS, the stress measure is a synthetic variable for all datasets
Spearman’s correlation of synthetic and self-rated stress measures with CES-D depression measure
| Dataset | Stress measure | Depression measure |
| P-value |
|---|---|---|---|---|
| MESA | Self-rated | Center for Epidemiological Studies-Depression Scale (CES-D) | 0.35 | < 0.0001 |
| Framingham Offspring Cohort | Synthetic | CES-D | 0.23 | < 0.0001 |
| CARDIA | Synthetic | CES-D | 0.07 | < 0.001 |
| ARIC | Synthetic | Maastricht Vital Exhaustion Score | 0.28 | < 0.0001 |
| WHI | Synthetic | Shortened CES-D | 0.2 | < 0.0001 |
| JHS | Self-rated | CES-D | 0.32 | < 0.0001 |
| CHASE | Synthetic | Beck Depression Inventory (BDI) | 0.31 | < 0.001 |
| DFHS | Synthetic | CES-D | 0.25 | < 0.0001 |
| Caregiver | Synthetic | CES-D | 0.42 | < 0.0001 |
| STRRIDE-AT/RT | Synthetic | Self-rated | 0.4 | < 0.0001 |
| STRRIDE-PD | Synthetic | Self-rated | 0.54 | < 0.0001 |
| Combined estimatea | 0.27 | < 0.0001 |
a Combined estimate derived from mega-analytic partial correlation between standardized (z-scores) stress and depression measures whilst controlling for the effect of study variable in combined dataset
Fig. 2a Distributions of harmonized phenotypes in dbGaP public-access datasets. Each notched box plot shows the distribution (i.e., five point summary statistics, outliers, and notches based on the median ± 1.58 * IQR/sqrt(n)) of one variable in the six dbGaP studies, i.e., ARIC, CARDIA, FRAMINGHAM, JACKSON HEART, MESA, and WHI; and b six Duke studies, i.e., CAREGIVER, CHASE, DFHS, STRRIDE-1, STRRIDE-AT/RT, and STRRIDE-PD. The scales for fasting glucose, insulin, HbA1C, and triglyceride were log transformed, and the standardized depression measure was square root transformed