| Literature DB >> 23210427 |
Philip Beineke1, Karen Fitch, Heng Tao, Michael R Elashoff, Steven Rosenberg, William E Kraus, James A Wingrove.
Abstract
BACKGROUND: Smoking is the leading cause of preventable death worldwide and has been shown to increase the risk of multiple diseases including coronary artery disease (CAD). We sought to identify genes whose levels of expression in whole blood correlate with self-reported smoking status.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23210427 PMCID: PMC3538056 DOI: 10.1186/1755-8794-5-58
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Clinical demographics of microarray subjects
| Variable* | (N = 100) | (N = 64) | (N = 4) | (N = 41) | |
| Max QCA‡ | 42.54±37 | 37.71±36 | 47.3±44 | 57.13±37 | |
| Sex (%Male) | 67 (0.67) | 54 (0.84) | 3 (0.75) | 34 (0.83) | 0.063 |
| Age (yrs) | 59±13 | 63±11 | 53±14 | 54±11 | |
| Caucasian (%) | 89 (0.89) | 61 (0.95) | 3 (0.75) | 35 (0.85) | 0.374 |
| BMI | 30±6 | 30±5 | 33±8 | 30±8 | 0.452 |
| Systolic BP | 136±18 | 133±18 | 130±20 | 135±15 | 0.541 |
| Diastolic BP | 83±12 | 79±12 | 78±10 | 80±11 | 0.242 |
| Hypertension (%) | 62 (0.62) | 37 (0.578125) | 2 (0.5) | 20 (0.49) | 0.180 |
| Dyslipidemia (%) | 63 (0.63) | 36 (0.5625) | 2 (0.5) | 23 (0.56) | 0.308 |
| Neutrophil Count | 3.9±1.2 | 3.8±1.3 | 4.8±1 | 4.8±1.7 | < |
| Lymphocyte Count | 1.9±0.5 | 1.9±0.7 | 2±0.2 | 2.1±0.7 | 0.063 |
*Mean values are given, ± SD or % in parenthesis.
† Variables with p values in bold are significantly different between the four categories (< 0.05).
‡Maximum coronary artery stenosis in all major coronary vessels of a subject, as determined by quantitative coronary angiography (QCA).
Figure 1Gene ontology analysis of 4214 array genes associated with smoking. The 4214 smoking-associated genes were analyzed using BINGO to identify significant biological processes. Significant processes (p < 0.001 after FDR correction) are colored with the gradient of p values reflected in the colors as indicated, and the biological process annotated. (A) Cellular component ontological terms (B) Biological Process ontological terms.
Figure 2Hierarchical clustering of 209 subjects and 227 array genes associated with smoking (p < 0.001). The dendogram on top shows correlations between subjects; black bars at bottom denote current smokers; red bars denote recently quit smokers. Dendogram on the left shows correlations between genes; positions of representative cell-specific genes are shown on the right.
Figure 3Expression levels of four most significant genes as assessed by qRT-PCR across 1074 PREDICT subjects grouped by self-reported smoking status. Expression levels are shown in Cp units on the Y axis, self-reported smoking status is shown on the X axis. (A) LRRN3; (B) CLDND1; (C) SASH1; (D) P2RY6.
Performance of GES and cotinine models
| GES – Development Set | 0.93 | 0.79 | 0.95 |
| GES – Validation Set | 0.82 (0.65-0.94) | 0.63 | 0.94 |
| Cotinine – Validation Set | 0.89 (0.81-0.97) | 0.81 | 0.97 |
*95% confidence interval is shown in parentheses. As the AUC for the development set was derived via cross-validation, a confidence interval could not be assigned. However, the standard deviation of the cross-validation runs equaled 0.03; the standard error equaled 0.001.
Figure 4Comparison of gene expression score to cotinine levels in validation set. The y-axis shows the log10 value of cotinine levels in the 180 subject validation set; the horizontal dashed line (−−-) denotes the 10ng/ml threshold used in the AUC analysis. The x-axis shows the GES in the 180 subject validation set; the vertical dashed line denotes the 50% probability threshold used in the AUC analysis. Black circles = non-smokers; red circles = former smokers (> 2 months quit); green circles = recently quit smokers (< 2 months quit); blue circles = current smokers. All smoking categories are self-reported.