| Literature DB >> 23227220 |
Oscar Harari1, Jen-Chyong Wang, Kathleen Bucholz, Howard J Edenberg, Andrew Heath, Nicholas G Martin, Michele L Pergadia, Grant Montgomery, Andrew Schrage, Laura J Bierut, Pamela F Madden, Alison M Goate.
Abstract
Cigarette smoking is a common addiction that increases the risk for many diseases, including lung cancer and chronic obstructive pulmonary disease. Genome-wide association studies (GWAS) have successfully identified and validated several susceptibility loci for nicotine consumption and dependence. However, the trait variance explained by these genes is only a small fraction of the estimated genetic risk. Pathway analysis complements single marker methods by including biological knowledge into the evaluation of GWAS, under the assumption that causal variants lie in functionally related genes, enabling the evaluation of a broad range of signals. Our approach to the identification of pathways enriched for multiple genes associated with smoking quantity includes the analysis of two studies and the replication of common findings in a third dataset. This study identified pathways for the cholinergic receptors, which included SNPs known to be genome-wide significant; as well as novel pathways, such as genes involved in the sensory perception of smell, that do not contain any single SNP that achieves that stringent threshold.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23227220 PMCID: PMC3515482 DOI: 10.1371/journal.pone.0050913
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of the studies and subjects analyzed.
| OZALC-NAG | SAGE | ARIC | ||
| # Subjects | 4038 | 2014 | 5198 | |
| % Men/%Women | 49%/51% | 44%/56% | 56%/44% | |
| Age, mean ± S.D. | 45.16±10.43 | 37.97±9.09 | 54.30±5.7 | |
| Cigarettes per day | ||||
|
| 1159 | 1001 | 1047 | |
|
| 876 | 518 | 2155 | |
|
| 958 | 223 | 941 | |
|
| 1045 | 272 | 1052 | |
CPD: Cigarettes per day.
Number of categories of genes identified by ALIGATOR for OZALC-NAG and SAGE studies for smoking quantity.
| Category of Gene enrichment threshold | |||||||||
| 0.005 | 0.01 | 0.05 | |||||||
| SNP Threshold | Study | # SNPs | # Genes | #cat. |
| #cat. |
| #cat. |
|
| 0.001 | OZALC-NAG | 270 | 264 | 24 | 1.40E−03 | 36 | 3.80E−03 | 102 | 1.32E−02 |
| SAGE | 206 | 225 | 15 | 7.60E−03 | 35 | 2.40E−03 | 122 | 1.60E−03 | |
| Common | 4 | <2.00E−4 | 5 | <2.00E−4 | 7 | 8.20E−03 | |||
| 0.005 | OZALC-NAG | 1158 | 1078 | 18 | 3.46E−02 | 27 | 1.04E−01 | 147 | 7.94E−02 |
| SAGE | 1094 | 1127 | 11 | 1.14E−01 | 35 | 5.40E−02 | 181 | 2.92E−02 | |
| Common | 2 | <2.00E−4 | 2 | <2.00E−4 | 14 | 1.20E−03 | |||
| 0.01 | OZALC-NAG | 2242 | 1962 | 19 | 6.18E−02 | 37 | 1.06E−01 | 175 | 1.41E−01 |
| SAGE | 2150 | 2021 | 21 | 4.42E−02 | 41 | 7.72E−02 | 197 | 8.16E−02 | |
| Common | 2 | <2.00E−4 | 4 | <2.00E−4 | 12 | 1.60E−03 | |||
| 0.05 | OZALC-NAG | 10374 | 6832 | 39 | 2.62E−02 | 72 | 4.32E−02 | 281 | 1.70E−01 |
| SAGE | 9906 | 6901 | 46 | 1.06E−02 | 85 | 1.88E−02 | 326 | 6.70E−02 | |
| Common | 9 | <2.00E−4 | 12 | <2.00E−4 | 55 | <2.00E−4 | |||
SNPs: number of SNPs that achieved a significant for each threshold for OZALC-NAG and SAGE studies;
Genes: number of genes mapped;
cat and the excess of categories of genes (GO terms or KEGG pathways) for the thresholds applied to the category-specific p-values in the OZALC-NAG and the SAGE studies. “Common” rows show the number of overlapping categories of genes significantly enriched for both studies and the corresponding statistical significance.
ALIGATOR identified common OZALC-NAG and SAGE GO terms and KEGG pathways for the analysis of SNPs p-value<0.05 and ARIC replication results.
| OZALC-NAG | SAGE | ARIC | Combined | ||||||||||
| acc | name | # Genes Cat. | # Genes |
| Expected | # Genes |
| Expected | # Genes |
| Expected |
| |
| Cholinergic nicotinic receptors | |||||||||||||
| GO:0042166 | acetylcholine binding | 22 | 11 | 5.80E−04 | 3.30 | 10 | 2.32E−03 | 3.3348 | 8 | 7.10E−03 | 2.69 | 2.57E−06 | |
| GO:0015464 | acetylcholine receptor activity | 18 | 10 | 9.00E−04 | 2.86 | 8 | 9.98E−03 | 2.8908 | 7 | 1.12E−02 | 2.34 | 1.17E−05 | |
| GO:0004889 | nicotinic acetylcholine-activated cation-selective channel activity | 17 | 9 | 4.40E−03 | 3.05 | 10 | 1.20E−03 | 3.0838 | 7 | 1.50E−02 | 2.49 | 2.40E−05 | |
| GO:0005892 | nicotinic acetylcholine-gatedreceptor-channel complex | 16 | 9 | 4.40E−03 | 3.05 | 10 | 1.20E−03 | 3.0838 | 7 | 1.50E−02 | 2.49 | 1.24E−04 | |
| Sensory perception | |||||||||||||
| GO:0004984 | olfactory receptor activity | 369 | 82 | 4.00E−05 | 48.48 | 78 | 1.00E−04 | 48.9594 | 61 | 1.08E−03 | 39.61 | 8.11E−09 | |
| GO:0007608 | sensory perception of smell | 386 | 91 | 9.40E−04 | 64.19 | 84 | 9.16E−03 | 64.8288 | 66 | 3.71E−02 | 52.42 | 5.44E−05 | |
| GO:0007606 | sensory perception of chemical stimulus | 430 | 99 | 8.00E−04 | 70.76 | 91 | 9.06E−03 | 71.4626 | 70 |
| 57.77 | 7.70E−05 | |
| Ribosome | |||||||||||||
| hsa03010 | Ribosome | 88 | 15 | 3.40E−04 | 5.36 | 19 | 2.00E−04 | 5.4116 | 14 | 1.60E−04 | 4.37 | 1.13E−08 | |
| GO:0005840 | ribosome | 192 | 34 | 5.00E−04 | 17.97 | 40 | 2.00E−04 | 18.1458 | 32 | 8.00E−04 | 14.67 | 5.69E−08 | |
| GO:0022625 | cytosolic large ribosomal subunit | 36 | 8 | 4.82E−03 | 2.53 | 9 | 1.52E−03 | 2.557 | 5 |
| 2.05 | 1.73E−04 | |
| Retinoid | |||||||||||||
| GO:0005501 | retinoid binding | 26 | 9 | 2.36E−03 | 5.54 | 9 | 2.60E−03 | 4.0218 | 6 |
| 4.84 | 5.53E−04 | |
| GO:0001972 | retinoic acid binding | 11 | 3 | 1.82E−03 | 0.68 | 4 | 6.40E−04 | 1.0174 | 1 |
| 3.00 | 2.80E−03 | |
acc: Category of gene (prefix GO indicates Gene ontology and hsa KEGG pathways) and corresponding name;
Genes Cat.: the total number of genes grouped by the category. For each study:
genes significant; corresponding category-specific ; and the Expected number of genes.
Combined p-values were calculated employing the weighted Z-score method.
Figure 1Go terms for cholinergic receptors and significant genes.
The p-value of each gene was assigned based on the most significant SNP in gene sequences and flanking regions (Left panel). SNPs in linkage disequilibrium (r2>0.2) and in a local proximity (1 Mb) were removed. Colored boxes in the right panel reflect the assignment of each gene to the different GO terms.
MAGENTA Significant categories of genes with nominal p-value<0.05 in the OZALC-NAG and the SAGE studies and ARIC corresponding results.
| OZALC-NAG | SAGE | ARIC | Combined | |||
| Threshold | Acc | Name |
|
|
|
|
| 0.005 | GO:0035095 | behavioral response to nicotine | 2.20E−03 | 2.20E−03 | 2.00E−03 | 8.29E−06 |
| GO:0060084 | synaptic transmission involved in micturition | 8.50E−03 | 7.20E−03 | 7.90E−03 | 1.91E−04 | |
| 0.01 | GO:0006942 | regulation of striated muscle contraction | 6.50E−03 | 6.30E−03 | 1.00E+00 | 2.94E−01 |
| 0.05 | GO:0004889 | nicotinic acetylcholine-activated cation-selective channel activity | 8.50E−05 | 4.06E−02 | 6.00E−04 | 7.83E−07 |
| GO:0005892 | nicotinic acetylcholine-gated receptor-channel complex | 8.20E−05 | 4.36E−02 | 1.10E−03 | 1.60E−06 | |
| GO:0015464 | acetylcholine receptor activity | 2.68E−04 | 2.50E−02 | 2.20E−03 | 5.90E−06 | |
| GO:0005230 | extracellular ligand-gated ion channel activity | 3.50E−03 | 1.65E−02 | 1.76E−02 | 3.43E−04 | |
| GO:0007271 | synaptic transmission, cholinergic | 3.30E−03 | 2.63E−02 | 2.60E−02 | 6.41E−04 | |
| GO:0042060 | wound healing | 3.00E−04 | 1.69E−02 | 3.22E−01 | 6.05E−03 | |
| GO:0006940 | regulation of smooth muscle contraction | 2.50E−02 | 2.61E−02 | 1.33E−01 | 1.68E−02 | |
| GO:0005216 | ion channel activity | 2.52E−02 | 1.53E−02 | 1.88E−01 | 2.23E−02 | |
| GO:0042552 | myelination | 1.51E−02 | 1.64E−02 | 4.35E−01 | 5.94E−02 | |
| GO:0007257 | activation of JUN kinase activity | 3.42E−02 | 3.61E−02 | 3.52E−01 | 7.47E−02 | |
| GO:0006548 | histidine catabolic process | 4.71E−02 | 4.27E−02 | 3.04E−01 | 7.52E−02 | |
| GO:0034185 | apolipoprotein binding | 4.20E−02 | 4.12E−02 | 1.00E+00 | 1.45E−01 | |
| GO:0007265 | Ras protein signal transduction | 4.17E−02 | 4.99E−02 | 8.33E−01 | 3.14E−01 |
Threshold of significance satisfied by both NAF and SAGE; acc: identifier for the category of genes and name. For each study, the nominal GSEA p-value is shown (see Table S4 for false discovery rate).
Combined p-values were calculated employing the weighted Z-score method.