| Literature DB >> 28361705 |
Amin Zollanvari1,2, Gil Alterovitz3,4.
Abstract
BACKGROUND: Alcoholism has a strong genetic component. Twin studies have demonstrated the heritability of a large proportion of phenotypic variance of alcoholism ranging from 50-80%. The search for genetic variants associated with this complex behavior has epitomized sequence-based studies for nearly a decade. The limited success of genome-wide association studies (GWAS), possibly precipitated by the polygenic nature of complex traits and behaviors, however, has demonstrated the need for novel, multivariate models capable of quantitatively capturing interactions between a host of genetic variants and their association with non-genetic factors. In this regard, capturing the network of SNP by SNP or SNP by environment interactions has recently gained much interest.Entities:
Keywords: Alcoholism; Environment; GWAS; Interaction; Network; SNP
Mesh:
Year: 2017 PMID: 28361705 PMCID: PMC5374593 DOI: 10.1186/s12918-017-0403-7
Source DB: PubMed Journal: BMC Syst Biol ISSN: 1752-0509
Fig. 1Manhattan plot of raw p-values from genome-wide association analysis (CMH test). Markers above the horizontal black line (p<0.0005) have been used in the iterative network construction. For the actual p-values and ranking of these 652 SNPs, see Additional file 2: Table S1
Fig. 2The SNP×SNP×E network of Alcoholism. The network contains 397 SNPs and 15 demographic variables. The nodes represent variables and an edge between two nodes represents their dependency quantified by conditional probabilities. For the node labels and the complete list of interactions see Additional file 3: Table S2. To enhance the quality of representation, we have removed the “alcoholism” node and the edges from this node to all other nodes
Fig. 3A subgraph of the SNP×SNP×E network in Fig. 2. All demographic factors are included, as well as the SNPs of several genes that have multiple SNPs in the network. Each blue box is labeled with the gene on which all of the SNPs within the box are found. The grey box contains all of the demographic factors
Demographic variables used in the SNP×SNP×E network
| Variable | Level/Range |
|---|---|
| Income/yr $ | 0–10 K |
| Location of childhood home | large city/suburbs/small city/village/rural area |
| Level of education | less than H.S. grad/H.S. grad/some college/college grad or more |
| Gender | M/F |
| Age at interview | 18–77 |
| Race | AA/EA |
| Sexually abused as a child | yes/no |
| Otherwise physically abused as a child | yes/no |
| Neglected as a child | yes/no |
| Experienced sexual trauma | yes/no |
| Otherwise experienced physical trauma | yes/no |
| Experienced non-physical trauma | yes/no |
| Weight | 85–435 lb |
| Frequency with which attends religious services | 0–500 |
| Height | 49–80 in |
(a) The 14 most significant genes (p < 0.01) in the SNP×SNP×E network, including the intergenic set. 221 total genes were considered; (b) The four significant demographic factors (p < 0.05) in SNP×SNP×E network. 15 total demographic factors were considered; (c) The four significant interactions (p < 0.05) in the demographic-genetic model. 427 total interactions were considered
| Table 2a | |
| Intronic/Intergenic SNPs |
|
| Intergenic | 0.001 |
| BLNK | 0.002 |
| BMPER | 0.002 |
| SERINC2 | 0.003 |
| LGALS2 | 0.004 |
| CPE | 0.006 |
| PDLIM5 | 0.006 |
| PKNOX2 | 0.008 |
| VEPH1 | 0.008 |
| NPAS3 | 0.009 |
| AMPD3 | 0.01 |
| CADM3 | 0.01 |
| DAB1 | 0.01 |
| GLT25D2 | 0.01 |
| Table 2b | |
| Demographic Factor |
|
| Race | 0.001 |
| Sex | 0.001 |
| Education Level | 0.002 |
| Income | 0.002 |
| Table 2c | |
| Factor-Factor Interaction |
|
| Race-Income | 0.011 |
| Sex-rs5933820 | 0.016 |
| Race-rs8225 | 0.04 |
| Income-Education Level | 0.041 |
Distribution of rs8225 in the two race groups among cases and controls. The link between this variant and “race” group is determined to be statistically significant (see Table 2c)
| Controls | |||
|---|---|---|---|
| Race | 2 C (wild type) | Heterozygous | 2 T (variant) |
| African American | 9.20% | 41.80% | 49.00% |
| European American | 76.30% | 22.20% | 1.60% |
| Cases | |||
| Race | 2 C (wild type) | Heterozygous | 2 T (variant) |
| African American | 6.00% | 37.5 | 56.50% |
| European American | 70.40% | 26.5 | 3.10% |
The 14 significant biological pathways (p < 0.05) in the demographic-genetic model. 186 total pathways were considered
| KEGG Pathways |
|
|---|---|
| Calcium Signaling Pathway | 0.001 |
| Focal Adhesion | 0.002 |
| ECM Receptor Interaction | 0.007 |
| Arrhythmogenic Right Ventricular Cardiomyopathy (ARVC) | 0.012 |
| Hypertrophic Cardiomyopathy | 0.012 |
| Dilated Cardiomyopathy | 0.012 |
| Regulation of Actin Cytoskeleton | 0.014 |
| Oocyte Meiosis | 0.014 |
| Fc-Gamma Receptor-Mediated Phagocytosis | 0.021 |
| Long-term Depression | 0.036 |
| Adherens Junction | 0.04 |
| MAPK Signaling Pathway | 0.04 |
| Endocytosis | 0.04 |
| Neuroactive Ligand Receptor Interaction | 0.047 |