| Literature DB >> 22371699 |
Jingyu Liu1, Mohammad M Ghassemi, Andrew M Michael, David Boutte, William Wells, Nora Perrone-Bizzozero, Fabio Macciardi, Daniel H Mathalon, Judith M Ford, Steven G Potkin, Jessica A Turner, Vince D Calhoun.
Abstract
To address the statistical challenges associated with genome-wide association studies, we present an independent component analysis (ICA) with reference approach to target a specific genetic variation and associated brain networks. First, a small set of single nucleotide polymorphisms (SNPs) are empirically chosen to reflect a feature of interest and these SNPs are used as a reference when applying ICA to a full genomic SNP array. After extracting the genetic component maximally representing the characteristics of the reference, we test its association with brain networks in functional magnetic resonance imaging (fMRI) data. The method was evaluated on both real and simulated datasets. Simulation demonstrates that ICA with reference can extract a specific genetic factor, even when the variance accounted for by such a factor is so small that a regular ICA fails. Our real data application from 48 schizophrenia patients (SZs) and 40 healthy controls (HCs) include 300K SNPs and fMRI images in an auditory oddball task. Using SNPs with allelic frequency difference in two groups as a reference, we extracted a genetic component that maximally differentiates patients from controls (p < 4 × 10(-17)), and discovered a brain functional network that was significantly associated with this genetic component (p < 1 × 10(-4)). The regions in the functional network mainly locate in the thalamus, anterior and posterior cingulate gyri. The contributing SNPs in the genetic factor mainly fall into two clusters centered at chromosome 7q21 and chromosome 5q35. The findings from the schizophrenia application are in concordance with previous knowledge about brain regions and gene function. All together, the results suggest that the ICA with reference can be particularly useful to explore the whole genome to find a specific factor of interest and further study its effect on brain.Entities:
Keywords: brain network; functional magnetic resonance imaging; genome-wide association study; independent component analysis with reference; schizophrenia; single nucleotide polymorphisms
Year: 2012 PMID: 22371699 PMCID: PMC3284145 DOI: 10.3389/fnhum.2012.00021
Source DB: PubMed Journal: Front Hum Neurosci ISSN: 1662-5161 Impact factor: 3.169
Figure 1The flowchart of the proposed method.
Subject demographic information and IQ measurements.
| Sex (male/female) | 29/19 | 20/20 | − |
| Race (Caucasian/African American/Asian/Mixed) | 38/8/2/0 | 34/2/2/2 | − |
| Handedness (R/L) | 44/4 | 38/2 | − |
| Age (in years) | 38.06 (10.98) | 35.29 (10.99) | 1.185/0.23 |
| Maternal education (in years) | 13.26 (2.99) | 14.44 (2.41) | −1.95/0.054 |
| Paternal education | 14.07 (3.47) | 14.89 (3.29) | −1.08/0.28 |
| Subject education | 13.70 (1.912) | 15.67 (1.81) | −4.8/7.18E-6 |
| Verbal IQ | 103.98 (10.21) | 110.79 (6.88) | 3.54/7.16E-4 |
| Performance IQ | 107.88 (5.07) | 110.95 (3.25) | 3.27/1.7E-3 |
| Full scale IQ | 102.13 (8.96) | 112.1 (6.03) | 3.54/7.16E-4 |
Note: IQ data was missing seven subjects from the SZ group.
Simulation results using 200 subjects and 10,000 SNPs.
| 0.1 | 1 | 0.94 | 0.30 | 0.99 | 0.28 |
| 0.2 | 1 | 0.97 | 0.53 | 0.99 | 0.47 |
| 0.4 | 1 | 0.99 | 0.71 | 0.99 | 0.62 |
| 0.5 | 1 | 0.99 | 0.73 | 0.99 | 0.64 |
| 0.1 | 0.75 | 0.93 | 0.24 | 0.99 | 0.25 |
| 0.2 | 0.75 | 0.91 | 0.37 | 0.99 | 0.31 |
| 0.4 | 0.75 | 0.98 | 0.63 | 0.99 | 0.52 |
| 0.5 | 0.75 | 0.98 | 0.63 | 0.99 | 0.51 |
| 0.1 | 0.5 | 0.82 | 0.13 | 0.99 | 0.11 |
| 0.2 | 0.5 | 0.85 | 0.23 | 0.99 | 0.17 |
| 0.4 | 0.5 | 0.93 | 0.44 | 0.99 | 0.32 |
| 0.5 | 0.5 | 0.94 | 0.45 | 0.99 | 0.33 |
| 0.1 | 0.25 | 0.69 | 0.06 | 0.98 | 0.04 |
| 0.2 | 0.25 | 0.72 | 0.07 | 0.98 | 0.04 |
| 0.4 | 0.25 | 0.80 | 0.12 | 0.98 | 0.09 |
| 0.5 | 0.25 | 0.82 | 0.16 | 0.98 | 0.09 |
Note:
Indicates that a non-disease-related factor was extracted with an effect size same as a random locus.
Top component SNPs by weights and associated genes.
| rs10953026 | 7.11 | 0.22 | 0.54 | 7q21.13 | CDK14 |
| rs2286696 | 6.7 | 0.29 | 0.61 | 7q21.13 | CDK14 |
| rs4105175 | 6.6 | 0.29 | 0.55 | 5q35.3 | ZNF879 |
| rs2173096 | 6.39 | 0.28 | 0.55 | 5q35.3 | Between genes GRM6 and ZNF879 |
| rs758706 | −6.12 | 0.56 | 0.31 | 7q21.13 | CDK14 |
| rs2067011 | −5.79 | 0.58 | 0.30 | 5q35.3 | GRM6 |
| rs1889574 | −5.65 | 0.46 | 0.26 | 13q12.11 | LOC 100506971 |
| rs7669821 | 5.53 | 0.36 | 0.54 | 4p14 | UGDH |
| rs2279834 | 5.31 | 0.10 | 0.39 | 12q23.1 | SLC5A8 |
| rs1406002 | −5.11 | 0.48 | 0.33 | 2p15 | Intergenic |
| rs1468708 | 5.10 | 0.21 | 0.51 | 6q24.1 | Intergenic |
| rs1284108 | 4.99 | 0.24 | 0.51 | 11q21 | Near MED17 (<20 kbp) |
| rs1017528 | 4.89 | 0.21 | 0.41 | 17q22 | Near CUEDC (< 30 kbp) |
| rs1445846 | 4.88 | 0.23 | 0.44 | 5q35.3 | ZNF354C |
| rs1445845 | 4.88 | 0.23 | 0.44 | 5q35.3 | ZNF354C |
| rs1445844 | 4.88 | 0.23 | 0.44 | 5q35.3 | ZNF354C |
| rs11750568 | 4.88 | 0.23 | 0.44 | 5q35.3 | ADAMTS2 |
| rs1419005 | 4.88 | 0.23 | 0.51 | 20p11.21 | NXT1 (<1 kbp) |
| rs887780 | 4.87 | 0.24 | 0.41 | 6q24.1 | Intergenic |
| rs3824039 | −4.86 | 0.54 | 0.33 | 7q21.13 | CDK14 |
| rs2188240 | −4.85 | 0.54 | 0.33 | 7q21.13 | CDK14 |
| rs1110457 | −4.85 | 0.54 | 0.33 | 7q21.13 | CDK14 |
| rs2839081 | 4.80 | 0.36 | 0.56 | 21q22.3 | Near COL6A1 (<20 kbp) |
| rs6991271 | −4.80 | 0.59 | 0.34 | 8p12 | KIF13B |
| rs2381387 | 4.70 | 0.30 | 0.51 | 4p14 | Near UGDH |
| rs1487074 | −4.68 | 0.40 | 0.23 | 2p15 | Near EHBP1 (<10 kbp) |
Table is sorted by SNP weights.
Indicates the SNP overlapping with the reference SNPs. MAF (minor allele frequency) was calculated based on our dataset. Two clusters are indicated by superscript
and
.
Figure 2Cross-correlation among the top 26 SNP loci. All correlations stronger than 0.35 were plotted (p < 0.001). Two big clusters were identified; one is centered at chromosome 7q21, including rs10953026 and rs2188240. The other is centered at chromosome 5q35 including rs4105175 and rs1445846. See text for the detail list of SNPs.
Figure 3The fMRI component linked to the SNP component. (A) Activation map with |z| value > 6 while responding to target vs. standard stimuli in the auditory oddball task. Red to yellow color indicates positive activation regions, while blue indicates deactivation regions. (B) A list of regions and volumes with |z| value > 6. The activation regions mainly include thalamus, anterior cingulate and posterior cingulate gyri and deactivation regions are mainly in left pre- and post-central gyri. Note that the pre- and post-central gyri are mainly positively activated during the task, and the deactivation in this component is only fraction of activation that covaries with positive activation in thalamus, cingulate gyrus, etc.
Figure 4The loading coefficients of the linked SNP and fMRI components. The X and Y axis are the loadings for SNP and fMRI components, respectively. The positive trend shows the positive correlation (r = 0.39, R2 = 0.15) between these linked components. Healthy controls demonstrates strong positive presence of the SNP component and higher activation in the fMRI component, while schizophrenia patients carry negative presence of the SNP component (opposite combination of minor alleles in the SNPs), and lower activation in the fMRI component.