BACKGROUND: Schizophrenia is a highly heritable disease with a polygenic mode of inheritance. Many studies have contributed to our understanding of the genetic underpinnings of schizophrenia, but little is known about how interactions among genes affect the risk of schizophrenia. This study aimed to assess the associations and interactions among genes that confer vulnerability to schizophrenia and to examine the moderating effect of neuropsychological impairment. METHODS: We analyzed 99 SNPs from 10 candidate genes in 1,512 subject samples. The permutation-based single-locus, multi-locus association tests, and a gene-based multifactorial dimension reduction procedure were used to examine genetic associations and interactions to schizophrenia. RESULTS: We found that no single SNP was significantly associated with schizophrenia. However, a risk haplotype, namely A-T-C of the SNP triplet rsDAO7-rsDAO8-rsDAO13 of the DAO gene, was strongly associated with schizophrenia. Interaction analyses identified multiple between-gene and within-gene interactions. Between-gene interactions including DAO*DISC1 , DAO*NRG1 and DAO*RASD2 and a within-gene interaction for CACNG2 were found among schizophrenia subjects with severe sustained attention deficits, suggesting a modifying effect of impaired neuropsychological functioning. Other interactions such as the within-gene interaction of DAO and the between-gene interaction of DAO and PTK2B were consistently identified regardless of stratification by neuropsychological dysfunction. Importantly, except for the within-gene interaction of CACNG2, all of the identified risk haplotypes and interactions involved SNPs from DAO. CONCLUSIONS: These results suggest that DAO, which is involved in the N-methyl-d-aspartate receptor regulation, signaling and glutamate metabolism, is the master gene of the genetic associations and interactions underlying schizophrenia. Besides, the interaction between DAO and RASD2 has provided an insight in integrating the glutamate and dopamine hypotheses of schizophrenia.
BACKGROUND:Schizophrenia is a highly heritable disease with a polygenic mode of inheritance. Many studies have contributed to our understanding of the genetic underpinnings of schizophrenia, but little is known about how interactions among genes affect the risk of schizophrenia. This study aimed to assess the associations and interactions among genes that confer vulnerability to schizophrenia and to examine the moderating effect of neuropsychological impairment. METHODS: We analyzed 99 SNPs from 10 candidate genes in 1,512 subject samples. The permutation-based single-locus, multi-locus association tests, and a gene-based multifactorial dimension reduction procedure were used to examine genetic associations and interactions to schizophrenia. RESULTS: We found that no single SNP was significantly associated with schizophrenia. However, a risk haplotype, namely A-T-C of the SNP triplet rsDAO7-rsDAO8-rsDAO13 of the DAO gene, was strongly associated with schizophrenia. Interaction analyses identified multiple between-gene and within-gene interactions. Between-gene interactions including DAO*DISC1 , DAO*NRG1 and DAO*RASD2 and a within-gene interaction for CACNG2 were found among schizophrenia subjects with severe sustained attention deficits, suggesting a modifying effect of impaired neuropsychological functioning. Other interactions such as the within-gene interaction of DAO and the between-gene interaction of DAO and PTK2B were consistently identified regardless of stratification by neuropsychological dysfunction. Importantly, except for the within-gene interaction of CACNG2, all of the identified risk haplotypes and interactions involved SNPs from DAO. CONCLUSIONS: These results suggest that DAO, which is involved in the N-methyl-d-aspartate receptor regulation, signaling and glutamate metabolism, is the master gene of the genetic associations and interactions underlying schizophrenia. Besides, the interaction between DAO and RASD2 has provided an insight in integrating the glutamate and dopamine hypotheses of schizophrenia.
Schizophrenia is a common, highly heritable, chronic mental disorder characterized by neuropsychological abnormalities and neurophysiological impairments [1], [2], [3]. The vulnerability to schizophrenia is influenced by a polygenic component, environmental factors, and their interactions [4]. Genes that confer vulnerability to schizophrenia have been identified in genetic studies [5], [6], [7], [8], but the genetic interactions among these genes and their interplay with neurobiological abnormalities and clinical subtypes of schizophrenia remains obscure.There were relatively large number of genes located in various chromosome regions identified as schizophrenia candidate vulnerability genes from linkage studies [9], [10], [11], [12], positional cloning or candidate gene association studies [13], [14], [15], [16], and genome-wide association studies [17], [18], [19], [20]. Those schizophrenia vulnerability genes were collected in public databases such as SzGene [21] and SZGR [22]. However, these identified genes ran a replication and non-replication pattern. For example, association between schizophrenia and the Disrupted in Schizophrenia 1 (DISC1) gene was found and replicated in some genetic studies [23], [24], [25] but failed to replicate by some of the others [26], [27]. The inconsistency in gene findings came from various reasons such as population substructure/stratification, genetic heterogeneity, and problem of statistical power [28], [29]. In Taiwan, we have carried out a series of genetic studies using the single-ethnicity samples of Taiwan Chinese families of schizophrenia to search for the genomic regions in which potentially harbor vulnerability genes of schizophrenia using linkage analyses. We have further finely mapped the vulnerability genes of schizophrenia using positional cloning. Besides, we also carried out association studies to replicate the reported vulnerability genes in the literature from the other research groups.The linkage studies for schizophrenia in Taiwan revealed suggestive linkage evidence of chromosome 1q42.1 [30], 6p24 [31], 8p21-12 [32], and 15q14 [33] using 52 families with at least two siblings affected with schizophrenia. We also found suggestive linkage evidence of 22q12 in an international collaborative study [34], where 557 families with at least two siblings affected with schizophrenia were recruited from the whole Taiwan. Genome-wide linkage analysis found the maximum nonparametric linkage z score was 2.88 for D10S2327 (100.92 cM) at 10q22.3 [35].Using candidate-gene positional cloning approach we have finely mapped genes vulnerable to schizophrenia in Taiwan, including DISC1 at chromosome 1q42 [36], RASD2 and CACNG2 at chromosome 22q12 [37], and DPYSL2, TRIM35 and PTK2B at chromosome 8p21 (in preparation). Some genes are specifically associated with some subtypes of schizophrenia.Besides, we have also performed the association studies in Taiwanese samples on NRG1 (8p12), DAO (12q24), and DAOA (12q33.2) as these three genes were reported to have strong association with schizophrenia in the literature [38], [39], [40]. We did find significant association of NRG1
[41] and DAO genes [42] with schizophrenia. However, we cannot find significant association of DAOA (G72) with schizophrenia [43]. In addition, the LMBRD1 knockout mouse model revealed a neurological impairment and manifested a prominent social withdrawal behavior, representing the negative symptom state of schizophrenia. Moreover, we also found a significant association between LMBRD1 gene and schizophrenia in our Taiwanese family samples of schizophrenia [44].These 10 candidate genes found in our Taiwanese samples of schizophrenia did have active neurobiological functions in the central nervous system especially in neurotransmission and neurodevelopment. The DAO gene product is an enzyme that degrades the amino acid, d-Serine (d-Ser), which acts as a co-agonist at the glycine site of the N-methyl-d-aspartic acid (NMDA) receptor [45]. The DAOA gene product activates the DAO enzyme [46]. The biological functions of DAO and DAOA are involved in the hypothesized hypofunction of NMDA receptor complex as the potential pathogenesis of schizophrenia [47]. The NRG1 gene product is a glycoprotein that interacts with the ERBB receptor tyrosine kinase. Moreover, NRG1 mediates cell-to-cell interactions and plays a critical role in the development of the central nervous system [48]. The DISC1 gene product is involved in neurite outgrowth and cortical development [49], [50]. The LMBRD1 gene product is a lysosomal membrane protein involved in the transport and metabolism of cobalamin (vitamin B12), which plays a key role in maintaining neuronal function [51], [52]. The DPYSL2 gene product, also known as collapsin response mediator protein-2 (CRMP-2), is a novel calmodulin-binding protein [53] that is differentially expressed in the anterior cingulate cortex of schizophrenicpatients [54]. The PTK2B gene product, also known as proline-rich tyrosine kinase 2 (Pyk2), is involved in regulating calcium-induced ion channels and in cellular signaling by mitogen-activated protein kinase [55]. The RASD2 gene product is a GTPase enriched in the striatum; the levels of this GTPase are reduced in dopamine super-sensitization [56]. The CACNG2 gene product, also known as stargazin, is a subunit of the L-type calcium channel, and it modulates the calcium channel [57] and regulates the AMPA receptors [58]. The TRIM35 gene product, also known as hemopoietic lineage switch 5 (HLS5), has been rarely studied with the exception of a report involving in cancer research [59].Based on the specific protein functions of these candidate genes, we hypothesized that these vulnerability genes may involve in significant neurobiological functional pathways in neurotransmission and/or neurodevelopment, which may be responsible for the pathogenesis of schizophrenia. In order to examine the potential interactive effect of multiple genes found in our single-ethnicity schizophrenia samples, we carried out a direct sequencing study on these candidate genes to identify the potential functional single nucleotide polymorphisms (SNPs) in our Taiwan Chinese samples. These potential functional SNPs were then under this gene-gene interaction analyses.The concept of intermediate phenotypes was introduced to refine the phenotypic characterization of schizophrenia and other psychiatric illnesses [60]. This strategy was also successfully applied in the exploration of the interactive effect of multiple genes using progressive brain change in schizophrenia [61]. We tested one potential intermediate phenotype: sustained attention as measured using a Continuous Performance Test (CPT) [62]. Deficits of sustained attention manifest not only in patients with schizophrenia but also in subjects with schizotypal personality disorder and in the nonpsychotic relatives of schizophrenicpatients [63], [64], [65], [66], [67]. The normalized z score for d’, the sensitivity measure of sustained attention as assessed by the CPT, has been frequently used as a schizophrenia endophenotype. Schizophrenicpatients with a z score below –2.5 are categorized as having a deficit in sustained attention. The recurrence risk ratio for schizophrenia among parents or siblings of this subgroup was higher than that for other schizophrenia probands [68], [69]. Thus, using performance on the CPT to define endophenotypes of impaired sustained attention among schizophrenics might help resolve the genetic heterogeneity observed in association studies of schizophrenia.This study examined three hypotheses. First, that the within-gene and between-gene interactions are responsible for the complex genetics of schizophrenia. Second, that these interactions are moderated by degree of impairment of neuropsychological parameters, e.g., CPT deficits [62]. Third, that there exist comprehensible and testable neurobiological pathways which could be formulated based on the interactions of the functional expressions of these corresponding genes. Here we proposed a multi-stage procedure for identifying within-gene and between-gene interactions, and discussed the meaning of these interactions based on the glutamate and dopamine hypotheses of schizophrenia.
Methods and Materials
Ethics Statement
All potential patients who declined to participate or otherwise did not participate were eligible for treatment and were not disadvantaged in any other way by not participating in the study. In all cases, written informed consent was obtained after the study procedures were fully explained to subjects. The sample information and the data were de-identified before statistical data analysis. This study was approved by the Institutional Review Board of National Taiwan University Hospital.
Study Subjects
Genomic DNA samples were collected from 912 schizophrenicpatients (564 males and 348 females) and 600 normal controls (313 males and 287 females) in Taiwan’s Han Chinese population as part of four independent research programs. Of the 912 schizophrenic subjects, 702 were from multiplex families with at least two affected siblings, and these subjects were recruited as part of two research programs: the Multidimensional Psychopathology Study of Schizophrenia (MPSS) from 1993 to 2001 [70] and the Taiwan Schizophrenia Linkage Study (TSLS) from 1998 to 2002 [11], [35]. To ensure the independence of multiplex subjects, we randomly selected one affected sibling from each multiplex family for inclusion in the study. The other 210 schizophrenic subjects were from simplex families with only one affected sibling, and these subjects were recruited as part of another independent project. The 600 normal controls were selected from a representative supernormal genomic sample from Taiwan as part of a fourth independent project [71], with inclusion criteria of ≥60 years of age and a Short Portable Mental Status Questionnaire score of ≥14. Ninety-three MPSS families were interviewed by research psychiatrists using the Psychiatrist Diagnostic Assessment [72]. Six hundred and nine TSLS families were interviewed by well-trained assistants using the Mandarin Chinese version of the Diagnostic Interview for Genetic Studies (DIGS) [73]. The 210 patients from simplex families were interviewed using DIGS. For all subjects, the final diagnostic assessment was formulated by integrating either the Psychiatrist Diagnostic Assessment or the DIGS data with clinical information from medical records using the Specialist Diagnostic Assessment Sheet, based on the criteria of the Diagnostic and Statistical Manual of Mental Disorders, 4th edition.
Genomic Sequencing for Identifying New SNPs
A total of 50 normal controls, 50 multiplex patients and 50 simplex patients were randomly selected for sequencing. DNA sequences of promoters, exons, highly conserved introns, spliced variants, and isoforms of the ten schizophrenia candidate genes were amplified by polymerase chain reaction (PCR). PCR products were purified by treatment with exonuclease I and shrimp alkaline phosphatase (USB Corporation, Cleveland, OH, USA) and sequenced from both ends. DNA sequencing reactions were performed with BigDye Terminator Cycle Sequencing Version 3.1 (Applied Biosystems, Foster City, CA, USA) followed by analysis on an ABI 3730xl DNA Analyzer (Applied Biosystems). The resulting sequences were compared and aligned using the Polyphred Sequence Alignment Editor (http://droog.mbt.washington.edu/PolyPhred.htm/). Reference sequences were obtained from National Center for Biotechnology Information (NCBI) website (http://www.ncbi.nlm.nih.gov/).SNPs satisfying one of the following conditions were selected for further genotyping: (1) SNPs in exons, (2) SNPs with a minor allele frequency (MAF) between 0 to 10% and an MAF difference between case and control groups of >0.02, (3) SNPs with MAF >10% and an MAF difference between case and control groups of >0.04, and (4) SNPs with a functional risk >3 of medium level, as defined in FastSNP [74]. We identified 99 SNPs for genotyping experiment: 17 SNPs in DISC1 (1q42.1), 11 SNPs in LMBRD1 (6q13), 14 SNPs in DPYSL2 (8p22), 5 SNPs in TRIM35 (8p22), 21 SNPs in PTK2B (8p22), 10 SNPs in NRG1 (8p21), 5 SNPs in DAO (12q22), 6 SNPs in G72 (13q32), 4 SNPs in RASD2 (22q12), and 6 SNPs in CACNG2 (22q12). The SNP annotation information, including rs number, region, function, and physical position, followed NCBI Build 36.3 (Table S1). Eighteen novel SNPs discovered from our direct DNA sequencing experiment were deposited in GenBank (http://www.ncbi.nlm.nih.gov/genbank/) (Table S2).
SNP Genotyping
SNP genotyping was performed using the method of matrix-assisted laser desorption/ionization-time of flight mass spectrometry. PCR primers and genotyping probes that flanked the SNPs were designed using SpectroDESIGNER software (Sequenom, San Diego, CA, USA). A DNA fragment (100–300 bp) encompassing the SNP site was amplified using a PCR-ABI 9700 thermocycler (Applied Biosystems) according to the manufacturer’s instructions. After removing any deoxynucleotide triphosphate (dNTP) that was not incorporated and inactivating the shrimp alkaline phosphatase from the PCR product, primer extension was performed by adding the probe, Thermo Sequenase (Amersham Pharmacia, Piscataway, NJ, USA), and the appropriate dideoxynucleotide triphosphate (ddNTP)/dNTP mixture, followed by 55 cycles of denaturing at 94°C for 5 s, annealing at 52°C for 5 s, and extension at 72°C for 5 s. Sequenom’s SpectroPOINT matrix array was used to transfer the PCR products from the microplate to a 384-well SpectroCHIP. The mass spectrum from time-resolved spectra was retrieved using a MassARRAY mass spectrometer (Sequenom), and each spectrum was then analyzed using SpectroTYPER and SpectroREADER software (Sequenom) for genotype calling.
Neuropsychological Assessment of Sustained Attention Using the CPT
Sustained attention was assessed using the unmasked and masked CPTs (Zd’ and Zmd’) [68]. The CPT indices of the normal controls were not measured. A CPT machine from Sunrise System, v. 2.20 (Pembroke, MA, USA) was used to conduct the tests for schizophrenic subjects [62]. Briefly, numerals from 0 to 9 were randomly presented on a screen for 50 ms each, at a rate of one per second. Subjects were asked to respond each time the numeral “9” was preceded by the numeral “1” on the screen. For the 25% masked session, a pattern of snow appeared on the screen to visually distort the images. Each test session began with 2 min of practice, which could be repeated if required. A total of 331 trials, with 34 (10%) target stimuli, were presented over a 5-min test session. A signal-detection index of performance on the test, sensitivity (d’), was derived from the hit rate (defined as the probability of a correct response to the target trials) and false-alarm rate (defined as the probability of a response to the non-target trials). Sensitivity measures an individual’s ability to discriminate target stimuli from non-target stimuli. In a 1-week test-retest reliability study, the intra-class correlation coefficients of reliability for d’ were 0.83 and 0.82 for the unmasked and the 25% masked tasks, respectively [68]. The effect of age, education, and sex on performance of the CPT was adjusted based on a community-based sample of 345 controls. The d’ was calculated as the Z-scores that were adjusted for these demographic characteristics [68]. For the unmasked CPT in this study, there were 455 (50%) patients with Zd’ ≥ –2.5, 298 (33%) patients with Zd’<–2.5, and 159 (17%) patients without Zd’ measurements. For the masked CPT, there were 357 (39%) patients with Zmd’ ≥ –2.5, 374 (41%) patients with Zmd’<–2.5, and 181 (20%) patients without Zmd’ measurements.
Statistical Analyses
Quality control of samples and SNPs
The quality of the study samples and SNP markers was evaluated. Poor samples and SNPs were removed using a sequentially exclusive procedure. First, samples with a genotyping call rate (GCR) of <0.9 were removed. Next, SNPs with a GCR <0.9 in a case or control group were removed, and then SNPs with an MAF <0.01 were removed. Finally, SNPs with pFDRHWE <0.05 were removed, where pFDRHWE is an adjusted p-value that controls for the false-discovery rate (FDR) [75] in an exact Hardy-Weinberg equilibrium test [76] with 10,000 permutations in the control group. Finally, missing genotypes were imputed using BIMBAM software [77]. An unstratified analysis of overall samples and a stratified analysis of four CPT strata, Zd’<−2.5, Zd’ ≥ −2.5, Zmd’<−2.5, and Zmd’ ≥ −2.5, for genetic associations and interactions were conducted using the clean and imputed data.
Association tests
The permutation-based single-locus and multi-locus association tests were performed to examine genetic associations to schizophrenia. Here, 10,000 permutations were considered. For the single-locus association tests, genotype-based, allele-based, and trend-based association tests were performed using the CASECONTROL procedure in SAS/GENETICS software [78]. The HAPLOVIEW software [79] was used to measure coefficient of LD (D’) [80] of SNPs within candidate genes, and haplotype blocks were identified using Gabriel’s method [81]. Within each haplotype block, the overall haplotype likelihood-ratio association test and the individual haplotype likelihood-ratio association test [82] were conducted using the HAPLOTYPE procedure in SAS/GENETICS software [78]. An FDR [75] was calculated for corrections of multiple testing in each CPT stratum. A significant association was defined as a value of <0.05 for the adjusted p-values pFDRSG, pFDRSA, and pFDRST, for genotype-based, allele-based, and trend-based single-locus association tests, respectively, and pFDRH for the haplotype association test.
Gene-gene interaction analyses
A gene-based, multifactorial dimension reduction procedure was utilized to identify the within-gene and between-gene interactions with the highest testing accuracy based on a 10-fold cross validation procedure [83]. For identification of within-gene interactions, MDR software [84], [85], [86] was used to analyze all SNPs within a gene of study. The testing accuracy was calculated for all possible order-1 (i.e., a single SNP), order-2 (i.e., a pair of SNPs), order-3 (i.e., a triplet of SNPs), and order-4 (i.e., a quartet of SNPs) SNP combinations. The average testing accuracy of 10-fold cross validation samples was calculated. The model with the highest average testing accuracy was considered to be the best model. No statistical tests were performed at this stage. If the best model was an order-1 model, then we concluded that a within-gene interaction did not exist. Only the within-gene interactions with a cross-validation consistency ≥0.8 were regarded as candidates. Definitions of testing accuracy and cross-validation consistency can refer to the MDR User’s Guide.A similar gene-based, multifactorial dimension reduction procedure was applied to each pair of genes to identify between-gene interactions. MDR software was used to analyze all SNPs within a pair of genes to determine the model with the highest average testing accuracy among all possible order-1, order-2, order-3, and order-4 SNP combinations. No statistical tests were performed at this stage. If the best model contained at least one SNP in the first gene and at least one SNP in the second gene, a between-gene interaction was found. Only the between-gene interactions with a cross-validation consistency ≥0.8 were regarded as candidates.A permutation test was performed to validate the identified within-gene and between-gene interaction candidates using MDR-PT software [86]. Because we only needed to evaluate statistical significance of the interaction candidate (the best model of all possible order-1, order-2, order-3, and order-4 SNP combinations), in contrast to conventional interaction testing procedures, the number of tests was dramatically reduced in the gene-based MDR procedure. A 10-fold cross validation procedure was used, and 1,000 permutations were carried out to randomly exchange the case and control status of the samples. An empirical p-value of the testing accuracy for each of the identified interaction candidates was calculated. In unstratified analysis and CPT-stratified analysis, only the within-gene and between-gene interaction candidates having an FDR-adjusted empirical p-value (i.e., pFDRWG for within-gene interaction and pFDRBG for between-gene interaction) of <0.05 were regarded as confirmed and reliable interactions. In this paper, notation pFDR denotes an FDR-adjusted p-value for corrections of multiple testing.
Results
After the direct sequencing analysis, ninety-nine SNPs were qualified for the genotyping study in these ten candidate genes. The quality control procedures first removed 65 samples with a GCR <0.9. Next, 11 SNPs with a GCR <0.9 in the case or control group were removed, and 4 SNPs with an MAF <0.01 were removed; no SNPs had a pFDRHWE <0.05. After the quality control procedures, 84 SNPs remained including 15 SNPs in DISC1, 7 SNPs in LMBRD1, 10 SNPs in DPYSL2, 4 SNPs in TRIM35, 21 SNPs in PTK2B, 9 SNPs in NRG1, 5 SNPs in DAO, 4 SNPs in G72, 3 SNPs in RASD2, and 6 SNPs in CACNG2, from 1,447 samples from 893 cases and 554 controls. Finally, missing genotypes (∼0.57% genotypes of all data) were imputed.First, unstratified analyses were performed. Genotype-, allele-, and trend-based single-locus association tests consistently identified rsCACNG2_3 and rsDAO_13 as significantly associated with schizophrenia, but the significance did not survive a correction for multiple testing (Table S1). Haplotype association tests were conducted for the following genes (the number of LD blocks is indicated in parentheses): DISC1 (1), LMBRD1 (1), DPYSL2 (1), TRIM35 (1), PTK2B (2), NRG1 (2), DAO (1), G72 (1), RASD2 (1), and CACNG2 (2) (Figure 1). Only the risk haplotype A-T-C of the SNP triplet rsDAO7-rsDAO8-rsDAO13 within the LD block of DAO was significantly associated with schizophrenia (pFDRH = 0.0090; Table S3). No controls carried this risk haplotype, but 1.24% of schizophrenia subjects carried this haplotype. We found a within-gene interaction of DAO, rsDAO_6*rsDAO_8 (pFDRWG = 0.0028; testing accuracy = 0.5811), and a between-gene interaction of DAO and PTK2B, rsDAO_6*rsDAO_8*rsPTK2B_2 (pFDRBG = 0.0028, testing accuracy = 0.5940; Figure 2). The cross-validation accuracy of the two identified interactions was 100%.
Figure 1
The LD structures of 84 SNPs in 10 candidate genes for vulnerability to schizophrenia.
For each gene, the id of each SNP in the gene is listed, and the locations reflect the relative physical positions of the SNPs (in units of base pairs). The LD coefficient, D’, is provided unless D’ = 1. The color scheme for D’ presentation is as follows: white depicts the case of D’ <1 and LOD <2; blue depicts the case of D’ = 1 and LOD <2; pink or light red depicts the case of D’ <1 and LOD ≥2; bright red depicts the case of D’ = 1 and LOD ≥2. The LD block(s) within each gene is marked by an inverted triangle based on Gabriel’s method [81].
Figure 2
Within-gene and between-gene interactions in schizophrenia for the unstratified and CPT-stratified interaction analyses.
The unstratified interaction plot is at the center, and the CPT-stratified plots with the stratum name are located in the four corners. Stratum name and sample size of cases and controls are provided in the title for each panel. Gene name, the number of SNPs in the gene, and the located chromosome are provided around each circle. If an interaction was identified, the SNP id along with its location in the gene (in parentheses) are provided. Abbreviations for SNP locations are: 3′, 3′ untranslated region; 5′, 5′ untranslated region; I, intron; E, exon; s, synonymous SNP; n, near-gene SNP; m, missense SNP. Interaction candidates with a pFDR <0.05 and ≥0.05 are connected by a solid line and dashed line, respectively, and the level of testing accuracy is represented by the line color from light green (less accurate) to dark green (highly accurate).
The LD structures of 84 SNPs in 10 candidate genes for vulnerability to schizophrenia.
For each gene, the id of each SNP in the gene is listed, and the locations reflect the relative physical positions of the SNPs (in units of base pairs). The LD coefficient, D’, is provided unless D’ = 1. The color scheme for D’ presentation is as follows: white depicts the case of D’ <1 and LOD <2; blue depicts the case of D’ = 1 and LOD <2; pink or light red depicts the case of D’ <1 and LOD ≥2; bright red depicts the case of D’ = 1 and LOD ≥2. The LD block(s) within each gene is marked by an inverted triangle based on Gabriel’s method [81].
Within-gene and between-gene interactions in schizophrenia for the unstratified and CPT-stratified interaction analyses.
The unstratified interaction plot is at the center, and the CPT-stratified plots with the stratum name are located in the four corners. Stratum name and sample size of cases and controls are provided in the title for each panel. Gene name, the number of SNPs in the gene, and the located chromosome are provided around each circle. If an interaction was identified, the SNP id along with its location in the gene (in parentheses) are provided. Abbreviations for SNP locations are: 3′, 3′ untranslated region; 5′, 5′ untranslated region; I, intron; E, exon; s, synonymous SNP; n, near-gene SNP; m, missense SNP. Interaction candidates with a pFDR <0.05 and ≥0.05 are connected by a solid line and dashed line, respectively, and the level of testing accuracy is represented by the line color from light green (less accurate) to dark green (highly accurate).Second, CPT-stratified analyses were performed. Single-locus association tests only found a borderline significance of rsCACNG2_3 in the subgroup “zmd’<−2.5” (pFDRST = 0.042) after a multiple test correction (Table S1). The frequencies of the risk allele T were 0.3595 and 0.2825, and their standard errors were 0.0179 and 0.0136, in the case and control groups, respectively. The LD structures of the four CPT strata differed only slightly for DPSYL2 and PTK2B when compared with the results of the non-stratified analysis (Figures S1, S2, S3, S4 and Figure 1). All haplotype association tests in the four CPT strata identified a risk haplotype A-T-C of the SNP triplet rsDAO7-rsDAO8-rsDAO13 within the LD block of DAO (pFDRH = 0.0034 for zd’ ≥ −2.5; pFDRH = 0.0090 for zd’<−2.5; pFDRH = 0.0047 for zmd’ ≥ −2.5; pFDRH = 0.0044 for zmd’<−2.5; Table S3). Approximately 1−2% of schizophrenia subjects carried this haplotype, whereas none of the control subjects did.In the zd’<−2.5 stratum we found several between-gene interactions: rsDAO_6*rsDAO_8*rsPTK2B_18 (pFDRBG = 0.0312; testing accuracy = 0.5800), rsDAO6*rsDAO8*rsNRG1_6 (pFDRBG = 0.0043; testing accuracy = 0.6222), and rsDAO_6*rsDAO_8*rsDAO_13*rsRASD2_8 (pFDRBG = 0.0043; testing accuracy = 0.5911). The analysis of the zd’ ≥ −2.5 stratum identified the within-gene interaction rsDAO_6*rsDAO_7*rsDAO_8*rsDAO_13 (pFDRWG = 0.0150; testing accuracy = 0.5842). The analysis of the zmd’<−2.5 stratum identified the within-gene interactions rsDAO_6*rsDAO_8*rsDAO_13 (pFDRWG = 0.0018; testing accuracy = 0.5882) and rsCACNG2_3*rsCACNG2_15 (pFDRWG = 0.0297; testing accuracy = 0.5597), and the between-gene interactions rsDAO_6*rsDAO_8*rsDISC1_2 (pFDRBG = 0.0069; testing accuracy = 0.5917), rsDAO_6*rsDAO_8*rsDAO_13*rsPTK2B_16 (pFDRBG = 0.0018; testing accuracy = 0.6020), and rsDAO_6*rsDAO_8*rsNRG1_6 (pFDRBG = 0.0018; testing accuracy = 0.5935). The analysis of the zmd’ ≥ −2.5 stratum identified the within-gene interaction rsDAO_6*rsDAO_7*rsDAO_8*rsDAO_13 (pFDRWG = 0.0045; testing accuracy = 0.6015) and the between-gene interaction rsDAO_6*rsDAO_8*rsPTK2B_2 (pFDRBG = 0.0158; testing accuracy = 0.5842; Figure 2). All of the identified interactions had a cross-validation accuracy of ≥0.8.
Discussion
Results from the single-locus association tests suggest that single SNPs have little impact on schizophrenia. Only borderline significance was found for a single SNP, rsCACNG2_3. Still, a future study with a larger sample size might indeed identify minor-effect SNPs associated with schizophrenia. By incorporating LD information, our haplotype analysis identified a risk haplotype A-T-C of the SNP triplet rsDAO7-rsDAO8-rsDAO13 on DAO. The low frequency of this risk haplotype suggests that a common disease can be caused by rare variants with a large impact (common disease, rare variant) [87]. The finding is worthy of a further investigation by a recruitment of more samples and by an independent replication using another cohort in the future.Multiple within-gene and between-gene interactions were identified. Genetic interaction analysis found that DAO is a master gene in the genetic interaction network underlying schizophrenia. Among the confirmed and reliable genetic interactions, all the within-gene interactions were found in DAO, except for a within-gene interaction in CACNG2, and all the between-gene interaction pairs involved DAO. Two master SNP nodes, rsDAO_6 and rsDAO_8, play key roles in the within- and between-gene interactions. Moreover, DAO and PTK2B were identified in most of the unstratified and stratified interaction analyses as an important between-gene interaction pair. These results support our hypothesis that within-gene and between-gene interactions may be responsible for the complex genetics of schizophrenia.Our analysis shows that DAO plays an important role in the genetic associations and interactions for schizophrenia. The DAO gene product is involved in the signal transduction pathway of the NMDA receptor [88], [89]. Substrates of the DAO-encoded enzyme, especially d-Ser, may bind to the glycine site of the NMDA receptor and function as a co-agonist [45], [90]; thus, the DAO gene product may regulate the NMDA receptor by the level of d-Ser, thereby opening the calcium channel of the NMDA receptor. d-Ser was found to inhibit the AMPA receptor–mediated current in rat hippocampal neurons [91]; thus, the DAO gene product was implicated in the pathogenesis of schizophrenia [89].The CPT-stratified analyses indicated that the genetic interactions of risk SNPs for schizophrenia were moderated by the degree of sustained attention deficit among schizophrenics. The within-gene interaction of DAO and the between-gene interaction of DAO and PTK2B were found for almost all unstratified and CPT-stratified analyses; however, some stratum-specific genetic interactions were found in the severe sustained attention deficit groups (zd’<−2.5 and zmd’<−2.5). We identified the between-gene interaction pair DAO*PTK2B, DAO*NRG1 and DAO*RASD2 in stratum zd’<−2.5, and the between-gene interaction pair DAO*NRG1, DAO*DISC1, and DAO*PTK2B in stratum zmd’<−2.5. These results support our hypothesis that the degree of sustained attention deficit moderates interactions of risk SNPs for schizophrenia.The glutamate receptor, such as NMDA receptor, can be triggered by d-Ser, which is a co-agonist of NMDA receptor. Activating this receptor will induce the calcium influx, which may cause clustering and autophosphorylation of Pyk2 (Proline-rich tyrosine kinase 2, which is also known as Ptk2b) in the postsynaptic neuron [92]. This cluster can be disrupted by the action of striatal-enriched protein-tyrosine phosphatase (STEP), which in turn down-regulate the long term potentiation (LTP) [93]. Affecting the long-term potentiation early in development in the hippocampus CA1 region may contribute to cognitive deficits observed later in schizophrenia [94]. Therefore, decreasing the concentration of d-Ser may lead to Schizophrenia.As shown in Figure 3, the concentration of d-Ser is regulated by two factors. First of all, serine racemase (SR) catalyzes the conversion of L-Ser to d-Ser, which increases the d-Ser concentration in the synapse area. On the other hand, DAO gene encodes an enzyme that degrades d-Ser. The inactivation of SR or the activation of DAO may decrease the d-Ser concentration and thus decrease calcium influx. This argument is consistent with the observations that polymorphisms of SR and DAO genes were implicated as risk factors for schizophrenia [95], [96], respectively).
Figure 3
A schematic diagram to show the relation among genes that involved in the observed gene-gene interactions.
The top, bottom, and left side components represent the pre-synaptic neuron, post-synaptic neuron, and glial cells, respectively. Three types of receptors (NMDA receptor, ERBB4 receptor, and AMPA receptor) are drawn on the post-synaptic neuron. All the genes involved in the gene-gene interactions are shown in boldface. NRG1 is the ligand for ERBB4 receptor, which may trigger the long term potentiation by way of the PTK2B protein activation. DISC1 may stabilize serine racemase (SR), which will convert L-Serine to d-Serine (d-Ser). The black triangle on the side of NMDA receptor represents d-Ser, which is a co-agonist of this receptor. Both NMDA and AMPA receptors are calcium channels, which may increase calcium influx upon activation. Two molecules of CACNG2 (shown by the grey oval) were found on each AMPA receptor. Even though CACNG2 is a subunit of AMPA receptor, this protein is explicitly drawn in order to show its role in gene-gene interaction.
A schematic diagram to show the relation among genes that involved in the observed gene-gene interactions.
The top, bottom, and left side components represent the pre-synaptic neuron, post-synaptic neuron, and glial cells, respectively. Three types of receptors (NMDA receptor, ERBB4 receptor, and AMPA receptor) are drawn on the post-synaptic neuron. All the genes involved in the gene-gene interactions are shown in boldface. NRG1 is the ligand for ERBB4 receptor, which may trigger the long term potentiation by way of the PTK2B protein activation. DISC1 may stabilize serine racemase (SR), which will convert L-Serine to d-Serine (d-Ser). The black triangle on the side of NMDA receptor represents d-Ser, which is a co-agonist of this receptor. Both NMDA and AMPA receptors are calcium channels, which may increase calcium influx upon activation. Two molecules of CACNG2 (shown by the grey oval) were found on each AMPA receptor. Even though CACNG2 is a subunit of AMPA receptor, this protein is explicitly drawn in order to show its role in gene-gene interaction.It has been shown that a mutant Disc1 failed to bind and stabilize SR. As a result, the concentration of d-Ser is lowered in mouse model [97]. The effect of this d-Ser deficit can be further enhanced by having a polymorphism, which may have a higher DAO activity. In other words, the cognitive deficits are expected to be even more severe when the DISC1 and DAO polymorphisms co-existed in the same cell.Because the active D-amino acid oxidase is a dimer [98], the hetero-dimer of a mutant and a wild type protein may have the mutant phenotype. On the basis of the generalized model of gene dosage and dominant negative effects in macromolecular complexes [99], about 25% of the dimers are wild type dimers. If, however, there are two polymorphisms within the DAO gene co-exist in trans, none of the dimers are wild type DAO proteins. It appeared that one polymorphism enhances the effect of the other polymorphism. In other words, these two polymorphisms are interacting with each other within the DAO gene. The association of the polymorphism of CACNG2 and schizophrenia [37] may be related to the calcium influx controlled by the AMPA receptor. CACNG2 protein regulates synaptic targeting of AMPA receptors (AMPAR) [100], because CACNG2 (also known as stargazin) is a subunit of AMPAR. Since there are two CACNG2 proteins on an active AMPAR (see Figure 3), the within gene interaction of CACNG2 can also be explained by using the generalized model [99].As described in the introduction, the polymorphisms of NRG1 and PTK2B are associated with schizophrenia, respectively. It is interesting to note that neuregulin 1 (Nrg1) activates Fyn and Pyk2 (Ptk2b) kinases and then modulates channel properties of NMDA receptor [101] in mouse model. If NMDA receptor is regulated by both NRG1 and DAO proteins, the NRG1 may enhance the down-regulation effect of DAO protein on calcium influx. Thus, a variation that attenuates the activity of either NRG1 or PTK2B (human orthologs of Nrg1 and Pyk2) is likely to enhance the cognitive deficits caused by DAO protein. This synergistic effect is similar to the synthetic lethal phenomenon [102]. Because PTK2B protein is in the downstream of NRG1 signaling pathway (see Figure 3), the effect PTK2B on cognitive deficits might be more direct than that of NRG1. This may explain why the interaction of DAO and PTK2B was observed regardless of stratification by neuropsychological dysfunction.In addition to the glutamate hypothesis, the dopamine hypothesis of has also been used to explain the possible cause of schizophrenia. In fact, the striatal dopamine abnormalities are now clearly demonstrated in patients with schizophrenia and at risk population (reviewed in [103]). Interestingly, the protein product of RASD2 (also known as Rhes, Ras homolog enriched in striatum) is a risk factor for schizophrenia [37]. This protein has been implicated in modulating dopamine signaling in striatal medium spiny neurons [104]. Furthermore, dopamine receptors D2 might associate with schizophrenia via Akt signaling [105]. It has been shown that RASD2 may regulate the Akt pathway by interacting with the regulatory subunit of PI3K
[106]. This observation provides a possible molecular mechanism for the association of RASD2 polymorphism and schizophrenia. The striatum is an area where glutamatergic input from cerebral cortex and a dopaminergic input from the substantia nigra come together. Thus, the interaction between DAO and RASD2 genes suggests a possible link between dopamine hypothesis [107] and glutamate hypothesis [108].Social-demographic-matched case-control design was favorable but not considered before. The main reason was that the available schizophrenicpatients and normal controls in this study were not affordable for a complete sample matching by all important social-demographic variables such as gender, age, and education level. In order to avoid false-positive findings, we therefore examined if gender, age and education level were confounders. The post-analyses of the unstratified data of overall samples and the stratified data of CPT strata showed that all the SNPs didn’t have significantly different genotypic distributions in gender, age and education level groups. The least FDR-adjusted p-value for the association between SNPs and gender, age and education level groups was 0.252, 0.202 and 0.145, respectively. The results suggest that gender, age and education level were not confounders to the outcomes of genetic association and interaction in this study.We also performed logistic regression analysis to examine the interactions reported by our gene-based MDR procedure. Logistic regression, a parametric model, has advantages in exploring genotype-phenotype relationship but suffering from the curse of dimensionality [84]. Most of our analyses either met a convergence failure while solving maximum likelihood estimates of regression coefficients or the fitted models failed to pass a lack-of-fit test [109] especially for the models involving more variables and higher order interactions. Among the genetic interactions reported by the gene-based MDR, logistic regression can fit well in two cases. The first case was about the within-gene interaction of the DAO gene in an unstratified analysis. The raw p-values from a type III analysis of rsDAO_8, rsDAO_6 and their interaction rsDAO_8*rsDAO_6 were 0.9994, 0.9995, and 0.0000 for nominal genotype coding and 0.0002, 0.0289, and 0.0000 for continuous genotype coding. In this case, both the logistic regression and gene-based MDR analysis identified the within-gene interaction of the DAO gene.The second case was about the within-gene interaction of the CACNG2 gene in the zmd’<-2.5 stratum. The raw p-values from a type III analysis for rsCACNG2_3, rsCACNG2_15 and their interaction rsCACNG2_3*rsCACNG2_15 were 0.0023, 0.8214, and 0.3651 for nominal genotype coding and 0.0178, 0.5757, and 0.8600 for continuous genotype coding. SNP rsCACNG2_3 was the only SNP showing a marginal significance of genetic association in this study. However, logistic regression analysis did not identify the interaction rsCACNG2_3*rsCACNG2_15 reported by the gene-based MDR. The discrepancy between the results of the gene-based MDR and logistic regression analyses may cause by a complex pattern of genetic interaction of rsCACNG2_3 and rsCACNG2_15. Compared with the low-risk genotype rsCACNG2_3 = GG, genotypes rsCACNG2_3 = TG and TT are high-risk except that the genotype combinations rsCACNG2_3 = TG and rsCACNG2_15 = GA are low-risk. Therefore, in our gene-based MDR analysis, the interaction model of rsCACNG2_3 and rsCACNG2_15 showed a higher testing accuracy than the main effect model of rsCACNG2_3. Whatever a 4-degree-of-freedom type III interaction test or four 1-degree-of-freedom interaction tests using dummy variables, logistic regression analysis requires more samples in order to detect such a complex pattern of genetic interaction.This study is limited, however, by the lack of CPT data for the normal controls. The results of the CPT analysis rely on the assumption that the distribution of CPT scores is relatively homogeneous in normal controls–an assumption that is held generally for the Taiwan Han Chinese population [68]. Besides, this study is limited by the adoption of only ten candidate vulnerability genes found in the Taiwanese population, and not including all other candidate genes reported in literature. However, we did also take the strength of single-ethnicity samples in this study to reduce false-positive and/or false-negative results due to genetic heterogeneity. Finally, this study is also limited by no replications for the reported genetic association and interaction. The findings reported in this study will be replicated in our ongoing genome-wide association study of schizophrenia in the Taiwan Han Chinese population.This study identified statistically significant within-gene and between-gene interactions in schizophrenia, and it thereby provides a new paradigm for exploring genetic pathways underlying this disease. Furthermore, this study also revealed multiple neurobiological pathways responsible for the multiple pathogenesis mechanisms of schizophrenia potentially. Further studies are necessary to explore the molecular mechanisms underlying these gene-gene interactions by bridging clinical phenotypes of schizophrenia, neurobiological abnormalities, genetic expressions, and the associated/interactive genes. Identification of interactions involving SNPs on more than two genes requires a large sample size and will be conducted after our sample size has been further increased.The LD structures of 84 SNPs in 10 candidate genes for the Zd’ ≥ −2.5 stratum. For each gene, the id of each SNP in the gene is listed, and the locations reflect the relative physical positions of the SNPs (in units of base pairs). The LD coefficient, D’, is provided unless D’ = 1. The color scheme for D’ presentation is as follows: white depicts the case of D’ <1 and LOD <2; blue depicts the case of D’ = 1 and LOD <2; pink or light red depicts the case of D’ <1 and LOD ≥2; bright red depicts the case of D’ = 1 and LOD ≥2. The LD block(s) within each gene is marked by an inverted triangle based on Gabriel’s method [81].(TIF)Click here for additional data file.The LD structures of 84 SNPs in 10 candidate genes for the Zd’<−2.5
stratum. For each gene, the id of each SNP in the gene is listed, and the locations reflect the relative physical positions of the SNPs (in units of base pairs). The LD coefficient, D’, is provided unless D’ = 1. The color scheme for D’ presentation is as follows: white depicts the case of D’ <1 and LOD <2; blue depicts the case of D’ = 1 and LOD <2; pink or light red depicts the case of D’ <1 and LOD ≥2; bright red depicts the case of D’ = 1 and LOD ≥2. The LD block(s) within each gene is marked by an inverted triangle based on Gabriel’s method [81].(TIF)Click here for additional data file.The LD structures of 84 SNPs in 10 candidate genes for the Zmd’ ≥ −2.5
stratum. For each gene, the id of each SNP in the gene is listed, and the locations reflect the relative physical positions of the SNPs (in units of base pairs). The LD coefficient, D’, is provided unless D’ = 1. The color scheme for D’ presentation is as follows: white depicts the case of D’ <1 and LOD <2; blue depicts the case of D’ = 1 and LOD <2; pink or light red depicts the case of D’ <1 and LOD ≥2; bright red depicts the case of D’ = 1 and LOD ≥2. The LD block(s) within each gene is marked by an inverted triangle based on Gabriel’s method [81].(TIF)Click here for additional data file.The LD structures of 84 SNPs in 10 candidate genes for the Zmd’<−2.5
stratum. For each gene, the id of each SNP in the gene is listed, and the locations reflect the relative physical positions of the SNPs (in units of base pairs). The LD coefficient, D’, is provided unless D’ = 1. The color scheme for D’ presentation is as follows: white depicts the case of D’ <1 and LOD <2; blue depicts the case of D’ = 1 and LOD <2; pink or light red depicts the case of D’ <1 and LOD ≥2; bright red depicts the case of D’ = 1 and LOD ≥2. The LD block(s) within each gene is marked by an inverted triangle based on Gabriel’s method [81].(TIF)Click here for additional data file.Results of single-locus association tests. Chromosome, gene name, SNP ID, rs number, region, function, and physical position are shown. The raw p-values and FDR-adjusted p-values are shown for genotype-based (X_G), allele-based (X_A), and trend-based (X_T) association tests in an unstratified analysis of overall samples and a stratified analysis of four CPT strata, Zd’ ≥ −2.5, Zd’<−2.5, Zmd’ ≥ −2.5, and Zmd’<−2.5.(PDF)Click here for additional data file.Novel SNPs discovered from DNA sequencing. Observation, chromosome, gene name, SNP ID, rs number, and ss number are shown. Eighteen novel SNPs discovered from our direct DNA sequencing experiment were deposited in GenBank. Their ss numbers are listed in the last column of this table.(XLSX)Click here for additional data file.Results of haplotype-based association tests. The results of haplotype-based association tests in an unstratified analysis of overall samples and a stratified analysis of four CPT strata, Zd’ ≥ −2.5, Zd’<−2.5, Zmd’ ≥ −2.5, and Zmd’<−2.5, are shown separately. In each sub-table, gene, LD block, haplotype, haplotype frequencies in case, control and combined groups, and the exact p-values and FDR-adjusted p-values of haplotype-based association tests are provided.(PDF)Click here for additional data file.
Authors: B J Mowry; P A Holmans; A E Pulver; P V Gejman; B Riley; N M Williams; C Laurent; S G Schwab; D B Wildenauer; S Bauché; M J Owen; B Wormley; A R Sanders; G Nestadt; K Y Liang; J Duan; R Ribble; N Norton; S Soubigou; W Maier; K R Ewen-White; N DeMarchi; B Carpenter; D Walsh; H Williams; M Jay; M Albus; D A Nertney; G Papadimitriou; A O'Neill; M C O'Donovan; J-F Deleuze; F B Lerer; D Dikeos; K S Kendler; J Mallet; J M Silverman; R R Crowe; D F Levinson Journal: Mol Psychiatry Date: 2004-08 Impact factor: 15.992
Authors: Nicole C Allen; Sachin Bagade; Matthew B McQueen; John P A Ioannidis; Fotini K Kavvoura; Muin J Khoury; Rudolph E Tanzi; Lars Bertram Journal: Nat Genet Date: 2008-07 Impact factor: 38.330
Authors: P A Thomson; N R Wray; J K Millar; K L Evans; S Le Hellard; A Condie; W J Muir; D H R Blackwood; D J Porteous Journal: Mol Psychiatry Date: 2005-07 Impact factor: 15.992
Authors: N C Andreasen; M A Wilcox; B-C Ho; E Epping; S Ziebell; E Zeien; B Weiss; T Wassink Journal: Mol Psychiatry Date: 2011-08-30 Impact factor: 15.992
Authors: Fernando J Martinez; Gabriel A Pratt; Eric L Van Nostrand; Ranjan Batra; Stephanie C Huelga; Katannya Kapeli; Peter Freese; Seung J Chun; Karen Ling; Chelsea Gelboin-Burkhart; Layla Fijany; Harrison C Wang; Julia K Nussbacher; Sara M Broski; Hong Joo Kim; Rea Lardelli; Balaji Sundararaman; John P Donohue; Ashkan Javaherian; Jens Lykke-Andersen; Steven Finkbeiner; C Frank Bennett; Manuel Ares; Christopher B Burge; J Paul Taylor; Frank Rigo; Gene W Yeo Journal: Neuron Date: 2016-10-20 Impact factor: 17.173