Literature DB >> 28533516

ASD restricted and repetitive behaviors associated at 17q21.33: genes prioritized by expression in fetal brains.

R M Cantor1,2, L Navarro1, H Won3, R L Walker3, J K Lowe3, D H Geschwind1,2,3.   

Abstract

Autism spectrum disorder (ASD) is a behaviorally defined condition that manifests in infancy or early childhood as deficits in communication skills and social interactions. Often, restricted and repetitive behaviors (RRBs) accompany this disorder. ASD is polygenic and genetically complex, so we hypothesized that focusing analyses on intermediate core component phenotypes, such as RRBs, can reduce genetic heterogeneity and improve statistical power. Applying this approach, we mined Caucasian genome-wide association studies (GWAS) data from two of the largest ASD family cohorts, the Autism Genetics Resource Exchange and Autism Genome Project (AGP). Of the 12 RRBs measured by the Autism Diagnostic Interview-Revised, seven were found to be significantly familial and substantially variable, and hence, were tested for genome-wide association in 3104 ASD-affected children from 2045 families. Using a stringent significance threshold (P<7.1 × 10-9), GWAS in the AGP revealed an association between 'the degree of the repetitive use of objects or interest in parts of objects' and rs2898883 (P<6.8 × 10-9), which resides within the sixth intron of PHB. To identify the candidate target genes of the associated single-nucleotide polymorphisms at that locus, we applied chromosome conformation studies in developing human brains and implicated three additional genes: SLC35B1, CALCOCO2 and DLX3. Gene expression, brain imaging and fetal brain expression quantitative trait locus studies prioritize SLC35B1 and PHB. These analyses indicate that GWAS of single heritable features of genetically complex disorders followed by chromosome conformation studies in relevant tissues can be successful in revealing novel risk genes for single core features of ASD.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28533516      PMCID: PMC5700871          DOI: 10.1038/mp.2017.114

Source DB:  PubMed          Journal:  Mol Psychiatry        ISSN: 1359-4184            Impact factor:   15.992


Introduction

Autism Spectrum Disorders (ASD), first described by Kanner (1) in 1943, are lifelong neurobehavioral syndromes identified by the key features of marked deficits in communication skills and social interactions that are accompanied by restricted interests and repetitive behaviors (RRBs), with an onset prior to the age of three (2). Diagnoses are made by an expert's opinion of the child's behavior, using instruments such as the Autism Diagnostic Interview–Revised (ADI-R) (3). Currently, the estimate of ASD incidence is 1 in 68, with a ratio of affected males to females of 4:1 (4), and an incidence in males of 1 in 42, making it compelling to identify the etiologies of this disorder. Although genetics is currently central to investigations of ASD etiologies, its importance was not immediately clear. Heritability analyses to establish a genetic component began in the 1970s with the availability of appropriate ASD study samples to estimate and compare the concordance rates in monozygotic (MZ) versus dizygotic (DZ) twin pairs. Although previous heritability estimates were as high as 90% (5), implicating the etiological impact of genes almost exclusively, current heritability estimates of approximately 50%- 60% (6) indicate that both genes and environmental factors, and possibly their interactions, are important. Not surprisingly, given its complexity as a phenotype and our emerging knowledge of its genetic heterogeneity, the specific patterns of genetic and environmental risk factors that lead to ASDs have not been easy to identify (7). A salient outcome of a decade of intensive genetic analysis of ASD has been a realization that the genetics of this disorder is extremely complex, and that as many as 1 000 genes, in various combinations with each other and with environmental factors, are likely to be involved (8). One powerful approach to reducing genetic complexity is to focus on identifying the genes that predispose to a single phenotype related to ASD. Such phenotypes may have less complex genetic etiologies than the diagnosis of ASD itself (9). Successful studies will reveal genes predisposing to the individual phenotypes, and possibly provide insight into the etiologies of ASD. This approach has been successful in studying the genetics of a number of ASD related phenotypes. In early studies of the Autism Genetics Resource Exchange (AGRE) sample (10), linkage analyses of a single component of ASD, language delay, identified a locus on chromosome 7q as harboring one or more risk genes for that feature (11). In follow-up association analyses, Contactin Associated Protein 2 (CNTNAP2) was identified (12) as a risk gene in that 7q region. In another study of the AGRE sample, linkage to deficits in nonverbal communication was found in four chromosome regions (13), and follow-up association analyses at 1p13-q12 revealed Nerve Growth Factor (NGF) as a risk gene for these deficits (14). More recently, using the expanded AGRE sample, a quantitative assessment of deficits in social behavior in those with ASD (the Social Responsiveness Scale (SRS)) revealed two linked loci on chromosome 8 (8p21.3 and 8q24.22) (15). A very recent study shows, as predicted, (16) that genome-wide risk loci contributing to variation in core ASD features of social cognition and adaptive functioning also contribute to variation in the same traits in the general population (17). This supports the notion that common variation contributes to individual core features of ASD. The analyses reported here use GWAS of common variants to find risk genes for RRBs. Investigations into the genetics of RRBs have been less aggressive than genetic studies of ASD. However, similar to the study designs used to investigate the broader ASD diagnosis, early genetic studies of RRBs examined familiality (18, 19) genome-wide linkage (18), and the association of candidate genes (19). The aggregate of these genetic studies of RRBs, along with studies in model organisms, implicate dopaminergic, glutamatergic and serotonergic genes, as reviewed by Lewis and Kim (20). Factor (21) and principal components (22) analyses of the ADI-R items assessing RRBs in those with ASD support the hypothesis that RRBs do not reflect a single underlying trait. These analyses of the correlation structure of the ADI-R RRB items consistently reveal that there are likely to be two major independent factors contributing to RRBs. The first represents repetitive sensory motor actions and the second represents insistence on sameness. Previous linkage analysis of the two RRB factors in large pedigrees implicate chromosome regions 2q37 and 15q13-14 (23). Recently, a study was conducted in the AGRE (10) sample (24) where a GWAS of the two RRB principal components, were analyzed to identify risk loci. The Genome-wide Efficient Mixed Model Analysis (GEMMA) software (25) that conducts a multivariate GWAS was applied to the bivariate principal components. A standard p<5×10-8 level of significance resulted in no significant associations; however, the authors reported 8p21.2-8p21.1 as harboring the top signal in the AGRE sample. The Simons Simplex Collection (SSC) (26) was used for replication, but this was not achieved. One might expect this, as the SSC is designed primarily to address the role of de novo variants in ASD, because the families used to ascertain the parent/affected child trios do not have multiple affected individuals. That is, the ascertainment scheme of the SSC depletes the supply of inherited risk alleles, making this sample statistically underpowered to identify familial inherited contributions. Here, we report GWAS analyses to identify RRB risk genes using existing ASD GWAS data. We made five choices in the study design in order to increase statistical power. 1)We studied RRBs, hypothesizing that because they exhibit less phenotypic complexity than ASD, they are more likely to have reduced genetic complexity and greater locus specific heritabilities. 2) We screened the RRBs for those with significant familiality and substantial variability. 3) To find common inherited risk alleles, we analyzed the AGRE and AGP study samples. 4) To reduce allelic and locus heterogeneity, we selected Caucasian AGRE and AGP sub-samples because it was the most frequent ethnicity in both samples. 5) For the GWAS analysis, we used a variance components analytic approach that corrects for the non-independence of siblings and exhibits superior statistical power over family-based approaches. Some of our study design choices may not appear straightforward; our reasoning follows. Studying RRBs in those with ASD promises to reduce phenotypic heterogeneity, because, although the composite diagnosis of ASD has a high heritability, we hypothesize that the individual RRBs are less genetically complex. We studied the individual RRBs rather than the two factors that capture their phenotypic correlations because they are likely to be less complex and because significant phenotypic correlations do not necessarily reflect the presence of genotypic correlations. We began by screening the RRBs for those most likely to be heritable and have adequate variability. Although ASD heritability is 50-60%, RRB heritabilities are not limited by this value. Significant heritability is an essential feature of a genetic trait, and we make the most precise estimates by comparing the trait correlations or concordance rates of MZ and DZ twin pairs. Since the AGRE sample does not contain an adequate number of MZ and DZ pairs, we use the approximately 1500 phenotyped sibling pairs in this sample to estimate RRB familialities with their Spearman correlations. Because the sample is composed of siblings, these estimates reflect both genetic and environmental contributions, making significant familiality necessary, but not sufficient, for significant heritability. An upper bound of heritability is twice the correlation. We also screened the RRBs for sufficient variability in order to achieve adequate statistical power for the GWAS. We dichotomized the response for each item using the criterion that each of the two groups contain between 40 and 60% of the individuals. We reasoned that the RRBs are continuous traits that are coded as ordinal in the ADI-R and could be re-coded as binary without compromising their genetic interpretation. Ethnicities were restricted to only those individuals who are Caucasian, because those of Asian and African descent currently comprise only 15 and 10 percent of the AGRE and AGP samples, respectively. Samples from different ethnicities are very likely to introduce allelic and locus heterogeneity, a concern not addressed adequately using standard GWAS approaches. We reasoned that the potential increase in power achieved by including small samples from other ethnic groups would be compromised by the likely introduction of locus and allelic heterogeneity. We chose a statistically powerful variance components approach to test RRB association in the children with ASD. Although the AGRE and AGP samples include the parents of the ASD affected children, transmission disequilibrium tests that assess allelic transmission to affected offspring, are much less powerful, as reported by Eu-Ahsunthornwattana and colleagues (27). Their power analyses indicate that, in pedigrees, the FBAT family based association test has 2% percent of the power consistently exhibited by the variance component approaches, using a standard p-value of 5×10-8. We analyzed seven familial and variable RRBs that and conducted the GWAS in the AGRE and AGP samples separately. Although this design required us to apply a very stringent level of significance to account for multiple testing, possibly decreasing our statistical power, we hypothesized that the selected study samples and filtered traits would compensate for this stringent criterion. Given the recent development of molecular technologies and bioinformatics databases, we reasoned that it is possible, straightforward and compelling to use these resources to interpret any SNP/trait associations resulting from a GWAS. We were fortunate to find significant SNP associations, and extended these to identify the genes most likely to be interacting with them. This was achieved using an analytic approach to detect the most ‘credible’ SNPs in the associated chromosome region, followed by the perusal of a database designed to detecting SNP/gene interactions using chromosome interaction studies of fetal brains. Prioritization of the target genes derived from studies of 1) gene expression in RRB relevant brain regions during development, 2) SNP associations with RRB relevant brain regions, and 3) expression quantitative trait locus (eQTL) studies in fetal brains.

Methods

Overview of the Study

This study was designed to identify genes predisposing to RRBs in individuals with ASD by analyzing GWAS from the well-characterized AGRE (10) and AGP (28) samples. We conducted GWAS in the uncombined study samples of Caucasian AGRE and AGP families testing the individual familial and variable ADI-R RRBs for association with genotyped and imputed common SNPs. We present descriptions of the AGRE and AGP samples, SNP genotyping, SNP imputation and selection of Caucasian families in Supplementary Methods. SNPs in the associated chromosome region was tested to identify the ‘credible SNPs’ and these were queried in our database of Hi-C fetal brain SNP-gene interactions (29) to identify candidate risk genes. These genes were then prioritized using bioinformatics databases reporting gene expression in developing brains, brain region volume associations with SNPs, and eQTL studies of fetal brains.

Identifying Familial and Variable RRBs

The ADI-R is an interview interrogating caregivers of potential ASD cases regarding the defining features of ASD and providing the information used to make an ASD diagnosis in the AGRE and AGP samples. Data from these interviews are stored in the AGRE and AGP databases, and items 67-78 measure the degrees of severity of the RRBs. Responses are divided into three ordinal categories, ranging from not exhibiting the RRB to the RRB being extremely disruptive. It also asks whether the individual currently has the trait, or has ever exhibited the trait, and since we wished to capture any manifestations of each RRB, we scored the RRBs according to the degree of severity in the category of ‘ever’. We estimated RRB familiality with Spearman correlations in the affected siblings. We tested the twelve ADI-R RRBs for non-zero Spearman sibling correlations with a 0.05/12 significance threshold, which is approximately p < 0.004, and omitted the RRBs with non-significant correlations. Seven RRBs queried by ADI-R questions 68 - 72, 76 and 78, met these criteria. To screen for adequate variability, the dichotomy was set between 0 and the more severe categories of 1,2 and 3, however ror item 71, those receiving 0 and 1 were in the first category, and for item 69, those receiving a score of 0,1, or 2 were in the first category. We summarize these significantly familial and adequately variable RRBs in Table 1.
Table 1

Familial and Variable RRBs

RRB (ADI-R Item Number)Number of Sibling PairsSibling Correlation, Maximum Heritability, (p-value)% of Individuals in Category 1
Circumscribed interests (68)12870.29, .58 (<.0001)54
Repetitive use of objects or interest in parts of objects (69)13820.13, .36 (<.0001)40
Compulsions/rituals (70)14120.15, .30 (<.0001)45
Unusual sensory interests (71)14140.10, .20 (0.0001)49
Undue general sensitivity to noise (72)13610.14, .28 (<.0001)56
Unusual attachments to objects (76)11960.15, .30 (<.0001)40
Other complex mannerisms or stereotyped body movements (78)14130.16, .32 (<.0001)58

GWAS to Identify SNPs Associated with RRBs

The GWAS were conducted separately in the AGRE (1 996 ASD affected children from 1 053 families) and AGP (1 108 ASD affected children from 1 092 families) samples for the seven RRBs. To increase genome-wide SNP coverage, imputation was conducted, providing 5 755 879 AGRE and 3 972 813 AGP SNPs. We conducted the association analyses using a linear mixed model approach that captures structure within the data that violates the assumption of independence. That is, the assumption of standard case/control GWAS using Fisher's Exact Test or a logistic regression analysis is that all individuals in the study are unrelated and therefore independent. Data from multiplex families analyzed here introduce a clear violation of this assumption. We selected the Efficient Mixed-Model Association eXpedited (EMMAX) (30) software for association testing. Similar to other linear mixed model software, EMMAX accounts for the structure within the study sample by modeling that structure using the variance components term in the linear mixed model. EMMAX estimates the relatedness matrix empirically using the SNP data. By assuming that the phenotype under analysis is polygenic, with a large number of alleles each having a small effect, the computational algorithm avoids time-consuming repetitive variance components estimation procedures. EMMAX association tests the SNP genotypes with either quantitative or binary phenotypes. Type 1 statistical errors are as expected, and power is consistent with other linear mixed model approaches. We set a conservative level of significance of 5×10-8/7, or 7.14×10-9, to account for the seven familial traits tested for association in the two independent samples.

Interpreting the Associated SNPs

We analyzed SNPs in the associated chromosome region to find interacting genes regulated by that locus and evaluated these SNPs and genes using data on gene expression in relevant brain structures at relevant periods of development, SNP associations with relevant brain structures volumes, and SNP/gene eQTL in fetal brains. First, the CAusal Variants Identification in Associated Regions (CAVIAR) software (31, 32) was applied to select a minimal set of putative causal SNPs (credible SNPs) in the associated chromosome region. By analyzing the association p-values and the linkage disequilibrium estimates among the tested SNPs at the associated locus and jointly modeling and conditioning on these p-values, the software generated a set of variants that includes all of those classified as causal with 95% probability. CAVIAR identified five credible SNPs for RRBs. These SNPs were classified as nonsense, missense, or splice site variants or residing in the gene promoter, and assigned to their target genes. For the remaining un-annotated SNPs, chromatin contact matrices from mid-gestation human developing brains were leveraged to find genes that physically interact with them and are predicted regulatory targets (29). We describe this SNP/gene interaction database in Supplementary Methods. We prioritized the ‘credible’ SNPs and their target genes using data from three sources. They are: 1) the Enhancing NeuroImaging Genetics Through Meta-Analysis (ENIGMA) database (30, 33) that associates SNPs and brain volumes, 2) the Allen Brain Atlas (36-38), which assesses gene expression levels in brain regions at key stages during fetal and childhood development, and 3) a database of expression quantitative trait loci (eQTL) in fetal brain tissue that is currently under development. The latter is based on gene expression and genotyped and imputed SNP data from 200 abortuses.

Results

We filtered 12 ADI-R RRBs for those with significant familiality and adequate variability. Table 1 presents the sibling correlation, upper bound of heritability and the degree of variability for the seven RRBs passing this filter. To illustrate, in the first line of Table 1, 1 287 sibling pairs have data assessing circumscribed interests, as queried by item 68 on the ADI-R. The significant sibling correlation of 0.29 provides a maximum heritability of 0.58, and the measure of variability where 54% of individuals fall in to the first category of the dichotomy. As reported in Table 2, these familial RRBs exhibit only moderate pairwise correlations, thus providing support for conducting GWAS separately on the individual RRBs. We conducted GWAS using a stringent Bonferroni correction for multiple testing and identified one locus on 17q21.33. ADI-R item 69, querying the degree of ‘repetitive use of objects or interest in parts of objects,’ is significantly associated with two SNPs in this region in the AGP sample. One genotyped SNP, rs2898883, meets the criterion for association (p < 6.79×10-9 with an empirical p < 4×10-9), and a second genotyped SNP in the region, rs7502499, provides additional support (p < 5.73×10-8 with an empirical p < 3.1×10-8). Between them, an imputed SNP, rs2233667, meets the criterion for significant association (p < 8.02×10-9 with an empirical p < 4×10-9). These SNPs are in moderate pairwise linkage disequilibrium and are located within the sixth intron of the Prohibitin gene (PHB).
Table 2

Correlations Among Familial and Variable RRB ADI-R Questions in the AGRE Sample

68697071727678
681.000.070.160.020.140.120.02
691.000.190.430.160.140.27
701.000.150.140.160.12
711.000.130.130.27
721.000.090.15
761.000.09
781.00
In the AGRE sample, there are no SNP/trait association tests meeting the 7.14×10-9 criterion for significance. The p-values for the three SNPs that have a significant association with ‘repetitive use of objects or interest in parts of objects’ in the AGP sample range between 0.49 and 0.51 for that RRB in AGRE. Meta-analyses result in p-values of 2.0 × 10-8 for rs2898883, 2.0 × 10-8 for rs2233667, and 6.7 × 10-7 for rs7502499, as reported in Supplementary Table 1. In early studies of the AGRE families, the 17q21 chromosome region was linked to ASD in a much smaller AGRE sample where all affected children were male, (34). That finding was replicated in an independent set of ‘male only’ AGRE families (35). Association analyses of the three significant SNPs at 17q21.33 and ADI-R item 69 in the families where all affected children are male in this current expanded AGRE sample results in p-values ranging between .78 and .92 As with almost all complex traits, none of the significantly associated SNPs is located in a protein-coding region of the genome. Identifying genes interacting with such SNPs has been a challenge, however, the availability of software and a database allowed us to address this. The CAVIAR software identified a set of ‘credible’ SNPs that has a greater than 95% likelihood of containing the causal SNPs. This analysis added two SNPs (chr17_47494782_D, p < 1.06×10-7 and rs71379361, p < 1.06×10-7) to the three identified by the GWAS. These five credible SNPs have minor allele frequencies ranging between 0.16 and 0.33 in Caucasians, indicating they are very common. Among the five, one (rs7502499) is in the PHB promoter. PHB encodes a mitochondrial scaffolding protein (36), and its role as a key regulator of neuronal survival is emerging (37, 38). The other four credible SNPs are in non-annotated non-coding regions. We therefore used the SNP/gene fetal brain interaction database to identify them. Hi-C interactions in the germinal zone, GZ, indicate that a genomic region that is 10kb in size containing two ‘credible’ SNPs (rs2898883 and rs2233667) interacts with Solute Carrier Family 35 Member B1 (SLC35B1), a gene that encodes a nucleotide sugar transporter whose function in neurodevelopment is unknown. Figure 2a, where rs2898883 shows a significant interaction with the 10kb bin containing SLC35B1, at a false discovery rate (FDR) < .01), illustrates this interaction. The bin that containing the other two credible SNPs (chr17_47494782_D and rs71379361) interacts with Calcium Binding and Coiled-Coil Domain 2 (CALCOCO2) in CP and Distal-Less Homeobox 3 (DLX3) in GZ, as illustrated in Figure 2b. CALCOCO2 encodes an autophagy receptor and is involved in the clearance of phosphorylated tau in Alzheimer's disease (39), while DLX3 encodes a homeodomain transcription factor. Notably, although the function of DLX3 in brain development remains to be studied, Distal-Less Homeobox 5 (DLX5) and Distal-Less Homeobox 6 (DLX6) are implicated in the development of GABAergic interneurons (40). Loss of function mutations in SLC35B1 and CALCOCO2 have been identified in ASD probands (41), while there are no published reports of DLX3 and PHB being associated with ASD or RRBs.
Figure 2

Representative interaction map for the ‘credible’ SNPs rs2898883 and rs2233667 (top) and chr17: 47494782: D and rs71379361 (bottom). Chromosome ideogram and genomic axis are across the top. Gene Model and possible target genes are in red. Horizontal axis is credible SNPs and their genomic coordinates. Genomic coordinates for credible SNPs and linkage disequilibrium (LD) region for the index SNP are indicated. Vertical axis is -log10(P-value) for the significance of the interaction between credible SNPs and each 10kb bin. Grey dotted line is for FDR=0.01. TAD is the topologically associating domain borders in CP and GZ.

Since ASD is a neurodevelopmental disorder often diagnosed before the age of three, and previous studies have demonstrated that ASD risk genes are expressed primarily during fetal development, genes with a prenatal-to-early-postnatal expression trajectory are excellent candidates, particularly within brain regions that are consistent with the development of RRBs. Thus, we examined gene expression trajectories over periods of development in different brain regions using the Allen Brain Atlas (42). Expression trajectories for the genes we identified are illustrated in Figure 3. SLC35B1 shows a very distinct regional expression pattern in the mediodorsal thalamic nucleus across development. CALCOCO2 displays a developmental stage-specific expression trajectory at mid-childhood, between ages 6 and 12 years. PHB shows a global expression pattern throughout cortical and subcortical regions except amygdala and cerebellum in childhood through adolescence (between ages 6 and 20 years) and a developmental expression trajectory in the mediodorsal thalamic nucleus. DLX3 expression was not detected in the brain (42).
Figure 3

Spatiotemporal maps depicting the expression trajectories of candidate genes. Expression is given across multiple brain regions (vertical axis) and developmental stages (horizontal axis) spanning from the prenatal to postnatal adult stage. Cortical regions include: DFC, dorsolateral prefrontal cortex; MFC, medial prefrontal cortex; OFC, orbital prefrontal cortex; VFC, ventrolateral prefrontal cortex; M1C, primary motor cortex; S1C, primary somatosensory cortex; V1C, primary visual cortex; A1C, primary auditory cortex; IPC, posterior inferior parietal cortex; ITC, inferior temporal cortex; STC, superior temporal cortex. Subcortical regions include CBC, cerebellum; AMY, amygdala; HIP, hippocampus; MD, mediodorsal nucleus of the thalamus; STR, striatum.

Most notably, both SLC35B1 and PHB are highly expressed in the thalamus, which receives the input from the basal ganglia and its frontal cortical circuitry, brain regions implicated in motor activity and repetitive behaviors (20, 43). Since alterations in thalamic volume have been observed in ASD patients (48-50), we evaluated whether the credible SNPs that interact with SLC35B1 (rs2898883 and rs2233667) or are located in the PHB promoter (rs7502499) are associated with the thalamic volume (33, 44) using the ENIGMA database. Notably, rs2898883 (p < 0.008) and rs7502499 (p < 0.01) are significantly associated with thalamic volume. Moreover, another SNP in strong linkage disequilibrium with rs2898883 (r2=0.75), rs4987082, is also associated with thalamic volume (p < 0.0005) and also shows a suggestive association with RRBs (p < 1.14×10-6), providing additional support for the credible SNPs and their interacting genes in the etiology of RRBs in those with ASD. We also examined our developing database of eQTL in fetal brains and were very encouraged to find that credible SNPs rs2898883 (p < 0.009) and rs2233667 (p < 0.01) are significant eQTL for their target gene SLC35B1. Table 3 presents the ‘credible’ associated SNPs, their interacting genes, and the evidence supporting these SNPs and genes from the Allen Brain Atlas, ENIGMA database and our prenatal eQTL database. We order the genes in this table according to their degree of prioritization, which is derived from known facts and supportive evidence. Clearly, SLC35B1, which exhibits an appropriate expression pattern in thalamus, and SNPs rs2898883 an rs233667, because both are its eQTL in fetal brains, with the latter also associated with thalamus volume, is the strongest.
Table 3

Credible SNPs for RRBs at 17q21, Candidate Target Genes and Spatiotemporal Expression pattern, Association with Subcortical Volume

Credible SNP (Candidate Gene eQTL)Candidate Gene (Function)Spatiotemporal Expression PatternSNP Association with Subcortical Volume
rs2898883 (P<0.009)SLC35B1 (Hi-C)Thalamus, prenatal to postnatal stagesThalamus, P=0.008
rs2233667 (P<0.01)*

rs7502499 (P>.05)PHB (Promoter)Thalamus, prenatal to postnatal stages Pan brain, childhood to adolescence (6-20 years)Thalamus, P=0.01

rs71379361 (P>.05)CALCOCO2 (Hi-C)Pan brain, childhood (6-12 years)Thalamus, P=0.01
chr17_47494782_D (*)DLX3 (Hi-C)**

data not available,

Discussion

ASD is a genetically complex disorder with etiologies that continue to be poorly understood. A recent report on its genetic architecture shows that common variation explains a substantial proportion of the risk (45); however, a decade of analyses reveals there may be as many as 1 000 predisposing genes. Our overarching hypothesis is that analyzing a single feature of ASD that reduces phenotypic heterogeneity will also reduce genetic heterogeneity, and our aim is to identify genome-wide significant RRB loci through genetic mapping, assigning their potential genes by leveraging chromatin interaction maps derived from the developing human brain. In mice, RRBs derive from three etiologies: 1) targeted insults to the CNS, 2) pharmacological agents, and 3) restricted environments and experience (46). Although these studies do not implicate any genes directly, they suggest that those genes involved in developing and maintaining cortical-basal ganglia circuitry in the fetal and childhood brain may be critical to the development of RRBs. Thus, genes that are highly expressed in brain regions in the development and maintenance of cortical-basal ganglia circuitry may be central to the development of RRBs in ASD. Support for these brain regions is also derived from brain morphology studies of RRBs in children with ASD. These studies provide evidence that RRBs may be associated with decreased volumes of the basal ganglia and thalamus. Phenotypic heterogeneity can be reduced in two ways. The first is to stratify individuals in the ASD study sample based on a selected single feature and conduct genetic analyses only in those exhibiting that feature. This approach is designed to capture ASD genes, as the ‘control’ sample will not have ASD. Although this approach reduces heterogeneity, it also can decrease the sample size, and is likely to reduce the statistical power to detect associated SNPs. However, sufficiently large genetic effects can compensate for the reduced sample size. The second approach, taken here, is to retain the full study sample and analyze an ASD related trait. Because the ‘cases’ and ‘controls’ both have ASD, analyses will capture variation of a single feature of ASD with possibly greater genetic homogeneity and larger effect sizes to identify genes contributing to that trait in those with ASD. The relative effect sizes of both approaches will depend upon the specific disorder and the trait under analysis, indicating that neither approach is better in all situations. In previous studies, we have used the second approach successfully to identify chromosome loci and genes for language delay, deficits in nonverbal communication, and deficits in social responsiveness in those with ASD. Here we examined the genetics of an additional core feature of ASD: RRBs. Applying this approach, we conducted individual GWAS of seven familial and variable RRBs. Using a conservative Bonferroni-corrected p-value, we detected association in the AGP sample at 17q21.33, with the lead SNP located in the sixth intron of PHB for the RRB described in the ADI-R as ‘repetitive use of objects or interest in parts of objects.’ The association is not replicated in the AGRE sample, and a meta-analysis of the lead SNP and the RRB ‘repetitive use of objects or interest in parts of objects’ in the AGRE and AGP samples results in a p-value of 2.0 × 10-8, which does not meet the 8 × 10-9 stringent p-value that corrects for the seven GWAS conducted initially. However, it does meet the 5 × 10-8 criterion set for a single GWAS. Interpreting the discrepancy between these two samples is not straightforward, and it illustrates a quandary faced by other investigators who analyze these ASD cohorts. Because these samples are small when compared to those ascertained for other complex disorders such as Schizophrenia and Coronary Artery Disease, replication, while welcomed, has not yet become an expectation. Small samples are more vulnerable to the effects of stochastic differences in their composition, and combined with genotypic and phenotypic heterogeneity of the traits and the small effect sizes with polygenetic disorders, we interpret this lack of replication as a challenge for future studies in larger samples. We also interpret our functional associations, as summarized below, as providing additional credibility to this association. Since the lead SNP in this analysis is not necessarily the causal variant, we obtained a set of five ‘credible’ causal variants, all of which reside in the non-coding region of the genome. Non-coding variants have been previously assigned to their nearest genes; however, interrogation of chromatin structure and eQTL studies have shown that this approach is often not correct. Here, we leveraged Hi-C data, which interrogates actual physical 3D chromatin contacts from the developing brain to functionally annotate common variants with their physically interacting genes, and detected three additional genes, each within 1 Mb of the ‘credible’ SNPs. The expression trajectories and imaging GWAS show that PHB and SLC35B1 are expressed in developing thalamus and influence thalamic volume in adults, prioritizing them among the four. SLC35B1 is further prioritized by our eQTL studies in fetal brains. In conclusion, these analyses 1) identify four novel genes predisposing those with ASD to exhibit a behavior involving a ‘repetitive use of objects or interest in parts of objects,’ 2) prioritize two of these genes, PHB and SLC35B1, and 3) illustrate that a reduction in phenotypic heterogeneity is a powerful approach to reduce genetic heterogeneity. Supplementary Table 1: Meta-Analysis of AGP Associated SNPs
  44 in total

1.  High density SNP association study of a major autism linkage region on chromosome 17.

Authors:  Jennifer L Stone; Barry Merriman; Rita M Cantor; Daniel H Geschwind; Stanley F Nelson
Journal:  Hum Mol Genet       Date:  2007-03-21       Impact factor: 6.150

Review 2.  Neurobiology of obsessive-compulsive disorder: insights into neural circuitry dysfunction through mouse genetics.

Authors:  Jonathan T Ting; Guoping Feng
Journal:  Curr Opin Neurobiol       Date:  2011-05-24       Impact factor: 6.627

3.  Genetic heritability and shared environmental factors among twin pairs with autism.

Authors:  Joachim Hallmayer; Sue Cleveland; Andrea Torres; Jennifer Phillips; Brianne Cohen; Tiffany Torigoe; Janet Miller; Angie Fedele; Jack Collins; Karen Smith; Linda Lotspeich; Lisa A Croen; Sally Ozonoff; Clara Lajonchere; Judith K Grether; Neil Risch
Journal:  Arch Gen Psychiatry       Date:  2011-07-04

4.  Prohibitin reduces mitochondrial free radical production and protects brain cells from different injury modalities.

Authors:  Ping Zhou; Liping Qian; Marilena D'Aurelio; Sunghee Cho; Gang Wang; Giovanni Manfredi; Virginia Pickel; Costantino Iadecola
Journal:  J Neurosci       Date:  2012-01-11       Impact factor: 6.167

5.  Subcategories of restricted and repetitive behaviors in children with autism spectrum disorders.

Authors:  Somer L Bishop; Vanessa Hus; Amie Duncan; Marisela Huerta; Katherine Gotham; Andrew Pickles; Abba Kreiger; Andreas Buja; Sabata Lund; Catherine Lord
Journal:  J Autism Dev Disord       Date:  2013-06

6.  EnigmaVis: online interactive visualization of genome-wide association studies of the Enhancing NeuroImaging Genetics through Meta-Analysis (ENIGMA) consortium.

Authors:  Nic M Novak; Jason L Stein; Sarah E Medland; Derrek P Hibar; Paul M Thompson; Arthur W Toga
Journal:  Twin Res Hum Genet       Date:  2012-06       Impact factor: 1.587

7.  A polygenic burden of rare disruptive mutations in schizophrenia.

Authors:  Shaun M Purcell; Jennifer L Moran; Menachem Fromer; Douglas Ruderfer; Nadia Solovieff; Panos Roussos; Colm O'Dushlaine; Kimberly Chambert; Sarah E Bergen; Anna Kähler; Laramie Duncan; Eli Stahl; Giulio Genovese; Esperanza Fernández; Mark O Collins; Noboru H Komiyama; Jyoti S Choudhary; Patrik K E Magnusson; Eric Banks; Khalid Shakir; Kiran Garimella; Tim Fennell; Mark DePristo; Seth G N Grant; Stephen J Haggarty; Stacey Gabriel; Edward M Scolnick; Eric S Lander; Christina M Hultman; Patrick F Sullivan; Steven A McCarroll; Pamela Sklar
Journal:  Nature       Date:  2014-01-22       Impact factor: 49.962

8.  Genome-wide efficient mixed-model analysis for association studies.

Authors:  Xiang Zhou; Matthew Stephens
Journal:  Nat Genet       Date:  2012-06-17       Impact factor: 38.330

9.  Loss of prohibitin membrane scaffolds impairs mitochondrial architecture and leads to tau hyperphosphorylation and neurodegeneration.

Authors:  Carsten Merkwirth; Paola Martinelli; Anne Korwitz; Michela Morbin; Hella S Brönneke; Sabine D Jordan; Elena I Rugarli; Thomas Langer
Journal:  PLoS Genet       Date:  2012-11-08       Impact factor: 5.917

10.  Synaptic, transcriptional and chromatin genes disrupted in autism.

Authors:  Silvia De Rubeis; Xin He; Arthur P Goldberg; Christopher S Poultney; Kaitlin Samocha; A Erucment Cicek; Yan Kou; Li Liu; Menachem Fromer; Susan Walker; Tarinder Singh; Lambertus Klei; Jack Kosmicki; Fu Shih-Chen; Branko Aleksic; Monica Biscaldi; Patrick F Bolton; Jessica M Brownfeld; Jinlu Cai; Nicholas G Campbell; Angel Carracedo; Maria H Chahrour; Andreas G Chiocchetti; Hilary Coon; Emily L Crawford; Sarah R Curran; Geraldine Dawson; Eftichia Duketis; Bridget A Fernandez; Louise Gallagher; Evan Geller; Stephen J Guter; R Sean Hill; Juliana Ionita-Laza; Patricia Jimenz Gonzalez; Helena Kilpinen; Sabine M Klauck; Alexander Kolevzon; Irene Lee; Irene Lei; Jing Lei; Terho Lehtimäki; Chiao-Feng Lin; Avi Ma'ayan; Christian R Marshall; Alison L McInnes; Benjamin Neale; Michael J Owen; Noriio Ozaki; Mara Parellada; Jeremy R Parr; Shaun Purcell; Kaija Puura; Deepthi Rajagopalan; Karola Rehnström; Abraham Reichenberg; Aniko Sabo; Michael Sachse; Stephan J Sanders; Chad Schafer; Martin Schulte-Rüther; David Skuse; Christine Stevens; Peter Szatmari; Kristiina Tammimies; Otto Valladares; Annette Voran; Wang Li-San; Lauren A Weiss; A Jeremy Willsey; Timothy W Yu; Ryan K C Yuen; Edwin H Cook; Christine M Freitag; Michael Gill; Christina M Hultman; Thomas Lehner; Aaarno Palotie; Gerard D Schellenberg; Pamela Sklar; Matthew W State; James S Sutcliffe; Christiopher A Walsh; Stephen W Scherer; Michael E Zwick; Jeffrey C Barett; David J Cutler; Kathryn Roeder; Bernie Devlin; Mark J Daly; Joseph D Buxbaum
Journal:  Nature       Date:  2014-10-29       Impact factor: 49.962

View more
  4 in total

1.  Target Genes of Autism Risk Loci in Brain Frontal Cortex.

Authors:  Yan Sun; Xueming Yao; Michael E March; Xinyi Meng; Junyi Li; Zhi Wei; Patrick M A Sleiman; Hakon Hakonarson; Qianghua Xia; Jin Li
Journal:  Front Genet       Date:  2019-08-09       Impact factor: 4.599

2.  Reliability and validity of the repetitive behavior scale-revised for young Chinese children with autism spectrum disorder in Jiangxi Province.

Authors:  Xiu Luo; Yaoyao Xiong; Mei Gu; Liyun Huang; Zhonghui Lu; Xia Zhong; Shipu Zou
Journal:  Front Pediatr       Date:  2022-09-08       Impact factor: 3.569

3.  Social and non-social autism symptoms and trait domains are genetically dissociable.

Authors:  David A Hinds; Thomas Bourgeron; Simon Baron-Cohen; Varun Warrier; Roberto Toro; Hyejung Won; Claire S Leblond; Freddy Cliquet; Richard Delorme; Ward De Witte; Janita Bralten; Bhismadev Chakrabarti; Anders D Børglum; Jakob Grove; Geert Poelmans
Journal:  Commun Biol       Date:  2019-09-03

Review 4.  Genetic contributions to autism spectrum disorder.

Authors:  A Havdahl; M Niarchou; A Starnawska; M Uddin; C van der Merwe; V Warrier
Journal:  Psychol Med       Date:  2021-02-26       Impact factor: 7.723

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.