| Literature DB >> 36153334 |
In-Hee Lee1, Ekaterina Koelliker2, Sek Won Kong3,4.
Abstract
Autism spectrum disorder (ASD) represents a heterogeneous group of neurodevelopmental disorders and is largely attributable to genetic risk factors. Phenotypic and genetic heterogeneity of ASD have been well-recognized; however, genetic substrates for endophenotypes that constitute phenotypic heterogeneity are not yet known. In the present study, we compiled data from the Autism Genetic Resource Exchange, which contains the demographic and detailed phenotype information of 11,961 individuals. Notably, the whole-genome sequencing data available from MSSNG and iHART for 3833 individuals in this dataset was used to perform an endophenotype-wide association study. Using a linear mixed model, genome-wide association analyses were performed for 29 endophenotype scores and 0.58 million common variants with variant allele frequency ≥ 5%. We discovered significant associations between 9 genetic variants and 6 endophenotype scores comprising neurocognitive development and severity scores for core symptoms of ASD at a significance threshold of p < 5 × 10-7. Of note, the Stereotyped Behaviors and Restricted Interests total score in Autism Diagnostic Observation Schedule Module 3 was significantly associated with multiple variants in the VPS13B gene, a causal gene for Cohen syndrome and a candidate gene for syndromic ASD. Our findings yielded loci with small effect sizes due to the moderate sample size and, thus, require validation in another cohort. Nonetheless, our endophenotype-wide association analysis extends previous candidate gene discovery in the context of genotype and endophenotype association. As a result, these candidate genes may be responsible for specific traits that constitute core symptoms and neurocognitive function of ASD rather than the disorder itself.Entities:
Mesh:
Year: 2022 PMID: 36153334 PMCID: PMC9509384 DOI: 10.1038/s41398-022-02179-3
Source DB: PubMed Journal: Transl Psychiatry ISSN: 2158-3188 Impact factor: 7.989
Fig. 1Correlation structure among 29 endophenotype scores and exploratory factor analysis.
A Two-dimensional representation of 29 endophenotype measurements with multidimensional scaling using 93 individuals with all scores. B A pairwise correlation map between endophenotype scores and demographic variables. The color and size of each circle represent the Spearman correlation coefficient between variable pairs. C The four factors derived from exploratory factor analysis of endophenotype scores, age, and head circumference. The numbers on the arrow represent the correlation between latent variables or between latent variable and endophenotype scores.
Common genetic variants associated with severity of core symptoms of ASD and neurocognitive measurements.
| Phenotype Measures1 | Name | Locus2 | EA/OA3 | Genes | All individuals | European individuals | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Beta (s.e.) | FDR | Beta (s.e.) | FDR | |||||||||
| ADI-R | Social | 12:123737933 | T/G | 1.93 (0.374) | 2.30 × 10−7 | 0.061 | 1881 | 2.11 (0.408) | 2.40 × 10−7 | 0.13 | 1239 | |
| 12:123852510 | A/G | 1.89 (0.366) | 2.40 × 10−7 | 0.061 | NS | |||||||
| ADOS (Module 1) | Play | 3:194058802 | CATTTT/C | −0.82 (0.160 | 2.60 × 10−7 | 0.11 | 522 | NS | 311 | |||
| ADOS (Module 3) | Communication | 7:73420195 | T/C | 0.91 (0.176) | 2.50 × 10−7 | 0.12 | 600 | 0.96 (0.190) | 4.50 × 10−7 | 0.25 | 440 | |
| Stereotyped Behaviors and Restricted Interests | 8:100650223 | A/G | 0.52 (0.102) | 3.30 × 10−7 | 0.070 | 600 | 0.66 (0.121) | 4.50 × 10−8 | 0.017 | 440 | ||
| 8:100717925 | G/T | 0.57 (0.109) | 1.80 × 10−7 | 0.070 | 0.71 (0.130) | 4.50 × 10−8 | 0.017 | |||||
| 9:74893024 | A/G | −0.64 (0.126) | 2.90 × 10−7 | 0.070 | NS | |||||||
| Total score | 3:118637716 | A/C | 2.76 (0.546) | 4.30 × 10−7 | 0.20 | 598 | NS | 438 | ||||
| RPCM | Total score | 4:97178781 | A/G | 1.8 (0.350) | 2.50 × 10−7 | 0.24 | 1051 | NS | 697 | |||
1ADI-R Autism Diagnostic Interview-Revised, ADOS Autism Diagnostic Observation Schedule, RBS Repetitive Behavior Scale-Revised, RPCM Raven’s Progressive Colored Matrices.
2All genomic positions are given in Genome Reference Consortium Human Build 37 (GRCh37).
3EA Effect allele, OA Other allele, s.e.: standard error, FDR false discovery rate estimated using Benjamini-Hochberg procedure, NS: not significant at p < 5 × 10−7.
Fig. 2Overview of genomic loci associated with endophenotypic scores.
Genome-wide association analysis with each of endophenotype scores highlights significant loci and genes. Horizontal axis indicates genomic position from chromosome 1 to chromosome 22 and each row in vertical axis is organized by test instruments and phenotypic scores. The phenotype scores with significantly associated loci are (top to bottom): Social score from ADI-R, Play score from ADOS Module 1, 3 scores from ADOS Module 3 (Communication, Stereotyped Behaviors and Restricted Interests, and Total scores) and RPCM Total score. Circles indicate genomic loci with p < 5 × 10−7, where the bigger the size the smaller the nominal p-value as indicated in the legend below x-axis. The genes that overlap with or in 250 kbps flanking region of each significant genomic loci are displayed next to the corresponding circles and significant loci in intergenic regions are displayed without an associated gene symbol.
Fig. 3Association between ADOS Module 3 stereotyped behaviors and restricted interests total score and variants in the VPS13B gene.
A The Manhattan plot of the quantitative locus analysis (QTL) results. The top-significant variants with p < 5 × 10−7 in each chromosome are labeled by their genomic loci as chromosome:base-position. The horizontal line represents the threshold of p = 5 × 10−7. The y-axis represents p-value in -log10 scale for genome-wide association test with SBRIs score based on linear mixed model using BOLT-LMM software. B A quantile-quantile plot and the genomic inflation factor (λgc) estimated for genome-wide association analysis (GWAS) with ADOS Module 3 Stereotyped Behaviors and Restricted Interests score. No significant inflation is found for the association test statistics (λgc = 1.0114). C Regional plot showing a significant quantitative trait locus around the VPS13B gene (located in chromosome 8:100025494–100889814). The horizontal dotted line represents the genome-wide significance level (p = 5 × 10−8). Each dot represents variant tested for association with its size proportional with its -log10(p-value). The yellow dots show results using all available individuals (N = 600), while the green dots with European individuals only (N = 440). The line graph shows the recombination rates (cM/Mb). D Distribution of SBRIs scores by the status of lead variant (8:100717925 T > G, or rs2510202) in the VPS13B gene. Individuals carrying variant allele (G-allele) show higher SBRIs score in ADOS Module 3. Also, individuals with rare and high impact variants in the VPS13B gene (brown filled dots) do not present extreme SBRIs scores than non-carriers (green circle).