| Literature DB >> 34493297 |
Yiliang Zhang1, Qiongshi Lu2,3,4, Yixuan Ye5, Kunling Huang3, Wei Liu5, Yuchang Wu2, Xiaoyuan Zhong2, Boyang Li1, Zhaolong Yu5, Brittany G Travers6,7, Donna M Werling7,8, James J Li7,9, Hongyu Zhao10,11,12.
Abstract
Local genetic correlation quantifies the genetic similarity of complex traits in specific genomic regions. However, accurate estimation of local genetic correlation remains challenging, due to linkage disequilibrium in local genomic regions and sample overlap across studies. We introduce SUPERGNOVA, a statistical framework to estimate local genetic correlations using summary statistics from genome-wide association studies. We demonstrate that SUPERGNOVA outperforms existing methods through simulations and analyses of 30 complex traits. In particular, we show that the positive yet paradoxical genetic correlation between autism spectrum disorder and cognitive performance could be explained by two etiologically distinct genetic signatures with bidirectional local genetic correlations.Entities:
Keywords: Autism spectrum disorder; Chromatin modifiers; Eigen decomposition; GWAS; Local genetic covariance
Mesh:
Year: 2021 PMID: 34493297 PMCID: PMC8422619 DOI: 10.1186/s13059-021-02478-w
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 17.906
Fig. 1SUPERGNOVA workflow. Details on the statistical framework are described in the “Methods” section. w denotes the diagonal elements of Σ, which are also the eigenvalues of each local LD matrix. Notation, ∘ in the last step indicates the element-wise product
Fig. 2Simulation results. A–C compare the type-I error and statistical power of SUPERGNOVA, GNOVA, and LDSC in global genetic covariance estimation. We use the proportion of p values that are less than 0.05 to estimate type I error or statistical power when true parameters are zero or nonzero, respectively. A Two GWASs were simulated on two non-overlapping datasets (set 1 and set 2). B GWASs were simulated on two datasets with a 50% sample overlap (set 1 and set 3). C Two GWASs were both simulated on the same dataset (set 1) with a 100% sample overlap. D–F compare SUPERGNOVA and ρ-HESS on local genetic covariance estimation using GWASs D without sample overlap (set 1 and set 2), E with a partial sample overlap (set 1 and set 3), and F with a complete sample overlap (set 1 only)
Fig. 3Global and local genetic correlations among 30 complex traits. A Estimates of global genetic correlations (upper triangle) and estimated proportions of correlated regions among 435 trait pairs (lower triangle). Asterisks in the upper triangle highlight significant genetic correlations after Bonferroni correction for 435 pairs. Asterisks in the lower triangle indicate at least one significantly correlated region between the traits after Bonferroni correction for all 1,006,072 regions in 435 trait pairs. We grouped traits with hierarchical clustering applied to global genetic correlations. We summarized detailed information about each trait, including abbreviations, in Additional file 3: Supplementary Table 1. B Global genetic covariance estimates were highly concordant with the sums of local genetic covariance. Each point represents a trait pair. Color and shape of each data point denote the significance status in global and local correlation analyses. C Volcano plot comparing the global genetic correlation and proportion of correlated local regions. Each point represents a trait pair. Color of each data point represents the significance and direction of global correlation. Trait pairs discussed in the main text are labeled in the plot
Fig. 4Bidirectional local genetic covariance between ASD and CP. A Regions with significant local genetic covariance among ADHD, ASD, and CP (FDR < 0.1). This plot uses bars to break down the Venn diagram of overlapped regions in different categories. The five categories shown in the lower panel are correlated regions of ADHD and CP (positive and negative), ASD and CP (positive and negative), and ASD and ADHD (positive only). We use different colors (red, blue, and gray) to annotate region categories of positive, negative, and mixed covariance directions. B LocusZoom plots for ASD and CP GWAS associations at the KMT2E locus. ASD and CP are negatively correlated in the highlighted region. C LocusZoom plots for ASD and CP at the POU3F2 locus. ASD and CP are positively correlated in the highlighted region. POU3F2 is 700 kb downstream of the GWAS association peak
Fig. 5Enrichment for gene sets in correlated regions between ASD and CP. Regions with opposite correlations between ASD and CP were enriched for different mechanistic pathways. Fold enrichment values are labeled next to each bar. The red dashed lines mark the p value cutoff of 0.05, and the black dashed lines denote the p value thresholds after Bonferroni correction (p = 2.8e−3)
Fig. 6Phenotypic heterogeneity of ASD probands with high PRS+ and PRS−. A Average IQ is computed for different groups defined by PRS. Each interval indicates standard error of the estimated mean. B PRS percentiles and IQ of probands above the 99% percentile of PRS−. PRS− was calculated using six negatively correlated regions between ASD and CP. The blue heatmap indicates the percentile of ASD PRS in each contributed region for each proband. The percentiles of PRS+ values are shown in the red boxes. IQ is shown as green bars. C PRS percentiles and IQ of probands above the 99% percentile of PRS+. PRS+ was calculated using 18 positively correlated regions between ASD and CP and the per-locus percentiles are shown in red. The percentiles of PRS+ are shown in blue and the green bars denote IQ