| Literature DB >> 33232274 |
Afsaneh Mohammadnejad1, Marianne Nygaard1, Shuxia Li1, Dongfeng Zhang2, Chunsheng Xu3, Weilong Li1, Jesper Lund1, Lene Christiansen1,4, Jan Baumbach5,6, Kaare Christensen1,7, Jacob V B Hjelmborg1, Qihua Tan1,7.
Abstract
Despite a strong genetic background in cognitive function only a limited number of single nucleotide polymorphisms (SNPs) have been found in genome-wide association studies (GWASs). We hypothesize that this is partially due to mis-specified modeling concerning phenotype distribution as well as the relationship between SNP dosage and the level of the phenotype. To overcome these issues, we introduced an assumption-free method based on generalized correlation coefficient (GCC) in a GWAS of cognitive function in Danish and Chinese twins to compare its performance with traditional linear models. The GCC-based GWAS identified two significant SNPs in Danish samples (rs71419535, p = 1.47e-08; rs905838, p = 1.69e-08) and two significant SNPs in Chinese samples (rs2292999, p = 9.27e-10; rs17019635, p = 2.50e-09). In contrast, linear models failed to detect any genome-wide significant SNPs. The number of top significant genes overlapping between the two samples in the GCC-based GWAS was higher than when applying linear models. The GCC model identified significant genetic variants missed by conventional linear models, with more replicated genes and biological pathways related to cognitive function. Moreover, the GCC-based GWAS was robust in handling correlated samples like twin pairs. GCC is a useful statistical method for GWAS that complements traditional linear models for capturing genetic effects beyond the additive assumption.Entities:
Keywords: GWAS; cognition; generalized correlation coefficient; twins
Mesh:
Year: 2020 PMID: 33232274 PMCID: PMC7746382 DOI: 10.18632/aging.104198
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Descriptive statistics of the 900 dizygotic twins and 272 single monozygotic twins included in the Danish GWAS and the 278 dizygotic twins included in the Chinese GWAS.
| Danish sample | 611 | 561 | 1172 |
| Mean of age± sd (min, max) | 66.51±6.05(56.4,79.88) | 66.12±5.93(55.94,80.21) | 66.32±6.0(55.94,80.21) |
| Mean of cognitive score± sd (min, max) | 45.03±9.83(11.68,84.93) | 46.74±9.94(21.16,83.69) | 45.86±9.91(11.68, 84.93) |
| Chinese sample | 141 | 137 | 278 |
| Mean of age± sd (min, max) | 51.04±7.04(41,70) | 51.18±7.04(40,70) | 51.11±7.03(40,70) |
| Mean of cognitive score± sd (min, max) | 20.59±4.65(5,30) | 21.44±4.77(5,29) | 21.01±4.72(5,30) |
Figure 1Circos plots indicating genes from genome-wide significant SNPs on chromosomes 2 (A), 5 (B) in Danish sample and 3 (C), 4 (D) in Chinese sample based on GCC model. The blue region shows the genomic risk region. Genes mapped by chromatin interaction, eQTL and both are displayed in orange, green and red respectively. The most outer layer shows a Manhattan plot only for SNPs with p < 0.05 and SNPs are colored in red based on linkage disequilibrium (LD) patterns with the lead SNPs.
Figure 2The genotype-specific density distribution for top 3 genotyped SNPs in Danish (A) and Chinese (B) samples from GCC model. The x-axis shows the SNP genotypes 0, 1 and 2 and y-axis shows cognitive function phenotype.
Figure 3QQ plot comparing the performance of GCC, kinship and LME models in Danish (A) and Chinese (B) GWAS data. The left QQ plot is from Danish sample and the right QQ plot is from Chinese sample. In each plot, x-axis is the expected p-value and y-axis is the observed p-value from the GWAS.
Figure 4Scatter plot comparing the performance of SNPs in linear model to the GCC model in both Danish (A) and Chinese (B) samples. Th x-axis and y-axis show -log10(p-value) from Kinship and GCC models respectively.
Figure 5The Venn diagrams showing the number of overlapped genes with p < 0.05 among GCC and Kinship models from both samples in left plot (A) and GCC and LME models in the right plot (B). The total number of genes included are GCC-DA (GCC in Danish data): 1115, GCC-CH (GCC in Chinese data): 1024, Kinship-DA (Kinship in Danish data): 1046, Kinship-CH (Kinship in Chinese data): 1069, LME-DA (LME in Danish data): 1034 and LME-CH (LME in Chinese data): 1014.
Significant KEGG pathways (FDR< 0.05) overlapping between GCC, Kinship model and LME model GWAS results from both Danish and Chinese samples.
| GCC-DA-CH* | KEGG_PATHWAYS_IN_CANCER | Pathways in cancer | 1.08e-12 | 2.01e-10 |
| GCC-DA-CH | KEGG_AXON_GUIDANCE | Axon guidance | 1.16e-10 | 1.08e-08 |
| GCC-DA-CH | KEGG_ARRHYTHMOGENIC_RIGHT_VENTRICULAR_CARDIOMYOPATHY_ARVC | Arrhythmogenic right ventricular cardiomyopathy (ARVC) | 2.27e-09 | 1.41e-07 |
| GCC-DA-CH | KEGG_VASCULAR_SMOOTH_MUSCLE_CONTRACTION | Vascular smooth muscle contraction | 4.67e-09 | 2.17e-07 |
| GCC-DA-CH | KEGG_CALCIUM_SIGNALING_PATHWAY | Calcium signaling pathway | 8.07e-09 | 3e-07 |
| GCC-DA-CH | KEGG_FOCAL_ADHESION | Focal adhesion | 1.51e-08 | 4.67e-07 |
| GCC-DA-CH | KEGG_MAPK_SIGNALING_PATHWAY | MAPK signaling pathway | 2.35e-07 | 6.26e-06 |
| GCC-DA-CH | KEGG_HYPERTROPHIC_CARDIOMYOPATHY_HCM | Hypertrophic cardiomyopathy (HCM) | 6.33e-07 | 1.47e-05 |
| GCC-DA-CH | KEGG_DILATED_CARDIOMYOPATHY | Dilated cardiomyopathy | 1.64e-06 | 3.04e-05 |
| GCC-DA-CH | KEGG_LONG_TERM_DEPRESSION | Long-term depression | 4.43e-06 | 7.5e-05 |
| GCC-DA-CH | KEGG_GNRH_SIGNALING_PATHWAY | GnRH signaling pathway | 6.09e-06 | 9.44e-05 |
| GCC-DA-CH | KEGG_ADHERENS_JUNCTION | Adherens junction | 6.75e-06 | 9.65e-05 |
| GCC-DA-CH | KEGG_BASAL_CELL_CARCINOMA | Basal cell carcinoma | 2.34e-05 | 0.00029 |
| GCC-DA-CH | KEGG_HEDGEHOG_SIGNALING_PATHWAY | Hedgehog signaling pathway | 2.72e-05 | 0.000294 |
| GCC-DA-CH | KEGG_REGULATION_OF_ACTIN_CYTOSKELETON | Regulation of actin cytoskeleton | 4.39e-05 | 0.000409 |
| GCC-DA-CH | KEGG_GAP_JUNCTION | Gap junction | 5.08e-05 | 0.00045 |
| GCC-DA-CH | KEGG_PHOSPHATIDYLINOSITOL_SIGNALING_SYSTEM | Phosphatidylinositol signaling system | 5.87e-05 | 0.000496 |
| GCC-DA-CH | KEGG_CELL_ADHESION_MOLECULES_CAMS | Cell adhesion molecules (CAMs) | 0.000116 | 0.000896 |
| GCC-DA-CH | KEGG_ECM_RECEPTOR_INTERACTION | ECM-receptor interaction | 0.000138 | 0.000989 |
| GCC-DA-CH | KEGG_TYPE_II_DIABETES_MELLITUS | Type II diabetes mellitus | 0.000347 | 0.00239 |
| GCC-DA-CH | KEGG_MELANOGENESIS | Melanogenesis | 0.000624 | 0.00387 |
| Kin-LME-DA-CH | KEGG_FOCAL_ADHESION | Focal adhesion | 0.000777 | 0.0361 |
| Kin-LME-DA-CH** | KEGG_WNT_SIGNALING_PATHWAY | Wnt signaling pathway | 0.001980 | 0.0462 |
| Kin-LME-DA-CH | KEGG_T_CELL_RECEPTOR_SIGNALING_PATHWAY | T cell receptor signaling pathway | 0.002090 | 0.0462 |
*GCC-DA-CH: Pathways overlapping in GCC GWAS results from both Danish (DA) and Chinese (CH) samples.
**Kin-LME-DA-CH: Pathways overlapping in Kinship model and LME model GWAS results from both Danish and Chinese samples.