Literature DB >> 35907915

Multivariate genome-wide association study of depression, cognition, and memory phenotypes and validation analysis identify 12 cross-ethnic variants.

Jing Sun1,2, Weijing Wang1, Ronghui Zhang1, Haiping Duan3, Xiaocao Tian3, Chunsheng Xu3, Xue Li4, Dongfeng Zhang5.   

Abstract

To date, little is known about the pleiotropic genetic variants among depression, cognition, and memory. The current research aimed to identify the potential pleiotropic single nucleotide polymorphisms (SNPs), genes, and pathways of the three phenotypes by conducting a multivariate genome-wide association study and an additional pleiotropy analysis among Chinese individuals and further validate the top variants in the UK Biobank (UKB). In the discovery phase, the participants were 139 pairs of dizygotic twins from the Qingdao Twins Registry. The genome-wide efficient mixed-model analysis identified 164 SNPs reaching suggestive significance (P < 1 × 10-5). Among them, rs3967317 (P = 1.21 × 10-8) exceeded the genome-wide significance level (P < 5 × 10-8) and was also demonstrated to be associated with depression and memory in pleiotropy analysis, followed by rs9863698, rs3967316, and rs9261381 (P = 7.80 × 10-8-5.68 × 10-7), which were associated with all three phenotypes. After imputation, a total of 457 SNPs reached suggestive significance. The top SNP chr6:24597173 was located in the KIAA0319 gene, which had biased expression in brain tissues. Genes and pathways related to metabolism, immunity, and neuronal systems demonstrated nominal significance (P < 0.05) in gene-based and pathway enrichment analyses. In the validation phase, 12 of the abovementioned SNPs reached the nominal significance level (P < 0.05) in the UKB. Among them, three SNPs were located in the KIAA0319 gene, and four SNPs were identified as significant expression quantitative trait loci in brain tissues. These findings may provide evidence for pleiotropic variants among depression, cognition, and memory and clues for further exploring the shared genetic pathogenesis of depression with Alzheimer's disease.
© 2022. The Author(s).

Entities:  

Mesh:

Year:  2022        PMID: 35907915      PMCID: PMC9338946          DOI: 10.1038/s41398-022-02074-x

Source DB:  PubMed          Journal:  Transl Psychiatry        ISSN: 2158-3188            Impact factor:   7.989


Introduction

Depression and Alzheimer’s disease (AD) are two common mental disorders that pose a serious threat to the physical and mental health of the public, especially elderly persons [1, 2], resulting in a heavy burden of disease worldwide [3, 4]. An increasing number of studies support that depression and AD are comorbidities [5, 6]. Evidence has shown some common pathogenic mechanisms and pathological changes between depression and AD, such as chronic inflammation [7, 8], dysregulation of the hypothalamic-pituitary-adrenal axis [9], lower levels of norepinephrine and 5-hydroxytryptamine [10], and a smaller volume of the hippocampus [11], suggesting the possibility and biological rationality of the basis of comorbidity between depression and AD. In addition to environmental factors, genetic factors play an important etiologic role in both depression and AD. According to a meta-analysis of twin studies, the heritability of depression ranges from 31% to 42% [12]. As the most commonly used endophenotypes of AD in genetic studies [13, 14], the heritability of cognition and memory ranges from 47% to 59% and 36% to 47%, respectively, based on a large meta-analysis of twin and family studies [15]. Moreover, some evidence has suggested a shared genetic basis among depression, cognition, and memory. Pertinently, Franz et al. found that shared genetic effects could explain 77% of the correlation of early cognitive function with midlife depression in American male twins [16]. Another study in male twins showed that the genetic correlation of executive function with the genetic effects shared by persons with depression and anxiety was −0.44 [17]. The Colorado Adoption Project’s study revealed that the genetic correlations of memory with several dimensions of cognition ranged from 0.59 to 0.69 [18]. Additionally, our previous study utilizing multivariate twin models found that, among Qingdao twins, the genetic correlations were −0.31 for depression and cognition, −0.28 for depression and memory, and 0.69 for memory and cognition, respectively [19]. However, to date, little is known about the shared genetic variants among depression, cognition, and memory. A depression genome-wide meta-analysis identified 269 genes associated with depression [20]. Interestingly, 70 of the 269 depression-related genes were also found in another genome-wide meta-analysis of cognitive function [21]. In addition, the hNP gene was associated with both depression [22] and memory [23]. The APOE-4 allele gene was linked to lower cognitive ability, faster cognitive decline [24], and poorer memory [25]. Studies reported that DISC1 gene polymorphisms were associated with depression [26, 27], cognition [27], and memory [27, 28]. These overlapping findings from univariate studies provided indirect clues about the potentially shared susceptibility genes for depression, cognition, and memory, but there still has been a lack of direct evidence for the shared genetic variants and genes among the three phenotypes. Multivariate genome-wide association studies (GWASs) can be performed to search for potential pleiotropic genetic variants affecting multiple phenotypes by jointly modeling all phenotypes simultaneously rather than focusing on the simple overlap of genetic variants among different studies. Moreover, multivariate GWASs have higher statistical power and more accurate parameter estimation than univariate GWASs, which is helpful for the discovery of pleiotropic and small effects of genes [29, 30]. Notably, twins are particularly valuable for genetic studies owing to their sharing of rearing and intrauterine environments, as well as genetic similarity and discrepancy [31]. The combination of twin-based design with GWAS is excellent in controlling population stratification and ‘passive gene-environment correlation (rGE)’ and can distinguish direct genetic effects from indirect genetic effects [32]. Additionally, owing to the severe European bias of GWASs, it is unknown how many genetic risk loci can be translated across ethnicities [33]. Human genomics research among under-represented populations and cross-ethnic studies are urgently needed, given that cross-ethnicity generalizability is vital for improving genetic risk prediction and the applicability of therapeutic targets, alleviating bias and unfairness against specific subpopulations [34]. Thus, we performed a multivariate GWAS among Qingdao twins in China to explore the potential pleiotropic single nucleotide polymorphisms (SNPs), genes, and pathways among depression, cognition, and memory. An additional pleiotropy analysis was also performed for interpreting the possibility of pleiotropy. Then, to determine if these findings in Chinese can be generalized to different ethnic groups (cross-ethnicity generalizability), we further validated the top variants in an independent UK Biobank (UKB) population to identify cross-ethnic associations.

Materials and methods

Study population

In the discovery phase, the participants were adult twins from the Qingdao Twins Registry, China, and the details have been described in previous literature [19]. Blood samples were collected from participants after they fasted overnight, and identification of zygosity was carried out by sex, blood type, and microsatellite DNA gene scanning and typing. Participants who were monozygotic twins, were lactating or pregnant, had serious diseases or lacked biological sample information were excluded. Finally, the current multivariate GWAS sample included 139 dizygotic twin pairs.

Phenotypes

Depression was assessed using the 30-item Geriatric Depression Scale (GDS-30, Chinese version), which consists of 30 questions, with an overall score of 0-30 points, and a higher score indicated more severe depressive symptoms. The GDS-30 is especially suitable for the assessment of depression in middle-aged and elderly individuals and is also highly valid in the Chinese population [35, 36]. Cognition was measured using the Montreal Cognitive Assessment (MoCA, Chinese version) with high reliability and acceptance in Chinese adults [37, 38]. This assessment involved attention, naming, delayed recall, language, visuospatial/executive ability, orientation, and abstraction, with a total score of 30 points. To correct for the effect of education on cognitive performance, education-adjusted scores were used [39], where the scores of participants with ≤12 years of education were given one additional point, but with a total score of no more than 30. A lower cognition score indicated worse cognitive ability. Memory was assessed by the backward and forward digit span tasks of the Wechsler Adult Intelligence (WAIS, Chinese version). The total score ranging from 0-17 was obtained by summing the scores of backward and forward digit span, and a lower score indicated worse memory. Digit span tasks have widely been used to reflect short-term memory, and the backward digit span also reflects working memory [40].

Genotyping, quality control, and imputation

Infinium Omni2.5Exome-8v1.2 BeadChip from Illumina was used for genotyping in dizygotic twins. After quality control, 1,338,905 SNPs with calling rate >0.98, locus missing <0.05, the significance of Hardy-Weinberg equilibrium (HWE) significance >1 × 10−4, and minor allele frequency (MAF) > 0.05 were included in this multivariate GWAS. On the basis of the linkage disequilibrium (LD) principle, IMPUTE2 software [41] was utilized to impute untyped SNPs with reference to the data collected during the third phase of the 1000 Genomes Project (ASIAN) [42]. After filtering by HWE > 1 × 10−4, MAF > 0.05, and R2 > 0.6, a total of 7,399,084 SNPs were finally used in the post-imputation multivariate GWAS.

Multivariate GWAS

SNP-based analysis

The genome-wide efficient mixed-model association (GEMMA) [43] was used to evaluate the association of SNP genotypes with depression-cognition-memory phenotypic pairs after depression, cognition, and memory scores were transformed by rank transformation based on Blom’s formula [44] to normalize their skewed distributions, adjusting for sex, age, and the first five genetic principal components (PCs). The GEMMA fitted a multivariate linear mixed model (mvLMM) for testing marker associations with multiple phenotypes (depression, cognition, and memory) simultaneously while controlling for relatedness (here intra-pair correlation of twins) and population structure. The significance level was defined as a P value threshold of <5 × 10−8 (a conventional Bonferroni-corrected threshold) [45], and the suggestive level was defined as a P value threshold of <1 × 10−5 (a commonly utilized threshold in GWAS) [46]. Quantile-quantile (Q-Q) and Manhattan plots were used to visualize the results. Furthermore, enhancer enrichment analysis was performed by submitting the list of the top 100 SNPs (ranked by P values) associated with depression-cognition-memory to HaploReg v4.1 [47], and the cell-type enhancers with a P value of <0.05 were reported. All genomic coordinates were based on human genome Build 37 (NCBI GRCh37). For interpreting the possibility of pleiotropy, we further performed a pleiotropy analysis by using the R package “pleio” [48] to test which phenotypes were associated with the potential pleiotropic genetic variants with suggestive significance in the current multivariate GWAS. Specifically, a new likelihood-ratio test with an extended sequential approach was used to test pleiotropy, which provides a testing framework to identify the number of phenotypes associated with a genetic variant, accounting for correlations among the phenotypes [48]. First, the sequential tests of pleiotropy started at the null hypothesis that all coefficients were equal to zero (test 0). If this test 0 was rejected, then test 1 was performed, which allowed one coefficient to be non-zero to test whether the remaining coefficients were equal to zero. If the test 1 was rejected, we then performed the test 2, which allowed two non-zero coefficients, considering all possible combinations of two non-zero coefficients and testing whether the remaining coefficients were equal to zero. Whenever a P value greater than 0.05 was derived, the sequential testing stopped. If the P value of test 2 remained <0.05, it implied that all three phenotypes were associated with this variant.

Gene-based analysis

VEGAS2 [49] was applied to carry out gene-based analysis by integrating the SNPs within a gene, and “1000 G East Asian Population” was used. A P value of <2.61 × 10−6 (0.05/19,152) was regarded as a significant threshold by Bonferroni correction due to the 19,152 genes tested, and the nominal significance level was defined as a P value of <0.05 [50].

Pathway enrichment analysis

PASCAL [51] was utilized to evaluate pathway scores. SNPs were mapped to genes, and then the joint score of all genes involved in a pathway was calculated. The chi-squared and empirical scores were utilized to assess pathway enrichment of high-scoring genes. The Reactome, KEGG, and BioCarta databases were used to obtain pathway information. An emp-P value of <4.64 × 10−5 (0.05/1,077) was regarded as a significant threshold by Bonferroni correction due to the 1,077 pathways tested, and the nominal significance level was defined as an emp-P value of <0.05.

Validation analysis

To identify the cross-ethnicity generalizability and cross-ethnic associations, we validated the top variants in an independent UKB population, which is a population-based cohort of 488,377 individuals with genotypic data across the United Kingdom; more details of genotyping, quality control, and imputation have been described elsewhere [52]. The phenotypic and genotypic data utilized in the current study were obtained from the third version of UKB data under an approved data application (application number: 66354). Depression was assessed using the two-item Patient Health Questionnaire (PHQ-2), and a total score of three and more indicated possible depression [53]. Cognition was measured through 13 numerical and verbal reasoning questions reflecting reasoning ability, and correct scores ranged from 0–13. Memory was assessed by the digit span task, and the maximum digits remembered correctly ranged from 2–12. A total of 46,102 individuals participated in and completed the depression, cognition, and memory tests. Cases (n = 355) were defined as participants with depression scores ≥3 and cognition and memory performance scores lower than the 25th percentile of their score distributions. Controls (n = 1775) were selected by matching individuals’ age and sex (ratio = 1:5) with those of the participants (n = 30,470) with depression scores <3 and cognition and memory performance ≥25th percentile of their score distributions. Finally, 2130 individuals (355 cases and 1775 controls) with a median age (interquartile range) of 55 (15) years were included in the validated sample. The top SNPs were validated by logistic regression analysis of the additive effect model, adjusting for the first 10 genetic PCs. A total of 469 of 481 SNPs (the union number before and after imputation) with P values lower than 1 × 10−5 in the discovery set were typed in the UKB data set and selected for validation. Thus, a P value < 1.07 × 10−4 (0.05/469) was regarded as a significant threshold by Bonferroni correction, and the nominal significance level was defined as a P value <0.05. Statistical analyses were performed utilizing R version 4.1.0.

Expression quantitative trait loci (eQTL) analysis

For SNPs with nominal significance in the validation set, we further checked their functional consequences by eQTL analysis across tissues using data from the GTEx portal (version 8) [54]. A P value lower than 0.05 was regarded as significant in the single-tissue eQTL analysis. The posterior probability m-value that the eQTL effect existed in each tissue of a cross-tissue meta-analysis higher than 0.9 indicated that the tissue had an eQTL effect [55]. An outline of the overall study design and analysis steps is shown in Fig. 1.
Fig. 1

Flowchart of the overall study design and analysis steps.

Flowchart of the overall study design and analysis steps.

Results

Basic characteristics

There were 139 pairs of dizygotic twins in the final discovery sample. The median (interquartile range) age was 49 (11) years, and the median scores (interquartile ranges) for depression, cognition, and memory for participants were 7 (7), 22 (5), and 12 (3) points, respectively (Supplementary Table 1). In the SNP-based study, the Q-Q plot (Fig. 2a) suggested no evidence of population stratification. The Manhattan plot (Fig. 3a) demonstrated that a total of 164 SNPs reached the level of suggestive significance (P < 1 × 10−5) (Supplementary Table 2); among them, rs3967317 (P = 1.21 × 10−8) on the CNTN4 gene on chromosome 3 exceeded the genome-wide significance level (P < 5 × 10−8). In addition, rs9863698 (P = 7.80 × 10−8) and rs3967316 (P = 1.33 × 10−7) on the CNTN4 gene, rs9261381 (P = 5.68 × 10−7) on the TRIM31 gene, rs11577464 (P = 5.82 × 10−7) on the LINC02567 gene, and rs73198369 (P = 7.00 × 10−7) on the RNU6-1325P gene reached suggestive significance. The top 20 SNPs ranked by P values are shown in Table 1. The additional pleiotropy analysis identified that 144 of the 164 SNPs (87.8%) were associated with two or three phenotypes (P < 0.05), with rs3967317 associated with depression and memory, rs9863698, rs3967316, and rs9261381 associated with all three phenotypes (Supplementary Table 2).
Fig. 2

Quantile-quantile plot for multivariate genome-wide association study of depression-cognition-memory.

a The quantile-quantile plot based on data before imputation. b The quantile-quantile plot based on data after imputation. The horizontal axis represents the expected −log10 (P), while the vertical axis represents the observed −log10 (P). The red line represents the expectation of the null hypothesis of no association, and the gray shaded area represents 95% confidence intervals of the null hypothesis. The black dots represent the observed data, and λ indicates genomic inflation.

Fig. 3

Manhattan plot for multivariate genome-wide association study of depression-cognition-memory.

a The Manhattan plot based on data before imputation. b The Manhattan plot based on data after imputation. The horizontal axis represents autosomes and the X chromosome, while the vertical axis represents the P values of SNPs. The red line represents the genome-wide significance threshold (5 × 10−8), and the lower horizontal dashed line represents the suggestive significance level (1 × 10−5).

Table 1

The top 20 SNPs from multivariate GWAS of depression-cognition-memory.

SNPChrBandBPP valueGene or nearest geneOfficial full name
rs396731733p26.3-p26.230587071.21E-08CNTN4contactin 4
rs986369833p26.3-p26.230593737.80E-08CNTN4contactin 4
rs396731633p26.3-p26.230625781.33E-07CNTN4contactin 4
rs926138166p22.1300600025.68E-07TRIM31tripartite motif containing 31
rs1157746411p31.1772591075.82E-07LINC02567long intergenic non-protein coding RNA 2567
rs7319836944q13.1608659687.00E-07RNU6-1325PRNA, U6 small nuclear 1325, pseudogene
rs80363891515q26.2974924307.59E-07RN7SKP181RN7SK pseudogene 181
rs583501641010q26.111205684199.01E-07CACUL1CDK2 associated cullin domain 1
rs95894701313q31.3929178561.19E-06GPC5glypican 5
rs705693823Xq21.31876883601.45E-06CPXCR1CPX chromosome region candidate 1
rs1149418401919p12224435581.76E-06LOC107985324uncharacterized LOC107985324
rs926093666p22.1299585021.77E-06DDX39BP2DEAD-box helicase 39B pseudogene 2
rs926091866p22.1299487512.10E-06MICDMHC class I polypeptide-related sequence D (pseudogene)
rs926093166p22.1299575082.15E-06DDX39BP2DEAD-box helicase 39B pseudogene 2
rs926067266p22.1299249962.23E-06HLA-Wmajor histocompatibility complex, class I, W (pseudogene)
rs926073366p22.1299324332.31E-06HLA-Wmajor histocompatibility complex, class I, W (pseudogene)
rs49856941717p11.2168613322.36E-06TNFRSF13BTNF receptor superfamily member 13B
rs728359881717p11.2168686762.36E-06TNFRSF13BTNF receptor superfamily member 13B
rs382338266p22.1299452102.45E-06HCG9HLA complex group 9
rs13318481313q12.3315087242.46E-06TEX26; TEX26-AS1testis expressed 26; TEX26 antisense RNA 1

SNP nucleotide polymorphism, Chr chromosome, BP base pair. The content discussed in detail or SNPs with nominal significance in UK Biobank were in bold.

Quantile-quantile plot for multivariate genome-wide association study of depression-cognition-memory.

a The quantile-quantile plot based on data before imputation. b The quantile-quantile plot based on data after imputation. The horizontal axis represents the expected −log10 (P), while the vertical axis represents the observed −log10 (P). The red line represents the expectation of the null hypothesis of no association, and the gray shaded area represents 95% confidence intervals of the null hypothesis. The black dots represent the observed data, and λ indicates genomic inflation.

Manhattan plot for multivariate genome-wide association study of depression-cognition-memory.

a The Manhattan plot based on data before imputation. b The Manhattan plot based on data after imputation. The horizontal axis represents autosomes and the X chromosome, while the vertical axis represents the P values of SNPs. The red line represents the genome-wide significance threshold (5 × 10−8), and the lower horizontal dashed line represents the suggestive significance level (1 × 10−5). The top 20 SNPs from multivariate GWAS of depression-cognition-memory. SNP nucleotide polymorphism, Chr chromosome, BP base pair. The content discussed in detail or SNPs with nominal significance in UK Biobank were in bold. Enhancer enrichment analysis found that the top 100 depression-cognition-memory-related genetic variants were significantly enriched in six tissues and cells, including pancreatic islets, stomach mucosa, fetal intestine large, primary natural killer cells, and T regulatory cells from peripheral blood, and liver (P < 0.05) (Supplementary Table 3). After using the data of the third phase of the 1000 Genomes Project as a reference to impute untyped SNPs, there was still no evidence of population stratification (Fig. 2b), and the amounts of SNPs with suggestive significance increased, with a total of 457 SNPs reaching the level of suggestive significance (Fig. 3b, Supplementary Table 4). The first three top SNPs, chr6:24597173, rs12210323, and rs12213116 (P = 1.71 × 10−7−1.72 × 10−7), were located in the KIAA0319 gene, which had biased expression in multiple brain tissues (Supplementary Fig 1), followed by rs61783213 (P = 2.77 × 10−7) in the LINC02567 gene. The top 20 SNPs after imputation are shown in Supplementary Table 5. A total of 377 of the 457 SNPs (82.5%) were associated with two or three phenotypes (P < 0.05) (Supplementary Table 4). In the gene-based study, no statistically significant gene was found (P < 2.61 × 10−6), but 1107 genes reached the nominal significance level (P < 0.05). Most of these genes are known to be involved in metabolism, immunity, and neuronal systems, and the top 20 genes are shown in Table 2.
Table 2

The top 20 genes associated with depression-cognition-memory from gene-based analysis.

GeneChrSNP (N)StartStopP valueTop-SNPTop-SNP P value
TMEM2572321449089271449113701.00E-05rs59199092.59E-06
TEX26132131506833315491532.10E-05rs13318482.46E-06
MEDAG131431480311314997094.80E-05rs107425.35E-05
LSM101536859030368634936.80E-05rs46531852.07E-05
ZMAT4820040388110407553439.10E-05rs286032513.45E-05
KIAA0232431678445868858999.50E-05rs38931853.69E-06
VDAC38542249278422634551.47E-04rs597916942.17E-04
CTD-2297D10.2517513219751401672.63E-04rs47022101.53E-05
OR6K6121587246051587256373.21E-04rs168410016.69E-05
SLC26A786292221721924103823.49E-04rs101010128.61E-04
TEX26-AS1133731456971315067453.84E-04rs107425.35E-05
C31938667784567206624.03E-04rs22506567.69E-04
TET1105670320116704542394.39E-04rs79129847.30E-06
LOC102724316101929698462297767854.47E-04rs64815963.22E-05
OR6N1151587355331587364725.41E-04rs8578263.86E-04
IQCJ3741587870401589840965.76E-04rs104704731.91E-04
TBC1D161711377906141780096575.84E-04rs126029513.47E-04
WDPCP28663348534638158676.02E-04rs65459887.39E-05
USP12-AS1131427699669277432726.08E-04rs95125568.12E-04
CATSPERG191738826442388615896.66E-04rs22700952.03E-04

Chr chromosome, SNP nucleotide polymorphism. The content discussed in detail were in bold.

The top 20 genes associated with depression-cognition-memory from gene-based analysis. Chr chromosome, SNP nucleotide polymorphism. The content discussed in detail were in bold. In the pathway-based analysis, no statistically significant pathway was found (P < 4.64 × 10−5), but 587 pathways were found to be nominally associated with depression-cognition-memory (emp-P < 0.05), and most of these pathways were involved in the metabolism of amino acids, lipids and RNA, the immune system, and the neuronal system. The top 20 pathways are shown in Table 3.
Table 3

The top 20 pathways associated with depression-cognition-memory from pathway enrichment analysis.

Pathwaychisq-Pemp-P−log (chisq-P)−log (emp-P)
REACTOME_METABOLISM_OF_AMINO_ACIDS_AND_DERIVATIVES6.50E-042.42E-043.193.62
REACTOME_SPHINGOLIPID_DE_NOVO_BIOSYNTHESIS1.08E-032.65E-042.973.58
REACTOME_SLBP_DEPENDENT_PROCESSING_OF_REPLICATION_DEPENDENT_HISTONE_PRE_MRNAS6.63E-043.95E-043.183.40
REACTOME_PROCESSING_OF_CAPPED_INTRONLESS_PRE_MRNA6.63E-044.18E-043.183.38
KEGG_SPHINGOLIPID_METABOLISM1.22E-035.30E-042.913.28
REACTOME_N_GLYCAN_ANTENNAE_ELONGATION_IN_THE_MEDIAL_TRANS_GOLGI1.84E-037.70E-042.743.11
REACTOME_N_GLYCAN_ANTENNAE_ELONGATION1.84E-031.03E-032.742.99
REACTOME_IMMUNOREGULATORY_INTERACTIONS_BETWEEN_A_LYMPHOID_AND_A_NON_LYMPHOID_CELL1.46E-031.24E-032.832.91
REACTOME_TRANSCRIPTIONAL_REGULATION_OF_WHITE_ADIPOCYTE_DIFFERENTIATION3.12E-031.41E-032.512.85
REACTOME_NONSENSE_MEDIATED_DECAY_ENHANCED_BY_THE_EXON_JUNCTION_COMPLEX2.42E-031.43E-032.622.84
REACTOME_TRANSPORT_TO_THE_GOLGI_AND_SUBSEQUENT_MODIFICATION1.93E-031.47E-032.712.83
REACTOME_SIGNALING_BY_FGFR1_FUSION_MUTANTS1.55E-031.66E-032.812.78
REACTOME_SIGNAL_REGULATORY_PROTEIN_SIRP_FAMILY_INTERACTIONS1.89E-031.87E-032.722.73
REACTOME_OTHER_SEMAPHORIN_INTERACTIONS1.89E-031.91E-032.722.72
REACTOME_DNA_REPAIR5.00E-032.13E-032.302.67
REACTOME_TRANSFERRIN_ENDOCYTOSIS_AND_RECYCLING3.30E-033.01E-032.482.52
BIOCARTA_NKCELLS_PATHWAY4.69E-033.04E-032.332.52
REACTOME_SIGNALING_BY_FGFR3_MUTANTS3.78E-033.06E-032.422.51
REACTOME_APOPTOTIC_EXECUTION_PHASE5.75E-033.10E-032.242.51
REACTOME_CELL_JUNCTION_ORGANIZATION6.99E-033.15E-032.162.50
The top 20 pathways associated with depression-cognition-memory from pathway enrichment analysis.

Validation analysis

A total of 469 SNPs with P values lower than 1 × 10−5 in the discovery set had genotype data in the UKB validation set and were selected for validation. Although no SNP passed the Bonferroni correction level, 12 SNPs reached the nominal significance level (P < 0.05), three of them (rs13209442, rs13208577, and rs12213116) were located in the KIAA0319 gene, and one (rs9261134) was located in the ZNRD1ASP gene (Supplementary Table 6).

eQTL analysis

The eQTL analysis across tissues found that four (rs2539731, rs17337582, rs62358383, and rs9261134) of 12 SNPs with nominal significance in the validation set were significant eQTLs in multiple tissues, specifically in brain tissues (Supplementary Figs 2–5). Among these SNPs, rs2539731 (Supplementary Fig 2, brain-substantia nigra: P = 1.10 × 10−3, m-value = 0.965; brain-nucleus accumbens: P = 3.60 × 10−5, m-value = 0.991), rs17337582 (Supplementary Fig 3, brain-substantia nigra: P = 1.20 × 10−3, m-value = 0.951; brain-nucleus accumbens: P = 4.10 × 10−5, m-value=0.996), and rs62358383 (Supplementary Fig 4, brain-substantia nigra: P = 1.90 × 10−3, m-value = 0.927; brain-nucleus accumbens: P = 5.70 × 10−5, m-value = 0.987) were significantly associated with the expression of MAP3K1 gene in brain tissues. Rs9261134 located at the ZNRD1ASP gene was significantly associated with the expression of 15 genes (Supplementary Table 6) in multiple brain tissues (brain-nucleus accumbens, brain-frontal cortex, brain-caudate, brain-putamen, brain-hypothalamus, brain-amygdala, brain-cortex, and brain-anterior cingulate cortex) (P < 0.5, m-value > 0.9), and ZNRD1ASP expression with rs9261134 across tissues is shown in Supplementary Fig 5.

Discussion

The current study performed the first multivariate GWAS of depression-cognition-memory and found some pleiotropic SNPs, genes, and pathways among depression, cognition, and memory in Qingdao twins. Moreover, multiple variants were replicated in another independent UKB population. Although few previous achievements have been made involving pleiotropic variants of depression, cognition, and memory across ancestries, some recent literature has focused on both depression and cognitive function or depression related variants across ancestry groups. Thalamuthu et al. performed a genome-wide interaction analysis of major depressive disorder (MDD) with cognitive function among European cohorts. The study revealed that MDD status had a moderating effect on the associations of variants with cognitive function, with some SNPs associated with cognitive domains in the context of MDD [56]. Another study conducted pleiotropy analyses, utilizing MDD and late-onset AD GWAS data based on European ancestry, thereby indicating that the genetic risks associated with AD might influence MDD risk [57]. Cross-ethnic studies, similar to our findings, demonstrated a small shared polygenic basis for depression in European and East Asian populations. Bigdeli et al. found a weak overlap of SNP effects between East Asian and European ancestries by combining MDD GWAS summary statistics of Chinese and European participants [58]. One significant SNP (rs10912903) was replicated in the current multivariate GWAS, with a P value of 7.15 × 10−3. Another study also showed that only 11% of depression risk loci previously identified in the European population reached nominal significance in the East Asian population [59]. In the SNP-based analysis, the strongest association signal was rs3967317 located in the CNTN4 gene on chromosome 3, which exceeded the genome-wide significance level. The following were rs9863698 and rs3967316, which were also located in the CNTN4 gene. The CNTN4-encoded protein belongs to the contactin family and is involved in neuronal network development and plasticity. Pertinently, studies have found that CNTN4 is associated with mental retardation [60] and affects intelligence [61]. Rs9261381 was located in the TRIM31 gene on chromosome 6. TRIM31 encodes a protein that functions as an E3 ubiquitin-protein ligase and can regulate cell growth. Studies have shown that the TRIM31 gene was associated with intelligence only in the background of a psychiatric disorder [62]. Furthermore, the top 100 depression-cognition-memory related genetic variants were significantly enriched in pancreatic islets, stomach mucosa, fetal intestine large, primary natural killer cells and T regulatory cells from peripheral blood, and the liver. Ample evidence has shown that these tissues and cells are closely related to depression, cognition, and memory [63-70], which further supports our findings. After imputing untyped SNPs, more SNPs with suggestive significance were identified. The SNP chr6:24597173 located in the KLAA0319 gene showed the strongest association, although this might be mainly driven by the association of the SNP with cognition. However, the KLAA0319 gene demonstrated a biased expression in multiple brain tissues, and three of 12 SNPs with nominal significance in the validation phase among the UKB population whose cases were abnormal on all three phenotypes and controls were normal were located in the KIAA0319 gene, indicating a potential cross-ethnic association of KIAA0319 with depression-cognition-memory. The protein encoded by KIAA0319 can regulate cell adhesion and neuronal migration processes to influence the growth of the cerebral cortex, and KIAA0319 has been found to be associated with category fluency, recall process, and verbal learning [71]. In addition, four of the other nine SNPs were significant eQTLs in brain tissues that control mood, cognition, and memory; moreover, three SNPs were significantly associated with the expression level of the MAP3K1 gene in the brain-substantia nigra and brain-nucleus accumbens. Pertinently, the protein encoded by MAP3K1 is part of the nuclear factor kappa beta (NF-κB) pathway [72]. NF-κB has been found to be involved in the pathophysiology of depression [73] and is closely related to the pathogenesis of AD [74]. Most genes with nominal significance levels in the gene-based analysis are known to be involved in metabolism, immunity, and neuronal systems. The potential mechanisms of several interesting genes other than those mentioned above were as follows: (1) MEDAG is an adipogenic gene that can promote the formation of adipocytes [75]. The lipid metabolism process involves the pathogenesis of depression and AD [76, 77]; (2) the C3a peptide encoded by C3 can modulate the inflammation process which is closely related to depression and AD [7, 8]; and (3) the protein encoded by TET1 influences gene activation and the process of DNA methylation. Studies have shown that the expression level of TET1 is significantly increased in psychotic participants [78], and TET1 variation is associated with late-onset AD [79]; (4) FGF1 is related to the survival of neurons and is involved in various biological processes. Furthermore, FGF1 has been reported to be associated with AD in Han Chinese individuals [80]. In the pathway enrichment analysis, most pathways that reached nominal significance were related to metabolism, immune, and neuronal system, and several important pathways were revealed as follows: (1) metabolism of amino acids and derivatives, various metabolites of this pathway including glutamate and glycine are involved in the pathophysiology of depression and AD [81, 82]; (2) sphingolipid de novo biosynthesis and (3) sphingolipid metabolism pathways, both of which are involved in the metabolism process of sphingolipid. The intermediate product is ceramide, which is closely related to the pathological mechanisms of depression and AD and used as a therapeutic target [83]. For (4) N-glycan antennae elongation in the medial/trans-Golgi and (5) N-glycan antennae elongation, there are significant differences between depression patients and controls in the serum N-glycan structure levels [84], and N-glycans can influence the development and progression of AD by regulating the key glycoproteins [85]. (6) Immunoregulatory interactions between a lymphoid and a non-lymphoid cell, which is an important pathway in the immune system; relevantly, the immune system is regarded as a major factor in both depression and AD [86, 87]. A previous study also found immune-related pathways in the shared genetic etiology of depression and AD [57]. (7) Other semaphorin interactions pathway containing four types of plexins and eight classes of semaphorins is involved in axon guidance and the development of the nervous system [88]. Semaphorins have been related to major depression risk [89], and plexin-A4 can mediate Aβ-induced tau pathology in the pathogenesis of AD [90]. However, some pathways, such as cortisol/stress responses and cholinergic and serotonergic function [10], have been clearly linked to both depression and AD but were not identified as top pathways in the current study with higher P values (stress pathway: emp-P = 2.60 × 10−2, neurotransmitter release cycle: emp-P = 2.16 × 10−2, acetylcholine neurotransmitter release cycle pathway: emp-P = 2.19 × 10−2, etc.). Except for the limitation of the small sample size, another possible reason was that although several physiological processes were thought to be the common mechanisms of both depression and AD, they might be mainly caused by different genes in depression and AD, rather than pleiotropic genes, and further research is required in the future. The current multivariate GWAS had several strengths. First, this is the first study to identify potential pleiotropic SNPs, genes, and pathways among depression, cognition, and memory phenotypes in Chinese individuals, which may not only provide insight into a common genetic basis of these phenotypes but also make a little contribution in shifting the Eurocentric bias of GWASs. Second, this current GWAS was performed in twin samples. The twin-based GWAS design has been demonstrated excellence in controlling population stratification and passive rGE and can identify direct genetic effects [32], which reduced the concerns about false-positive errors and the confusion of indirect genetic effects. Third, validation analysis was performed in another independent UKB population, which allowed the identification of cross-ethnic generalizability. However, several study limitations should also be considered. First, the sample size of the current study was relatively small owing to the difficulty in twin pair recruitment, even though the algorithm of multivariate GWAS has the natural advantage of power [91], which may restrict the discovery of more significantly associated SNPs, genes, and pathways. In fact, no gene and pathway reached the significance threshold of Bonferroni correction, which might be partly due to the small sample size, and further research in a large East Asian population is needed. Notably, cognitive decline in depressed individuals could be secondary to depression itself, such as psychomotor slowing and withdrawal from engagement in activities that were conducive to cognition. Similarly, cognitive dysfunction might lead to stress and psychological burdens that give rise to depressive symptoms. In these cases, the genetics affecting cognition and memory might largely be separate from those that served as other risk factors for depression, which might also explain the lack of extensive genetic overlap among the three phenotypes in the current study. Second, the assessment methods of depression, cognition, and memory phenotypes were not perfectly consistent between the discovery set and validation set; for example, depression was assessed using the GDS-30 in the current multivariate GWAS and the PHQ-2 in UKB validation analysis. Although both the GDS-30 and PHQ-2 are reliable and valid and have been widely used as screening measures for depression, there may still be phenotypic heterogeneity to some degree due to the not identical questionnaire items. The phenotypic heterogeneity and ethnic difference in the MAF and LD structure might partly account for the fewer significant findings revealed in the validation analysis. Nevertheless, we still found a potential cross-ethnic association of KIAA0319. However, the strongest SNP (rs3967317), which demonstrated genome-wide significance (P < 5 × 10−8) and still showed suggestive significance after imputation (P < 1 × 10−5) in the discovery set, was not replicated in the UKB validation sample, but it may still be worth being further validated in the Asian population as a promising candidate variant. In conclusion, this multivariate GWAS identified some pleiotropic SNPs, genes, and pathways among depression, cognition, and memory, which provided evidence for a common genetic basis of the three phenotypes and clues for further exploring the shared genetic pathogenesis of depression with AD, and it might be helpful in the search for new therapeutic targets for both diseases. Supplementary Table 1 Supplementary Table 2 Supplementary Table 3 Supplementary Table 4 Supplementary Table 5 Supplementary Table 6 Supplementary figure and table legends Supplementary Figure 1 Supplementary Figure 2 Supplementary Figure 3 Supplementary Figure 4 Supplementary Figure 5
  87 in total

1.  The genetic basis for cognitive ability, memory, and depression symptomatology in middle-aged and elderly chinese twins.

Authors:  Chunsheng Xu; Jianping Sun; Fuling Ji; Xiaocao Tian; Haiping Duan; Yaoming Zhai; Shaojie Wang; Zengchang Pang; Dongfeng Zhang; Zhongtang Zhao; Shuxia Li; Jacob V B Hjelmborg; Kaare Christensen; Qihua Tan
Journal:  Twin Res Hum Genet       Date:  2015-01-14       Impact factor: 1.587

2.  Integrated Metabolomics and Proteomics Analysis of Hippocampus in a Rat Model of Depression.

Authors:  Yuqing Zhang; Shuai Yuan; Juncai Pu; Lining Yang; Xinyu Zhou; Lanxiang Liu; Xiaofeng Jiang; Hanping Zhang; Teng Teng; Lu Tian; Peng Xie
Journal:  Neuroscience       Date:  2017-12-10       Impact factor: 3.590

3.  Validating the Geriatric Depression Scale with proxy-based data: A case-control psychological autopsy study in rural China.

Authors:  Lu Niu; Cunxian Jia; Zhenyu Ma; Guojun Wang; Zhenjun Yu; Liang Zhou
Journal:  J Affect Disord       Date:  2018-08-18       Impact factor: 4.839

Review 4.  N-glycan and Alzheimer's disease.

Authors:  Yasuhiko Kizuka; Shinobu Kitazume; Naoyuki Taniguchi
Journal:  Biochim Biophys Acta Gen Subj       Date:  2017-04-29       Impact factor: 3.770

5.  The Genetic Architecture of Depression in Individuals of East Asian Ancestry: A Genome-Wide Association Study.

Authors:  Olga Giannakopoulou; Kuang Lin; Xiangrui Meng; Mei-Hsin Su; Po-Hsiu Kuo; Roseann E Peterson; Swapnil Awasthi; Arden Moscati; Jonathan R I Coleman; Nick Bass; Iona Y Millwood; Yiping Chen; Zhengming Chen; Hsi-Chung Chen; Mong-Liang Lu; Ming-Chyi Huang; Chun-Hsin Chen; Eli A Stahl; Ruth J F Loos; Niamh Mullins; Robert J Ursano; Ronald C Kessler; Murray B Stein; Srijan Sen; Laura J Scott; Margit Burmeister; Yu Fang; Jess Tyrrell; Yunxuan Jiang; Chao Tian; Andrew M McIntosh; Stephan Ripke; Erin C Dunn; Kenneth S Kendler; Robin G Walters; Cathryn M Lewis; Karoline Kuchenbaecker
Journal:  JAMA Psychiatry       Date:  2021-11-01       Impact factor: 25.911

Review 6.  The genetics and biology of DISC1--an emerging role in psychosis and cognition.

Authors:  David J Porteous; Pippa Thomson; Nicholas J Brandon; J Kirsty Millar
Journal:  Biol Psychiatry       Date:  2006-07-15       Impact factor: 13.382

7.  Upregulation of TET1 and downregulation of APOBEC3A and APOBEC3C in the parietal cortex of psychotic patients.

Authors:  E Dong; D P Gavin; Y Chen; J Davis
Journal:  Transl Psychiatry       Date:  2012-09-04       Impact factor: 6.222

8.  Estimation of significance thresholds for genomewide association scans.

Authors:  Frank Dudbridge; Arief Gusnanto
Journal:  Genet Epidemiol       Date:  2008-04       Impact factor: 2.135

9.  708 Common and 2010 rare DISC1 locus variants identified in 1542 subjects: analysis for association with psychiatric disorder and cognitive traits.

Authors:  P A Thomson; J S Parla; A F McRae; M Kramer; K Ramakrishnan; J Yao; D C Soares; S McCarthy; S W Morris; L Cardone; S Cass; E Ghiban; W Hennah; K L Evans; D Rebolini; J K Millar; S E Harris; J M Starr; D J MacIntyre; A M McIntosh; J D Watson; I J Deary; P M Visscher; D H Blackwood; W R McCombie; D J Porteous
Journal:  Mol Psychiatry       Date:  2013-06-04       Impact factor: 15.992

10.  Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics.

Authors:  David Lamparter; Daniel Marbach; Rico Rueedi; Zoltán Kutalik; Sven Bergmann
Journal:  PLoS Comput Biol       Date:  2016-01-25       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.