| Literature DB >> 35094046 |
Sophie L Farrow1,2, William Schierding1,2, Sreemol Gokuladhas1, Evgeniia Golovina1, Tayaza Fadason1,2, Antony A Cooper3,4, Justin M O'Sullivan1,2,3,5,6.
Abstract
The latest meta-analysis of genome-wide association studies identified 90 independent variants across 78 genomic regions associated with Parkinson's disease, yet the mechanisms by which these variants influence the development of the disease remains largely elusive. To establish the functional gene regulatory networks associated with Parkinson's disease risk variants, we utilized an approach combining spatial (chromosomal conformation capture) and functional (expression quantitative trait loci) data. We identified 518 genes subject to regulation by 76 Parkinson's variants across 49 tissues, whicih encompass 36 peripheral and 13 CNS tissues. Notably, one-third of these genes were regulated via trans-acting mechanisms (distal; risk locus-gene separated by >1 Mb, or on different chromosomes). Of particular interest is the identification of a novel trans-expression quantitative trait loci-gene connection between rs10847864 and SYNJ1 in the adult brain cortex, highlighting a convergence between familial studies and Parkinson's disease genome-wide association studies loci for SYNJ1 (PARK20) for the first time. Furthermore, we identified 16 neurodevelopment-specific expression quantitative trait loci-gene regulatory connections within the foetal cortex, consistent with hypotheses suggesting a neurodevelopmental involvement in the pathogenesis of Parkinson's disease. Through utilizing Louvain clustering we extracted nine significant and highly intraconnected clusters within the entire gene regulatory network. The nine clusters are enriched for specific biological processes and pathways, some of which have not previously been associated with Parkinson's disease. Together, our results not only contribute to an overall understanding of the mechanisms and impact of specific combinations of Parkinson's disease variants, but also highlight the potential impact gene regulatory networks may have when elucidating aetiological subtypes of Parkinson's disease.Entities:
Keywords: Parkinson’s disease; gene regulation; genetics; precision medicine; spatial genomics
Mesh:
Year: 2022 PMID: 35094046 PMCID: PMC9373962 DOI: 10.1093/brain/awac022
Source DB: PubMed Journal: Brain ISSN: 0006-8950 Impact factor: 15.255
Figure 1Methods workflow. Ninety Parkinson-SNPs were obtained from Nalls et al.[5] Spatial interactions between the 90 Parkinson-SNPs and genes were identified from Hi-C libraries (Supplementary Table 2). The resulting spatial SNP–gene pairs were then used to query GTEx v8 to identify significant eQTLs. The significant spatial SNP–gene pairs were then analysed for functional relevance using multiple tools and databases (‘Materials and methods’ section). Figure adapted from Schierding et al.[19]
Summary statistics for the spatial eQT–gene regulatory network for the 90 Parkinson-SNPs
| Parkinson’s disease SNPs | 1000 simulation SNP sets [Mean (min–max)][ | |||
|---|---|---|---|---|
| Brain-specific[ | All tissues[ | GWAS SNPs (all tissues) | All SNPs (all tissues) | |
| No. SNPs | 90 | 90 | 90 | 90 |
| No. eQTL SNPs[ | 55 | 76 | 54 (33–74) | 7 (0–17) |
| No. genes[ | 165 | 518 | 235 (93–478) | 18 (0–135) |
| No. eQTL–gene pairs[ | 167 | 542 | 244 (93–518) | 19 (0–136) |
| No. | 30 | 178 | 37 (13–63) | 2 (0–15) |
SNPs were downloaded from the Nalls et al.[5] GWAS (download date: 18 June 2020). eQTLs (both cis- and trans-acting eQTL) were only called if: (i) a spatial interaction connecting the SNP and gene had been captured; and (ii) if the adjusted P-value for the eQTL association was P < 0.05 following a step-wise Bonferroni Hochberg correction for the number of tests that were performed during the eQTL calling. All adjusted P-values are presented for all eQTLs (cis and trans; Supplementary Tables 3 and 4).
Spatial eQTL interactions identified for two sets of null distributions generated from (i) all called variants in the GTEx v8; and (ii) trait-associated SNPs in the GWAS Catalogue (accessed 18 November 2021). Each set contained 1000 simulations of 90 randomly selected SNPs without replication.
For full list of tissue eQTL–gene interactions see Supplementary Tables 3 and 4.
eQTL SNPs were defined as having significant spatial interactions (FDR ≤ 0.05) with at least one gene.
Genes were those whose expression was shown to be affected by an eQTL SNP.
The total number of SNP-gene pairs reflects interactions with FDR ≤ 0.05 in at least one GTEx tissue.
Proportion of genes subject to cis- and trans-regulation
| Genes subject to | ||
|---|---|---|
| Brain-specifica | All tissuesb | |
| 136 (82.0%) | 364 (67.2%) | |
| 10 (6.0%) | 56 (10.3%) | |
| 20 (12.0%) | 122 (22.5%) | |
The proportion of eQTL-gene pairs that are either cis-, trans-intrachromosomal or trans-interchromosomal in 13 GTEx brain-specific tissuesa and all 49 GTEx tissuesb. Brain-specific indicates the eQTL dataset obtained through analysing Hi-C cell lines only from the brain and eQTLs only from the brain tissues in GTEx. All-tissues indicates the eQTL dataset obtained through analysing all Hi-C cell lines and eQTLs from all tissues in GTEx. There is a significant difference (chi square test P-value < 0.01) between brain tissues and all tissues for the proportions of the cis versus trans eQTLs.
For detailed information on the specific eQTL–gene pairs see Supplementary Tables 3 and 4.
Figure 2Correlation between genotype samples per tissue and number of eQTLs present in the tissue. (A) Correlation between the number of genotyped samples per tissue (in GTEx) and the number of eQTLs (including cis, trans-intrachromosomal and trans-interchromosomal) per tissue, in 13 brain-specific tissues. (B) Correlation between the number of genotyped samples per tissue (in GTEx) and the number of eQTLs (including cis, trans-intrachromosomal and trans-interchromosomal) per tissue, in all 49 tissues. (C) Correlation between the number of genotyped samples per tissue (in GTEx) and the number of cis-eQTLs per tissue, in 13 brain-specific tissues. (D) Correlation between the number of genotyped samples per tissue (in GTEx) and the number of cis-eQTLs per tissue, in all 49 tissues. (E) Correlation between the number of genotyped samples per tissue (in GTEx) and the number of trans-intrachromosomal-eQTLs per tissue, in 13 brain-specific tissues. (F) Correlation between the number of genotyped samples per tissue (in GTEx) and the number of trans-intrachromosomal-eQTLs per tissue, in all 49 tissues. (G) Correlation between the number of genotyped samples per tissue (in GTEx) and the number of trans-interchromosomal-eQTLs per tissue, in 13 brain-specific tissues. (H) Correlation between the number of genotyped samples per tissue (in GTEx) and the number of trans-interchromosomal-eQTLs per tissue, in all 49 tissues. The tissues that fall furthest from the confidence interval are annotated. The grey dots show the correlation for all GTEx tissues. The 13 brain tissues (from GTEx) are indicated by the coloured dots, as shown in the legend. For information on all tissues outside of the 95% confidence interval, see Supplementary Table 12.
Figure 3Genes subjected to Genes that are loss-of-function intolerant, as measured by a continuous LOEUF score, are enriched in trans-regulatory interactions involving Parkinson-SNPs. The LOEUF score is a continuous value that indicates the tolerance of a given gene to inactivation. Low LOEUF scores indicate stronger selection against loss-of-function variation. The distribution is shown as a violin plot with the median (LOEUF) values for each eQTL group (black text). The groups were compared using a Kruskal–Wallis test (**P-value < 0.01); the absence of a significance value indicates the LOEUF values of the two groups were not significantly different. No eQTL = all genes in gnomAD with an assigned probability of being loss-of-function intolerant (pLI) or LOEUF for which an eQTL was not identified in this study (∼18 500 genes). Not all genes had LOEUF scores (Supplementary Fig. 2 and Supplementary Table 7).
Figure 4Gene regulation in the foetal cortex compared to the adult cortex. The leftmost section shows genes that are regulated only in the foetal cortex, with no eQTLs seen in any of the 13 adult brain tissues. The middle section shows genes that are regulated in both the foetal and adult cortex. The black dots show the regulation effect size of the gene in the adult cortex, and the grey dots show the regulation effect size of the gene across the different brain tissues (where an eQTL is seen). The rightmost section shows genes that are regulated in both the foetal cortex and adult non-cortical brain tissue.
Figure 5Louvain clustering analysis highlights nine significant clusters, indicative of biological connectivity. The grey and orange shading of the nodes is indicative of whether the gene is subject to regulation via cis- or trans-mechanisms. The pink- and turquoise-shaded circles indicate genes that are regulated in adult brain tissue and foetal cortex, respectively. The clusters were also analysed in STRING with an increased stringency [only PPIs with a high confidence level (>0.700, as defined in STRING) were used for this analysis, and interactions identified only through text-mining were excluded]. This exclusion led to very few changes, with cluster 6 the only cluster to lose any connectivity within the cluster (WDHD1, NCAPG and PARPBP no longer connect). Experimentally determined = imported from experimental repositories; gene neighbourhood = similar genomic context in different species suggest a similar function of the proteins; gene fusions = fused proteins are recognized by orthology of the fused parts to other, non-fused proteins; gene co-occurrence = indicates the presence of a specific gene pair is in agreement in all species—must be expressed together; co-expression = predicted association between genes/proteins based on RNA and/or protein expression.