| Literature DB >> 32005911 |
Tobias Strunz1, Susette Lauwen2,3, Christina Kiel1, Anneke den Hollander2,3, Bernhard H F Weber4.
Abstract
Genome-wide association studies (GWAS) for late stage age-related macular degeneration (AMD) have identified 52 independent genetic variants with genome-wide significance at 34 genomic loci. Typically, such an approach rarely results in the identification of functional variants implicating a defined gene in the disease process. We now performed a transcriptome-wide association study (TWAS) allowing the prediction of effects of AMD-associated genetic variants on gene expression. The TWAS was based on the genotypes of 16,144 late-stage AMD cases and 17,832 healthy controls, and gene expression was imputed for 27 different human tissues which were obtained from 134 to 421 individuals. A linear regression model including each individuals imputed gene expression data and the respective AMD status identified 106 genes significantly associated to AMD variants in at least one tissue (Q-value < 0.001). Gene enrichment analysis highlighted rather systemic than tissue- or cell-specific processes. Remarkably, 31 of the 106 genes overlapped with significant GWAS signals of other complex traits and diseases, such as neurological or autoimmune conditions. Taken together, our study highlights the fact that expression of genes associated with AMD is not restricted to retinal tissue as could be expected for an eye disease of the posterior pole, but instead is rather ubiquitous suggesting processes underlying AMD pathology to be of systemic nature.Entities:
Mesh:
Year: 2020 PMID: 32005911 PMCID: PMC6994629 DOI: 10.1038/s41598-020-58510-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1TWAS results for 27 tissues. A TWAS was conducted based on the genotypes of 16,144 late-stage AMD cases and 17,832 AMD-free controls. Prediction models of 27 tissues were included in the analysis. The schematic overview demonstrates the number of significant AMD-associated genes (FDR < 1 × 10−3) within the respective tissue. If a gene was found exclusively in a single tissue, it was marked as tissue-specific (TS). Tissue classification was performed manually according to main functions or metabolic assignments. Adipose SU: Adipose Subcutaneous; Adipose VO: Adipose Visceral Omentum; Artery AO: Artery Aorta; Artery TI: Artery Tibial; Brain CE: Brain Cerebellum; Breast MT: Breast Mammary Tissue; Cells TF: Cells Transformed fibroblasts; Colon SI: Colon Sigmoid; Colon TR: Colon Transverse; Esophagus GJ: Esophagus Gastroesophageal Junction; Esophagus MC: Esophagus Mucosa; Esophagus MS: Esophagus Muscularis; Heart AA: Heart Atrial Appendage; Heart LV: Heart Left Ventricle; Muscle SK: Muscle Skeletal; Nerve TI: Nerve Tibial; Skin NSS: Skin Not Sun Exposed Suprapubic; Skin SEL: Skin Sun Exposed Lower leg.
AMD-associated genes located in known AMD loci.
| AMD locusa | Locus namea | Chr | AMD-associated genes in TWAS analysisb (Number of significant tissues (FDR < 0.001) and effect direction within tissues)c |
|---|---|---|---|
| 1 | 1 | ||
| 2 | 2 | ||
| 4 | 3 | ||
| 5 | 4 | ||
| 9 | 6 | ||
| 11 | 7 | ||
| 10 | 7 | ||
| 12 | 8 | ||
| 14 | 9 | ||
| 13 | 9 | ||
| 15 | 9 | ||
| 18 | 10 | ||
| 19 | 12 | ||
| 21 | 13 | ||
| 22 | 14 | ||
| 23 | 15 | ||
| 24 | 16 | ||
| 25 | 16 | ||
| 26 | 17 | ||
| 27 | 17 | ||
| 29 | 19 | ||
| 28 | 19 | ||
| 30 | 19 | ||
| 31 | 20 | ||
| 34 | 22 |
aAMD locus number and name according to Fritsche et al.[2]. bAMD-associated genes in TWAS analysis overlapping with the locus of Fritsche et al. (2016) (ref. [2]) in a +/− 1 Mb window. cPositive effect direction (+) points to predicted gene expression being higher in AMD cases compared to controls and vice versa for negative effect direction (−). Chr = Chromosome.
AMD-associated genes located in novel loci that did not reach genome-wide significance in previous GWAS studies[2].
| Gene | Chr | Gene Expression Prediction | Strongest effect tissuea | Model Information in Strongest Effect Tissue | |||
|---|---|---|---|---|---|---|---|
| Predictable Tissues | AMD associated (FDR < 0.001) | Mean effect size (SD) | Variants in model | AMD associated (GWAS P-value < 1 × 10-04)b | |||
| 1 | 15 | 1 | −0.028 | Liver | 34 | 3 | |
| 1 | 17 | 3 | −0.016 (0.003) | Esophagus Muscularis | 59 | 5 | |
| 1 | 3 | 1 | −0.013 | Muscle Skeletal | 40 | 6 | |
| 2 | 18 | 1 | −0.015 | Esophagus Mucosa | 44 | 0 | |
| 2 | 27 | 16 | 0.037 (0.004) | Adipose Subcutaneous | 37 | 24 | |
| 2 | 6 | 4 | 0.005 (0.001) | Adipose Subcutaneous | 12 | 8 | |
| 5 | 21 | 12 | −0.013 (0.006) | Adipose Subcutaneous | 25 | 8 | |
| 6 | 17 | 1 | 0.019 | Cells Transformed fibroblasts | 22 | 1 | |
| 9 | 10 | 1 | −0.017 | Adipose Subcutaneous | 53 | 0 | |
| 11 | 2 | 1 | 0.007 | Testis | 5 | 2 | |
| 11 | 23 | 3 | −0.02 (0.005) | Skin Not Sun Exposed Suprapubic | 48 | 0 | |
| 11 | 24 | 8 | −0.016 (0.006) | Nerve Tibial | 50 | 0 | |
| 14 | 13 | 1 | 0.018 | Colon Sigmoid | 24 | 2 | |
| 15 | 17 | 1 | −0.01 | Lung | 10 | 4 | |
| 16 | 3 | 1 | 0.014 | Muscle Skeletal | 55 | 1 | |
| 19 | 11 | 1 | −0.009 | Lung | 23 | 0 | |
| 19 | 9 | 1 | 0.007 | Adrenal Gland | 13 | 0 | |
| 19 | 23 | 1 | −0.016 | Colon Sigmoid | 15 | 2 | |
aTissue, which showed the highest absolute effect size. bNumber of variants in prediction model, which were AMD-associated with a p-value smaller than 1 × 10−04 in Fritsche et al. (2016) (ref. [2]). Chr = Chromosome.
Figure 2Enriched GO biological processes in 106 genes identified (adjusted P-value < 0.05). Enrichr was used to assign gene ontology (GO) terms for the 106 AMD-associated genes and to investigate enriched GO biological processes[60]. The eight significantly enriched processes are shown, clustered into the complement cascade and lipid-related processes. Genes are given for each process. Genes colored in green indicate those which were not identified previously in AMD-associated loci.
Figure 3Work-flow for identification of pleiotropic genes. Genome-wide significant independent GWAS signals (P-value ≤ 5 × 10−8) were extracted for 82 complex traits and diseases from the corresponding publications. Additionally, variants in LD for each of the independent GWAS hits were included in the analysis. Linked variants with R2 > 0.5 demarcated the start- and stop-positions for a GWAS signal R2 Locus. Overlapping loci were merged to identify potential pleiotropic genomic regions. The genes overlapping with at least one R2 locus were identified.
Figure 4Analysis of AMD-associated genes and their overlap with pleiotropic loci. (A) Number of genes, which overlapped with R2 loci of trait groups (B) Number of AMD-associated tissues per gene as identified by TWAS and overlap of genes with at least a single R2 locus (Mann-Whitney-U-Test P-value 0.0053) (C) Trait groups shared by AMD-associated genes identified in this study (black) and by all predictable genes (grey). The latter have been scaled from in total 24,388 to 106 genes to enable a better comparability. “Other” include “Aging”, “anthropometric traits”, “Blood cells”, “cardiovascular diseases”, “Complex eye diseases and traits”, “Lifestyle”, and “Immune-related traits”. Significance was assessed through a Fisher exact test. *P-value < 0.05; **P-value < 0.01; ***P-value < 0.001.
AMD-associated genes overlapping pleiotropic loci.
| Gene | Gene Type | AMD associated tissues (FDR < 0.001) | Locus R2 [hg19] | Disease groups in locus | Single phenotypes in locus |
|---|---|---|---|---|---|
| BCAR1 | Protein coding | 2 | chr16:75233867–75516739 | Organ function; Cancer; Neurological diseases; Autoimmune diseases; Metabolic traits; AMD | FEV1FVC; LGF; PanC; MIG; T1D; T2D; AMD |
| CFDP1 | Protein coding | 4 | |||
| TMEM170A | Protein coding | 2 | |||
| BAIAP2L2 | Protein coding | 2 | chr22:38295271–38503972; chr22:38505356–38614129 | AMD; Anthropometric traits; Metabolic traits; Cancer | AMD; BFP; TG; CMM |
| ALDH1A2 | Protein coding | 1 | chr15:58671721–58692118; chr15:58718529–58742418 | Blood cells; Metabolic traits; AMD | HB; RBC; HDL; TC; TG; AMD |
| ULK3 | Protein coding | 1 | chr15:75031521–75449869 | Cardiovascular diseases; Autoimmune diseases; Blood cells | DBP; GBP; HTN; SBP; SLE; MCV; RBC |
| COL4A3 | Protein coding | 2 | chr2:228083236–228100488; chr2:228126494–228231432 | AMD; Complex eye diseases and traits | AMD; CCT |
| PMS2P1 | pseudogene | 14 | chr7:99894971–100111776 | Neurological diseases; AMD | AD; AMD |
| STAG3L5P | pseudogene | 27 | |||
| PILRB | Protein coding | 27 | |||
| PILRA | Protein coding | 26 | |||
| ZCWPW1 | Protein coding | 3 | |||
| TSC22D4 | Protein coding | 8 | |||
| NYAP1 | Protein coding | 3 | |||
| RP11.325F22.2 | lincRNA | 1 | chr7:104581402–105063372 | AMD; Neurological diseases | AMD; SCZ |
| PLEKHA1 | Protein coding | 18 | chr10:124124669–124235355 | Neurological diseases; AMD | MIG; AMD |
| ARMS2 | Protein coding | 14 | |||
| HTRA1 | Protein coding | 7 | |||
| RDH5 | Protein coding | 17 | chr12:56115585–56213297 | AMD; Complex eye diseases and traits | AMD; MYP |
| RIN3 | Protein coding | 1 | chr14:93068516–93118229 | Cancer; Organ function | BRC; BRConly; FEV1; LGF |
| CETP | Protein coding | 4 | chr16:56985514–57006829 | AMD; Metabolic traits | AMD; HDL; LDL; TC; TG |
| FUT2 | Protein coding | 1 | chr19:49158532–49252574 | Metabolic traits; Autoimmune diseases | ALP; LEP; TC; CD; IBD; GGT |
| MAMSTR | Protein coding | 1 |
AD = Alzheimers disease; ALP = LEP - alkaline phosphatase; AMD = Age-related macular degeneration; BCC = Basal cell carcinoma; BFP = Body fat percentage; BRC = Breast cancer - max. beta; BRConly = Breast cancer - only BRC; CCT = Central corneal thickness; CD = Crohns disease; CMM = Cutaneous malignant melanoma; CSCC = Cutaneous squamous cell carcinoma; DBP = Diastolic blood pressure; eGFR = estimated glomerular filtration rate of creatinine; FEV1 = forced expiratory volume in 1 second; FEV1FVC = forced expiratory volume in 1 second/forced vital capacity; GBP = General blood pressure; GGT = LEP - γ -glutamyl transferase; HB = Haemoglobin; HDL = High-density lipoprotein; HGT = Height; HTN = Hypertension; IBD = Inflammatory bowel disease; LDL = Low-density lipoprotein; LEP = Liver enzymes in plasma; LGF = Lung function; MCV = Mean cell volume; MIG = Migraine; MYP = Myopia; PanC = Pancreatic cancer; RBC = Red blood cell phenotypes; SBP = Systolic blood pressure; SCZ = Schizophrenia; SLE = Systemic lupus erythematosis; T1D = Type 1 diabetes; T2D = Type 2 diabetes; TC = Total cholesterol; TG = Triglycerides.