| Literature DB >> 30127857 |
Jing Zhang1, Dan Wu1, Yiqin Dai1, Jianjiang Xu1.
Abstract
BACKGROUND: The genetic architecture underlying central cornea thickness (CCT) is far from understood. Most of the CCT-associated variants are located in the non-coding regions, raising the difficulty of following functional characterizations. Thus, integrative functional analyses on CCT-associated loci might benefit in overcoming these issues by prioritizing the hub genes that are located in the center of CCT genetic network.Entities:
Keywords: Central cornea thickness; GWAS; Integrative analyses; Regulatory variants
Year: 2018 PMID: 30127857 PMCID: PMC6094462 DOI: 10.1186/s13040-018-0179-3
Source DB: PubMed Journal: BioData Min ISSN: 1756-0381 Impact factor: 2.522
Fig. 1Study design of the current integrative analyses
Fig. 2Distribution of (a) CADD, (b) GWAVA, and (c) Eigen scores. Histograms are drawn based on CADD, GWAVA, and Eigen scores of all the variants (both lead GWAS SNPs and LD variants) after removing the missing values
Fig. 3Comparison of functional annotation scores of CCT-associated GWAS SNPs versus LD variants. a CADD scores for GWAS SNPs (n = 34) versus LD variants (n = 587); b GWAVA scores for GWAS SNPs (n = 41) versus LD variants (n = 654); c Eigen scores for GWAS SNPs (n = 43) versus LD variants (n = 645). In the boxplots, centerlines indicated the medians of the values and box limits showed the 25th &75th percentiles (as determined by R package). Whiskers extended to 5th and 95th percentiles and outliers were shown by open circle dots. The notches were defined as ±1.58 × IQR (interquartile range)/square root of n and indicated the 95% confidence interval for each median. P-values were calculated using two-sided Wilcoxon rank sum test
List of prioritized putative regulatory SNPs
| Variant | Chr | Position | LD (r2) | Lead SNP | Ref | Alt | CADD | GWAVA | Eigen-PC | Genes |
|---|---|---|---|---|---|---|---|---|---|---|
| rs2307121 | 5 | 65,329,685 | 1 | rs2307121 | C | T | 15.53 | 0.57 | 1.76 | ADAMTS6 |
| rs2393729 | 10 | 62,077,257 | 0.98 | rs7090871 | C | T | 13.08 | 0.41 | 2.05 | ARID5B |
| rs2721051 | 13 | 40,536,747 | 1 | rs2721051 | C | T | 11.62 | 0.58 | 0.65 | FOXO1 |
| rs4843047 | 15 | 85,316,545 | 0.83 | rs6496932 | G | A | 10.22 | 0.43 | 0.67 | AKAP13 |
| rs57817160 | 2 | 227,314,522 | 0.92 | rs7606754 | G | A | 16.96 | 0.69 | 0.91 | COL4A3 |
| rs59257065 | 2 | 227,314,612 | 0.91 | rs7606754 | C | T | 15.72 | 0.64 | 0.91 | COL4A3 |
| rs6443477 | 3 | 177,590,145 | 0.99 | rs7620503 | A | C | 14.6 | 0.57 | 1.70 | TBL1XR1, KCMB2 |
| rs6693322 | 1 | 36,099,761 | 0.8 | rs96067 | T | C | 10.78 | 0.49 | 1.26 | COL8A2 |
| rs6769466 | 3 | 177,590,226 | 0.99 | rs7620503 | G | A | 15.53 | 0.55 | 1.72 | TBL1XR1, KCMB2 |
| rs6862857 | 5 | 65,334,310 | 1 | rs2307121 | G | C | 13.57 | 0.58 | 1.54 | ADAMTS6 |
Position (hg19). Ref: Reference allele. Alt: Alternative allele. Cutoff: CADD≥10, GWAVA ≥0.4 and Eigen-PC≥ 0
Top 5 terms of the Gene Ontology, Pathway, and Disease enrichment of 38 CCT-associated genes
| Term | Name | FDR B&H | |
|---|---|---|---|
| GO: Molecular Function | |||
| GO:0000988 | transcription factor activity, protein binding | 1.47E-04 | 3.62E-02 |
| GO:0005201 | extracellular matrix structural constituent | 4.54E-04 | 3.88E-02 |
| GO:0033293 | monocarboxylic acid binding | 4.72E-04 | 3.88E-02 |
| GO:0008013 | beta-catenin binding | 6.68E-04 | 4.13E-02 |
| GO:0000989 | transcription factor activity, transcription factor binding | 1.01E-03 | 4.96E-02 |
| GO: Biological Process | |||
| GO:0061448 | connective tissue development | 6.57E-07 | 9.12E-04 |
| GO:0072359 | circulatory system development | 3.05E-06 | 1.41E-03 |
| GO:0072358 | cardiovascular system development | 3.05E-06 | 1.41E-03 |
| GO:0001501 | skeletal system development | 6.12E-06 | 2.12E-03 |
| GO:0048646 | anatomical structure formation involved in morphogenesis | 2.15E-05 | 5.58E-03 |
| GO: Cellular Component | |||
| GO:0005604 | basement membrane | 5.18E-05 | 7.82E-03 |
| GO:0044420 | extracellular matrix component | 1.41E-04 | 1.06E-02 |
| GO:0016327 | apicolateral plasma membrane | 5.81E-04 | 1.84E-02 |
| GO:0043218 | compact myelin | 5.81E-04 | 1.84E-02 |
| GO:0005581 | collagen trimer | 6.08E-04 | 1.84E-02 |
| Pathway | |||
| M3005 | Genes encoding collagen proteins | 1.41E-04 | 3.09E-02 |
| 1,470,926 | Collagen chain trimerization | 1.72E-04 | 3.09E-02 |
| 83,105 | Pathways in cancer | 2.51E-04 | 3.09E-02 |
| M17668 | Rho-Selective Guanine Exchange Factor AKAP13 Mediates Stress Fiber Formation | 3.41E-04 | 3.09E-02 |
| 1,270,247 | Assembly of collagen fibrils and other multimeric structures | 3.56E-04 | 3.09E-02 |
| Disease | |||
| C0022578 | Keratoconus | 6.92E-11 | 8.68E-08 |
| C1096274 | Corneal thinning | 1.21E-05 | 7.56E-03 |
| C0544008 | Chandler syndrom | 6.00E-05 | 1.88E-02 |
| C1857569 | Corneal endothelial dystrophy 2 | 6.00E-05 | 1.88E-02 |
| C3542501 | Acute inflammatpry demyelinating polyneuropathy | 1.12E-04 | 2.78E-02 |
The list of CCT-associated genes was set as the input seed. Enrichment P-values were calculated using probability density function, and the Benjamini-Hochberg procedure was used to correct the P values (FDR) of the enriched GO terms
Summary of the enrichment scores for genes prioritized for keratoconus (KC) using a combination of CADD, GWAVA, and Eigen prediction tool
| Prediction tool | Fold Enrichment | FDR B&H |
|---|---|---|
| CADD | 32.2 | 9.54E-03 |
| GWAVA | 23.9 | 8.17E-03 |
| Eigen-PC | 20.1 | 2.14E-02 |
| CADD+GWAVA+Eigen-PC | 77.9 | 8.68E-08 |
Fold-enrichment was calculated as ((m/n)/(M/N), where m is the number of prioritized genes (with a tool or a combination of tools) belong to KC, n is the total number of genes prioritized using a tool (or a combination of tools), M is the total number of genes associated with KC, and N is the human genome background (around 20,000 genes). Enrichment P-values were calculated using probability density function, and the Benjamini-Hochberg procedure was used for multiple testing corrections
Fig. 4Protein-protein interaction (PPI), co-expression network analyses using GeneMANIA. Genes were shown as nodes and edges indicated different types of interaction among genes. Black circles were the query genes and the colors coding on edges indicated different types of interaction. The eight prioritized genes were labeled with red circles