| Literature DB >> 34899825 |
Hongxiao Jiao1, Miaomiao Zhang2, Yuan Zhang2,3, Yaogang Wang3, Wei-Dong Li2.
Abstract
As a marker for glomerular filtration, plasma cystatin C level is used to evaluate kidney function. To decipher genetic factors that control the plasma cystatin C level, we performed genome-wide association and pathway association studies using United Kingdom Biobank data. One hundred fifteen loci yielded p values less than 1 × 10-100, three genes (clusters) showed the most significant associations, including the CST8-CST9 cluster on chromosome 20, the SH2B3-ATXN2 gene region on chromosome 12, and the SHROOM3-CCDC158 gene region on chromosome 4. In pathway association studies, forty significant pathways had FDR (false discovery rate) and or FWER (family-wise error rate) ≤ 0.001: spermatogenesis, leukocyte trans-endothelial migration, cell adhesion, glycoprotein, membrane lipid, steroid metabolic process, and insulin signaling pathways were among the most significant pathways that associated with the plasma cystatin C levels. We also performed Genome-wide association studies for eGFR, top associated genes were largely overlapped with those for cystatin C.Entities:
Keywords: United Kingdom biobank; cystatin C; genetics; genome wide association; pathway association studies
Year: 2021 PMID: 34899825 PMCID: PMC8656399 DOI: 10.3389/fgene.2021.711155
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Genome-wide association study for plasma cystatin C (P < 1 × 10−150).
| CHR | SNP | Position (bp) | MAF | A1 | A2 | R2 (all) | T (all) |
|
|
| SNP annotation | Gene (in or near)c | Distanced (kb) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 20 | rs1158167 | 23578189 | 0.232 | G | A | 0.0369 | −132.7 | 0a | 0a | 0a | intergenic |
| 4.8 |
| 20 | rs2983640 | 23586360 | 0.408 | G | A | 0.0155 | −84.96 | 0a | 0a | 0a | missense mutation |
| |
| 20 | rs2983641 | 23586977 | 0.178 | T | C | 0.0024 | 33.15 | 1.29E-240 | 0a | 0a | upstream |
| 2.0 |
| 20 | rs77114334 | 23565975 | 0.096 | T | C | 0.0022 | 31.6 | 6.35E-219 | 1.328E-188 | 1.201E-238 | intergenic |
| 17.0 |
| 20 | rs2273378 | 23476389 | 0.115 | G | A | 0.0048 | −46.87 | 0a | 0a | 0a | intronic |
| |
| 20 | rs3004118 | 23487503 | 0.462 | T | C | 0.0029 | 36.47 | 8.99E-291 | 6.102E-238 | 3.119E-288 | non-coding variant |
| |
| 20 | rs73610708 | 23539846 | 0.085 | T | C | 0.0049 | −46.87 | 0a | 0a | 0a | intergenic |
| 5.5 |
| 20 | rs73102389 | 23635691 | 0.178 | A | C | 0.0043 | 44.5 | 0a | 0a | 0a | intergenic |
| 21.3 |
| 20 | rs112308292 | 23667865 | 0.122 | A | C | 0.0149 | −83.18 | 0a | 0a | 0a | intronic |
| |
| 20 | rs13039144 | 23633755 | 0.177 | G | A | 0.0312 | −121.6 | 0a | 0a | 0a | intergenic |
| 19.4 |
| 20 | rs66590796 | 23691153 | 0.111 | T | G | 0.0061 | 52.79 | 0a | 0a | 0a | intergenic |
| 24.8 |
| 20 | rs7266357 | 23700824 | 0.164 | G | T | 0.0029 | 36.51 | 2.42E-291 | 0a | 0a | intergenic |
| 27.3 |
| 20 | rs6114264 | 23746310 | 0.137 | T | C | 0.0092 | −64.74 | 0a | 0a | 0a | intergenic |
| 7.1 |
| 20 | rs67418849 | 23740511 | 0.091 | A | G | 0.0037 | 41.41 | 0a | 0a | 0a | intergenic |
| 12.3 |
| 20 | rs78916169 | 23405904 | 0.054 | T | G | 0.0021 | 30.97 | 1.99E-210 | 3.064E-187 | 2.144E-230 | intergenic |
| 14.4 |
| 20 | rs761725 | 23402735 | 0.110 | A | G | 0.0017 | 28.06 | 4.35E-173 | 7.723E-144 | 1.093E-178 | upstream |
| 2.0 |
| 20 | rs2983608 | 23642839 | 0.205 | C | T | 0.0142 | −81.47 | 0a | 0a | 0a | intronic |
| |
| 20 | rs13040731 | 23642389 | 0.121 | C | T | 0.0018 | −28.57 | 2.32E-179 | 9.755E-204 | 3.049E-242 | intronic |
| |
| 20 | rs6106728 | 23801950 | 0.210 | T | C | 0.0030 | −37.41 | 8.9E-306 | 3.5E-305 | 0a | downstream |
| 0.5 |
| 20 | rs3004096 | 23305768 | 0.334 | T | G | 0.0023 | 31.6 | 5.69E-219 | 3.233E-177 | 6.695E-209 | intronic |
| |
| 20 | rs6048704 | 23296209 | 0.251 | T | C | 0.0021 | −31.18 | 3.44E-213 | 2.037E-207 | 5.996E-249 | intronic |
| |
| 20 | rs6049135 | 23783368 | 0.260 | C | T | 0.0020 | 30.17 | 9.34E-200 | 2.396E-154 | 2.918E-184 | upstream |
| 2.0 |
| 12 | rs653178 | 112007756 | 0.464 | C | T | 0.0024 | 33.05 | 3.49E-239 | 3.497E-169 | 5.568E-192 | intronic |
| |
| 12 | rs3184504 | 111884608 | 0.464 | T | C | 0.0024 | 33.32 | 3.9E-243 | 4.023E-173 | 2.466E-195 | missense mutation |
| |
| 4 | rs17319721 | 77368847 | 0.426 | A | G | 0.0024 | 33.37 | 8.61E-244 | 1.624E-203 | 7.16E-232 | intronic |
| |
| 4 | rs13106227 | 77418681 | 0.364 | G | A | 0.0016 | −26.91 | 2.1E-159 | 3.242E-117 | 5.32E-124 | intronic |
| |
| 4 | rs907446 | 77254804 | 0.424 | T | C | 0.0017 | 27.63 | 6.96E-168 | 4.319E-113 | 9.297E-129 | intronic |
| 4.8 |
Associations with p-value < 1 × 10−150 are shown above. The chromosome (CHR) and base pair position are given with regards to the GRCh37 genome reference sequence. SNP, single nucleotide polymorphism; MAF, minor allele frequency; A1, Allele 1 code (minor allele); A2, Allele 2 code (major allele); R2, regression r-squared; T, Wald test (based on t-distribution). a P-value<1 × 10−308. b p-value for Caucasian adjusted for gender and age. cGenes within 40 kb were based on RefSeq genes. d Distance from nearest genes to which they were annotated.
FIGURE 1Manhattan Plot for genome wide association study of cystatin C. Variants are positioned according to the GRCh37 genome reference sequence. Associations with p-value < 1 × 10−308 were not displayed in normal scale.
FIGURE 2Manhattan Plots for cystatin C, creatinine, and eGFR genome wide association studies. Variants are positioned according to the GRCh37 genome reference sequence.
FIGURE 3Top significant pathways/gene sets that associated with cystatin C levels. Results passing multiple hypothesis tests (false-discovery rate (FDR) and or family-wise error rate (FWER) ≤ 0.001) that identified by pathway association studies were shown in the figure. The NES(Wang et al., 2007) denotes normalized enrichment score (ES) which reflects the overrepresentation of gene sets at the top of the entire ranking list of genes in the genome.
Pathway association studies for plasma cystatin C.
| Pathway ID | Size | ES | NES | Empirical | FDR | FWER | Database | Description |
|---|---|---|---|---|---|---|---|---|
| hsa03010 | 56 | 0.253 | 8.176 | 0.000 | 0.000 | 0.000 | KEGG | Ribosome |
| hsa04670 | 99 | 0.424 | 7.464 | 0.000 | 0.000 | 0.000 | KEGG | Leukocyte transendothelial migration |
| hsa00230 | 132 | 0.331 | 6.561 | 0.000 | 0.000 | 0.000 | KEGG | Purine metabolism |
| hsa04120 | 115 | 0.304 | 6.503 | 0.000 | 0.000 | 0.000 | KEGG | Ubiquitin mediated proteolysis |
| hsa04810 | 182 | 0.334 | 6.176 | 0.000 | 0.000 | 0.001 | KEGG | Regulation of actin cytoskeleton |
| hsa04530 | 120 | 0.386 | 6.121 | 0.000 | 0.000 | 0.001 | KEGG | Tight junction |
| hsa04910 | 124 | 0.305 | 5.986 | 0.000 | 0.000 | 0.001 | KEGG | Insulin signaling pathway |
| hsa00564 | 58 | 0.410 | 5.786 | 0.000 | 0.000 | 0.001 | KEGG | Glycerophospholipid metabolism |
| hsa04370 | 69 | 0.347 | 5.722 | 0.000 | 0.000 | 0.001 | KEGG | VEGF signaling pathway |
| hsa04514 | 121 | 0.437 | 5.534 | 0.000 | 0.000 | 0.001 | KEGG | Cell adhesion molecules (CAMs) |
| hsa04640 | 79 | 0.361 | 5.451 | 0.000 | 0.000 | 0.001 | KEGG | Hematopoietic cell lineage |
| ppara Pathway | 55 | 0.405 | 4.919 | 0.000 | 0.000 | 0.000 | BioCarta | Mechanism of Gene Regulation by Peroxisome Proliferators via PPARa (alpha) |
| par1Pathway | 35 | 0.514 | 4.523 | 0.001 | 0.001 | 0.002 | BioCarta | Thrombin signaling and protease-activated receptors |
| mapk Pathway | 81 | 0.313 | 4.397 | 0.000 | 0.001 | 0.003 | BioCarta | Mapkinase Signaling Pathway |
| nfat Pathway | 53 | 0.360 | 4.180 | 0.000 | 0.001 | 0.003 | BioCarta | NFAT and Hypertrophy of the heart (Transcription in the broken heart) |
| GO0007283 | 185 | 0.315 | 9.989 | 0.000 | 0.000 | 0.000 | GO | Spermatogenesis |
| GO0006414 | 82 | 0.256 | 9.540 | 0.000 | 0.000 | 0.000 | GO | Translational elongation |
| GO0005125 | 195 | 0.315 | 9.211 | 0.000 | 0.000 | 0.000 | GO | Cytokine activity |
| GO0051336 | 186 | 0.313 | 8.028 | 0.000 | 0.000 | 0.000 | GO | Regulation of hydrolase activity |
| GO0007067 | 141 | 0.319 | 8.010 | 0.000 | 0.000 | 0.000 | GO | Mitotic cell cycle |
| GO0012502 | 189 | 0.308 | 7.591 | 0.000 | 0.000 | 0.000 | GO | Induction of programmed cell death |
| GO0016747 | 139 | 0.317 | 7.348 | 0.000 | 0.000 | 0.000 | GO | Transferase activity, transferring acyl groups other than amino-acyl groups |
| GO0006643 | 174 | 0.300 | 7.264 | 0.000 | 0.000 | 0.000 | GO | Membrane lipid metabolic process |
| GO0016298 | 83 | 0.406 | 7.136 | 0.000 | 0.000 | 0.000 | GO | Lipase activity |
| GO0008202 | 126 | 0.340 | 7.033 | 0.000 | 0.000 | 0.000 | GO | Steroid metabolic process |
| GO0016881 | 158 | 0.288 | 6.938 | 0.000 | 0.000 | 0.000 | GO | Acid-amino acid ligase activity |
| GO0055086 | 194 | 0.285 | 6.929 | 0.000 | 0.000 | 0.000 | GO | Nucleobase-containing small molecule metabolic process |
| GO0008236 | 155 | 0.300 | 6.902 | 0.000 | 0.000 | 0.000 | GO | Serine-type peptidase activity |
| GO0051169 | 131 | 0.303 | 6.796 | 0.000 | 0.000 | 0.000 | GO | Nuclear transport |
| GO0032268 | 189 | 0.296 | 6.579 | 0.000 | 0.000 | 0.000 | GO | Regulation of cellular protein metabolic process |
| GO0006520 | 187 | 0.272 | 6.576 | 0.000 | 0.000 | 0.000 | GO | Cellular amino acid metabolic process |
| GO0046982 | 127 | 0.357 | 6.565 | 0.000 | 0.000 | 0.000 | GO | Protein heterodimerization activity |
| GO0051249 | 88 | 0.363 | 6.421 | 0.000 | 0.000 | 0.000 | GO | Regulation of lymphocyte activation |
| GO0009100 | 133 | 0.372 | 6.412 | 0.000 | 0.000 | 0.000 | GO | Glycoprotein metabolic process |
| GO0030097 | 175 | 0.344 | 6.363 | 0.000 | 0.000 | 0.000 | GO | Hemopoiesis |
| GO0032990 | 161 | 0.377 | 6.326 | 0.000 | 0.000 | 0.000 | GO | Cell part morphogenesis |
| GO0005099 | 78 | 0.430 | 6.179 | 0.000 | 0.000 | 0.000 | GO | GTPase activator activity |
| GO0008544 | 117 | 0.333 | 6.078 | 0.000 | 0.000 | 0.000 | GO | Epidermis development |
| GO0015291 | 187 | 0.339 | 6.062 | 0.000 | 0.000 | 0.000 | GO | Secondary active transmembrane transporter activity |
| GO0033043 | 80 | 0.365 | 5.970 | 0.000 | 0.000 | 0.000 | GO | Regulation of organelle organization |
In pathway association studies, top forty (40) significant pathways/gene sets that significantly related to cystatin C levels were enriched from KEGG, GO, and BioCarta. ES, enrichment score; NES, normalized enrichment score; FDR, false-discovery rate; FWER, family-wise error rate.