| Literature DB >> 33092545 |
Vladimir Babenko1,2, Roman Babenko3,4, Yuri Orlov3,4,5.
Abstract
BACKGROUND: Genome-wide association studies have identified the CDC7-TGFBR3 intergenic region on chromosome 1 to be strongly associated with optic disc area size. The mechanism of its function remained unclear until new data on eQTL markers emerged from the Genotype-Tissue Expression project. The target region was found to contain a strong silencer of the distal (800 kb) Transcription Factor (TF) gene GFI1 (Growth Factor Independent Transcription Repressor 1) specifically in neuroendocrine cells (pituitary gland). GFI1 has also been reported to be involved in the development of sensory neurons and hematopoiesis. Therefore, GFI1, being a developmental gene, is likely to affect optic disc area size by altering the expression of the associated genes via long-range interactions.Entities:
Keywords: 1000GP; Alu; CDC7; Enhancers; GFI1; GTEx; GWAS; Glaucoma; Optic disc size; TGFB3
Mesh:
Year: 2020 PMID: 33092545 PMCID: PMC7583307 DOI: 10.1186/s12863-020-00873-z
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Fig. 1Chromosome location depicts evenly distributed 15 target SNPs within sequence span of 4 kb. Green shaded are GWAS SNPs; red shaded is the causal eQTL SNP (Table 1)
15 SNPs minor frequencies for merged GWAS and GTEx SNP sets considered in the study for European (EUR), South Asian (SAS), East Asian (EAS) and African (AFR) supergroups
| 0.173 | 0.3037 | 0.1399 | 0.2103 | G | |
| 0.175 | 0.3037 | 0.1399 | 0.3829 | A | |
| rs4658103 | 0.3867 | 0.3507 | 0.1399 | 0.4276 | G |
| rs11165071 | 0.3867 | 0.3517 | 0.1399 | 0.4276 | G |
| 0.3877 | 0.3517 | 0.1399 | 0.4375 | C | |
| rs11165072 | 0.3877 | 0.3517 | 0.1399 | 0.4375 | T |
| rs12037771 | 0.3946 | 0.3569 | 0.1399 | 0.4375 | T |
| rs12036943 | 0.3877 | 0.3517 | 0.1399 | 0.4375 | A |
| rs4658246 | 0.3877 | 0.3517 | 0.1399 | 0.4375 | T |
| 0.1759 | 0.3047 | 0.1339 | 0.2927 | A | |
| rs981554 | 0.3857 | 0.3282 | 0.1359 | 0.4355 | A |
| rs6671034 | 0.3867 | 0.3272 | 0.1339 | 0.4365 | A |
| rs6604047 | 0.3867 | 0.3262 | 0.1339 | 0.4365 | A |
| rs6604048 | 0.3867 | 0.3272 | 0.1339 | 0.4365 | T |
| rs6604049 | 0.3867 | 0.3272 | 0.1339 | 0.4365 | A |
**GWAs SNPs with minimal P values: (P < 8E-17; 6E-81; 8E-56), respectively
aGTEx causal SNP
Fig. 2Graphic representation of MAF frequencies from Table 1 for European (EUR), South Asian (SAS), East Asian (EAS) and African (AFR) supergroups. GWAS SNPs are plotted with enlarged markers
Distribution of top haplotypes frequencies in 4 major supergroupsa
| GG | 0.562 | 0.86 | 0.605 | 0.642 | |
| AC | 0.209 | 0.132 | 0.172 | 0.28 | |
| GC | 0.043 | 0 | 0.21 | 0.045 | |
| SUM: | 0.814 | 0.992 | 0.987 | 0.967 |
a Supergroup denotations are given in Fig. 2. GWAS SNPs are marked with bold, eQTL causal SNP is marked in bold Cyrillic. The first allele is a major one (f > 0.5 in all supergroups), two others are minor ones. The ‘tag’ haplotype represents convoluted GWAS/eQTL 2-letter haplotype of tag SNPs (pos. 5:rs4658101, pos. 10:rs10874833). Two African–specific low frequency haplotypes AAGGTTATGAAATA (0.099) and AAGGTTATAAAATA (0.07) were omitted from consideration according to selection criterion
Fig. 3PCA plot of 3 major haplotypes (Table 2) in European (EUR), South Asian (SAS), East Asian (EAS) and African (AFR) supergroups. GWAS SNPs (Table 1) are marked in bold. eQTL casual repressive SNP rs1087483 G- > C (pos. 5) is marked in bold italics. Underlined is the high-risk allele according to GWAS reports. The low-risk allele (GWAS haplotype: AGG; on the right) is highly elevated at EAS supergroup, and is present at least at 50% in the others (Table 2)
Fig. 4Distribution of enhancers’ number given the number of tissues it is active in, and its exponential approximation based on GeneHancer elements database
Fig. 5Fifteen target SNPs (top track) spanning the region enriched with enhancer histone marks (yellow boxes manifested in H1-hESC cell line) comprising 3 GWAS SNPs (green shaded in the top track) and 12 GTEx eQTL SNPs (top track; “GFI1 pituitary” labels, middle track), as well as 6Alu sequences (bottom track). Marked are highest ranked SNPs (rs4658101, blue arrow, and rs10874833, red arrow; another one pointing to target Alu5’ region, bottom track). Enhancer GH01J091608 (‘GeneHancer Regulatory Elements’ track, middle positioned; grey shaded box) overlaps the yellow shaded box
Pairwise Pearson correlation rate of three genes based on 100 random samples of EBV-transformed lymphocytes RNA-Seq data (GTEx Consortium; v.8; https://www.gtexportal.org)
| Variables | GFI1 | TP53RK | TP53 | MALAT1 |
|---|---|---|---|---|
| GFI1 | −0.222 | −0.058 | ||
| TP53RK | −0.222 | |||
| TP53 | −0.058 | |||
| MALAT1 |
Values in bold are different from 0 with a significance level alpha = 0.05