| Literature DB >> 29854282 |
Jacqueline Chyr1,2, Dongmin Guo3, Xiaobo Zhou3,2.
Abstract
Lung squamous cell carcinoma (LSCC) is a genomically complex malignancy with no effective treatments. Recent studies have found a large number of DNA alterations such as SOX2 amplification in LSCC patients. As a stem cell transcription factor, SOX2 is important for the maintenance of pluripotent cells and may play a role in cancer. To study the downstream mechanisms of SOX2, we employed expression quantitative trait loci (eQTLs) technology to investigate how the presence of SOX2 affects the expression of target genes. We discovered unique eQTLs, such as rs798827-VDAC3 (FDR p-value = 0.0034), that are only found in SOX2-active patients but not in SOX2-inactive patients. SNP rs798827 is within strong linkage disequilibrium (r2 = 1) to rs58163073, where rs58163073 [T] allele increases the binding affinity of SOX2 and allele [TA] decreases it. In our analysis, SOX2 silencing downregulates VDAC3 in two LSCC cell lines. Chromatin conformation capturing data indicates that this SNP is located within the same Topologically Associating Domain (TAD) of VDAC3, further suggesting SOX2's role in the regulation of VDAC3 through the binding of rs58163073. By first subgrouping patients based on SOX2 activity, we made more relevant eQTL discoveries and our analysis can be applied to other diseases.Entities:
Keywords: SNP; SOX2; eQTL; lung cancer; topologically associating domain
Year: 2018 PMID: 29854282 PMCID: PMC5976468 DOI: 10.18632/oncotarget.24918
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Figure 1Brief overview of workflow
Left: LSCC patients are first clustered into two groups: SOX2-active and SOX2-inactive based on their SOX2 gene expression, copy number variation, and methylation. Matrix eQTL analyses are performed on each group of patients to identify group-specific SNP-gene pairs. Only the eQTL pairs with genes that are downregulated after SOX2 silencing are included in our analysis. Only the SNP-gene pairs that are unique in SOX2 active patients are considered. Right: Further analyses on SOX2 motifs, ChIP-seq binding peaks, and topologically associating domains are conducted to validate a SOX2→ SNP→ Gene relationship.
Figure 2Patient genomic data
Multiple genomic datatypes are used to group SOX2 patients into two groups. The highlighted bars in the histograms indicate patients with high SOX2 gene expression (A), SOX2 copy number amplification (B), and SOX2 promoter-region hypomethylation (C). Patients with SOX2 expression > 10.88, copy number segment means > 0.5, and methylation β-values ≤ 0.4 are considered as SOX2 active patients. A total number of 366 patents has all three datatypes available and were included in our analysis. (D) Of the 366 patients, 159 patients were considered as SOX2-active, all other patients are considered as SOX2-inactive.
Figure 3Gene expression profile of two SOX2-silenced LSCC cell lines
SOX2 was knocked down in two LSCC cell lines: H520 and LK2 and gene expression was profiled using an array. (A) Top 266 downregulated genes are shown in a heat map. Downregulated genes are defined as genes with combined value differences between SOX2 siRNA and control > 1.0. Yellow color represents higher expression and dark blue color represents lower expression. (B) VDAC3's normalized expression values are shown as an example. All samples were normalized to 0.
eQTLs (FDR p value < 0.01) in SOX2-active patients
| SNP | Allele A | Allele B | Gene | FDR | |
|---|---|---|---|---|---|
| rs4654947 | C | T | NBPF3 | 4.60E-11 | 3.45E-07 |
| rs9984519 | C | T | IFNAR1 | 1.38E-07 | 5.59E-04 |
| rs17420195 | C | T | NBPF3 | 4.18E-07 | 1.45E-03 |
| rs2465941 | C | T | ZCCHC12 | 8.23E-07 | 2.60E-03 |
| rs2290163 | C | T | LMCD1 | 8.48E-07 | 2.67E-03 |
| rs4747471 | A | G | MSRB2 | 1.22E-06 | 3.63E-03 |
| rs16913776 | C | G | RAB38 | 1.31E-06 | 3.83E-03 |
| rs7624916 | C | G | ARL6IP5 | 1.48E-06 | 4.27E-03 |
| rs10968209 | A | G | MOBKL2B | 1.49E-06 | 4.27E-03 |
| rs16850158 | A | G | ALCAM | 1.87E-06 | 5.14E-03 |
| rs12057041 | C | T | MOBKL2B | 3.20E-06 | 7.98E-03 |
| rs10968456 | C | T | MOBKL2B | 3.20E-06 | 7.98E-03 |
| rs4790508 | A | C | CRK | 3.37E-06 | 8.33E-03 |
| rs308819 | A | C | RAB38 | 4.00E-06 | 9.57E-03 |
| rs308814 | C | T | RAB38 | 4.00E-06 | 9.57E-03 |
Figure 4SOX2 regulates VDAC3 expression
(A) VDAC3 expression is significantly higher in SOX2-active patients compared to SOX2-inactive patients, t-test p = 0.0031. (B) In SOX2-active patients, the SNP genotype is associated with a significant difference in VDAC3 expression, ANOVA t-test p = 6.91E-08. In SOX2-inactive patients, the difference is not present, ANOVA t-test p = 0.459. (C) SOX2 and H3K27ac ChIP-seq peaks for cell line HCC95 are shown for the promoter region of VDAC3.
Seven LD SNPs are located within and alters a SOX2 binding motif
| SNP | LD SNP | Ref | Alt | D’ | AFR | AMR | ASN | EUR | Motifs | |
|---|---|---|---|---|---|---|---|---|---|---|
| rs798827 | rs58163073 | T | TA | –1 | 1 | 0.7 | 0.83 | 0.85 | 0.97 | Cart1, Dbx1, Foxa2, Foxp1, HDAC2, Ncx2, |
| rs4747471 | rs199772546 | TA | T | 1 | 1 | 0.04 | 0.02 | 0.21 | 0 | Arid3a2, Dbx1, Dbx2, FAC1, Foxa2, Foxa4, Foxj2, Foxk1, Foxo2, Foxp1, HNF1, Hlx1, Hoxa10, Hoxa5, Hoxc6, Hoxd8, Lhx3, Mef2, Msx-1, Nanog, Ncx2, Nkx6-1, PLZF, Pax-6, Pou2f2, Pou3f2, Pou3f4, Pou4f3, Prrx1, Sox13, Sox18, Sox19, |
| rs4747471 | rs200774383 | AAT | A | 1 | 1 | 0.04 | 0.02 | 0.21 | 0 | CDP7, Dbx1, Dbx2, Evi-1, FAC1, Foxa2, Foxa4, Foxj2, Foxk1, Foxo2, Foxp1, HNF1, Hlx1, Hoxa10, Hoxa5, Hoxd8, Lhx3, Lhx3, Mef2, Nanog, Ncx_2, Nkx6-1, Nkx6-2, PLZF, Pax-6, Pou2f2, Pou3f4, Pou6f1, Prrx1, Sox13, Sox18, Sox19, |
| rs4747471 | rs7087230 | C | T | 1 | 1 | 0.3 | 0.06 | 0.28 | 0 | Fox, Foxk1, Foxp1, Hoxa10, Hoxd8, Lhx3, Pou2f2, Sox18, |
| rs4747471 | rs12217320 | A | C | 1 | 1 | 0.03 | 0.02 | 0.25 | 0 | FAC1, Foxa4, Foxd3, Foxk1, Foxo1, Foxo2, Foxp1, HDAC2, Irf, Mef2, Nanog, RREB-1, Sox13, |
| rs4747471 | rs1398027 | G | C | 1 | 1 | 0.002 | 0.02 | 0.25 | 0 | |
| rs12057041 | rs10968463 | C | T | 1 | 1 | 0 | 0.03 | 0.2 | 0.01 | Foxj2, |
| rs10968456 | rs10968463 | C | T | 1 | 1 | 0 | 0.03 | 0.2 | 0.01 | Foxj2, |
The eQTL SNP, LD SNP, and reference and alternative alleles of the LD SNPs are shown. The coefficient of linkage disequilibrium (D`) and the square of correlation coefficient (r2) are shown along with the frequency of the alternative allele in African (AFR), American (AMR), Asian (ASN), and European (EUR) populations. Motifs whose position weight matrix scores are affected by the LD SNPs are listed.
Figure 5SNP rs58163073 alters the PWM of SOX2
The flanking sequences of rs58163073 from chromosome 8 position 42904940 +/– 29 nucleotides are shown for the reference and alternative alleles. The position weight matrix (PWM) score are obtained from HaploReg4.1. The reference [T] allele has a stronger binding affinity for SOX2 than the alternative [TA].
Figure 6VDAC3 and rs58163073 are located within the same TAD in lung cell line
The Hi-C heat map of lung cell line IMR90 is shown for chr8:41000000–44000000, resolution 10 kb. The location of VDAC3 and rs58163073 are indicated with dotted lines. The different TADs are marked with pale yellow and blue bars below the heat map. The locations of other genes are shown.
Figure 7Hi-C heat maps for lung cancer cell line and lung tissues
Hi-C heat maps of lung cancer cell line A549 and two lung tissue samples are show for chr8:41000000–44000000, resolution 40 kb. VDAC3 gene and SNP rs58163073 are marked with a dotted line.
Figure 8LSCC clinical analysis
SNP rs798827 minor allele frequency (MAF) is shown for different races in LSCC patients (A) and in SOX2-active patients (B). Frequency of tumor location (left or right) is shown for patients with the major allele and minor allele (C).
Association of SNP rs798827 and clinical features in LSCC patients
| Major allele | Minor allele | |
|---|---|---|
| 5% | 36% | |
| 93% | 60% | |
| 2% | 5% | |
| 18% | 14% | |
| 30% | 11% | |
| 24% | 32% | |
| 29% | 43% | |
| 39 | 40 | |
| 62 | 63 | |
| 68 | 69 | |
| 67.22 | 67.72 | |
| 73 | 74.75 | |
| 90 | 84 | |
| 48% | 57% | |
| 33% | 25% | |
| 17% | 16% | |
| 1% | 2% |
Figure 9Graphical summary of the regulation of VDAC3 by SOX2
In SOX2-active patients, the genotype of SNP rs798827 is associated to the expression of VDAC3. This SNP is within strong LD to SNP rs58163073 which overlaps with the binding motif of SOX2. This mechanism is only seen in patients with SOX2 activity. Positions of genes and SNPs are not drawn to scale.