Literature DB >> 28400551

Genetic variants of PTPN2 are associated with lung cancer risk: a re-analysis of eight GWASs in the TRICL-ILCCO consortium.

Yun Feng1,2,3, Yanru Wang2,3, Hongliang Liu2,3, Zhensheng Liu2,3, Coleman Mills2,3, Younghun Han4, Rayjean J Hung5, Yonathan Brhane5, John McLaughlin6, Paul Brennan7, Heike Bickeboeller8, Albert Rosenberger8, Richard S Houlston9, Neil E Caporaso10, Maria Teresa Landi10, Irene Brueske11, Angela Risch12, Yuanqing Ye13, Xifeng Wu13, David C Christiani14, Christopher I Amos4, Qingyi Wei15,16.   

Abstract

The T-cell protein tyrosine phosphatase (TCPTP) pathway consists of signaling events mediated by TCPTP. Mutations and genetic variants of some genes in the TCPTP pathway are associated with lung cancer risk and survival. In the present study, we first investigated associations of 5,162 single nucleotide polymorphisms (SNPs) in 43 genes of this TCPTP pathway with lung cancer risk by using summary data of six published genome-wide association studies (GWAS) of 12,160 cases and 16,838 controls. We identified 11 independent SNPs in eight genes after correction for multiple comparisons by a false discovery rate <0.20. Then, we performed in silico functional analyses for these 11 SNPs by eQTL analysis, two of which, PTPN2 SNPs rs2847297 and rs2847282, were chosen as tagSNPs. We further included two additional GWAS datasets of Harvard University (984 cases and 970 controls) and deCODE (1,319 cases and 26,380 controls), and the overall effects of these two SNPs among all eight GWAS studies remained significant (OR = 0.95, 95% CI = 0.92-0.98, and P = 0.004 for rs2847297; OR = 0.95, 95% CI = 0.92-0.99, and P = 0.009 for rs2847282). In conclusion, the PTPN2 rs2847297 and rs2847282 may be potential susceptible loci for lung cancer risk.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28400551      PMCID: PMC5429754          DOI: 10.1038/s41598-017-00850-0

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Lung cancer is one of the most common human malignancies and the leading cause of cancer-related deaths in both men and women[1]. It is estimated that 224,390 new lung cancer cases will be diagnosed in the United States in 2016[2]. Lung cancer risk likely results from joint effects and interactions of environmental and genetic factors. Single nucleotide polymorphisms (SNPs) are the most common genetic variants and have been shown to be associated with lung cancer risk[3]. Genome-wide association studies (GWAS) have identified 30 loci in 13 genomic regions to be associated with lung cancer risk[4-15]. However, most of the SNPs identified to date have not been shown to be functional. Other approaches to GWAS including pathway-based analysis with reduced dimension or multiple testing have been emerged to identify possible functional SNPs associated with lung cancer risk. The T-cell protein tyrosine phosphatase (TCPTP/PTPN2) is an important member of the protein-tyrosine phosphatase (PTP) family. Activating and deactivating mutations in PTP genes often result in enzymes that can either promote or suppress oncogenesis. The TCPTP pathway consists of signaling events mediated by TCPTP through negative regulation of several receptor tyrosine kinases such as epidermal growth factor receptor (EGFR)[16], vascular endothelial growth factor receptor-2 (VEGFR2)[17], platelet-derived growth factor receptor beta (PDGFRβ)[18], signal transducer and activator of transcription subtypes 1 (STAT1)[19], 3 (STAT3)[20], and 6 (STAT6)[21], and the insulin receptor[22]. Studies have shown that mutations and genetic variants of some genes in the TCPTP pathway are associated with lung cancer risk and survival[23, 24]. However, SNPs in many candidate genes in the pathway have not been studied and reported. In the present study, we systematically investigated all potentially functional SNPs in TCPTP pathway genes by assessing their associations of lung cancer risk using eight published lung cancer GWAS datasets.

Results

Analysis of six GWAS datasets

Overall, 5162 SNPs from 43 TCPTP pathway genes in the six GWAS datasets from the Transdisciplinary Research in Cancer of the Lung and The International Lung Cancer Consortium (TRICL-ILCCO) Consortium were identified, and their associations with lung cancer risk are shown in the Manhattan plot (Fig. 1A). After multiple-testing correction, 112 SNPs in eight genes (ATR, EGFR, MET, PIK3R1, PIK3R3, PTPN2, STAT3, and STAT5A) remained significantly associated with lung cancer risk with FDR <0.20. The results of associations with lung cancer risk are summarized in Supplementary Table S2. Based on LD analysis (r2 > 0.30) and online functional prediction analyses by using SNPinfo, RegulomeDB, and HaploReg, we selected to perform additional analyses for 11 SNPs: rs11707731 in ATR; rs845553, rs1140762 and rs17172432 in EGFR; rs34280975 in MET; rs706714 in PIK3R1; rs7538978 in PIK3R3; rs2847297 and rs2847282 in PTPN2; rs3744483 in STAT3; rs1135669 in STAT5A for further study (Supplementary Figure S1 and Supplementary Table S3).
Figure 1

Screening of SNPs in the TCPTP pathway. (A) Manhattan plot of genome-wide association results of 5,162 SNPs in 43 TCPTP pathway genes and lung cancer risk in the TRICL-ILCCO Consortium. SNPs are plotted on the X-axis according to their positions on each chromosome. The association P values with lung cancer risk are shown on the Y-axis (as −log10 (P) values). The horizontal red line represents FDR threshold 0.20. The horizontal blue line represents P value of 0.05; (B) SNPs in PTPN2 with 500 kb up- and downstream of the gene region and (C) LD plots of the SNPs in PTPN2 with FDR <0.20. In B, the left-hand y-axis shows the association P value of each SNP, which is plotted as −log10 (P) against chromosomal base pair position; the right-hand y-axis shows the recombination rate estimated from the hg19/1000 Genomes European population.

Screening of SNPs in the TCPTP pathway. (A) Manhattan plot of genome-wide association results of 5,162 SNPs in 43 TCPTP pathway genes and lung cancer risk in the TRICL-ILCCO Consortium. SNPs are plotted on the X-axis according to their positions on each chromosome. The association P values with lung cancer risk are shown on the Y-axis (as −log10 (P) values). The horizontal red line represents FDR threshold 0.20. The horizontal blue line represents P value of 0.05; (B) SNPs in PTPN2 with 500 kb up- and downstream of the gene region and (C) LD plots of the SNPs in PTPN2 with FDR <0.20. In B, the left-hand y-axis shows the association P value of each SNP, which is plotted as −log10 (P) against chromosomal base pair position; the right-hand y-axis shows the recombination rate estimated from the hg19/1000 Genomes European population.

Functional validation by eQTL analysis 21

We assessed associations between the 11 SNPs and mRNA expression levels by using the genotyping and expression data available from the lymphoblastoid cell lines derived from 373 individuals of European descent (http://www.1000genomes.org/), and we found that only rs2847297 and rs2847282 were associated with expression levels of PTPN2 in additive, dominant and recessive models (Table 1). Regional association plots for rs2847297 and rs2847282 in 500 kb up- and downstream region were shown in Fig. 1B. The SNP rs2847297 was in a low LD with rs2847282 (Fig. 1C). PTPN2 mRNA expression levels were significantly decreased with an increased number of the rs2847297 G allele in additive (P = 0.002) (Fig. 2A), dominant (P = 0.017) (Fig. 2B) and recessive model (P = 0.005) (Fig. 2C ). The eQTL analysis results of rs2847282 were also significant (Fig. 2D,E,F). In addition, we compared mRNA expression levels of PTPN2 in 109 paired target tissue samples from The Cancer Genome Atlas (TCGA) and found that PTPN2 mRNA expression levels were significantly increased in tumor tissues than normal tissues (P = 3.01E-05) (Supplementary Figure S2). The two SNPs rs2847297 and rs2847282 were chosen as tagSNPs, because they were significantly associated with lung cancer risk as assessed in the overall association analysis and had potential functions according to the eQTL analysis.
Table 1

Summary of the functional prediction and eQTL analysis results of the 11 selected SNPs in the TCPTP pathways in silico.

SNPGeneChr.Allelea SNPinfoRegulome DB ScoreHaploReg P b
P (additive model) P (dominant model) P (recessive model)
rs7538978 PIK3R3 1A/G1 fEnhancer histone marks: 9 tissues; DNAse: CRVX; Motifs changed: 6 altered motifs0.7220.2490.175
rs11707731 ATR 3G/T4Promoter histone marks: 4 tissues; Enhancer histone marks: 4 tissues; DNAse: ESC; Motifs changed: 4 altered motifs0.5790.7190.420
rs706714 PIK3R1 5A/CTFBS5Enhancer histone marks: 7 tissues; DNAse: GI; Motifs changed: GATA,Nkx2,Nkx30.2810.1370.714
rs2740762 EGFR 7C/ATFBS5Enhancer histone marks: 13 tissues; DNAse: IPSC,MUS,PANC; Motifs changed: Foxo,NF-AT,Pax-40.0530.0080.338
rs845553 EGFR 7G/A4Promoter histone marks: 4 tissues; Enhancer histone marks: 17 tissues; DNAse: 17 tissues; Proteins bound: 7; Motifs changed: 5 altered motifs0.3960.6830.280
rs17172432 EGFR 7T/C4Promoter histone marks: 7 tissues; Enhancer histone marks: 18 tissues; DNAse: 6 tissues; Motifs changed: 4 altered motifs0.3590.6140.280
rs34280975 MET 7A/G2cEnhancer histone marks: SKIN; DNAse: 4 tissues; Proteins bound: CEBPB; Motifs changed: 7 altered motifs0.4210.1970.538
rs3744483 STAT3 17T/CmiRNA4bound: 7; DNAse: 11 tissues; Motifs changed: Foxa,p3000.9070.8560.467
rs1135669 STAT5A 17C/TSplicing4Enhancer histone marks: BLD, THYM; DNAse: OVRY,BRST; Motifs changed: BATF, Pbx3, STAT0.0620.0790.255
rs2847297 PTPN2 18A/GDNAse: BLD; Motifs changed: Nkx2,Pax-50.0050.0170.005
rs2847282 PTPN2 18T/G5Promoter histone marks: STRM, LIV, BLD; Enhancer histone marks: 9 tissues; DNAse: 4 tissues; Motifs changed: 26 altered motifs0.0290.0010.029

aReference allele/effect allele. b P value of eQTL analysis results

TFBS = transcription factor binding site.

Figure 2

The correlations between identified SNPs and PTPN2 mRNA expression. rs2847297 in PTPN2 (A) additive model, P = 0.002; (B) dominant model, P = 0.017; (C) recessive model, P = 0.005) and rs2847282 in PTPN2 (D), additive model, P = 0.0006; (E) dominant model, P = 0.001; (F) recessive model, P = 0.029).

Summary of the functional prediction and eQTL analysis results of the 11 selected SNPs in the TCPTP pathways in silico. aReference allele/effect allele. b P value of eQTL analysis results TFBS = transcription factor binding site. The correlations between identified SNPs and PTPN2 mRNA expression. rs2847297 in PTPN2 (A) additive model, P = 0.002; (B) dominant model, P = 0.017; (C) recessive model, P = 0.005) and rs2847282 in PTPN2 (D), additive model, P = 0.0006; (E) dominant model, P = 0.001; (F) recessive model, P = 0.029).

Expanded analysis by including additional two GWAS studies

We expanded our analysis by including two additional independent lung cancer GWAS studies, Harvard Lung Cancer Study and Icelandic Lung Cancer Study (deCODE). We performed an overall meta-analysis to evaluate associations between the two PTPN2 SNPs and lung cancer risk. We found that the overall effects among all eight GWAS studies remained significant (OR = 0.95, 95% CI = 0.92–0.98, Phet = 0.476, and P = 0.004 for rs2847297; OR = 0.95, 95% CI = 0.92–0.99, Phet = 0.523, and P = 0.009 for rs2847282) (Table 2 and Fig. 3A,B).
Table 2

Summary of the association results of two SNPs in the eight lung cancer GWAS studies.

Study populationSample sizeImp. Quality PTPN2 rs2847297 A > GImp. Quality PTPN2 rs2847282 T > G
CasesControlsOR (95% CI) P OR (95% CI) P
ICR 1 195252001.000.97 (0.89–1.04)0.3790.880.94 (0.87–1.03)0.180
AD46552001.000.95(0.82–1.09)0.4590.870.92(0.79–1.07)0.281
SQ61152001.000.95 (0.84–1.08)0.4250.870.96 (0.83–1.10)0.521
MDACC 2 115011341.000.85 (0.75–0.97)0.0140.810.85 (0.74–0.99)0.030
AD61911341.000.87 (0.75–1.01)0.0700.810.86 (0.72–1.01)0.073
SQ30611341.000.73 (0.60–0.89)0.0020.810.88 (0.71–1.09)0.246
Ever smoking115011341.000.85 (0.75–0.97)0.0140.810.85 (0.74–0.99)0.030
IARC 3 253337911.000.97 (0.90–1.05)0.4750.770.94 (0.86–1.03)0.188
AD51728241.001.03 (0.90–1.19)0.6410.771.00 (0.85–1.17)0.961
SQ91129681.000.91 (0.81–1.02)0.1040.770.89 (0.78–1.02)0.084
Ever smoking236725081.000.97 (0.89–1.05)0.4460.770.95 (0.86–1.04)0.273
Never smoking15912531.001.06 (0.83–1.36)0.6230.770.95 (0.71–1.27)0.735
NCI 4 571357361.000.94 (0.88–0.99)0.0220.870.95 (0.89–1.01)0.116
AD184157361.000.95 (0.87–1.03)0.2250.870.95 (0.87–1.04)0.257
SQ144757361.000.92 (0.84–1.00)0.0600.880.95 (0.86–1.04)0.258
Ever smoking534243361.000.97(0.91–1.03)0.2970.880.98 (0.92–1.06)0.649
Never smoking35013791.000.91(0.74–1.12)0.3760.880.94 (0.75–1.19)0.622
Toronto 5 3314991.000.86 (0.68–1.07)0.1820.850.84 (0.65–1.09)0.180
AD904991.000.84 (0.59–1.19)0.3260.850.89 (0.60–1.32)0.566
SQ504991.000.96 (0.61–1.52)0.8700.850.96 (0.57–1.62)0.871
Ever smoking2362721.000.95(0.71–1.27)0.7350.870.82 (0.59–1.14)0.231
Never smoking952171.000.69(0.47–1.03)0.0650.870.89 (0.57–1.40)0.611
GLC 6 4814781.001.01 (0.83–1.24)0.8810.801.06 (0.85–1.33)0.584
AD1864781.000.88 (0.67–1.16)0.3680.800.92 (0.68–1.25)0.609
SQ974781.001.20 (0.85–1.70)0.2990.801.17 (0.80–1.70)0.426
Ever smoking4332581.001.01 (0.78–1.32)0.9200.801.06 (0.79–1.41)0.701
Never smoking352201.000.99 (0.54–1.82)0.9780.800.90 (0.47–1.70)0.736
Discovery combined 12160168380.94 (0.91–0.98)0.0020.94 (0.90–0.98)0.003
Harvard 7 9849701.001.01 (0.88–1.17)0.8570.831.02 (0.89–1.17)0.791
AD5979701.001.00 (0.86–1.18)0.9520.831.04 (0.88–1.21)0.673
SQ2169701.001.03 (0.82–1.31)0.7810.831.00 (0.79–1.27)0.967
Ever smoking8928091.001.00 (0.86–1.16)0.9620.830.97 (0.84–1.13)0.687
Never smoking921611.001.15 (0.76–1.76)0.5020.831.51 (0.99–2.29)0.053
deCOD 8 1319263801.000.98 (0.91–1.07)0.6890.890.99 (0.91–1.09)0.911
AD547263801.000.96 (0.85–1.09)0.5240.891.02 (0.88–1.17)0.834
SQ259263801.000.92 (0.77–1.10)0.3730.890.84 (0.69–1.03)0.095
Replication combined 2303273500.99 (0.92–1.06)0.8031.00 (0.93–1.08)0.960
Overall 14463441880.95 (0.92–0.98)0.0040.95 (0.92–0.99)0.009
Overall AD combined 4862432210.95 (0.91–1.00)0.0530.96 (0.91–1.01)0.114
Overall SQ combined 3897433650.92 (0.87–0.97)0.0020.93 (0.88–0.99)0.016
Overall ever smoking combined 1042093170.96 (0.91–1.00)0.0430.96 (0.91–1.00)0.064
Overall never smoking combined 73132300.95 (0.83–1.09)0.4671.00 (0.86–1.16)0.959

AD: adenocarcinoma, SQ: squamous cell carcinoma. The combined OR and P value were estimated using a fixed-effects model.

1ICR: the Institute of Cancer Research Genome-wide Association Study, UK.

2MDACC: the MD Anderson Cancer Center Genome-wide Association Study, US.

3IARC: the International Agency for Research on Cancer Genome-wide Association Study, France.

4NCI: the National Cancer Institute Genome-wide Association Study, US.

5Toronto: the Samuel Lunenfeld Research Institute Genome-wide Association Study, Toronto, Canada.

6GLC: German Lung Cancer Study, Germany.

7Harvard: Harvard Lung Cancer Study, USA.

8deCODE: Icelandic Lung Cancer Study, Iceland.

Figure 3

Forest plots of effect size and direction for tagSNPs from TRICL-ILCCO consortium. PTPN2 rs2847297 P combined = 0.004 in all individuals; P combined = 0.052 in overall adenocarcinoma individuals; P combined = 0.002 in overall squamous cell carcinoma individuals; P combined = 0.042 in overall ever smoking individuals; P combined = 0.465 in overall never smoking individuals (A); PTPN2 rs2847282P combined = 0.009 in all individuals; P combined = 0.114 in overall adenocarcinoma individuals; P combined = 0.016 in overall squamous cell carcinoma individuals; P combined = 0.066 in overall ever smoking individuals; P combined = 0.960 in overall never smoking individuals (B); Each box and horizontal line represent the OR point estimate and 95% CI derived from the additive model. The area of each box is proportional to the statistical weight of the study. Diamonds represent the ORs obtained from the combined analysis with 95% confidence intervals indicated by their widths. The meta-analysis includes eight GWAS studies [the Institute of Cancer Research (ICR) GWAS, the MD Anderson Cancer Center (MDACC) GWAS, the International Agency for Research on Cancer (IARC) GWAS, the National Cancer Institute (NCI) GWAS, the Lunenfeld-Tanenbaum Research Institute (Toronto) GWAS, German Lung Cancer Study (GLC) GWAS, Harvard Lung Cancer Study (Harvard) GWAS, Icelandic Lung Cancer Study (deCODE) GWAS]. NCI GWAS includes four sub-studies: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention Study (ATBC), the Cancer Prevention Study II Nutrition Cohort (CPS-II), the Environment and Genetics in Lung Cancer Etiology (EAGLE), and the Prostate, Lung, Colon, Ovary Screening Trial (PLCO).

Summary of the association results of two SNPs in the eight lung cancer GWAS studies. AD: adenocarcinoma, SQ: squamous cell carcinoma. The combined OR and P value were estimated using a fixed-effects model. 1ICR: the Institute of Cancer Research Genome-wide Association Study, UK. 2MDACC: the MD Anderson Cancer Center Genome-wide Association Study, US. 3IARC: the International Agency for Research on Cancer Genome-wide Association Study, France. 4NCI: the National Cancer Institute Genome-wide Association Study, US. 5Toronto: the Samuel Lunenfeld Research Institute Genome-wide Association Study, Toronto, Canada. 6GLC: German Lung Cancer Study, Germany. 7Harvard: Harvard Lung Cancer Study, USA. 8deCODE: Icelandic Lung Cancer Study, Iceland. Forest plots of effect size and direction for tagSNPs from TRICL-ILCCO consortium. PTPN2 rs2847297 P combined = 0.004 in all individuals; P combined = 0.052 in overall adenocarcinoma individuals; P combined = 0.002 in overall squamous cell carcinoma individuals; P combined = 0.042 in overall ever smoking individuals; P combined = 0.465 in overall never smoking individuals (A); PTPN2 rs2847282P combined = 0.009 in all individuals; P combined = 0.114 in overall adenocarcinoma individuals; P combined = 0.016 in overall squamous cell carcinoma individuals; P combined = 0.066 in overall ever smoking individuals; P combined = 0.960 in overall never smoking individuals (B); Each box and horizontal line represent the OR point estimate and 95% CI derived from the additive model. The area of each box is proportional to the statistical weight of the study. Diamonds represent the ORs obtained from the combined analysis with 95% confidence intervals indicated by their widths. The meta-analysis includes eight GWAS studies [the Institute of Cancer Research (ICR) GWAS, the MD Anderson Cancer Center (MDACC) GWAS, the International Agency for Research on Cancer (IARC) GWAS, the National Cancer Institute (NCI) GWAS, the Lunenfeld-Tanenbaum Research Institute (Toronto) GWAS, German Lung Cancer Study (GLC) GWAS, Harvard Lung Cancer Study (Harvard) GWAS, Icelandic Lung Cancer Study (deCODE) GWAS]. NCI GWAS includes four sub-studies: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention Study (ATBC), the Cancer Prevention Study II Nutrition Cohort (CPS-II), the Environment and Genetics in Lung Cancer Etiology (EAGLE), and the Prostate, Lung, Colon, Ovary Screening Trial (PLCO). In subgroup analysis by histology (Table 2, Fig. 3), we found that the rs2847297 G allele was borderline associated with lung adenocarcinoma (AD) risk (OR = 0.95, 95% CI = 0.91–1.00, P = 0.052) and significantly associated with squamous cell lung carcinoma (SQ) risk (OR = 0.92, 95% CI = 0.87–0.97, P = 0.002, Fig. 3A). We also found the rs2847282 G allele was associated with SQ risk (OR = 0.93, 95% CI = 0.88–0.99, P = 0.016), while there was no statistical association with AD risk (OR = 0.96, 95% CI = 0.91–1.01, P = 0.114, Fig. 3B). In subgroup analysis by smoking status, there was a marginal significant decrease in lung cancer risk for the rs2847297 G allele among ever smokers (OR = 0.96, 95% CI = 0.91–1.00, P = 0.042), but not among never smokers (OR = 0.95, 95% CI = 0.83–1.09, P = 0.465, Fig. 3A). However, there was no association with the PTPN2 rs2847282 G allele and lung cancer risk among ever smokers (OR = 0.96, 95% CI = 0.91–1.00, P = 0.066 and never smokers (OR = 1.00, 95% CI = 0.86–1.16, P = 0.960, Fig. 3B).

Discussion

In the present study, we sought to investigate associations between genetic variants in the TCPTP pathway genes and lung cancer risk using eight published GWAS studies of 14,463 cases and 44,188 controls. The principal findings included two novel, potentially functional SNPs, rs2847297 and rs2847282 of PTPN2, that were both associated with a decreased lung cancer risk and a decreased mRNA expression level of PTPN2, particularly in subgroups of ever smokers and squamous cell lung carcinoma. Four articles about pathway-based analysis and lung cancer risk (Centrosome, DNA repair, lncRNA and RNA degradation) have been accepted or published in our laboratory. We found that the loci of two SNPs in PTPN2 were different from previous studies in our lab and GWAS studies. PTPN2 plays a dual role in development and progression of cancer. Proliferation and cell cycle assays demonstrated that overexpression of PTPN2 would decrease serum requirement, increase formation of larger colonies in soft agar, alter morphology, and rapidly progress through G1 and S phases and the rate of cell division[25, 26]. Another study showed that the proliferation rate would reduce in TCPTP (−/−), compared to TCPTP (+/+), lymphocytes[27]. We found that PTPN2 mRNA expression levels in matched lung cancer tissues were increased compared to adjacent normal tissues from the TCGA database, some other studies also demonstrated that PTPN2 expression levels were higher in lung AD[28, 29] and SQ[30, 31] than in normal lung tissues. These findings provided oncogenic evidence of PTPN2 and were consistent with our results that the two susceptibility loci of PTPN2 were associated with a decreased lung cancer risk as a result of a decreased mRNA expression level of the gene. In addition, we found that the eQTL analysis result of rs2847297 in lung tissue was also significant in the GTEx analysis (P = 4.0E10–7) (http://www.gtexportal.org/home/eqtls/bySnp?snpId=rs2847297&tissueName=All). This result is also consistent with the eQTL analysis from the lymphoblastoid cell lines in the present study. However, it has been reported that overexpression of PTPN2 induces apoptosis in the p53 + A549 and MCF-7 cells but not in p53- HeLa cells, also consistent with features of a tumor suppressor[32]. Another study demonstrated that PTPN2 was absent in a large proportion of “triple-negative” primary human breast cancers and PTPN2 overexpression would suppress tumor growth[33]. In subgroup analysis we found that the two SNPs were more likely to be associated with SQ risk, and the risk associated with rs2847297 G allele was more likely to be among ever smoking. Cigarette smoke is the major risk factor for lung cancer, especially for SQ. Study showed that smoking led to an increased expression of Nkx2[34], which is the transcription factor (TF) of PTPN2. Therefore, it is likely that the locus has the possibility of influencing lung cancer risk of ever smokers through changing the expression of PTPN2. Our study has some limitations. First, genes in the TCPTP pathway were identified mainly from the Molecular Signatures Database and Genecards. Although we did search some relative articles to complete the list of genes in the pathway, some newly discovered genes in the pathway might have been missed. Second, although we demonstrated the association of thetwo novel potentially functional loci in PTPN2 with lung cancer risk with functional evidence from eQTL analyses, the exact biochemical and molecular mechanisms are still unclear. Third, our eQTL analyses were limited to publicly available data from lymphoblastoid cell lines but target tissues, which could provide more direct correlation results between the two SNPs and PTPN2 expression. Taken together, the present study revealed two novel, potentially functional susceptibility loci in PTPN2 associated with lung cancer risk in European populations, particularly among ever smokers and squamous carcinoma. Further validation and functional evaluation of these genetic variants are warranted to verify our findings.

Materials and Methods

Study populations

The present study first used genotyping data from the TRICL-ILCCO consortium, which included 12,160 lung cancer cases and 16,838 controls (all Europeans) of six previously published GWAS studies: The University of Texas MD Anderson Cancer Center (MDACC), Institute of Cancer Research (ICR), National Cancer Institute (NCI), International Agency for Research on Cancer (IARC), Toronto study from Samuel Lunenfeld Research Institute study (Toronto), and German Lung Cancer Study (GLC). The expanded analysis included additional two GWAS studies of European ancestry from the Harvard Lung Cancer Study (984 cases and 970 controls)[35] and the Icelandic Lung Cancer Study (deCODE) (1,319 cases and 26,380 controls)[36] of the ILCCO. Details of the study populations are presented in the supplementary file. A written informed consent was obtained by all participating GWAS studies. All methods were performed in accordance with the relevant guidelines and regulations for each of the participating institutions, and the present study followed the study protocols approved by Duke University Health System Institutional Review Board.

Selection of Genes and SNPs from TCPTP pathway

Genotyping in these GWAS studies was performed by one of Illumina HumanHap 317, 317 + 240 S, 370Duo, 550, 610 or 1 M arrays. IMPUTE2 v2.1.1 or MaCH v1.0 software was used for imputation. Genes in the TCPTP pathway were identified from the Molecular Signatures Database (http://www.broadinstitute.org/gsea/index.jsp)[37] and Genecards (http://www.genecards.org/). Overall, 43 genes located on autosomal chromosomes were selected (detailed in Supplementary Table S1). The final meta-analysis contained 5,162 SNPs with the following inclusion criteria: genotyping rate >95%, minor allele frequency (MAF) ≥ 5%, and Hardy-Weinberg Equilibrium (HWE) exact P value ≥ 10−5. The detailed workflow is shown in Fig. 4.
Figure 4

Flowchart of SNP selection among the TCPTP pathway genes.

Flowchart of SNP selection among the TCPTP pathway genes.

In silico functional prediction and validation

We use three in silico tools, SNPinfo (http://snpinfo.niehs.nih.gov/snpinfo/snpfunc.htm)[38], RegulomeDB (http://regulomedb.org/)[39], and HaploReg (http://www.broadinstitute.org/mammals/haploreg/haploreg.php)[40] to predict potential functions. The expression quantitative trait loci (eQTL) analysis was performed in the 1000 Genomes Project[41]. The mRNA expression of lung cancer tissue samples was performed in TCGA[42].

Statistical analysis

Odds ratios (ORs) and their 95% confidence intervals (CIs) were calculated using Stata (v10, State College, Texas, USA) and PLINK (v1.06) software. A meta-analysis with the inverse variance method was employed on the 5,162 SNPs. We used Cochran’s Q statistic to test for heterogeneity and I2 statistic for the proportion of the total variation[43]. The fixed-effects model was used when there was no heterogeneity among GWAS studies (Q-test P > 0.100 and I2 < 25%); otherwise, the random-effects model was used. The false discovery rate (FDR) was performed to control for multiple testing with a threshold <0.20[44]. The genes mRNA expression levels in lung cancer and adjacent tissues from TCGA database were performed by paired t-test. Regional association plots were performed by LocusZoom[45]. Haploview v4.2 was used to generate the Manhattan plot and LD plots[46]. All other analyses were conducted with SAS (Version 9.3; SAS Institute, Cary, NC, USA). Supplementary information
  44 in total

1.  Variation in TP63 is associated with lung adenocarcinoma susceptibility in Japanese and Korean populations.

Authors:  Daiki Miki; Michiaki Kubo; Atsushi Takahashi; Kyong-Ah Yoon; Jeongseon Kim; Geon Kook Lee; Jae Ill Zo; Jin Soo Lee; Naoya Hosono; Takashi Morizono; Tatsuhiko Tsunoda; Naoyuki Kamatani; Kazuaki Chayama; Takashi Takahashi; Johji Inazawa; Yusuke Nakamura; Yataro Daigo
Journal:  Nat Genet       Date:  2010-09-26       Impact factor: 38.330

2.  Genomics: SNPs and human disease.

Authors:  Caitlin Smith
Journal:  Nature       Date:  2005-06-16       Impact factor: 49.962

3.  A genome-wide gene-environment interaction analysis for tobacco smoke and lung cancer susceptibility.

Authors:  Ruyang Zhang; Minjie Chu; Yang Zhao; Chen Wu; Huan Guo; Yongyong Shi; Juncheng Dai; Yongyue Wei; Guangfu Jin; Hongxia Ma; Jing Dong; Honggang Yi; Jianling Bai; Jianhang Gong; Chongqi Sun; Meng Zhu; Tangchun Wu; Zhibin Hu; Dongxin Lin; Hongbing Shen; Feng Chen
Journal:  Carcinogenesis       Date:  2014-03-22       Impact factor: 4.944

4.  Genome-wide association scan of tag SNPs identifies a susceptibility locus for lung cancer at 15q25.1.

Authors:  Christopher I Amos; Xifeng Wu; Peter Broderick; Ivan P Gorlov; Jian Gu; Timothy Eisen; Qiong Dong; Qing Zhang; Xiangjun Gu; Jayaram Vijayakrishnan; Kate Sullivan; Athena Matakidou; Yufei Wang; Gordon Mills; Kimberly Doheny; Ya-Yu Tsai; Wei Vivien Chen; Sanjay Shete; Margaret R Spitz; Richard S Houlston
Journal:  Nat Genet       Date:  2008-04-02       Impact factor: 38.330

5.  Overexpression of a nuclear protein tyrosine phosphatase increases cell proliferation.

Authors:  V Radha; S Nambirajan; G Swarup
Journal:  FEBS Lett       Date:  1997-06-02       Impact factor: 4.124

6.  Identification of a nuclear Stat1 protein tyrosine phosphatase.

Authors:  Johanna ten Hoeve; Maria de Jesus Ibarra-Sanchez; Yubin Fu; Wei Zhu; Michel Tremblay; Michael David; Ke Shuai
Journal:  Mol Cell Biol       Date:  2002-08       Impact factor: 4.272

7.  The Molecular Signatures Database (MSigDB) hallmark gene set collection.

Authors:  Arthur Liberzon; Chet Birger; Helga Thorvaldsdóttir; Mahmoud Ghandi; Jill P Mesirov; Pablo Tamayo
Journal:  Cell Syst       Date:  2015-12-23       Impact factor: 10.304

8.  Common 5p15.33 and 6p21.33 variants influence lung cancer risk.

Authors:  Yufei Wang; Peter Broderick; Emily Webb; Xifeng Wu; Jayaram Vijayakrishnan; Athena Matakidou; Mobshra Qureshi; Qiong Dong; Xiangjun Gu; Wei Vivien Chen; Margaret R Spitz; Timothy Eisen; Christopher I Amos; Richard S Houlston
Journal:  Nat Genet       Date:  2008-11-02       Impact factor: 38.330

9.  HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants.

Authors:  Lucas D Ward; Manolis Kellis
Journal:  Nucleic Acids Res       Date:  2011-11-07       Impact factor: 16.971

10.  Transcriptome and genome sequencing uncovers functional variation in humans.

Authors:  Tuuli Lappalainen; Michael Sammeth; Marc R Friedländer; Peter A C 't Hoen; Jean Monlong; Manuel A Rivas; Mar Gonzàlez-Porta; Natalja Kurbatova; Thasso Griebel; Pedro G Ferreira; Matthias Barann; Thomas Wieland; Liliana Greger; Maarten van Iterson; Jonas Almlöf; Paolo Ribeca; Irina Pulyakhina; Daniela Esser; Thomas Giger; Andrew Tikhonov; Marc Sultan; Gabrielle Bertier; Daniel G MacArthur; Monkol Lek; Esther Lizano; Henk P J Buermans; Ismael Padioleau; Thomas Schwarzmayr; Olof Karlberg; Halit Ongen; Helena Kilpinen; Sergi Beltran; Marta Gut; Katja Kahlem; Vyacheslav Amstislavskiy; Oliver Stegle; Matti Pirinen; Stephen B Montgomery; Peter Donnelly; Mark I McCarthy; Paul Flicek; Tim M Strom; Hans Lehrach; Stefan Schreiber; Ralf Sudbrak; Angel Carracedo; Stylianos E Antonarakis; Robert Häsler; Ann-Christine Syvänen; Gert-Jan van Ommen; Alvis Brazma; Thomas Meitinger; Philip Rosenstiel; Roderic Guigó; Ivo G Gut; Xavier Estivill; Emmanouil T Dermitzakis
Journal:  Nature       Date:  2013-09-15       Impact factor: 49.962

View more
  3 in total

1.  Proteasome regulation by reversible tyrosine phosphorylation at the membrane.

Authors:  Lu Chen; Yanan Zhang; Xin Shu; Qiong Chen; Tiantian Wei; Heman Wang; Xiaorong Wang; Qirou Wu; Xiaomei Zhang; Xiaoyan Liu; Suya Zheng; Lan Huang; Junyu Xiao; Chao Jiang; Bing Yang; Zhiping Wang; Xing Guo
Journal:  Oncogene       Date:  2021-02-18       Impact factor: 9.867

2.  PTPN2 phosphatase deletion in T cells promotes anti-tumour immunity and CAR T-cell efficacy in solid tumours.

Authors:  Florian Wiede; Kun-Hui Lu; Xin Du; Shuwei Liang; Katharina Hochheiser; Garron T Dodd; Pei K Goh; Conor Kearney; Deborah Meyran; Paul A Beavis; Melissa A Henderson; Simone L Park; Jason Waithman; Sheng Zhang; Zhong-Yin Zhang; Jane Oliaro; Thomas Gebhardt; Phillip K Darcy; Tony Tiganis
Journal:  EMBO J       Date:  2019-12-05       Impact factor: 11.598

Review 3.  PTPN2 in the Immunity and Tumor Immunotherapy: A Concise Review.

Authors:  Jiachun Song; Jinxin Lan; Jiaping Tang; Na Luo
Journal:  Int J Mol Sci       Date:  2022-09-02       Impact factor: 6.208

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.