Literature DB >> 30909962

X-chromosome association study reveals genetic susceptibility loci of nasopharyngeal carcinoma.

Xiao-Yu Zuo1, Qi-Sheng Feng1, Jian Sun1, Pan-Pan Wei1, Yoon-Ming Chin2, Yun-Miao Guo1, Yun-Fei Xia1, Bo Li3,4, Xiao-Jun Xia1, Wei-Hua Jia1, Jian-Jun Liu5, Alan Soo-Beng Khoo6, Taisei Mushiroda7, Ching-Ching Ng8, Wen-Hui Su9,10, Yi-Xin Zeng11, Jin-Xin Bei12,13.   

Abstract

BACKGROUND: The male predominance in the incidence of nasopharyngeal carcinoma (NPC) suggests the contribution of the X chromosome to the susceptibility of NPC. However, no X-linked susceptibility loci have been examined by genome-wide association studies (GWASs) for NPC by far.
METHODS: To understand the contribution of the X chromosome in NPC susceptibility, we conducted an X chromosome-wide association analysis on 1615 NPC patients and 1025 healthy controls of Guangdong Chinese, followed by two validation analyses in Taiwan Chinese (n = 562) and Malaysian Chinese (n = 716).
RESULTS: Firstly, the proportion of variance of X-linked loci over phenotypic variance was estimated in the discovery samples, which revealed that the phenotypic variance explained by X chromosome polymorphisms was estimated to be 12.63% (non-dosage compensation model) in males, as compared with 0.0001% in females. This suggested that the contribution of X chromosome to the genetic variance of NPC should not be neglected. Secondly, association analysis revealed that rs5927056 in DMD gene achieved X chromosome-wide association significance in the discovery sample (OR = 0.81, 95% CI 0.73-0.89, P = 1.49 × 10-5). Combined analysis revealed rs5927056 for DMD gene with suggestive significance (P = 9.44 × 10-5). Moreover, the female-specific association of rs5933886 in ARHGAP6 gene (OR = 0.62, 95%CI: 0.47-0.81, P = 4.37 × 10-4) was successfully replicated in Taiwan Chinese (P = 1.64 × 10-2). rs5933886 also showed nominally significant gender × SNP interaction in both Guangdong (P = 6.25 × 10-4) and Taiwan datasets (P = 2.99 × 10-2).
CONCLUSION: Our finding reveals new susceptibility loci at the X chromosome conferring risk of NPC and supports the value of including the X chromosome in large-scale association studies.

Entities:  

Keywords:  Association study; Genetic susceptibility; Male predominance; Nasopharyngeal carcinoma; X chromosome

Mesh:

Year:  2019        PMID: 30909962      PMCID: PMC6434801          DOI: 10.1186/s13293-019-0227-9

Source DB:  PubMed          Journal:  Biol Sex Differ        ISSN: 2042-6410            Impact factor:   5.027


Introduction

Nasopharyngeal carcinoma (NPC) is a human squamous-cell carcinoma that arises in the epithelium of the nasopharynx. The most remarkable feature of NPC is its restricted geographical distribution of incidence rate, which is about 30 per 100,000 in its prevalent regions including Southern China but rare in the western countries [1]. The etiology of NPC is a multifactorial process, involving genetic, viral, and environmental factors [2]. Large-scale case-control association studies and familial linkage studies have confirmed the genetic contribution to NPC predisposition, by revealing multiple susceptibility loci of NPC, such as HLA genes [3-6], TNFRSF19 [5], MECOM [5], GABBR1 [3], XRCC3 [7], ITGA9 [4], TERT-CLPTM1L [8-10], and CIITA [8]. However, these explain only a small fraction of the NPC heritability [11, 12]. The predominant incidence of males is another feature of NPC, with a male/female ratio of two to threefolds in diverse populations [13, 14]. The age-standardized rates of NPC for males:females were 21.8:8.1 per 100,000 person-years in Hong Kong [15] and 28.93:11.26 in Sihui, Guangdong, where both cities are NPC prevalent regions in Southern China [16]. Unequal exposures to environmental risk factors between males and females, such as smoking and diet habit, may partially contribute to the gender difference in the incidence of NPC [15]. It has been suggested that some tumor suppressor genes may contribute to gender bias in cancer development by escaping from the inactivation of the X chromosome in females [17]. Moreover, the involvement of the X chromosome has been implicated in the predisposition to NPC [18, 19]; however, the exact genetic components in the X chromosome have remained largely unexploited in NPC. By contrast, recent X chromosome association studies have identified novel risk loci at the sexual chromosome for follicular lymphoma [20], schizophrenia [21], systemic lupus erythematosus [22], etc., which partially explain the missing heritability of those complex diseases. To explore the genetic contribution of the X chromosome to the gender difference of NPC incidence, we performed an X chromosome-wide association study, with the discovery data of 1615 cases and 1025 controls in Southern China, followed by two validation analyses involving a total of 543 cases and 735 controls from Taiwan Chinese and Malaysian Chinese, respectively.

Methods

Study subjects

The discovery sample included 1615 NPC cases and 1025 controls derived from a previous genome-wide association study (GWAS) in Southern Chinese [5]. All cases were recruited through Sun Yat-sen University Cancer Center (SYSUCC) in Southern China during October 2005 and October 2007, whereas the controls were recruited from several physical examination centers in local communities in Guangdong during the same period. As for the validation stage, two independent case-control cohorts of Chinese descendants were included, with 266 NPC cases and 450 controls recruited from different states of Malaysia [4, 23], and 277 NPC cases and 285 controls recruited from the northern parts of Taiwan, respectively [3]. The diagnosis of NPC was confirmed according to the World Health Organization (WHO) classification at each study site. The study was approved by the Institutional Review Board at SYSUCC. Informed consent was obtained from all participants. The information for all subjects is summarized in Additional file 1: Table S1.

Genotyping and quality control

Individual genotypes had been determined as described previously [5]. In brief, the genomic DNA was extracted from peripheral blood sample using commercial (Qiagen) DNA extraction kit (Southern China and Taiwan studies) or conventional methods (Malaysia study), and the genotyping was conducted by using Illumina BeadChip arrays, according to the manufacturer’s protocols (Illumina, Inc., San Diego, CA, USA). For the discovery stage, quality checks were applied for each sample as described in the previous study [5], using autosomal SNPs and removing those with genotyping rate < 95%, excessive observed level of heterozygosity (departure from 3 standard deviation), cryptic relatedness, error or uncertainty in gender estimation, and population outliers through principal component analysis (PCA). A total of 1590 cases and 994 controls were retained for subsequent analyses. We extracted 18,133 genotyped SNPs at the X chromosome from the dataset and conducted further quality control filtering for the SNPs as suggested previously [24, 25]. As heterozygote of a SNP in a male should be deemed to be a genotyping/calling error, a fact of being haploidy for the X chromosome in a male, thus, we assigned a missing value for such a call in a male sample. The SNPs were removed if they met the following criteria [1]: with genotyping rate < 92% for all samples or call rate difference > 3% or P < 1 × 10−5 between gender [2], with the difference of heterozygosity level > 5% or P < 1 × 10−3 between cases and controls in males [3], with minor allele frequency (MAF) < 1% for both males and females [4], deviation from Hardy-Weinberg equilibrium test P < 1 × 10−6 in females, or [5] with heterozygote genotypes found in 5% of males (due to genotyping error). For SNPs in pseudo-autosomal regions (PAR), we used the same filtering criteria as for the autosomal SNPs in our previous study [5]. Finally, 6536 SNPs were removed, and 11,597 X chromosomal SNPs were used for subsequent analyses. For the validation stage, genotypes were retrieved from the two GWAS studies [3, 4, 23]. Genotypes for unknown SNPs in the GWAS datasets were imputed by using non-PAR SNPs and IMPUTE2 program [26, 27], with 1000 Genomes Phase I integrated variant set (March 2012, Build 37) as the reference. Imputed SNPs with imputation score (INFO in the *.impute2_info output file) < 0.5 were considered low confidence and removed. Then, Gtool program (v0.7.5) was used to convert data from IMPUTE2 to PLINK. Genotypes with the threshold for calling genotypes (−threshold) < 0.9 were considered as low confidence and set as missing. Imputed SNPs were subjected to the same quality control procedure as that of the genotyped SNPs. As a result, 99,710 imputed loci were retained for downstream analysis. For the Malaysian Chinese dataset, 1000 Genomes Phase I interim set (June 2011, Build 37) was used as the reference. The genotypes of validating SNPs were retrieved from the Taiwan Chinese dataset as previously described [3].

Statistical and bioinformatics analysis

Association tests were performed by using PLINK implemented with logistic regression analysis under the generalized linear model [28, 29]. Genotypes of the X chromosome loci were coded as [0, 2] for males and [0, 1, 2] for females, accounting for the random inactivation on females and assuming activation of one of the two X chromosomes in females. Given that a proportion of SNPs can escape from random inactivation, genotype coding of [0, 1] for males and [0, 1, 2] for females was also tested in combined samples to account for the escaping from random inactivation. Gender was treated as a covariate, and the first 10 principal components (PCs) were also adjusted in the association test in the discovery sample. To investigate whether there are sex-specific effects underlying X chromosome loci, we performed association tests for males and females separately and also tests accounting for gender × SNP interaction. For combined analyses of discovery and replication datasets, meta-analyses were conducted in R (version 3.2.3) using the “metafor” package (version 1.9–8). The heterogeneity among datasets was first evaluated by I2 and Q test. I2 > 50% or P < 0.05 were considered as heterogeneous, and the random-effect model was applied; otherwise, the fixed-effect model would be used. We used linear mixed model approach implemented in GCTA (version 1.24.7) [30, 31] to estimate the contribution of the X chromosome on the proportion of the NPC phenotypic variance in the discovery dataset. The equal variance in both sexes (EV), full-dosage compensation (FDC), and non-dosage compensation (NDC) models were applied [32]. Only genotyped SNPs were used in this study to reduce the potential impact of subtle imputation uncertainty. For a comparison, we also estimated the variance explained by each autosome by using common SNPs passing QC and with MAF > 5%. We used the simpleM method to address the effective number of independent association tests [33] since the Bonferroni correction for multiple testing was too conservative for the association test among loci with considerable linkage disequilibrium (LD). The simpleM method adopts a principal component analysis on the SNP correlation matrix to pick up the least “effective number” of tests accounting for ≥ 99.5% variance. In this study, the effective number was estimated as 2308, irrespective of genotyped or imputed loci. Therefore, we adopted the X chromosome-wide significance level as 0.05/2308 = 2.17 × 10−5. In the discovery stage, we chose a relatively relaxed significance threshold (P < 1 × 10−3) to select candidate SNPs for the follow-up replications, allowing inclusion of more SNPs and higher sensitivity of reproducible association signals. We used the R package “twoStageGwasPower” to calculate statistical power in this study, following the methods described previously [34]. The key raw data of this study have been uploaded onto the Research Data Deposit (RDD; http://www.researchdata.org.cn/; Number: RDDB2019000532). Furthermore, noncoding susceptible SNPs identified in this study were subjected to HaploReg (V4.1; https://pubs.broadinstitute.org/mammals/haploreg/haploreg.php) [35] to annotate their functional and regulatory potentials on chromatin state and protein-binding annotations from the Roadmap Epigenomics and ENCODE projects, sequence conservation across mammals, and the effect of SNPs on regulatory motifs. Expression quantitative trait locus (eQTL) effects of the SNPs were estimated by using GTEx portal, where samples were collected from 53 non-diseased tissue sites across nearly 1000 individuals and tissue-specific gene expression and regulation with particularly the correlation between SNP and the expression of nearby genes was analyzed and archived (V7; https://gtexportal.org/home/).

Results

Genetic variance in X chromosome contributes to the risk of NPC

Firstly, the proportion of variance of X-linked loci over phenotypic variance was estimated under several dosage compensation models in the discovery samples for which the individual-level genotype data were available. X chromosome variants showed different contributions to the variance of NPC risk between males and females (Fig. 1). In males, 12.63% of the genetic variance estimated under non-dosage compensation (NDC) model was likely due to X chromosome variations (P = 0.024) and that was 6.74% and 3.49% under equal variance (EV) model (P = 0.024) and full-dosage compensation (FDC) model (P = 0.024), respectively. At whole genome level, the proportion ranked after that of chromosome 6, which contributed 24.14% of the genetic variance largely because of harboring the well-known risk loci of NPC as revealed by many GWAS studies [3-5](Additional file 1: Figure S1). However, as in females, X chromosome variants explained only 0.0001% variance for NPC under all three models, in contrast to the 23.04% of variance attributable to chromosome 6 (Fig. 1). These suggest that the contribution of the X chromosome to the genetic variance of NPC should not be neglected, hence motivating the search for loci associated with NPC in X chromosome.
Fig. 1

Estimates of the gender-specific genetic variance explained by autosomal and sexual chromosomes. The genetic variance of the X chromosome was estimated based on the equal variance (EV) model for females and non-dosage compensation (NDC) model for males. The genetic variance of autosomal chromosomes was estimated based on the EV model for both sexes

Estimates of the gender-specific genetic variance explained by autosomal and sexual chromosomes. The genetic variance of the X chromosome was estimated based on the equal variance (EV) model for females and non-dosage compensation (NDC) model for males. The genetic variance of autosomal chromosomes was estimated based on the EV model for both sexes

X chromosome-wide association analysis

Next, association tests were conducted for a total of 111,307 SNPs of X chromosome, including 11,597 directly genotyped and 99,710 imputed, in 1590 NPC cases and 994 controls. Significant associations surpassing the X chromosome-wide significance level (P = 2.17 × 10−5, Fig. 2a) were observed in two regions spanning DMD and LOC101928201-NLGN4X, where the leading SNPs were rs5927056 (OR = 0.81, 95% CI 0.73–0.89; P = 1.49 × 10−5, Fig. 2b) and rs4495592 (OR = 1.28, 95% CI 1.14–1.44; P = 2.15 × 10−5, Fig. 2c), respectively, together with additional supportive associations (Table 1 and Additional file 1: Table S2). Moreover, suggestive associations were observed in the other three loci, including TENM1 (rs12842370), REPS2 (rs12860876), and MAGEA11 (rs2156978) (Additional file 1: Table S2; P < 1 × 10−4). Conditional analyses revealed that the sentinel SNPs accounted for all the associations observed in each of the five loci (Additional file 1: Figure S2). Given that a small portion of genes may escape from X chromosome inactivation, the non-random inactivation model was also applied in the association test. The association P values were highly consistent with those derived from random inactivation model (r = 0.85, Additional file 1: Figure S3) and an additional suggestive association was observed in ARX-MAGEB18 (rs10127187, OR = 1.19, 95% CI 0.82–0.94; P = 8.04 × 10−5. Additional file 1: Table S3). Therefore, we adopted random inactivation model for the remaining analyses.
Fig. 2

X chromosome-wide association of nasopharyngeal carcinoma. a Manhattan plot of P values for all X-linked SNPs (presented as −log10(P); circle and triangle for genotyped and imputed SNP, respectively). Steel blue line shows the suggestive significance level in the discovery study (P < × 10−4) and red line shows the X chromosome-wide significance (P < 2.17 × 10−5). b and c. Regional plot of P values from associations of rs5927056 and rs44955592, respectively, in the discovery dataset

Table 1

Representative associations of SNPs on X chromosome for combined samples

Discovery (Guangdong)Replication 1 (Malaysia)Replication 2 (Taiwan)Combined analysis
MarkerPositionAnnotationOR95% CI P OR95% CI P OR95% CI P I 2 OR95% CI P
rs592705631969744 DMD 0.810.73–0.891.49 × 10−51.010.83–1.229.47 × 10−10.90.74–1.113.33 × 10−10.000.850.79–0.929.44 × 10−5
rs66411424825493299 kb 3′ of LOC1019282011.281.14–1.432.86 × 10−50.950.76–1.196.58 × 10−11.120.89–1.413.44 × 10−10.001.191.08–1.303.01 × 10−4
rs1286087617037354 REPS2 1.201.09–1.317.20 × 10−50.950.81–1.125.58 × 10−11.010.85–1.218.86 × 10−168.811.070.92–1.233.98 × 10−1
rs2207942145630047261 kb 5′ of CXorf51A1.181.08–1.294.02 × 10−41.140.96–1.361.43 × 10−10.960.79–1.177.04 × 10−10.001.141.06–1.237.30 × 10−4
rs201809412272097 FRMPD4 0.860.78–0.947.69 × 10−40.960.81–1.146.44 × 10−10.960.8–1.166.86 × 10−10.000.890.83–0.961.95 × 10−3
rs5910990119872266108 kb 3′ of C1GALT1C10.860.79–0.933.86 × 10−41.060.90–1.254.89 × 10−11.040.86–1.256.97 × 10−169.110.960.83–1.126.17 × 10−1
rs65280692191563043 kb 5′ of SMS0.850.78–0.932.41 × 10−41.110.94–1.312.07 × 10−11.110.92–1.342.76 × 10−180.291.000.83–1.219.86 × 10−1
rs594969895161542430 kb 5′ of BRDTP11.221.10–1.373.40 × 10−40.860.70–1.051.39 × 10−10.910.73–1.123.74 × 10−180.911.000.79–1.269.90 × 10−1
X chromosome-wide association of nasopharyngeal carcinoma. a Manhattan plot of P values for all X-linked SNPs (presented as −log10(P); circle and triangle for genotyped and imputed SNP, respectively). Steel blue line shows the suggestive significance level in the discovery study (P < × 10−4) and red line shows the X chromosome-wide significance (P < 2.17 × 10−5). b and c. Regional plot of P values from associations of rs5927056 and rs44955592, respectively, in the discovery dataset Representative associations of SNPs on X chromosome for combined samples Moreover, gender-specific association tests were carried out to identify X chromosomal loci that contribute to the sexual dimorphism phenomena in NPC. Suggestive gender-specific associations were found in the two genders, respectively (P < 1 × 10−3; Fig. 3 and Additional file 1: Figure S4). Sentinel signals were observed within the intergenic region of LOC101928201-NLGN4X (rs6641142, OR = 1.31, 95% CI 1.16–1.49, P = 2.50 × 10−5) and intron of TENM1 (rs12842370, OR = 1.48, 95% CI 1.23–1.78, P = 4.51 × 10−5) for males (Fig. 3a) and 47Kb upstream of MAGEB18 gene (rs10127187, OR = 0.42, 95% CI 0.29–0.65, P = 5.02 × 10−5) for females (Fig. 3b). Some female-specific associations showed protective effects, such as the rs5933886 in ARHGAP6 (OR = 0.62, 95%CI: 0.47–0.81, P = 4.37 × 10−4, Table 2). Gender-SNP interaction tests revealed several SNPs with nominal significance (P < 1 × 10−3), including rs2002686 in EFHC2 (P = 8.4 × 10−5), rs139949129 and rs5933886 in ARHGAP6 (P = 4.72 × 10−4 and 6.25 × 10−4, respectively), and some intergenic SNPs such as rs3859959, rs12834592, rs6603446, and rs72620283 (Additional file 1: Figure S5 and Table 2). Notably, after excluding the top SNPs (listed in Additional file 1: Tables S4 and S2 for males and all samples, respectively), the remaining variants explained less the genetic variances, with 1.67% under EV model in all samples, 4.63% under EV model and 8.85% under NDC model in males (Additional file 1: Figure S6).
Fig. 3

Gender-specific association of nasopharyngeal carcinoma. Manhattan plots of association results for a males and b females, respectively

Table 2

Representative gender-specific association on X chromosome in discovery cohort

MarkerPositionMinor alleleAnnotationMAFmalesaORM P M MAFfemalesaORF P F ORI P I
Males
 rs66411424825493C299 kb 3′ of LOC1019282010.22/0.141.312.50 × 10−50.20/0.191.0786.05 × 10−11.212.34 × 10−1
 rs12842370124071272A TENM1 0.47/0.441.224.51 × 10−50.51/0.471.057.02 × 10−11.152.58 × 10−1
 rs662984224276842C427 kb 3′ of ZFX0.49/0.401.211.21 × 10−40.47/0.500.872.17 × 10−11.351.20 × 10−2
 rs592705631969744G DMD 0.22/0.30.811.17 × 10−40.22/0.270.774.66 × 10−21.047.62 × 10−1
 rs14071266895192068T400 kb 5′ of BRDTP10.39/0.311.221.20 × 10−40.35/0.380.851.77 × 10−10.891.20 × 10−2
Females
 rs1012718726108742C47Kb 3′ of MAGEB180.06/0.080.878.65 × 10−10.05/0.110.425.02 × 10−52.022.40 × 10−3
 rs1044236944182366A EFHC2 0.32/0.281.081.40 × 10−10.24/0.330.631.42 × 10−41.661.19 × 10−4
 rs652788519067421A ADGRG2 0.50/0.491.018.01 × 10−10.47/0.421.561.67 × 10−41.087.57 × 10−2
 rs593388611463381T ARHGAP6 0.19/0.271.054.10 × 10−10.20/0.190.624.37 × 10−41.666.25 × 10−4
 rs1284759826558711C177Kb 5′ of VENTXP10.07/0.080.92.58 × 10−10.05/0.110.462.32 × 10−41.953.21 × 10−3
Gender-SNP
 rs200268644182139A EFHC2 0.31/0.271.081.29 × 10−10.23/0.330.621.48 × 10−41.698.35 × 10−5
 rs385995943496280C20 kb 5′ of MAOA0.39/0.450.887.88 × 10−30.47/0.381.422.12 × 10−30.631.22 × 10−4
 rs1283459292782494C143 kb 5′ of NAP1L30.44/0.470.942.00 × 10−10.49/0.391.513.49 × 10−40.631.86 × 10−4
 rs6603446115968398G374 kb 3′ of CT830.45/0.421.071.90 × 10−10.40/0.490.687.25 × 10−41.582.24 × 10−4
 rs7262028321923327A20 kb 3′ of MBTPS20.21/0.210.976.23 × 10−10.24/0.161.686.01 × 10−40.574.71 × 10−4
 rs13994912911449691T ARHGAP6 0.17/0.151.063.99 × 10−10.16/0.230.593.81 × 10−41.744.72 × 10−4

The association tests were conducted by using logistic regression adjusted for the first 10 principal components. OR and P, odds ratio and P value for single point association test in males; OR and P, odds ratio and P value for single point association test in females; OR and P, odds ratio and P value for gender-SNP interaction

aMAF presents as minor allele frequency in cases/minor allele frequency in controls

Gender-specific association of nasopharyngeal carcinoma. Manhattan plots of association results for a males and b females, respectively Representative gender-specific association on X chromosome in discovery cohort The association tests were conducted by using logistic regression adjusted for the first 10 principal components. OR and P, odds ratio and P value for single point association test in males; OR and P, odds ratio and P value for single point association test in females; OR and P, odds ratio and P value for gender-SNP interaction aMAF presents as minor allele frequency in cases/minor allele frequency in controls

Validation and combined analyses

A total of 27 independent candidate SNPs (pairwise r2 < 0.5) passing the suggestive significance level in discovery stage (P < 1 × 10−3) were selected for two independent follow-up validations in Taiwan Chinese and Malaysian Chinese, respectively. Three SNPs (rs12556646, rs12842370, and rs6540340) were excluded due to their absence in either of the validation datasets (Additional file 1: Table S5). Combined analyses suggested that rs5927056 in DMD gene were associated with NPC risk (P = 9.44 × 10−5; Fig. 4a). Moreover, combined analysis showed that rs371000 in F9 (P = 3.13 × 10−4) was associated with NPC risk in males (Fig. 4b and Additional file 1: Table S4). For the female group, the combined analysis revealed that rs5933886 in ARHGAP6 was associated with NPC risk (P = 2.05 × 10−4), and the association was validated in the Taiwan dataset (P = 1.64 × 10−2; Fig. 4c and Additional file 1: Table S6). Furthermore, the interaction of rs5933886 with gender was validated in the Taiwan dataset (P = 2.99 × 10−2). However, the associations within REPS2 locus observed in the discovery sample were not significant in the combined analysis (Table 1).
Fig. 4

Forest plot of meta-analyses for X chromosome SNPs. Forest plot shows association results for a rs5927056 in all samples, b rs371000 in males, and c rs5933886 in females, respectively

Forest plot of meta-analyses for X chromosome SNPs. Forest plot shows association results for a rs5927056 in all samples, b rs371000 in males, and c rs5933886 in females, respectively

Functional annotations of the SNPs

In silico analyses were conducted to explore the functional potentials of the susceptibility SNPs on the X chromosomes. HaploReg revealed that the top significant rs5927056, located in the intron of DMD, might alter six regulatory motifs including AP-3, Evi-1_4, Hoxa10, Hoxb13, Hoxd10, and Pou2f2_known11, suggesting its regulatory potentials (Additional file 1: Table S7). HaploReg also revealed regulatory motifs (HMG-IY_1, Ik-3, NFKB_known5, Pou3f1, and STAT_known5) and DNase peaks at rs5933886, which is an intronic SNP of ARHGAP6 (Additional file 1: Table S8). Moreover, the eQTL analysis revealed a significant cis-eQTL effect of rs5933886 in the aorta artery sample from the GTEx database, which were collected from the ascending aorta or other thoracic regions (nonatherosclerotic; P = 5.88 × 10−3, Additional file 1: Table S8). These results imply that rs5933886 may act as a regulatory SNP and predispose NPC by regulating the expression of ARHGAP6.

Discussion

The human X chromosome consists ~ 155 million base pairs and contains more than 1000 genes that are important for many biological processes [17, 30]. However, none of the previous GWAS of NPC included association analysis of variations at the X chromosome, overlooking potential susceptibility loci that could partially account for the missing heritability of NPC. Here, we provided, to the best of our knowledge, the first large-scale X chromosome-wide association study of NPC in the Chinese population, revealing the contribution of the X chromosome and suggestive susceptibility loci for NPC risk. At the chromosome-wide level, we observed different contribution of the X chromosome to NPC heritability between genders. The genetic variance of NPC could be partially explained by X chromosome for males, with considerable portion next to that by the chromosome 6 harboring the well-known susceptible HLA loci. However, a subtle genetic contribution of the X chromosome to NPC was observed in females. The different genetic effects on males and females might explain the sexual dimorphism phenomena in NPC incidence; however, the mechanisms underlying this phenomenon remain unclear. Random inactivation in the X chromosome is a major characteristic for females, such that homozygotes in females exhibit a similar genetic effect as hemizygotes in males. It has been implicated that X-linked tumor suppressor genes may escape from X-inactivation, whereby the females carrying such two copies of functional genes have a reduced risk to develop tumor [17]. Moreover, sex-specific regulatory variants are likely enriched in the X chromosome due to the sex-specific chromatin accessibility, implying different cumulative effects on gene reregulation between genders and thereby sex-biases in disease prevalence [36]. We identified a novel susceptibility locus at rs5927056 to be associated with NPC risk, surpassing X chromosome-wide significance in the discovery sample collection. Moreover, the combined analysis revealed a consistent association in males. Heterogeneous effect of the SNP was found in females (OR = 0.77, P = 0.04 in discovery data; OR = 0.99 and 1.00 in two replications), which could be partially explained by the inadequate statistical power because of limited female individuals included. Furthermore, the genetic variance of NPC risk to be explained by the X chromosome SNPs decreased with 8.3% after exclusion of rs5927056, suggesting a remarkable genetic effect of this locus on NPC susceptibility. rs5927056 is located within the intron of the DMD gene, a large gene that encodes the component of dystrophin-glycoprotein complex and bridges the inner cytoskeleton and the extracellular matrix. It has been demonstrated that DMD is sensitive to the replication stress on DNA damage and genome instability in tumor cells [37] and may act as a tumor suppressor involved in the development and progression of mesenchymal tumors [38]. Alteration in DMD was shown to be associated with many non-myogenic tumors [39]. Considering that rs5927056 alters some motifs in the region as revealed by HaploReg, we suspect that the haplotype of rs5927056 might regulate DMD expression and thus predispose individuals to NPC. We also observed that rs5933886 in ARHGAP6 was associated with NPC risk, specifically in females. ARHGAP6 encodes Ras homology GTPase activation protein 6 and is involved in the regulation of actin polymerization at the plasma membrane during several cellular processes. ARHGAP6 may act as a tumor suppressor by inhibiting cell proliferation, migration, invasion, and adhesion of cervical carcinoma [40]. Our bioinformatics analyses revealed that rs5933886 was located within a regulatory region and was associated with cis-eQTL effects on ARHGAP6, suggesting that it may confer a protective effect on NPC risk for females through transcriptional regulation of ARHGAP6. Although our study adopted a case-control approach with more than 3900 samples across multiple centers, the sample size is still our limitation. Only one third of the heritability on X chromosome could be explained by the identified association signals in current study. Given the observed minor allele frequency and effect size, the power of association tests in discovery dataset varied from 0.097 (rs371000 in males) to 0.841 (rs6418572 in females; Additional file 1: Table S9) under the present significance level (α = 2.17 × 10−5), suggesting that we had certain power to discover some susceptibility loci underlying the X chromosome and meanwhile other potential susceptibility loci would be overlooked under the current sample size setting. Moreover, as compared to that of the discovery cohort, the relatively smaller sample sizes of the replication cohorts might be insufficient to establish associations with effect size smaller than that in the discovery due to Winner’s curse phenomenon. Therefore, further studies with larger cohorts are necessary to boost the power for replicating our findings as well as detecting missing susceptibility loci on X chromosome, in particular for SNPs with low minor allele frequency and weak effect size. Taken together, we reported for the first time the large-scale X chromosome-wide association study for NPC. Our study illustrates the value of association test by including the X chromosome in finding the missing heritability of complex diseases other than the conventional GWAS, particularly for those diseases showing the remarkable difference in incidence rate between genders. We acknowledge that future association studies are warranted to validate our findings in independent cohorts with a large sample size and to identify more novel genetic variants on sexual chromosomes associated with NPC risk. Moreover, fine-mapping studies incorporating next-generation sequencing technology would be important to pinpoint the causal variants underlying the susceptibility loci.

Conclusion

Our study illustrates that the X chromosome confers different genetic effects on males and females and might explain the sexual dimorphism phenomena in NPC incidence. Our study reveals new susceptibility loci at the X chromosome conferring risk of NPC and supports the value of including the X chromosome in large-scale association studies. Further association studies using independent large cohorts are warranted to validate our findings. Supplemental materials. This file contains supplemental tables (Table S1-S8) and figures (Figure S1-S6). (PDF 2826 kb)
  39 in total

1.  Is nasopharyngeal cancer really a "Cantonese cancer"?

Authors:  Joseph Tien Seng Wee; Tam Cam Ha; Susan Li Er Loong; Chao-Nan Qian
Journal:  Chin J Cancer       Date:  2010-05

2.  Joint analysis is more efficient than replication-based analysis for two-stage genome-wide association studies.

Authors:  Andrew D Skol; Laura J Scott; Gonçalo R Abecasis; Michael Boehnke
Journal:  Nat Genet       Date:  2006-01-15       Impact factor: 38.330

3.  Sex difference in the incidence of nasopharyngeal carcinoma in Hong Kong 1983-2008: suggestion of a potential protective role of oestrogen.

Authors:  Shao-Hua Xie; Ignatius Tak-Sun Yu; Lap-Ah Tse; Oscar Wai-kong Mang; Li Yue
Journal:  Eur J Cancer       Date:  2012-08-11       Impact factor: 9.162

4.  HLA-A SNPs and amino acid variants are associated with nasopharyngeal carcinoma in Malaysian Chinese.

Authors:  Yoon-Ming Chin; Taisei Mushiroda; Atsushi Takahashi; Michiaki Kubo; Gopala Krishnan; Lee-Fah Yap; Soo-Hwang Teo; Paul Vey-Hong Lim; Yoke-Yeow Yap; Kin-Choo Pua; Naoyuki Kamatani; Yusuke Nakamura; Choon-Kook Sam; Alan Soo-Beng Khoo; Ching-Ching Ng
Journal:  Int J Cancer       Date:  2014-06-27       Impact factor: 7.396

5.  HaploReg: a resource for exploring chromatin states, conservation, and regulatory motif alterations within sets of genetically linked variants.

Authors:  Lucas D Ward; Manolis Kellis
Journal:  Nucleic Acids Res       Date:  2011-11-07       Impact factor: 16.971

6.  Incidence trend of nasopharyngeal carcinoma from 1987 to 2011 in Sihui County, Guangdong Province, South China: an age-period-cohort analysis.

Authors:  Li-Fang Zhang; Yan-Hua Li; Shang-Hang Xie; Wei Ling; Sui-Hong Chen; Qing Liu; Qi-Hong Huang; Su-Mei Cao
Journal:  Chin J Cancer       Date:  2015-05-14

7.  Nasopharyngeal carcinoma as a paradigm of cancer genetics.

Authors:  Malcolm J Simons
Journal:  Chin J Cancer       Date:  2011-02

8.  Impact of the X Chromosome and sex on regulatory variation.

Authors:  Kimberly R Kukurba; Princy Parsana; Brunilda Balliu; Kevin S Smith; Zachary Zappala; David A Knowles; Marie-Julie Favé; Joe R Davis; Xin Li; Xiaowei Zhu; James B Potash; Myrna M Weissman; Jianxin Shi; Anshul Kundaje; Douglas F Levinson; Philip Awadalla; Sara Mostafavi; Alexis Battle; Stephen B Montgomery
Journal:  Genome Res       Date:  2016-04-21       Impact factor: 9.043

9.  A flexible and accurate genotype imputation method for the next generation of genome-wide association studies.

Authors:  Bryan N Howie; Peter Donnelly; Jonathan Marchini
Journal:  PLoS Genet       Date:  2009-06-19       Impact factor: 5.917

10.  Testing for association on the X chromosome.

Authors:  David Clayton
Journal:  Biostatistics       Date:  2008-04-25       Impact factor: 5.899

View more
  5 in total

Review 1.  Nasopharyngeal carcinoma among the Bidayuh of Sarawak, Malaysia: History and risk factors.

Authors:  Reagan Entigu Linton; Maelinda Daker; Alan Soo-Beng Khoo; Diana Chung Yiing Choo; Mignon Viljoen; Paul M Neilsen
Journal:  Oncol Lett       Date:  2021-05-06       Impact factor: 2.967

Review 2.  The Duchenne muscular dystrophy gene and cancer.

Authors:  Leanne Jones; Michael Naidoo; Lee R Machado; Karen Anthony
Journal:  Cell Oncol (Dordr)       Date:  2020-11-14       Impact factor: 6.730

3.  MYC-activated RNA N6-methyladenosine reader IGF2BP3 promotes cell proliferation and metastasis in nasopharyngeal carcinoma.

Authors:  Mingyu Du; Yi Peng; Yang Li; Wenyue Sun; Huanfeng Zhu; Jing Wu; Dan Zong; Lirong Wu; Xia He
Journal:  Cell Death Discov       Date:  2022-02-08

Review 4.  Nasopharyngeal Carcinoma and Its Microenvironment: Past, Current, and Future Perspectives.

Authors:  Zhi Yi Su; Pui Yan Siak; Chee-Onn Leong; Shiau-Chuen Cheah
Journal:  Front Oncol       Date:  2022-03-02       Impact factor: 6.244

5.  Identification of Genetic Variants Associated with Sex-Specific Lung-Cancer Risk.

Authors:  Xiaoshun Shi; Sylvia Young; Grant Morahan
Journal:  Cancers (Basel)       Date:  2021-12-20       Impact factor: 6.639

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.