Literature DB >> 29581771

Genome-wide Association Study (GWAS) of Germline Copy Number Variations (CNVs) Reveal Genetic Risks of Prostate Cancer in Chinese population.

Yishuo Wu1,2,3, Haitao Chen4, Guangliang Jiang1,2, Zengnan Mo5, Dingwei Ye6, Meilin Wang7, Jun Qi8, Xiaoling Lin2, S Lilly Zheng3, Ning Zhang1,2, Rong Na2,9, Qiang Ding1,2, Jianfeng Xu2,3, Yinghao Sun10.   

Abstract

Introduction: The associations between Prostate cancer (PCa) and germline copy number variations (CNVs) in genome-wide level based on Chinese population are unknown. The objective of this study was to identify possible PCa-risk associated CNV regions in Chinese population. Materials and
Methods: We performed a genome-wide association study for CNV in 1,417 PCa cases and 1,008 controls in Chinese population.
Results: 7 risk-associated CNVs were identified for PCa after association analyses (P <7.2×10-6). Another 34 CNVs were found to be potentially risk-associated CNVs (P<0.05). Among the total 41 CNVs, 27 CNVs were risk variations and the other 14 were found to be protective of PCa. 25 of the CNVs (19 duplications and 6 deletions) were located in gene regions while 16 CNVs (9 duplications and 7 deletions) were located in intergenic regions. We identified a higher burden of gaining PCa-risk CNVs and a lower frequency of protective CNVs in cases than controls. Bioinformatics analyses suggested that genes related to PCa risk-associated CNVs were significantly enriched in some biological processes, cellular components and molecular functions.
Conclusion: These results provided additional information of genetic risks for PCa. Several CNV regions involved actionable genes that might be potential gene for target therapy. Additional validation and functional studies are warranted for these results.

Entities:  

Keywords:  China.; copy number variation; genome-wide association study; prostate cancer

Year:  2018        PMID: 29581771      PMCID: PMC5868157          DOI: 10.7150/jca.22802

Source DB:  PubMed          Journal:  J Cancer        ISSN: 1837-9664            Impact factor:   4.207


Introduction

Prostate cancer (PCa) represents the second most common cancer and the fifth most common cause of cancer-related death in men, with more than 900,000 new cases diagnosed each year 1. In China, the incidence has increased by nearly six-fold during last 30 years 2. Genetic risk is one of the most important factors causing PCa, which can explain ~42% of the disease carcinogenesis 3. Till now, more than 100 PCa risk-associated single nucleotide polymorphisms (SNPs) have been found through genome-wide association studies (GWAS) 4. However, a study showed these risk-associated SNPs could only explain about 33% of the familial risk for PCa 5. In addition, most of these SNPs are located in intergenic regions 4. Thus, it is hard to understand the possible functional effects of these SNPs on pathogenesis. Copy-number variations (CNVs) are considered another major fact of genetic diversity, which could directly modulate cellular biological functions 6. In addition, studies showed that CNVs cover ~35% of the human genome, while SNPs only account for less than 1% 7. Several studies indicated that germline CNVs were associated with PCa in Caucasian and African-American populations 8-12. However, the association between PCa and CNVs in a genome-wide level has not been reported in Chinese population. In the present study, we performed a genome-wide CNV association study with PCa in a Chinese population by using the published GWAS data from our previous study 13. Our objective was to identify possible germline PCa-risk CNV regions, which might provide additional insight to the inherited risk of PCa.

Materials and Methods

Study population

The study population is a part of Chinese Consortium for Prostate Cancer Genetics (ChinaPCa). The demographic characteristics and clinical features of the study were reported in our previous study 13. All the subjects are male Han Chinese from the southeastern region of China. A total of 1,417 PCa cases and 1,008 controls were recruited. All of the cases were pathologically diagnosed with primary PCa. Controls were recruited from the community population. Written informed consent was obtained from each participant. The study was approved by the institutional review board of each medical center that participated in ChinaPCa.

Genotyping

DNA was extracted from blood sample of each subject. Illumina Human OmniExpress BeadChips (Illumina, San Diego, California) were used to genotype the samples. Finally, a total of 731,458 SNPs were genotyped. Genotyping was performed at State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, China.

CNV Calling

Raw signal intensity files were first generated by the export function provided in Illumina GenomeStudio. Then we used UCSC Genome Browser's liftover tool to map SNPs to the newer reference genome assembly (http://genome.ucsc.edu/cgi-bin/hgLiftOver). By choosing proper population frequency of B allele (PFB) and GCmodel files, we then used PennCNV to identify CNVs and generate a quality control summary for each sample 14.

Quality Controls

During SNP genotyping, samples were removed if they (i) had an overall genotyping rate of <95%; or (ii) were duplicates or showed familial relationships (PI_HAT > 0.025). SNPs were excluded if they had (i) a call rate of <95%; (ii) a minor allele frequency (MAF) of <0.01; or (iii) P < 1 × 10-3 in a Hardy-Weinberg equilibrium test among controls. During individual quality control analysis, samples with highly variable signal intensity was removed, including (i) LRR_SD (standard deviation of Log R Ratio) >0.3; (ii) BAF_drift (measuring departure of the B Allele Frequency from the expected values) > 0.01; (iii) WF (waviness factor, the amount of dispersion in signal intensity) >0.05 or WF<-0.05; (iv) Number of CNVs >50 (number of called-CNVs). After individual quality control, a total of 753 cases and 982 controls with 51788 CNVs remained. During CNV calling, CNVs were removed if (i) number of SNPs spanning less than 10; (ii) length of CNV >50 kb. After call-level quality control, 8352 CNVs remained. Then we merged adjacent CNV calls using a threshold of gap<20% of total length during which 91 paired CNV were merged. We also removed the CNVs calls located in certain regions, including HLA regions, immunoglobulin regions, telomere regions, centromere regions. After quality control analysis, a total of 6932 CNV regions were able to be used for further analyses.

Statistical Analysis

Predicted CNVs from the sample genomic locus can have various start and end points across individuals. To make the analysis simpler, we divided the genome into copy number polymorphic regions (CNPRs). Each individuals is then assigned a copy number (CN) state for each CNPR according to the CNV predicted in that region, with CN = 2 if no CNV is predicted.14 In order to discriminate the CNV with a single duplication of one allele and single deletion of the other (CN=2) from non-CNV status (CN=2), we further encoded CN state with deletion and duplication variables.14 P-link 1.09 was used to perform association analysis. Logistic regression analysis was used to adjust for age. Fisher exact test was used if 25% of cells with expected count less than 5. Two-tail Bonferroni corrected p value of 7.2×10-6 (0.05/6932) was considered statistical significant.

Bioinformatics analyses

The CNVs found in this study were compared with the data based on normal populations in the Datablse of Genomic Variants (DGV, http://dgv.tcag.ca/dgv/app/home). Gene annotation in the CNV regions was performed by using UCSC gene browser. DAVID was used to perform functional annotation of affected genes located within CNVs 15.

Results

After quality control, a total of 753 cases (PCa patients) and 982 controls were available for further analyses. The characteristics of the study population are shown in Table . The mean age of case group was significantly higher than that of control group. The difference of age was adjusted in the further association analyses. Seven risk-associated CNVs were identified for PCa after association analyses (P value in bold<7.2×10-6). Another 34 CNVs were found to be potentially risk-associated CNVs (P<0.05) (Table ). After comparing with DGV database, all CNV regions were previously reported overlapped with common CNVs. Among the total 41 CNVs, 27 CNVs were risk variations and the rest 14 were found to be protective variations of PCa. In addition, 25 of the CNVs (19 duplications and 6 deletions) were significantly associated with PCa and were located in gene regions (all P values<0.05). The remaining 16 CNVs (9 duplications and 7 deletions) were located in intergenic regions. In order to analyze the global burden of risk CNVs in PCa patients, we compared the amount of patients harboring different types of risk CNVs with respect to 982 controls (Table . Risk CNVs were found in 51.5% of the PCa patients compared to 12.5% found in controls (P value=1.1×10-68). When the frequencies of deletions and duplications were analyzed separately, they were still higher in PCa patients than that in controls (9.3% vs. 2.9% in deletions with a P value = 8.2×10-9 and 46.5% vs. 10.3% in duplications with a P value = 1.1×10-68). Similar analysis was also performed in comparing the frequencies of the 14 protective CNVs in PCa patients versus in controls (Table . Lower frequencies of the protective CNVs were observed in PCa cases for deletions, duplications and all CNVs, respectively. Furthermore, we calculated P values for each CNVs and scores resulting from -log10 (P) are displayed in the genome-wide Manhattan plot in Figure . Genes located within or nearby the CNV regions were also used to perform network analysis through cBioportal using 491 TCGA prostate cancer data (Supplementary Figure 1). HLA-DRB1, HLA-DRB5 and GRM5 were clinical actionable genes, among which GRM5 was located at CNV regions and that duplication reached a genome-wide significance in our data. Since the target therapy medications (2-Methyl-6-(phenylethynyl) pyridine, MPEP) toward GRM5 have not been approved by Food and Drug Administration (FDA), our results suggested additional evidence for its potential value as a candidate target therapy for PCa.

Discussion

Studies have found several germline CNVs in Caucasian population (deletion at 2p24.3, 12q21.31, 15q21.3 and 20p13) and in African-American population (14q32.33) associated with predisposition of PCa 8, 11, 16, 17. However, PCa risk-associated CNVs have not been reported in Chinese population. To our knowledge, this is the first genome-wide association study between CNV and PCa in Chinese population by using the published GWAS data from our previous study. We found 41 CNVs were significantly associated with PCa and these CNVs conferring either risk or protection to PCa. More than half of the PCa risk-associated CNVs were located in gene regions. We used DAVID Bioinformatics Resources 6.8 to perform functional annotation and found that the genes were related with multiple biological processes, cellular components and molecular functions (e.g. protein binding, DNA binding, transferase activity, G-protein coupled receptor activity, virus receptor activity, hydrolase activity, nucleotide binding and so on). For instance, in the present study, we observed a significant association between 11q14.3 duplication with PCa. GRM5 located in 11q14.3 known as metabotropic glutamate receptor 5 is a Gq protein-coupled receptor that is widely expressed in the brain and activate PLCβ, resulting in intracellular Ca2+ release and protein kinase C (PKC) activation 18. It was reported that this pathway might relate to the proliferation and migration of tumor cells which included melanoma, oral squamous cell carcinoma, nasopharyngeal carcinoma, breast cancer, colon cancer, sarcoma and various kinds of neurologic tumors. In addition, such biological effect could be inhibited by glutamate antagonists (MPEP) 19-27. However, it had not been reported in prostate cancer. We also looked into the expression of GRM5 in prostate cancer cell lines in Cancer Cell Line Encyclopedia (CCLE) and found the mRNA expression level was 4.14-4.43 (robust multi-array normalized) among 7 different prostate cancer cell lines, while the expression of GRM5 was nearly 0 in normal prostate tissues in Genotype-Tissue Expression (GTEx) database. In the present study, the frequency of the duplication located in GRM5 region was 13.7% in cases while the frequency was much lower in controls (1.1%). Thus we hypothesized that germline CNV of 11q14.3 may influence the expression of GRM5 and lead to carcinogenesis of PCa. However, the function of GRM5 related CNV increasing PCa risk and the differences between germline and somatic variations of GRM5 should be further investigated. Higher frequencies of specific prostate cancer risk-associated deletions and duplications had been reported in previous studies 8, 12. Our results also revealed a relevant contribution of CNVs to PCa-risk with a higher burden of risk CNVs and a lower frequency of protective CNVs in cases compared to controls (all P values <0.05). This provided a genome-wide summary of CNV frequencies in PCa patients and controls which might be helpful to illustrate the importance of CNVs in prostate cancer. This case-control study was the first genome-wide CNV association study in Chinese population evaluating the relationship between CNVs and PCa. In this study, we analyzed CNV information from GWAS array by using a well-established method. Although, we failed to perform the confirmation evaluation of these CNVs at this stage, we were able to compare our findings with databases by using bioinformatics methods, and we agreed that additional validation studies should be conducted in the future. Moreover, functional studies would be necessary to validate the findings of the association between CNVs and PCa. Nevertheless, our study has provided additional information of carcinogenesis and potential treatment targets of prostate cancer.

Conclusion

41 CNVs were found associated or potentially associated with PCa in the Chinese population. These results provided additional information of genetic risks of PCa. Several CNV regions involved clinical actionable genes that might be potentially targeted by medications. Additional validation and functional studies are still warranted for these results. Supplementary figure S1. Click here for additional data file.
Table 1

Characteristics of the study population

VariablesCase (N=753)Control (N=982)P
Age (year, mean±SD)*71.4±8.0461.5±9.5<1.0×10-4
PSA (ng/mL, median, IQR)*26.6(13.2-79.1)0.6(0.3-1.2)<1.0×10-4
Gleason Score (No., %)
<7189(25.1)-
≥7500(66.4)-
Missing64(8.5)-

*Age and PSA at diagnosis for cases or at recruitment for controls.

Table 2

Copy number variations associated with prostate cancer in potential

Chr.StartEndCNV typeDirectionCasesControlORP valueGenes
192007917721077133Delrisk9012.602.39×10-3*CR593334,CR614976,LOC284441,ZNF430,ZNF486
116725820267488532Delrisk3222.086.77×10-3AK091996,BC110365,CR602694
105924237659438056Delrisk906.301.08×10-2intergenic
212031546820449022Delrisk50INF1.27×10-2intergenic
118820473488263638Delrisk40INF3.03×10-2GRM5
1621360842187363Delrisk40INF3.03×10-2CASKIN1,RAB26,SNORD60,TRAF7
16616990718025Delrisk40INF3.03×10-2*AK128777,AL360260,C16orf13,C16orf14,CCDC78
341325034187007Delrisk704.893.98×10-2SUMF1
67903180979086086Duprisk157337.027.86×10-32intergenic
118833631088384073Duprisk1031113.722.39×10-28GRM5
181473795814828909Duprisk45173.749.08×10-7ANKRD30B
63260176832664508Duprisk1938.902.71×10-5AK293020,HLA-DRB1,HLA-DRB5
102224922722376157Duprisk110INF6.63×10-5DNAJC1
151925740419408394Duprisk60INF5.28×10-3A26B1,DQ582260,M84131,POTEB
68028526080386902Duprisk60INF5.28×10-3LCA5
490031889169160Duprisk1135.136.25×10-3DEFB131,LOC650293
5109845447109992561Duprisk719.791.12×10-2FLJ43080
53214313832144121Duprisk32222.041.19×10-2KIAA0300,PDZD2
136313034263185384Duprisk50INF1.26×10-2intergenic
163207304032556470Duprisk50INF1.27×10-2BC041879,BC042588,DQ571479,TP53TG3b,X69637
115856381758634485Duprisk19122.222.93×10-2AX746988,FAM111B
128521984285281435Duprisk40INF3.03×10-2MGAT4C
181206236412122247Duprisk40INF3.03×10-2BX648696,BX649021
37600574576061858Duprisk40INF3.03×10-2intergenic
1067177736783787Duprisk724.893.98×10-2intergenic
1149608041149614444Duprisk724.893.98×10-2SELENBP1
76163160561631605Duprisk45173.749.08×10-7intergenic
193262045332675491Delprotective504210.158.96×10-54intergenic
115043284450586426Delprotective341790.252.54×10-16intergenic
123678544736801139Delprotective7430.223.47×10-5intergenic
192422824424354405Delprotective5260.273.13×10-3intergenic
91194122212175185Delprotective3150.283.20×10-2intergenic
59945596499486169Dupprotective1320.048.23×10-7intergenic
8145079175145131413Dupprotective3290.149.22×10-5PARP10,PLEC1
8114044747114181033Dupprotective17570.418.19×10-4CSMD3
1187833278187886901Dupprotective2170.164.59×10-3intergenic
4162057165162075793Dupprotective27640.581.85×10-2intergenic
122250987222573948Dupprotective080.002.44×10-2KIAA0528
192193698721995054Dupprotective080.002.44×10-2ZFS-3,ZNF208
4145164558145169643Dupprotective2130.213.31×10-2GYPA
75316464353216869Dupprotective070.004.64×10-2intergenic

P values and ORs are from a regional test at each locus. INF, infinite; Del, deletion; Dup, duplication. *Only five genes in that region were listed.

Table 3

Comparisons of the frequencies of risk and protective copy number variations detected in the cases and controls

GroupCases(n=753)Controls(n=982)P value
Risk CNVs, N2714
Deletions, n (%)70(9.3%)28(2.9%)8.24E-09
Duplications, n (%)350(46.5%)101(10.3%)4.38E-65
All CNVs, n (%)385(51.5%)123(12.5%)1.13E-68
Protective CNVs, N1114
Deletions, n (%)70(9.3%)512(52.1%)2.68E-78
Duplications, n (%)51(6.8%)208(21.2%)7.01E-17
All CNVs, n (%)118(15.7%)626(63.7%)1.84E-89

P values were calculated by Chi-test; N, number of CNVs; n, number of patients harboring CNVs.

  27 in total

1.  Genome-wide association study of prostate cancer mortality.

Authors:  Kathryn L Penney; Saumyadipta Pyne; Fredrick R Schumacher; Jennifer A Sinnott; Lorelei A Mucci; Peter L Kraft; Jing Ma; William K Oh; Tobias Kurth; Philip W Kantoff; Edward L Giovannucci; Meir J Stampfer; David J Hunter; Matthew L Freedman
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2010-10-26       Impact factor: 4.254

2.  Regulation of the mGluR5, EAAT1 and GS expression by glucocorticoids in MG-63 osteoblast-like osteosarcoma cells.

Authors:  N Kalariti; P Lembessis; E Papageorgiou; N Pissimissis; M Koutsilieris
Journal:  J Musculoskelet Neuronal Interact       Date:  2007 Apr-Jun       Impact factor: 2.041

3.  Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal.

Authors:  Jianjiong Gao; Bülent Arman Aksoy; Ugur Dogrusoz; Gideon Dresdner; Benjamin Gross; S Onur Sumer; Yichao Sun; Anders Jacobsen; Rileen Sinha; Erik Larsson; Ethan Cerami; Chris Sander; Nikolaus Schultz
Journal:  Sci Signal       Date:  2013-04-02       Impact factor: 8.192

4.  Environmental and heritable factors in the causation of cancer--analyses of cohorts of twins from Sweden, Denmark, and Finland.

Authors:  P Lichtenstein; N V Holm; P K Verkasalo; A Iliadou; J Kaprio; M Koskenvuo; E Pukkala; A Skytthe; K Hemminki
Journal:  N Engl J Med       Date:  2000-07-13       Impact factor: 91.245

5.  Expression of the metabotropic glutamate receptor 5 (mGluR5) induces melanoma in transgenic mice.

Authors:  Kyu Yeong Choi; Kai Chang; James M Pickel; John D Badger; Katherine W Roche
Journal:  Proc Natl Acad Sci U S A       Date:  2011-09-06       Impact factor: 11.205

6.  Characterization of germline copy number variation in high-risk African American families with prostate cancer.

Authors:  Elisa M Ledet; Xiaofeng Hu; Oliver Sartor; Walter Rayford; Marilyn Li; Diptasri Mandal
Journal:  Prostate       Date:  2012-10-11       Impact factor: 4.104

7.  Origins and functional impact of copy number variation in the human genome.

Authors:  Donald F Conrad; Dalila Pinto; Richard Redon; Lars Feuk; Omer Gokcumen; Yujun Zhang; Jan Aerts; T Daniel Andrews; Chris Barnes; Peter Campbell; Tomas Fitzgerald; Min Hu; Chun Hwa Ihm; Kati Kristiansson; Daniel G Macarthur; Jeffrey R Macdonald; Ifejinelo Onyiah; Andy Wing Chun Pang; Sam Robson; Kathy Stirrups; Armand Valsesia; Klaudia Walter; John Wei; Chris Tyler-Smith; Nigel P Carter; Charles Lee; Stephen W Scherer; Matthew E Hurles
Journal:  Nature       Date:  2009-10-07       Impact factor: 49.962

8.  Cancer incidence and mortality worldwide: sources, methods and major patterns in GLOBOCAN 2012.

Authors:  Jacques Ferlay; Isabelle Soerjomataram; Rajesh Dikshit; Sultan Eser; Colin Mathers; Marise Rebelo; Donald Maxwell Parkin; David Forman; Freddie Bray
Journal:  Int J Cancer       Date:  2014-10-09       Impact factor: 7.396

9.  Expression of glutamate receptor subunits in human cancers.

Authors:  Andrzej Stepulak; Hella Luksch; Christine Gebhardt; Ortrud Uckermann; Jenny Marzahn; Marco Sifringer; Wojciech Rzeski; Christian Staufner; Katja S Brocke; Lechoslaw Turski; Chrysanthy Ikonomidou
Journal:  Histochem Cell Biol       Date:  2009-06-14       Impact factor: 4.304

10.  A Genome Wide Study of Copy Number Variation Associated with Nasopharyngeal Carcinoma in Malaysian Chinese Identifies CNVs at 11q14.3 and 6p21.3 as Candidate Loci.

Authors:  Joyce Siew Yong Low; Yoon Ming Chin; Taisei Mushiroda; Michiaki Kubo; Gopala Krishnan Govindasamy; Kin Choo Pua; Yoke Yeow Yap; Lee Fah Yap; Selva Kumar Subramaniam; Cheng Ai Ong; Tee Yong Tan; Alan Soo Beng Khoo; Ching Ching Ng
Journal:  PLoS One       Date:  2016-01-05       Impact factor: 3.240

View more
  1 in total

1.  A Deep Neural Network for Gastric Cancer Prognosis Prediction Based on Biological Information Pathways.

Authors:  Jili Hu; Weiqiang Yu; Yuting Dai; Can Liu; Yongkang Wang; Qingfa Wu
Journal:  J Oncol       Date:  2022-09-09       Impact factor: 4.501

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.