BACKGROUND: This study aimed to identify potential prostate cancer (PC)-related variations in gene expression profiles. METHODS: Microarray data from the GSE21032 dataset that contained the whole-transcript and exon-level expression profile (GSE21034) and Agilent 244K array-comparative genomic hybridization data (GSE21035) were downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) and copy-number variations (CNVs) were identified between PC and normal tissue samples. Coexpression interactions of DEGs that contained CNVs (CNV-DEGs) were analyzed. Pathway enrichment analysis of CNV-DEGs was performed. Drugs targeting CNV-DEGs were searched using the Drug-Gene Interaction database. RESULTS: In total, 679 DEGs were obtained, including 182 upregulated genes and 497 downregulated genes. A total of 48 amplified CNV regions and 45 deleted regions were determined. The number of CNVs at 8q and 8p was relatively higher in PC tissue. Only 16 DEGs, including 4 upregulated and 12 downregulated genes, showed a positive correlation with CNVs. In the coexpression network, 3 downregulated CNV-DEGs, including FAT4 (FAT atypical cadherin 4), PDE5A (phosphodiesterase 5A, cGMP-specific), and PCP4 (Purkinje cell protein 4), had a higher degree, and were enriched in specific pathways such as the calmodulin signaling pathway. Five of the 16 CNV-DEGs (e.g., PDE5A) were identified as drug targets. CONCLUSION: The identified CNV-DEGs could be implicated in the progression of human PC. The findings could lead to a better understanding of PC pathogenesis.
BACKGROUND: This study aimed to identify potential prostate cancer (PC)-related variations in gene expression profiles. METHODS: Microarray data from the GSE21032 dataset that contained the whole-transcript and exon-level expression profile (GSE21034) and Agilent 244K array-comparative genomic hybridization data (GSE21035) were downloaded from the Gene Expression Omnibus database. Differentially expressed genes (DEGs) and copy-number variations (CNVs) were identified between PC and normal tissue samples. Coexpression interactions of DEGs that contained CNVs (CNV-DEGs) were analyzed. Pathway enrichment analysis of CNV-DEGs was performed. Drugs targeting CNV-DEGs were searched using the Drug-Gene Interaction database. RESULTS: In total, 679 DEGs were obtained, including 182 upregulated genes and 497 downregulated genes. A total of 48 amplified CNV regions and 45 deleted regions were determined. The number of CNVs at 8q and 8p was relatively higher in PC tissue. Only 16 DEGs, including 4 upregulated and 12 downregulated genes, showed a positive correlation with CNVs. In the coexpression network, 3 downregulated CNV-DEGs, including FAT4 (FAT atypical cadherin 4), PDE5A (phosphodiesterase 5A, cGMP-specific), and PCP4 (Purkinje cell protein 4), had a higher degree, and were enriched in specific pathways such as the calmodulin signaling pathway. Five of the 16 CNV-DEGs (e.g., PDE5A) were identified as drug targets. CONCLUSION: The identified CNV-DEGs could be implicated in the progression of human PC. The findings could lead to a better understanding of PC pathogenesis.
Prostate cancer (PC) is the most frequently diagnosed malignancy in males, with 220,800 new cases and 27,540 deaths per year in the United States.[ Despite an improvement in therapeutic schedules, PC-related mortality remains high worldwide.[ In recent years, remarkable advances have been made in the investigation of molecular mechanisms underlying PC, such as the detection of copy-number variations (CNVs).CNV is a kind of gene mutation that causes genetic diversity and is defined as a variable copy number of a segment of deoxyribonucleotides (DNA) > 1 kb in size.[ CNV is universal among humans and is associated with complex diseases like PC by providing phenotypic diversity and conferring variable disease susceptibility.[ The analyses of transcriptomes and CNVs in PC have been reported, and common results include a positive correlation between Transmembrane Protease, Serine 2 (TMPRSS2)-V-Ets avian erythroblastosis virus E26 oncogene homolog (ERG) fusion gene, gene amplifications (e.g., androgen receptor gene, mitogen-activated protein kinase kinase 7 [MAP3K7], and maternal embryonic leucine zipper kinase [MELK]),[ predominant somatic gene mutations (e.g., phosphatase and tensin homolog [PTEN] and olfactomedin 4 gene [OLFM4]), and overexpression of differential display code 3 (DD3).[DD3 is a novel PC-specific gene, which has been renamed PCA3 to reflect its association with PC.[ Furthermore, CNVs at 15q21.3 and 12q21.31 are related to the activator protein 1 (AP1) and α-1,3-mannosyl-glycoprotein 4-β-N-acetylglucosaminyltransferase C (MGAT4C) genes, and 8p loss and 8q gain have been found to be associated with PC risk through altered regulation of cell proliferation and migration of PC cells.[ Nevertheless, the relationship between CNVs and gene expression in PC has not been fully elucidated.Microarrays constructed with genomic clones have been widely used to detect CNVs in vitro.[ Based on array-comparative genomic hybridization (aCGH), whole-transcript expression data, and microRNA expression data, Taylor et al[ identified NCOA2 (nuclear receptor coactivator 2) as an oncogene in approximately 11% of tumors and revealed that androgen-driven TMPRSS2-ERG fusion was associated with a narrow deletion on 3p14 implicated FOXP1, RYBP, and SHQ1 as potential cooperative tumor suppressors. However, the differentially expressed genes (DEGs) between normal samples and PC samples have not been fully characterized, and numerous CNVs in PC have not been described. The present study focused on focal DNA CNVs and DEGs between PC samples and normal tissue samples to identify differences that could contribute to the progression of human PC.In the present study, using aCGH and whole-transcript expression data deposited by Taylor et al,[ DEGs and CNVs between PC tissue samples and normal tissue were determined, and coexpression interactions between DEGs were analyzed. Furthermore, functional analysis of DEGs that contained CNVs was performed. Our bioinformatic approaches may be used to analyze other PC datasets (e.g., whole transcriptome expression data, microRNA expression data, and aCGH) or even datasets of any type of cancer to identify novel CNV genes.
Material and methods
Microarray data
The GSE21032 dataset was downloaded from the Gene Expression Omnibus database (http://www.ncbi.nlm.nih.gov/geo/),[ which contained GSE21034 (whole-transcript and exon-level expression data), GSE21035 (Agilent 244K aCGH data), and GSE21036 (microRNA expression data). PC tissue and cell-line microarray data were included, but only those of tissue samples in GSE21034 and GSE21035 were used in the study.GSE21035 included 218 samples, which consisted of 181 primary PC samples and 37 metastatic PC samples; the associated platform was an Agilent-014693 Human Genome CGH Microarray 244A (GPL4091; Agilent Technologies, Palo Alto, CA). The preprocessed GSE21034 data, which were based on an Affymetrix Human Exon 1.0 ST Array platform (GPL5188; Affymetrix, Santa Clara, CA), consisted of 179 samples, including 29 normal tissue samples, 131 primary PC samples, and 19 metastatic PC samples. This study was conducted with approval from the Memorial Sloan-Kettering Cancer Center Institutional Review Board, and all patients provided informed consent.
Screening of DEGs
The preprocessed whole-transcript data underwent a log2 transformation. DEGs between the 368 tumor samples and the 29 normal tissue samples were identified using the linear models for microarray data package,[ which has been previously used to identify DEGs based on microarray data, and the generated P value for each gene was adjusted using the Benjamini–Hochberg (BH) method.[ Only genes meeting the selection criteria of |logFC (fold change)| ≥0.5 and adjusted P value <.05 were considered DEGs.
Preprocessing of aCGH microarray data
Raw aCGH data were preprocessed using the package cghMCR in R (version 1.34.0, http://www.bioconductor.org/packages/release/bioc/html/cghMCR.html).[ The method “minimum” in the package was used for background correction, and this method meant that any intensity, which was zero or negative after background subtraction, was set to a value equal to half the minimum of the positively corrected intensities for that array. The approach of locally weighted scatterplot smoothing, a modern modeling method based on classical methods such as linear and nonlinear least squares regression,[ was employed for data normalization.
Analysis of CNVs
The circular binary segmentation[ algorithm was applied for segment analysis of the preprocessed aCGH data to translate noisy intensity measurements into regions of equal copy number. Subsequently, significant CNV regions were predicted using GISTIC 2.0 (genomic identification of significant targets in cancer, http://www.gistic.org/), an online tool to identify genes implicated in somatic copy-number alterations that regulate cancer growth.[ All sequences were mapped to the reference human genome hg18 (NCBI build 36.1, http://genome.ucsc.edu), and only the sequences meeting the threshold for amplification of 0.3 or deletion of −0.3 were considered significant CNV regions.[
Integrated analysis of CNVs and DEGs
Correlations between DEGs were analyzed using the Pearson correlation coefficient.[ The raw P value was adjusted by the BH method. Pairs that satisfied an adjusted P value <.05 and |correlation| ≥0.8 were deemed to be significantly correlated. The coexpression network of these DEGs was then visualized using Cytoscape (version 3.2.0, U.S. National Institute of General Medical Sciences, http://cytoscape.org/), an open source software for integrating biomolecular interaction networks.[ In the network, a “node” represents a gene or protein, and a “line” represents an interaction between 2 nodes. The degree of each node equals the number of nodes that interact with it.Correlations between the genes that had CNVs and DEGs were performed. Copy number is usually understood to positively correlate with gene expression. Thus, DEGs that had a positive correlation with copy number were chosen for further analysis. Coexpression interactions of CNV–DEGs were extracted from the DEG coexpression network to construct a CNV–DEG coexpression network, which was visualized using Cytoscape. Herein, each CNV–DEG and its connected genes were defined as a network module. With a P value <.05 as the cut-off criterion, a Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis of genes in the network modules was performed using the Database for Annotation, Visualization, and Integrated Discovery, https://david.ncifcrf.gov/,[ and the network composed of pathways and CNV–DEGs was visualized using Cytoscape.
Search for drugs targeting CNV–DEGs
Potential drugs targeting CNV–DEGs were searched using the Drug–Gene Interaction database (DGIdb, http://dgidb.genome.wustl.edu/),[ which contains several common drug–gene interaction databases, including DrugBank,[ therapeutic target database,[ and pharmacogenomics knowledge database.[
Results
Identification of DEGs
In total, 679 DEGs were screened from the 368 PC samples compared with the 29 normal samples. Among these, 182 genes were upregulated and 497 were downregulated in PC tumor cells (supplementary Tumor vs Normal, DEGs).
Analysis of CNV regions
In total, 48 amplified CNV regions and 45 deleted regions were obtained (Fig. 1A and B). The amplified regions were located on chromosomes 1 to 12, 14 to 17, and 22. The peak density at 8q13.2 to 8q24.3 was relatively higher, and the peak height at 1q21.3 was clearly larger compared with normal samples (Fig. 1A). Meanwhile, the deleted regions were located on chromosomes 1 to 18, 21, and 22. Compared with the normal tissue samples, the peak density at 5q11.2 to 5q21.3, 6q15 to 6q21, 8p23.1 to 8p21.3, 13q14.2 to 13q22.1, and 16q22.1 to 16q24.2 was relatively higher (Fig. 1B). All identified CNVs were located on euchromosomes, not idiochromosomes. Amplified regions contained 226 genes and deleted regions contained 372 genes (supplementary files-amplified and deleted regions).
Figure 1
Amplified and deleted regions of copy-number variations (CNVs) on chromosomes. (A) Amplified regions of CNVs. (B) Deleted regions of CNVs. The numbers on the left vertical axis represent chromosomes, and the labels on the right represent varied regions of CNVs. The values on the top horizontal axis represent G-scores, and values on the horizontal axis at the bottom represent q values. The larger G-score and smaller q value indicate that the CNVs were more significant.
Amplified and deleted regions of copy-number variations (CNVs) on chromosomes. (A) Amplified regions of CNVs. (B) Deleted regions of CNVs. The numbers on the left vertical axis represent chromosomes, and the labels on the right represent varied regions of CNVs. The values on the top horizontal axis represent G-scores, and values on the horizontal axis at the bottom represent q values. The larger G-score and smaller q value indicate that the CNVs were more significant.
Correlations between DEGs and CNVs
To investigate whether DEGs had CNVs, DEGs were compared with CNV genes. Notably, expression changes of most DEGs showed no positive correlation with CNVs. Only 16 DEGs had a positive correlation with CNVs (CNV–DEGs), including 4 upregulated ones that had amplified CNV regions and 12 downregulated ones that had deleted regions (Table 1).
Table 1
DEGs mapped to CNV regions.
DEGs mapped to CNV regions.
Analysis of CNV–DEG coexpression network and module functions
To reveal interactions between CNVs and DEGs, coexpression interactions were extracted from the DEG coexpression network to construct the CNV–DEG coexpression network. There were 181 interactions and 113 nodes in the CNV–DEG coexpression network (Fig. 2). Eight nodes were CNV–DEGs, including PCP4 (Purkinje cell protein 4), PDE5A (phosphodiesterase [PDE] 5A, cGMP-specific), FAT4 (FAT atypical cadherin 4), RND3 (Rho family GTPase 3), ASXL3 (additional sex combs-like 3, Drosophila), HSPA4L (heat shock 70 kDa protein 4-like), ANXA8L1 (annexin A8-like 1), and ANXA8 (annexin A8); all of these genes were downregulated. Among the 8 CNV–DEGs, PCP4, PDE5A, and FAT4 had a higher degree. PDE5A interacted with both PCP4 and FAT4. Here, each CNV–DEG and its interacting genes were considered as a network module.
Figure 2
Coexpression network of differentially expressed genes (DEGs) with copy-number variations (CNVs) and DEGs without CNVs. Squares represent DEGs without copy-number variations. Rhombuses represent DEGs with copy-number variations. Lines represent the coexpression correlations between genes.
Coexpression network of differentially expressed genes (DEGs) with copy-number variations (CNVs) and DEGs without CNVs. Squares represent DEGs without copy-number variations. Rhombuses represent DEGs with copy-number variations. Lines represent the coexpression correlations between genes.To further explore functions of genes in the network modules, a KEGG pathway enrichment analysis was conducted. Three modules, including FAT4, PDE5A, and PCP4, as well as their interacting genes, were significantly enriched in pathways (Fig. 3). Common pathways included arrhythmogenic right ventricular cardiomyopathy, focal adhesion, hypertrophic cardiomyopathy, dilated cardiomyopathy, and calcium signaling pathway.
Figure 3
Network of differentially expressed genes with copy-number variations and their enriched pathways.
Network of differentially expressed genes with copy-number variations and their enriched pathways.
Drugs targeting CNV–DEGs
Drugs targeting the CNV–DEGs were searched using the DGIdb database. Five gene products of identified CNV–DEGs are drug targets, including MYC (v-myc avian myelocytomatosis viral oncogene homolog), PYCR1 (pyrroline-5-carboxylate reductase 1), ODC1 (ornithine decarboxylase 1), PDE5A, and RND3.
Discussion
In PC, deletions and increases in DNA copy-number contribute to alterations in the expression of tumor-related genes and are involved in PC pathogenesis.[ In the present study, amplified CNV regions containing 226 genes and deleted regions containing 372 genes were identified through genomic analyses of PC samples. The peak density of amplified regions at 8q13.2 to 8q24.3 in PC samples was higher than that in normal samples, which was consistent with the results reported by Sun et al.[ Compared with normal groups, the peak density of deleted regions at 5q11.2 to 5q21.3, 6q15 to 6q21, 8p23.1 to 8p21.3, 13q14.2 to 13q22.1, and 16q22.1 to 16q24.2 was higher in PC samples. In total, 182 upregulated and 497 downregulated genes were screened from PC samples compared with normal samples. Among the identified DEGs, 2 upregulated genes and 14 downregulated genes contained CNVs. In the CNV–DEG coexpression network, there were 8 CNV–DEGs, including FAT4, PDE5A, and PCP4, which had a higher degree and were significantly enriched in several pathways.FAT4 encodes a gene family member belonging to the protocadherin family, and this protein is involved in the regulation of planar cell polarity.[ Loss of Fat4 expression in tumors is related to humanFAT4 promoter methylation.[ To date, there is no evidence to demonstrate CNVs in FAT4 in PC. However, it has been demonstrated that loss of FAT4 expression is present in a large fraction of humanbreast tumor cell lines and primary tumor tissues, and Fat4 has been identified as a potential tumor suppressor gene in breast cancer.[ Moreover, single-nucleotide polymorphisms in FAT4 have been discovered in spontaneous pulmonary adenoma,[ and mutations in FAT4 have been detected in colorectal cancer.[ Therefore, FAT4 could play a role in PC progression through deleted CNVs.PDE5A encodes a cGMP-specific PDE, which belongs to the cyclic nucleotide PDE family. This PDE is correlated with the regulation of intracellular concentrations of cyclic nucleotides.[ PDE5 is mainly observed in glandular structures of the prostate.[ A previous study found that PDE5 inhibition reverses hypoxia-induced shedding of the immune stimulatory molecule MHC class I-related chain A and attenuates the growth of humanprostate tumors.[PDE5A was found to be targeted by a set of drugs in this study. As reported in previous studies, PDE5 inhibitors can mediate smooth muscle tone in the prostate.[ However, there is no evidence in previous studies to demonstrate CNVs in PDE5A in PC. In this study, PDE5A was coexpressed with FAT4 and PCP4. PCP4 encodes Purkinje cell protein 4, also known as PEP19.[ The hemizygous deletion of PCP4 has been discovered in the PC cell line LuCap35,[ and downregulation of PCP4 has also been observed in PC.[ In this study, PCP4 was significantly enriched in the calmodulin signaling pathway, along with FAT4 and PDE5A. The association between PCP4 and calmodulin signaling has been previously reported.[ Ca2+ is a ubiquitous intracellular messenger responsible for regulating multiple biological processes, including mitosis, cell death, gene transcription, contraction, and muscle relaxation.[ Calmodulin kinase II (CaMKII), a major target of the Ca2+/calmodulin second messenger system, promotes PC cell survival.[ Furthermore, Ca2+/calmodulin-dependent protein kinase kinase beta regulates PC cell growth via 5′-adenine monophosphate-activated protein kinase.[ Collectively, PCP4, as well as FAT4 and PDE5A, could exert pivotal functions related to PC progression via deleted CNVs and the calmodulin signaling pathway.In addition, as reported in the article of Taylor et al[ who are the contributors of the GSE21032 dataset used in this study, NCOA2 is amplified in primary and metastatic PC, and androgen-driven TMPRSS2-ERG fusion is correlated with a deletion at 3p14, which is related to FOXP1 (forkhead box P1), RYBP (RING1 and YY1 binding protein), and SHQ1 (H/ACA ribonucleoprotein assembly factor). The amplifications of chromosome 8 (spanning MYC) and chromosome 11 (CCND1) were increased, while the amplifications of 17p13.1 (TP53), 12p13.1 (CDKN1B), and others were reduced, and SPINK1 messenger ribonucleic acid was found to be frequently overexpressed in PC.[ These CNVs and genes were not identified in our study. Instead, we screened out the FAT4, PDE5A, and PCP4 genes that contained CNVs. We speculated that the different findings in our study compared with those in the Taylor et al study could be due to the different analytical methods used to evaluate CNVs. The analytical methods used in this study were applicable to identifying genes in somatic CNVs that regulate cancer growth and help identify genes targeted by CNVs that drive cancer growth. Conversely, Taylor methods were applicable to high-resolution genomic data, which are used to assess the extent and function of copy-number alterations in cancer. Moreover, in the Taylor et al[ article and The Cancer Genome Atlas (TCGA) paper,[ the most frequent molecular abnormalities were chromosomal arm-level copy-number alterations (gains and losses), while our study focused on focal DNA copy-number alterations, which enabled the discovery of 16 CNV–DEGs that could contribute to human PC progression. Most of the samples in the Taylor et al[ study were derived from non-Hispanic white subjects in a clinical T1c stage with a lower Gleason score. However, the majority of samples in the TCGA paper were from Caucasian subjects in a pathological T2c/T3a stage with a higher Gleason score.[ These differences could have contributed to the different results between this study and the TCGA paper.Despite the aforementioned results, there were several limitations in this study. The predicted results should be confirmed by laboratory data. In our future studies, the CNVs and DEGs will be validated by multiple ligation-dependent probe amplification and real-time polymerase chain reaction analyses. Coexpression relationships between PDE5A and PCP4/FAT4 will be confirmed by pull-down assays.In conclusion, amplified CNV regions containing 226 genes and deleted regions containing 372 genes were identified in PC tissue. The number of CNVs at 8q and 8p in PC samples was relatively higher than that in normal tissue samples. Furthermore, a total of 182 upregulated and 497 downregulated genes were identified between PC and normal tissues. Among the 16 CNV–DEGs, differences in FAT4 and PDE5A in PC were not previously reported. These 2 genes and PCP4 were coexpressed with multiple DEGs, and all of these were enriched in pathways, including the calmodulin signaling pathway. These CNV–DEGs could play important roles in the progression of human PC. The findings could lead to a greater understanding of PC pathogenesis.
Authors: Paul Shannon; Andrew Markiel; Owen Ozier; Nitin S Baliga; Jonathan T Wang; Daniel Ramage; Nada Amin; Benno Schwikowski; Trey Ideker Journal: Genome Res Date: 2003-11 Impact factor: 9.043
Authors: Daniela Sia; Yujin Hoshida; Augusto Villanueva; Sasan Roayaie; Joana Ferrer; Barbara Tabak; Judit Peix; Manel Sole; Victoria Tovar; Clara Alsinet; Helena Cornella; Brandy Klotzle; Jian-Bing Fan; Christian Cotsoglou; Swan N Thung; Josep Fuster; Samuel Waxman; Juan Carlos Garcia-Valdecasas; Jordi Bruix; Myron E Schwartz; Rameen Beroukhim; Vincenzo Mazzaferro; Josep M Llovet Journal: Gastroenterology Date: 2013-01-04 Impact factor: 22.682
Authors: Oskar W Rokhlin; Agshin F Taghiyev; K Ulrich Bayer; David Bumcrot; Victor E Koteliansk; Rebecca A Glover; Michael B Cohen Journal: Cancer Biol Ther Date: 2007-02-05 Impact factor: 4.742
Authors: Ashley E Ross; Luigi Marchionni; Milena Vuica-Ross; Chris Cheadle; Jinshui Fan; David M Berman; Edward M Schaeffer Journal: Prostate Date: 2011-02-25 Impact factor: 4.012
Authors: David S Wishart; Craig Knox; An Chi Guo; Savita Shrivastava; Murtaza Hassanali; Paul Stothard; Zhan Chang; Jennifer Woolsey Journal: Nucleic Acids Res Date: 2006-01-01 Impact factor: 16.971
Authors: Jun Yu; William K K Wu; Xiangchun Li; Jun He; Xiao-Xing Li; Simon S M Ng; Chang Yu; Zhibo Gao; Jie Yang; Miao Li; Qiaoxiu Wang; Qiaoyi Liang; Yi Pan; Joanna H Tong; Ka F To; Nathalie Wong; Ning Zhang; Jie Chen; Youyong Lu; Paul B S Lai; Francis K L Chan; Yingrui Li; Hsiang-Fu Kung; Huanming Yang; Jun Wang; Joseph J Y Sung Journal: Gut Date: 2014-06-20 Impact factor: 23.059
Authors: Wellerson R Scarano; Amina Bedrat; Luiz G Alonso-Costa; Ariana M Aquino; Bruno Fantinatti; Luis A Justulin; Luis F Barbisan; Paula P Freire; Jodi A Flaws; Lemos Bernardo Journal: Toxicol Sci Date: 2019-06-14 Impact factor: 4.849
Authors: Mariana R Botton; Xingwu Lu; Geping Zhao; Elena Repnikova; Yoshinori Seki; Andrea Gaedigk; Eric E Schadt; Lisa Edelmann; Stuart A Scott Journal: Hum Mutat Date: 2019-11 Impact factor: 4.878