Literature DB >> 28751665

Multidimensional integrative analysis uncovers driver candidates and biomarkers in penile carcinoma.

Fabio Albuquerque Marchi1, David Correa Martins2, Mateus Camargo Barros-Filho1, Hellen Kuasne1, Ariane Fidelis Busso Lopes3, Helena Brentani4, Jose Carlos Souza Trindade Filho5, Gustavo Cardoso Guimarães1, Eliney F Faria6, Cristovam Scapulatempo-Neto7, Ademar Lopes1, Silvia Regina Rogatto8,9.   

Abstract

Molecular data generation and their combination in penile carcinomas (PeCa), a significant public health problem in poor and underdeveloped countries, remain virtually unexplored. An integrativemethodology combin ing genome-wide copy number alteration, DNA methylation, miRNA and mRNA expression analysis was performed in a set of 20 usual PeCa. The well-ranked 16 driver candidates harboring genomic alterations and regulated by a set of miRNAs, including hsa-miR-31, hsa-miR-34a and hsa-miR-130b, were significantly associated with over-represented pathways in cancer, such as immune-inflammatory system, apoptosis and cell cycle. Modules of co-expressed genes generated from expression matrix were associated with driver candidates and classified according to the over-representation of passengers, thus suggesting an alteration of the pathway dynamics during the carcinogenesis. This association resulted in 10 top driver candidates (AR, BIRC5, DNMT3B, ERBB4, FGFR1, PML, PPARG, RB1, TNFSF10 and STAT1) selected and confirmed as altered in an independent set of 33 PeCa samples. In addition to the potential driver genes herein described, shorter overall survival was associated with BIRC5 and DNMT3B overexpression (log-rank test, P = 0.026 and P = 0.002, respectively) highlighting its potential as novel prognostic marker for penile cancer.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28751665      PMCID: PMC5532302          DOI: 10.1038/s41598-017-06659-1

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Data integration has emerged as a promising mechanism for the association of events affecting biological pathways and tumor development[1]. Due to the high mutational burden of cancer genomes, the distinction between driver and passenger genes is a challenge[2]. Passenger mutations were believed to not affect cell growth and to be accumulated during tumor progression. However, more recently, the accumulation of deleterious passengers has been suggested as being associated with carcinogenesis, leading to an immune response and cellular stress, as well as contributing to therapy-resistance[3, 4]. The identification of these biomarkers is hampered by genome complexity and limited investigation at a molecular level, which does not allow a broad overview of the different mechanisms involved in gene activity[5]. In order to overcome this issue, the combination of different molecular alterations in a comprehensive manner has been explored as a mechanism to reveal potential gene candidates associated with targeted pathways by therapeutic agents[6]. Recent initiatives, such as TCGA (The Cancer Genome Atlas) and ICGC (International Cancer Genome Consortium), rendered novel insights on cancer system biology compared with isolated events[7]. At the same time, the combination of heterogeneous datasets is particularly difficult to analyze. This encouraged initiatives to design a broad-spectrum of integrative analysis[6]. Module-based approaches have emerged as an efficient mechanism to reconstruct modules of co-regulated genes and their regulatory programs[8]. This methodology has been widely used to explore various biological contexts in cancer studies[9, 10]. Although novel targeted-genes for cancer therapy have been described, there is a lack of studies generating and combining molecular data of penile carcinomas. Penile carcinoma (PeCa) is a rare genitourinary malignancy in developed countries, with an incidence of 0.2 per 100,000 men in the United States and Europe[11, 12] and 2.9 to 6.8 cases per 100,000 in the Brazilian population[13]. The risk factors described in PeCa include phimosis with chronic inflammation, poor hygiene, smoking, low socioeconomic status, number of sexual partners, a history of genital warts and/or other sexually transmitted diseases[14]. Approximately 40% of PeCa are HPV positive, however, the impact of high-risk HPV in the prognosis has not been clarified[15]. Recently, in a large international study applied in 25 countries, HPV positivity was described in 33% of PeCa (N = 1010) and 87% of precancerous lesions (N = 85)[16]. Several prognostic factors have been established for PeCa patients, while regional inguinal lymph node involvement remains the most important predictor of an unfavorable prognosis[17]. Patients with locally advanced penile squamous cell carcinoma and lymph node metastasis are submitted to total or partial penile amputation, followed by primary chemotherapy or radiotherapy[18]. In a recent review, Burnett et al.[19] presented different surgical options available for penile-preservation at early stages and the need for patient monitoring. Besides having a curative effect even in the most advanced diseases, these surgical procedures results in a significant burden of social and psychological impact for the patient, highlighting the importance of identifying molecular markers for penile cancer therapy[20]. Previously, we reported an association between genomic alterations involving losses of 3p21.1-p14.3 and gains of 3q25.31-q29 with reduced cancer-specific and disease-free survival[16]. DLC1 and PPARG losses were also associated with worse prognosis. By integrating methylome and gene expression data, we described a panel of 54 genes with inverse correlation (including TWIST1, RSOP2, SOX3, SOX17, PROM1, OTX2, HOXA3 and MEIS1), pointing out driver epigenetic events associated with dysregulated pathways in PeCa, such as stem cells, Wnt/β-catenin signaling and cell cycle[21]. More recently, by assessing 23 PeCa patients we identified a high sensitivity and specificity of PPARG, MMP1 and MMP12 and hsa-miR-31-5p, hsa-miR-223-3p and hsa-miR-224-5p to distinguish penile tumors from normal tissue[22]. Next generation sequencing studies in penile carcinomas revealed the involvement of well-described genes, such as EGFR, PIK3CA, TP53 and CCND1 [23-25] and dysregulated miRNAs[26], all associated with cancer signaling pathways. In this study, we used a module-based integrative methodology to identify and contextualize driver genes in pathways involved in penile carcinogenesis aiming to explore genome-wide copy number alteration (CNA), DNA methylation, miRNA and gene expression (GE) data. To our knowledge, this is the first study with a multidimensional integrative approach using four molecular levels to identify novel driver candidates with potential therapeutic application.

Results

Integrative analysis to uncover candidate genes involved in PeCa development and progression

The first step of the integrative analysis resulted in 389 genes with varying score between 4.11 and 101.56, with expression levels regulated by at least two other molecular mechanisms. A cutoff of 48.72 was considered to separate 47 potential driver candidate genes used in the module-based analysis (Table 1) and 342 passenger candidates (Supplementary Table S1). Seventeen of 47 (36%) genes were mapped in chromosome 3, followed by chromosomes 2 (6/47) and 8 (6/47). The genomic alterations included 34 losses and 13 gains. Although the 47 driver candidates presented significant copy number alterations, with frequency varying from 35% to 90% of cases, 17 (36%) of them presented expression levels regulated by methylation, with a predominance of hypermethylation (16 of 47 genes). Fifty differentially expressed miRNAs were associated with the regulation of 47 driver candidates (Supplementary Table S2). hsa-miR-34a and hsa-miR-130b overexpression were predicted as regulators of the higher number of downexpressed driver candidates (17 and 16, respectively). Interestingly, 17 of 47 driver candidates (including 26 miRNAs) showed expression levels regulated by three molecular mechanisms investigated in this study (i.e. copy number alteration, methylation and miRNA).
Table 1

Forty-seven candidate genes selected in the first step of the integrative analysis.

GeneChromosomeLR CNAFC Gene ExpressionFC MethylationFC miRNAScore
IL17RD 3−0.33−2.711.628.03101.56
BCL2 18−0.42−2.701.457.5396.73
IGFBP5 2−0.35−3.131.606.5092.68
FOXP1 3−0.51−1.841.706.9888.17
MTUS1 8−0.39−1.371.946.8584.39
SOX7 8−0.57−1.141.976.6482.52
AR X−0.35−4.84−1.388.0179.20
MMP9 200.314.25−1.44−3.8779.00
GNG7 19−0.27−2.301.535.6678.00
SORCS3 10−0.54−4.27−1.407.7475.31
POLQ 30.733.010−8.5873.95
PPARG 3−0.36−3.581.403.4970.65
ERBB4 2−0.50−3.2307.9670.11
MMP1 110.575.440−5.5669.39
SVIL 10−0.44−2.051.424.5968.01
TACC1 8−0.43−2.8707.6965.98
SORBS1 10−0.26−5.0605.5865.38
OGG1 3−0.38−1.291.215.0363.29
TFRC 31.402.310−6.8063.07
RFX2 19−0.28−2.451.713.4463.00
CUL3 2−0.50−2.1607.7562.49
ZEB1 10−0.46−1.8108.0161.67
PML 150.391.430−8.3160.78
BIRC5 170.333.740−5.9460.07
ITPR1 3−0.49−2.5706.7258.69
CTDSP1 2−0.49−1.412.003.3457.94
PAX3 2−0.41−3.39−1.525.6856.84
DNMT3B 200.391.3907.3554.77
GSK3B 30.811.261.45−6.9954.38
RB1 130.970.922.12−7.1154.03
ARHGEF12 11−0.63−0.9807.2252.94
CADM1 11−0.78−2.27−1.265.7152.59
PPP2CB 8−0.63−0.861.403.6852.50
STAT1 20.261.941.75−6.5352.39
LRIG1 3−0.55−1.9206.2552.31
FGFR2 10−0.38−1.051.373.7352.24
TNFSF10 30.401.070−7.2352.23
ADAMTS9 3−0.52−1.89−1.376.2752.12
MKRN2 3−0.45−1.1007.1051.96
TGFBR2 3−0.72−1.3106.6251.88
ITGA9 3−0.42−2.5205.7051.80
FGFR1 8−0.60−3.0804.9151.54
RANBP3 19−0.43−0.921.433.6351.34
MYOM2 8−0.37−1.821.482.7451.26
CADM2 3−0.42−1.5506.5451.04
SERP1 30.421.221.63−6.7850.50
FNDC3B 30.631.201.34−6.3749.21

Scores higher than 48.72 (median among the lowest: 4.11 and highest: 101.56 scores) were used to select potential driver genes among the 389 candidates. Transcript levels were regulated by three molecular mechanisms in 32% (15 of 47) of the genes. Significant alterations considered in this step were identified in more than 20% of the patients. LR: Log ratio; FC: Log transformed Fold-Change; CNA: Copy Number Alteration.

Forty-seven candidate genes selected in the first step of the integrative analysis. Scores higher than 48.72 (median among the lowest: 4.11 and highest: 101.56 scores) were used to select potential driver genes among the 389 candidates. Transcript levels were regulated by three molecular mechanisms in 32% (15 of 47) of the genes. Significant alterations considered in this step were identified in more than 20% of the patients. LR: Log ratio; FC: Log transformed Fold-Change; CNA: Copy Number Alteration.

Modules identification and assignment of driver candidates

A matrix with 4,607 differentially expressed genes was submitted to clustering analysis using a Gibbs sampling algorithm[27], which generated 418 modules composed by 3,322 genes. Modules with less than five genes were removed, resulting in 113 modules and 2,846 genes (approximately 25 genes per module). The previously identified 47 driver candidates were assigned as regulators of the 113 selected modules, resulting in 6,561 driver-module associations that were ranked by score. The top 1% high-scoring association was selected for detailed analysis. Modules with less than 10% of passenger genes were filtered out, resulting in 19 modules associated with 16 driver candidates (STAT1, BIRC5, TNFSF10, PML, FGFR1, DNMT3B, ERBB4, RB1, AR, PPARG, SOX7, BCL2, IGFBP5, PAX3, CUL3 and RANBP3) (Table 2). Modules 55 (RB1 and IGFBP5), 49 (FGFR1 and BIRC5) and 97 (PPARG and AR) were predicted to be regulated by two driver candidates. A median of 41 genes, including 12 passengers, was detected in each module. The modules 52 (13/25), 73 (7/14), 92 (6/11), 95 (7/14) and 97 (8/16) presented more than 50% of passenger genes. The highest score was detected in module 38 (Score = 119.19), which is regulated by STAT1 gene (Table 2).
Table 2

Driver candidates identified in the module-based analysis.

RegulatorModuleAssignment ScoreTotal number of genes/modulePassengers/moduleMSigDB categoriesDrug(s)
STAT1 38119.19289 (32%)TF
BIRC5 16109.213814 (37%)
TNFSF10 4868.607017 (24%)CDM, CGF
PML 8567.123110 (32%)O, TF, TCGD
FGFR1 4966.733910 (26%)O, CDM, PK, TCGD
BIRC5 4965.833910 (26%)
DNMT3B 165.247122 (31%)
ERBB4 1064.553312 (36%)PKD
RB1 5564.223711 (30%)TS, TF
AR 5263.892715 (56%)TFD
PPARG 9763.84179 (53%)O, TF, TCGD
BIRC5 10256.8117420 (11%)
BCL2 253.364510 (22%)O, TCGD
BCL2 4052.926810 (15%)O, TCGD
IGFBP5 5549.013711 (30%)
SOX7 644.234318 (42%)
AR 9742.66179 (53%)TFD
PAX3 7342.20158 (53%)O, TF, HP, TCG
STAT1 7239.37178 (47%)TF
BIRC5 3437.532912 (41%)
CUL3 9531.17147 (50%)
RANBP3 1130.114515 (33%)

Sixteen genes were assigned as regulators of 19 modules, with scores ranging from 30.11 to 119.19. Modules with less than 10% of passenger genes were removed. Eight genes were described as drug targets and 62.5% (10/16) were annotated in at least one biological function using MSigDB categories. Modules 49, 55 and 97 were associated with two driver candidates. D: Drug target; TF: Transcription Factor; CDM: Cell Differentiation Markers; CGF: Cytokines and Growth Factor; O: Oncogenes; TCG: Translocated Cancer Genes; PK: Protein Kinase; TS: Tumor Suppressor; HP: Homeodomain Protein.

Driver candidates identified in the module-based analysis. Sixteen genes were assigned as regulators of 19 modules, with scores ranging from 30.11 to 119.19. Modules with less than 10% of passenger genes were removed. Eight genes were described as drug targets and 62.5% (10/16) were annotated in at least one biological function using MSigDB categories. Modules 49, 55 and 97 were associated with two driver candidates. D: Drug target; TF: Transcription Factor; CDM: Cell Differentiation Markers; CGF: Cytokines and Growth Factor; O: Oncogenes; TCG: Translocated Cancer Genes; PK: Protein Kinase; TS: Tumor Suppressor; HP: Homeodomain Protein.

In silico enrichment of biological process and pathways of the driver-module association

Nineteen modules with high scores were submitted to an enrichment analysis (GSEA, P < 0.05), revealing an association with 843 GO categories and 42 pathways (KEGG and Reactome). The majority of these modules was associated with cancer-related pathways. Biological processes associated with immune system, signal transduction, transcription factor activity, carbohydrate metabolism and cytoskeleton were the most significant categories in modules 2, 11, 48 and 102 (P-value varying from 1.14 × 10−8 to 3.31 × 10−14) (Supplementary Table S3). Pathways involved with tumor development, including homeostasis, immune system and apoptosis were predominantly enriched for modules 2 (10 pathways), 48 (5 pathways) and 55 (5 pathways) (Supplementary Table S4). Using the Molecular Signatures Database (MSigDB), 16 driver candidates were categorized in cancer-associated groups, such as cytokines and growth factors, transcription factors, homeodomain proteins, cell differentiation markers, protein kinases, translocated cancer genes, and also oncogenes and tumor suppressors. Ten (TNFSF10, FGFR1, PAX3, PML, PPARG, BCL2, ERBB4, AR, STAT1 and RB1) of 16 genes were annotated in at least one of these biological functions. Moreover, PML, ERBB4, AR, PPARG, BCL2 and FGFR1 were identified as drug-targets (DrugBank database) (Table 2). A protein-protein interaction (PPI) analysis revealed an association among the 16 driver candidates and 19 modules. RB1 and AR genes presented the higher connectivity degree with modules (19 edges), followed by CUL3 and PPARG (18 edges). Passenger genes were over-represented in all modules and the enrichment analysis revealed significant gene ontology categories associated with well-connected modules, such as module 6 (17 passengers and 12 GO categories), 48 (16 passengers and 9 GO categories) and 102 (19 passengers and 10 GO categories) (Fig. 1).
Figure 1

Protein-protein interaction (PPI) network illustrating the connectivity betwen 16 driver candidates and 19 modules. All driver candidates showed association with at least two modules, indicating a possible interconnection among driver genes activity in the regulation of an important biological process related with cancer development. RB1 and AR genes presented the highest connectivity with modules (19 edges), followed by CUL3 and PPARG (18 edges). Passenger genes and significant GO categories associated with modules were illustrated. Modules 48 and 102 presented associations with the largest number of GO categories (12 and 10, respectively). Transcription levels of the driver candidates selected and confirmed by RT-qPCR were highlighted with black outline in the rounded rectangle.

Protein-protein interaction (PPI) network illustrating the connectivity betwen 16 driver candidates and 19 modules. All driver candidates showed association with at least two modules, indicating a possible interconnection among driver genes activity in the regulation of an important biological process related with cancer development. RB1 and AR genes presented the highest connectivity with modules (19 edges), followed by CUL3 and PPARG (18 edges). Passenger genes and significant GO categories associated with modules were illustrated. Modules 48 and 102 presented associations with the largest number of GO categories (12 and 10, respectively). Transcription levels of the driver candidates selected and confirmed by RT-qPCR were highlighted with black outline in the rounded rectangle.

Cross-study validation test to identify PeCa driver candidates in other SCC histological subtypes

Transcriptomic profile of the selected 16 driver candidates identified in the set of PeCa was compared to the expression profile of head and neck (460 T and 44 N), cervical (19 T and 3 N) and lung squamous cell carcinomas (501 T and 51 N) using data retrieved from TCGA. As shown in the online Supplementary Table S5, 15 genes displayed significant differential expression in at least one tumor type (Limma, P < 0.05). Although not significant, RB1 overexpression was found in head and neck carcinomas.

Gene expression pattern of driver candidates by RT-qPCR

The cutoff of 44.54, which is the median value between the lowest (30.11) and highest (119.19) score, was used to rank the 22 associations, including 16 driver candidates and 19 modules. Ten selected transcripts were evaluated by RT-qPCR in the same set of 20 PeCa used in the arrays and in 33 cases selected for data validation. Significant overexpression of BIRC5, DNMT3B, PML, RB1, STAT1 and TNFSF10 genes was confirmed as altered (Fig. 2A; Supplementary Table S6). Downexpression of AR, PPARG, ERBB4 and FGFR1 was previously confirmed in the same set of PeCa samples used in this study[21]. Of note, BIRC5 and DNMT3B overexpression were associated with shorter overall survival (log-rank test, P = 0.026 and P = 0.002, respectively) (Fig. 2B). Although our set of patients includes a limited number of death events(11), the multivariate analysis confirmed DNMT3B as significantly associated with shorter overall survival, revealing its potential as a prognostic marker in PeCa (Cox Regression, P = 0.015 OR = 5.4 CI 1.4-21.2) (Supplementary Table S7).
Figure 2

(A) Boxplot representation of the RT-qPCR data performed in the microarray-independent set of samples, showing expected significant results for all assessed transcripts (Mann Whitney test *P < 0.05; **P < 0.01; ***P < 0.001). (B) Overall survival curves of BIRC5 and DNMT3B, demonstrating a significant short overall survival (log rank test P < 0.05) in patients who exhibit overexpression of these genes. Legend: NG: Normal glans; PeCa: Penile Carcinoma.

(A) Boxplot representation of the RT-qPCR data performed in the microarray-independent set of samples, showing expected significant results for all assessed transcripts (Mann Whitney test *P < 0.05; **P < 0.01; ***P < 0.001). (B) Overall survival curves of BIRC5 and DNMT3B, demonstrating a significant short overall survival (log rank test P < 0.05) in patients who exhibit overexpression of these genes. Legend: NG: Normal glans; PeCa: Penile Carcinoma.

Discussion

Studies implementing and exploring integrative approaches have unveiled therapy candidates in many tumors[26, 28]. Nevertheless, molecular mechanisms underlying penile cancer remain poorly understood. Here, an integrative study was performed with four molecular levels to investigate penile carcinoma. AR, BIRC5, DNMT3B, ERBB4, FGFR1, PML, PPARG, RB1 and STAT1 genes were highlighted as potential driver candidates. In addition, 40 miRNAs, including hsa-miR-130b and hsa-miR-320, were associated with the regulation of these genes. Recently, McDaniel et al.[23] reported somatic variants in 60 PeCa from 43 patients using a panel with 126 potentially actionable genes. The authors reported non-synonymous mutations covering well-described cancer related genes, including CDKN2A, TP53, PIK3CA, MYC and BRAF. In addition to the somatic variants, genomic profile was also investigated. In accordance to our data, RB1 gains and AR, FGFR1 and PPARG losses were previously reported. Ali et al.[24] described genomic variants of AR and RB1 genes using a panel of 236 cancer-related genes in 20 PeCa. We also reported significant low levels of AR expression (P < 0.001) and four overexpressed miRNAs (hsa-miR-31-5p, hsa-miR-34a- 5p, hsa-miR-205-5p and hsa-miR-185-5p) predicted to regulate this gene[22]. In the present study, AR and RB1 genes were identified as potential driver candidates, harboring genomic and epigenetic alterations that are consistent with the transcriptomic profile. Overall, these findings pointed out that multiple genetic events in AR and RB1 genes are involved in penile carcinogenesis. The tumor ability to rapidly acquire new mutations is a major limitation of targeted-gene therapies. The accumulation of alterations in passenger genes may alter the dynamics of cancer development and explain clinical events, including unconstrained tumor growth, spontaneous regression and long periods of dormancy[3]. Based on these evidences, we mapped the modules with passenger candidates and used the accumulation frequency to identify a driver-module association that would be more critical for penile carcinogenesis. Considering the final 22 driver-module association list, a high frequency of passengers was detected in modules enriched for cell cycle and immune-inflammatory response pathways. Increased levels of BIRC5 were associated with the regulation of the majority of these modules (16, 34, 49 and 102). This gene plays an important role in cell proliferation and apoptosis inhibition[29]. Due to the overexpression of BIRC5 during carcinogenesis, treatment targeting this gene has been increasingly recognized as a promising therapy to various cancers[30-32]. In our study, BIRC5 gene copy number gain and downexpression of hsa-miR-135a and hsa-miR-320 were significantly associated with increased expression levels of BIRC5 suggesting that multiple events could be involved with the aberrant activity of this gene in penile cancer. Although poorly investigated in PeCa, aberrant levels of miRNAs were recently reported. In 10 PeCa paired with adjacent non-tumor tissues, Zhang et al.[33] reported 56 miRNAs and their targets associated with the modulation of MAPK, p53, Wnt, TGF-β and PI3K-Akt signaling pathways. A miRNA-based signature including hsa-miR-1, hsa-miR-101 and hsa-miR-204 was significantly associated with lymph node metastasis and unfavorable prognosis in 24 PeCa samples[34]. Recently, by integrating miRNA and gene expression data (23 PeCa and 12 non-neoplastic penile tissues-NPT), we identified 255 mRNAs specifically regulated by 68 miRNAs[22]. In this study, 34 of 40 differentially expressed miRNA were associated with tumor development or progression. A recent study reported hsa-miR-34a as potential therapeutic target in human cancer with an essential role in tumor cell response to chemotherapeutic agents[35]. In addition to hsa-miR-34a regulation involved in the BCL2 activity, we found increased methylation levels of BCL2, suggesting its importance in penile carcinogenesis. Although not selected to validation as a top driver candidate in PeCa, BCL2 was one of the 47 driver candidates herein described. Effective anti-cancer immunotherapy strategies are hindered by the lack of knowledge of key driver mechanisms that contribute to tumor aggressiveness and immune system evasion. The association of multiple deregulated driver-pathways may allow the design of new strategies to target driver genes that promote cancer. A significant association of STAT1 (logFC = 1.9; Score = 119.19; Module 38) and PPARG (logFC = −3.5; Score = 63.84; Module 97) with immune-inflammatory pathways was detected. Furthermore, STAT1 copy number gain and PPARG loss were identified as a regulatory mechanism in combination with 11 differentially expressed miRNAs. An increased level of STAT1 has been reported as conferring cellular resistance to DNA-damaging agents and mediating tumor growth aggressiveness[36]. PPARG was recognized to play an important role in the immune regulation through its ability to inhibit the activity of various transcription factors, including signal transducers and transcription activators (STATs), leading to an anti-inflammatory phenotype[37, 38]. Copy number losses and miRNA regulation in genes associated with PPARG signaling pathway have the potential to contribute to an aberrant activity of the inflammatory process in PeCa. In addition, an association between driver genes and immune-inflammatory pathways may suggest a need for novel strategies to hit druggable genes and find new routes to evade the resistance acquired by tumor cells. Despite current advances in penile carcinomas investigation, effective markers clinically useful to identify lymph node metastasis, which increase morbidity in consequence of unnecessary inguinal lymphadenectomy, are poorly described in literature[39, 40]. In 2008, Kroon et al.[41] reported a 44-probe classifier able to identify patients with lymph node metastases compared with patients with no lymph nodes involvement. However, the validation set of cases was not able to confirm the results. In a previous study focusing on aberrant copy number alteration profile in PeCa, we reported a significant association between PPARG loss and lymph node metastasis in 46 PeCa samples[42]. Recently, we verified that higher MMP1 expression levels revealed to be a better predictor of lymph node metastasis than the clinical-pathological features[22]. Here, MMP1 was one of the 47 driver genes obtained in the integrative analysis, with increased expression levels possibly associated with copy number gains and down-expression of its miRNAs regulators (hsa-let-7b, hsa-let-7c, hsa-miR-342-3 and hsa-miR-134). The combination of different molecular mechanisms involved in the regulation of gene expression pointed out two overexpressed driver candidates, BIRC5 (Score = 109.21) and DNMT3B (Score = 65.24), associated with shorter overall survival (log-rank test, P = 0.026 and P = 0.002, respectively). Despite the small number of death event in our cohort (11 patients), a multivariate analysis confirmed that DNMT3B overexpression was significantly associated with poor overall survival (Supplementary Table S7). Increased expression levels of BIRC5, a member of the inhibitor of apoptosis protein (IAP), was described in a large number of malignancies[43-45]. The protein encoded by BIRC5 was reported to be involved in cell-cycle regulation and apoptosis by inhibiting caspase-3 and −7[46]. Both activities are associated with tumor progression and resistance to therapy, highlighting BIRC5 as a potential therapeutical target[47, 48]. In addition to the association of BIRC5 increased expression levels with unfavorable prognosis in PeCa, we identified copy number gains and downexpression of its miRNAs regulators (hsa-miR-320 and hsa-miR-135a) as alternative events to alter the gene expression levels and to contribute with the penile tumorigenesis. DNMT3B copy number gains and down-expression of its miRNAs regulators (hsa-let-7b, hsa-let-7c and hsa-miR-145) are able to explain the increased expression levels of this gene. DNA methyltransferase 3B participates in de novo DNA methylation and has been reported to be involved in multiples cancer types, including gastric and lung[49, 50]. Increased levels of DNMT3B and hsa-miR-145 downexpression were powerful in predicting shorter survival (P < 0.05) in endometrial carcinomas[51]. An additional evidence to highlight the importance of this gene was the association between DNMT3B overexpression and higher incidence of lymph node metastasis in oral squamous cell carcinomas[52]. In conclusion, novel driver candidates associated with penile carcinogenesis were described. The multidimensional analysis was able to identify high-scored genes, including STAT1 and PPARG, which have potential association with dysfunctional activity of the immune system. Higher connectivity with dysregulated modules was observed for AR gene. The well ranked BIRC5 and DNMT3B were significantly associated with unfavorable prognosis in PeCa patients.

Methods

Patients

Fifty three fresh-frozen usual penile squamous cell carcinomas obtained from untreated patients who underwent tumor resection at A.C.Camargo Cancer Center (São Paulo, Brazil), Barretos Cancer Hospital (Barretos, SP, Brazil) and Medical School, UNESP (Botucatu, SP, Brazil) were included in this study. Twenty-one normal glans were obtained from autopsies. Samples were submitted to cellular macrodissection and histology confirmation. PeCa samples composed of at least 80% of malignant cells were further processed. Written informed consent was obtained from all patients or relatives. This study was approved by The Human Research Ethics Committees of the Institutions (Protocols #1230/09: A.C. Camargo Cancer Center; #363–2010: Barretos Cancer Hospital, and #501.229/2013: Faculty of Medicine, Botucatu, SP, Brazil). Twenty PeCa samples were evaluated for genome-wide copy number alteration, DNA methylation, gene expression and miRNA screening. HPV status was established for all PeCa using the Linear Array HPV Test Genotyping (Roche Molecular Diagnostics). Fifteen of 53 patients were positive for high-risk HPV (16 or 18) infection. Patients were advised of the procedures and provided written informed consent. The Human Research Ethics Committees of A.C.Camargo Cancer Center (#1230/2009), Barretos Cancer Hospital (#363/2010) and Medical School-UNESP (#501.229/2013) approved this study. Clinical data is summarized in Table 3.
Table 3

Clinical and histopathological features of PeCa cases (N = 53). Patients were divided into two groups – dependent (N = 20) and independent (N = 33), according to the microarray analysis.

VariableDependent groupIndependent group
N (%)N (%)
Number 2033
Age (years)
Median (interquartile range)54.5 (46–74)55 (45–71)
Follow-up (months)
Median (interquartile range)8.5 (6–14)12.7 (8–29)
Histological grade
  I-II12 (60%)23 (79.3%)
  III8 (40%)6 (20.7%)
  ND04
HPV infection
  HPV-Positive# 5 (25%)12 (36.4%)
  HPV-Negative15 (75%)21 (63.6%)
Lymph node metastasis
  Presence9 (45.0%)11 (33.3%)
  Absence11 (55.0%)22 (66.7%)
Perineural Invasion
  Presence7 (35%)9 (27.3%)
  Absence13 (65%)24 (72.7%)
Vascular Invasion
  Presence3 (15%)4 (12.1%)
  Absence17 (85%)29 (87.9%)
T Stage
  1–210 (50.0%)24 (72.7%)
  3–410 (50.0%)9 (27.3%)
Clinical and histopathological features of PeCa cases (N = 53). Patients were divided into two groups – dependent (N = 20) and independent (N = 33), according to the microarray analysis.

Data acquisition and processing

The data used for integrative analysis were obtained from previous studies of our group[14, 22, 42]. Genome-wide copy number alteration analysis was performed using Agilent Human 4 × 44 K CGH Microarrays (Agilent Technologies)[42]. Aberrant regions were identified using Fast Adaptive States Segmentation Technique 2 (FASST2) algorithm, considering significance threshold of 1 × 10−6, three consecutive altered probes per segment and the average log2 ratio of +0.15 for copy gains and −0.15 for losses. Alterations detected in at least 20% of the samples were selected for the integrative analysis. Datasets are available in the Gene Expression Omnibus (GEO) database (GSE50134). Global gene expression data were obtained using the Whole Human Genome 4 × 44 K microarray platform (Agilent Technologies) as described by Kuasne et al.[21]. Data processing, quality control filter and normalization were obtained with Agilent Feature Extraction Software (v. 10.1.1.1) and an in-house pipeline. Genes with a mean log2 signal ratio (Cy3/Cy5) of ≥0.6 and ≤−0.6 within a 95% confidence interval (CI) were considered differentially expressed. Datasets are available in Gene Expression Omnibus (GEO) database (GSE57955). Genome-wide methylation was performed using the Agilent 244 K Human DNA Methylation Microarray (Agilent Technologies)[14]. Workbench Standard (Ed. 5.0.14, Agilent Technologies) software and Limma 3.30.6 method[53] algorithm were used for data normalization (Lowess) and statistical analyses, respectively. Significant genes were selected considering P < 0.05. Non-coding RNA (miRNA) analysis were conducted using TaqMan Human MicroRNA Assay System Set v2.0 (Applied Biosystems), as previously described[22]. Pfaffl model was used for data normalization[54], considering MammU6, RNU44 and RNU48 as reference. Statistical analysis considered a two-sample t-test (P < 0.01 and FDR < 0.05) to select differentially miRNA expression. Target transcripts of differentially expressed miRNAs were predicted by at least six algorithms using miRWalk 2.0 software (http://www.umm.uni-heidelberg.de/apps/zmf/mirwalk/). All experiments were performed in accordance to relevant guidelines and following manufacturer’s recommendations. Details of the labeling, hybridization and normalization of the experiments were described in the Supplemental Methods S1.

Integrative Analysis

The integrative analysis was performed in four major steps: (1) cross-platforms combination to select the most representative candidates; (2) module-based analysis, partitioning the expression matrix in significant modules of co-expressed genes; (3) driver-module assignment, to identify regulatory modules and their condition-specific regulator and (4) enrichment analysis, to select top driver-module association. The integrative strategy was illustrated in Supplementary Fig. 1. Differentially expressed genes (GE) were compared with genome-wide copy number alteration (CNA), methylation (Me) and miRNA (Mi) data to identify genes whose expression could be explained by aberrant genomic alterations and/or epigenetic events. The most representative candidates for module-based analysis were selected using the following formula:with α as a bonus to genes identified in at least 20% of the patients and β the bonus for event agreement. For each event concordant with the gene expression profile, an added bonus was assigned (2 for two events agreement, 3 for three events and 4 if gene expression is in accordance with the other three molecular levels). For example, one overexpressed gene mapped in an amplified region, having promoter hypomethylated and regulated by a downexpressed miRNA, has bonus 4. We considered a median value between the lowest and highest scores as cutoff to select potential driver candidates for module-based analysis. Genes with score below the cutoff were defined as potential passenger genes. In order to iteratively infer modules where genes systematically cluster together we used a Gibbs sampling procedure[27]. Modules with less than 5 genes were filtered out. The LeMoNe algorithm[55] was used to infer a set of regulatory programs for all selected modules assigning the set of candidate genes, previously identified as the modules’ potential regulators. Using regression tree, genes were associated to each node, composed by a set of genes having similar mean and standard deviation. A score was computed to each gene-module association and the top 1% high-scoring genes were investigated. The modules associated with the top candidates were mapped with passenger candidates to ensure the identification of modules with accumulation of secondary alterations and possibly involved in penile carcinogenesis. Modules with more than 10% of passenger candidates were selected for an enrichment analysis using Gene Set Enrichment Analysis (GSEA) algorithm considering GO (geneontology.org/), KEGG (http://www.genome.jp/kegg/) and Reactome (http://www.reactome.org/) databases. The statistical significance of module enrichment was defined with P < 0.05. The median value between the highest and lowest score was the cutoff to select the top potential driver candidates for expression levels validation using RT-qPCR.

Cross-validation of top driver candidates and comparison with other squamous cell carcinoma (SCC) available in TCGA

RNA-seq data of 1,423 squamous cell carcinomas samples (1,325 T and 98 NT) were retrieved from TCGA (http://tcga-data.nci.nih.gov/tcga/). A total of 397 samples were excluded for having indeterminate or non-squamous cell histology and Human Papilloma Virus (HPV) positivity. The final set of samples was composed by 1,026 patients (928 SCC HPV- and 98 NT), which included head and neck (415 T and 44 NT), cervical (12 T and 3 NT) and lung squamous cell carcinomas (501 T and 51 NT). The results obtained with the TCGA data were compared with the driver candidates selected in PeCa. Samples were obtained from “level 3”, quantified at the gene levels using RSEM (RNA-Seq by Expectation Maximization), and normalized with upper-quartile.

Gene expression analysis by RT-qPCR

A total of 53 PeCa (33 used in the array assays) and 21 NG (18 array independent) were used for RT-qPCR (following the MIQE guideline recommendations). As previously reported[56] , GUSB was selected as reference. Relative quantification of the expression levels was calculated according to Pfaffl method[54]. Non-parametric Mann-Whitney test was applied to compare tumors with NG samples according to the clinicopathological features.

Human protein-protein interaction and enrichment analysis

The protein-protein interaction was obtained from I2D[57] that contains 71,694 predicted interactions for human identified with high-throughput data analysis. NAViGaTOR software package (ophid.utoronto.ca/navigator) was used for visualizing and analyzing protein-protein interaction networks[58]. Molecular Signatures Database (MSigDB) (software.broadinstitute.org/gsea/msigdb) and DrugBank (http://www.drugbank.ca) were used to identify association among significant modules with specific gene families (cytokines and growth factors, transcription factors, oncogenes, tumor suppressors, homeodomain proteins, cell differentiation markers and protein kinases) and drug-target genes, respectively. Databases were consulted in October 2016.

Statistical analysis

Statistical analysis was performed using GraphPad Prism5 and SPSS version 21.0 software, adopting Two-Tailed Test and P < 0.05 value as significant. Overall survival analysis was performed using Kaplan-Meier and log rank test. High and low transcript levels in the tumor samples were defined as superior and inferior outliers compared with NG expression levels. Cross-validation of top driver candidates and comparison with other squamous cell carcinomas (SCC) available in TCGA were conducted using R 3.3.2 software[59] and Limma 3.30.6 method (two-tailed P < 0.05 and FDR < 0.05)[53].
  58 in total

1.  Prognostic factors in patients with penile carcinoma and inguinal lymph node metastasis.

Authors:  Walter Henriques da Costa; Renato Almeida Rosa de Oliveira; Thiago Borges Santana; Bruno Santos Benigno; Isabela Werneck da Cunha; Stênio de Cássio Zequi; Gustavo Cardoso Guimaraes; Ademar Lopes
Journal:  Int J Urol       Date:  2015-03-31       Impact factor: 3.369

Review 2.  Penile cancer: current therapy and future directions.

Authors:  G Sonpavde; L C Pagliaro; C Buonerba; T B Dorff; R J Lee; G Di Lorenzo
Journal:  Ann Oncol       Date:  2013-01-04       Impact factor: 32.976

3.  EAU guidelines on penile cancer: 2014 update.

Authors:  Oliver W Hakenberg; Eva M Compérat; Suks Minhas; Andrea Necchi; Chris Protzel; Nick Watkin
Journal:  Eur Urol       Date:  2014-11-01       Impact factor: 20.096

4.  Integrative analysis of microRNA and mRNA expression profiles in non-small-cell lung cancer.

Authors:  C Yang; C Sun; X Liang; S Xie; J Huang; D Li
Journal:  Cancer Gene Ther       Date:  2016-03-11       Impact factor: 5.987

5.  Epidemiologic study on penile cancer in Brazil.

Authors:  Luciano A Favorito; Aguinaldo C Nardi; Mario Ronalsa; Stenio C Zequi; Francisco J B Sampaio; Sidney Glina
Journal:  Int Braz J Urol       Date:  2008 Sep-Oct       Impact factor: 1.541

6.  hTERT and BIRC5 gene promoters for cancer gene therapy: A comparative study.

Authors:  Mikhail V Shepelev; Eugene P Kopantzev; Tatiana V Vinogradova; Eugene D Sverdlov; Igor V Korobko
Journal:  Oncol Lett       Date:  2016-06-15       Impact factor: 2.967

7.  Comprehensive Genomic Profiling of Advanced Penile Carcinoma Suggests a High Frequency of Clinically Relevant Genomic Alterations.

Authors:  Siraj M Ali; Sumanta K Pal; Kai Wang; Norma A Palma; Eric Sanford; Mark Bailey; Jie He; Julia A Elvin; Juliann Chmielecki; Rachel Squillace; Edward Dow; Deborah Morosini; Jamie Buell; Roman Yelensky; Doron Lipson; Garrett M Frampton; Peter Howley; Jeffrey S Ross; Philip J Stephens; Vincent A Miller
Journal:  Oncologist       Date:  2015-12-15

8.  RMaNI: Regulatory Module Network Inference framework.

Authors:  Piyush B Madhamshettiwar; Stefan R Maetschke; Melissa J Davis; Mark A Ragan
Journal:  BMC Bioinformatics       Date:  2013-10-22       Impact factor: 3.169

9.  Integrative multi-omics module network inference with Lemon-Tree.

Authors:  Eric Bonnet; Laurence Calzone; Tom Michoel
Journal:  PLoS Comput Biol       Date:  2015-02-13       Impact factor: 4.475

Review 10.  Epigenetic mechanisms in penile carcinoma.

Authors:  Hellen Kuasne; Fabio Albuquerque Marchi; Silvia Regina Rogatto; Ilce Mara de Syllos Cólus
Journal:  Int J Mol Sci       Date:  2013-05-23       Impact factor: 5.923

View more
  16 in total

1.  Nuclear loss and cytoplasmic expression of androgen receptor in penile carcinomas: role as a driver event and as a prognosis factor.

Authors:  Hellen Kuasne; Mateus C Barros-Filho; Fábio A Marchi; Sandra A Drigo; Cristovam Scapulatempo-Neto; Eliney F Faria; Silvia R Rogatto
Journal:  Virchows Arch       Date:  2018-08-11       Impact factor: 4.064

Review 2.  Immune-based therapies in penile cancer.

Authors:  Vidhu B Joshi; Philippe E Spiess; Andrea Necchi; Curtis A Pettaway; Jad Chahoud
Journal:  Nat Rev Urol       Date:  2022-07-18       Impact factor: 16.430

3.  Upregulated miRNAs on the TP53 and RB1 Binding Seedless Regions in High-Risk HPV-Associated Penile Cancer.

Authors:  Jenilson da Silva; Carla Cutrim da Costa; Ingryd de Farias Ramos; Ana Carolina Laus; Luciane Sussuchi; Rui Manuel Reis; André Salim Khayat; Luciane Regina Cavalli; Silma Regina Pereira
Journal:  Front Genet       Date:  2022-06-24       Impact factor: 4.772

4.  Molecular characterization and integrative genomic analysis of a panel of newly established penile cancer cell lines.

Authors:  Qiang-Hua Zhou; Chuang-Zhong Deng; Zai-Shang Li; Jie-Ping Chen; Kai Yao; Kang-Bo Huang; Ting-Yu Liu; Zhuo-Wei Liu; Zi-Ke Qin; Fang-Jian Zhou; Wenlin Huang; Hui Han; Ran-Yi Liu
Journal:  Cell Death Dis       Date:  2018-06-07       Impact factor: 8.469

5.  PMAMCA: prediction of microRNA-disease association utilizing a matrix completion approach.

Authors:  Jihwan Ha; Chihyun Park; Sanghyun Park
Journal:  BMC Syst Biol       Date:  2019-03-20

Review 6.  TGF-β and microRNA Interplay in Genitourinary Cancers.

Authors:  Joanna Boguslawska; Piotr Kryst; Slawomir Poletajew; Agnieszka Piekielko-Witkowska
Journal:  Cells       Date:  2019-12-12       Impact factor: 6.600

7.  Copy Number Analysis Reveal Genetic Risks of Penile Cancer.

Authors:  Yongbo Yu; Chengwen Gao; Yuanbin Chen; Meilan Wang; Jianfeng Zhang; Xiaocheng Ma; Shuaihong Liu; Hang Yuan; Zhiqiang Li; Haitao Niu
Journal:  Front Oncol       Date:  2020-12-14       Impact factor: 6.244

8.  The Biomarker Potential of Caveolin-1 in Penile Cancer.

Authors:  Andrej Panic; Henning Reis; Alina Wittka; Christopher Darr; Boris Hadaschik; Verena Jendrossek; Diana Klein
Journal:  Front Oncol       Date:  2021-03-31       Impact factor: 6.244

9.  Molecular stratification by BCL2A1 and AIM2 provides additional prognostic value in penile squamous cell carcinoma.

Authors:  Xingliang Tan; Dong Chen; Shengjie Guo; Yanjun Wang; Yuantao Zou; Zhiming Wu; Fangjian Zhou; Zike Qin; Zhuowei Liu; Yun Cao; Chunhua Lin; Gangjun Yuan; Kai Yao
Journal:  Theranostics       Date:  2021-01-01       Impact factor: 11.556

Review 10.  Understanding genomics and the immune environment of penile cancer to improve therapy.

Authors:  Ahmet Murat Aydin; Jad Chahoud; Jacob J Adashek; Mounsif Azizi; Anthony Magliocco; Jeffrey S Ross; Andrea Necchi; Philippe E Spiess
Journal:  Nat Rev Urol       Date:  2020-08-18       Impact factor: 14.432

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.