Literature DB >> 35117583

Exome sequencing identified six copy number variations as a prediction model for recurrence of primary prostate cancers with distinctive prognosis.

Jie Liu1,2, Jiajun Yan3, Ruifang Mao2, Guoping Ren4, Xiaoyan Liu4, Yanling Zhang4,5, Jili Wang4, Yan Wang4, Meiling Li6, Qingchong Qiu2, Lin Wang2, Guanfeng Liu2, Shanshan Jin2, Liang Ma2, Yingying Ma2, Na Zhao2, Hongwei Zhang6, Biaoyang Lin1,2,7,8.   

Abstract

BACKGROUND: Prostate cancer (PCa) is a common type of malignancy, which represents one of the leading causes of death among men worldwide. Copy number variations (CNVs) and gene fusions play important roles in PCa and may serve as markers for the prognosis of this condition.
METHODS: We have presently conducted an analysis of CNVs and gene fusions in PCa, using whole exome sequencing (WES) data of primary tumors. For this, a cohort of 74 PCa patients, including 30 recurrent and 44 non-recurrent cases, were assessed during 5 years of follow-up.
RESULTS: We have identified 66 CNVs that were specific to the primary tumor tissues from the recurrent PCa group. Most of duplicated genomic regions were located in 8q2, suggesting that this chromosomal region could be important for the prognosis of PCa. Meanwhile, we have developed a random forest model, using six selected CNVs, with an accuracy near 90% for predicting PCa recurrence according to a 10-fold cross validation. In addition, we have detected 16 recurrent oncogenic gene fusions in PCa. Among these, ALK (ALK receptor tyrosine kinase)-involved fusions were the most common type of gene fusion (n=7). Four of these fusions (i.e., EML4-ALK, STRN-ALK, CLTC-ALK, ETV6-ALK) were previously identified in other cancer types, while the remaining three gene fusions (FRYL-ALK, ABL1-ALK, and BCR-ALK) were here identified.
CONCLUSIONS: Our findings expand the current understanding in regard to prostate carcinogenesis. Current data might be further used for assay development as well as to predict PCa recurrence, using primary tissues. 2020 Translational Cancer Research. All rights reserved.

Entities:  

Keywords:  Cancer recurrence; copy number variation (CNV); gene fusion; random forest; whole exome sequencing (WES)

Year:  2020        PMID: 35117583      PMCID: PMC8798897          DOI: 10.21037/tcr.2020.03.31

Source DB:  PubMed          Journal:  Transl Cancer Res        ISSN: 2218-676X            Impact factor:   1.241


Introduction

Second-generation sequencing is an efficient technique to identify gross genomic changes in cancer, including gene fusion events and copy number variations (CNVs). In this context, whole genome sequencing (WGS) would be the ideal application but, due to the cost and the complexity of data analysis, its applicability in many clinical settings has reduced. Instead, exome sequencing is a more cost effective approach which has been largely used to identify single nucleotide polymorphisms (SNPs) and small insertions and deletions (indels). Moreover, several studies have successfully utilized whole exome sequencing (WES) as a platform for the identification of CNVs (1), and a number of algorithms have been developed for this purpose, including ExomeCNV, VarScan2, exomeCopy, CoNIFER, ExomeDepth and XHMM (2). Lastly, exome sequencing can also be used to identify subsets of genomic rearrangements whose breakpoints are located in or near exon regions (3). CNVs can play important roles in prostate carcinogenesis. Using an array comparative genomic hybridization (aCGH) method, Saramäki and colleagues have confirmed common gains and losses along the genome of prostate cancer (PCa) samples, including gains at 1q, 7, 8q, 16p and 17q, and losses at 2q, 4p/q, 6q, 8p, 13q, 16q, 17p and 18q, in addition to a novel recurrent gain at 9p13-q21 (4). CNVs have been also related to the prognosis of PCa. According to the current literature, a CNV-based model can correctly predict 73% of relapsed cases and 75% of the cases with short PSA doubling time (PSADT, <4 months) (5). In another WGS study, Camacho and colleagues detected 64 recurrent regions of genomic loss and gain, and confirmed that the burden of SCNAs (somatic copy number alteration) was predictive for biochemical recurrence (BCR). At the same time, nine individual regions were also associated with relapse, including two deletions and seven gains (6). Gene fusion events also play important roles in PCas. Indeed, the first gene fusion event occurring in solid tumors was discovered in PCa, which involved the identification of the fused TMPRSS2-ETS gene (7). Another example relates to TMPRSS2:ERG gene fusions, which are present in ~50% of PCa cases. TMPRSS2:ERG is capable of inducing TGF-β signaling and epithelial to mesenchymal transition (EMT) in human PCa cells (8). Moreover, the incidence of TMPRSS2:ERG gene fusions have been also related to PCa prognosis. As reported, Kulda and colleagues showed that a combination between high level of prostate specific antigen (PSA) in preoperative serum and increased expression of TMPRSS2-ERG in tumor tissues (P=0.0001) was the most promising marker of recurrence risk after radical prostatectomy (9). Consistently, Nam and colleagues showed that, in a 5-year period, patients with TMPRSS2:ERG fusion had a significantly higher risk of recurrence than patients without this gene fusion (58.4% versus 8.1%, respectively; P<0.0001) (10). Still, TMPRSS2-ERG fusion was not a frequent genomic alteration among PCa patients from Asian countries such as China and Turkey and, in this case, it appears to have a limited significance in the clinical practice (11,12). In order to predict PCa recurrence after radical prostatectomy (RP), a variety of prediction models and biomarkers have been developed. For instance, Lalonde and colleagues developed a 100-locus genomic classifier (later refined to 31 functional loci) which could identify patients with elevated BCR rates (hazard ratio =2.73, P<0.001) (13). Oh and colleagues selected 16 significant predictive SNPs of BCR, from a large-scale exome wide association study, and indicated that the built GRS (genetic risk score) had additional predictive gain for BCR after RP (14). Furthermore, Campbell and colleagues performed a systematic review of prediction tools and suggested different tools for different purpose (15). For example, they recommended Stephenson nomograms for BCR, the CAPRA nomogram for aggressive BCR and metastasis for preoperative prognosis. For postoperative prognosis, different prediction models were suggested (15). For example, they suggested the CAPRA-Surgery (CAPRA-S), Stephenson, Kattan, Duke prostate cancer (DPC), and the Suardi nomograms for the prediction of BCR, the DPC nomogram for aggressive BCR (15). In the present study, we have conducted the WES of primary PCa samples with known follow-up outcomes (defined as recurrent or non-recurrent) after 5 or more years of clinical follow-up. Specifically, we sequenced the exome of 30 recurrent and 44 non-recurrent PCa patients and then analyzed the presence of CNVs and fusion genes specific to recurrent/non-recurrent PCa or common to both outcomes. These data were then used to develop a prediction model for PCa recurrence. Clinical assays using genes associated with this model might be further utilized for prediction of recurrence of PCa.

Methods

Patients and study design

This study has been conducted in accordance with the ethical standards, stated by the Declaration of Helsinki as well as by National and International guidelines. This work was approved by the authors’ institutional review board. Several thousands of PCa patients who underwent radical prostatectomy, between 2003 and 2010, at the First Affiliated Hospital of Zhejiang University were screened and classified as recurrent or non-recurrent PCa patients. Non-recurrent PCa patients were defined as those with Gleason score lower than 7 at diagnosis, and with no BCR within 5 years of follow-up. Recurrent PCa patients were those with Gleason score lower than 7 at diagnosis but the presence of BCR within 5 years of follow-up. Paraffin-fixed tissues sections were retrieved from the pathological archives. Tumor contents were sectioned and then analyzed by IHC (immunohistochemistry) staining. Tissue blocks with high tumor content were eventually used for DNA extraction. A total of 44 non-recurrent PCa samples and 30 recurrent PCa samples were involved in this study.

DNA library preparation and sequencing

QIAamp DNA FFPE Tissue Kit (Qiagen, USA) was used to extract DNA from FFPE (formalin-fixed, paraffin-embedded) tissue sections. The quality and concentration of the extracted DNA were evaluated using Nanodrop 2000 spectrophotometer and Invitrogen Qubit 2.0 fluorometer. Thereafter, a sequencing system covering about 180,000 exons of the human genome (SureSelect Human All Exon V5 & V5 + UTRs, Agilent) was used to capture the respective exomes, using 2 µg DNA per sample. A 2×76 bp paired-end sequencing was conducted using the Illumina HiSeq2500 machine.

CNV calling and annotation

presents a flowchart for the analysis of CNVs and gene fusion events originated from WES data. The XHMM (eXome-Hidden Markov Model) software suite (16) was used to annotate CNVs in the exome sequencing data. At first, the “best practice” pipeline (i.e., BWA + Picard + GATK) was used to generate processed alignment files. After running the cnv calling pipeline, the result file (xcnv) was analyzed using the bedtools (17), which computed both the depth and breadth of the coverage of CNV events along the whole genome. After that, duplicated (copy number gain) or deleted (copy number loss) regions (>10 kb), specific to the recurrent group, were filtered accordingly. For this, respective duplicated and deleted regions should be solely present in at least 10% samples of the recurrent group (i.e., absent in the non-recurrent group). In addition, these CNVs should be consistently present in all samples from the recurrent group. The Annovar software tool was further used to annotate these CNV events with the refgene and cytoband databases.
Figure 1

A flowchart for the analysis of CNVs and gene fusion events from WES data. CNVs, copy number variations; WES, whole exome sequencing.

A flowchart for the analysis of CNVs and gene fusion events from WES data. CNVs, copy number variations; WES, whole exome sequencing.

Building of random forest model with CNV events

Using the output file (vcf) of XHMM, the genotypes (DIP, DEL, DUP) related to the CNV events in every sample were picked. Thereafter, the frequency of three genotypes per CNV event in recurrent group and/or non-recurrent group were counted. The chi-square test was used to evaluate the correlation between CNV events and PCa recurrence (CNV events with P value <0.1 were selected). After that, CNV events which could not be genotyped in more than 10% of PCa samples were removed. The remaining CNV events were further used to build a random forest model, with missing genotypes replaced by the most common genotype (DIP, Diploidy). Meanwhile, the importance of predictors (CNV events) was assessed. Lastly, the function tool rfcv of the randomForest package was used to show the cross-validated prediction performance of models with sequentially reduced number of predictors (ranked by variable importance). Based on this modelling performance, a final panel of CNV events was established to build the prediction model (random forest) for PCa recurrence.

Gene fusion detection and annotation

To detect gene fusions in respective PCa samples, we used the software FACTERA (Fusion And Chromosomal Enumeration and Recovery Algorithm) and the GeneFuse program by adopting different algorithms. FACTERA directly processed alignment files (BAM) in three phases: identification of discordant read clusters, detection of breakpoints at nucleotide resolution and in silico validation of candidate fusions (18). By contrast, GeneFuse searched for reads that could be mapped into two different genes (left and right sides), but could not be entirely mapped to any position of the whole reference genome (19). These reads were called “supporting reads for potential gene fusions”. GeneFuse started to run with raw sequences (FASTQ) rather than alignment files (BAM). So, compared with FACTERA, GeneFuse could detect more potential gene fusions but required longer computing time. Accordingly, FACTERA aimed at whole genome to search for fusion events involved with all genes, while GeneFuse aimed at all COSMIC (Catalogue of Somatic Mutations in Cancer) curated fusion genes and druggable fusion genes to search for fusion events involved with specific genes. Afterwards, Oncofuse was used to predict the oncogenic potential of all gene fusions with a naive Bayes Network Classifier, trained and tested using the Weka machine learning package (20).

Results

WES of primary PCa, with different prognosis status, after a 5-year clinical follow-up

We sequenced a total of 44 non-recurrent and 30 recurrent samples by WES. The raw data was submitted to the SRA (Sequence Read Archive) database accordingly [BioProject ID PRJNA496568 (SUB4629515) and SRA submission ID SUB4637601]. After alignment of raw sequences to the human genome (hg19), both coverage rate and average depth were calculated. In most of PCa samples, coverage rate exceeded 99% and the mean coverage rate was 99.4%. The average sequence depth ranged from 20× to 100×, and the average depth of the targeted regions for all samples was 44.8. The detailed description of this dataset has been recently reported (21).

Copy number gains in the chromosome 8q2 and centromere protein F (CENPF) locus is detected in the recurrent PCa samples

Using XHMM, a total of 3,484 CNV events have been identified in the PCa samples. Specifically, 764 duplications and 603 deletions were detected in 76.7% (23/30) of recurrent PCa samples, with a median length of 33.29 and 55.94 kb, respectively. A total of 1,302 duplications and 815 deletions were detected in 68.2% (30/44) of the non-recurrent PCa samples, with a median length of 37.44 and 48.41 kb, respectively. To identify the CNVs specifically related to the recurrent group, we applied a filter (see Methods) and identified a total of 66 specific CNVs (2 deleted and 64 duplicated regions) specific to the recurrent group. Strikingly, most of the specific duplicated regions in the recurrent group were located on chromosome 8q2, including 8q21, 8q22, 8q23 and 8q24, which were previously shown to be associated with PCa progression and recurrence (22). Except for the well-known gene MYC (MYC Proto-Oncogene, BHLH Transcription Factor), some PCa-related genes were present in duplicated regions of 8q2, such as PTK2 (Protein Tyrosine Kinase 2), RAD21 (RAD21 Cohesin Complex Component), KIAA0196 (WASH Complex Subunit 5) and EIF3H (Eukaryotic Translation Initiation Factor 3 Subunit H). According to the COSMIC database (release v89, 15th May 2019), RAD21 was duplicated in 158 and deleted in 18 PCa samples. KIAA0196 was duplicated in 166 and deleted in 6 PCa samples, and EIF3H was duplicated in 158 and deleted in 26 PCa samples. No copy number variants involving the PTK2 gene were reported. In addition, we have found copy number gains of CENPF (centromere protein F) on chromosome 1q41 and RFPL4A (Ret Finger Protein Like 4A) on 19q13.42, which was reported to be duplicated in 31 and 16 PCa samples according to the COSMIC database, respectively. Although some of these specific CNVs had been reported in PCa, our findings suggest that these specific duplication or deletion events in recurrent PCa might affect the expression of genes in these chromosomal regions and play roles in the recurrence of PCa. For instance, duplication of 8q24 has been associated with MYC overexpression and characterized as an independent predictor of recurrence after RP (22).

Identification of common CNV events in both recurrent and non-recurrent PCa

In addition to CNV events specific to recurrent and non-recurrent PCa groups, we have also identified 7 duplications and 3 deletions, commonly present in both groups, using the criterion that the specified regions (over 10 kb) should be either duplicated or deleted in, at least, 2 recurrent and 2 non-recurrent PCa samples, with no deletion or duplication in any PCa samples, respectively (). We noted that one particular CNV resulted in CHD1 (Chromodomain Helicase DNA Binding Protein 1) gene deletion at 5q21.1. According to the COSMIC database, CHD1 deletion was detected in 135 recurrent PCa samples. As previously reported, CHD1 deletion might increase cell invasiveness in PCa, indicating a possible novel role of altered chromatin remodeling during prostate tumorigenesis (23).
Table 1

Repeated common CNV events in both the recurrent and the non-recurrent prostate cancers

RegionGeneChrStartEndBandType
Exonic RPTN, TCHH chr11520863331521272411q21.3Gain
Exonic OR4C11, OR4C16 chr11553395515537095311q11Gain
Exonic NOMO2 chr16185701671860403716p12.3Gain
Exonic TMPRSS11E, UGT2B17 chr469344564694163774q13.2Gain
Exonic ZFHX4 chr877763117777767848q21.11Gain
Exonic GLI4, ZNF696 chr81443580351443783718q24.3Gain
Exonic SPANXB1 chrX140084812140097840Xq27.1Gain
Exonic RHD chr125599065256343021p36.11Loss
Exonic KIR2DL1, KIR2DL3, KIR3DL3 chr19552466825528692419q13.42Loss
Exonic CHD1 chr598204186982408225q21.1Loss

Note: gain and loss represent copy number gain (duplication) and copy number loss (deletion) respectively. CNV, copy number variation.

Note: gain and loss represent copy number gain (duplication) and copy number loss (deletion) respectively. CNV, copy number variation.

Random forest modelling to predict PCa recurrence

To build a prediction model for PCa recurrence, the output file (vcf) of XHMM software which contained a total of 2,866 CNV events, derived from the genome of all PCa samples, were utilized. After statistical analysis of respective CNV genotypes in every PCa sample, a chi-square test indicated that 51 CNV events were correlated with PCa recurrence. Among these, 34 CNVs could not be genotyped in more than 10% of the PCa samples, and 2 CNV events were too small (DNA length less than 1 kb). As a result, the remaining 15 CNVs were used to build a random forest model. Based on a 10-fold cross validation, a prediction model was built with the six most important CNV events () and the cross-validated error rate was 11.3% (). In addition, we conducted a receiver operating characteristic (ROC) analysis of the prediction model, based on these same events, resulting in an AUC (area under curve) of 0.857 for predicting PCa recurrence ().
Table 2

The top 6 important CNV events used to build random forest model

IDBandRecurrentNon-recurrent
DIPDELDUPDIPDELDUP
chr1:12907321-12921169p36.2115162251
chr5:150632769-150647161q33.118412401
chr10:18112033-18122813p12.3319132600
chr7:102208555-102235842q22.116142430
chr11:60978529-60998542q12.216412901
chr8:141799474-141931119q24.319132900

Note: DIP, DEL, DUP indicate the genotype of CNVs and the numbers below indicate the frequency of every genotype in recurrent prostate cancers and non-recurrent prostate cancers. CNV, copy number variation; DIP, diploid; DEL, deletion; DUP, duplication.

Figure 2

The performance of the prediction model. (A) The cross-validated prediction performance of models with sequentially reduced number of predictors (ranked by variable importance) through a nested cross-validation procedure. X-axis, the number of the predictors used; Y-axis, the cross validated error rate. (B) ROC analysis showing the AUC of the model in predicting recurrence of prostate cancer. AUC, area under the curve; ROC, receiver operating characteristic.

Note: DIP, DEL, DUP indicate the genotype of CNVs and the numbers below indicate the frequency of every genotype in recurrent prostate cancers and non-recurrent prostate cancers. CNV, copy number variation; DIP, diploid; DEL, deletion; DUP, duplication. The performance of the prediction model. (A) The cross-validated prediction performance of models with sequentially reduced number of predictors (ranked by variable importance) through a nested cross-validation procedure. X-axis, the number of the predictors used; Y-axis, the cross validated error rate. (B) ROC analysis showing the AUC of the model in predicting recurrence of prostate cancer. AUC, area under the curve; ROC, receiver operating characteristic.

Identification of gene fusion events in PCa

We identified a total of 324 gene fusion events by FACTERA. Furthermore, 67 gene fusion events involving COSMIC curated fusion genes and 150 gene fusion events involving druggable fusion genes were identified by GeneFuse (). According to the COSMIC database, many of these gene fusions had been previously identified in different types of cancers, including (I) BCR-ABL1, CLTC-ALK, ETV6-ABL1 and RANBP2-ALK in haematopoietic and lymphoid-related cancers, (II) EML4-ALK in lung and thyroid cancers, (III) STRN-ALK in lung, peritoneum and thyroid cancers, (IV) RANBP2-ALK, TPM3-ALK and ATIC-ALK in soft, haematopoietic and lymphoid-related cancers, and (V) NTRK1-TPM3 in thyroid and colorectal cancers. Among these nine fusion genes, seven involved the ALK (ALK receptor tyrosine kinase) gene.
Table 3

The statistics of gene fusions detected by FACTERA and GeneFuse

TypeTotalPutative oncogenicNewNew + putative oncogenic
FACTERA32403240
GeneFuse_COSMIC67186718
GeneFuse_druggable1505912940

Note: The “New” category covers those that were not included in COSMIC database, and oncogenic gene fusions were predicted by Oncofuse. The statistical figures did not eliminate duplicates, which meant that the same fusion events detected in many samples would be re-counted. So the unique gene fusion events would be less than the statistics shown in the figure.

Note: The “New” category covers those that were not included in COSMIC database, and oncogenic gene fusions were predicted by Oncofuse. The statistical figures did not eliminate duplicates, which meant that the same fusion events detected in many samples would be re-counted. So the unique gene fusion events would be less than the statistics shown in the figure. After Oncofuse analysis, we discovered that 18 gene fusions (i.e., 16 unique events) involving COSMIC curated fusion genes and 59 gene fusions (i.e., 33 unique events) related to druggable fusion genes were potentially oncogenic (corrected P value <0.05) (). Among these potentially oncogenic gene fusions, 16 events were detected in at least 2 PCa samples which were called as “recurrent gene fusions” (). From these, 5 gene fusions (BCR-ABL1, EML4-ALK, STRN-ALK, CLTC-ALK and ETV6-ABL1) were previously reported in the COSMIC database, while ETV6-ALK was identified in epithelioid fibrous histiocytoma (24). Ten recurrent oncogenic gene fusions were novel, among which three new fusion partners of oncogenic ALK were discovered (i.e., FRYL-ALK, ABL1-ALK, and BCR-ALK).
Figure 3

A circos plot of potential oncogenic gene fusion events. A total of 49 unique potential oncogenic gene fusion events were mapped in and shown on this circos plot. The red lines represented intrachromosomal gene fusions and the blue lines represented interchromosomal gene fusions. The shade of the color reflects the frequency of gene fusions. For example, the color of the link between ETV6 and ABL1 was deeper than other links because ETV6-ABL1 fusion was detected in 7 prostate cancer samples.

Table 4

The 16 putative oncogenic gene fusions that were detected in at least 2 prostate cancer samples

TypeIDbcrno_bcrTotal (≥2)P valueCOSMIC
GeneFuse_COSMIC FRYL:ALK 1120.001947N
GeneFuse_druggable BCR:ABL1 0221.3E-05Y
GeneFuse_druggable ETV6:ALK 1125.57E-05N
GeneFuse_druggable EML4:ALK 0220.000138Y
GeneFuse_druggable BCR:ALK 0220.000161N
GeneFuse_druggable STRN:ALK 0220.000747Y
GeneFuse_druggable RANBP2:BCR 0220.00309N
GeneFuse_druggable TPM4:BCR 0220.009371N
GeneFuse_druggable BCR:EZR 1120.037034N
GeneFuse_druggable ETV6:EZR 0220.037034N
GeneFuse_druggable CLTC:ALK 1230.000247Y
GeneFuse_druggable EML4:NTRK3 0338.99E-05N
GeneFuse_druggable GOPC:NTRK3 0330.00241N
GeneFuse_druggable ABL1:ALK 1340.000138N
GeneFuse_druggable ABL1:STRN 2466.07E-05N
GeneFuse_druggable ETV6:ABL1 3471.3E-05Y

Note: “bcr” and “no_bcr” stand for recurrent prostate cancer group and non-recurrent prostate cancer group. The number below stands for the frequency of every gene fusion event in recurrent group and non-recurrent group. “P value” stems from OncoFuse and indicates whether this gene fusion is predicted to be oncogenic. “Y” and “N” in the last column indicate whether this gene fusion had been included in COSMIC database.

A circos plot of potential oncogenic gene fusion events. A total of 49 unique potential oncogenic gene fusion events were mapped in and shown on this circos plot. The red lines represented intrachromosomal gene fusions and the blue lines represented interchromosomal gene fusions. The shade of the color reflects the frequency of gene fusions. For example, the color of the link between ETV6 and ABL1 was deeper than other links because ETV6-ABL1 fusion was detected in 7 prostate cancer samples. Note: “bcr” and “no_bcr” stand for recurrent prostate cancer group and non-recurrent prostate cancer group. The number below stands for the frequency of every gene fusion event in recurrent group and non-recurrent group. “P value” stems from OncoFuse and indicates whether this gene fusion is predicted to be oncogenic. “Y” and “N” in the last column indicate whether this gene fusion had been included in COSMIC database.

Discussion

We presently conducted an analysis of CNVs and gene fusion events in PCa, using WES data from primary tumors of 74 PCa patients. This genomic data was originated from 30 recurrent and 44 non-recurrent PCa cases, after 5 years of clinical follow-up. Finally, we identified 66 specific CNVs (2 deletions and 64 duplications) that were specific to primary tumor tissues from the recurrent PCa group. Most of the 64 duplication regions were located at 8q2, including 8q21, 8q22, 8q23, 8q24, indicating that this chromosomal region is an important duplication site for PCa. As previously reported, 8q24 duplication has been associated with MYC overexpression and PCa progression (22). In addition, this duplication event may serve an independent predictor for PCa recurrence after RP (22). The chromosomal 8q24 region has been also associated with certain aggressive forms of PCa (25). Our findings suggest that duplication regions may have occurred specifically in primary PCa tissues that were recurrent after 5-year follow-up. This observation indicates that the 8q2 duplication might be an early event that predisposes PCa to recurrence, and not solely an outcome of PCa recurrence. These duplication regions could be further used to develop assays to predict PCa recurrence, using primary cancer tissues. Specifically, a high number of cancer-risk SNPs have been detected in the 8q24 region, which also harbors c-MYC (26,27). Another important gene located in the region is the prostate-specific noncoding RNA gene, PCaT-1, whose overexpression promotes proliferation, migration, invasion and inhibits apoptosis in PCa cells (28). In the recurrent PCa group, our data analyses have revealed that copy number gain of 8q2 includes other important genes, such as PTK2 (Protein Tyrosine Kinase 2), RAD21 (RAD21 Cohesin Complex Component) and KIAA0196 (WASHC5, WASH Complex Subunit 5). Based on a large validation cohort, PTK2 was duplicated in 1% of localized PCa and 35% of CRPC (castration resistant PCa). Interestingly, inhibition of PTK2 expression significantly affects cell proliferation and migration of PC3 PCa cells (29), suggesting that duplication of PTK2 might be associated with more aggressive PCa. Two additional genes from chromosomal 8q24—RAD21 and KIAA0196—have shown increased expression in PCa and were also duplicated in 30–40% of PCa xenografts and hormone-refractory tumors (30). Moreover, KIAA0196 expression appears to be significantly higher in tumors with gene duplication than those with no CNV. These data suggest that KIAA0196 and RAD21 are putative effector genes for the common duplication of 8q23-24 in PCa (30). The duplication of 8q2 region in recurrent PCa were consistent with our current understanding of the important roles of 8q2 in PCa. In addition, our findings also suggest a potential mechanism of action (MOA) of specific duplicated regions in PCa recurrence and prognosis, by enhancing the expression of genes in these particular chromosomal sites. We have also identified a duplicated 1q41 region in the recurrent PCa group. This region harbors CENPF (Centromere Protein F). It has been reported that CENPF acts synergistically with FOXM1 to promote PCa growth, and co-expression of FOXM1 and CENPF is a robust prognostic indicator of poor survival and metastasis (31). Overexpression of CENPF has been related to higher Gleason grade, advanced pathological tumor stage, accelerated cell proliferation, and lymph node metastasis, which corroborate its application as a potential biomarker for PCa aggressiveness (32). In addition, up-regulation of CENPF is an independent predictor of poor BCR-free survival (33). Therefore, the copy number gain of CENPF in 1q41 might promote its expression and contribute to prostate recurrence. Upon CNV analysis, we have also detected several common copy number gain/loss in both the recurrent and the non-recurrent PCa, including CHD1 deletion. As previously reported, CHD1 loss sensitizes PCa to DNA damaging therapy by promoting error-prone double-strand break repair (34). Loss of CHD1 causes defects on DNA repair and enhances the responsiveness to PCa therapy. In fact, CHD1 gene loss may serve as a marker for PCa patient stratification for targeted therapies, such as PARP inhibitors, which specifically affect tumors with HR (homologous recombination) defects (35). In summary, these common CNVs potentially shed light on the mechanism(s) of prostate carcinogenesis. Remarkably, we have also discovered a total of 324 gene fusion events by FACTERA. Furthermore, we identified 67 gene fusions related to COSMIC curated fusion genes, and 150 gene fusions related to druggable fusion genes, respectively, by GeneFuse (). We have additionally found fusion events involving the ALK gene, which was the most common type of gene fusion in our data. In this case, a total of 7 recurrent oncogenic ALK fusion events were annotated, where four of these (EML4-ALK, STRN-ALK, CLTC-ALK, ETV6-ALK) have been previously identified in other types of cancers, and the remaining three fusions (FRYL-ALK, ABL1-ALK, and BCR-ALK) are novel events. These seven ALK fusion events were identified, for the first time, in PCa. ALK is a receptor tyrosine kinase (RTK) that was first identified in anaplastic large-cell lymphoma (ALCL), due to a characteristic NPM-ALK fusion event (36,37). Since then, oncogenic ALK fusions have been identified in several types of cancer. Therefore, treatment with ALK inhibitors has been further considered in cancer therapy. ALK gene fusions have also been implicated in the pathogenesis of many kinds of tumors, including anaplastic large cell lymphomas, non-small cell lung cancer (38), melanomas (39), thyroid cancer (40), renal cell carcinoma (41), colorectal adenocarcinoma (42) and others (43). For instance, EML4-ALK translocation can lead to constitutive ALK kinase activity and represents an oncogenic addiction pathway in lung cancer (38). Intriguingly, ALK fusions have not been previously reported in PCa. So far, multiple fusion partners of ALK have been identified in anaplastic large cell lymphomas (ALCLs) and inflammatory myofibroblastic tumors (IMTs), including nucleophosmin (NPM), 5-aminoimidazole-4-carboxamide ribonucleotide formyltransferase/IMP cyclohydrolase (ATIC), tropomyosin 3 (TPM3), tropomyosin 4 (TPM4), clathrin heavy chain (CLTC), myosin heavy chain 9 (MYH), and the TRK-fused gene (TFG) (44). Renal cell carcinoma (RCC) also contains multiple ALK fusion partners, such as vinculin (VCL), tropomyosin 3 (TPM3), echinoderm microtubule associated protein-like 4 (EML4), hook microtubule tethering protein 1 (HOOK1) and striatin (STRN) genes (41). In particular, we have previously reported that the ALK gene is truncated (mostly by 5' deletion) in 18 of 281 (6.4%) Chinese PCa cases (45), however, no ALK fusion partners were identified. In our current study, we have detected three novel ALK fusion events, including FRYL:ALK. FRY-like transcription coactivator (FRYL) is a putative target of miR-1205, which is encoded by the long non-coding PVT1 gene located at 8q24, a PCa susceptibility chromosomal region (46). Indeed, transfection of oligonucleotide mimics of miR-1205 into androgen-independent PC3 cells can lead to a significant decrease in the expression of FRYL, suggesting that FRYL may act as a direct target of miR-1205. Loss of miR-1205 promotes a tumorigenic phenotype in PCa (46). Interestingly, we have observed that the FRYL:ALK fusion event negatively affects the binding site of miR-1205 in 3’UTR of the FRYL gene. Therefore, we concluded that FRYL:ALK gene fusion might protect FRYL expression from miR-1205-dependent inhibition, contributing to PCa tumorigenesis due to elevated FRYL expression.

Conclusions

Here we demonstrated that WES is a cost effective approach to identify CNVs and gene fusion events in PCa. Our findings that 8q2 duplication specifically occurs in primary PCa samples that were recurrent, after 5-year follow-up, suggests that this particular duplication is an early genetic event that predispose PCa to recurrence. Meanwhile, a random forest model with most informative CNV events (n=6) were developed. This model could effectively predict PCa recurrence, with an accuracy close to 90%. Furthermore, we found a high frequency of ALK gene fusion events in our dataset. Interestingly, all seven recurrent oncogenic ALK fusion events were here identified, for the first time, in PCa. Altogether, these findings expand our understanding of prostate carcinogenesis and may lead to the development of more specific assays focused on the prediction of PCa recurrence.
  45 in total

Review 1.  Optimum Tools for Predicting Clinical Outcomes in Prostate Cancer Patients Undergoing Radical Prostatectomy: A Systematic Review of Prognostic Accuracy and Validity.

Authors:  Jared M Campbell; Elspeth Raymond; Michael E O'Callaghan; Andrew D Vincent; Kerri R Beckmann; David Roder; Sue Evans; John McNeil; Jeremy Millar; John Zalcberg; Martin Borg; Kim L Moretti
Journal:  Clin Genitourin Cancer       Date:  2017-06-08       Impact factor: 2.872

2.  Whole Exome Sequencing Identifies Putative Predictors of Recurrent Prostate Cancer with High Accuracy.

Authors:  Jie Liu; Ruifang Mao; Guoping Ren; Xiaoyan Liu; Yanling Zhang; Jili Wang; Yan Wang; Meiling Li; Qingchong Qiu; Lin Wang; Guanfeng Liu; Shanshan Jin; Liang Ma; Yingying Ma; Na Zhao; Jiajun Yan; Hongwei Zhang; Biaoyang Lin
Journal:  OMICS       Date:  2019-06-13

Review 3.  A review of ALK-rearranged renal cell carcinomas with a focus on clinical and pathobiological aspects.

Authors:  Naoto Kuroda; Emiko Sugawara; Hironori Kusano; Yoshiaki Yuba; Kenji Yorita; Kengo Takeuchi
Journal:  Pol J Pathol       Date:  2018       Impact factor: 1.072

4.  Oncogenic ALK Fusion in Rare and Aggressive Subtype of Colorectal Adenocarcinoma as a Potential Therapeutic Target.

Authors:  Evgeny Yakirevich; Murray B Resnick; Shamlal Mangray; Michael Wheeler; Cynthia L Jackson; Kara A Lombardo; Jeeyun Lee; Kyoung-Mee Kim; Anthony J Gill; Kai Wang; Kyle Gowen; James Sun; Vincent A Miller; Philip J Stephens; Siraj M Ali; Jeffrey S Ross; Howard Safran
Journal:  Clin Cancer Res       Date:  2016-03-01       Impact factor: 12.531

5.  ALK Inhibitor Response in Melanomas Expressing EML4-ALK Fusions and Alternate ALK Isoforms.

Authors:  Kasey L Couts; Judson Bemis; Jacqueline A Turner; Stacey M Bagby; Danielle Murphy; Jason Christiansen; Jennifer D Hintzsche; Anh Le; Todd M Pitts; Keith Wells; Allison Applegate; Carol Amato; Pratik Multani; Edna Chow-Maneval; John J Tentler; Yiqun G Shellman; Matthew J Rioth; Aik-Choon Tan; Rene Gonzalez; Theresa Medina; Robert C Doebele; William A Robinson
Journal:  Mol Cancer Ther       Date:  2017-10-20       Impact factor: 6.261

6.  Somatic copy number alterations by whole-exome sequencing implicates YWHAZ and PTK2 in castration-resistant prostate cancer.

Authors:  Roopika Menon; Mario Deng; Kerstin Rüenauver; Angela Queisser; Martin Peifer; Martin Pfeifer; Anne Offermann; Diana Boehm; Wenzel Vogel; Veit Scheble; Falko Fend; Glen Kristiansen; Nicolas Wernert; Nicole Oberbeckmann; Saskia Biskup; Mark A Rubin; David Adler; Sven Perner
Journal:  J Pathol       Date:  2013-12       Impact factor: 7.996

7.  Upregulation of centromere protein F is linked to aggressive prostate cancers.

Authors:  Cosima Göbel; Cansu Özden; Cornelia Schroeder; Claudia Hube-Magg; Martina Kluth; Christina Möller-Koop; Emily Neubauer; Andrea Hinsch; Frank Jacobsen; Ronald Simon; Guido Sauter; Uwe Michl; Dirk Pehrke; Hartwig Huland; Markus Graefen; Thorsten Schlomm; Andreas M Luebke
Journal:  Cancer Manag Res       Date:  2018-11-09       Impact factor: 3.989

8.  Cumulative Evidence for Relationships Between 8q24 Variants and Prostate Cancer.

Authors:  Yu Tong; Tao Yu; Shiping Li; Fengyan Zhao; Junjie Ying; Yi Qu; Dezhi Mu
Journal:  Front Physiol       Date:  2018-07-16       Impact factor: 4.566

9.  GeneFuse: detection and visualization of target gene fusions from DNA sequencing data.

Authors:  Shifu Chen; Ming Liu; Tanxiao Huang; Wenting Liao; Mingyan Xu; Jia Gu
Journal:  Int J Biol Sci       Date:  2018-05-22       Impact factor: 6.580

10.  The positive immunostaining of TMPRSS2-ERG is not associated with unfavourable outcomes and biochemical recurrence after radical prostatectomy in Turkish patients.

Authors:  Gülistan Gümrükcü; Burcu Onamay Celik; Selahattin Çalişkan; Selvinaz Ökzara; Orhan Koca; Cağatay Tosun; Ayse Mine Önenerk; Fügen Aker
Journal:  Cent European J Urol       Date:  2018-08-16
View more
  1 in total

1.  Mutational Signature and Integrative Genomic Analysis of Human Papillomavirus-Associated Penile Squamous Cell Carcinomas from Latin American Patients.

Authors:  Luisa Matos Canto; Jenilson Mota da Silva; Patrícia Valèria Castelo-Branco; Ingrid Monteiro da Silva; Leudivan Nogueira; Carlos Eduardo Fonseca-Alves; André Khayat; Alexander Birbrair; Silma Regina Pereira
Journal:  Cancers (Basel)       Date:  2022-07-20       Impact factor: 6.575

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.