| Literature DB >> 35625354 |
Rebecca Morgan1, Dulcie Keeley1, E Starr Hazard2, Emma H Allott3, Bethany Wolf4, Stephen J Savage5,6, Chanita Hughes Halbert7,8, Sebastiano Gattoni-Celli6,9, Gary Hardiman1,4,10.
Abstract
Prostate cancer is one of the most prevalent cancers worldwide, particularly affecting men living a western lifestyle and of African descent, suggesting risk factors that are genetic, environmental, and socioeconomic in nature. In the USA, African American (AA) men are disproportionately affected, on average suffering from a higher grade of the disease and at a younger age compared to men of European descent (EA). Fusion genes are chimeric products formed by the merging of two separate genes occurring as a result of chromosomal structural changes, for example, inversion or trans/cis-splicing of neighboring genes. They are known drivers of cancer and have been identified in 20% of cancers. Improvements in genomics technologies such as RNA-sequencing coupled with better algorithms for prediction of fusion genes has added to our knowledge of specific gene fusions in cancers. At present AA are underrepresented in genomic studies of prostate cancer. The primary goal of this study was to examine molecular differences in predicted fusion genes in a cohort of AA and EA men in the context of prostate cancer using computational approaches. RNA was purified from prostate tissue specimens obtained at surgery from subjects enrolled in the study. Fusion gene predictions were performed using four different fusion gene detection programs. This identified novel putative gene fusions unique to AA and suggested that the fusion gene burden was higher in AA compared to EA men.Entities:
Keywords: African American; African descent; European American; RNA-sequencing; biomarkers; fusion genes; prostate cancer
Year: 2022 PMID: 35625354 PMCID: PMC9137560 DOI: 10.3390/biology11050625
Source DB: PubMed Journal: Biology (Basel) ISSN: 2079-7737
Patient demographics and clinical characteristics.
| European American ( | African American | |
|---|---|---|
|
| 61.4 (4.78) | 63.4 (5.21) |
|
| ||
| 3 + 3 | 4 (23.5) | 2 (20.0) |
| 3 + 4 | 10 (58.8) | 7 (70.0) |
| 4 + 3 | 1 (5.88) | 0 (0.00) |
| 4 + 3 + 5 | 2 (11.8) | 1 (10.0) |
|
| ||
| Confined | 15 (88.2) | 7 (70.0) |
| Extension | 2 (11.8) | 3 (30.0) |
|
| ||
| N0 | 13 (76.5) | 7 (70.0) |
| NX | 4 (23.5) | 2 (20.0) |
Figure 1Workflow schematic. Fastq files were inputted to FastQC and Cutadapt. Four separate gene fusion tools predicted gene fusions that were compared using jvenn.
Comparison of the input sequencing data, patient samples, and genome build specific to our analyses.
| Program | STAR-Fusion | FusionCatcher | JAFFA | ChimeraScan |
|---|---|---|---|---|
|
| Single and paired end | Single and paired end | Single and paired end | Paired end only |
|
| Y | N | Y | N |
|
| 27 samples | 17 samples | 27 samples | 17 samples |
| 10 AA | 10 AA | 10 AA | 10 AA | |
| 17 EA | 7 EA | 17 EA | 7 EA | |
|
| GRCh38 | GRCh38 | GRCh38 | GRCh37 |
Fusion detection pipelines.
| Program | STAR-Fusion | FusionCatcher | JAFFA | ChimeraScan |
|---|---|---|---|---|
|
| 2017 | 2012 | 2015 | 2011 |
|
| Python script | Java script | C++ script | |
|
| STAR | Bowtie, Bowtie2, Liftover, STAR, Velvet, FaToTwoBit, SAMtools, Seqtk, Numpy, Biopython, Picard, Parallel | Bpipe, Velvet, Oases, SAMtools, | Bowtie |
|
| Transcriptome | Genome and transcriptome | Transcriptome | Genome and transcriptome |
|
| Single end (suggested reads > 100 bp) and paired end | Single end (reads > 130 bp) and paired end | Single end (any length) and paired end | Paired end |
|
| Read mapping | Read mapping | Read mapping | Read mapping |
|
| RNA-Seq | RNA-Seq | RNA-Seq | RNA-Seq |
|
| GRCh38 | GRCh38 | GRCh38 | GRCh37 |
Figure 2Comparison of fusion genes predicted by the different pipelines in African American patients. Total nonredundant fusions = 9203.
Figure 3Comparison of fusion genes predicted by the different pipelines in European American patients. Total nonredundant fusions = 2809.
Figure 4(A) Comparison of fusions detected by FusionCatcher, JAFFA, and ChimeraScan in AA patients. Two shared fusion genes were predicted by these tools in AA patients (NAIP:OCLN and PDE1C:DNAJC6). (B) Comparison of fusions detected by FusionCatcher, JAFFA, and ChimeraScan in EA patients. Four shared fusion genes were predicted by these tools in EA patients (Shared fusions—NAIP:OCLN, FOXP2:CREM, KANSL1:ARL17B, and KANSL1:ARL17A). (C) Comparison of detected fusions in AA and EA patients across all four tools. Low-confidence fusions have been excluded from all the comparisons.
Comparison of fusion genes predicted by the four detection tools. (Low-confidence fusions detected by JAFFA were excluded from this comparison).
| Number of Different Fusion Genes Detected by Each Tool | |||||
|---|---|---|---|---|---|
| Group | STAR-Fusion | FusionCatcher | JAFFA | ChimeraScan | Total |
|
| 26 | 4753 | 3290 | 1134 | 9203 |
|
| 10 | 278 | 1530 | 991 | 2809 |
|
| 36 | 5031 | 4820 | 2125 | |
Novel predicted fusion genes identified by each tool. Race, patient samples, and the tools used to obtain the fusions are summarized.
| Fusion Gene | Race | Patient ID | Tools |
|---|---|---|---|
|
| AA | AA 20, 22, 24 | ChimeraScan, FusionCatcher, JAFFA |
|
| EA | EA 15 | ChimeraScan, FusionCatcher, JAFFA |
|
| AA | AA 24 | ChimeraScan, FusionCatcher, JAFFA |
|
| EA | EA 5, 6, 15 | ChimeraScan, FusionCatcher, JAFFA |
|
| EA | EA 1 | ChimeraScan, FusionCatcher, JAFFA |
|
| EA | EA 15 | STAR-Fusion, FusionCatcher, JAFFA |
Figure 5Chromosome Circos plot of novel fusion genes detected between fusion prediction tools. Red and blue lines indicate intra- and inter-chromosomal fusions. All chromosomes and cytoband information are denoted. The width of each link varies according to number of reads supporting the fusion event.
Figure 6(A) Overview of the NAIP:OCLN fusion gene found in patient AA 20. The plot displays the locations on chromosome 5 of the fusion partner genes. (B) Overview of fusion KANSL1:ARL17B for patient EA 5. The plot displays the locations on chromosome 17 of the fusion partner genes. The plots display the locations on chromosome 5 and 17, respectively, of the fusion partner genes, the number of discordant (split and spanning, respectively, in parentheses) reads supporting the breakpoint indicated by the red line between partner genes, transcript information showing the exons in each partner gene, and, lastly, the RNA expression of partner genes beside their genomic location in Mb.