| Literature DB >> 31835700 |
Osama Hamzeh1, Abedalrhman Alkhateeb1, Julia Zhuoran Zheng1, Srinath Kandalam2, Crystal Leung3, Govindaraja Atikukke4, Dora Cavallo-Medved2, Nallasivam Palanisamy5, Luis Rueda1.
Abstract
(1) Background:One of the most common cancers that affect North American men and men worldwide is prostate cancer. The Gleason score is a pathological grading system to examine the potential aggressiveness of the disease in the prostate tissue. Advancements in computing and next-generation sequencing technology now allow us to study the genomic profiles of patients in association with their different Gleason scores more accurately and effectively. (2)Entities:
Keywords: Gleason score detection; classification; next generation sequencing; prostate cancer; supervised learning; transcriptomics
Year: 2019 PMID: 31835700 PMCID: PMC6963340 DOI: 10.3390/diagnostics9040219
Source DB: PubMed Journal: Diagnostics (Basel) ISSN: 2075-4418
Gleason groups considered in this study.
| Gleason Group | Score |
|---|---|
| 1 | 6 |
| 2 | 3 + 4 = 7 |
| 3 | 4 + 3 = 7 |
| 4 | 8 |
| 5 | 9 and 10 |
Figure 1Gleason groups and their distributions.
Set of resulting transcripts in Gleason group 1.
| Transcript | Gene | Description |
|---|---|---|
| NM_003350 |
| ubiquitin conjugating enzyme E2 V2 ( |
| NM_153051 |
| myotubularin related protein 3 ( |
| NM_207445 |
| chromosome 15 open reading frame 54
( |
Set of resulting transcripts in Gleason group 2.
| Transcript | Gene | Description |
|---|---|---|
| NM_001170880 |
| G protein-coupled receptor 137 ( |
| NM_001198827 |
| chromosome 8 open reading frame 58
( |
| NM_004629 |
| Fanconi anemia complementation group G ( |
| NM_001098268 |
| DNA ligase 4 ( |
| NM_016641 |
| glycerophosphodiester phosphodiesterase 1
( |
| NM_002445 |
| macrophage scavenger receptor 1 ( |
| NM_001126337 |
| tuftelin 1 ( |
| NM_033071 |
| spectrin repeat containing nuclear envelope
protein 1( |
| NM_052906 |
| extracellular leucine rich repeat and
fibronectin typeIII domain containing 2 ( |
| NM_000714 |
| translocator protein ( |
| NM_004374 |
| cytochrome c oxidase subunit 6C ( |
| NM_001007544 |
| chromosome 1 open reading frame 186
( |
| NM_001276438 |
| potassium voltage-gated channel subfamily
J member 15 ( |
| NM_001252021 |
| torsin family 2 member A ( |
| NM_152612 |
| coiled-coil domain containing 116 ( |
Set of resulting transcripts in Gleason group 3.
| Transcript | Gene | Description |
|---|---|---|
| NM_001136224 |
| REST corepressor 3 ( |
| NM_001017967 |
| MARVEL domain containing 3 ( |
| NM_006099 |
| protein inhibitor of activated STAT 3 ( |
| NM_152395 |
| nudix hydrolase 16 ( |
| NM_006473 |
| TATA-box binding protein associated factor 6
like ( |
| NM_001145541 |
| t-complex 11 like 1 ( |
| NM_182501 |
| mitochondrial transcription termination factor
4 ( |
Set of resulting transcripts in Gleason group 4.
| Transcript | Gene | Description |
|---|---|---|
| NM_001258330 |
| erythrocyte membrane protein band 4.1
like 1 ( |
Figure 2Hierarchical tree of classifications of Gleason groups against the rest, along with the corresponding classification accuracies.
Classification performance for each step in the hierarchy.
| Gleason Group | Accuracy | Sensitivity | Specificity | F-Measure | MCC | ROC Area |
|---|---|---|---|---|---|---|
| 3 + 4 = 7 vs. Res | 94 | 95 | 94 | 0.94 | 0.88 | 95 |
| 4 + 3 = 7 vs. Rest | 98 | 100 | 96 | 0.98 | 0.96 | 99 |
| 6 vs. Rest | 100 | 100 | 100 | 1.00 | 1.00 | 100 |
| 8 vs. 9 | 100 | 100 | 100 | 1.00 | 1.00 | 100 |
Figure 3Accuracy obtained by each classifier for classifying one versus the rest for all five Gleason groups.
Figure 4Classification accuracies obtained after applying the model on the second dataset.
Figure 5An interactive figure taken from proteomics database STRING. It shows neighbouring protein binding and pathway interactions for a given gene using STRING and KEGG pathway analysis. Here, the gene of interest is PIAS3, an identified possible biomarker in the 4 + 3 = 7 score. The figure shows the interaction between other proteins and pathways associated with it.
Numbers of samples in different Gleason groups.
| Gleason Score | Number of Samples |
|---|---|
| 6 | 10 |
| 3 + 4 = 7 | 55 |
| 4 + 3 = 7 | 24 |
| 8 | 10 |
| 9 | 4 |
Figure 6Pre-processing steps of the proposed method.
Figure 7Hypothetical example that shows how the synthetic minority oversampling technique (SMOTE) works.
Figure 8Machine learning pipeline used in the proposed method.