| Literature DB >> 29563929 |
Malihe Ram1, Ali Najafi2, Mohammad Taghi Shakeri1.
Abstract
BACKGROUND &Entities:
Keywords: Cancer; Classification; Gene Selection; Microarray; Random Forest
Year: 2017 PMID: 29563929 PMCID: PMC5844678
Source DB: PubMed Journal: Iran J Pathol ISSN: 1735-5303
Figure1A flowchart for random forest algorithm that represents a workflow for classification problems
Figure 2The process steps of feature selection and classification of microarray data that used in this study using Random Forest package
The Number of Genes and Samples Used in the Current Study for the Considered Cancers
| Class | Sample (+/–) | Gene | Data Set |
|---|---|---|---|
| No tumor/tumor | 111(56/55) | 22277 |
|
| Normal/tumor | 30(15/15) | 17881 |
|
| Normal/leukemia | 64(26/38) | 22283 |
|
Evaluation Criteria of Random Forest Classifier Model
| Kappa | Precision | Accuracy | Specificity | Sensitivity | Overall Error | |
|---|---|---|---|---|---|---|
|
| 85.45455 | 87.38739 | 85.45455 | 89.28571 | 12.61261 |
|
|
| 66.66667 | 73.33333 | 66.66667 | 80 | 26.66667 |
|
|
| 100 | 95.2381 | 100 | 88.46154 | 4.761905 |
|
The Smallest Set of Genes Selected From Colon Cancer Data by the Random Forest Method
| Function | Gene. Symbol | Probe ID |
|---|---|---|
| Regulates the p53 pathway to control the expansion growth of digestive organs | DIEXF |
|
| Guanyin, cyclase C binding protein 2A, increasing intracellular cGMP. Endogenous activator of intestinal guanylate cyclase. It stimulates this enzyme through the same receptor binding region as the heat-stable enterotoxins. | GUCA2A |
|
| Carbonic anhydrases are a large family of zinc metalloenzymes that catalyze the reversible hydration of carbon dioxide. They participate in a variety of biological processes, including respiration, calcification, acid-base balance, bone resorption, and the formation of aqueous humor, cerebrospinal fluid, saliva, and gastric acid. | CA7 |
|
| IGHA1 is a Protein Coding gene. Among its related pathways are Vesicle-mediated transport and Regulation of nuclear SMAD2/3 signaling. | IGHA1 |
|
The Smallest Set of Genes Selected From Prostate Cancer Data by the Random Forest Method
| Function | Gene. Symbol | Probe ID | |||
|---|---|---|---|---|---|
| Alpha-synuclein is a member of the synuclein family, which also includes beta- and gamma-synuclein. Among its related pathways are “transport to the Golgi” and subsequent modification and “EGFR1 signaling pathway”. | SNCA |
| |||
| This gene encodes an ubiquitin specific processing protease that was first identified as a substrate of the VHL protein E3 ubiquitin ligase complex. In addition to being ubiquitinated by the VHL-E3 ligase complex, this enzyme deubiquitinates HIF-1 alpha, and thereby, causes increased expression of HIF-1alpha-targeted genes, which play a role in angiogenesis, glucose metabolism, cell proliferation, and metastasis. | USP20 |
| |||
| SNRPA is a protein coding gene. Among its related pathways are gene expression and mRNA splicing-major pathway. GO annotations related to this gene include | SNRPA1 |
| |||
The Smallest Set of Genes Selected From Leukemia Cancer Data by the Random Forest Method
| Function | Gene. Symbol | Probe ID | |
|---|---|---|---|
| The ANKHD1-EIF4EBP3 mRNA is an infrequent, but naturally occurring read through transcript of the neighboring ANKHD1 and EIF4EBP3 genes. This read through transcript encodes a protein composed mostly of the multiple ankyrin repeats, single KH-domain protein, with its C-terminus encoded in a different reading frame from the shared portion of the EIF4EBP3 gene. | ANKHD1-EIF4EBP3 |
| |
| This gene encodes a member of the plexin family. Plexins are transmembrane receptors for semaphorins, a large family of proteins that regulate axon guidance, cell motility and migration, and the immune response. The encoded protein and its ligand regulate melanocyte adhesion, and viral semaphorins may modulate the immune response by binding to this receptor. The encoded protein may be a tumor suppressor for melanoma. | PLXNC1 |
| |
| The protein encoded by this gene is a member of the BAG1-related protein family. BAG1 is an anti-apoptotic protein that functions through interactions with a variety of cell apoptosis and growth related proteins including BCL-2, Raf-protein kinase, steroid hormone receptors, growth factor receptors, and members of the heat shock protein 70-kDa family. This protein was associated with the death domain of TNF-R1 and DR3, and thereby negatively regulates downstream cell death signaling. The regulatory role of this protein in cell death was demonstrated in epithelial cells that undergo apoptosis while integrin mediated matrix contacts are lost. | BAG4 |
| |
| This gene encodes a member of the protocadherin family, and cadherin superfamily of transmembrane proteins containing cadherin domains. These proteins mediate cell adhesion in neural tissue in the presence of calcium. The encoded protein may be involved in signaling at neuronal synaptic junctions. Sharing a characteristic with other protocadherin genes, this gene has a notably large exon that encodes multiple cadherin domains and a transmembrane region. | PCDH9 |
| |