| Literature DB >> 24718460 |
Yaping Xu1, Yue Deng1, Zhenhua Ji1, Haibin Liu1, Yueyang Liu1, Hu Peng1, Jian Wu1, Jingping Fan1.
Abstract
Thyroid cancer is a malignant neoplasm originated from thyroid cells. It can be classified into papillary carcinomas (PTCs) and anaplastic carcinomas (ATCs). Although ATCs are in an very aggressive status and cause more death than PTCs, their difference is poorly understood at molecular level. In this study, we focus on the transcriptome difference among PTCs, ATCs and normal tissue from a published dataset including 45 normal tissues, 49 PTCs and 11 ATCs, by applying a machine learning method, maximum relevance minimum redundancy, and identified 9 genes (BCL2, MRPS31, ID4, RASAL2, DLG2, MY01B, ZBTB5, PRKCQ and PPP6C) and 1 miscRNA (miscellaneous RNA, LOC646736) as important candidates involved in the progression of thyroid cancer. We further identified the protein-protein interaction (PPI) sub network from the shortest paths among the 9 genes in a PPI network constructed based on STRING database. Our results may provide insights to the molecular mechanism of the progression of thyroid cancer.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24718460 PMCID: PMC3981740 DOI: 10.1371/journal.pone.0094022
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1IFS curve of the classification of ATCs, PTCs and normal tissue samples.
The X-axis indicate the number of genes used for classification/prediction, and Y-axis is the prediction accuracies by NNA evaluated using leave-one-out (orange), 10 fold (green) and stratified 10 fold (blue) cross validation.
The 10 Genes selected using mRMR and IFS.
| Gene Name | Entrez Gene ID | mRMR score |
| BCL2 | 596 | 1.09662945 |
| MRPS31 | 10240 | 0.222372096 |
| ID4 | 3400 | 0.32164204 |
| RASAL2 | 9462 | 0.390513354 |
| DLG2 | 1740 | 0.334284222 |
| MY01B | 4430 | 0.354486787 |
| ZBTB5 | 9925 | 0.384452316 |
| LOC646736 | 0.339571667 | |
| PRKCQ | 5588 | 0.359410448 |
| PPP6C | 5537 | 0.340892868 |
Proteins selected on the shortest paths among the mRMR selected proteins.
| Ensembl Gene ID | Ensembl Protein ID | Associated Gene Name | betweenness |
| ENSG00000091831 | ENSP00000206249 | ESR1 | 5 |
| ENSG00000010610 | ENSP00000011653 | CD4 | 4 |
| ENSG00000150991 | ENSP00000344818 | UBC | 4 |
| ENSG00000143933 | ENSP00000272298 | CALM2 | 3 |
| ENSG00000132170 | ENSP00000287820 | PPARG | 3 |
| ENSG00000029363 | ENSP00000031135 | BCLAF1 | 2 |
| ENSG00000050820 | ENSP00000162330 | BCAR1 | 2 |
| ENSG00000100906 | ENSP00000216797 | NFKBIA | 2 |
| ENSG00000106588 | ENSP00000223321 | PSMA2 | 2 |
| ENSG00000112365 | ENSP00000230122 | ZBTB24 | 2 |
| ENSG00000115956 | ENSP00000234313 | PLEK | 2 |
| ENSG00000141510 | ENSP00000269305 | TP53 | 2 |
| ENSG00000204519 | ENSP00000282296 | ZNF551 | 2 |
| ENSG00000154342 | ENSP00000284523 | WNT3A | 2 |
| ENSG00000158092 | ENSP00000288986 | NCK1 | 2 |
| ENSG00000147044 | ENSP00000367408 | CASK | 2 |
| ENSG00000074071 | ENSP00000380531 | MRPS34 | 2 |
Figure 217 shortest paths genes among the 9 genes identified with mRMR methods.
We identified 17 genes located on the shortest paths of STRING PPI network among the 9 mRMR identified genes. Genes in blue are those identified with mRMR methods, and genes in red are located on their shortest paths. The network is constructed based on STRING PPI data.
KEGG pathway enrichment of the 25 genes selected on the shortest paths.
| Term | Gene Count |
| Fold Enrichment |
| T cell receptor signaling pathway | 4 | 0.002282004 | 13.45238095 |
| Neurotrophin signaling pathway | 4 | 0.003385035 | 11.71658986 |
| Pathways in cancer | 5 | 0.007626354 | 5.536803136 |
| Small cell lung cancer | 3 | 0.018690317 | 12.97193878 |
| Apoptosis | 3 | 0.019971101 | 12.52463054 |
| Prostate cancer | 3 | 0.02084525 | 12.24317817 |
| Thyroid cancer | 2 | 0.071736805 | 25.04926108 |