| Literature DB >> 31737124 |
Guo-Liang Zhang1, Le-Lin Pan1, Tao Huang2, Jin-Hai Wang1.
Abstract
The previous cancer studies were difficult to reproduce since the tumor tissues were analyzed directly. But the tumor tissues were actually a mixture of different cancer cells. The transcriptome of single-cell was much robust than the transcriptome of a mixed tissue. The single-cell transcriptome had much smaller variance. In this study, we analyzed the single-cell transcriptome of 272 colorectal cancer (CRC) epithelial cells and 160 normal epithelial cells and identified 342 discriminative transcripts using advanced machine learning methods. The most discriminative transcripts were LGALS4, PHGR1, C15orf48, HEPACAM2, PERP, FABP1, FCGBP, MT1G, TSPAN1 and CKB. We further clustered the 342 transcripts into two categories. The upregulated transcripts in CRC epithelial cells were significantly enriched in Ribosome, Protein processing in endoplasmic reticulum, Antigen processing and presentation and p53 signaling pathway. The downregulated transcripts in CRC epithelial cells were significantly enriched in Mineral absorption, Aldosterone-regulated sodium reabsorption and Oxidative phosphorylation pathways. The biological analysis of the discriminative transcripts revealed the possible mechanism of colorectal cancer. © The author(s).Entities:
Keywords: colorectal cancer; incremental feature selection; minimal redundancy maximal relevance; single-cell sequencing; support vector machine; transcriptome
Year: 2019 PMID: 31737124 PMCID: PMC6843882 DOI: 10.7150/jca.32267
Source DB: PubMed Journal: J Cancer ISSN: 1837-9664 Impact factor: 4.207
The top 20 mRMR transcripts
| Order | Chromosome | Start Position | End Position | Transcript Name | Transcript ID | Score |
|---|---|---|---|---|---|---|
| 1 | chr19 | 39292310 | 39304004 | LGALS4 | ENSG00000171747.4 | 0.138 |
| 2 | chr15 | 40643233 | 40648635 | PHGR1 | ENSG00000233041.4 | 0.085 |
| 3 | chr15 | 45722726 | 45878488 | C15orf48 | ENSG00000166920.6 | 0.067 |
| 4 | chr7 | 92817898 | 92855837 | HEPACAM2 | ENSG00000188175.5 | 0.065 |
| 5 | chr16 | 56659386 | 56661024 | MT1E | ENSG00000169715.10 | 0.066 |
| 6 | chr6 | 138409641 | 138428648 | PERP | ENSG00000112378.11 | 0.061 |
| 7 | chr10 | 85933493 | 85945050 | C10orf99 | ENSG00000188373.4 | 0.063 |
| 8 | chr1 | 45249256 | 45253377 | BEST4 | ENSG00000142959.4 | 0.059 |
| 9 | chr13 | 27825445 | 27830828 | RPL21 | ENSG00000122026.6 | 0.054 |
| 10 | chr2 | 88422509 | 88427635 | FABP1 | ENSG00000163586.5 | 0.055 |
| 11 | chr12 | 39040623 | 39303394 | CPNE8 | ENSG00000139117.9 | 0.054 |
| 12 | chr12 | 56544579 | 56584068 | MYL6 | ENSG00000092841.14 | 0.053 |
| 13 | chr19 | 40353962 | 40440533 | FCGBP | ENSG00000090920.9 | 0.052 |
| 14 | chr20 | 1290618 | 1373806 | SDCBP2 | ENSG00000125775.10 | 0.053 |
| 15 | chr16 | 56700642 | 56701977 | MT1G | ENSG00000125144.9 | 0.053 |
| 16 | chr6 | 31795511 | 31798031 | HSPA1B | ENSG00000204388.5 | 0.052 |
| 17 | chr5 | 179041178 | 179061785 | HNRNPH1 | ENSG00000169045.13 | 0.051 |
| 18 | chr1 | 46505811 | 46651630 | TSPAN1 | ENSG00000117472.5 | 0.051 |
| 19 | chr16 | 56662970 | 56667898 | MT1M | ENSG00000205364.3 | 0.051 |
| 20 | chr14 | 103985995 | 103989448 | CKB | ENSG00000166165.8 | 0.051 |
The significantly enriched KEGG pathways of the up regulated transcripts in CRC epithelial cells
| KEGG pathway | FDR* | P value | Transcripts |
|---|---|---|---|
| hsa03010 Ribosome | 0.00136 | 4.32E-06 | RPS4Y1, RPS18, RPS27A, RPL7, RPL13, RPL18A, RPL21, RPL23, RPL38 |
| hsa04141 Protein processing in endoplasmic reticulum | 0.0456 | 0.000422 | HSPA5, HSPA1B, HSPA6, HSP90AA1, HSP90AB1, HSPH1, PPP1R15A |
| hsa04612 Antigen processing and presentation | 0.0456 | 0.000436 | HSPA1B, HSPA6, HSP90AA1, HSP90AB1, HSPA5 |
| hsa05215 Prostate cancer | 0.0981 | 0.00125 | ETV5, MDM2, HSP90AA1, HSP90AB1, GSTP1 |
| hsa04115 p53 signaling pathway | 0.153 | 0.00244 | ATR, MDM2, PERP, SESN3 |
*: FDR<0.2
The significantly enriched KEGG pathways of the down regulated transcripts in CRC epithelial cells
| KEGG pathway | FDR* | P value | Transcripts |
|---|---|---|---|
| hsa04978 Mineral absorption | 0.000195 | 6.21E-07 | SLC26A3, MT1E, MT1F, MT1G, MT1M, MT1X, MT2A |
| hsa04960 Aldosterone-regulated sodium reabsorption | 0.000236 | 1.50E-06 | HSD11B2, NR3C2, SCNN1A, SCNN1G, SGK1, NEDD4L |
| hsa00190 Oxidative phosphorylation | 0.0295 | 0.000347 | NDUFB1, COX6B1, COX7A2, COX7C, ATP5C1, ATP5G1, ATP5H |
| hsa04972 Pancreatic secretion | 0.029 | 0.000376 | PLA2G10, CLCA1, CLCA4, SLC26A3, SLC4A4, CA2 |
| hsa05012 Parkinson's disease | 0.0311 | 0.000516 | NDUFB1, COX6B1, COX7A2, COX7C, ATP5C1, ATP5G1, ATP5H |
| hsa00910 Nitrogen metabolism | 0.0311 | 0.000595 | CA1, CA7, CA2 |
*: FDR<0.05