| Literature DB >> 35480120 |
Tian-An Guo1,2,3, Hong-Yan Lai1,2,3, Cong Li1,2,3, Yan Li1,2,3, Yu-Chen Li1,2,3, Yu-Tong Jin3,4, Zhao-Zhen Zhang4,5, Hao-Bo Huang6, Sheng-Lin Huang1,2,3, Ye Xu1,2,3.
Abstract
Background: Early detection of colorectal cancer (CRC) is crucial to the treatment and prognosis of patients. Traditional screening methods have disadvantages.Entities:
Keywords: colorectal adenoma; colorectal cancer; early detection; extracellular vesicle; long RNAs
Year: 2022 PMID: 35480120 PMCID: PMC9037372 DOI: 10.3389/fonc.2022.829230
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
Clinicopathological information of 194 participants.
| CRC (N = 72) | CRA (N = 42) | Healthy (N = 80) | |
|---|---|---|---|
| Age | 60.8 ± 10.9 | 56.2 ± 10.7 | 59.9 ± 13.0 |
| Gender | |||
| Male | 48 | 24 | 54 |
| Female | 24 | 18 | 26 |
| Tumor site | |||
| Right colon | 15 | 11 | NA |
| Left colon | 23 | 13 | NA |
| Rectum | 34 | 18 | NA |
| TNM stage | |||
| I | 22 | NA | NA |
| II | 31 | NA | NA |
| III | 3 | NA | NA |
| IV | 16 | NA | NA |
NA, not available.
Figure 1Plasma EVs and exLR-seq. (A) Photograph of EVs using a TEM. (B) Size distribution of EVs. (C) Western blot analysis of EV markers TSG101 and CD63 in PMBC and EVs. (D) Amount of exLRs for each sample among CRC, CRA, and healthy individuals. (E) Unsupervised hierarchical clustering of the exLRs differentially expressed between CRC and control (class I); CRC, healthy, and CRA (class II). (F) KEGG pathway enrichment analysis for differentially expressed exLRs.
Figure 2Establishment of the exLR d-signature. (A) Flowchart of establishment of the d-signature. (B, C) Unsupervised hierarchical clustering of the 17 genes in training cohort (B) and validation cohort (C). (D–F) ROC curve for the exLR d-signature in the training (D), validation (E), and independent (F) cohorts. aSelection of lncRNA or protein-coding genes with (1) expression frequency >0.5; (2) log2(FC) >0.59, adjusted p value < 0.05.
Basic information and expression of the 17 feature genes.
| Gene ID | Gene name | Gene type | Expression frequency | log2(FC) | Mean CRC | Mean control |
|---|---|---|---|---|---|---|
| ENSG00000272196.2 | HIST2H2AA4 | Protein coding | 0.77 | 1.92 | 21.79 | 5.17 |
| ENSG00000234289.5 | H2BFS | Protein coding | 0.67 | 1.65 | 13.91 | 5.45 |
| ENSG00000233954.6 | UQCRHL | Protein coding | 0.97 | 1.49 | 17.41 | 8.23 |
| ENSG00000143185.3 | XCL2 | Protein coding | 0.73 | 1.32 | 32.43 | 18.73 |
| ENSG00000229321.1 | AC008269.1 | lncRNA | 0.53 | 1.01 | 15.96 | 9.4 |
| ENSG00000100206.9 | DMC1 | Protein coding | 0.86 | 0.98 | 17.2 | 9.75 |
| ENSG00000233087.7 | RAB6D | Protein coding | 0.90 | 0.87 | 3.68 | 1.82 |
| ENSG00000185909.14 | KLHDC8B | Protein coding | 0.82 | 0.84 | 15.09 | 9.76 |
| ENSG00000164879.6 | CA3 | Protein coding | 0.63 | 0.82 | 7.54 | 4.44 |
| ENSG00000100336.17 | APOL4 | Protein coding | 0.81 | 0.76 | 7.69 | 4.45 |
| ENSG00000196747.4 | HIST1H2AI | Protein coding | 0.97 | 0.76 | 235.37 | 165.53 |
| ENSG00000151687.14 | ANKAR | Protein coding | 0.95 | 0.74 | 16.07 | 9.88 |
| ENSG00000198964.13 | SGMS1 | Protein coding | 0.99 | 0.71 | 76.67 | 45.6 |
| ENSG00000119004.15 | CYP20A1 | Protein coding | 0.93 | 0.68 | 19.07 | 13.9 |
| ENSG00000276410.3 | HIST1H2BB | Protein coding | 0.99 | 0.66 | 497.04 | 293.11 |
| ENSG00000104375.16 | STK3 | Protein coding | 0.99 | 0.63 | 19.03 | 12.62 |
| ENSG00000274559.3 | CBWD1 | Protein coding | 0.97 | 0.61 | 10.65 | 7.42 |
FC, fold change.
The exLR d-signature in diagnosis of CRC, CRA, and healthy participants.
| Groups | AUC | 95% CI | Sensitivity (%) | Specificity (%) | Accuracy (%) |
|---|---|---|---|---|---|
| Training | 0.938 | 0.897–0.978 | 77.08 | 91.46 | 86.15 |
| Validation | 0.943 | 0.867–0.995 | 91.67 | 85.00 | 87.50 |
| Independent cohort | 0.947 | 0.801–1.000 | 71.43 | 95.65 | 86.49 |
| CRC vs. CRA | 0.853 | 0.776–0.930 | 76.19 | 84.72 | 79.83 |
| CRC vs. healthy | 0.983 | 0.969–0.997 | 92.50 | 94.44 | 89.47 |
| Stage I/II CRC vs. CRA | 0.882 | 0.809–0.955 | 85.71 | 81.13 | 82.11 |
| Stage I/II CRC vs. healthy | 0.990 | 0.979–1.000 | 95.00 | 96.23 | 92.48 |
| CEA-negative CRC vs. CRA | 0.870 | 0.790–0.950 | 76.19 | 87.81 | 80.72 |
| CEA-negative CRC vs. healthy | 0.988 | 0.974–1.000 | 92.50 | 97.56 | 92.56 |
| CRA vs. healthy training | 0.993 | 0.981–1.000 | 89.29 | 98.15 | 95.12 |
| CRA vs. healthy validation | 0.978 | 0.940–1.000 | 71.43 | 96.15 | 87.50 |
AUC, area under the curve; CI, confidence interval; CEA, carcinoembryonic antigen.
Figure 3Prediction performance of the exLR d-signature in subgroups. (A) The d-signature in distinguishing healthy, CRA, and CRC individuals. (B) The d-signature in control and stage I–IV CRC participants. The ROC curve for the d-signature in CRC and CRA (C), CRC and healthy (D), early-stage (stage I–II) CRC and CRA (E), early-stage (stage I–II) CRC and healthy (F), CEA-negative CRC and CRA (G), and CEA-negative CRC and healthy (H) cohorts.
Figure 4Potential of the exLR d-signature in differentiating CRA and healthy participants. (A) Unsupervised hierarchical clustering of the differentially expressed exLRs between CRA and healthy cohorts. (B) KEGG pathway enrichment analysis for the differentially expressed exLRs between CRA and healthy cohorts. (C, D) Unsupervised hierarchical clustering of the 7 genes selected for d-signature establishment in the training cohort (C) and validation cohort (D). (E, F) ROC curve for the exLR d-signature in the training (D) and validation (E) cohorts.
Figure 5Analyses of cell components, survival, and signaling pathways. (A) Heatmap of unsupervised hierarchical clustering of the 21 cell types in different groups. (B) Box plots of selected cell-type abundance between CRC, CRA, and healthy groups. Prognostic significance of selected cell types by (C) OS and (D) DFS. (E) ssGSEA score and statistical significance for selected KEGG pathways differing between CRC, CRA, and healthy groups. ***p < 0.001; **p < 0.01; *p < 0.05; NS, not significant.