| Literature DB >> 36059706 |
Ouxi Wang1,2, Di Shi1,3,4, Yaqi Li3,5, Xiaoyan Zhou1,3,4, Haidan Yan2, Qianlan Yao1,3,4.
Abstract
Background: Early diagnosis of colorectal cancer could significantly improve the prognosis and reduce mortality. However, indeterminate diagnosis is often met in pathology diagnosis in biopsy samples. Abnormal expression of long non-coding RNA (lncRNA) is associated with the initiation and progression of colorectal cancer. It is of great value and clinical significance to explore lncRNAs as candidate diagnostic biomarkers in colorectal cancer.Entities:
Keywords: biomarker; colorectal cancer; diagnostic; long non-coding RNA; relative expression analysis
Year: 2022 PMID: 36059706 PMCID: PMC9428707 DOI: 10.3389/fonc.2022.912882
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 5.738
The training datasets used in this study.
| Dataset | Platform | Normal | Adenoma | Cancer | Cancer adjacent |
|---|---|---|---|---|---|
|
| RNA-seq GPL11154 | – | 20 | – | – |
|
| RNA-seq GPL16791 | – | – | 10 | 10 |
|
| RNA-seq GPL11154 | – | – | 6 | 6 |
|
| RNA-seq GPL11154 | – | – | 17 | – |
|
| RNA-seq GPL17303 | – | – | 3 | 3 |
|
| RNA-seq GPL18573 | – | – | 6 | 3 |
|
| RNA-seq GPL11154 | – | – | 18 | 18 |
|
| RNA-seq | 155 | – | – | – |
|
| MicroArray GPL570 | – | 32 | – | |
|
| MicroArray GPL25373 | – | – | 40 | – |
|
| 155 | 52 | 100 | 40 | |
The performance of the lncRNA pair biomarkers in the validation dataset. .
| Platform | Dataset | Normal | Adenoma | Cancer | Cancer-adjacent | Specificity | Sensitivity |
|---|---|---|---|---|---|---|---|
|
| GSE76987 | – | 62(0.67) | 4(1) | – | 0.67 | 1 |
| GSE83687 | – | – | – | 61(0.77) | – | 0.77 | |
| GTEX | 153(1) | – | – | – | 1 | – | |
| GSE107422 | – | – | 110(0.97) | – | – | 0.97 | |
| TCGA | – | – | 471(0.87) | 41(0.80) | 1 | – | |
|
| 153(1) | 62(0.67) | 585(0.89) | 102(0.78) | 0.91 | 0.87 | |
|
| GSE8671 | – | 32(0.84) | – | – | – | 0.84 |
| GSE117606 | – | – | 34(1) | 134(1) | – | 1 | |
| GSE35144 | – | – | 27(0.92) | – | – | – | |
| GSE22598 | – | – | 17(0.88) | 17(0.88) | – | – | |
| GSE32323 | – | – | 17(0.88) | 17(0.88) | – | – | |
|
| – | 32(0.84) | 95(0.94) | 168(0.97) | 0.84 | 0.96 | |
|
| 153(1) | 94(0.72) | 680(0.90) | 270(0.90) | 0.89 | 0.9 |
The performance of the lncRNA pair biomarkers in independent inflammatory bowel disease and normal validation datasets. Accuracy is marked in parentheses.
| Dataset | Platform | Normal | IBD | Specificity |
|---|---|---|---|---|
| GSE47908 | MicroArray GPL570 | 15(0.93) | 39(0.95) | 0.94 |
| GSE16879 | MicroArray GPL570 | – | 61(0.97) | 0.97 |
| GSE14580 | MicroArray GPL570 | – | 24(1) | 1 |
| Total | 15(0.93) | 124(0.97) | 0.96 |
Figure 1The workflow of identifying candidate diagnostic biomarker of colorectal cancer (CRC).
Figure 2The geometric mean of the sensitivity and specificity of the top long non-coding RNA (lncRNA) pairs in the training data.
Six candidate diagnostic long non-coding RNA (lncRNA) pairs for colorectal cancer (CRC). In each lncRNA pair, gene B always has a higher expression level than gene A in CRC tissue samples compared with non-CRC tissue samples. If at least three of the six lncRNA pairs showed that the expression level of gene A is less than gene B, the tested sample was predicted as cancer.
| lncRNA Pair | Gene A Symbol (Ensembl ID) | Gene B Symbol (Ensembl ID) |
|---|---|---|
| 1 |
|
|
| 2 |
|
|
| 3 |
|
|
| 4 |
|
|
| 5 |
|
|
| 6 |
|
|
Performance of the lncRNA pair biomarkers in different stages, the microsatellite instability status, and the site of CRC in the TCGA dataset.
| Status | Number of Samples | Accuracy | |
|---|---|---|---|
| Stage | I | 78 | 0.87 |
| II | 183 | 0.82 | |
| III | 132 | 0.93 | |
| IV | 65 | 0.89 | |
| MSI Status | MSI | 47 | 0.82 |
| MSS | 63 | 0.95 | |
| Primary site | Left sided | 177 | 0.913 |
| Right sided | 198 | 0.853 |
The characteristics of the CRC tissue samples.
| SampleID | Gender | Age | TumorSize(cm) | TNM | Stage | TissueType |
|---|---|---|---|---|---|---|
| 1 | Male | 60 | 5.5*4.5*1.8 | T2N0M0 | IIA | Tumor and Tumor-adjacent |
| 2 | Female | 59 | 5*4.5*3 | T4AN0M0 | IIB | Tumor and Tumor-adjacent |
| 3 | Male | 60 | 4*7*1.2 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 4 | Male | 54 | 7.5*4.5*1 | T4AN0M0 | IIB | Tumor and Tumor-adjacent |
| 5 | Female | 44 | 3*2.5*1.7 | T4N0M0 | IIB | Tumor and Tumor-adjacent |
| 6 | Male | 63 | 2.5*2*0.3 | T2N0M0 | IIA | Tumor and Tumor-adjacent |
| 7 | Female | 56 | 3*2*0.8 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 8 | Male | 63 | 5.5*9.5*2 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 9 | Male | 54 | 1.8*2*0.8 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 10 | Male | 67 | 2.5*2*0.8 | T3N0MX | IIA | Tumor and Tumor-adjacent |
| 11 | Male | 64 | 3.5*3*0.8 | T2N0MX | IIA | Tumor and Tumor-adjacent |
| 12 | Female | 35 | 4*3.8*0.7 | T3N0MX | IIA | Tumor and Tumor-adjacent |
| 13 | Male | 55 | 3*2.5*1.8 | T3AN0M0 | IIA | Tumor and Tumor-adjacent |
| 14 | Female | 51 | 4*3*2.5 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 15 | Male | 57 | 3.8*3.5*2 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 16 | Male | 60 | 4*3.8*1 | T3N0MX | IIA | Tumor and Tumor-adjacent |
| 17 | Female | 78 | 6.5*6*4 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 18 | Female | 48 | 3*3*1 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 19 | Female | 62 | 3.7*4*0.8 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 20 | Female | 51 | 8*8*2.5 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 21 | Male | 81 | 5.5*4*1 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 22 | Female | 57 | 3.5*3*0.5 | T3N0MX | IIA | Tumor and Tumor-adjacent |
| 23 | Female | 55 | 1.5*1.8*0.8 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 24 | Female | 69 | 3.5*3.5*2 | T2N0M0 | IIA | Tumor and Tumor-adjacent |
| 25 | Male | 57 | 3.6*4.5*0.5 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 26 | Female | 67 | 2.5*2*1 | T4N0M0 | IIB | Tumor and Tumor-adjacent |
| 27 | Male | 77 | 3.5*3*0.8 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 28 | Male | 75 | 5*4.5*2 | T2N0MX | IIA | Tumor and Tumor-adjacent |
| 29 | Female | 72 | 4*4*1.5 | T3N0MX | IIA | Tumor and Tumor-adjacent |
| 30 | Male | 75 | 2.5*1.5*1 | T3N0M0 | IIA | Tumor and Tumor-adjacent |
| 31 | Male | 62 | 3.5*2.8*1 | T4N0M0 | II | Tumor |
| 32 | Male | 69 | 7*6 | T4NxM1 | IV | Tumor |
| 33 | Male | 73 | 4*3*1 | T4N2aM0 | IIIB | Tumor |
| 34 | Male | 80 | 5*4*2 | T2N0M0 | I | Tumor |
| 35 | Female | 75 | 2.5*2*1.5 | T4N0M0 | IIB | Tumor |
Figure 3(A) The pathway enrichment analysis of the lncRNA biomarkers. Left is the result of Gene Ontology (GO); right is the result of KEGG. (B) High expression of lncRNA TOB1-AS1 was associated with a worse prognosis outcome in CRC. (C) Association between the immune cell infiltration and the lncRNA biomarkers. *, **, and *** represent the P-values of 0.05, 0.01, and 0.001, respectively. The size of the circle represents the correlation coefficient. (D) The correlation of expression of lncRNA markers and driver genes in CRC. *, **, and *** represent the P-values of 0.05, 0.01, and 0.001, respectively. The size of the circle represents the size of the absolute value of log10 (P-value).