| Literature DB >> 33804927 |
Camila Meirelles S Silva1, Mateus C Barros-Filho2,3, Deysi Viviana T Wong4,5, Julia Bette H Mello6, Livia Maria S Nobre1, Carlos Wagner S Wanderley1, Larisse T Lucetti1, Heitor A Muniz1, Igor Kenned D Paiva1, Hellen Kuasne2, Daniel Paula P Ferreira7, Maria Perpétuo S S Cunha5, Carlos G Hirth5, Paulo Goberlânio B Silva5, Rosane O Sant'Ana5,8, Marcellus Henrique L P Souza9, Josiane S Quetz5, Silvia R Rogatto10,11,12, Roberto César P Lima-Junior1.
Abstract
Colorectal cancer (CRC) is a disease with high incidence and mortality. Colonoscopy is a gold standard among tests used for CRC traceability. However, serious complications, such as colon perforation, may occur. Non-invasive diagnostic procedures are an unmet need. We aimed to identify a plasma microRNA (miRNA) signature for CRC detection. Plasma samples were obtained from subjects (n = 109) at different stages of colorectal carcinogenesis. The patients were stratified into a non-cancer (27 healthy volunteers, 17 patients with hyperplastic polyps, 24 with adenomas), and a cancer group (20 CRC and 21 metastatic CRC). miRNAs (381) were screened by TaqMan Low-Density Array. A classifier based on four differentially expressed miRNAs (miR-28-3p, let-7e-5p, miR-106a-5p, and miR-542-5p) was able to discriminate cancer versus non-cancer cases. The overexpression of these miRNAs was confirmed by RT-qPCR, and a cross-study validation step was implemented using eight data series retrieved from Gene Expression Omnibus (GEO). In addition, another external data validation using CRC surgical specimens from The Cancer Genome Atlas (TCGA) was carried out. The predictive model's performance in the validation set was 76.5% accuracy, 59.4% sensitivity, and 86.8% specificity (area under the curve, AUC = 0.716). The employment of our model in the independent publicly available datasets confirmed a good discrimination performance in five of eight datasets (median AUC = 0.823). Applying this algorithm to the TCGA cohort, we found 99.5% accuracy, 99.7% sensitivity, and 90.9% specificity (AUC = 0.998) when the model was applied to solid colorectal tissues. Overall, we suggest a novel signature of four circulating miRNAs, i.e., miR-28-3p, let-7e-5p, miR-106a-5p, and miR-542-5p, as a predictive tool for the detection of CRC.Entities:
Keywords: blood; colorectal cancer; diagnosis; microRNA
Year: 2021 PMID: 33804927 PMCID: PMC8037203 DOI: 10.3390/cancers13071493
Source DB: PubMed Journal: Cancers (Basel) ISSN: 2072-6694 Impact factor: 6.639
Clinical–demographic characteristics.
| Variables | Groups | ||||||
|---|---|---|---|---|---|---|---|
| Total | Non-Cancer | Cancer | |||||
|
| % |
| % |
| % | ||
| Gender | |||||||
| Female | 61 | 55.96% | 40 | 65.57% | 21 | 34.43% | |
| Male | 48 | 44.04% | 28 | 58.34% | 20 | 41.66% | |
| Age | |||||||
| Up to 60 years | 59 | 54.12% | 40 | 67.79% | 19 | 32.21% | |
| >60 years | 50 | 45.88% | 28 | 56% | 22 | 44% | |
| Smoking status | |||||||
| Never smoker | 72 | 66.06% | 49 | 68.05% | 23 | 31.94% | |
| Former smoker | 25 | 22.93% | 13 | 52% | 12 | 48% | |
| Current smoker | 12 | 11.01% | 6 | 50% | 6 | 50% | |
| Medication | |||||||
| No | 59 | 54.12% | 40 | 68% | 19 | 32% | |
| Yes | 50 | 45.88% | 28 | 56% | 22 | 44% | |
Figure 1Study workflow to identify a diagnostic miRNA signature from plasma samples of patients at different stages of CRC development. CRC, colorectal cancer.
Figure 2Hierarchical clustering analysis and plots representing plasma miRNAs. (A) Unsupervised hierarchical clustering analysis of 292 circulating miRNAs (RT-qPCR-TaqMan Low-Density Array (TLDA) assay). (B) Differential expression of nine miRNAs in the plasma of cancer versus non-cancer cases. The boxplot displays the first quartile, median, and third quartiles (interquartile range) and the minimum and maximum values excluding outliers of the log2-normalized relative quantification of the miRNAs in plasma (RT-qPCR-TLDA assay). (C) Supervised hierarchical clustering analysis comprising the nine differentially expressed miRNAs. The dendrogram demonstrates a stratification of samples into two clusters (black and gray) associated with the cancer status. The lines in the heatmaps represent individual miRNAs, and the columns represent each sample; * p < 0.05; ** p < 0.01 (t-test).
Figure 3Training and validation of the circulating miRNA-based diagnostic classifier. (A) Cancer prediction models including one to six miRNAs (selected by the recursive elimination method) previously detected at higher levels in the blood samples of CRC patients. Representative graphs of overall yield accuracy and LOOCV estimative. (B) Application of the four-miR-based classifier (miR-106a + let-7e + miR-28 + miR-542) in the screening phase (evaluated by the TLDA assay). (C) The four-miR-based classifier applied to a subset of cases of the discovery set (screening phase) using individual RT-qPCR assays. (D) Application of the four-miR-based classifier to a group of samples independent of the screening phase (validation set) using individual RT-qPCR assays. The dotted line indicates the threshold above which a malignant status would be predicted. SVM: support vector machine; LOOCV: leave-one-out cross-validation; AUC: area under the ROC curve; CI95%: 95% confidence interval.
Classification performance of the four-miR-based classifier used to distinguish colorectal cancer from non-cancer individuals.
| Metric | TLDA Assay | Single Assays | |
|---|---|---|---|
| Screening Phase | Discovery Set | Validation Set | |
| Estimate (CI95%) | Estimate (CI95%) | Estimate (CI95%) | |
| Sensitivity | 88.9 (50.7–99.4) | 57.1 (20.2–88.2) | 59.4 (40.8–75.8) |
| Specificity | 86.7 (58.4–97.7) | 80.0 (51.4–94.7) | 86.8 (74–94.1) |
| PPV | 80.0 (44.2–96.5) | 57.1 (20.2–88.2) | 73.1 (51.9–87.6) |
| NPV | 92.9 (64.2–99.6) | 80 (51.4–94.7) | 78.0 (64.9–87.3) |
| AUC | 0.867 (0.710–1.000) | 0.743 (0.501–0.985) | 0.716 (0.600–0.832) |
CI95%: 95% confidence interval; PPV = positive predictive value; NPV = negative predictive value.
Figure 4Performance of the four-miR classifier tested in the Gene Expression Omnibus (GEO) dataset. (A) Database searching, inclusion and exclusion criteria. (B) Among 16 studies found in the GEO datasets, 7 fulfilled the criteria of number of samples (≥20 samples of both CRC and controls), 5 used serum samples and validated our four-miR classifier model, and 3 datasets (exosome, serum, and plasma samples) showed no significant association.
Figure 5Performance of the four-miR classifier tested in TCGA colorectal primary tumors and adjacent non-cancer tissues. The classifier designed to be a liquid biopsy method also demonstrated high power in discriminating cancer and non-cancer colorectal tissues of the TCGA dataset. The dotted line indicates the threshold above which a malignant status would be predicted. TCGA: The Cancer Genome Atlas; COAD: colon cancer cohort from TCGA; READ: rectal cancer cohort from TCGA.
Figure 6Biological pathways enriched with the mRNAs predicted to be targets of miR-106a-5p, let-7e-5p, miR-28-3p, and miR-542-5p. The colorectal cancer pathway (red star) is among the most significant pathways for three out of four tested miRNAs (miR-106a-5p, let-7e-5p, and miR-28-3p). p-Value expressed as −log10.