| Literature DB >> 30719423 |
Deqiang Zheng1, Yuanjie Ding1, Qing Ma2, Lei Zhao3, Xudong Guo1, Yi Shen1, Yan He1, Wenqiang Wei2, Fen Liu1.
Abstract
Introduction: Circulating microRNAs (miRNAs) are promising molecular biomarkers for the early detection of esophageal squamous cell carcinoma (ESCC). We investigated the serum miRNA expression profiles from microarray-based technologies and evaluated the diagnostic value of serum miRNAs as potential biomarkers for ESCC by using feature selection algorithms.Entities:
Keywords: Lasso logistic regression; esophageal squamous cell carcinoma; multiple-testing criterion; penalized support vector machine; serum microRNA
Year: 2019 PMID: 30719423 PMCID: PMC6348251 DOI: 10.3389/fonc.2018.00674
Source DB: PubMed Journal: Front Oncol ISSN: 2234-943X Impact factor: 6.244
Characteristics of the study subjects.
| Sex, | 1.000 | ||
| Male | 24 (46.2) | 24 (46.2) | |
| Female | 28 (53.8) | 28 (53.8) | |
| Age (years) | 59.2 (4.1) | 57.8 (5.5) | 0.066 |
| Smoking, | 0.516 | ||
| No | 35 (67.3) | 39 (75.0) | |
| Yes | 17 (32.7) | 13 (25.0) | |
| Drinking, | 0.093 | ||
| No | 37 (71.2) | 45 (86.5) | |
| Yes | 15 (28.8) | 7 (13.5) | |
| Stage, | |||
| ESCC I-II | 23 (44.2) | - | - |
| ESCC III-IV | 29 (55.8) | - |
ESCC I-II, ESCC with stage I-II; ESCC III-IV, ESCC with stage III-IV.
Figure 1Flowchart of identifying predictive miRNAs by performing representative feature selection algorithms.
The seven selected miRNAs by controlling FDR and FWER.
| miR-451a | 6.74 ± 2.72 | 4.08 ± 2.27 | 1.29 × 10−6 | 0.001 | 0.001 |
| miR-16-5p | 3.77 ± 2.42 | 1.57 ± 1.63 | 6.33 × 10−6 | 0.003 | 0.005 |
| miR-486-5p | 4.06 ± 2.59 | 1.90 ± 1.84 | 4.15 × 10−5 | 0.009 | 0.035 |
| miR-574-5p | 4.23 ± 2.11 | 2.74 ± 1.69 | 4.15 × 10−5 | 0.009 | 0.035 |
| miR-92a-3p | 3.46 ± 2.21 | 1.51 ± 1.85 | 5.80 × 10−5 | 0.010 | 0.049 |
| miR-107 | 1.67 ± 2.97 | 0.03 ± 0.38 | 2.62 × 10−5 | 0.037 | 0.211 |
| miR-320c | 2.96 ± 2.44 | 1.20 ± 1.55 | 3.81 × 10−4 | 0.046 | 0.319 |
FDR, false discovery rate; FWER, family-wise error rate.
Figure 2The heat map of the seven candidate miRNAs identified by the FDR method. Pseudocolors indicated expression levels on a log-2 scale from −3 to 3 standard deviations independently by Z-score transformation. Green: negative, under-expressed; red: positive, over-expressed.
Figure 3Serum levels of the seven selected miRNAs in ESCC cases and healthy controls. The miRNAs expression levels were log2-transformed.
The selection frequencies (%) of candidate miRNAs.
| let-7b-5p | 95 | 92 | 67 | 73 | 58 | 69 |
| miR-107 | 99 | 96 | 94 | 90 | 91 | 81 |
| miR-16-5p | 100 | 99 | 100 | 96 | 100 | 96 |
| miR-191-3p | 99 | 96 | 96 | 90 | 91 | 86 |
| miR-451a | 100 | 100 | 100 | 100 | 100 | 100 |
| miR-574-5p | 99 | 98 | 98 | 90 | 93 | 88 |
Lasso LR, Lasso logistic regression; HHSVM, the hybrid Huberized support vector machine; SESVM, support vector machine using squared-error loss; BD, binomial deviance; ME, misclassification error; MBL, marginal based loss.
Figure 4The ROC curves for the three serum miRNA-based panel as a diagnostic biomarker by using three classifiers. (A) Logistic regression. (B) Linear SVM. (C) SVM with the Radial Basis Function kernel.
Diagnostic performance in discriminating ESCC from the healthy.
| Logistic regression | 0.812 (0.704, 0.905) | 0.807 (0.110) | 0.791 (0.117) | 0.799 (0.047) |
| Linear SVM | 0.816 (0.704, 0.907) | 0.814 (0.104) | 0.807 (0.105) | 0.811 (0.046) |
| Radial SVM | 0.824 (0.696, 0.927) | 0.857 (0.096) | 0.746 (0.110) | 0.802 (0.049) |
| Logistic regression | 0.821 (0.685, 0.926) | 0.822 (0.120) | 0.787 (0.134) | 0.804 (0.057) |
| Linear SVM | 0.832 (0.706, 0.921) | 0.849 (0.113) | 0.774 (0.134) | 0.811 (0.051) |
| Radial SVM | 0.836 (0.686, 0.929) | 0.836 (0.111) | 0.779 (0.136) | 0.808 (0.053) |
AUC, area under the curve; CI, confidence interval; SD, standard deviation; SVM, support vector machine; Radial SVM, SVM with the Radial Basis Function kernel.
Figure 5Quantification of serum miR-16-5p, miR-451a, and miR-574-5p in the healthy controls and two different ESCC groups. (A) The box plot comparing the expression level of serum miR-16-5p, miR-451a, and miR-574-5p in the healthy control with ESCC patients with stage I-II (ESCC I-II) and with stage III-IV (ESCC III-IV). (B) The ROC curve for the three serum miRNA-based panel in differentiating ESCC I-II from the healthy by logistic regression. (C) The ROC curve for the three serum miRNA-based panel in differentiating ESCC III-IV from the healthy by logistic regression.
Diagnostic performance in differentiating ESCC subgroups from the healthy for the three-miRNA-based classifiers.
| Logistic regression | 0.769 (0.563, 0.971) | 0.756 (0.165) | 0.839 (0.126) | 0.781 (0.107) |
| Linear SVM | 0.787 (0.613, 0.979) | 0.791 (0.140) | 0.828 (0.130) | 0.803 (0.091) |
| Radial SVM | 0.779 (0.617, 0.971) | 0.737 (0.171) | 0.821 (0.156) | 0.763 (0.096) |
| Logistic regression | 0.805 (0.633, 0.938) | 0.752 (0.153) | 0.845 (0.122) | 0.786 (0.078) |
| Linear SVM | 0.801 (0.617, 0.951) | 0.794 (0.134) | 0.839 (0.093) | 0.811 (0.076) |
| Radial SVM | 0.816 (0.683, 0.942) | 0.793 (0.119) | 0.815 (0.095) | 0.801 (0.066) |
AUC, area under the curve; CI, confidence interval; SD, standard deviation; SVM, support vector machine; Radial SVM, SVM with the Radial Basis Function kernel.