| Literature DB >> 29607263 |
Xiaomeng Cui1,2, Zhangming Li3, Yilei Zhao4, Anqi Song5, Yunbo Shi1,2, Xin Hai4, Wenliang Zhu6.
Abstract
Prolonged life expectancy in humans has been accompanied by an increase in the prevalence of cancers. Breast cancer (BC) is the leading cause of cancer-related deaths. It accounts for one-fourth of all diagnosed cancers and affects one in eight females worldwide. Given the high BC prevalence, there is a practical need for demographic screening of the disease. In the present study, we re-analyzed a large microRNA (miRNA) expression dataset (GSE73002), with the goal of optimizing miRNA biomarker selection using neural network cascade (NNC) modeling. Our results identified numerous candidate miRNA biomarkers that are technically suitable for BC detection. We combined three miRNAs (miR-1246, miR-6756-5p, and miR-8073) into a single panel to generate an NNC model, which successfully detected BC with 97.1% accuracy in an independent validation cohort comprising 429 BC patients and 895 healthy controls. In contrast, at least seven miRNAs were merged in a multiple linear regression model to obtain equivalent diagnostic performance (96.4% accuracy in the independent validation set). Our findings suggested that suitable modeling can effectively reduce the number of miRNAs required in a biomarker panel without compromising prediction accuracy, thereby increasing the technical possibility of early detection of BC.Entities:
Keywords: Breast cancer; Diagnostic biomarker; Neural network cascade; microRNA
Year: 2018 PMID: 29607263 PMCID: PMC5875392 DOI: 10.7717/peerj.4551
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1AUC distribution and collinearity of miRNA expression.
(A) Frequency distribution of AUCs. (B) Collinearity network of the 82 miRNAs with AUC ≥0.95. An edge represents collinear expression between the two miRNAs (ρ2 > 0.5).
ROC curve analysis of individual miRNAs (training set).
| miRNA ID | AUC | Sensitivity (%) | Specificity (%) | ACC (%) |
|---|---|---|---|---|
| miR-197-5p | 0.961 | 90.7 | 95.9 | 94.2 |
| miR-1238-5p | 0.964 | 90.3 | 97.1 | 94.9 |
| miR-1246 | 0.967 | 89.8 | 91.7 | 91.1 |
| miR-3156-5p | 0.976 | 89.8 | 96.1 | 94.0 |
| miR-4532 | 0.968 | 89.8 | 98.7 | 95.8 |
| miR-6748-5p | 0.962 | 90.2 | 90.3 | 90.3 |
| miR-6756-5p | 0.975 | 92.7 | 97.2 | 95.7 |
| miR-8073 | 0.991 | 95.7 | 97.6 | 97.0 |
Figure 2Establishment of the NNC model.
(A) Illustration of the NNC model. L1–L3: Layers 1–3 of the NNC model; R1246: miR-1246; R6756-5p: miR-6756-5p; R8073: miR-8073; AUC values are shown above the layers. (B) ROC curve diagrams of the NNC and MLR models (training set).
Comparison between NNC and MLR models (training set).
| Model | AUC | Sensitivity (%) | Specificity (%) | ACC (%) |
|---|---|---|---|---|
| Layer 1 of NNC | 0.991 | 95.7 | 97.6 | 96.9 |
| Layer 2 of NNC | 0.995 | 95.8 | 98.5 | 97.6 |
| Layer 3 of NNC | 0.996 | 97.3 | 99.1 | 98.5 |
| MLR | 0.996 | 96.5 | 97.9 | 97.4 |
Figure 3Model validation.
(A) ROC curve diagram of the tenfold cross-validation of the NNC model. (B) ROC curve diagram of the NNC and MLR models (validation set). 10FCV: Tenfold cross-validation. (C) Accuracy evaluation of miR-8073, MLR, and NNC BC detection (validation set). R8073: miR-8073. ACC: accuracy rate; Se: sensitivity; Sp: specificity.