| Literature DB >> 26581577 |
Elena Landoni1, Rosalba Miceli2, Maurizio Callari3, Paola Tiberio3, Valentina Appierto3, Valentina Angeloni3, Luigi Mariani4, Maria Grazia Daidone3.
Abstract
BACKGROUND: Plasma miRNAs have the potential as cancer biomarkers but no consolidated guidelines for data mining in this field are available. The purpose of the study was to apply a supervised data analysis strategy in a context where prior knowledge is available, i.e., that of hemolysis-related miRNAs deregulation, so as to compare our results with existing evidence.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26581577 PMCID: PMC4650369 DOI: 10.1186/s12859-015-0820-9
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Results of the first step of class prediction performed in the training set raw data. a Bootstrap occurrences of the top 35 miRNAs included in the chosen model. b Egg-shaped plot. Node size and line thickness are proportional to the frequency of bootstrap occurrences and co-occurrences, respectively. A filter was applied to show only the features with at least 300 co-occurrences. c Bootstrap co-occurrences of the most interconnected miRNAs
Fig. 2Results of the second step of class prediction performed in the training set raw andratio data. ‘ROC space’ plot representing the classification performance of different models for class prediction in terms of false positive rate (FPR) and true positive rate (TPR) in the training set raw data (left panel) and ratio data (right panel). As true for the ROC curves, ideal models are those closest to the point (0,1), corresponding to 100 % sensitivity and specificity.
Model classification performance measures in the training and validation sets with raw and ratio data.
| Training set | Validation set | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Classication performance of the best performing groups of models | Parameters of the chosen model | Classication performance of the chosen model | ||||||||
| Group ID | N models | Sens | Spec | Youden index | N miR | SVM cost | SVM weights | Sens [CI] | Spec [CI] | Youden index [CI] |
| 1 | 16 | 0.85 | 0.96 | 0.81 | 35 | 10 | (0.5; 0.5) | 0.77 [0.54–0.92] | 0.77 [0.61–0.92] | 0.54 [0.23–0.81] |
| 2 | 5 | 0.77 | 1.00 | 0.77 | 35 | 1 | (0.5; 0.5) | 0.85 [0.61–1.00] | 0.81 [0.65–0.92] | 0.65 [0.38–0.85] |
| 3 | 2 | 0.77 | 0.96 | 0.73 | 30 | 1 | (0.5; 0.5) | 0.85 [0.61–1.00] | 0.85 [0.69–0.96] | 0.69 [0.42–0.88] |
| 4 | 16 | 0.85 | 0.88 | 0.73 | 40 | 10 | (0.5; 0.5) | 0.77 [0.54–0.92] | 0.73 [0.54–0.88 | 0.50 [0.19–0.77] |
| 5 | 5 | 0.77 | 0.92 | 0.69 | 35 | 1 | (0.4; 0.6) | 0.85 [0.61–1.00] | 0.73 [0.54–0.88] | 0.58 [0.31–0.81] |
| 6 | 16 | 0.85 | 0.85 | 0.69 | 50 | 10 | (0.5; 0.5) | 0.85 [0.61–1.00 | 0.69 [0.50–0.88 | 0.54 [0.27–0.81] |
| 7 | 19 | 0.77 | 0.88 | 0.65 | 40 | 1 | (0.4; 0.6) | 0.77 [0.54–0.92] | 0.69 [0.50–0.85] | 0.46 [0.15–0.73] |
| 8 | 5 | 0.69 | 0.92 | 0.61 | 3 | 100 | (0.4; 0.6) | 0.77 [0.54–0.92] | 0.96 [0.88–1.00] | 0.73 [0.46–0.92] |
| 9 | 20 | 0.77 | 0.85 | 0.61 | 5 | 10 | (0.4; 0.6) | 0.69 [0.46–0.92] | 0.92 [0.81–1.00] | 0.61 [0.35–0.85] |
|
| ||||||||||
| 1 | 1 | 0.92 | 0.81 | 0.73 | 500 (88) | 0.01 | (0.2; 0.8) | 0.92 [0.77–1.00] | 0.65 [0.46–0.85] | 0.58 [0.31–0.81] |
| 2 | 1 | 0.77 | 0.92 | 0.69 | 17 (16) | 0.01 | (0.3; 0.7) | 0.77 [0.54–0.92] | 0.92 [0.81–1.00] | 0.69 [0.42–0.92] |
| 3 | 1 | 0.69 | 1.00 | 0.69 | 90 (50) | 0.01 | (0.5; 0.5) | 0.69 [0.38–0.92] | 1.00 [1.00–1.00] | 0.69 [0.38–0.92] |
| 4 | 2 | 0.85 | 0.85 | 0.69 | 150 (66) | 0.01 | (0.2; 0.8) | 0.92 [0.77–1.00] | 0.69 [0.50–0.85] | 0.61 [0.38–0.81] |
| 5 | 35 | 0.69 | 0.96 | 0.65 | 4 (5) | 0.1 | (0.5; 0.5) | 0.77 [0.54–0.92] | 1.00 [1.00–1.00] | 0.77 [0.54–0.92] |
| 6 | 4 | 0.77 | 0.88 | 0.65 | 500 (88) | 0.01 | (0.4; 0.6) | 0.92 [0.77–1.00] | 0.77 [0.58–0.92] | 0.69 [0.46–0.88] |
| 7 | 3 | 0.85 | 0.81 | 0.65 | 600 (88) | 0.01 | (0.2; 0.8) | 0.92 [0.77–1.00] | 0.65 [0.46–0.85] | 0.58 [0.31–0.81] |
| 8 | 11 | 0.61 | 1.00 | 0.61 | 2 (3) | 0.1 | (0.5; 0.5) | 0.77 [0.54–0.92] | 1.00 [1.00–1.00] | 0.77 [0.54–0.92] |
| 9 | 23 | 0.69 | 0.92 | 0.61 | 3 (4) | 0.1 | (0.4; 0.6) | 0.77 [0.54–0.92] | 0.96 [0.88–1.00] | 0.73 [0.50–0.92] |
| 10 | 54 | 0.77 | 0.85 | 0.61 | 4 (5) | 0.1 | (0.3; 0.7) | 0.77 [0.54–0.92] | 0.88 [0.73–1.00] | 0.65 [0.38–0.88] |
| 11 | 18 | 0.61 | 0.96 | 0.58 | 3 (4) | 0.1 | (0.5; 0.5) | 0.77 [0.54–0.92] | 1.00 [1.00–1.00] | 0.77 [0.54–0.92] |
| 12 | 59 | 0.69 | 0.88 | 0.58 | 2 (3) | 10 | (0.4; 0.6) | 0.77 [0.54–0.92] | 1.00 [1.00–1.00] | 0.77 [0.54–0.92] |
In the last three columns, validation set classification performance measures are reported together with the corresponding bootstrap 95 % confidence intevals (CI)
Abbreviations: Group ID ID of the groups of best performing models (see also Fig. 2); N models number of models in each group, showing a specific classification performance, Sens sensitivity, Spec specificity, N miR number of miRNAs included in the model chosen in each group for containing the smallest number of miRNAs, SVM cost cost parameter of the linear SVM model, SVM weights weight parameter of the linear SVM model