| Literature DB >> 23369435 |
Fan Zhang1, Howard L Kaufman, Youping Deng, Renee Drabier.
Abstract
BACKGROUND: Breast cancer is worldwide the second most common type of cancer after lung cancer. Traditional mammography and Tissue Microarray has been studied for early cancer detection and cancer prediction. However, there is a need for more reliable diagnostic tools for early detection of breast cancer. This can be a challenge due to a number of factors and logistics. First, obtaining tissue biopsies can be difficult. Second, mammography may not detect small tumors, and is often unsatisfactory for younger women who typically have dense breast tissue. Lastly, breast cancer is not a single homogeneous disease but consists of multiple disease states, each arising from a distinct molecular mechanism and having a distinct clinical progression path which makes the disease difficult to detect and predict in early stages.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23369435 PMCID: PMC3552693 DOI: 10.1186/1755-8794-6-S1-S4
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
statistics of samples
| #health | #cancer | #total | |
|---|---|---|---|
| Training set | 32 | 33 | 65 |
| Testing set | 31 | 34 | 65 |
| Total | 63 | 67 | 130 |
performance comparison of SVM, SVM-RFE, and SVM-RFE-CV
| Measure | SVM | SVM-RFE | SVM-RFE-CV | |||
|---|---|---|---|---|---|---|
| Precision | 97.0% | 58.8% | 100% | 71.4% | 100% | 74.29% |
| Accuracy | 98.4% | 56.9% | 100% | 70.8% | 100% | 73.85% |
| Sensitivity | 100.0% | 58.8% | 100% | 73.5% | 100% | 76.47% |
| Specificity | 96.9% | 54.8% | 100% | 67.7% | 100% | 70.97% |
| AUC | 0.98 | 0.56 | 1.0 | 0.75 | 1.0 | 0.80 |
Figure 1Recursive feature elimination with automatic tuning of the number of features selected with cross-validation.
Figure 215 biomarkers predicting the healthy and breast cancer samples in testing set. X axis is the 15 biomarkers. Y-axis shows the 33 breast cancer and 32 healthy samples (H, healthy, blue; C, cancer, yellow).
gene expressions changes in 15-marker panel.
| ProbeID | Genesymbol | GeneID | direction | qvalue |
|---|---|---|---|---|
| 131318 | FAM135A | 57579 | up | 0.00775 |
| 134303 | NOD1 | 10392 | down | 0.00995 |
| 146885 | POMT2 | 29954 | down | 0.00459 |
| 154366 | LEFTY2 | 7044 | up | 0.00928 |
| 155372 | WISP1 | 8840 | down | 0.00197 |
| 162446 | FABP1 | 2168 | up | 0.00735 |
| 167465 | POLR3A | 11128 | down | 0.00754 |
| 167529 | ICA1 | 3382 | down | 0.00368 |
| 172360 | TMED8 | 283578 | up | 0.00582 |
| 189547 | C5orf20 | 140947 | down | 0.00941 |
| 206647 | ALG10 | 84920 | down | 0.00050 |
| 210406 | SLC33A1 | 9197 | down | 0.00483 |
| 211808 | PLCG1 | 5335 | down | 0.00317 |
| 222602 | LMOD3 | 56203 | up | 0.00508 |
| 230936 | FLJ44635 | 392490 | up | 0.00773 |
prediction result for the 15-marker panel
| Training set | Testing set | |||
|---|---|---|---|---|
| Predicted | Cancer | Normal | Cancer | Normal |
| Cancer | 33 | 0 | 26 | 9 |
| Normal | 0 | 32 | 8 | 22 |
| Precision | 100% | 74.29% | ||
| Accuracy | 100% | 73.85% | ||
| Sensitivity | 100% | 76.47% | ||
| Specificity | 100% | 70.97% | ||
Figure 3Our 15-marker panel compared to 4 best randomly selected 15-marker panels (solid lines, out of 42 candidates) and 4 worst randomly selected 15-marker panels (dotted lines, out of 42 candidates). The 15-marker panel was compared with the best four 15-marker panels (solid lines) and the worst four 15-marker panels (dotted lines) which were randomly selected out of the 42 candidates.