| Literature DB >> 19891795 |
Abstract
BACKGROUND: Digital mammography is one of the most promising options to diagnose breast cancer which is the most common cancer in women. However, its effectiveness is enfeebled due to the difficulty in distinguishing actual cancer lesions from benign abnormalities, which results in unnecessary biopsy referrals. To overcome this issue, computer aided diagnosis (CADx) using machine learning techniques have been studied worldwide. Since this is a classification problem and the number of features obtainable from a mammogram image is infinite, a feature selection method that is tailored for use in the CADx systems is needed.Entities:
Mesh:
Year: 2009 PMID: 19891795 PMCID: PMC2773916 DOI: 10.1186/1472-6947-9-S1-S1
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
Dataset Information
| MGH | 482 | 365 | 381 | 323 |
| WU | 154 | 115 | 41 | 98 |
| WFUSM | 163 | 255 | 188 | 159 |
| SHH | 324 | 380 | 207 | 140 |
| total | 1123 | 1115 | 817 | 720 |
MGH = Massachussetts General Hospital; WU = Washington University at Saint Louis; WFUSM = Wake Forest University School of Medicine; SHH = Sacred Heart Hospital
BI-RADS mammographic features
| mass shape | no mass(0), round(1), oval(2), lobulated(3), irregular(4) |
| mass margin | no mass(0), well circumscribed(1), microlobulated(2), obscured(3), ill-defined(4), spiculated(5) |
| calcification type | no calc.(0), milk of calcium-like(1), eggshell(2), skin(3), vascular(4), spherical(5), suture(6), coarse(7), large rod-like(8), round(9), dystrophic(10), punctate(11), indistinct(12), pleomorphic(13), fine branching(14) |
| calcification distribution | no calc.(0), diffuse(1), regional(2), segmental(3), linear(4), clustered(5) |
| density | 1, 2, 3, 4 |
| assessment | 1, 2, 3, 4, 5 |
density: 1 = sparser, 4 = denser;
Comparison of kernels in terms of maximum Az value of mass dataset
| linear | 0.90391 | 0.94571 | 0.92159 | 0.85718 | 0.87159 | 0.97150 | ||
| RBF | 0.88597 | 0.95716 | ||||||
| 10 | 5 | 10 | 10 | 10 | 10 | 10 | 10 | |
| 0.25 | 0.06 | 0.5 | 0.075 | 0.15 | 0.1 | 0.5 | 0.05 | |
Same tradeoff parameter value C is used for both linear and RBF kernels.
Comparison of kernels in terms of maximum Az value of calcification dataset
| linear | 0.72686 | 0.72625 | 0.89981 | 0.74046 | 0.77509 | 0.89603 | 0.92705 | |
| RBF | 0.88155 | |||||||
| 1 | 10 | 1 | 5 | 10 | 20 | 10 | 10 | |
| 1.5 | 0.1 | 1 | 0.05 | 0.4 | 0.05 | 0.15 | 0.05 | |
Same tradeoff parameter value C is used for both linear and RBF kernels.
Comparison of methods by maximum Az value using 8 features (Mass)
| SVM | 0.95821 | 0.97247 | 0.92252 | 0.97401 | |
| SVM-RFE | 0.97734 | 0.92252 | 0.97401 | ||
| ENSEMBLE | 0.72102 | 0.74859 | 0.67307 | 0.94292 | |
| JOIN (1) | 0.77944 | 0.88187 | 0.79655 | 0.92650 | |
| JOIN (2) | 0.72102 | 0.77365 | 0.79200 | 0.90262 | |
| JOIN (3) | 0.72102 | 0.75484 | 0.79200 | 0.86857 | |
| JOIN (4) | 0.72102 | 0.75484 | 0.75765 | 0.86861 | |
| JOIN (5) | 0.72102 | 0.71136 | 0.67307 | 0.73745 | |
| MSVM-RFE (bootstrap) | 5 | 0.95821 | 0.97247 | 0.92423 | 0.97401 |
| 10 | 0.95821 | 0.92288 | |||
| 15 | 0.95947 | 0.97457 | 0.92288 | 0.97401 | |
| 20 | 0.95947 | 0.97705 | 0.92315 | 0.97401 | |
| MSVM-RFE (boost) | 5 | 0.95821 | 0.97247 | 0.92314 | 0.97401 |
| 10 | 0.95821 | 0.97616 | 0.97401 | ||
| 15 | 0.95947 | 0.97247 | 0.92314 | 0.97401 | |
| 20 | 0.95947 | 0.97387 | 0.92314 | 0.97401 |
Numbers in parenthesis stands for cutoff value for JOIN method.
Comparison of methods by maximum Az value using 22 features (Calcification)
| SVM | 0.77497 | 0.91710 | 0.89738 | 0.94945 | |
| SVM-RFE | 0.77497 | 0.89859 | 0.95332 | ||
| ENSEMBLE | 0.68951 | 0.76647 | 0.72650 | 0.85677 | |
| JOIN (1) | 0.75259 | 0.92326 | 0.81433 | 0.91352 | |
| JOIN (2) | 0.72296 | 0.82307 | 0.72987 | 0.80400 | |
| JOIN (3) | 0.70815 | 0.76647 | 0.70059 | 0.67598 | |
| JOIN (4) | 0.58656 | 0.69779 | 0.65667 | 0.55964 | |
| JOIN (5) | 0.53520 | 0.63858 | 0.65667 | 0.51203 | |
| MSVM-RFE (bootstrap) | 5 | 0.77497 | 0.91710 | 0.89988 | |
| 10 | 0.91710 | 0.89786 | 0.95330 | ||
| 15 | 0.77497 | 0.92193 | 0.89738 | 0.95250 | |
| 20 | 0.77497 | 0.93305 | 0.95267 | ||
| MSVM-RFE (boost) | 5 | 0.77727 | 0.92097 | 0.89848 | 0.94945 |
| 10 | 0.77497 | 0.93063 | 0.90108 | 0.95292 | |
| 15 | 0.77497 | 0.92352 | 0.90133 | 0.95136 | |
| 20 | 0.77497 | 0.92105 | 0.89957 | 0.95256 |
Numbers in parenthesis stands for cutoff value for JOIN method.
Comparison of methods by maximum Az value using 8 features (Calcification)
| SVM | 0.91182 | 0.98765 | 0.94690 | 0.96595 | |
| SVM-RFE | 0.95100 | 0.96595 | |||
| ENSEMBLE | 0.53915 | 0.69512 | 0.56583 | 0.91392 | |
| JOIN (1) | 0.67508 | 0.71655 | 0.83947 | 0.93422 | |
| JOIN (2) | 0.57971 | 0.72941 | 0.76157 | 0.88733 | |
| JOIN (3) | 0.57971 | 0.72941 | 0.62686 | 0.73542 | |
| JOIN (4) | 0.54571 | 0.69512 | 0.62686 | 0.72464 | |
| JOIN (5) | 0.54571 | 0.69512 | 0.54077 | 0.66210 | |
| MSVM-RFE (bootstrap) | 5 | 0.91182 | 0.98765 | ||
| 10 | 0.91182 | 0.98765 | 0.95168 | 0.96595 | |
| 15 | 0.91182 | 0.98765 | 0.94690 | 0.96757 | |
| 20 | 0.91182 | 0.98765 | 0.94690 | 0.97348 | |
| MSVM-RFE (boost) | 5 | 0.91182 | 0.98765 | 0.94690 | 0.96595 |
| 10 | 0.91182 | 0.99259 | 0.94690 | 0.96595 | |
| 15 | 0.91182 | 0.99429 | 0.94690 | 0.96595 | |
| 20 | 0.91182 | 0.98765 | 0.94690 | 0.96595 |
Numbers in parenthesis stands for cutoff value for JOIN method.
Comparison of methods by maximum Az value using 22 features (Mass)
| SVM | 0.88805 | 0.93642 | 0.92474 | 0.94998 | |
| SVM-RFE | 0.88849 | 0.94173 | 0.93037 | 0.94998 | |
| ENSEMBLE | 0.81490 | 0.90299 | 0.80317 | 0.86155 | |
| JOIN (1) | 0.86728 | 0.92278 | 0.87638 | 0.90789 | |
| JOIN (2) | 0.83034 | 0.93886 | 0.89597 | 0.85132 | |
| JOIN (3) | 0.75098 | 0.87312 | 0.82694 | 0.83834 | |
| JOIN (4) | 0.74270 | 0.74262 | 0.66948 | 0.83834 | |
| JOIN (5) | 0.68776 | 0.71316 | 0.66948 | 0.80802 | |
| MSVM-RFE (bootstrap) | 5 | 0.89720 | 0.93729 | 0.92664 | 0.95087 |
| 10 | 0.88833 | 0.93666 | 0.92972 | 0.95016 | |
| 15 | 0.93746 | 0.93000 | 0.95076 | ||
| 20 | 0.89014 | 0.94290 | 0.92986 | 0.95066 | |
| MSVM-RFE (boost) | 5 | 0.88993 | 0.93987 | 0.94998 | |
| 10 | 0.88805 | 0.92812 | 0.94998 | ||
| 15 | 0.89092 | 0.94204 | 0.92789 | 0.94998 | |
| 20 | 0.88805 | 0.94197 | 0.92758 |
Numbers in parenthesis stands for cutoff value for JOIN method.