| Literature DB >> 25112586 |
Mehmet U S Ayvaci, Oguzhan Alagoz, Jagpreet Chhatwal, Alejandro Munoz del Rio, Edward A Sickles, Houssam Nassif, Karla Kerlikowske, Elizabeth S Burnside1.
Abstract
BACKGROUND: Increasing focus on potentially unnecessary diagnosis and treatment of certain breast cancers prompted our investigation of whether clinical and mammographic features predictive of invasive breast cancer versus ductal carcinoma in situ (DCIS) differ by age.Entities:
Mesh:
Year: 2014 PMID: 25112586 PMCID: PMC4138370 DOI: 10.1186/1471-2407-14-584
Source DB: PubMed Journal: BMC Cancer ISSN: 1471-2407 Impact factor: 4.430
Figure 1Patient population derived from consecutive image guided biopsies revealing cancer.
List of structured and extracted variables*
| Structured | Variables extracted using NLP |
|---|---|
| • Age | • Calcification distribution |
| • Family history (of breast cancer)† | • Calcification morphology¥ |
| • Personal history (of breast cancer) | • Mass margins |
| • Prior surgery‡ | • Mass shape |
| • Palpable lump | • Architectural distortion |
| • Breast density | • Focal asymmetric density |
| • BI-RADS assessment | |
| • Indication for exam if diagnostic | |
| • Principal mammography findingΨ | |
| • Mass size |
*These variables were used as input to the stepwise regression to produce the models for older and younger women.
†Defined as family history of breast cancer (Minor = one or more relatives more distant than first-degree relatives, Strong = one first-degree relative with unilateral postmenopausal breast cancer, Very Strong = more than one first-degree relative with unilateral postmenopausal breast cancer, one first-degree relative with bilateral breast cancer, or one first-degree with premenopausal breast cancer).
‡Defined as prior breast surgery of any kind.
ΨPrincipal mammographic finding: architectural distortion, calcifications, asymmetry (one view), focal asymmetry (two views), developing asymmetry, mass, single dilated duct, both calcifications and something else.
¥To overcome low frequency categories, features are grouped into high probability malignancy, intermediate and typically benign categories, as described in the Breast Imaging and Reporting Data System (BI-RADS) lexicon [18].
Proportion of DCIS in each age group
| Biopsies revealing DCIS | Biopsies revealing invasive carcinoma | Total biopsies | Total patients | DCIS percentage (%) and the 95% confidence interval | |
|---|---|---|---|---|---|
|
| 110 | 264 | 374 | 353 | 29.4 (25.0,34.2) |
|
| 170 | 398 | 568 | 538 | 29.9 (26.3, 33.8) |
|
| 132 | 401 | 533 | 493 | 24.8 (21.3,28.6) |
|
| 412 | 1063 | 1475 | 1384 | 27.9 (25.7,30.3) |
Multivariable model for older group using stepwise regression with AIC criterion*
| Risk factor | Beta | Odds ratio | 95% CI (Lower -Upper) | p value | |||||
|---|---|---|---|---|---|---|---|---|---|
| (Intercept) | −1.16 | 0.31 | 0.18 | - | 0.55 | 0.000 | *** | ||
|
|
| ** | |||||||
| No corresponding palpable mass | 0.00 | 1(referent) | |||||||
| Missing | −0.30 | 0.74 | 0.05 | - | 10.55 | 0.824 | |||
| Corresponding palpable mass | 0.80 | 2.22 | 1.12 | - | 4.41 | 0.022 | ** | ||
|
|
| ** | |||||||
| None | 0.00 | 1(referent) | |||||||
| Missing | −0.89 | 0.41 | 0.13 | - | 1.32 | 0.135 | |||
| Strong | −0.32 | 0.73 | 0.33 | - | 1.59 | 0.422 | |||
| Very strong | 1.66 | 5.24 | 0.84 | - | 32.78 | 0.076 | * | ||
|
|
| ||||||||
| Not present | 0.00 | 1(referent) | |||||||
| Missing | −0.36 | 0.70 | 0.07 | - | 6.82 | 0.759 | |||
| Present | 0.57 | 1.78 | 0.99 | - | 3.17 | 0.053 | * | ||
|
|
| *** | |||||||
| Calcifications or Single dilated duct | 0.00 | 1(referent) | |||||||
| Architectural distortion | 20.56 | Inf | 0.00 | - | Inf | 0.993 | |||
| Associated calcifications | 2.16 | 8.67 | 3.39 | - | 22.14 | 0.000 | *** | ||
| Missing | 2.10 | 8.14 | 3.88 | - | 17.09 | 0.000 | *** | ||
| Asymmetry or Focal asymmetry | 2.94 | 18.87 | 3.79 | - | 93.87 | 0.000 | *** | ||
| Mass | 3.04 | 20.93 | 9.20 | - | 47.65 | 0.000 | *** | ||
| Developing asymmetry | 2.80 | 16.45 | 1.78 | - | 151.95 | 0.014 | ** | ||
|
|
| ** | |||||||
| Not present | 0.00 | 1(referent) | |||||||
| Linear or Segmental | −3.11 | 0.04 | 0.00 | - | 0.49 | 0.011 | ** | ||
| Clustered | −0.69 | 0.50 | 0.22 | - | 1.18 | 0.113 | |||
| Regional or Scattered | −1.94 | 0.14 | 0.01 | - | 2.83 | 0.202 | |||
|
|
| *** | |||||||
| None | 0.00 | 1(referent) | |||||||
| Circumscribed | −2.51 | 0.08 | 0.01 | - | 0.45 | 0.004 | *** | ||
| Ill-Defined | 0.19 | 1.21 | 0.46 | - | 3.20 | 0.703 | |||
| Obscured | 0.10 | 1.10 | 0.11 | - | 11.31 | 0.935 | |||
| Spiculated | 28.70 | Inf | 0.00 | - | Inf | 0.983 | |||
|
|
| ** | |||||||
| None | 0.00 | 1(referent) | |||||||
| Irregular | 1.91 | 6.78 | 0.78 | - | 58.79 | 0.083 | * | ||
| Lobular or Oval | −0.13 | 0.87 | 0.24 | - | 3.16 | 0.838 | |||
| Round | −15.53 | 0.00 | 0.00 | - | Inf | 0.987 | |||
|
|
| * | |||||||
| Not present | 0.00 | 1(referent) | |||||||
| Present | 1.63 | 5.10 | 0.54 | - | 47.77 | 0.154 | |||
The model is presented in the order of inclusion into the model.
*Asterisks denote the level of significance such that: ***p-value < 0.001; **p-value < 0.05, and *p-value <0.1.
“Inf” (short for infinity) is inserted at places where the data for the corresponding variable is sparsely populated and produces a very high and unstable odds ratio.
Multivariable model for the middle group using stepwise regression with AIC criterion*
| Risk factor | Beta | Odds ratio | 95% CI (Lower -Upper) | p value | |||||
|---|---|---|---|---|---|---|---|---|---|
| (Intercept) | -1.367 | 0.255 | 0.16 | - | 0.41 | <0.001 | *** | ||
|
|
| *** | |||||||
| Calcifications or Single dilated duct | 0 | 1(referent) | |||||||
| Architectural distortion | 18.123 | Inf | 0 | - | Inf | 0.979 | |||
| Associated calcifications | 1.911 | 6.757 | 3.04 | - | 15.03 | <0.001 | *** | ||
| Missing | 0.562 | 1.754 | 0.95 | - | 3.23 | 0.072 | * | ||
| Asymmetry or Focal asymmetry | 2.212 | 9.13 | 1.83 | - | 45.5 | 0.007 | *** | ||
| Mass | 2.81 | 16.604 | 7.54 | - | 36.55 | <0.001 | *** | ||
| Developing asymmetry | 18.049 | Inf | 0 | - | Inf | 0.991 | |||
|
|
| *** | |||||||
| No corresponding palpable mass | 0 | 1(referent) | |||||||
| Missing | 1.01 | 2.74 | 0.68 | - | 11.1 | 0.158 | |||
| Corresponding palpable mass | 1.2 | 3.322 | 1.91 | - | 5.79 | <0.001 | *** | ||
|
|
| *** | |||||||
| None | 0 | 1(referent) | |||||||
| Circumscribed | 0.529 | 1.697 | 0.14 | - | 20.5 | 0.677 | |||
| Ill-Defined | 0.24 | 1.272 | 0.31 | - | 5.24 | 0.74 | |||
| Obscured | 16.662 | Inf | 0 | - | Inf | 0.991 | |||
| Spiculated | 2.6 | 13.463 | 3.03 | - | 59.76 | 0.001 | *** | ||
|
|
| ** | |||||||
| Not present | 0 | 1(referent) | |||||||
| Missing | 0.729 | 2.074 | 0.97 | - | 4.42 | 0.059 | * | ||
| Present | 0.627 | 1.871 | 1.07 | - | 3.28 | 0.029 | ** | ||
|
|
| * | |||||||
| None | 0 | 1(referent) | |||||||
| Irregular | 2.114 | 8.28 | 0.96 | - | 71.31 | 0.054 | * | ||
| Lobular or Oval | 0.728 | 2.072 | 0.36 | - | 11.88 | 0.414 | |||
| Round | 16.123 | Inf | 0 | - | Inf | 0.997 | |||
The model is presented in the order of inclusion into the model.
*Asterisks denote the level of significance such that: *** p-value < 0.001; **p-value < 0.05, and * p-value <0.1.
“Inf” (short for infinity) is inserted at places where the data for the corresponding variable is sparsely populated and produces a very high and unstable odds ratio.
Multivariable model for younger group using stepwise regression with AIC criterion*
| Risk factor | Beta | Odds ratio | 95% CI (Lower -Upper) | p value | |||||
|---|---|---|---|---|---|---|---|---|---|
| (Intercept) | −0.64 | 0.53 | 0.35 | - | 0.8 | 0.002 | *** | ||
|
|
| *** | |||||||
| No corresponding palpable mass | 0 | 1(referent) | |||||||
| Missing | −0.68 | 0.51 | 0.16 | - | 1.6 | 0.246 | |||
| Corresponding palpable mass | 1.21 | 3.36 | 1.79 | - | 6.32 | 0 | *** | ||
|
|
| *** | |||||||
| Calcifications or Single dilated duct | 0 | 1(referent) | |||||||
| Architectural distortion | 1.95 | 7.05 | 0.75 | - | 65.98 | 0.087 | * | ||
| Associated calcifications | 1.58 | 4.85 | 1.87 | - | 12.55 | 0.001 | *** | ||
| Missing | 1.02 | 2.76 | 1.34 | - | 5.7 | 0.006 | *** | ||
| Asymmetry or Focal asymmetry | 1.86 | 6.41 | 1.26 | - | 32.64 | 0.025 | ** | ||
| Mass | 2.74 | 15.51 | 4.97 | - | 48.35 | 0 | *** | ||
| Developing asymmetry | 16.5 | Inf | 0 | - | Inf | 0.997 | |||
|
|
| * | |||||||
| Not present | 0 | 1(referent) | |||||||
| Present | 1.78 | 5.91 | 0.67 | - | 52.13 | 0.11 | |||
|
|
| * | |||||||
| None | 0 | 1(referent) | |||||||
| Irregular | 15.83 | Inf | 0 | - | Inf | 0.986 | |||
| Lobular or Oval | 0.09 | 1.1 | 0.23 | - | 5.21 | 0.787 | |||
| Round | −19.53 | 0 | 0 | - | Inf | 0.996 | |||
|
|
| * | |||||||
| None | 0 | 1(referent) | |||||||
| 20-Oct | −0.97 | 0.38 | 0.03 | - | 4.61 | 0.447 | |||
| 20-50 | 1.7 | 5.47 | 1.17 | - | 25.69 | 0.031 | ** | ||
| <10 | −0.58 | 0.56 | 0.19 | - | 1.63 | 0.287 | |||
| > = 50 | −0.58 | 0.56 | 0.14 | - | 2.25 | 0.413 | |||
The model is presented in the order of inclusion into the model.
*Asterisks denote the level of significance such that: *** p-value < 0.001; **p-value < 0.05, and * p-value <0.1.
“Inf” (short for infinity) is inserted at places where the data for the corresponding variable is sparsely populated and produces a very high and unstable odds ratio.
Figure 2ROC curves for age specific models. Graph shows receiver operating characteristic (ROC) curves constructed from predictions from multivariable logistic regression models for older, middle, and younger group. AUC refers to area under the ROC curve and SE refers to standard error.
Figure 3Misclassification rates of models for older versus younger group at all possible thresholds. False negative rate (FNR) and false positive rate (FPR) for two of the age-based models: the older group (dashed lines) and the younger group (solid lines), are graphed for all threshold levels.