| Literature DB >> 27231580 |
Farahnaz Sadoughi1, Hadi Lotfnezhad Afshar2, Asiie Olfatbakhsh3, Neda Mehrdad3.
Abstract
BACKGROUND: Advances in treatment options of breast cancer and development of cancer research centers have necessitated the collection of many variables about breast cancer patients. Detection of important variables as predictors and outcomes among them, without applying an appropriate statistical method is a very challenging task. Because of recurrent nature of breast cancer occurring in different time intervals, there are usually more than one variable in the outcome set. For the prevention of this problem that causes multicollinearity, a statistical method named canonical correlation analysis (CCA) is a good solution.Entities:
Keywords: Breast Neoplasms; Data Mining; Neoplasm Recurrence; Statistics as Topic
Year: 2016 PMID: 27231580 PMCID: PMC4879760 DOI: 10.5812/ircmj.23131
Source DB: PubMed Journal: Iran Red Crescent Med J ISSN: 2074-1804 Impact factor: 0.611
List of Variables in Both Sets
| Variables |
|---|
|
|
| Age |
| Family history |
| Tumor size |
| Number of involved LN |
| LN positive |
| Number of removed LN |
| Pathology of tumor |
| Type of surgery |
| Tumor grade |
| Estrogen receptor |
| Progesterone receptor |
| Radiotherapy |
| Hormone therapy |
|
|
| DM, first three years |
| DM, 3 - 5 years |
| DM, more than 5 years |
| LRR, first three years |
| LRR, 3 - 5 years |
| LRR, more than 5 years |
Abbreviations: DM, distant metastasis; LN, lymph node; LRR, loco-regional recurrence.
aAll periods are time after diagnosis.
Transformation Rules and the Study Population Characteristics
| Variables | Coding | Values [ |
|---|---|---|
|
| ||
| > 50 | 0 | 219 (37.5) |
| ≤ 50 | 1 | 365 (62.5) |
|
| ||
| No | 0 | 485 (83) |
| First degree | 1 | 99 (17) |
|
| ||
| (not) < 2 | (0) 1 | 84 (14.4) |
| (not) 2 - 5 | (0) 1 | 268 (45.9) |
| (not) > 5 | (0) 1 | 232 (39.7) |
|
| ||
| (not) Nothing | (0) 1 | 231 (39.6) |
| (not) 1 - 3 | (0) 1 | 187 (32) |
| (not) 3 - 9 | (0) 1 | 112 (19.2) |
| (not) > 9 | (0) 1 | 54 (9.2) |
|
| ||
| No | 0 | 174 (29.8) |
| Yes | 1 | 410 (70.2) |
|
| ||
| Zero | 0 | 29 (5) |
| One or more | 1 | 381 (95) |
|
| ||
| LCIS | (0) 1 | 47 (8) |
| DCIS | (0) 1 | 72 (12.3) |
| IDC | (0) 1 | 272 (46.6) |
| ILC | (0) 1 | 95 (16.3) |
| Medullary | (0) 1 | 53 (9.1) |
| Micro invasive | (0) 1 | 24 (4.1) |
| Paget disease | (0) 1 | 4 (0.7) |
| Inflammatory | (0) 1 | 1 (0.2) |
| Other | (0) 1 | 16 (2.7) |
|
| ||
| MRM | (0) 1 | 399 (68.3) |
| BCS | (0) 1 | 135 (23.1) |
| Bilateral MRM | (0) 1 | 25 (4.3) |
| Bilateral BCS | (0) 1 | 23 (3.9) |
| MRM + BCS | (0) 1 | 1 (0.2) |
| Combined | (0) 1 | 1 (0.2) |
|
| ||
| First grade | 1 | 114 (19.5) |
| Second grade | 2 | 319 (54.6) |
| Third grade | 3 | 151 (25.9) |
|
| ||
| Negative | 0 | 241 (41.3) |
| Positive | 1 | 343 (58.7) |
|
| ||
| Negative | 0 | 242 (41.4) |
| Positive | 1 | 342 (58.6) |
|
| ||
| No | 0 | 220 (37.7) |
| Yes | 1 | 364 (62.3) |
|
| ||
| No | (0) 1 | 41 (7) |
| Tamoxifen | (0) 1 | 173 (29.6) |
| Raloxifene | (0) 1 | 21 (3.6) |
| Letrozole | (0) 1 | 46 (7.9) |
| Aromasin | (0) 1 | 18 (3.1) |
| Megace | (0) 1 | 16 |
| Combined | (0) 1 | 269 |
Abbreviations: BCS, breast conserving surgery; DCIS, ductal carcinoma in situ; IDC, invasive ductal carcinoma; ILC, invasive lobular carcinoma; LCIS, lobular carcinoma in situ; MRM, modified radical mastectomy; P, preservation.
aValues are presented as No. (%).
Figure 1.Illustration of the First Function in a Canonical Correlation Analysis With Three Predictors and Two Criterion Variables
Canonical Solution for Function 1
| Variables | Coef | R3 | R2s, % |
|---|---|---|---|
|
| -0.015 | -0.163 | 2.65 |
|
| -0.436 | -0.795[ | 63.2 |
|
| 0.007 | 0.126 | 1.58 |
|
| 0.085 | 0.414 | 17.13 |
|
| 0 | -0.512[ | 26.21 |
|
| 0.139 | 0.293 | 8.58 |
|
| -0.021 | -0.081 | 0.65 |
|
| 0.066 | 0.04 | 0.16 |
|
| 0 | -0.419 | 17.55 |
|
| 0.061 | -0.307 | 9.42 |
|
| -.021 | -0.155 | 2.4 |
|
| -.408 | -0.562[ | 31.58 |
|
| -0.259 | 0.122 | 1.48 |
|
| -0.373 | 0.492[ | 24.2 |
|
| -0.545 | -0.302 | 9.12 |
|
| -0.332 | -0.041 | 0.16 |
|
| -0.191 | -0.045 | 0.2 |
|
| -0.153 | -0.091 | 0.82 |
|
| -0.164 | 0.04 | 0.16 |
|
| 0 | 0.019 | 0.03 |
|
| 1.569 | 0.285 | 8.12 |
|
| 1.317 | 0.054 | 0.29 |
|
| 0.881 | -0.212 | 4.49 |
|
| 0.491 | -0.558[ | 31.13 |
|
| 0.144 | 0.019 | 0.03 |
|
| 0 | -0.119 | 1.41 |
|
| -0.01 | 0.033 | 0.1 |
|
| 0.414 | 0.599[ | 35.88 |
|
| -0.111 | 0.198 | 3.92 |
|
| -0.019 | -0.069 | 0.47 |
|
| 0.161 | 0.112 | 1.25 |
|
| 0.147 | 0.278 | 7.72 |
|
| 0.041 | -0.025 | 0.06 |
|
| 0.113 | 0.218 | 4.75 |
|
| 0.056 | 0.06 | .36 |
|
| -0.012 | 0.025 | 0.06 |
|
| 0 | -0.451[ | 20.34 |
|
| 9.12 | ||
|
| 0.173 | 0.209 | 4.36 |
|
| 0.175 | 0.2 | 4 |
|
| 0.074 | -0.058 | 0.33 |
|
| -0.437 | -0.356 | 12.67 |
|
| -0.564 | -0.479[ | 22.94 |
|
| -0.739 | -0.686[ | 47.05 |
Abbreviations: Coef, standardized canonical function coefficient; rs, structure coefficient; rs2, squared structure coefficient, R2c, squared canonical correlations.
aStructure coefficients (rs) greater than 0.45 are underlined.