| Literature DB >> 16111503 |
Amir R Razavi1, Hans Gill, Olle Stål, Marie Sundquist, Sten Thorstenson, Hans Ahlfeldt, Nosrat Shahsavar.
Abstract
BACKGROUND: A common approach in exploring register data is to find relationships between outcomes and predictors by using multiple regression analysis (MRA). If there is more than one outcome variable, the analysis must then be repeated, and the results combined in some arbitrary fashion. In contrast, Canonical Correlation Analysis (CCA) has the ability to analyze multiple outcomes at the same time. One essential outcome after breast cancer treatment is recurrence of the disease. It is important to understand the relationship between different predictors and recurrence, including the time interval until recurrence. This study describes the application of CCA to find important predictors for two different outcomes for breast cancer patients, loco-regional recurrence and occurrence of distant metastasis and to decrease the number of variables in the sets of predictors and outcomes without decreasing the predictive strength of the model.Entities:
Mesh:
Year: 2005 PMID: 16111503 PMCID: PMC1208892 DOI: 10.1186/1472-6947-5-29
Source DB: PubMed Journal: BMC Med Inform Decis Mak ISSN: 1472-6947 Impact factor: 2.796
List of variables in both sets
| Age | DM, first two years |
| Tumor location | DM, 2–4 years |
| Side | DM, more than 4 years |
| Tumor size * | LRR, first two years |
| LN involvement * | LRR, 2–4 years |
| LN involvement † | LRR, more than 4 years |
| Periglandular growth * | |
| NHG | |
| Multiple tumors * | |
| Estrogen receptor | |
| Progesterone receptor | |
| S-phase fraction | |
| DNA index | |
| DNA ploidy |
Abbreviations: LN: lymph node, NHG: Nottingham Histologic Grade, DM: Distant Metastasis, LRR: Loco-regional Recurrence
* from pathology report, † N0: Not palpable LN metastasis, ‡ all periods are time after diagnosis.
Transformation rules and the study population characteristics
| Age | >50 years | 0 | 177 |
| ≤ 50 years | 1 | 459 | |
| Tumor location | (not) Superior medial | (0)1 | 144 |
| (not) Inferior medial | (0)1 | 70 | |
| (not) Superior lateral | (0)1 | 368 | |
| (not) Inferior lateral | (0)1 | 112 | |
| (not) Nipple area | (0)1 | 58 | |
| Side | Left | 0 | 315 |
| Right | 1 | 322 | |
| Tumor Size | ≤ 20 mm | 0 | 233 |
| >20 mm | 1 | 404 | |
| LN involvement | No LN involvement | 0 | 373 |
| Positive LN involvement | 1 | 260 | |
| LN involvement (N0) | No palpable LN | 0 | 100 |
| Palpable and/or fixed LNs | 1 | 533 | |
| Periglandular growth | Absence of growth | 0 | 515 |
| Presence of growth | 1 | 122 | |
| Nottingham Histologic Grade | I | 1 | 145 |
| II | 2 | 228 | |
| III | 3 | 264 | |
| Multiple tumors | Absence of multiple tumors | 0 | 502 |
| Presence of multiple tumors | 1 | 134 | |
| Estrogen receptor | ≥ 0.3 fmol/mg | 0 | 181 |
| <0.3 fmol/mg | 1 | 456 | |
| Progesterone receptor | ≥ 0.3 fmol/mg | 0 | 232 |
| <0.3 fmol/mg | 1 | 405 | |
| S-phase fraction | <10% | 0 | 439 |
| ≥ 10% | 1 | 198 | |
| DNA index (DI) | 0.9 ≤ DI and DI < 1.3 | 0 | 345 |
| 0.9 > DI or DI ≥ 1.3 | 1 | 292 | |
| DNA ploidy | DNA diploidy or tetraploidy | 0 | 368 |
| DNA aneuploid | 1 | 269 |
Canonical Structure Matrix for Predictor and outcome Variates
| Age | .223 | ||
| Tumor location | |||
| Superior medial | .138 | DM, more than 4 years | .193 |
| Inferior medial | .159 | ||
| Superior lateral | -.056 | LRR, 2–4 years | -.030 |
| Inferior lateral | .155 | LRR, more than 4 years | -.013 |
| Nipple area | .160 | ||
| Side | -.017 | ||
| Multiple tumors * | .110 | ||
Abbreviations: LN: lymph node, NHG: Nottingham Histologic Grade, DM: Distant Metastasis, LRR: Loco-regional Recurrence
* from pathology report, † N0: Not palpable LN metastasis, ‡ all periods are time after diagnosis.
If the signs in the sets are the same, then if one increases the other also increases, and vice versa.
Figure 1The first canonical correlation solution. Variables are sorted by the absolute value of their loadings. Abbreviations: LN: lymph node, DM: Distant Metastasis, LRR: Loco-regional Recurrence. * N0: Not palpable LN metastasis, † from pathology report, ‡ all periods are time after diagnosis. If the signs in the sets are the same, then if one increases the other also increases, and vice versa.