| Literature DB >> 26900390 |
Hiroko K Solvang1, Arnoldo Frigessi2, Fateme Kaveh3, Margit L H Riis4, Torben Lüders4, Ida R K Bukholm5, Vessela N Kristensen6, Bettina K Andreassen7.
Abstract
Tumor size, as indicated by the T-category, is known as a strong prognostic indicator for breast cancer. It is common practice to distinguish the T1 and T2 groups at a tumor size of 2.0 cm. We investigated the 2.0-cm rule from a new point of view. Here, we try to find the optimal threshold based on the differences between the gene expression profiles of the T1 and T2 groups (as defined by the threshold). We developed a numerical algorithm to measure the overall differential gene expression between patients with smaller tumors and those with larger tumors among multiple expression datasets from different studies. We confirmed the performance of the proposed algorithm by a simulation study and then applied it to three different studies conducted at two Norwegian hospitals. We found that the maximum difference in gene expression is obtained at a threshold of 2.2-2.4 cm, and we confirmed that the optimum threshold was over 2.0 cm, as indicated by a validation study using five publicly available expression datasets. Furthermore, we observed a significant differentiation between the two threshold groups in terms of time to local recurrence for the Norwegian datasets. In addition, we performed an associated network and canonical pathway analyses for the genes differentially expressed between tumors below and above the given thresholds, 2.0 and 2.4 cm, using the Norwegian datasets. The associated network function illustrated a cellular assembly of the genes for the 2.0-cm threshold: an energy production for the 2.4-cm threshold and an enrichment in lipid metabolism based on the genes in the intersection for the 2.0- and 2.4-cm thresholds.Entities:
Keywords: Breast cancer; Differentially expressed; Microarray data; Optimization algorithm; T-category; Two-group comparison statistical test
Year: 2016 PMID: 26900390 PMCID: PMC4746218 DOI: 10.1186/s13637-015-0034-5
Source DB: PubMed Journal: EURASIP J Bioinform Syst Biol ISSN: 1687-4145
Fig. 1Flow diagram of the analysis process
Fig. 2Plots for Fisher’s score and maximum point indicating the optimal threshold 3.0 (left). Right panel indicates plots of scores for more precise regions around 3.0
Fig. 3Plots for DEDS and minimum point indicating the optimal threshold 3
Fig. 4Plots for Fisher’s score (upper) and DEDS (lower) (black small square 5 %, O 20 %, multiplication sign 40 %, triangle 60 %, and asterisk 80 %)
Fig. 5Fisher’s score (upper) found 2.5 cm and mDEDS (lower) found 2.2 cm as the optimal threshold
Fig. 6Plots obtained by TEP
Fig. 7Optimum thresholds by Fisher’s score (top), mDEDS (middle), and TEP (bottom) in the case of validation datasets
p values obtained by the log-rank test for survival time (months) and time to local recurrence (in months)
| Thresholds [cm] | Overall survival | BC specific | Local recurrence |
|---|---|---|---|
| 2.0 | 0.021 | 0.027 | 0.13 |
| 2.2 | 0.18 | 0.047 | 0.045 |
| 2.4 | 0.30 | 0.083 | 0.089 |
Fig. 8Survival curves in the case of local recurrence for each threshold 2.0 cm (top), 2.2 cm (middle), and 2.4 cm (bottom)
Estimated number of significant probes (genes) by SAM
| Size used for response | 2.4 cm | 2.0 cm | |||
|---|---|---|---|---|---|
| Statistics for SAM | # probes | FDR (%) | # probes | FDR (%) | |
| Data | mdata1 | 12 | 15.03 | 44 | 6.03 |
| mdata2 | 11,740 | 5.02 | 8036 | 4.81 | |
| mdata3 | 93 | 20.29 | 9 | 10.42 | |
Fig. 9Summary of unique probes for 2.4 and 2.0 cm and overlapped probes
Summary of pathway analyses
| (Part A) Biological functions enriched in 5618 unique probes separating tumors below and above size 2.4 cm | |
| Associated network functions | Score |
| Cellular assembly and organization, cellular compromise, protein synthesis | 26 |
| Cell signaling, nucleic acid metabolism, small molecule biochemistry | 24 |
| Energy production, nucleic acid metabolism, small molecule biochemistry | 24 |
| Hair and skin development and function, dermatological diseases and conditions, developmental disorder | 22 |
| Post-translational modification, gene expression, infectious disease | 22 |
| Top canonical pathways | −log (FDR-corrected |
| Neuropathic pain signaling in dorsal horn neurons | 1.13 |
| Role of NNFAT in cardiac hypertrophy | 1.13 |
| Melatonin signaling | 6.94 × 10−1 |
| Molecular mechanisms of cancer | 6.94 × 10−1 |
| Calcium-induced T lymphocyte apoptosis | 6.94 × 10−1 |
| Diseases and disorders | FDR-corrected |
| Cancer | 3.95 × 10−1–5.44 × 10−1 |
| Hematological disease | 3.95 × 10−1–5.44 × 10−1 |
| Immunological disease | 5.13 × 10−1–5.44 × 10−1 |
| Hypersensitivity response | 5.44 × 10−1–5.44 × 10−1 |
| Inflammatory response | 5.44 × 10−1–5.44 × 10−1 |
| Molecular and cellular functions | FDR-corrected |
| Gene expression | 2.03 × 10−1–5.44 × 10−1 |
| Cellular growth and proliferation | 2.03 × 10−1–5.44 × 10−1 |
| Energy production | 2.03 × 10−1–5.44 × 10−1 |
| Amino acid metabolism | 2.03 × 10−1–5.44 × 10−1 |
| Small molecule biochemistry | 2.03 × 10−1–5.44 × 10−1 |
| (Part B) Biological functions enriched in 1914 unique probes separating tumors below and above size 2.0 cm | |
| Associated network functions | Score |
| Antigen presentation, cellular movement, hematological system development and function | 29 |
| Cell assembly, and organization, cellular function and maintenance, protein synthesis | 29 |
| Gene expression, infectious disease, small molecule biochemistry | 29 |
| Cellular assembly and organization, cell signaling, gene expression | 29 |
| Post-translational modification, protein folding, cell death | 24 |
| Top canonical pathways | −log (FDR-corrected |
| Tight junction signaling | 9.13 × 10−1 |
| Germ cell-Sertoli cell junction signaling | 8.69 × 10−1 |
| Cfc42 signaling | 1.26 × 10−1 |
| Fatty acid biosynthesis | 8.69 × 10−1 |
| Integrin signaling | 8.69 × 10−1 |
| Diseases and disorders | FDR-corrected |
| Dermatological diseases and conditions | 6.58 × 10−3–2.73 × 10−1 |
| Genetic disorder | 6.58 × 10−3–2.73 × 10−1 |
| Infectious disease | 1.14 × 10−2–2.73 × 10−1 |
| Inflammatory disease | 3.38 × 10−2–2.73 × 10−1 |
| Inflammatory response | 3.38 × 10−2–2.73 × 10−1 |
| Molecular and cellular functions | FDR-corrected |
| Antigen presentation | 3.38 × 10−2–2.73 × 10−1 |
| Cell-to-cell signaling and interaction | 3.38 × 10−2–2.73 × 10−1 |
| Cellular compromise | 3.38 × 10−2–2.73 × 10−1 |
| Cellular function and maintenance | 3.38 × 10−2–2.73 × 10−1 |
| Cellular movement | 7.14 × 10−2–2.73 × 10−1 |
| (Part C) Biological functions enriched in 6112 overlapping probes separating tumors below and above size 2.4 cm and 2.0 cm | |
| Associated network functions | Score |
| Protein synthesis, post-translational modification, cancer | 26 |
| Cell signaling, nucleic acid metabolism, small molecule biochemistry | 24 |
| Lipid metabolism, small molecule biochemistry, vitamin, and mineral metabolism | 24 |
| Connective tissue development and function, embryonic development, skeletal and muscular system development and function | 24 |
| Cancer, dematological diseases and conditions, tumor morphology | 22 |
| Top canonical pathways | −log (FDR-corrected |
| Cytotoxic T lymphocyte-mediated apoptosis of target cells | 3.65 |
| Allograft rejection signaling | 2.77 |
| Nur77 signaling in T lymphocytes | 2.77 |
| Antigen presentation pathway | 2.77 |
| T helper cell differentiation | 1.75 |
| Diseases and disorders | FDR-corrected |
| Dermatological diseases and conditions | 1.98 × 10−7–1.89 × 10−1 |
| Respiratory disease | 7.98 × 10−5–1.89 × 10−1 |
| Cancer | 4.49 × 10−4–1.89 × 10−1 |
| Genetic disorder | 4.49 × 10−4–1.71 × 10−1 |
| Inflammatory response | 4.49 × 10−4–1.89 × 10−1 |
| Molecular and cellular functions | FDR-corrected |
| Cell-to-cell signaling and interaction | 2.68 × 10−4–1.89 × 10−1 |
| Cellular movement | 8.02 × 10−4–1.89 × 10−1 |
| Cellular growth and proliferation | 1.29 × 10−3–1.89 × 10−1 |
| Cellular development | 2.99 × 10−3–1.89 × 10−1 |
| Cell death | 4.1 × 10−3–1.89 × 10−1 |