| Literature DB >> 24886783 |
Andrea Woltmann1, Bowang Chen1, Jesús Lascorz1, Robert Johansson2, Jorunn E Eyfjörd3, Ute Hamann4, Jonas Manjer5, Kerstin Enquist-Olsson6, Roger Henriksson7, Stefan Herms8, Per Hoffmann8, Kari Hemminki9, Per Lenner2, Asta Försti9.
Abstract
Genome-wide association studies (GWASs) may help to understand the effects of genetic polymorphisms on breast cancer (BC) progression and survival. However, they give only a focused view, which cannot capture the tremendous complexity of this disease. Therefore, we investigated data from a previously conducted GWAS on BC survival for enriched pathways by different enrichment analysis tools using the two main annotation databases Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG). The goal was to identify the functional categories (GO terms and KEGG pathways) that are consistently overrepresented in a statistically significant way in the list of genes generated from the single nucleotide polymorphism (SNP) data. The SNPs with allelic p-value cut-offs 0.005 and 0.01 were annotated to the genes by excluding or including a 20 kb up-and down-stream sequence of the genes and analyzed by six different tools. We identified eleven consistently enriched categories, the most significant ones relating to cell adhesion and calcium ion binding. Moreover, we investigated the similarity between our GWAS and the enrichment analyses of twelve published gene expression signatures for breast cancer prognosis. Five of them were commonly used and commercially available, five were based on different aspects of metastasis formation and two were developed from meta-analyses of published prognostic signatures. This comparison revealed similarities between our GWAS data and the general and the specific brain metastasis gene signatures as well as the Oncotype DX signature. As metastasis formation is a strong indicator of a patient's prognosis, this result reflects the survival aspect of the conducted GWAS and supports cell adhesion and calcium signaling as important pathways in cancer progression.Entities:
Mesh:
Year: 2014 PMID: 24886783 PMCID: PMC4041745 DOI: 10.1371/journal.pone.0098229
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Flow chart of the pathway enrichment analysis of the GWAS on BC survival.
Number of SNPs and genes corresponding to allelic p-value cut-offs of the GWAS on BC survival.
|
| total No. of SNPs | No. of SNPs within a gene | No. of genes | No. of SNPs ±20 kb | No. of genes ±20 kb |
| <0.05 | 4572 | 1664 | 1015 | 2525 | 1576 |
| <0.01 | 3080 | 1137 | 737 | 1725 | 1143 |
| <0.005 | 1607 | 576 | 402 | 746 | 638 |
| <0.001 | 329 | 112 | 83 | 163 | 125 |
| <0.0001 | 40 | 9 | 9 | 12 | 10 |
Number of GO annotations and KEGG pathways enriched by six pathway enrichment tools for gene lists with allelic SNP p-value cut-offs 0.005 and 0.01.
| SNP | SNP | |||||||
| within a gene | gene locus ±20 kb | within a gene | gene locus ±20 kb | |||||
| Tool name | No. of GO Annotations | No. of KEGG pathways | No. of GO Annotations | No. of KEGG pathways | No. of GO Annotations | No. of KEGG pathways | No. of GO Annotations | No. of KEGG pathways |
| ConsensusPathDB | 159 | 16 | 170 | 14 | 161 | 25 | 176 | 21 |
| DAVID | 32 | 0 | 15 | 0 | 22 | 5 | 4 | 1 |
| FatiGO | 7 | 0 | 5 | 0 | 5 | 0 | 4 | 0 |
| GATHER | 20 | 3 | 10 | 2 | 33 | 5 | 23 | 4 |
| GeneCodis | 133 | 11 | 172 | 15 | 223 | 29 | 278 | 26 |
| WebGestalt | 50 | 15 | 24 | 17 | 77 | 37 | 52 | 37 |
Pathway enrichment p-value cut-off: 0.05.
Consistently enriched categories of the GWAS on BC survival.
| GO Annotations (6 tools) | Number of tools | Number of genes in category | Number of GWAS genes* in category | |
| GO:0005509 | calcium ion binding | 3 | 685 | 67 |
| GO:0007155 | cell adhesion | 4 | 958 | 52 |
|
| ||||
| KEGG 04520 | Adherens junction | 3 | 59 | 9 |
| KEGG 05412 | Arrhythmogenic right ventricular cardiomyopathy | 2 | 65 | 10 |
| KEGG 04360 | Axon guidance | 3 | 80 | 15 |
| KEGG 04020 | Calcium signaling pathway | 3 | 139 | 17 |
| KEGG 05414 | Dilated cardiomyopathy | 3 | 82 | 11 |
| KEGG 04512 | ECM-receptor interaction | 4 | 57 | 8 |
| KEGG 04510 | Focal adhesion | 3 | 134 | 15 |
| KEGG 00512 | O-Glycan biosynthesis | 2 | 11 | 7 |
| KEGG 05222 | Small cell lung cancer | 2 | 65 | 6 |
Only categories enriched in all four gene lists and by more than one tool were considered consistently enriched. * Genes present in the 0.01 gene list (allelic SNP p-value cut-off 0.01).
Gene overlap of the consistently enriched categories for all pathway genes.
| a | b | c | d | e | f | g | h | i | j | k | ||||
| ID | Category | Number of genes | 685 | 958 | 59 | 65 | 80 | 139 | 82 | 57 | 134 | 11 | 65 | |
|
| GO:0005509 | Calcium ion binding | 685 | – | 185 | 0 | 6 | 3 | 14 | 8 | 1 | 9 | 0 | 0 |
|
| GO:0007155 | Cell adhesion | 958 | – | 21 | 29 | 25 | 5 | 25 | 48 | 63 | 0 | 23 | |
|
| KEGG 04520 | Adherens junction | 59 | – | 8 | 9 | 2 | 1 | 0 | 17 | 0 | 0 | ||
|
| KEGG 05412 | Arrhythmogenic right ventricular cardiomyopathy | 65 | – | 1 | 5 | 51 | 21 | 23 | 0 | 6 | |||
|
| KEGG 04360 | Axon guidance | 80 | – | 2 | 1 | 1 | 20 | 0 | 2 | ||||
|
| KEGG 04020 | Calcium signaling pathway | 139 | – | 18 | 0 | 6 | 0 | 1 | |||||
|
| KEGG 05414 | Dilated cardiomyopathy | 82 | – | 21 | 23 | 0 | 6 | ||||||
|
| KEGG 04512 | ECM-receptor interaction | 57 | – | 42 | 0 | 18 | |||||||
|
| KEGG 04510 | Focal adhesion | 134 | – | 0 | 32 | ||||||||
|
| KEGG 00512 | O-Glycan biosynthesis | 11 | – | 0 | |||||||||
|
| KEGG 05222 | Small cell lung cancer | 65 | – |
Gene overlap of the consistently enriched categories based on the GWAS genes present in the 0.01 gene list.
| a | b | c | d | e | f | g | h | i | j | k | ||||
| ID | Category | Number of genes | 67 | 52 | 9 | 10 | 15 | 17 | 11 | 8 | 15 | 7 | 6 | |
|
| GO:0005509 | Calcium ion binding | 67 | – | 13 | 1 | 8 | 1 | 11 | 7 | 2 | 4 | 6 | 0 |
|
| GO:0007155 | Cell adhesion | 52 | – | 3 | 5 | 3 | 0 | 3 | 7 | 10 | 0 | 5 | |
|
| KEGG 04520 | Adherens junction | 9 | – | 2 | 1 | 0 | 0 | 0 | 2 | 0 | 0 | ||
|
| KEGG 05412 | Arrhythmogenic right ventricular cardiomyopathy | 10 | – | 1 | 2 | 8 | 3 | 4 | 0 | 1 | |||
|
| KEGG 04360 | Axon guidance | 15 | – | 1 | 1 | 1 | 2 | 0 | 1 | ||||
|
| KEGG 04020 | Calcium signaling pathway | 17 | – | 4 | 0 | 1 | 0 | 0 | |||||
|
| KEGG 05414 | Dilated cardiomyopathy | 11 | – | 3 | 3 | 0 | 1 | ||||||
|
| KEGG 04512 | ECM-receptor interaction | 8 | – | 7 | 0 | 5 | |||||||
|
| KEGG 04510 | Focal adhesion | 15 | – | 0 | 5 | ||||||||
|
| KEGG 00512 | O-Glycan biosynthesis | 7 | – | 0 | |||||||||
|
| KEGG 05222 | Small cell lung cancer | 6 | – |
Figure 2Top 25 GeneGO pathways enriched by the 0.01 gene list derived from the GWAS data.
red numbers = significant at FDR of 0.05.
Distribution of the seven generic terms among the 50 top pathways in the enrichment analyses of the 13 gene lists.
| signature name | No. ofgenes | cytoskeletonremodeling | celladhesion | cellcycle | neurophysio- logicalprocess | musclecontraction | immuneresponse | development | sum ofpathways | |||||||||
| total | % | total | % | total | % | total | % | total | % | total | % | total | % | total | % | |||
|
| 0.01 gene list | 737 | 6 | 12 | 6 | 12 | 0 | 0 | 6 | 12 | 3 | 6 | 1 | 2 | 7 | 14 | 29 | 58 |
|
| Mammaprint | 70 | 1 | 2 | 3 | 6 | 3 | 6 | 5 | 10 | 2 | 4 | 3 | 6 | 12 | 24 | 29 | 58 |
| Oncotype DX | 21 | 5 | 10 | 4 | 8 | 9 | 18 | 1 | 2 | 0 | 0 | 1 | 2 | 10 | 20 | 30 | 60 | |
| MapQuant | 97 | 4 | 8 | 1 | 2 | 17 | 34 | 0 | 0 | 0 | 0 | 2 | 4 | 1 | 2 | 25 | 50 | |
| Gene Search | 76 | 2 | 4 | 3 | 6 | 9 | 18 | 2 | 4 | 0 | 0 | 6 | 12 | 6 | 12 | 28 | 56 | |
| Wound responsesignature | 512 | 5 | 10 | 3 | 6 | 3 | 6 | 0 | 0 | 0 | 0 | 4 | 8 | 4 | 8 | 19 | 38 | |
|
| Lung metastasissignature | 54 | 4 | 8 | 5 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 16 | 32 | 11 | 22 | 36 | 72 |
| Brain metastasissignature | 243 | 2 | 4 | 6 | 12 | 0 | 0 | 0 | 0 | 1 | 2 | 18 | 36 | 11 | 22 | 38 | 76 | |
| Bone metastasissignature | 102 | 6 | 12 | 4 | 8 | 0 | 0 | 0 | 0 | 0 | 0 | 14 | 28 | 11 | 22 | 35 | 70 | |
|
| Invasivenesssignature | 186 | 4 | 8 | 1 | 2 | 1 | 2 | 1 | 2 | 0 | 0 | 8 | 16 | 7 | 14 | 22 | 44 |
| Generalmetastasissignature | 128 | 4 | 8 | 4 | 8 | 8 | 16 | 2 | 4 | 3 | 6 | 4 | 8 | 5 | 10 | 30 | 60 | |
|
| Meta genesignature | 376 | 0 | 0 | 0 | 0 | 17 | 34 | 0 | 0 | 0 | 0 | 5 | 10 | 3 | 6 | 25 | 50 |
| 374 GeneSet/consensusgenes | 374 | 5 | 10 | 3 | 6 | 17 | 34 | 0 | 0 | 0 | 0 | 3 | 6 | 3 | 6 | 31 | 62 | |
Prognostic gene expression signatures selected for pathway enrichment analysis.
| Signaturename | Author | Year ofpublication | No. ofgenes | Study design | Outcome |
| Mammaprint | Van’t Veer | 2002 | 70 | 78 patients with sporadic primary breast tumors:<5 cm, N0, age <55 years; 34 patients developeddistant metastasis <5 years vs. 44 patients:disease-free >5 years | Prognosis for distantmetastasis |
| Oncotype DX | Paik | 2004 | 21 | 668 tumors from patients: N0 and ER+,treated with tamoxifen | Prognosis for distantrecurrence/overall survival |
| MapQuant | Sotirou | 2006 | 97 | 64 samples: ER+, grade 1 vs. grade 3 | Prognosis for recurrence/relapse-free survival |
| Gene Search | Wang et al. | 2005 | 76 | 115 tumors: all N0; 80 samples ER+,35 ER-, analyzed separately fordistant tumor recurrence,then combined | Prognosis for distanttumor recurrence |
| Wound responsesignature | Chang | 2004 | 512 | 50 fibroblast culturesfrom 10anatomic sites: response offibroblast to serum exposure | Prognosis formetastasis/survival |
| Lung metastasissignature | Minn | 2005 | 54 | Comparison of highly and weaklylung-metastatic cell populationsderived from the breast cancercell line MDA-MB-231 | Prognosis for lungmetastasis |
| Brain metastasissignature | Bos | 2009 | 243 | Comparison of cell lines withdifferent metastatic potentialsderived from the breast cancercell lines MDA-MB-231 and CN34 | Prognosis for brainmetastasis |
| Bone metastasissignature | Kang | 2003 | 102 | MDA-MB-231 breast cancer cell line+12 derivative subpopulations withdifferent metastatic potentials | Prognosis for bonemetastasis |
| Invasivenesssignature | Liu | 2007 | 186 | CD44+CD24−/low breast cancer cellswith high tumorgenic capacity vs. cellsof normal breast epithelium | Prognosis for overall/metastasis-free survival |
| General metastasissignature | Ramaswamy | 2003 | 128 | 64 primary adenocarcinomas of diverseorigin (lung, breast, prostate,colorectal, uterus, ovary) vs. 12unmatched adenocarcinoma metastasis | Metastatic potential,clinical outcome |
| Meta genesignature | Györffy | 2009 | 376 | Meta-analysis of 20 published genesignatures on 7 breast cancermicroarray data sets (n = 1079) | Prognosis forrelapse-free survival |
| 374 GeneSet/consensus genes | Lauss | 2008 | 374 | Meta-analysis of 44 published genesignatures on 8 breastcancer microarray datasets (n = 1067) | Prognosis forsurvival |
Figure 3Top 25 GeneGO pathways enriched simultaneously by the 0.01 gene list and general metastasis signature; red numbers = significant at FDR of 0.05; green box = pathway significantly enriched by both gene lists; Pathway “Airway smooth muscle contraction in asthma” was placed at rank 7, pathway “Cytoskeleton remodeling_Cytoskeleton remodeling” was placed at rank 9.
Figure 4GeneGo pathway “Airway smooth muscle contraction in asthma”.
Barometers: 1 = 0.01 gene list; 2 = general metastasis signature. red = Calcium signaling pathway, blue = Smooth muscle contraction/relaxation.
Figure 5GeneGo pathway “Cytoskeleton remodeling”.
Barometer: 1 = 0.01 gene list; 2 = general metastasis signature; 3 = brain metastasis signature. orange = ECM-receptor interaction, purple = Adherens junction pathway, red = Calcium signaling pathway, pink = Focal adhesion pathway, yellow = TGF-β signaling pathway, green = Wnt signaling pathway, blue = Smooth muscle contraction/relaxation.
Figure 6GeneGo pathway “Neurophysiological process_Receptor-mediated axon growth repulsion”.
Barometers: 1 = 0.01 gene list; 2 = Oncotype DX. red = Calcium signaling pathway.