| Literature DB >> 26185765 |
Gabriel Wajnberg1, Benilton S Carvalho2, Carlos G Ferreira3, Fabio Passetti1.
Abstract
Copy number variation is a class of structural genomic modifications that includes the gain and loss of a specific genomic region, which may include an entire gene. Many studies have used low-resolution techniques to identify regions that are frequently lost or amplified in cancer. Usually, researchers choose to use proprietary or non-open-source software to detect these regions because the graphical interface tends to be easier to use. In this study, we combined two different open-source packages into an innovative strategy to identify novel copy number variations and pathways associated with cancer. We used a mesothelioma and ependymoma published datasets to assess our tool. We detected previously described and novel copy number variations that are associated with cancer chemotherapy resistance. We also identified altered pathways associated with these diseases, like cell adhesion in patients with mesothelioma and negative regulation of glutamatergic synaptic transmission in ependymoma patients. In conclusion, we present a novel strategy using open-source software to identify copy number variations and altered pathways associated with cancer.Entities:
Mesh:
Year: 2015 PMID: 26185765 PMCID: PMC4491549 DOI: 10.1155/2015/902419
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Comparison of SNP array data of mesothelioma [12] analyzed by DNAcopy and CGHcall with amplifications.
| DNAcopy | CGHcall | Christensen et al. [ | |||
|---|---|---|---|---|---|
| Chromosome arm | Patients | Chromosome arm | Patients | Chromosome arm | Patients |
| 1q | 6 | 1q | 5 | 1q | 7 |
| 15q | 7 | 15q | 5 | 15q | 4 |
| 8q | 4 | 8q | 3 | 8q | 9 |
| 7p | 3 | 7p | 4 | 7p | 5 |
| 8p | 4 | 5p | 3 | 5p | 5 |
| 10p | 4 | 10p | 4 | 10p | 0 |
| 9p | 0 | 9p | 4 | 9p | 0 |
| 6q | 3 | 6q | 3 | 6q | 0 |
| 6p | 0 | 6p | 4 | 6p | 0 |
Comparison of SNP array data of mesothelioma [12] analyzed by DNAcopy and CGHcall with lost regions.
| DNAcopy | CGHcall | Christensen et al. [ | |||
|---|---|---|---|---|---|
| Chromosome arm | Patients | Chromosome arm | Patients | Chromosome arm | Patients |
| 1p | 18 | 1p | 16 | 1p | 15 |
| 6q | 15 | 6q | 8 | 6q | 15 |
| 22q | 6 | 22q | 10 | 22q | 10 |
| 14q | 9 | 14q | 11 | 14q | 9 |
| 9p | 10 | 9p | 11 | 9p | 9 |
| 4q | 8 | 4q | 9 | 4q | 7 |
| 4p | 7 | 4p | 6 | 4p | 6 |
| 13q | 4 | 13q | 5 | 13q | 4 |
| 18q | 2 | 18q | 13 | 18q | 3 |
| 10p | 2 | 10p | 2 | 10p | 2 |
| 16q | 15 | 16q | 6 | 16q | 0 |
Figure 1Venn diagram comparing the number of chromosomal regions in which gained (a) or lost (b) regions were identified by DNAcopy and CGHcall and by Christensen and colleagues [12].
Figure 2KO:04514 from KEGG: colored if genes were identified in at least one patient with amplification (green), lost (red), both (yellow), and with probe with no hybridization (gray).
Gene set enrichment analysis using MGSA package with posterior probability ≥0.5 for the mesothelioma dataset.
| ID | Pathway | Posterior probability |
|---|---|---|
| KO:04740 | Olfactory transduction | 0.98 |
| GO:0042613 | MHC class II protein complex | 0.97 |
| GO:0034332 | Adherens junction organization | 0.93 |
| KO:03460 | Fanconi anemia pathway | 0.86 |
| KO:00350 | Tyrosine metabolism | 0.79 |
| KO:00565 | Ether lipid metabolism | 0.76 |
| KO:04514 | Cell adhesion molecules (CAMs) | 0.69 |
| KO:00650 | Butanoate metabolism | 0.53 |
| KO:05416 | Viral myocarditis | 0.53 |
KO IDs are from the KEGG database and GO IDs are from the Gene Ontology database.
Comparison of SNP array data of ependymoma [13] analyzed by DNAcopy and CGHcall with amplifications.
| DNAcopy | CGHcall | Kilday et al. [ | |||
|---|---|---|---|---|---|
| Chromosome arm | Patients | Chromosome arm | Patients | Chromosome arm | Patients |
| 1q | 9 | 1q | 15 | 1q | 14 |
| 2q | 7 | 2q | 9 | 2q | 11 |
| 2p | 4 | 2p | 10 | 2p | 11 |
| 19p | 4 | 19p | 12 | 19p | 10 |
| 5p | 5 | 5p | 9 | 5p | 10 |
| 17q | 4 | 17q | 7 | 17q | 5 |
| 18p | 3 | 18p | 12 | 18p | 10 |
| 17p | 2 | 17p | 7 | 17p | 5 |
| 19q | 3 | 19q | 10 | 19q | 9 |
| 15q | 4 | 15q | 13 | 15q | 8 |
| 6q | 5 | 6q | 10 | 6q | 8 |
| 6p | 5 | 6p | 9 | 6p | 8 |
| 12q | 5 | 12q | 8 | 12q | 6 |
| 21q | 5 | 21q | 11 | 21q | 6 |
| 22q | 6 | 22q | 12 | 22q | 7 |
| 7q | 4 | 7q | 9 | 7q | 7 |
| 7p | 4 | 7p | 8 | 7p | 7 |
| 9p | 5 | 9p | 9 | 9p | 7 |
| 9q | 5 | 9q | 9 | 9q | 7 |
| 20p | 4 | 20p | 12 | 20p | 6 |
| 20q | 4 | 20q | 13 | 20q | 6 |
| 11q | 5 | 11q | 5 | 11q | 7 |
| 12p | 5 | 12p | 8 | 12p | 6 |
| 8p | 4 | 8p | 6 | 8p | 6 |
| 8q | 5 | 8q | 6 | 8q | 6 |
| 4p | 4 | 4p | 9 | 4p | 6 |
| 4q | 5 | 4q | 9 | 4q | 6 |
| 14q | 6 | 14q | 9 | 14q | 8 |
| 13q | 7 | 13q | 7 | 13q | 0 |
| 3q | 6 | 3q | 7 | 3q | 0 |
| 3p | 4 | 3p | 7 | 3p | 0 |
| 11p | 0 | 11p | 11 | 11p | 7 |
| 5q | 0 | 5q | 10 | 5q | 10 |
| 10q | 6 | 10q | 10 | 10q | 0 |
| 10p | 4 | 10p | 9 | 10p | 0 |
| 18q | 4 | 18q | 0 | 18q | 10 |
Comparison of SNP array data of ependymoma [13] analyzed by DNAcopy and CGHcall with lost copies.
| DNAcopy | CGHcall | Kilday et al. [ | |||
|---|---|---|---|---|---|
| Chromosome arm | Patients | Chromosome arm | Patients | Chromosome arm | Patients |
| 6p | 8 | 6p | 10 | 6p | 9 |
| 6q | 9 | 6q | 11 | 6q | 8 |
| 22q | 6 | 22q | 7 | 22q | 6 |
| 16q | 6 | 16q | 7 | 16q | 6 |
| 3q | 8 | 10q | 7 | 10q | 7 |
| 2q | 11 | 2q | 11 | 2q | 0 |
| 14q | 7 | 14q | 7 | 14q | 0 |
| 21q | 7 | 21q | 7 | 21p | 0 |
| 3p | 6 | 3p | 8 | 3p | 0 |
| 13q | 6 | 13q | 9 | 13q | 0 |
| 9q | 9 | 9q | 9 | 9q | 0 |
| 9p | 0 | 9p | 7 | 9p | 0 |
| 11q | 0 | 11q | 7 | 11q | 0 |
| 5q | 0 | 5q | 8 | 5q | 0 |
| 1p | 9 | 1p | 14 | 1p | 0 |
| 3q | 0 | 3q | 8 | 3q | 0 |
| 11p | 0 | 11p | 9 | 11p | 0 |
| 10p | 0 | 10p | 6 | 10p | 0 |
Figure 3Venn diagram comparing the number of chromosomal regions in which gained (a) or lost (b) regions were identified by DNAcopy and CGHcall and by Kilday and colleagues [13].
Gene set enrichment analysis using MGSA package with posterior probability ≥ 0.5 for the ependymoma dataset.
| ID | Pathway | Posterior probability |
|---|---|---|
| GO:0004522 | Pancreatic ribonuclease activity | 0.97 |
| GO:0042613 | MHC class II protein complex | 0.84 |
| KO:04740 | Olfactory transduction | 0.79 |
| GO:0016339 | Calcium-dependent cell-cell adhesion | 0.73 |
| KO:04974 | Protein digestion and absorption | 0.66 |
| GO:0046703 | Natural killer cell lectin-like receptor binding | 0.65 |
| GO:0051967 | Negative regulation of synaptic transmission, glutamatergic | 0.55 |
KO IDs are from the KEGG database and GO IDs are from the Gene Ontology database.
Figure 4Functional interaction of genes with amplifications (yellow) in the “negative regulation of synaptic transmission, glutamatergic” GO pathway (GO:0051967).