| Literature DB >> 22396732 |
Annelies Fieuw1, Bram De Wilde, Frank Speleman, Jo Vandesompele, Katleen De Preter.
Abstract
BACKGROUND: Although the throughput of next generation sequencing is increasing and at the same time the cost is substantially reduced, for the majority of laboratories whole genome sequencing of large cohorts of cancer samples is still not feasible. In addition, the low number of genomes that are being sequenced is often problematic for the downstream interpretation of the significance of the variants. Targeted resequencing can partially circumvent this problem; by focusing on a limited number of candidate cancer genes to sequence, more samples can be included in the screening, hence resulting in substantial improvement of the statistical power. In this study, a successful strategy for prioritizing candidate genes for targeted resequencing of cancer genomes is presented.Entities:
Mesh:
Year: 2012 PMID: 22396732 PMCID: PMC3291573 DOI: 10.1371/journal.pone.0031333
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Overview of the publically available tumor data sets, used in this study.
| Tumor entity | Copy number data | Gene expression data | Combined CN and GE data | Gene mutational data |
|
| 22 samples | 15 samples | 12 samples | 10 primary breast ductal adenocarcinomas |
| GSE22840 (GSE22839) | GSE22840 (GSE22544) | ±18000 genes sequenced | ||
| Sjöblom | ||||
|
| 38 samples | 19 samples | 19 samples | 11 liver metastases from colorectal carcinomas |
| GSE17047 | GSE17047 | ±18000 genes sequenced | ||
| Sjöblom | ||||
|
| 9 samples | 9 samples | 9 samples | 7 ovarian clear cell carcinomas |
| GSE19539 | GSE19539 | ±18000 genes sequenced | ||
| Jones | ||||
|
| 15 samples | 15 samples | 15 samples | 19 primary glioblastoma multiforme samples |
| GSE10878 | GSE10878 | ±20000 genes sequenced | ||
| Parsons | ||||
|
| 30 samples | 36 samples | / | 24 primary/metastases pancreas adenocarcinomas |
| GSE7599 | GSE15471 | ±20000 genes sequenced | ||
| Jones | ||||
|
| 60 samples | 77 samples | / | 14 pediatric classic medulloblastomas |
| GSE8634 | GSE21140 | ±21000 genes sequenced | ||
| Parsons |
one sample was excluded from this study due to a hypermutated profile caused by chemotherapeutic treatment.
CN: copy number; GE: gene expression.
Figure 1The number of mutated genes in relation to a certain number of top-ranked genes.
Mutation plots showing the amount of genes that need to be sequenced (y-axis) in order to find a certain number of mutated genes (depicted on the x-axis), for the six different tumor types. A: colon cancer; B: pancreas cancer; C: breast cancer; D: ovarian cancer: E: glioblastoma; F: medulloblastoma.
Figure 2PPV plot of the fitSNP strategy for the combined tumor entities.
A PPV plot for the fitSNP strategy, performed on the mutation data of all combined tumor entities, in function of different prioritization value cut-offs.