| Literature DB >> 24240121 |
Biao Liu1, Carl D Morrison, Candace S Johnson, Donald L Trump, Maochun Qin, Jeffrey C Conroy, Jianmin Wang, Song Liu.
Abstract
Accurate detection of somatic copy number variations (CNVs) is an essential part of cancer genome analysis, and plays an important role in oncotarget identifications. Next generation sequencing (NGS) holds the promise to revolutionize somatic CNV detection. In this review, we provide an overview of current analytic tools used for CNV detection in NGS-based cancer studies. We summarize the NGS data types used for CNV detection, decipher the principles for data preprocessing, segmentation, and interpretation, and discuss the challenges in somatic CNV detection. This review aims to provide a guide to the analytic tools used in NGS-based cancer CNV studies, and to discuss the important factors that researchers need to consider when analyzing NGS data for somatic CNV detections.Entities:
Mesh:
Year: 2013 PMID: 24240121 PMCID: PMC3875755 DOI: 10.18632/oncotarget.1537
Source DB: PubMed Journal: Oncotarget ISSN: 1949-2553
Available programs for detecting copy number variation in cancer genome using next generation sequencing data
| Platform | Program | Website | Ref. | Year | Language |
|---|---|---|---|---|---|
| WGS | SegSeq | [ | 2009 | MATLAB | |
| ReadDepth | [ | 2011 | R | ||
| BIC-seq | [ | 2011 | Perl/R | ||
| Patchwork | [ | 2013 | R | ||
| OncoSNP-SEQ | [ | 2013 | MATLAB | ||
| HMMcopy | / | / | R | ||
| CONSERTING | / | / | R | ||
| WES | ExomeCNV | [ | 2011 | R | |
| VarScan2 | [ | 2012 | Java | ||
| HAPSEG/ABSOLUTE | [ | 2012 | R | ||
| WGS&WES | Control_FREEC | [ | 2011 | C |
Figure 1The workflow chart that computational methods fall in for calling somatic copy number variations from next generation sequencing data
Major features of programs for detecting copy number variation in cancer genome using next generation sequencing data
| Programs | Data type | Data preprocessing | Segmentation | Interpretation | Sample information |
|---|---|---|---|---|---|
| SegSeq | RC | Matched normal; | Local change-point analysis with a subsequent merging procedure | Optimized cutoffs | / |
| ReadDepth | RD | Mappability correction; | CBS | Optimized cutoffs | / |
| BIC-seq | RD | Matched normal; | Minimizing BIC | Empirical cutoffs | / |
| Patchwork | RD | Normal genome; | CBS | Pattern Recognition and empirical cutoffs | Tumor purity |
| OncoSNP-SEQ | RC | Matched normal; | HMM | HMM | Tumor purity |
| HMMcopy | RC | Matched normal; | HMM | HMM | / |
| CONSERTING | RD | Matched normal; | Regression Tree | Empirical cutoff | / |
| ExomeCNV | RD | Matched normal | CBS | Optimized cutoff | Fixed tumor purity |
| VarScan | RD | Matched normal | CBS | Empirical cutoff | / |
| HAPSEG/ABSOLUTE | RD at SNP loci | Matched normal | probabilistic method | Pattern Matching and fit platform error model | Tumor purity |
| Control_FREEC | RC | Matched normal and/or GC and Mappability correction; | LASSO algorithm | Empirical cutoff | Tumor purity |
Abbreviations: RC, Read Counts; RD, Read Depth; BAF, B Allele Frequency; SNP, single nucleotide polymorphism; CBS, circular binary segmentation; HMM, hidden Markov model.
ExomeCNV uses only RD for calling CNV; it uses BAF for calling LOH.
The data is assumed to be in normal distribution if not specified.
Figure 2Diagram of detecting somatic CNV from sequencing data
(A) A normal human genome usually has two copies of its chromosomes (each copy or homologue from either parents), and contains loci with different genotypes (AA, AB, BA, and BB for loci 1-4, respectively). (B) A somatic CNV event (tandem duplication here) alters copy number of some genomic regions. (C) Pileup view of mapped reads. Altered relative read depth or read counts can be observed. Depending on copy numbers of two homologues in tumor genome, shifted B allele frequency might be observed at heterozygous loci (see the table. CN, copy number; BAF, B allele frequency).