| Literature DB >> 24303503 |
Yan Guo1, Quanghu Sheng, David C Samuels, Brian Lehmann, Joshua A Bauer, Jennifer Pietenpol, Yu Shyr.
Abstract
Exome sequencing using next-generation sequencing technologies is a cost-efficient approach to selectively sequencing coding regions of the human genome for detection of disease variants. One of the lesser known yet important applications of exome sequencing data is to identify copy number variation (CNV). There have been many exome CNV tools developed over the last few years, but the performance and accuracy of these programs have not been thoroughly evaluated. In this study, we systematically compared four popular exome CNV tools (CoNIFER, cn.MOPS, exomeCopy, and ExomeDepth) and evaluated their effectiveness against array comparative genome hybridization (array CGH) platforms. We found that exome CNV tools are capable of identifying CNVs, but they can have problems such as high false positives, low sensitivity, and duplication bias when compared to array CGH platforms. While exome CNV tools do serve their purpose for data mining, careful evaluation and additional validation is highly recommended. Based on all these results, we recommend CoNIFER and cn.MOPs for nonpaired exome CNV detection over the other two tools due to a low false-positive rate, although none of the four exome CNV tools performed at an outstanding level when compared to array CGH.Entities:
Mesh:
Year: 2013 PMID: 24303503 PMCID: PMC3835197 DOI: 10.1155/2013/915636
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Overview of the CNVs detected by array CGH and four algorithms. (a) Barplot of the duplication and deletion CNVs detected by five methods. (b) Boxplot of the CNV length detected by five methods.
Figure 2Barplot of duplication and deletion CNVs detected from each sample by five methods. The P value beside each method name was calculated by paired Wilcoxon signed rank tests following FDR correction. It indicated the detection bias between duplication and deletion CNVs of that method. Array CGH and exomeCopy showed unbiased duplication and deletion while CoNIFER had strong bias toward duplication. Cn.MOPS and ExomeDepth showed marginal bias toward duplication.
Kullback-Leibler test on similarity with array CGH.
| aCGH | cn.MOPS | exomeCopy | ExomeDepth | CoNIFER | |
|---|---|---|---|---|---|
| Deletion CNVs proportion similarity | |||||
| aCGH | 0 | 0.14 | 0.16 | 0.17 | 2.24 |
| cn.MOPS | 0.15 | 0 | 0.22 | 0.24 | 1.84 |
| exomeCopy | 0.16 | 0.23 | 0 | 0.2 | 1.88 |
| ExomeDepth | 0.22 | 0.27 | 0.23 | 0 | 2.56 |
| CoNIFER | 0.8 | 0.97 | 0.87 | 0.72 | 0 |
|
| |||||
| Duplication CNVs proportion similarity | |||||
| aCGH | 0 | 0.4 | 0.4 | 0.47 | 0.66 |
| cn.MOPS | 0.36 | 0 | 0.42 | 0.66 | 0.61 |
| exomeCopy | 0.42 | 0.59 | 0 | 0.18 | 0.26 |
| ExomeDepth | 0.55 | 1.06 | 0.25 | 0 | 0.55 |
| CoNIFER | 0.77 | 0.58 | 0.26 | 0.42 | 0 |
Figure 3Specificity of four algorithms for CNV detection. CoNIFER identified many fewer CNVs but with a high true positive rate at deletion detection. ExomeDepth and ExomeCopy showed comparable specificity with CoNIFER on duplication detection but many more false positives on deletion detection. Cn.MOPS showed best specificity at duplication detection and second best specificity at deletion identified many more CNVs than CoNIFER. Overall, cn.MOPS achieved the highest specificity among all four algorithms.