| Literature DB >> 25102989 |
Pubudu Saneth Samarakoon, Hanne Sørmo Sorte, Bjørn Evert Kristiansen, Tove Skodje, Ying Sheng, Geir E Tjønnfjord, Barbro Stadheim, Asbjørg Stray-Pedersen, Olaug Kristin Rødningen, Robert Lyle1.
Abstract
BACKGROUND: With advances in next generation sequencing technologies and genomic capture techniques, exome sequencing has become a cost-effective approach for mutation detection in genetic diseases. However, computational prediction of copy number variants (CNVs) from exome sequence data is a challenging task. Whilst numerous programs are available, they have different sensitivities, and have low sensitivity to detect smaller CNVs (1-4 exons). Additionally, exonic CNV discovery using standard aCGH has limitations due to the low probe density over exonic regions. The goal of our study was to develop a protocol to detect exonic CNVs (including shorter CNVs that cover 1-4 exons), combining computational prediction algorithms and a high-resolution custom CGH array.Entities:
Mesh:
Year: 2014 PMID: 25102989 PMCID: PMC4132917 DOI: 10.1186/1471-2164-15-661
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Exome CNV prediction programs used in the study
| Program | Reference |
|---|---|
| ExomeCNV |
|
| CONTRA |
|
| ExomeCopy |
|
| ExomeDepth |
|
| CoNIFER |
|
| XHMM |
|
Figure 1Count and length distributions of CNVs predicted by the programs in the study. (a) Length distribution of CNVs predicted by each program. (b) Count (number of CNVs) distribution of CNVs predicted by each program.
Figure 2exCNVs predicted by programs and overlapping exCNVs predicted by program combinations. Each dot represents an individual exome. (a) Number of exonic duplications predicted by each program. (b) Number of exonic deletions predicted by each program. (c) Number of overlapping duplications predicted by each program combination. (d) Number of overlapping deletions predicted by each program combination. Program combinations (c and d): 1, ExomeCopy/ExCopyDepth; 2, ExomeCopy/ExCopyDepth/CONTRA; 3, ExomeCopy/ExCopyDepth/CoNIFER; 4, ExomeCopy/ExCopyDepth/XHMM; 5, ExomeCopy/ExCopyDepth/ExomeDepth; 6, ExomeCopy/ExCopyDepth/ExomeDepth/XHMM; 7, ExomeCopy/ExCopyDepth/CONTRA/XHMM; 8, ExomeCopy/ExCopyDepth/CoNIFER/XHMM.
True positive (TP)/false positive (FP) CNV ratio predicted from each program
| Program | AvgTP | TP/FP ratio | Total CNV count 3 | Average CNVs per sample 4 |
|---|---|---|---|---|
| CoNIFER | 1.33 (1.33)1 | 0.92 (1.09)2 | 23 | 2.56 |
| ExCopyDepth | 28.11 (15.86)1 | 0.34 (0.51)2 | 422 | 46.89 |
| ExomeCopy | 289.33 (226.0)1 | 0.20 (0.21)2 | 11978 | 1330.89 |
| ExomeDepth | 1.11 (0.78)1 | 0.67 (0.70)2 | 17 | 1.89 |
| Intersection of ExomeCopy / ExCopyDepth | 12.89 (7.33)1 | 0.28 (0.33)2 | 218 | 24.22 |
| XHMM | 4.44 (3.56)1 | 0.32 (0.53)2 | 92 | 10.22 |
| CONTRA | 17.88 (16.56)1 | 0.20 (0.20)2 | 896 | 99.56 |
TP/FP ratio for each program was calculated using CNVs identified from 9, 1000 genomes samples run in both computational programs and exaCGH; Average true positive (AvgTP) = TP/9; TP/FP ratio = TP CNV count /FP CNV count; Average CNVs per sample = Total CNV count/9.
1Average true positive calculated by excluding CNVs in X and Y Chromosomes.
2TP/FP ratio calculated by excluding CNVs in X and Y Chromosomes.
3Total number of CNVs predicted by each program excluding CNVs in X and Y Chromosomes (CNV counts for each program including X and Y Chromosomes are presented in Figure 1b).
4Average CNVs per sample calculated from counts presented in total CNV count column in Table 2.
True positive (TP), false positive (FP) and TP/FP ratio for short CNVs (1–4 exons)
| Program | 1000 genomes exomes | Primary immunodeficiency patients | ||||
|---|---|---|---|---|---|---|
| TP | FP | TP/FP ratio | TP | FP | TP/FP ratio | |
| CONIFER | 0 | 0 | 4 | 6 | 0.67 | |
| XHMM | 0 | 4 | 7 | 32 | 0.22 | |
| ExomeDepth | 0 | 2 | 50 | 301 | 0.17 | |
| ExomeCopy | 914 | 6591 | 0.14 | 161 | 1076 | 0.15 |
| ExCopyDepth | 63 | 267 | 0.24 | 52 | 669 | 0.08 |
| Intersection of ExomeCopy/ExCopyDepth | 34 | 76 | 0.45 | 44 | 478 | 0.09 |
TP, True positive CNVs (CNVs identified both exaCGH and computational programs); FP, False positive CNVs (CNVs identified only by computational programs but not by exaCGH); TP/FP ratio, Ratio between true positive and false positive CNVs.
Figure 3Relative cumulative frequency for true positive CNVs. In order to clearly highlight the proportion of short CNVs (with 1–4 exons) predicted by each program, relative cumulative frequency distributions were presented using only the TP CNVs with 0 to 20 exons.
Analysis of copy number state (CNS) predicted by different program combinations
| Program combinations | Number of genomic regions with different CNS | Number of genomic regions with same CNS | % of genomic regions with same CNS |
|---|---|---|---|
| ExomeCopy, ExCopyDepth | 0 | 9 | 100.00 |
| ExomeCopy, ExCopyDepth, ExomeDepth | 1 | 37 | 97.37 |
| ExomeCopy, ExCopyDepth, ExomeDepth, CoNIFER | 3 | 3 | 50.00 |
| ExomeCopy, ExCopyDepth, ExomeDepth, CoNIFER, XHMM | 0 | 4 | 100.00 |
| ExomeCopy, ExCopyDepth, ExomeDepth, XHMM | 1 | 9 | 90.00 |
| ExomeCopy, ExomeDepth | 0 | 4 | 100.00 |
| ExCopyDepth, ExomeDepth | 3 | 31 | 91.18 |
| ExCopyDepth, ExomeDepth, XHMM | 0 | 4 | 100.00 |
Figure 4Sensitivity versus false positive rate for CNV prediction. (a) ExomeCopy. (b) ExCopyDepth. (c) Intersection of ExomeCopy and ExCopyDepth (overlapping CNVs predicted by ExomeCopy and ExCopydepth). (d) ExomeDepth. (e) CoNIFER. (f) XHMM. Sensitivity = true positive CNVs/(true positive CNVs + false negative CNVs). False positive rate = false positive CNVs/(false positive CNVs + true positive CNVs).