| Literature DB >> 33176676 |
Junhua Rao1, Lihua Peng2, Xinming Liang1, Hui Jiang1, Chunyu Geng1, Xia Zhao1, Xin Liu2,3,4,5, Guangyi Fan3,5, Fang Chen6,7,8, Feng Mu9,10.
Abstract
BACKGROUND: DNBSEQ™ platforms are new massively parallel sequencing (MPS) platforms that use DNA nanoball technology. Use of data generated from DNBSEQ™ platforms to detect single nucleotide variants (SNVs) and small insertions and deletions (indels) has proven to be quite effective, while the feasibility of copy number variants (CNVs) detection is unclear.Entities:
Keywords: Benchmark; Copy number variant (CNV); DNBSEQ; Whole-genome sequencing (WGS)
Mesh:
Year: 2020 PMID: 33176676 PMCID: PMC7659224 DOI: 10.1186/s12859-020-03859-x
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Statistics of CNV number and overall length of all 50 CNV sets
| Dataset | Number | Overall length (Mb) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| BreakDancer | CNVnator | Pindel | DELLY | LUMPY | BreakDancer | CNVnator | Pindel | DELLY | LUMPY | |
| BGISEQ-500_PE100 | 2329 | 5778 | 4924 | 1904 | 2378 | 6.62 | 27.07 | 12.03 | 23.38 | 18.10 |
| BGISEQ-500_PE150-1 | 1646 | 3617 | 4463 | 1624 | 1422 | 7.42 | 30.10 | 9.21 | 22.43 | 17.64 |
| BGISEQ-500_PE150-2 | 2265 | 4241 | 4310 | 2209 | 2202 | 7.68 | 32.37 | 8.24 | 28.01 | 22.37 |
| DNBSEQ-G400_PE100 | 2160 | 4354 | 4688 | 1745 | 1889 | 7.41 | 26.74 | 9.03 | 23.13 | 18.25 |
| DNBSEQ-G400_PE150_PCR-1 | 1576 | 2234 | 4577 | 1902 | 1609 | 9.11 | 22.79 | 11.81 | 23.61 | 17.62 |
| DNBSEQ-G400_PE150_PCR-2 | 1600 | 2209 | 4689 | 1900 | 1533 | 7.59 | 22.68 | 11.21 | 23.93 | 20.15 |
| DNBSEQ-G400_PE150_PCRfree-1 | 1519 | 2890 | 4221 | 1705 | 1463 | 8.53 | 25.78 | 12.07 | 25.51 | 18.58 |
| DNBSEQ-G400_PE150_PCRfree-2 | 1523 | 2860 | 4172 | 1713 | 1505 | 8.20 | 25.43 | 9.84 | 22.30 | 18.42 |
| HiSeq2500_PE150 | 968 | 2660 | 4045 | 1709 | 1380 | 7.03 | 26.49 | 10.68 | 29.73 | 22.25 |
| NovaSeq6000_PE150 | 897 | 1229 | 5730 | 1773 | 1350 | 6.32 | 22.54 | 9.64 | 28.52 | 22.02 |
Fig. 1Heatmap of CNV results across the genome. The numbers and lengths of deletions and duplications detected on ten WGS datasets by five tools is profiled through 10 Mb bins across the whole genome. The colour bar on the left indicates the number and length panel, the deletion and duplication, five tools and two platforms. The 23 chromosomes are arranged linearly on the x-axis. (i) BreakDancer; (ii) CNVnator; (iii) Pindel; (iv) DELLY; (v) LUMPY
Fig. 2Evaluation of CNVs from 50 datasets on two benchmarks. Dot plot shows the sensitivity and precision of CNVs on ten datasets using five tools compared with those using Benchmark1 (left) and Benchmark2 (right). Ten datasets are marked with different colours, and five tools are marked with different point types. Benchmark1, data by Ryan et al. 2014; Benchmark2, data by Peter et al. 2015
Fig. 3Comparison of CNVs by platforms. Box plot shows the number (upper), precision based on Benchmark1 (median) and precision based on Benchmark2 (lower) of common and specific CNVs between platforms by different tools (column). Precision1, precision based on Benchmark1; Precision2, precision based on Benchmark2; DNBSEQ, specific CNVs on DNBSEQ™ platforms; Illumina, specific CNVs on Illumina platforms; Common, common CNVs between the DNBSEQ™ and Illumina platforms
Fig. 4Precision of CNVs with different length ranges. Histogram shows the precision of CNVs detected by different tools (column) based on different benchmarks (row). Precision for CNVs is shown with different colour bars according to length ranges. Precision1, precision based on Benchmark1; Precision2, precision based on Benchmark2