| Literature DB >> 35546390 |
Abstract
BACKGROUND: Single-cell DNA sequencing is getting indispensable in the study of cell-specific cancer genomics. The performance of computational tools that tackle single-cell genome aberrations may be nevertheless undervalued or overvalued, owing to the insufficient size of benchmarking data. In silicon simulation is a cost-effective approach to generate as many single-cell genomes as possible in a controlled manner to make reliable and valid benchmarking.Entities:
Keywords: Copy number variation; Simulation; Single-cell sequencing
Mesh:
Substances:
Year: 2022 PMID: 35546390 PMCID: PMC9092674 DOI: 10.1186/s12864-022-08566-w
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 4.547
Overview of existing scDNA-Seq simulators
| Tool | Journal | Supported Variants | Other Facilities | |||
|---|---|---|---|---|---|---|
| SNV | Indel | CNV | Cell Cluster | Breakpoints | ||
| CellCoal [ | - | - | - | - | ||
| SCSsim [ | - | - | ||||
| SCSIM [ | - | - | - | - | ||
| SingleCellCNABenchmark [ | - | - | - | - | ||
| - | ||||||
Fig. 1Illustration of SCSilicon framework
Fig. 2Visualization of simulated SNP. A Heatmap of random selected 100 SNPs across single-cells. 0 is reference allele, 1 is alternative allele. B IGV plot of SNP events. C IGV inspection of individual SNP events
Fig. 3Visualization of simulated CNV. A-B Heatmap of two randomly generated CNV configuration across single-cells. The column and row represents the genome region bin and single cell, respectively. The value of the heatmap indicates the copy number, with blue, white, and red stands for copy number less than, equal to, and larger than 2, respectively. C IGV inspection of CNV breakpoint
Fig. 4Benchmarking of single-cell CNV Caller for CNV dataset1. A-E CNV Heatmap of noisy ground-truth, AneuFinder, SCOPE, SCYN, and clean ground-truth, respectively. The column and row represents the genome region bin and single cell, respectively. The value of the heatmap indicates the copy number, with blue, white, and red stands for copy number less than, equal to, and larger than 2, respectively. F-H Scatter plot and Pearson correlation between clean ground-truth CNV and estimated CNV of AneuFinder, SCOPE, and SCYN, respectively. I-K The CNV calling accuracy on loss, neutral, and gain bins, respectively