| Literature DB >> 25524895 |
John C Mu1, Marghoob Mohiyuddin2, Jian Li2, Narges Bani Asadi2, Mark B Gerstein2, Alexej Abyzov2, Wing H Wong1, Hugo Y K Lam2.
Abstract
SUMMARY: VarSim is a framework for assessing alignment and variant calling accuracy in high-throughput genome sequencing through simulation or real data. In contrast to simulating a random mutation spectrum, it synthesizes diploid genomes with germline and somatic mutations based on a realistic model. This model leverages information such as previously reported mutations to make the synthetic genomes biologically relevant. VarSim simulates and validates a wide range of variants, including single nucleotide variants, small indels and large structural variants. It is an automated, comprehensive compute framework supporting parallel computation and multiple read simulators. Furthermore, we developed a novel map data structure to validate read alignments, a strategy to compare variants binned in size ranges and a lightweight, interactive, graphical report to visualize validation results with detailed statistics. Thus far, it is the most comprehensive validation tool for secondary analysis in next generation sequencing.Entities:
Mesh:
Year: 2014 PMID: 25524895 PMCID: PMC4410653 DOI: 10.1093/bioinformatics/btu828
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.VarSim simulation and validation workflow. The germline workflow can be run with or without the somatic workflow
Fig. 2.Validation results for some popular secondary analysis tools