Literature DB >> 27164567

IntSIM: An Integrated Simulator of Next-Generation Sequencing Data.

Xiguo Yuan, Junying Zhang, Liying Yang.   

Abstract

OBJECTIVE: Next-generation sequencing data has been widely used for DNA variant discovery and tumor study through computational tools. Effective simulation of such data with many realistic features is very necessary for testing existing tools and guiding the development of new tools.
METHODS: We present an integrated simulation system, IntSIM, to simulate common DNA variants and to generate sequencing reads for mixture genomes. IntSIM has three novel features in comparison with other simulation programs: 1) it is able to simulate both germline and somatic variants in the same sequence, 2) it deals with tumor purity so as to generate reads corresponding to heterogeneous genomes and also produce tumor-normal matched samples, and 3) it simulates correlations among SNPs, among CNVs/CNAs based on HMM models trained from real sequencing genomes, and can simulates broad and focal CNV/CNA events.
RESULTS: The simulation data of IntSIM can reflect characteristics observed from real data and are consistent with input parameters. The IntSIM software package is freely available at http://intsim.sourceforge.net/.
CONCLUSION: Based on a great number of experiments, IntSIM performs better than other program for some scenarios, such as simulation of heterozygous SNPs, CNVs/CNAs, and can achieve some functions that other programs cannot achieve. SIGNIFICANCE: Simulation with IntSIM can be expected to evaluate performance of methods in detecting various types of variants, analyzing tumor samples, and especially providing a realistic assessment of effect of tumor purity on identification of somatic mutations.

Entities:  

Mesh:

Year:  2016        PMID: 27164567     DOI: 10.1109/TBME.2016.2560939

Source DB:  PubMed          Journal:  IEEE Trans Biomed Eng        ISSN: 0018-9294            Impact factor:   4.538


  12 in total

1.  SM-RCNV: a statistical method to detect recurrent copy number variations in sequenced samples.

Authors:  Yaoyao Li; Xiguo Yuan; Junying Zhang; Liying Yang; Jun Bai; Shan Jiang
Journal:  Genes Genomics       Date:  2019-02-18       Impact factor: 1.839

2.  A novel machine learning approach (svmSomatic) to distinguish somatic and germline mutations using next-generation sequencing data.

Authors:  Yu-Fang Mao; Xi-Guo Yuan; Yu-Peng Cun
Journal:  Zool Res       Date:  2021-03-18

Review 3.  A broad survey of DNA sequence data simulation tools.

Authors:  Shatha Alosaimi; Armand Bandiang; Noelle van Biljon; Denis Awany; Prisca K Thami; Milaine S S Tchamga; Anmol Kiran; Olfa Messaoud; Radia Ismaeel Mohammed Hassan; Jacquiline Mugo; Azza Ahmed; Christian D Bope; Imane Allali; Gaston K Mazandu; Nicola J Mulder; Emile R Chimusa
Journal:  Brief Funct Genomics       Date:  2020-01-22       Impact factor: 4.241

4.  SVEngine: an efficient and versatile simulator of genome structural variations with features of cancer clonal evolution.

Authors:  Li Charlie Xia; Dongmei Ai; Hojoon Lee; Noemi Andor; Chao Li; Nancy R Zhang; Hanlee P Ji
Journal:  Gigascience       Date:  2018-07-01       Impact factor: 6.524

5.  Pysim-sv: a package for simulating structural variation data with GC-biases.

Authors:  Yuchao Xia; Yun Liu; Minghua Deng; Ruibin Xi
Journal:  BMC Bioinformatics       Date:  2017-03-14       Impact factor: 3.169

6.  Accurate Inference of Tumor Purity and Absolute Copy Numbers From High-Throughput Sequencing Data.

Authors:  Xiguo Yuan; Zhe Li; Haiyong Zhao; Jun Bai; Junying Zhang
Journal:  Front Genet       Date:  2020-04-30       Impact factor: 4.599

7.  RKDOSCNV: A Local Kernel Density-Based Approach to the Detection of Copy Number Variations by Using Next-Generation Sequencing Data.

Authors:  Guojun Liu; Junying Zhang; Xiguo Yuan; Chao Wei
Journal:  Front Genet       Date:  2020-11-04       Impact factor: 4.599

8.  Next-generation forward genetic screens: using simulated data to improve the design of mapping-by-sequencing experiments in Arabidopsis.

Authors:  David Wilson-Sánchez; Samuel Daniel Lup; Raquel Sarmiento-Mañús; María Rosa Ponce; José Luis Micol
Journal:  Nucleic Acids Res       Date:  2019-12-02       Impact factor: 16.971

9.  UMI-Gen: A UMI-based read simulator for variant calling evaluation in paired-end sequencing NGS libraries.

Authors:  Vincent Sater; Pierre-Julien Viailly; Thierry Lecroq; Philippe Ruminy; Caroline Bérard; Élise Prieur-Gaston; Fabrice Jardin
Journal:  Comput Struct Biotechnol J       Date:  2020-08-27       Impact factor: 7.271

10.  CIRCNV: Detection of CNVs Based on a Circular Profile of Read Depth from Sequencing Data.

Authors:  Hai-Yong Zhao; Qi Li; Ye Tian; Yue-Hui Chen; Haque A K Alvi; Xi-Guo Yuan
Journal:  Biology (Basel)       Date:  2021-06-25
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.