Literature DB >> 33676403

SomatoSim: precision simulation of somatic single nucleotide variants.

Marwan A Hawari1, Celine S Hong2, Leslie G Biesecker1.   

Abstract

BACKGROUND: Somatic single nucleotide variants have gained increased attention because of their role in cancer development and the widespread use of high-throughput sequencing techniques. The necessity to accurately identify these variants in sequencing data has led to a proliferation of somatic variant calling tools. Additionally, the use of simulated data to assess the performance of these tools has become common practice, as there is no gold standard dataset for benchmarking performance. However, many existing somatic variant simulation tools are limited because they rely on generating entirely synthetic reads derived from a reference genome or because they do not allow for the precise customizability that would enable a more focused understanding of single nucleotide variant calling performance.
RESULTS: SomatoSim is a tool that lets users simulate somatic single nucleotide variants in sequence alignment map (SAM/BAM) files with full control of the specific variant positions, number of variants, variant allele fractions, depth of coverage, read quality, and base quality, among other parameters. SomatoSim accomplishes this through a three-stage process: variant selection, where candidate positions are selected for simulation, variant simulation, where reads are selected and mutated, and variant evaluation, where SomatoSim summarizes the simulation results.
CONCLUSIONS: SomatoSim is a user-friendly tool that offers a high level of customizability for simulating somatic single nucleotide variants. SomatoSim is available at https://github.com/BieseckerLab/SomatoSim .

Entities:  

Keywords:  Mosaicism; SNV; Simulation; Single nucleotide variants; Somatic variants

Mesh:

Substances:

Year:  2021        PMID: 33676403      PMCID: PMC7936459          DOI: 10.1186/s12859-021-04024-8

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  18 in total

1.  Extensive sequencing of seven human genomes to characterize benchmark reference materials.

Authors:  Justin M Zook; David Catoe; Jennifer McDaniel; Lindsay Vang; Noah Spies; Arend Sidow; Ziming Weng; Yuling Liu; Christopher E Mason; Noah Alexander; Elizabeth Henaff; Alexa B R McIntyre; Dhruva Chandramohan; Feng Chen; Erich Jaeger; Ali Moshrefi; Khoa Pham; William Stedman; Tiffany Liang; Michael Saghbini; Zeljko Dzakula; Alex Hastie; Han Cao; Gintaras Deikus; Eric Schadt; Robert Sebra; Ali Bashir; Rebecca M Truty; Christopher C Chang; Natali Gulbahce; Keyan Zhao; Srinka Ghosh; Fiona Hyland; Yutao Fu; Mark Chaisson; Chunlin Xiao; Jonathan Trow; Stephen T Sherry; Alexander W Zaranek; Madeleine Ball; Jason Bobe; Preston Estep; George M Church; Patrick Marks; Sofia Kyriazopoulou-Panagiotopoulou; Grace X Y Zheng; Michael Schnall-Levin; Heather S Ordonez; Patrice A Mudivarti; Kristina Giorda; Ying Sheng; Karoline Bjarnesdatter Rypdal; Marc Salit
Journal:  Sci Data       Date:  2016-06-07       Impact factor: 6.444

Review 2.  A genomic view of mosaicism and human disease.

Authors:  Leslie G Biesecker; Nancy B Spinner
Journal:  Nat Rev Genet       Date:  2013-05       Impact factor: 53.242

3.  SInC: an accurate and fast error-model based simulator for SNPs, Indels and CNVs coupled with a read generator for short-read sequence data.

Authors:  Swetansu Pattnaik; Saurabh Gupta; Arjun A Rao; Binay Panda
Journal:  BMC Bioinformatics       Date:  2014-02-05       Impact factor: 3.169

4.  Combining tumor genome simulation with crowdsourcing to benchmark somatic single-nucleotide-variant detection.

Authors:  Adam D Ewing; Kathleen E Houlahan; Yin Hu; Kyle Ellrott; Cristian Caloian; Takafumi N Yamaguchi; J Christopher Bare; Christine P'ng; Daryl Waggott; Veronica Y Sabelnykova; Michael R Kellen; Thea C Norman; David Haussler; Stephen H Friend; Gustavo Stolovitzky; Adam A Margolin; Joshua M Stuart; Paul C Boutros
Journal:  Nat Methods       Date:  2015-05-18       Impact factor: 28.547

5.  Xome-Blender: A novel cancer genome simulator.

Authors:  Roberto Semeraro; Valerio Orlandini; Alberto Magi
Journal:  PLoS One       Date:  2018-04-05       Impact factor: 3.240

6.  Effect of read-mapping biases on detecting allele-specific expression from RNA-sequencing data.

Authors:  Jacob F Degner; John C Marioni; Athma A Pai; Joseph K Pickrell; Everlyne Nkadori; Yoav Gilad; Jonathan K Pritchard
Journal:  Bioinformatics       Date:  2009-10-06       Impact factor: 6.937

7.  LoFreq: a sequence-quality aware, ultra-sensitive variant caller for uncovering cell-population heterogeneity from high-throughput sequencing datasets.

Authors:  Andreas Wilm; Pauline Poh Kim Aw; Denis Bertrand; Grace Hui Ting Yeo; Swee Hoe Ong; Chang Hua Wong; Chiea Chuen Khor; Rosemary Petric; Martin Lloyd Hibberd; Niranjan Nagarajan
Journal:  Nucleic Acids Res       Date:  2012-10-12       Impact factor: 16.971

Review 8.  A review of somatic single nucleotide variant calling algorithms for next-generation sequencing data.

Authors:  Chang Xu
Journal:  Comput Struct Biotechnol J       Date:  2018-02-06       Impact factor: 7.271

9.  GENCODE reference annotation for the human and mouse genomes.

Authors:  Adam Frankish; Mark Diekhans; Anne-Maud Ferreira; Rory Johnson; Irwin Jungreis; Jane Loveland; Jonathan M Mudge; Cristina Sisu; James Wright; Joel Armstrong; If Barnes; Andrew Berry; Alexandra Bignell; Silvia Carbonell Sala; Jacqueline Chrast; Fiona Cunningham; Tomás Di Domenico; Sarah Donaldson; Ian T Fiddes; Carlos García Girón; Jose Manuel Gonzalez; Tiago Grego; Matthew Hardy; Thibaut Hourlier; Toby Hunt; Osagie G Izuogu; Julien Lagarde; Fergal J Martin; Laura Martínez; Shamika Mohanan; Paul Muir; Fabio C P Navarro; Anne Parker; Baikang Pei; Fernando Pozo; Magali Ruffier; Bianca M Schmitt; Eloise Stapleton; Marie-Marthe Suner; Irina Sycheva; Barbara Uszczynska-Ratajczak; Jinuri Xu; Andrew Yates; Daniel Zerbino; Yan Zhang; Bronwen Aken; Jyoti S Choudhary; Mark Gerstein; Roderic Guigó; Tim J P Hubbard; Manolis Kellis; Benedict Paten; Alexandre Reymond; Michael L Tress; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

10.  COSMIC: the Catalogue Of Somatic Mutations In Cancer.

Authors:  John G Tate; Sally Bamford; Harry C Jubb; Zbyslaw Sondka; David M Beare; Nidhi Bindal; Harry Boutselakis; Charlotte G Cole; Celestino Creatore; Elisabeth Dawson; Peter Fish; Bhavana Harsha; Charlie Hathaway; Steve C Jupe; Chai Yin Kok; Kate Noble; Laura Ponting; Christopher C Ramshaw; Claire E Rye; Helen E Speedy; Ray Stefancsik; Sam L Thompson; Shicai Wang; Sari Ward; Peter J Campbell; Simon A Forbes
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  2 in total

1.  Low-level variant calling for non-matched samples using a position-based and nucleotide-specific approach.

Authors:  Jeffrey N Dudley; Celine S Hong; Marwan A Hawari; Jasmine Shwetar; Julie C Sapp; Justin Lack; Henoke Shiferaw; Jennifer J Johnston; Leslie G Biesecker
Journal:  BMC Bioinformatics       Date:  2021-04-08       Impact factor: 3.307

2.  MQuad enables clonal substructure discovery using single cell mitochondrial variants.

Authors:  Aaron Wing Cheung Kwok; Chen Qiao; Rongting Huang; Mai-Har Sham; Joshua W K Ho; Yuanhua Huang
Journal:  Nat Commun       Date:  2022-03-08       Impact factor: 14.919

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.