Literature DB >> 21029846

Genome simulation approaches for synthesizing in silico datasets for human genomics.

Marylyn D Ritchie1, William S Bush.   

Abstract

Simulated data is a necessary first step in the evaluation of new analytic methods because in simulated data the true effects are known. To successfully develop novel statistical and computational methods for genetic analysis, it is vital to simulate datasets consisting of single nucleotide polymorphisms (SNPs) spread throughout the genome at a density similar to that observed by new high-throughput molecular genomics studies. In addition, the simulation of environmental data and effects will be essential to properly formulate risk models for complex disorders. Data simulations are often criticized because they are much less noisy than natural biological data, as it is nearly impossible to simulate the multitude of possible sources of natural and experimental variability. However, simulating data in silico is the most straightforward way to test the true potential of new methods during development. Thus, advances that increase the complexity of data simulations will permit investigators to better assess new analytical methods. In this work, we will briefly describe some of the current approaches for the simulation of human genomics data describing the advantages and disadvantages of the various approaches. We will also include details on software packages available for data simulation. Finally, we will expand upon one particular approach for the creation of complex, human genomic datasets that uses a forward-time population simulation algorithm: genomeSIMLA. Many of the hallmark features of biological datasets can be synthesized in silico; still much research is needed to enhance our capabilities to create datasets that capture the natural complexity of biological datasets.
Copyright © 2010 Elsevier Inc. All rights reserved.

Entities:  

Mesh:

Year:  2010        PMID: 21029846     DOI: 10.1016/B978-0-12-380862-2.00001-1

Source DB:  PubMed          Journal:  Adv Genet        ISSN: 0065-2660            Impact factor:   1.944


  10 in total

1.  Next generation analytic tools for large scale genetic epidemiology studies of complex diseases.

Authors:  Leah E Mechanic; Huann-Sheng Chen; Christopher I Amos; Nilanjan Chatterjee; Nancy J Cox; Rao L Divi; Ruzong Fan; Emily L Harris; Kevin Jacobs; Peter Kraft; Suzanne M Leal; Kimberly McAllister; Jason H Moore; Dina N Paltoo; Michael A Province; Erin M Ramos; Marylyn D Ritchie; Kathryn Roeder; Daniel J Schaid; Matthew Stephens; Duncan C Thomas; Clarice R Weinberg; John S Witte; Shunpu Zhang; Sebastian Zöllner; Eric J Feuer; Elizabeth M Gillanders
Journal:  Genet Epidemiol       Date:  2011-12-06       Impact factor: 2.135

Review 2.  Computer simulations: tools for population and evolutionary genetics.

Authors:  Sean Hoban; Giorgio Bertorelle; Oscar E Gaggiotti
Journal:  Nat Rev Genet       Date:  2012-01-10       Impact factor: 53.242

3.  Multi-scale genetic dynamic modelling I : an algorithm to compute generators.

Authors:  Markus Kirkilionis; Ulrich Janus; Luca Sbano
Journal:  Theory Biosci       Date:  2011-04-13       Impact factor: 1.919

4.  Genetic Simulation Resources and the GSR Certification Program.

Authors:  Bo Peng; Man Chong Leong; Huann-Sheng Chen; Melissa Rotunno; Katy R Brignole; John Clarke; Leah E Mechanic
Journal:  Bioinformatics       Date:  2019-02-15       Impact factor: 6.937

5.  Mendel: the Swiss army knife of genetic analysis programs.

Authors:  Kenneth Lange; Jeanette C Papp; Janet S Sinsheimer; Ram Sripracha; Hua Zhou; Eric M Sobel
Journal:  Bioinformatics       Date:  2013-04-22       Impact factor: 6.937

6.  A heuristic method for simulating open-data of arbitrary complexity that can be used to compare and evaluate machine learning methods.

Authors:  Jason H Moore; Maksim Shestov; Peter Schmitt; Randal S Olson
Journal:  Pac Symp Biocomput       Date:  2018

7.  Genetic simulation tools for post-genome wide association studies of complex diseases.

Authors:  Huann-Sheng Chen; Carolyn M Hutter; Leah E Mechanic; Elizabeth M Gillanders; Eric J Feuer; Christopher I Amos; Vineet Bafna; Elizabeth R Hauser; Ryan D Hernandez; Chun Li; David A Liberles; Kimberly McAllister; Jason H Moore; Dina N Paltoo; George J Papanicolaou; Bo Peng; Marylyn D Ritchie; Gabriel Rosenfeld; John S Witte
Journal:  Genet Epidemiol       Date:  2014-11-04       Impact factor: 2.135

8.  Genetic data simulators and their applications: an overview.

Authors:  Bo Peng; Huann-Sheng Chen; Leah E Mechanic; Ben Racine; John Clarke; Elizabeth Gillanders; Eric J Feuer
Journal:  Genet Epidemiol       Date:  2014-12-13       Impact factor: 2.135

9.  Heuristic identification of biological architectures for simulating complex hierarchical genetic interactions.

Authors:  Jason H Moore; Ryan Amos; Jeff Kiralis; Peter C Andrews
Journal:  Genet Epidemiol       Date:  2014-11-13       Impact factor: 2.135

10.  SANTA-SIM: simulating viral sequence evolution dynamics under selection and recombination.

Authors:  Abbas Jariani; Christopher Warth; Koen Deforche; Pieter Libin; Alexei J Drummond; Andrew Rambaut; Frederick A Matsen Iv; Kristof Theys
Journal:  Virus Evol       Date:  2019-03-08
  10 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.