Literature DB >> 21212684

Simulating sequences of the human genome with rare variants.

Bo Peng1, Xiaoming Liu.   

Abstract

OBJECTIVE: Simulated samples have been widely used in the development of efficient statistical methods identifying genetic variants that predispose to human genetic diseases. Although it is well known that natural selection has a strong influence on the number and diversity of rare genetic variations in human populations, existing simulation methods are limited in their ability to simulate multi-locus selection models with realistic distributions of the random fitness effects of newly arising mutants.
METHODS: We developed a computer program to simulate large populations of gene sequences using a forward-time simulation approach. This program is capable of simulating several multi-locus fitness schemes with arbitrary diploid single-locus selection models with random or locus-specific fitness effects. Arbitrary quantitative trait or disease models can be applied to the simulated populations from which individual- or family-based samples can be drawn and analyzed.
RESULTS: Using realistic demographic and natural selection models estimated from empirical sequence data, datasets simulated using our method differ significantly in the number and diversity of rare variants from datasets simulated using existing methods that ignore natural selection. Our program thus provides a useful tool to simulate datasets with realistic distributions of rare genetic variants for the study of genetic diseases caused by such variants.
Copyright © 2011 S. Karger AG, Basel.

Entities:  

Mesh:

Year:  2011        PMID: 21212684      PMCID: PMC3164177          DOI: 10.1159/000323316

Source DB:  PubMed          Journal:  Hum Hered        ISSN: 0001-5652            Impact factor:   0.444


  26 in total

1.  Pooled association tests for rare variants in exon-resequencing studies.

Authors:  Alkes L Price; Gregory V Kryukov; Paul I W de Bakker; Shaun M Purcell; Jeff Staples; Lee-Jen Wei; Shamil R Sunyaev
Journal:  Am J Hum Genet       Date:  2010-05-13       Impact factor: 11.025

2.  A haplotype map of the human genome.

Authors: 
Journal:  Nature       Date:  2005-10-27       Impact factor: 49.962

3.  simuPOP: a forward-time population genetics simulation environment.

Authors:  Bo Peng; Marek Kimmel
Journal:  Bioinformatics       Date:  2005-07-14       Impact factor: 6.937

4.  Sequence-level population simulations over large genomic regions.

Authors:  Clive J Hoggart; Marc Chadeau-Hyam; Taane G Clark; Riccardo Lampariello; John C Whittaker; Maria De Iorio; David J Balding
Journal:  Genetics       Date:  2007-10-18       Impact factor: 4.562

5.  Power of deep, all-exon resequencing for discovery of human trait genes.

Authors:  Gregory V Kryukov; Alexander Shpunt; John A Stamatoyannopoulos; Shamil R Sunyaev
Journal:  Proc Natl Acad Sci U S A       Date:  2009-02-06       Impact factor: 11.205

Review 6.  Common and rare variants in multifactorial susceptibility to common diseases.

Authors:  Walter Bodmer; Carolina Bonilla
Journal:  Nat Genet       Date:  2008-06       Impact factor: 38.330

7.  Forward-time simulations of human populations with complex diseases.

Authors:  Bo Peng; Christopher I Amos; Marek Kimmel
Journal:  PLoS Genet       Date:  2007-02-15       Impact factor: 5.917

8.  Simulation of genomes: a review.

Authors:  Antonio Carvajal-Rodríguez
Journal:  Curr Genomics       Date:  2008-05       Impact factor: 2.236

9.  Fregene: simulation of realistic sequence-level data in populations and ascertained samples.

Authors:  Marc Chadeau-Hyam; Clive J Hoggart; Paul F O'Reilly; John C Whittaker; Maria De Iorio; David J Balding
Journal:  BMC Bioinformatics       Date:  2008-09-08       Impact factor: 3.169

10.  Assessing the evolutionary impact of amino acid mutations in the human genome.

Authors:  Adam R Boyko; Scott H Williamson; Amit R Indap; Jeremiah D Degenhardt; Ryan D Hernandez; Kirk E Lohmueller; Mark D Adams; Steffen Schmidt; John J Sninsky; Shamil R Sunyaev; Thomas J White; Rasmus Nielsen; Andrew G Clark; Carlos D Bustamante
Journal:  PLoS Genet       Date:  2008-05-30       Impact factor: 5.917

View more
  12 in total

1.  Generation of sequence-based data for pedigree-segregating Mendelian or Complex traits.

Authors:  Biao Li; Gao T Wang; Suzanne M Leal
Journal:  Bioinformatics       Date:  2015-07-14       Impact factor: 6.937

2.  Power and sample size calculations for high-throughput sequencing-based experiments.

Authors:  Chung-I Li; David C Samuels; Ying-Yong Zhao; Yu Shyr; Yan Guo
Journal:  Brief Bioinform       Date:  2018-11-27       Impact factor: 11.622

3.  SimRare: a program to generate and analyze sequence-based data for association studies of quantitative and qualitative traits.

Authors:  Biao Li; Gao Wang; Suzanne M Leal
Journal:  Bioinformatics       Date:  2012-08-22       Impact factor: 6.937

4.  A C++ template library for efficient forward-time population genetic simulation of large populations.

Authors:  Kevin R Thornton
Journal:  Genetics       Date:  2014-06-20       Impact factor: 4.562

5.  Power analysis and sample size estimation for sequence-based association studies.

Authors:  Gao T Wang; Biao Li; Regie P Lyn Santos-Cortez; Bo Peng; Suzanne M Leal
Journal:  Bioinformatics       Date:  2014-04-28       Impact factor: 6.937

6.  Genetic data simulators and their applications: an overview.

Authors:  Bo Peng; Huann-Sheng Chen; Leah E Mechanic; Ben Racine; John Clarke; Elizabeth Gillanders; Eric J Feuer
Journal:  Genet Epidemiol       Date:  2014-12-13       Impact factor: 2.135

7.  A fast and noise-resilient approach to detect rare-variant associations with deep sequencing data for complex disorders.

Authors:  Yee Him Cheung; Gao Wang; Suzanne M Leal; Shuang Wang
Journal:  Genet Epidemiol       Date:  2012-08-03       Impact factor: 2.135

8.  On Sample Size and Power Calculation for Variant Set-Based Association Tests.

Authors:  Baolin Wu; James S Pankow
Journal:  Ann Hum Genet       Date:  2016-02-01       Impact factor: 1.670

9.  Properties and modeling of GWAS when complex disease risk is due to non-complementing, deleterious mutations in genes of large effect.

Authors:  Kevin R Thornton; Andrew J Foran; Anthony D Long
Journal:  PLoS Genet       Date:  2013-02-21       Impact factor: 5.917

10.  SeqSIMLA: a sequence and phenotype simulation tool for complex disease studies.

Authors:  Ren-Hua Chung; Chung-Chin Shih
Journal:  BMC Bioinformatics       Date:  2013-06-20       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.