Literature DB >> 28957497

A multi-scenario genome-wide medical population genetics simulation framework.

Jacquiline W Mugo1, Ephifania Geza1,2, Joel Defo1, Samar S M Elsheikh1, Gaston K Mazandu1,2, Nicola J Mulder1, Emile R Chimusa3.   

Abstract

MOTIVATION: Recent technological advances in high-throughput sequencing and genotyping have facilitated an improved understanding of genomic structure and disease-associated genetic factors. In this context, simulation models can play a critical role in revealing various evolutionary and demographic effects on genomic variation, enabling researchers to assess existing and design novel analytical approaches. Although various simulation frameworks have been suggested, they do not account for natural selection in admixture processes. Most are tailored to a single chromosome or a genomic region, very few capture large-scale genomic data, and most are not accessible for genomic communities.
RESULTS: Here we develop a multi-scenario genome-wide medical population genetics simulation framework called 'FractalSIM'. FractalSIM has the capability to accurately mimic and generate genome-wide data under various genetic models on genetic diversity, genomic variation affecting diseases and DNA sequence patterns of admixed and/or homogeneous populations. Moreover, the framework accounts for natural selection in both homogeneous and admixture processes. The outputs of FractalSIM have been assessed using popular tools, and the results demonstrated its capability to accurately mimic real scenarios. They can be used to evaluate the performance of a range of genomic tools from ancestry inference to genome-wide association studies.
AVAILABILITY AND IMPLEMENTATION: The FractalSIM package is available at http://www.cbio.uct.ac.za/FractalSIM. CONTACT: emile.chimusa@uct.ac.za. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

Entities:  

Mesh:

Year:  2017        PMID: 28957497      PMCID: PMC5870573          DOI: 10.1093/bioinformatics/btx369

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  16 in total

1.  Modeling linkage disequilibrium and identifying recombination hotspots using single-nucleotide polymorphism data.

Authors:  Na Li; Matthew Stephens
Journal:  Genetics       Date:  2003-12       Impact factor: 4.562

2.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

3.  SIMCOAL 2.0: a program to simulate genomic diversity over large recombining regions in a subdivided population with a complex history.

Authors:  Guillaume Laval; Laurent Excoffier
Journal:  Bioinformatics       Date:  2004-04-29       Impact factor: 6.937

Review 4.  An overview of population genetic data simulation.

Authors:  Xiguo Yuan; David J Miller; Junying Zhang; David Herrington; Yue Wang
Journal:  J Comput Biol       Date:  2011-12-09       Impact factor: 1.479

5.  Principal components analysis corrects for stratification in genome-wide association studies.

Authors:  Alkes L Price; Nick J Patterson; Robert M Plenge; Michael E Weinblatt; Nancy A Shadick; David Reich
Journal:  Nat Genet       Date:  2006-07-23       Impact factor: 38.330

6.  HAPGEN2: simulation of multiple disease SNPs.

Authors:  Zhan Su; Jonathan Marchini; Peter Donnelly
Journal:  Bioinformatics       Date:  2011-06-08       Impact factor: 6.937

Review 7.  A comparison of tools for the simulation of genomic next-generation sequencing data.

Authors:  Merly Escalona; Sara Rocha; David Posada
Journal:  Nat Rev Genet       Date:  2016-06-20       Impact factor: 53.242

8.  Inference of locus-specific ancestry in closely related populations.

Authors:  Bogdan Pasaniuc; Sriram Sankararaman; Gad Kimmel; Eran Halperin
Journal:  Bioinformatics       Date:  2009-06-15       Impact factor: 6.937

9.  Designing genome-wide association studies: sample size, power, imputation, and the choice of genotyping chip.

Authors:  Chris C A Spencer; Zhan Su; Peter Donnelly; Jonathan Marchini
Journal:  PLoS Genet       Date:  2009-05-15       Impact factor: 5.917

10.  Determining ancestry proportions in complex admixture scenarios in South Africa using a novel proxy ancestry selection method.

Authors:  Emile R Chimusa; Michelle Daya; Marlo Möller; Raj Ramesar; Brenna M Henn; Paul D van Helden; Nicola J Mulder; Eileen G Hoal
Journal:  PLoS One       Date:  2013-09-16       Impact factor: 3.240

View more
  1 in total

1.  Identifying novel associations in GWAS by hierarchical Bayesian latent variable detection of differentially misclassified phenotypes.

Authors:  Afrah Shafquat; Ronald G Crystal; Jason G Mezey
Journal:  BMC Bioinformatics       Date:  2020-05-07       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.