| Literature DB >> 26040700 |
Alexis Dereeper1, Felix Homa2, Gwendoline Andres3, Guilhem Sempere4, Gautier Sarah2, Yann Hueber5, Jean-François Dufayard2, Manuel Ruiz6.
Abstract
SNiPlay is a web-based tool for detection, management and analysis of genetic variants including both single nucleotide polymorphisms (SNPs) and InDels. Version 3 now extends functionalities in order to easily manage and exploit SNPs derived from next generation sequencing technologies, such as GBS (genotyping by sequencing), WGRS (whole gre-sequencing) and RNA-Seq technologies. Based on the standard VCF (variant call format) format, the application offers an intuitive interface for filtering and comparing polymorphisms using user-defined sets of individuals and then establishing a reliable genotyping data matrix for further analyses. Namely, in addition to the various scaled-up analyses allowed by the application (genomic annotation of SNP, diversity analysis, haplotype reconstruction and network, linkage disequilibrium), SNiPlay3 proposes new modules for GWAS (genome-wide association studies), population stratification, distance tree analysis and visualization of SNP density. Additionally, we developed a suite of Galaxy wrappers for each step of the SNiPlay3 process, so that the complete pipeline can also be deployed on a Galaxy instance using the Galaxy ToolShed procedure and then be computed as a Galaxy workflow. SNiPlay is accessible at http://sniplay.southgreen.fr.Entities:
Mesh:
Year: 2015 PMID: 26040700 PMCID: PMC4489301 DOI: 10.1093/nar/gkv351
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of the SNiPlay3 process. (A) General schema of the process and graphical layouts of the different modules. For modules marked with an asterisk, analyses are computed gene by gene. Input and output file formats are indicated in red. (B) One of the SNiPlay Galaxy workflows: the SNP calling workflow based on the GATK package for data pre-processing.
Figure 2.Overview of population structure analyses. (A) Structure population inference using the Admixture software. (B) Multi-dimensional scaling (MDS) plot representation. (C) Colorized and customizable SNP-based distance tree, using InTreeGreat.
Figure 3.GWAS analyses in SNiPlay. (A) Data control: first step controls data concordance and outputs some statistics about genotypic (MAF distribution) and phenotypic (phenotypic values distribution) datasets. (B) QQ plot shows the expected distribution of association test statistics (X-axis) compared to the observed values (Y-axis). (C) Result interface displays an interactive Manhattan plot color-coded by chromosome that represents the association P-values between markers and the trait being measured. It supports zooming, which can be achieved by a ‘click, hold and drag mouse’ action on the region of interest.