| Literature DB >> 27267926 |
Guilhem Sempéré1,2, Florian Philippe3, Alexis Dereeper4,5, Manuel Ruiz4,6,7,8, Gautier Sarah4,9, Pierre Larmande4,3,7,10.
Abstract
BACKGROUND: Exploring the structure of genomes and analyzing their evolution is essential to understanding the ecological adaptation of organisms. However, with the large amounts of data being produced by next-generation sequencing, computational challenges arise in terms of storage, search, sharing, analysis and visualization. This is particularly true with regards to studies of genomic variation, which are currently lacking scalable and user-friendly data exploration solutions. DESCRIPTION: Here we present Gigwa, a web-based tool that provides an easy and intuitive way to explore large amounts of genotyping data by filtering it not only on the basis of variant features, including functional annotations, but also on genotype patterns. The data storage relies on MongoDB, which offers good scalability properties. Gigwa can handle multiple databases and may be deployed in either single- or multi-user mode. In addition, it provides a wide range of popular export formats.Entities:
Keywords: Genomic variations; HapMap; INDEL; MongoDB; NoSQL; SNP; VCF; Web interface
Mesh:
Year: 2016 PMID: 27267926 PMCID: PMC4897896 DOI: 10.1186/s13742-016-0131-8
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Fig. 1Action panel enabling project selection, progress indication, abort and export functionalities
Fig. 2Variant-filtering interface, allowing search criteria definition and result browsing
Fig 3Variant detail dialogue, providing variant metadata and genotype-level information
Fig. 4Response-time plot by tool for first benchmarked query (location-based filter). VCFtools is by far the slowest option
Fig. 5Response-time plot by tool for second benchmarked query (MAF filter). MySQL is by far the slowest option