Literature DB >> 26500154

BGT: efficient and flexible genotype query across many samples.

Heng Li1.   

Abstract

UNLABELLED: BGT is a compact format, a fast command line tool and a simple web application for efficient and convenient query of whole-genome genotypes and frequencies across tens to hundreds of thousands of samples. On real data, it encodes the haplotypes of 32 488 samples across 39.2 million SNPs into a 7.4 GB database and decodes up to 420 million genotypes per CPU second. The high performance enables real-time responses to complex queries.
AVAILABILITY AND IMPLEMENTATION: https://github.com/lh3/bgt.
© The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2015        PMID: 26500154      PMCID: PMC5963361          DOI: 10.1093/bioinformatics/btv613

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  4 in total

1.  The variant call format and VCFtools.

Authors:  Petr Danecek; Adam Auton; Goncalo Abecasis; Cornelis A Albers; Eric Banks; Mark A DePristo; Robert E Handsaker; Gerton Lunter; Gabor T Marth; Stephen T Sherry; Gilean McVean; Richard Durbin
Journal:  Bioinformatics       Date:  2011-06-07       Impact factor: 6.937

2.  Efficient haplotype matching and storage using the positional Burrows-Wheeler transform (PBWT).

Authors:  Richard Durbin
Journal:  Bioinformatics       Date:  2014-01-09       Impact factor: 6.937

3.  An integrated map of genetic variation from 1,092 human genomes.

Authors:  Goncalo R Abecasis; Adam Auton; Lisa D Brooks; Mark A DePristo; Richard M Durbin; Robert E Handsaker; Hyun Min Kang; Gabor T Marth; Gil A McVean
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

4.  GrabBlur--a framework to facilitate the secure exchange of whole-exome and -genome SNV data using VCF files.

Authors:  Björn Stade; Dominik Seelow; Ingo Thomsen; Michael Krawczak; Andre Franke
Journal:  BMC Genomics       Date:  2014-05-20       Impact factor: 3.969

  4 in total
  15 in total

1.  Accurate, scalable cohort variant calls using DeepVariant and GLnexus.

Authors:  Taedong Yun; Helen Li; Pi-Chuan Chang; Michael F Lin; Andrew Carroll; Cory Y McLean
Journal:  Bioinformatics       Date:  2021-01-05       Impact factor: 6.937

2.  Robust and rapid algorithms facilitate large-scale whole genome sequencing downstream analysis in an integrative framework.

Authors:  Miaoxin Li; Jiang Li; Mulin Jun Li; Zhicheng Pan; Jacob Shujui Hsu; Dajiang J Liu; Xiaowei Zhan; Junwen Wang; Youqiang Song; Pak Chung Sham
Journal:  Nucleic Acids Res       Date:  2017-05-19       Impact factor: 16.971

3.  SeqArray-a storage-efficient high-performance data format for WGS variant calls.

Authors:  Xiuwen Zheng; Stephanie M Gogarten; Michael Lawrence; Adrienne Stilp; Matthew P Conomos; Bruce S Weir; Cathy Laurie; David Levine
Journal:  Bioinformatics       Date:  2017-08-01       Impact factor: 6.937

4.  Sparse Allele Vectors and the Savvy Software Suite.

Authors:  Jonathon LeFaive; Albert V Smith; Hyun Min Kang; Gonçalo Abecasis
Journal:  Bioinformatics       Date:  2021-05-14       Impact factor: 6.931

5.  A graph extension of the positional Burrows-Wheeler transform and its applications.

Authors:  Adam M Novak; Erik Garrison; Benedict Paten
Journal:  Algorithms Mol Biol       Date:  2017-07-11       Impact factor: 1.405

6.  genozip: a fast and efficient compression tool for VCF files.

Authors:  Divon Lan; Raymond Tobler; Yassine Souilmi; Bastien Llamas
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

7.  Vertical lossless genomic data compression tools for assembled genomes: A systematic literature review.

Authors:  Kelvin V Kredens; Juliano V Martins; Osmar B Dordal; Mauri Ferrandin; Roberto H Herai; Edson E Scalabrin; Bráulio C Ávila
Journal:  PLoS One       Date:  2020-05-26       Impact factor: 3.240

8.  Vcfanno: fast, flexible annotation of genetic variants.

Authors:  Brent S Pedersen; Ryan M Layer; Aaron R Quinlan
Journal:  Genome Biol       Date:  2016-06-01       Impact factor: 13.583

9.  Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes.

Authors:  Jerome Kelleher; Alison M Etheridge; Gilean McVean
Journal:  PLoS Comput Biol       Date:  2016-05-04       Impact factor: 4.475

10.  htsget: a protocol for securely streaming genomic data.

Authors:  Jerome Kelleher; Mike Lin; C H Albach; Ewan Birney; Robert Davies; Marina Gourtovaia; David Glazer; Cristina Y Gonzalez; David K Jackson; Aaron Kemp; John Marshall; Andrew Nowak; Alexander Senf; Jaime M Tovar-Corona; Alexander Vikhorev; Thomas M Keane
Journal:  Bioinformatics       Date:  2019-01-01       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.