Literature DB >> 33594436

HTSlib: C library for reading/writing high-throughput sequencing data.

James K Bonfield1, John Marshall2, Petr Danecek1, Heng Li3,4, Valeriu Ohan1, Andrew Whitwham1, Thomas Keane5, Robert M Davies1.   

Abstract

BACKGROUND: Since the original publication of the VCF and SAM formats, an explosion of software tools have been created to process these data files. To facilitate this a library was produced out of the original SAMtools implementation, with a focus on performance and robustness. The file formats themselves have become international standards under the jurisdiction of the Global Alliance for Genomics and Health.
FINDINGS: We present a software library for providing programmatic access to sequencing alignment and variant formats. It was born out of the widely used SAMtools and BCFtools applications. Considerable improvements have been made to the original code plus many new features including newer access protocols, the addition of the CRAM file format, better indexing and iterators, and better use of threading.
CONCLUSION: Since the original Samtools release, performance has been considerably improved, with a BAM read-write loop running 5 times faster and BAM to SAM conversion 13 times faster (both using 16 threads, compared to Samtools 0.1.19). Widespread adoption has seen HTSlib downloaded >1 million times from GitHub and conda. The C library has been used directly by an estimated 900 GitHub projects and has been incorporated into Perl, Python, Rust, and R, significantly expanding the number of uses via other languages. HTSlib is open source and is freely available from htslib.org under MIT/BSD license.
© The Author(s) 2021. Published by Oxford University Press GigaScience.

Entities:  

Keywords:  bcftools; data analysis; high-throughput sequencing; next generation sequencing; samtools; variant calling

Year:  2021        PMID: 33594436     DOI: 10.1093/gigascience/giab007

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


  26 in total

1.  Whole Genome Analysis of Dizygotic Twins With Autism Reveals Prevalent Transposon Insertion Within Neuronal Regulatory Elements: Potential Implications for Disease Etiology and Clinical Assessment.

Authors:  Kaan Okay; Pelin Ünal Varış; Süha Miral; Athanasia Pavlopoulou; Yavuz Oktay; Gökhan Karakülah
Journal:  J Autism Dev Disord       Date:  2022-06-27

2.  Protocol for unbiased, consolidated variant calling from whole exome sequencing data.

Authors:  Kleio-Maria Verrou; Georgios A Pavlopoulos; Panagiotis Moulos
Journal:  STAR Protoc       Date:  2022-05-30

3.  A spectrum of free software tools for processing the VCF variant call format: vcflib, bio-vcf, cyvcf2, hts-nim and slivar.

Authors:  Erik Garrison; Zev N Kronenberg; Eric T Dawson; Brent S Pedersen; Pjotr Prins
Journal:  PLoS Comput Biol       Date:  2022-05-31       Impact factor: 4.779

4.  Cross reactivity of neutralizing antibodies to the encephalitic California Serogroup orthobunyaviruses varies by virus and genetic relatedness.

Authors:  Alyssa B Evans; Karin E Peterson
Journal:  Sci Rep       Date:  2021-08-12       Impact factor: 4.996

5.  Chromatin-associated MRN complex protects highly transcribing genes from genomic instability.

Authors:  Kader Salifou; Callum Burnard; Poornima Basavarajaiah; Giuseppa Grasso; Marion Helsmoortel; Victor Mac; David Depierre; Céline Franckhauser; Emmanuelle Beyne; Xavier Contreras; Jérôme Dejardin; Sylvie Rouquier; Olivier Cuvier; Rosemary Kiernan
Journal:  Sci Adv       Date:  2021-05-21       Impact factor: 14.136

6.  BiSulfite Bolt: A bisulfite sequencing analysis platform.

Authors:  Colin Farrell; Michael Thompson; Anela Tosevska; Adewale Oyetunde; Matteo Pellegrini
Journal:  Gigascience       Date:  2021-05-08       Impact factor: 6.524

7.  Pyridylpiperazine-based allosteric inhibitors of RND-type multidrug efflux pumps.

Authors:  Coline Plé; Heng-Keat Tam; Anais Vieira Da Cruz; Nina Compagne; Juan-Carlos Jiménez-Castellanos; Reinke T Müller; Elizabeth Pradel; Wuen Ee Foong; Giuliano Malloci; Alexia Ballée; Moritz A Kirchner; Parisa Moshfegh; Adrien Herledan; Andrea Herrmann; Benoit Deprez; Nicolas Willand; Attilio Vittorio Vargiu; Klaas M Pos; Marion Flipo; Ruben C Hartkoorn
Journal:  Nat Commun       Date:  2022-01-10       Impact factor: 14.919

8.  The genome of the rice variety LTH provides insight into its universal susceptibility mechanism to worldwide rice blast fungal strains.

Authors:  Lei Yang; Mengfei Zhao; Gan Sha; Qiping Sun; Qiuwen Gong; Qun Yang; Kabin Xie; Meng Yuan; Jenny C Mortimer; Weibo Xie; Tong Wei; Zhensheng Kang; Guotian Li
Journal:  Comput Struct Biotechnol J       Date:  2022-02-10       Impact factor: 7.271

9.  The Prognostic Value and Immune Landscapes of a m6A/m5C/m1A-Related LncRNAs Signature in Head and Neck Squamous Cell Carcinoma.

Authors:  Enhao Wang; Yang Li; Ruijie Ming; Jiahui Wei; Peiyu Du; Peng Zhou; Shimin Zong; Hongjun Xiao
Journal:  Front Cell Dev Biol       Date:  2021-11-30

10.  Pheniqs 2.0: accurate, high-performance Bayesian decoding and confidence estimation for combinatorial barcode indexing.

Authors:  Lior Galanti; Dennis Shasha; Kristin C Gunsalus
Journal:  BMC Bioinformatics       Date:  2021-07-02       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.