| Literature DB >> 35561197 |
Givanna H Putri1,2, Simon Anders3, Paul Theodor Pyl4, John E Pimanda1,2,5,6, Fabio Zanini1,2,7.
Abstract
SUMMARY: HTSeq 2.0 provides a more extensive application programming interface including a new representation for sparse genomic data, enhancements for htseq-count to suit single-cell omics, a new script for data using cell and molecular barcodes, improved documentation, testing and deployment, bug fixes and Python 3 support.Entities:
Mesh:
Year: 2022 PMID: 35561197 PMCID: PMC9113351 DOI: 10.1093/bioinformatics/btac166
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Major (A–C) Improvements to htseq-count. (A) Parallel processing on multicore architectures enables faster processing of single-cell data, where each cell is represented by a BAM file [typical for Smart-seq2 (Picelli et al. 2013) and viscRNA-Seq (Zanini )]. Note the new output formats available in HTSeq 2.0. (B) Conventional gene–cell matrix, which collapses reads that align to distinct exons of the same gene into a single gene count. (C) Additional attributes enable quantification at the exon level while retaining information on which gene each exon belongs to. (D, E) Sparse data representations in HTSeq 2.0. (D) StepVector represents piecewise-constant sparse genomic data. (E) StretchVector represents sparse islands of genomic data