Literature DB >> 27706213

SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation.

Wei Shen1, Shuai Le1, Yan Li2, Fuquan Hu1.   

Abstract

FASTA and FASTQ are basic and ubiquitous formats for storing nucleotide and protein sequences. Common manipulations of FASTA/Q file include converting, searching, filtering, deduplication, splitting, shuffling, and sampling. Existing tools only implement some of these manipulations, and not particularly efficiently, and some are only available for certain operating systems. Furthermore, the complicated installation process of required packages and running environments can render these programs less user friendly. This paper describes a cross-platform ultrafast comprehensive toolkit for FASTA/Q processing. SeqKit provides executable binary files for all major operating systems, including Windows, Linux, and Mac OSX, and can be directly used without any dependencies or pre-configurations. SeqKit demonstrates competitive performance in execution time and memory usage compared to similar tools. The efficiency and usability of SeqKit enable researchers to rapidly accomplish common FASTA/Q file manipulations. SeqKit is open source and available on Github at https://github.com/shenwei356/seqkit.

Entities:  

Mesh:

Year:  2016        PMID: 27706213      PMCID: PMC5051824          DOI: 10.1371/journal.pone.0163962

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  5 in total

1.  Rapid and sensitive protein similarity searches.

Authors:  D J Lipman; W R Pearson
Journal:  Science       Date:  1985-03-22       Impact factor: 47.728

2.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

Review 3.  The Sanger FASTQ file format for sequences with quality scores, and the Solexa/Illumina FASTQ variants.

Authors:  Peter J A Cock; Christopher J Fields; Naohisa Goto; Michael L Heuer; Peter M Rice
Journal:  Nucleic Acids Res       Date:  2009-12-16       Impact factor: 16.971

4.  BEDTools: The Swiss-Army Tool for Genome Feature Analysis.

Authors:  Aaron R Quinlan
Journal:  Curr Protoc Bioinformatics       Date:  2014-09-08

5.  A novel algorithm for detecting multiple covariance and clustering of biological sequences.

Authors:  Wei Shen; Yan Li
Journal:  Sci Rep       Date:  2016-07-25       Impact factor: 4.379

  5 in total
  347 in total

1.  Firefly genomes illuminate parallel origins of bioluminescence in beetles.

Authors:  Timothy R Fallon; Sarah E Lower; Ching-Ho Chang; Manabu Bessho-Uehara; Gavin J Martin; Adam J Bewick; Megan Behringer; Humberto J Debat; Isaac Wong; John C Day; Anton Suvorov; Christian J Silva; Kathrin F Stanger-Hall; David W Hall; Robert J Schmitz; David R Nelson; Sara M Lewis; Shuji Shigenobu; Seth M Bybee; Amanda M Larracuente; Yuichi Oba; Jing-Ke Weng
Journal:  Elife       Date:  2018-10-16       Impact factor: 8.140

2.  Emergence of a Plant Pathogen in Europe Associated with Multiple Intercontinental Introductions.

Authors:  Blanca B Landa; Andreina I Castillo; Annalisa Giampetruzzi; Alexandra Kahn; Miguel Román-Écija; María Pilar Velasco-Amo; Juan A Navas-Cortés; Ester Marco-Noales; Silvia Barbé; Eduardo Moralejo; Helvecio D Coletta-Filho; Pasquale Saldarelli; Maria Saponari; Rodrigo P P Almeida
Journal:  Appl Environ Microbiol       Date:  2020-01-21       Impact factor: 4.792

3.  Liver transcriptome resources of four commercially exploited teleost species.

Authors:  André M Machado; Antonio Muñoz-Merida; Elza Fonseca; Ana Veríssimo; Rui Pinto; Mónica Felício; Rute R da Fonseca; Elsa Froufe; L Filipe C Castro
Journal:  Sci Data       Date:  2020-07-07       Impact factor: 6.444

4.  Identification and Characterization of Mycobacterial Species Using Whole-Genome Sequences.

Authors:  Marco A Riojas; Andrew M Frank; Samuel R Greenfield; Stephen P King; Conor J Meehan; Michael Strong; Alice R Wattam; Manzour Hernando Hazbón
Journal:  Methods Mol Biol       Date:  2021

5.  The tRNA pseudouridine synthase TruB1 regulates the maturation of let-7 miRNA.

Authors:  Ryota Kurimoto; Tomoki Chiba; Yoshiaki Ito; Takahide Matsushima; Yuki Yano; Kohei Miyata; Yuka Yashiro; Tsutomu Suzuki; Kozo Tomita; Hiroshi Asahara
Journal:  EMBO J       Date:  2020-09-14       Impact factor: 11.598

6.  Nanopore direct RNA sequencing maps the complexity of Arabidopsis mRNA processing and m6A modification.

Authors:  Matthew T Parker; Katarzyna Knop; Anna V Sherwood; Nicholas J Schurch; Katarzyna Mackinnon; Peter D Gould; Anthony Jw Hall; Geoffrey J Barton; Gordon G Simpson
Journal:  Elife       Date:  2020-01-14       Impact factor: 8.140

7.  Identification of a distinct lineage of aviadenovirus from crane feces.

Authors:  Yahiro Mukai; Yuriko Tomita; Kirill Kryukov; So Nakagawa; Makoto Ozawa; Tsutomu Matsui; Keizo Tomonaga; Tadashi Imanishi; Yoshihiro Kawaoka; Tokiko Watanabe; Masayuki Horie
Journal:  Virus Genes       Date:  2019-09-23       Impact factor: 2.332

8.  ZCWPW1 is recruited to recombination hotspots by PRDM9 and is essential for meiotic double strand break repair.

Authors:  Daniel Wells; Emmanuelle Bitoun; Daniela Moralli; Gang Zhang; Anjali Hinch; Julia Jankowska; Peter Donnelly; Catherine Green; Simon R Myers
Journal:  Elife       Date:  2020-08-03       Impact factor: 8.140

9.  Identification of a reptile lyssavirus in Anolis allogus provided novel insights into lyssavirus evolution.

Authors:  Masayuki Horie; Hiroshi Akashi; Masakado Kawata; Keizo Tomonaga
Journal:  Virus Genes       Date:  2020-11-07       Impact factor: 2.332

10.  The Cassandra retrotransposon landscape in sugar beet (Beta vulgaris) and related Amaranthaceae: recombination and re-shuffling lead to a high structural variability.

Authors:  Sophie Maiwald; Beatrice Weber; Kathrin M Seibt; Thomas Schmidt; Tony Heitkam
Journal:  Ann Bot       Date:  2021-01-01       Impact factor: 4.357

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.