Literature DB >> 33112385

Minirmd: accurate and fast duplicate removal tool for short reads via multiple minimizers.

Yuansheng Liu1, Xiaocai Zhang2, Quan Zou3, Xiangxiang Zeng1.   

Abstract

SUMMARY: Removing duplicate and near-duplicate reads, generated by high-throughput sequencing technologies, is able to reduce computational resources in downstream applications. Here we develop minirmd, a de novo tool to remove duplicate reads via multiple rounds of clustering using different length of minimizer. Experiments demonstrate that minirmd removes more near-duplicate reads than existing clustering approaches and is faster than existing multi-core tools. To the best of our knowledge, minirmd is the first tool to remove near-duplicates on reverse-complementary strand.
AVAILABILITY AND IMPLEMENTATION: https://github.com/yuansliu/minirmd. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Year:  2021        PMID: 33112385     DOI: 10.1093/bioinformatics/btaa915

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  Fast-HBR: Fast hash based duplicate read remover.

Authors:  Sami Altayyar; Abdel Monim Artoli
Journal:  Bioinformation       Date:  2022-01-31

Review 2.  Identify DNA-Binding Proteins Through the Extreme Gradient Boosting Algorithm.

Authors:  Ziye Zhao; Wen Yang; Yixiao Zhai; Yingjian Liang; Yuming Zhao
Journal:  Front Genet       Date:  2022-01-28       Impact factor: 4.599

Review 3.  Research on the Computational Prediction of Essential Genes.

Authors:  Yuxin Guo; Ying Ju; Dong Chen; Lihong Wang
Journal:  Front Cell Dev Biol       Date:  2021-12-06

4.  SparkGC: Spark based genome compression for large collections of genomes.

Authors:  Haichang Yao; Guangyong Hu; Shangdong Liu; Houzhi Fang; Yimu Ji
Journal:  BMC Bioinformatics       Date:  2022-07-25       Impact factor: 3.307

5.  Hamming-shifting graph of genomic short reads: Efficient construction and its application for compression.

Authors:  Yuansheng Liu; Jinyan Li
Journal:  PLoS Comput Biol       Date:  2021-07-19       Impact factor: 4.475

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.