| Literature DB >> 30475995 |
Matthew D Czajkowski1, Daniel P Vance1, Steven A Frese1,2, Giorgio Casaburi1.
Abstract
SUMMARY: The removal of human genomic reads from shotgun metagenomic sequencing is a critical step in protecting subject privacy. Freely available tools addressing this issue require advanced programing knowledge or are limited by analytical time and data load due to their server-based nature. Here, we compared the most cited tools for host-DNA removal using synthetic and real metagenomic datasets. Then, we integrated the most efficient pipeline in a graphical user interface to make these tools available without command line use. This interface, GenCoF, rapidly removes human genome contaminants from metagenomic datasets. Additionally, the tool offers quality-filtering, data reduction and interactive modification of any parameter in order to customize the analysis. GenCoF offers both quality and host-associated filtering in a non-commercial, freely available tool in a local, interactive and easy-to-use interface.Entities:
Mesh:
Year: 2019 PMID: 30475995 PMCID: PMC6596892 DOI: 10.1093/bioinformatics/bty963
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Analysis of program performance. (A) Average error rate of synthetic reads wrongly assigned as non-human. (B) Average CPU/Hour. (C) Average error rate of synthetic reads wrongly assigned as human. (D) Average error rate of real dataset. Error bars represent standard error