Literature DB >> 30292006

Virus detection in high-throughput sequencing data without a reference genome of the host.

Jochen Kruppa1, Wendy K Jo2, Erhard van der Vries3, Martin Ludlow2, Albert Osterhaus2, Wolfgang Baumgaertner4, Klaus Jung5.   

Abstract

Discovery of novel viruses in host samples is a multidisciplinary process which relies increasingly on next-generation sequencing (NGS) followed by computational analysis. A crucial step in this analysis is to separate host sequence reads from the sequence reads of the virus to be discovered. This becomes especially difficult if no reference genome of the host is available. Furthermore, if the total number of viral reads in a sample is low, de novo assembly of a virus which is a requirement for most existing pipelines is hard to realize. We present a new modular, computational pipeline for discovery of novel viruses in host samples. While existing pipelines rely on the availability of the hosts reference genome for filtering sequence reads, our new pipeline can also cope with cases for which no reference genome is available. As a further novelty of our method a decoy module is used to assess false classification rates in the discovery process. Additionally, viruses with a low read coverage can be identified and visually reviewed. We validate our pipeline on simulated data as well as two experimental samples with known virus content. For the experimental samples, we were able to reproduce the laboratory findings. Our newly developed pipeline is applicable for virus detection in a wide range of host species. The three modules we present can either be incorporated individually in other pipelines or be used as a stand-alone pipeline. We are the first to present a decoy approach within a virus detection pipeline that can be used to assess error rates so that the quality of the final result can be judged. We provide an implementation of our modules via Github. However, the principle of the modules can easily be re-implemented by other researchers.
Copyright © 2018 Elsevier B.V. All rights reserved.

Keywords:  Decoy database; Metagenomics; Read mapping; Reference genome; Virus discovery

Mesh:

Year:  2018        PMID: 30292006     DOI: 10.1016/j.meegid.2018.09.026

Source DB:  PubMed          Journal:  Infect Genet Evol        ISSN: 1567-1348            Impact factor:   3.342


  5 in total

Review 1.  2019 meeting of the global virus network.

Authors:  Ramesh Akkina; Robert Garry; Christian Bréchot; Heinz Ellerbrok; Hideki Hasegawa; Luis Menéndez-Arias; Natalia Mercer; Johan Neyts; Victor Romanowski; Joaquim Segalés; Anders Vahlne
Journal:  Antiviral Res       Date:  2019-11-04       Impact factor: 5.970

2.  Correcting the Estimation of Viral Taxa Distributions in Next-Generation Sequencing Data after Applying Artificial Neural Networks.

Authors:  Moritz Kohls; Magdalena Kircher; Jessica Krepel; Pamela Liebig; Klaus Jung
Journal:  Genes (Basel)       Date:  2021-10-31       Impact factor: 4.096

3.  Heat Stress Resistance Mechanisms of Two Cucumber Varieties from Different Regions.

Authors:  Bingwei Yu; Fangyan Ming; Yonggui Liang; Yixi Wang; Yuwei Gan; Zhengkun Qiu; Shuangshuang Yan; Bihao Cao
Journal:  Int J Mol Sci       Date:  2022-02-05       Impact factor: 5.923

4.  Clinical Application and Influencing Factor Analysis of Metagenomic Next-Generation Sequencing (mNGS) in ICU Patients With Sepsis.

Authors:  Limin Sun; Shuguang Zhang; Ziyue Yang; Fei Yang; Zhenhua Wang; Hongqiang Li; Yaoguang Li; Tongwen Sun
Journal:  Front Cell Infect Microbiol       Date:  2022-07-13       Impact factor: 6.073

5.  An evolutionary divergent pestivirus lacking the Npro gene systemically infects a whale species.

Authors:  Wendy K Jo; Cornelis van Elk; Marco van de Bildt; Peter van Run; Monique Petry; Sonja T Jesse; Klaus Jung; Martin Ludlow; Thijs Kuiken; Albert Osterhaus
Journal:  Emerg Microbes Infect       Date:  2019       Impact factor: 7.163

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.