Literature DB >> 23589648

Analyzing genome coverage profiles with applications to quality control in metagenomics.

Martin S Lindner1, Maximilian Kollock, Franziska Zickmann, Bernhard Y Renard.   

Abstract

MOTIVATION: Genome coverage, the number of sequencing reads mapped to a position in a genome, is an insightful indicator of irregularities within sequencing experiments. While the average genome coverage is frequently used within algorithms in computational genomics, the complete information available in coverage profiles (i.e. histograms over all coverages) is currently not exploited to its full extent. Thus, biases such as fragmented or erroneous reference genomes often remain unaccounted for. Making this information accessible can improve the quality of sequencing experiments and quantitative analyses.
RESULTS: We introduce a framework for fitting mixtures of probability distributions to genome coverage profiles. Besides commonly used distributions, we introduce distributions tailored to account for common artifacts. The mixture models are iteratively fitted based on the Expectation-Maximization algorithm. We introduce use cases with focus on metagenomics and develop new analysis strategies to assess the validity of a reference genome with respect to (meta-) genomic read data. The framework is evaluated on simulated data as well as applied to a large-scale metagenomic study, for which we compute the validity of 75 microbial genomes. The results indicate that the choice and quality of reference genomes is vital for metagenomic analyses and that validation of coverage profiles is crucial to avoid incorrect conclusions. AVAILABILITY: The code is freely available and can be downloaded from http://sourceforge.net/projects/fitgcp/. CONTACT: RenardB@rki.de SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Mesh:

Year:  2013        PMID: 23589648     DOI: 10.1093/bioinformatics/btt147

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

1.  Introduction to Population Genomics Methods.

Authors:  Thibault Leroy; Quentin Rougemont
Journal:  Methods Mol Biol       Date:  2021

2.  Sequana coverage: detection and characterization of genomic variations using running median and mixture models.

Authors:  Dimitri Desvillechabrol; Christiane Bouchier; Sean Kennedy; Thomas Cokelaer
Journal:  Gigascience       Date:  2018-12-01       Impact factor: 6.524

3.  Stepwise large genome assembly approach: a case of Siberian larch (Larix sibirica Ledeb).

Authors:  Dmitry A Kuzmin; Sergey I Feranchuk; Vadim V Sharov; Alexander N Cybin; Stepan V Makolov; Yuliya A Putintseva; Natalya V Oreshkova; Konstantin V Krutovsky
Journal:  BMC Bioinformatics       Date:  2019-02-05       Impact factor: 3.169

4.  Birth of a W sex chromosome by horizontal transfer of Wolbachia bacterial symbiont genome.

Authors:  Sébastien Leclercq; Julien Thézé; Mohamed Amine Chebbi; Isabelle Giraud; Bouziane Moumen; Lise Ernenwein; Pierre Grève; Clément Gilbert; Richard Cordaux
Journal:  Proc Natl Acad Sci U S A       Date:  2016-12-06       Impact factor: 11.205

Review 5.  Music of metagenomics-a review of its applications, analysis pipeline, and associated tools.

Authors:  Bilal Wajid; Faria Anwar; Imran Wajid; Haseeb Nisar; Sharoze Meraj; Ali Zafar; Mustafa Kamal Al-Shawaqfeh; Ali Riza Ekti; Asia Khatoon; Jan S Suchodolski
Journal:  Funct Integr Genomics       Date:  2021-10-18       Impact factor: 3.410

6.  Low-bandwidth and non-compute intensive remote identification of microbes from raw sequencing reads.

Authors:  Laurent Gautier; Ole Lund
Journal:  PLoS One       Date:  2013-12-31       Impact factor: 3.240

7.  Metagenomic profiling of known and unknown microbes with microbeGPS.

Authors:  Martin S Lindner; Bernhard Y Renard
Journal:  PLoS One       Date:  2015-02-02       Impact factor: 3.240

8.  SLIMM: species level identification of microorganisms from metagenomes.

Authors:  Temesgen Hailemariam Dadi; Bernhard Y Renard; Lothar H Wieler; Torsten Semmler; Knut Reinert
Journal:  PeerJ       Date:  2017-03-28       Impact factor: 2.984

9.  Where did you come from, where did you go: Refining metagenomic analysis tools for horizontal gene transfer characterisation.

Authors:  Enrico Seiler; Kathrin Trappe; Bernhard Y Renard
Journal:  PLoS Comput Biol       Date:  2019-07-23       Impact factor: 4.475

10.  Pipasic: similarity and expression correction for strain-level identification and quantification in metaproteomics.

Authors:  Anke Penzlin; Martin S Lindner; Joerg Doellinger; Piotr Wojtek Dabrowski; Andreas Nitsche; Bernhard Y Renard
Journal:  Bioinformatics       Date:  2014-06-15       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.