Literature DB >> 22174281

Artificial functional difference between microbial communities caused by length difference of sequencing reads.

Quan Zhang1, Thomas G Doak, Yuzhen Ye.   

Abstract

Homology-based approaches are often used for the annotation of microbial communities, providing functional profiles that are used to characterize and compare the content and the functionality of microbial communities. Metagenomic reads are the starting data for these studies, however considerable differences are observed between the functional profiles-built from sequencing reads produced by different sequencing techniques-for even the same microbial community. Using simulation experiments, we show that such functional differences are likely to be caused by the actual difference in read lengths, and are not the results of a sampling bias of the sequencing techniques. Furthermore, the functional differences derived from different sequencing techniques cannot be fully explained by the read-count bias, i.e. 1) the higher fraction of unannotated shorter reads (i.e., "read length matters"), and 2) the different lengths of proteins in different functional categories. Instead, we show here that specific functional categories are under-annotated, because similarity-search-based functional annotation tools tend to miss more reads from functional categories that contain less conserved genes/proteins. In addition, the accuracy of functional annotation of short reads for different functions varies, further skewing the functional profiles. To address these issues, we present a simple yet efficient method to improve the frequency estimates of different functional categories in the functional profiles of metagenomes, based on the functional annotation of simulated reads from complete microbial genomes.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 22174281

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  7 in total

1.  Statistical correction for functional metagenomic profiling of a microbial community with short NGS reads.

Authors:  Ruofei Du; Zhide Fang
Journal:  J Appl Stat       Date:  2018-01-27       Impact factor: 1.404

Review 2.  Third-generation sequencing techniques and applications to drug discovery.

Authors:  Fatih Ozsolak
Journal:  Expert Opin Drug Discov       Date:  2012-02-02       Impact factor: 6.098

3.  Comparative analysis of functional metagenomic annotation and the mappability of short reads.

Authors:  Rogan Carr; Elhanan Borenstein
Journal:  PLoS One       Date:  2014-08-22       Impact factor: 3.240

4.  Moleculo Long-Read Sequencing Facilitates Assembly and Genomic Binning from Complex Soil Metagenomes.

Authors:  Richard Allen White; Eric M Bottos; Taniya Roy Chowdhury; Jeremy D Zucker; Colin J Brislawn; Carrie D Nicora; Sarah J Fansler; Kurt R Glaesemann; Kevin Glass; Janet K Jansson
Journal:  mSystems       Date:  2016-06-28       Impact factor: 6.496

5.  MinION™ nanopore sequencing of environmental metagenomes: a synthetic approach.

Authors:  Bonnie L Brown; Mick Watson; Samuel S Minot; Maria C Rivera; Rima B Franklin
Journal:  Gigascience       Date:  2017-03-01       Impact factor: 6.524

6.  An artificial functional family filter in homolog searching in next-generation sequencing metagenomics.

Authors:  Ruofei Du; Donald Mercante; Zhide Fang
Journal:  PLoS One       Date:  2013-03-14       Impact factor: 3.240

7.  A Statistical Approach to Correcting Cross-Annotations in a Metagenomic Functional Profile Generated by Short Reads.

Authors:  Ruofei Du; Donald Mercante; Lingling An; Zhide Fang
Journal:  J Biom Biostat       Date:  2014-11-10
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.