Literature DB >> 30505061

Statistical correction for functional metagenomic profiling of a microbial community with short NGS reads.

Ruofei Du1, Zhide Fang2.   

Abstract

By sequence homology search, the list of all the functions found and the counts of reads being aligned to them present the functional profile of a metagenomic sample. However, a significant obstacle has been observed in this approach due to the short read length associated with many next generation sequencing technologies. This includes artificial families, cross-annotations, length bias and conservation bias. The widely applied cutoff methods, such as BLAST E-value, are not able to solve the problems. Following the published successful procedures on the artificial families and the cross-annotation issue, we propose in this paper to use zero-truncated Poisson and Binomial (ZTP-Bin) hierarchical modelling to correct the length bias and the conservation bias. Goodness-of-fit of the modelling and cross-validation for the prediction using a bioinformatic simulated sample show the validity of this approach. Evaluated on an in vitro-simulated data set, the proposed modelling method outperforms other traditional methods. All three steps were then sequentially applied on real-life metagenomic samples to show that the proposed framework will lead to a more accurate functional profile of a short read metagenomic sample.

Entities:  

Keywords:  Primary 62F10; conservation bias; functional profiling; length bias; metagenomics; secondary 62P10; short reads

Year:  2018        PMID: 30505061      PMCID: PMC6261491          DOI: 10.1080/02664763.2018.1426741

Source DB:  PubMed          Journal:  J Appl Stat        ISSN: 0266-4763            Impact factor:   1.404


  30 in total

Review 1.  Review and future prospects for DNA barcoding methods in forensic palynology.

Authors:  Karen L Bell; Kevin S Burgess; Kazufusa C Okamoto; Roman Aranda; Berry J Brosi
Journal:  Forensic Sci Int Genet       Date:  2015-12-21       Impact factor: 4.882

2.  Average gene length is highly conserved in prokaryotes and eukaryotes and diverges only between the two kingdoms.

Authors:  Lin Xu; Hong Chen; Xiaohua Hu; Rongmei Zhang; Ze Zhang; Z W Luo
Journal:  Mol Biol Evol       Date:  2006-04-12       Impact factor: 16.240

3.  Functional metagenomic profiling of nine biomes.

Authors:  Elizabeth A Dinsdale; Robert A Edwards; Dana Hall; Florent Angly; Mya Breitbart; Jennifer M Brulc; Mike Furlan; Christelle Desnues; Matthew Haynes; Linlin Li; Lauren McDaniel; Mary Ann Moran; Karen E Nelson; Christina Nilsson; Robert Olson; John Paul; Beltran Rodriguez Brito; Yijun Ruan; Brandon K Swan; Rick Stevens; David L Valentine; Rebecca Vega Thurber; Linda Wegley; Bryan A White; Forest Rohwer
Journal:  Nature       Date:  2008-03-12       Impact factor: 49.962

4.  Microbial community gene expression in ocean surface waters.

Authors:  Jorge Frias-Lopez; Yanmei Shi; Gene W Tyson; Maureen L Coleman; Stephan C Schuster; Sallie W Chisholm; Edward F Delong
Journal:  Proc Natl Acad Sci U S A       Date:  2008-03-03       Impact factor: 11.205

Review 5.  Next-generation DNA sequencing methods.

Authors:  Elaine R Mardis
Journal:  Annu Rev Genomics Hum Genet       Date:  2008       Impact factor: 8.929

6.  Revisiting bovine pyometra--new insights into the disease using a culture-independent deep sequencing approach.

Authors:  Lif Rødtness Vesterby Knudsen; Cecilia Christensen Karstrup; Hanne Gervi Pedersen; Jørgen Steen Agerholm; Tim Kåre Jensen; Kirstine Klitgaard
Journal:  Vet Microbiol       Date:  2014-12-19       Impact factor: 3.293

Review 7.  Metagenomics for studying unculturable microorganisms: cutting the Gordian knot.

Authors:  Patrick D Schloss; Jo Handelsman
Journal:  Genome Biol       Date:  2005-08-01       Impact factor: 13.583

8.  Prokaryotic assemblages and metagenomes in pelagic zones of the South China Sea.

Authors:  Ching-Hung Tseng; Pei-Wen Chiang; Hung-Chun Lai; Fuh-Kwo Shiah; Ting-Chang Hsu; Yi-Lung Chen; Liang-Saw Wen; Chun-Mao Tseng; Wung-Yang Shieh; Isaam Saeed; Saman Halgamuge; Sen-Lin Tang
Journal:  BMC Genomics       Date:  2015-03-20       Impact factor: 3.969

9.  MetaSim: a sequencing simulator for genomics and metagenomics.

Authors:  Daniel C Richter; Felix Ott; Alexander F Auch; Ramona Schmid; Daniel H Huson
Journal:  PLoS One       Date:  2008-10-08       Impact factor: 3.240

10.  Genome Sequence of Porphyromonas gingivalis Strain A7A1-28.

Authors:  Gary Xie; Ryan P Chastain-Gross; Myriam Bélanger; Dibyendu Kumar; Joan A Whitlock; Li Liu; William G Farmerie; Collin L Zeng; Hajnalka E Daligault; Cliff S Han; Thomas S Brettin; Ann Progulske-Fox
Journal:  Genome Announc       Date:  2017-03-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.