Literature DB >> 19507285

An ORFome assembly approach to metagenomics sequences analysis.

Yuzhen Ye1, Haixu Tang.   

Abstract

Metagenomics is an emerging methodology for the direct genomic analysis of a mixed community of uncultured microorganisms. The current analyses of metagenomics data largely rely on the computational tools originally designed for microbial genomics projects. The challenge of assembling metagenomic sequences arises mainly from the short reads and the high species complexity of the community. Alternatively, individual (short) reads will be searched directly against databases of known genes (or proteins) to identify homologous sequences. The latter approach may have low sensitivity and specificity in identifying homologous sequences, which may further bias the subsequent diversity analysis. In this paper, we present a novel approach to metagenomic data analysis, called Metagenomic ORFome Assembly (MetaORFA). The whole computational framework consists of three steps. Each read from a metagenomics project will first be annotated with putative open reading frames (ORFs) that likely encode proteins. Next, the predicted ORFs are assembled into a collection of peptides using an EULER assembly method. Finally, the assembled peptides (i.e. ORFome) are used for database searching of homologs and subsequent diversity analysis. We applied MetaORFA approach to several metagenomics datasets with low coverage short reads. The results show that MetaORFA can produce long peptides even when the sequence coverage of reads is extremely low. Hence, the ORFome assembly significantly increases the sensitivity of homology searching, and may potentially improve the diversity analysis of the metagenomic data. This improvement is especially useful for metagenomic projects when the genome assembly does not work because of the low sequence coverage.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19507285      PMCID: PMC2829862          DOI: 10.1142/s0219720009004151

Source DB:  PubMed          Journal:  J Bioinform Comput Biol        ISSN: 0219-7200            Impact factor:   1.122


  41 in total

Review 1.  Environmental genomics: exploring the unmined richness of microbes to degrade xenobiotics.

Authors:  L Eyers; I George; L Schuler; B Stenuit; S N Agathos; Said El Fantroussi
Journal:  Appl Microbiol Biotechnol       Date:  2004-08-13       Impact factor: 4.813

2.  De novo repeat classification and fragment assembly.

Authors:  Pavel A Pevzner; Paul A Pevzner; Haixu Tang; Glenn Tesler
Journal:  Genome Res       Date:  2004-09       Impact factor: 9.043

Review 3.  Comparative analysis of environmental sequences: potential and challenges.

Authors:  Konrad U Foerstner; Christian von Mering; Peer Bork
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2006-03-29       Impact factor: 6.237

4.  Tandem repeats finder: a program to analyze DNA sequences.

Authors:  G Benson
Journal:  Nucleic Acids Res       Date:  1999-01-15       Impact factor: 16.971

5.  An obesity-associated gut microbiome with increased capacity for energy harvest.

Authors:  Peter J Turnbaugh; Ruth E Ley; Michael A Mahowald; Vincent Magrini; Elaine R Mardis; Jeffrey I Gordon
Journal:  Nature       Date:  2006-12-21       Impact factor: 49.962

6.  Metagenomic analysis of the human distal gut microbiome.

Authors:  Steven R Gill; Mihai Pop; Robert T Deboy; Paul B Eckburg; Peter J Turnbaugh; Buck S Samuel; Jeffrey I Gordon; David A Relman; Claire M Fraser-Liggett; Karen E Nelson
Journal:  Science       Date:  2006-06-02       Impact factor: 47.728

7.  ARACHNE: a whole-genome shotgun assembler.

Authors:  Serafim Batzoglou; David B Jaffe; Ken Stanley; Jonathan Butler; Sante Gnerre; Evan Mauceli; Bonnie Berger; Jill P Mesirov; Eric S Lander
Journal:  Genome Res       Date:  2002-01       Impact factor: 9.043

8.  Bioinformatics for whole-genome shotgun sequencing of microbial communities.

Authors:  Kevin Chen; Lior Pachter
Journal:  PLoS Comput Biol       Date:  2005-07       Impact factor: 4.475

9.  CAMERA: a community resource for metagenomics.

Authors:  Rekha Seshadri; Saul A Kravitz; Larry Smarr; Paul Gilna; Marvin Frazier
Journal:  PLoS Biol       Date:  2007-03       Impact factor: 8.029

10.  The marine viromes of four oceanic regions.

Authors:  Florent E Angly; Ben Felts; Mya Breitbart; Peter Salamon; Robert A Edwards; Craig Carlson; Amy M Chan; Matthew Haynes; Scott Kelley; Hong Liu; Joseph M Mahaffy; Jennifer E Mueller; Jim Nulton; Robert Olson; Rachel Parsons; Steve Rayhawk; Curtis A Suttle; Forest Rohwer
Journal:  PLoS Biol       Date:  2006-11       Impact factor: 8.029

View more
  16 in total

1.  Novel sequence-based method for identifying transcription factor binding sites in prokaryotic genomes.

Authors:  Gurmukh Sahota; Gary D Stormo
Journal:  Bioinformatics       Date:  2010-08-31       Impact factor: 6.937

2.  Metagenomics: Facts and Artifacts, and Computational Challenges*

Authors:  John C Wooley; Yuzhen Ye
Journal:  J Comput Sci Technol       Date:  2009-01       Impact factor: 1.571

Review 3.  A clinician's guide to microbiome analysis.

Authors:  Marcus J Claesson; Adam G Clooney; Paul W O'Toole
Journal:  Nat Rev Gastroenterol Hepatol       Date:  2017-08-09       Impact factor: 46.802

Review 4.  A primer on metagenomics.

Authors:  John C Wooley; Adam Godzik; Iddo Friedberg
Journal:  PLoS Comput Biol       Date:  2010-02-26       Impact factor: 4.475

5.  MetaVelvet: an extension of Velvet assembler to de novo metagenome assembly from short sequence reads.

Authors:  Toshiaki Namiki; Tsuyoshi Hachiya; Hideaki Tanaka; Yasubumi Sakakibara
Journal:  Nucleic Acids Res       Date:  2012-07-19       Impact factor: 16.971

6.  SPA: a short peptide assembler for metagenomic data.

Authors:  Youngik Yang; Shibu Yooseph
Journal:  Nucleic Acids Res       Date:  2013-02-23       Impact factor: 16.971

Review 7.  Functional assignment of metagenomic data: challenges and applications.

Authors:  Tulika Prakash; Todd D Taylor
Journal:  Brief Bioinform       Date:  2012-07-06       Impact factor: 11.622

8.  Evaluating the fidelity of de novo short read metagenomic assembly using simulated data.

Authors:  Miguel Pignatelli; Andrés Moya
Journal:  PLoS One       Date:  2011-05-23       Impact factor: 3.240

9.  Stitching gene fragments with a network matching algorithm improves gene assembly for metagenomics.

Authors:  Yu-Wei Wu; Mina Rho; Thomas G Doak; Yuzhen Ye
Journal:  Bioinformatics       Date:  2012-09-15       Impact factor: 6.937

Review 10.  Bioinformatic approaches for functional annotation and pathway inference in metagenomics data.

Authors:  Carlotta De Filippo; Matteo Ramazzotti; Paolo Fontana; Duccio Cavalieri
Journal:  Brief Bioinform       Date:  2012-11       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.