Literature DB >> 30657979

Amino acid based de Bruijn graph algorithm for identifying complete coding genes from metagenomic and metatranscriptomic short reads.

Jiemeng Liu1,2, Qichao Lian1, Yamao Chen1, Ji Qi1.   

Abstract

Metagenomic studies, greatly promoted by the fast development of next-generation sequencing (NGS) technologies, uncover complex structures of microbial communities and their interactions with environment. As the majority of microbes lack information of genome sequences, it is essential to assemble prokaryotic genomes ab initio aiming to retrieve complete coding genes from various metabolic pathways. The complex nature of microbial composition and the burden of handling a vast amount of metagenomic data, bring great challenges to the development of effective and efficient bioinformatic tools. Here we present a protein assembler (MetaPA), based on de Bruijn graph searching on oligopeptide spaces and can be applied on both metagenomic and metatranscriptomic sequencing data. When public homologous protein sequences are involved to guide the assembling procedures, MetaPA assembles 85% of total proteins in complete sequences with high precision of 83% on real high-throughput sequencing datasets. Application of MetaPA on metatranscriptomic data successfully identifies the majority of actively transcribed genes validated in related studies. The results suggest that MetaPA has a good potential in both metagenomic and metatranscriptomic studies to characterize the composition and abundance of microbiota.
© The Author(s) 2019. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 30657979      PMCID: PMC6412133          DOI: 10.1093/nar/gkz017

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  36 in total

1.  An Eulerian path approach to DNA fragment assembly.

Authors:  P A Pevzner; H Tang; M S Waterman
Journal:  Proc Natl Acad Sci U S A       Date:  2001-08-14       Impact factor: 11.205

2.  IDBA-UD: a de novo assembler for single-cell and metagenomic sequencing data with highly uneven depth.

Authors:  Yu Peng; Henry C M Leung; S M Yiu; Francis Y L Chin
Journal:  Bioinformatics       Date:  2012-04-11       Impact factor: 6.937

3.  A human gut microbial gene catalogue established by metagenomic sequencing.

Authors:  Junjie Qin; Ruiqiang Li; Jeroen Raes; Manimozhiyan Arumugam; Kristoffer Solvsten Burgdorf; Chaysavanh Manichanh; Trine Nielsen; Nicolas Pons; Florence Levenez; Takuji Yamada; Daniel R Mende; Junhua Li; Junming Xu; Shaochuan Li; Dongfang Li; Jianjun Cao; Bo Wang; Huiqing Liang; Huisong Zheng; Yinlong Xie; Julien Tap; Patricia Lepage; Marcelo Bertalan; Jean-Michel Batto; Torben Hansen; Denis Le Paslier; Allan Linneberg; H Bjørn Nielsen; Eric Pelletier; Pierre Renault; Thomas Sicheritz-Ponten; Keith Turner; Hongmei Zhu; Chang Yu; Shengting Li; Min Jian; Yan Zhou; Yingrui Li; Xiuqing Zhang; Songgang Li; Nan Qin; Huanming Yang; Jian Wang; Søren Brunak; Joel Doré; Francisco Guarner; Karsten Kristiansen; Oluf Pedersen; Julian Parkhill; Jean Weissenbach; Peer Bork; S Dusko Ehrlich; Jun Wang
Journal:  Nature       Date:  2010-03-04       Impact factor: 49.962

4.  Archaea predominate among ammonia-oxidizing prokaryotes in soils.

Authors:  S Leininger; T Urich; M Schloter; L Schwark; J Qi; G W Nicol; J I Prosser; S C Schuster; C Schleper
Journal:  Nature       Date:  2006-08-17       Impact factor: 49.962

5.  Recovery of nearly 8,000 metagenome-assembled genomes substantially expands the tree of life.

Authors:  Donovan H Parks; Christian Rinke; Maria Chuvochina; Pierre-Alain Chaumeil; Ben J Woodcroft; Paul N Evans; Philip Hugenholtz; Gene W Tyson
Journal:  Nat Microbiol       Date:  2017-09-11       Impact factor: 17.745

6.  Toward molecular trait-based ecology through integration of biogeochemical, geographical and metagenomic data.

Authors:  Jeroen Raes; Ivica Letunic; Takuji Yamada; Lars Juhl Jensen; Peer Bork
Journal:  Mol Syst Biol       Date:  2011-03-15       Impact factor: 11.429

7.  OrfPredictor: predicting protein-coding regions in EST-derived sequences.

Authors:  Xiang Jia Min; Gregory Butler; Reginald Storms; Adrian Tsang
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

8.  Complex archaea that bridge the gap between prokaryotes and eukaryotes.

Authors:  Anja Spang; Jimmy H Saw; Steffen L Jørgensen; Katarzyna Zaremba-Niedzwiedzka; Joran Martijn; Anders E Lind; Roel van Eijk; Christa Schleper; Lionel Guy; Thijs J G Ettema
Journal:  Nature       Date:  2015-05-06       Impact factor: 49.962

9.  metaSPAdes: a new versatile metagenomic assembler.

Authors:  Sergey Nurk; Dmitry Meleshko; Anton Korobeynikov; Pavel A Pevzner
Journal:  Genome Res       Date:  2017-03-15       Impact factor: 9.043

10.  Bioprospecting metagenomes: glycosyl hydrolases for converting biomass.

Authors:  Luen-Luen Li; Sean R McCorkle; Sebastien Monchy; Safiyh Taghavi; Daniel van der Lelie
Journal:  Biotechnol Biofuels       Date:  2009-05-18       Impact factor: 6.040

View more
  1 in total

1.  GIMICA: host genetic and immune factors shaping human microbiota.

Authors:  Jing Tang; Xianglu Wu; Minjie Mou; Chuan Wang; Lidan Wang; Fengcheng Li; Maiyuan Guo; Jiayi Yin; Wenqin Xie; Xiaona Wang; Yingxiong Wang; Yubin Ding; Weiwei Xue; Feng Zhu
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.