Literature DB >> 28454776

Functional association prediction by community profiling.

Dazhi Jiao1, Wontack Han1, Yuzhen Ye2.   

Abstract

Recent years have witnessed unprecedented accumulation of DNA sequences and therefore protein sequences (predicted from DNA sequences), due to the advances of sequencing technology. One of the major sources of the hypothetical proteins is the metagenomics research. Current annotation of metagenomes (collections of short metagenomic sequences or assemblies) relies on similarity searches against known gene/protein families, based on which functional profiles of microbial communities can be built. This practice, however, leaves out the hypothetical proteins, which may outnumber the known proteins for many microbial communities. On the other hand, we may ask: what can we gain from the large number of metagenomes made available by the metagenomic studies, for the annotation of metagenomic sequences as well as functional annotation of hypothetical proteins in general? Here we propose a community profiling approach for predicting functional associations between proteins: two proteins are predicted to be associated if they share similar presence and absence profiles (called community profiles) across microbial communities. Community profiling is conceptually similar to the phylogenetic profiling approach to functional prediction, however with fundamental differences. We tested different profile construction methods, the selection of reference metagenomes, and correlation metrics, among others, to optimize the performance of this new approach. We demonstrated that the community profiling approach alone slightly outperforms the phylogenetic profiling approach for associating proteins in species that are well represented by sequenced genomes, and combining phylogenetic and community profiling further improves (though only marginally) the prediction of functional association. Further we showed that community profiling method significantly outperforms phylogenetic profiling, revealing more functional associations, when applied to a more recently sequenced bacterial genome.
Copyright © 2017 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Community profiling; Functional association prediction; Guilt-by-association; Metagenomics; Phylogenetic profiling

Mesh:

Year:  2017        PMID: 28454776      PMCID: PMC5643221          DOI: 10.1016/j.ymeth.2017.04.018

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  44 in total

1.  The use of gene clusters to infer functional coupling.

Authors:  R Overbeek; M Fonstein; M D'Souza; G D Pusch; N Maltsev
Journal:  Proc Natl Acad Sci U S A       Date:  1999-03-16       Impact factor: 11.205

2.  Functional discovery via a compendium of expression profiles.

Authors:  T R Hughes; M J Marton; A R Jones; C J Roberts; R Stoughton; C D Armour; H A Bennett; E Coffey; H Dai; Y D He; M J Kidd; A M King; M R Meyer; D Slade; P Y Lum; S B Stepaniants; D D Shoemaker; D Gachotte; K Chakraburtty; J Simon; M Bard; S H Friend
Journal:  Cell       Date:  2000-07-07       Impact factor: 41.582

3.  Assigning protein functions by comparative genome analysis: protein phylogenetic profiles.

Authors:  M Pellegrini; E M Marcotte; M J Thompson; D Eisenberg; T O Yeates
Journal:  Proc Natl Acad Sci U S A       Date:  1999-04-13       Impact factor: 11.205

Review 4.  Predicting protein function from sequence and structure.

Authors:  David Lee; Oliver Redfern; Christine Orengo
Journal:  Nat Rev Mol Cell Biol       Date:  2007-12       Impact factor: 94.444

5.  Entropy-scaling search of massive biological data.

Authors:  Y William Yu; Noah M Daniels; David Christian Danko; Bonnie Berger
Journal:  Cell Syst       Date:  2015-08-26       Impact factor: 10.304

Review 6.  Taking it Personally: Personalized Utilization of the Human Microbiome in Health and Disease.

Authors:  Niv Zmora; David Zeevi; Tal Korem; Eran Segal; Eran Elinav
Journal:  Cell Host Microbe       Date:  2016-01-13       Impact factor: 21.023

7.  FragGeneScan: predicting genes in short and error-prone reads.

Authors:  Mina Rho; Haixu Tang; Yuzhen Ye
Journal:  Nucleic Acids Res       Date:  2010-08-30       Impact factor: 16.971

8.  IMG/M: the integrated metagenome data management and comparative analysis system.

Authors:  Victor M Markowitz; I-Min A Chen; Ken Chu; Ernest Szeto; Krishna Palaniappan; Yuri Grechkin; Anna Ratner; Biju Jacob; Amrita Pati; Marcel Huntemann; Konstantinos Liolios; Ioanna Pagani; Iain Anderson; Konstantinos Mavromatis; Natalia N Ivanova; Nikos C Kyrpides
Journal:  Nucleic Acids Res       Date:  2011-11-15       Impact factor: 16.971

9.  Detection of evolutionarily stable fragments of cellular pathways by hierarchical clustering of phyletic patterns.

Authors:  Galina V Glazko; Arcady R Mushegian
Journal:  Genome Biol       Date:  2004-04-27       Impact factor: 13.583

10.  The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST).

Authors:  Ross Overbeek; Robert Olson; Gordon D Pusch; Gary J Olsen; James J Davis; Terry Disz; Robert A Edwards; Svetlana Gerdes; Bruce Parrello; Maulik Shukla; Veronika Vonstein; Alice R Wattam; Fangfang Xia; Rick Stevens
Journal:  Nucleic Acids Res       Date:  2013-11-29       Impact factor: 16.971

View more
  3 in total

1.  Metabolomics as an Emerging Tool in the Search for Astrobiologically Relevant Biomarkers.

Authors:  Lauren Seyler; Elizabeth B Kujawinski; Armando Azua-Bustos; Michael D Lee; Jeffrey Marlow; Scott M Perl; Henderson James Cleaves Ii
Journal:  Astrobiology       Date:  2020-06-17       Impact factor: 4.335

2.  A repository of microbial marker genes related to human health and diseases for host phenotype prediction using microbiome data.

Authors:  Wontack Han; Yuzhen Ye
Journal:  Pac Symp Biocomput       Date:  2019

3.  Machine learning methods and systems for data-driven discovery in biomedical informatics.

Authors:  Sungroh Yoon; Seunghak Lee; Wei Wang
Journal:  Methods       Date:  2017-10-01       Impact factor: 3.608

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.