Literature DB >> 19725959

UFO: a web server for ultra-fast functional profiling of whole genome protein sequences.

Peter Meinicke1.   

Abstract

BACKGROUND: Functional profiling is a key technique to characterize and compare the functional potential of entire genomes. The estimation of profiles according to an assignment of sequences to functional categories is a computationally expensive task because it requires the comparison of all protein sequences from a genome with a usually large database of annotated sequences or sequence families. DESCRIPTION: Based on machine learning techniques for Pfam domain detection, the UFO web server for ultra-fast functional profiling allows researchers to process large protein sequence collections instantaneously. Besides the frequencies of Pfam and GO categories, the user also obtains the sequence specific assignments to Pfam domain families. In addition, a comparison with existing genomes provides dissimilarity scores with respect to 821 reference proteomes. Considering the underlying UFO domain detection, the results on 206 test genomes indicate a high sensitivity of the approach. In comparison with current state-of-the-art HMMs, the runtime measurements show a considerable speed up in the range of four orders of magnitude. For an average size prokaryotic genome, the computation of a functional profile together with its comparison typically requires about 10 seconds of processing time.
CONCLUSION: For the first time the UFO web server makes it possible to get a quick overview on the functional inventory of newly sequenced organisms. The genome scale comparison with a large number of precomputed profiles allows a first guess about functionally related organisms. The service is freely available and does not require user registration or specification of a valid email address.

Entities:  

Mesh:

Year:  2009        PMID: 19725959      PMCID: PMC2744726          DOI: 10.1186/1471-2164-10-409

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


  23 in total

1.  Heuristic approach to deriving models for gene finding.

Authors:  J Besemer; M Borodovsky
Journal:  Nucleic Acids Res       Date:  1999-10-01       Impact factor: 16.971

2.  Gene ontology: tool for the unification of biology. The Gene Ontology Consortium.

Authors:  M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock
Journal:  Nat Genet       Date:  2000-05       Impact factor: 38.330

3.  CDD: a database of conserved domain alignments with links to domain three-dimensional structure.

Authors:  Aron Marchler-Bauer; Anna R Panchenko; Benjamin A Shoemaker; Paul A Thiessen; Lewis Y Geer; Stephen H Bryant
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

Review 4.  Profile hidden Markov models.

Authors:  S R Eddy
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

5.  The COG database: new developments in phylogenetic classification of proteins from complete genomes.

Authors:  R L Tatusov; D A Natale; I V Garkavtsev; T A Tatusova; U T Shankavaram; B S Rao; B Kiryutin; M Y Galperin; N D Fedorova; E V Koonin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

Review 6.  Exploring prokaryotic diversity in the genomic era.

Authors:  Philip Hugenholtz
Journal:  Genome Biol       Date:  2002-01-29       Impact factor: 13.583

7.  Integr8 and Genome Reviews: integrated views of complete genomes and proteomes.

Authors:  Paul Kersey; Lawrence Bower; Lorna Morris; Alan Horne; Robert Petryszak; Carola Kanz; Alexander Kanapin; Ujjwal Das; Karine Michoud; Isabelle Phan; Alexandre Gattiker; Tamara Kulikova; Nadeem Faruque; Karyn Duggan; Peter Mclaren; Britt Reimholz; Laurent Duret; Simon Penel; Ingmar Reuter; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

8.  InterProScan: protein domains identifier.

Authors:  E Quevillon; V Silventoinen; S Pillai; N Harte; N Mulder; R Apweiler; R Lopez
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

9.  The metagenomics RAST server - a public resource for the automatic phylogenetic and functional analysis of metagenomes.

Authors:  F Meyer; D Paarmann; M D'Souza; R Olson; E M Glass; M Kubal; T Paczian; A Rodriguez; R Stevens; A Wilke; J Wilkening; R A Edwards
Journal:  BMC Bioinformatics       Date:  2008-09-19       Impact factor: 3.169

10.  HAMAP: a database of completely sequenced microbial proteome sets and manually curated microbial protein families in UniProtKB/Swiss-Prot.

Authors:  Tania Lima; Andrea H Auchincloss; Elisabeth Coudert; Guillaume Keller; Karine Michoud; Catherine Rivoire; Virginie Bulliard; Edouard de Castro; Corinne Lachaize; Delphine Baratin; Isabelle Phan; Lydie Bougueleret; Amos Bairoch
Journal:  Nucleic Acids Res       Date:  2008-10-11       Impact factor: 16.971

View more
  11 in total

1.  The mining of toxin-like polypeptides from EST database by single residue distribution analysis.

Authors:  Sergey Kozlov; Eugene Grishin
Journal:  BMC Genomics       Date:  2011-01-31       Impact factor: 3.969

2.  Predicting phenotypic traits of prokaryotes from protein domain frequencies.

Authors:  Thomas Lingner; Stefanie Mühlhausen; Toni Gabaldón; Cedric Notredame; Peter Meinicke
Journal:  BMC Bioinformatics       Date:  2010-09-24       Impact factor: 3.169

3.  CoMet--a web server for comparative functional profiling of metagenomes.

Authors:  Thomas Lingner; Kathrin Petra Asshauer; Fabian Schreiber; Peter Meinicke
Journal:  Nucleic Acids Res       Date:  2011-05-26       Impact factor: 16.971

4.  Smed454 dataset: unravelling the transcriptome of Schmidtea mediterranea.

Authors:  Josep F Abril; Francesc Cebrià; Gustavo Rodríguez-Esteban; Thomas Horn; Susanna Fraguas; Beatriz Calvo; Kerstin Bartscherer; Emili Saló
Journal:  BMC Genomics       Date:  2010-12-31       Impact factor: 3.969

5.  Land use type significantly affects microbial gene transcription in soil.

Authors:  Heiko Nacke; Christiane Fischer; Andrea Thürmer; Peter Meinicke; Rolf Daniel
Journal:  Microb Ecol       Date:  2014-02-20       Impact factor: 4.192

6.  The effect of sequencing errors on metagenomic gene prediction.

Authors:  Katharina J Hoff
Journal:  BMC Genomics       Date:  2009-11-12       Impact factor: 3.969

7.  Significant speedup of database searches with HMMs by search space reduction with PSSM family models.

Authors:  Michael Beckstette; Robert Homann; Robert Giegerich; Stefan Kurtz
Journal:  Bioinformatics       Date:  2009-10-14       Impact factor: 6.937

8.  A comparative evaluation of sequence classification programs.

Authors:  Adam L Bazinet; Michael P Cummings
Journal:  BMC Bioinformatics       Date:  2012-05-10       Impact factor: 3.169

9.  Zap1 regulates zinc homeostasis and modulates virulence in Cryptococcus gattii.

Authors:  Rafael de Oliveira Schneider; Natully de Souza Süffert Fogaça; Lívia Kmetzsch; Augusto Schrank; Marilene Henning Vainstein; Charley Christian Staats
Journal:  PLoS One       Date:  2012-08-20       Impact factor: 3.240

10.  Genome Sequence of Rough and Smooth Variants of Pleomorphic Strain Lactobacillus farciminis CNCM-I-3699.

Authors:  R Tareb; M Bernardeau; J P Vernoux
Journal:  Genome Announc       Date:  2015-09-17
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.