Literature DB >> 15273311

Proteome-wide functional classification and identification of prokaryotic transmembrane proteins by transmembrane topology similarity comparison.

Masafumi Arai1, Kosuke Okumura, Masanobu Satake, Toshio Shimizu.   

Abstract

We propose a new method for classifying and identifying transmembrane (TM) protein functions in proteome-scale by applying a single-linkage clustering method based on TM topology similarity, which is calculated simply from comparing the lengths of loop regions. In this study, we focused on 87 prokaryotic TM proteomes consisting of 31 proteobacteria, 22 gram-positive bacteria, 19 other bacteria, and 15 archaea. Prior to performing the clustering, we first categorized individual TM protein sequences as "known," "putative" (similar to "known" sequences), or "unknown" by using the homology search and the sequence similarity comparison against SWISS-PROT to assess the current status of the functional annotation of the TM proteomes based on sequence similarity only. More than three-quarters, that is, 75.7% of the TM protein sequences are functionally "unknown," with only 3.8% and 20.5% of them being classified as "known" and "putative," respectively. Using our clustering approach based on TM topology similarity, we succeeded in increasing the rate of TM protein sequences functionally classified and identified from 24.3% to 60.9%. Obtained clusters correspond well to functional superfamilies or families, and the functional classification and identification are successfully achieved by this approach. For example, in an obtained cluster of TM proteins with six TM segments, 109 sequences out of 119 sequences annotated as "ATP-binding cassette transporter" are properly included and 122 "unknown" sequences are also contained.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15273311      PMCID: PMC2279829          DOI: 10.1110/ps.04814404

Source DB:  PubMed          Journal:  Protein Sci        ISSN: 0961-8368            Impact factor:   6.725


  32 in total

1.  Transmembrane topology prediction methods: a re-assessment and improvement by a consensus method using a dataset of experimentally-characterized transmembrane topologies.

Authors:  Masami Ikeda; Masafumi Arai; Demelo M Lao; Toshio Shimizu
Journal:  In Silico Biol       Date:  2002

2.  The HMMTOP transmembrane topology prediction server.

Authors:  G E Tusnády; I Simon
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

3.  The KEGG resource for deciphering the genome.

Authors:  Minoru Kanehisa; Susumu Goto; Shuichi Kawashima; Yasushi Okuno; Masahiro Hattori
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

4.  GenBank: update.

Authors:  Dennis A Benson; Ilene Karsch-Mizrachi; David J Lipman; James Ostell; David L Wheeler
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

5.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.

Authors:  Brigitte Boeckmann; Amos Bairoch; Rolf Apweiler; Marie-Claude Blatter; Anne Estreicher; Elisabeth Gasteiger; Maria J Martin; Karine Michoud; Claire O'Donovan; Isabelle Phan; Sandrine Pilbout; Michel Schneider
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

6.  Comprehensive analysis of transmembrane topologies in prokaryotic genomes.

Authors:  Masafumi Arai; Masami Ikeda; Toshio Shimizu
Journal:  Gene       Date:  2003-01-30       Impact factor: 3.688

7.  Identification of transmembrane protein functions by binary topology patterns.

Authors:  Yoshiaki Sugiyama; Natalia Polulyakh; Toshio Shimizu
Journal:  Protein Eng       Date:  2003-07

8.  The presence of signal peptide significantly affects transmembrane topology prediction.

Authors:  Demelo M Lao; Masafumi Arai; Masami Ikeda; Toshio Shimizu
Journal:  Bioinformatics       Date:  2002-12       Impact factor: 6.937

9.  A functional update of the Escherichia coli K-12 genome.

Authors:  M H Serres; S Gopal; L A Nahum; P Liang; T Gaasterland; M Riley
Journal:  Genome Biol       Date:  2001-08-20       Impact factor: 13.583

10.  The Pfam protein families database.

Authors:  Alex Bateman; Lachlan Coin; Richard Durbin; Robert D Finn; Volker Hollich; Sam Griffiths-Jones; Ajay Khanna; Mhairi Marshall; Simon Moxon; Erik L L Sonnhammer; David J Studholme; Corin Yeats; Sean R Eddy
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

View more
  1 in total

1.  Functional mining of transporters using synthetic selections.

Authors:  Hans J Genee; Anne P Bali; Søren D Petersen; Solvej Siedler; Mads T Bonde; Luisa S Gronenberg; Mette Kristensen; Scott J Harrison; Morten O A Sommer
Journal:  Nat Chem Biol       Date:  2016-10-03       Impact factor: 15.040

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.