Literature DB >> 18219596

In silico characterization of proteins: UniProt, InterPro and Integr8.

Nicola Jane Mulder1, Paul Kersey, Manuela Pruess, Rolf Apweiler.   

Abstract

Nucleic acid sequences from genome sequencing projects are submitted as raw data, from which biologists attempt to elucidate the function of the predicted gene products. The protein sequences are stored in public databases, such as the UniProt Knowledgebase (UniProtKB), where curators try to add predicted and experimental functional information. Protein function prediction can be done using sequence similarity searches, but an alternative approach is to use protein signatures, which classify proteins into families and domains. The major protein signature databases are available through the integrated InterPro database, which provides a classification of UniProtKB sequences. As well as characterization of proteins through protein families, many researchers are interested in analyzing the complete set of proteins from a genome (i.e. the proteome), and there are databases and resources that provide non-redundant proteome sets and analyses of proteins from organisms with completely sequenced genomes. This article reviews the tools and resources available on the web for single and large-scale protein characterization and whole proteome analysis.

Mesh:

Substances:

Year:  2007        PMID: 18219596     DOI: 10.1007/s12033-007-9003-x

Source DB:  PubMed          Journal:  Mol Biotechnol        ISSN: 1073-6085            Impact factor:   2.695


  48 in total

1.  Clustering of highly homologous sequences to reduce the size of large protein databases.

Authors:  W Li; L Jaroszewski; A Godzik
Journal:  Bioinformatics       Date:  2001-03       Impact factor: 6.937

2.  PIRSF: family classification system at the Protein Information Resource.

Authors:  Cathy H Wu; Anastasia Nikolskaya; Hongzhan Huang; Lai-Su L Yeh; Darren A Natale; C R Vinayaka; Zhang-Zhi Hu; Raja Mazumder; Sandeep Kumar; Panagiotis Kourtesis; Robert S Ledley; Baris E Suzek; Leslie Arminski; Yongxing Chen; Jian Zhang; Jorge Louie Cardenas; Sehee Chung; Jorge Castro-Alvear; Georgi Dinkov; Winona C Barker
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  PROSITE: a documented database using patterns and profiles as motif descriptors.

Authors:  Christian J A Sigrist; Lorenzo Cerutti; Nicolas Hulo; Alexandre Gattiker; Laurent Falquet; Marco Pagni; Amos Bairoch; Philipp Bucher
Journal:  Brief Bioinform       Date:  2002-09       Impact factor: 11.622

4.  Profile analysis.

Authors:  M Gribskov; R Lüthy; D Eisenberg
Journal:  Methods Enzymol       Date:  1990       Impact factor: 1.600

5.  Optimal alignments in linear space.

Authors:  E W Myers; W Miller
Journal:  Comput Appl Biosci       Date:  1988-03

6.  Hidden Markov models in computational biology. Applications to protein modeling.

Authors:  A Krogh; M Brown; I S Mian; K Sjölander; D Haussler
Journal:  J Mol Biol       Date:  1994-02-04       Impact factor: 5.469

7.  The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community.

Authors:  Seung Yon Rhee; William Beavis; Tanya Z Berardini; Guanghong Chen; David Dixon; Aisling Doyle; Margarita Garcia-Hernandez; Eva Huala; Gabriel Lander; Mary Montoya; Neil Miller; Lukas A Mueller; Suparna Mundodi; Leonore Reiser; Julie Tacklind; Dan C Weems; Yihe Wu; Iris Xu; Daniel Yoo; Jungwon Yoon; Peifen Zhang
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

8.  DDBJ in preparation for overview of research activities behind data submissions.

Authors:  Kousaku Okubo; Hideaki Sugawara; Takashi Gojobori; Yoshio Tateno
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

9.  The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution.

Authors:  Lesley H Greene; Tony E Lewis; Sarah Addou; Alison Cuff; Tim Dallman; Mark Dibley; Oliver Redfern; Frances Pearl; Rekha Nambudiry; Adam Reid; Ian Sillitoe; Corin Yeats; Janet M Thornton; Christine A Orengo
Journal:  Nucleic Acids Res       Date:  2006-11-29       Impact factor: 16.971

10.  InterProScan: protein domains identifier.

Authors:  E Quevillon; V Silventoinen; S Pillai; N Harte; N Mulder; R Apweiler; R Lopez
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

View more
  20 in total

Review 1.  Bioinformatic analyses of transmembrane transport: novel software for deducing protein phylogeny, topology, and evolution.

Authors:  Ming Ren Yen; Jeehye Choi; Milton H Saier
Journal:  J Mol Microbiol Biotechnol       Date:  2009-09-18

2.  Using comparative genomics to uncover new kinds of protein-based metabolic organelles in bacteria.

Authors:  Julien Jorda; David Lopez; Nicole M Wheatley; Todd O Yeates
Journal:  Protein Sci       Date:  2013-01-04       Impact factor: 6.725

Review 3.  Visualizing viral protein structures in cells using genetic probes for correlated light and electron microscopy.

Authors:  Horng D Ou; Thomas J Deerinck; Eric Bushong; Mark H Ellisman; Clodagh C O'Shea
Journal:  Methods       Date:  2015-06-09       Impact factor: 3.608

4.  GeMMA: functional subfamily classification within superfamilies of predicted protein structural domains.

Authors:  David A Lee; Robert Rentzsch; Christine Orengo
Journal:  Nucleic Acids Res       Date:  2009-11-18       Impact factor: 16.971

Review 5.  Bioinformatics and molecular modeling in glycobiology.

Authors:  Martin Frank; Siegfried Schloissnig
Journal:  Cell Mol Life Sci       Date:  2010-04-04       Impact factor: 9.261

6.  In silico prediction of antimalarial drug target candidates.

Authors:  Philipp Ludin; Ben Woodcroft; Stuart A Ralph; Pascal Mäser
Journal:  Int J Parasitol Drugs Drug Resist       Date:  2012-07-17       Impact factor: 4.077

7.  Male-specific region of the bovine Y chromosome is gene rich with a high transcriptomic activity in testis development.

Authors:  Ti-Cheng Chang; Yang Yang; Ernest F Retzel; Wan-Sheng Liu
Journal:  Proc Natl Acad Sci U S A       Date:  2013-07-10       Impact factor: 11.205

8.  Strepto-DB, a database for comparative genomics of group A (GAS) and B (GBS) streptococci, implemented with the novel database platform 'Open Genome Resource' (OGeR).

Authors:  Johannes Klein; Richard Münch; Ilona Biegler; Isam Haddad; Ida Retter; Dieter Jahn
Journal:  Nucleic Acids Res       Date:  2008-10-14       Impact factor: 16.971

Review 9.  From protein sequences to 3D-structures and beyond: the example of the UniProt knowledgebase.

Authors:  Ursula Hinz
Journal:  Cell Mol Life Sci       Date:  2009-12-31       Impact factor: 9.261

10.  The genome of the heartworm, Dirofilaria immitis, reveals drug and vaccine targets.

Authors:  Christelle Godel; Sujai Kumar; Georgios Koutsovoulos; Philipp Ludin; Daniel Nilsson; Francesco Comandatore; Nicola Wrobel; Marian Thompson; Christoph D Schmid; Susumu Goto; Frédéric Bringaud; Adrian Wolstenholme; Claudio Bandi; Christian Epe; Ronald Kaminsky; Mark Blaxter; Pascal Mäser
Journal:  FASEB J       Date:  2012-08-13       Impact factor: 5.191

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.