Literature DB >> 14681402

The SUPERFAMILY database in 2004: additions and improvements.

Martin Madera1, Christine Vogel, Sarah K Kummerfeld, Cyrus Chothia, Julian Gough.   

Abstract

The SUPERFAMILY database provides structural assignments to protein sequences and a framework for analysis of the results. At the core of the database is a library of profile Hidden Markov Models that represent all proteins of known structure. The library is based on the SCOP classification of proteins: each model corresponds to a SCOP domain and aims to represent an entire superfamily. We have applied the library to predicted proteins from all completely sequenced genomes (currently 154), the Swiss-Prot and TrEMBL databases and other sequence collections. Close to 60% of all proteins have at least one match, and one half of all residues are covered by assignments. All models and full results are available for download and online browsing at http://supfam.org. Users can study the distribution of their superfamily of interest across all completely sequenced genomes, investigate with which other superfamilies it combines and retrieve proteins in which it occurs. Alternatively, concentrating on a particular genome as a whole, it is possible first, to find out its superfamily composition, and secondly, to compare it with that of other genomes to detect superfamilies that are over- or under-represented. In addition, the webserver provides the following standard services: sequence search; keyword search for genomes, superfamilies and sequence identifiers; and multiple alignment of genomic, PDB and custom sequences.

Mesh:

Substances:

Year:  2004        PMID: 14681402      PMCID: PMC308851          DOI: 10.1093/nar/gkh117

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  20 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Comparison of sequence profiles. Strategies for structural predictions using sequence information.

Authors:  L Rychlewski; L Jaroszewski; W Li; A Godzik
Journal:  Protein Sci       Date:  2000-02       Impact factor: 6.725

3.  SUPERFAMILY: HMMs representing all proteins of known structure. SCOP sequence searches, alignments and genome assignments.

Authors:  Julian Gough; Cyrus Chothia
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

4.  Within the twilight zone: a sensitive profile-profile comparison tool based on information theory.

Authors:  Golan Yona; Michael Levitt
Journal:  J Mol Biol       Date:  2002-02-01       Impact factor: 5.469

5.  Domain combinations in archaeal, eubacterial and eukaryotic proteomes.

Authors:  G Apic; J Gough; S A Teichmann
Journal:  J Mol Biol       Date:  2001-07-06       Impact factor: 5.469

6.  InterProScan--an integration platform for the signature-recognition methods in InterPro.

Authors:  E M Zdobnov; R Apweiler
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

7.  The InterPro Database, 2003 brings increased coverage and new features.

Authors:  Nicola J Mulder; Rolf Apweiler; Teresa K Attwood; Amos Bairoch; Daniel Barrell; Alex Bateman; David Binns; Margaret Biswas; Paul Bradley; Peer Bork; Phillip Bucher; Richard R Copley; Emmanuel Courcelle; Ujjwal Das; Richard Durbin; Laurent Falquet; Wolfgang Fleischmann; Sam Griffiths-Jones; Daniel Haft; Nicola Harte; Nicolas Hulo; Daniel Kahn; Alexander Kanapin; Maria Krestyaninova; Rodrigo Lopez; Ivica Letunic; David Lonsdale; Ville Silventoinen; Sandra E Orchard; Marco Pagni; David Peyruc; Chris P Ponting; Jeremy D Selengut; Florence Servant; Christian J A Sigrist; Robert Vaughan; Evgueni M Zdobnov
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

8.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003.

Authors:  Brigitte Boeckmann; Amos Bairoch; Rolf Apweiler; Marie-Claude Blatter; Anne Estreicher; Elisabeth Gasteiger; Maria J Martin; Karine Michoud; Claire O'Donovan; Isabelle Phan; Sandrine Pilbout; Michel Schneider
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

9.  A comparison of profile hidden Markov model procedures for remote homology detection.

Authors:  Martin Madera; Julian Gough
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

10.  Genome sequence of the dissimilatory metal ion-reducing bacterium Shewanella oneidensis.

Authors:  John F Heidelberg; Ian T Paulsen; Karen E Nelson; Eric J Gaidos; William C Nelson; Timothy D Read; Jonathan A Eisen; Rekha Seshadri; Naomi Ward; Barbara Methe; Rebecca A Clayton; Terry Meyer; Alexandre Tsapin; James Scott; Maureen Beanan; Lauren Brinkac; Sean Daugherty; Robert T DeBoy; Robert J Dodson; A Scott Durkin; Daniel H Haft; James F Kolonay; Ramana Madupu; Jeremy D Peterson; Lowell A Umayam; Owen White; Alex M Wolf; Jessica Vamathevan; Janice Weidman; Marjorie Impraim; Kathy Lee; Kristy Berry; Chris Lee; Jacob Mueller; Hoda Khouri; John Gill; Terry R Utterback; Lisa A McDonald; Tamara V Feldblyum; Hamilton O Smith; J Craig Venter; Kenneth H Nealson; Claire M Fraser
Journal:  Nat Biotechnol       Date:  2002-10-07       Impact factor: 54.908

View more
  109 in total

1.  An assessment of substitution scores for protein profile-profile comparison.

Authors:  Xugang Ye; Guoli Wang; Stephen F Altschul
Journal:  Bioinformatics       Date:  2011-10-13       Impact factor: 6.937

2.  Isolation and characterization of a novel v-SNARE family protein that interacts with a calcium-dependent protein kinase from the common ice plant, Mesembryanthemum crystallinum.

Authors:  E Wassim Chehab; O Rahul Patharkar; John C Cushman
Journal:  Planta       Date:  2007-03       Impact factor: 4.116

3.  Novel specificities emerge by stepwise duplication of functional modules.

Authors:  José B Pereira-Leal; Sarah A Teichmann
Journal:  Genome Res       Date:  2005-04       Impact factor: 9.043

4.  New genes from old: redeployment of dUTPase by herpesviruses.

Authors:  Andrew J Davison; Nigel D Stow
Journal:  J Virol       Date:  2005-10       Impact factor: 5.103

Review 5.  The limits of protein sequence comparison?

Authors:  William R Pearson; Michael L Sierk
Journal:  Curr Opin Struct Biol       Date:  2005-06       Impact factor: 6.809

6.  Assessing strategies for improved superfamily recognition.

Authors:  Ian Sillitoe; Mark Dibley; James Bray; Sarah Addou; Christine Orengo
Journal:  Protein Sci       Date:  2005-06-03       Impact factor: 6.725

7.  Structure of phage protein BC1872 from Bacillus cereus, a singleton with new fold.

Authors:  R Zhang; G Joachimiak; S Jiang; A Cipriani; F Collart; A Joachimiak
Journal:  Proteins       Date:  2006-07-01

8.  Crystal structure of YfeU protein from Haemophilus influenzae: a predicted etherase involved in peptidoglycan recycling.

Authors:  Y Kim; P Quartey; R Ng; T I Zarembinski; A Joachimiak
Journal:  J Struct Funct Genomics       Date:  2009-02-21

9.  Functional conservation of erythropoietin signaling in zebrafish.

Authors:  Noëlle Paffett-Lugassy; Nelson Hsia; Paula G Fraenkel; Barry Paw; Irene Leshinsky; Bruce Barut; Nathan Bahary; Jaime Caro; Robert Handin; Leonard I Zon
Journal:  Blood       Date:  2007-06-19       Impact factor: 22.113

10.  delayed flowering1 Encodes a basic leucine zipper protein that mediates floral inductive signals at the shoot apex in maize.

Authors:  Michael G Muszynski; Thao Dam; Bailin Li; David M Shirbroun; Zhenglin Hou; Edward Bruggemann; Rayeann Archibald; Evgueni V Ananiev; Olga N Danilevskaya
Journal:  Plant Physiol       Date:  2006-10-27       Impact factor: 8.340

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.