Literature DB >> 12824419

UniqueProt: Creating representative protein sequence sets.

Sven Mika1, Burkhard Rost.   

Abstract

UniqueProt is a practical and easy to use web service designed to create representative, unbiased data sets of protein sequences. The largest possible representative sets are found through a simple greedy algorithm using the HSSP-value to establish sequence similarity. UniqueProt is not a real clustering program in the sense that the 'representatives' are not at the centres of well-defined clusters since the definition of such clusters is problem-specific. Overall, UniqueProt is a reasonable fast solution for bias in data sets. The service is accessible at http://cubic.bioc.columbia.edu/services/uniqueprot; a command-line version for Linux is downloadable from this web site.

Mesh:

Substances:

Year:  2003        PMID: 12824419      PMCID: PMC169026          DOI: 10.1093/nar/gkg620

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  19 in total

1.  The Protein Data Bank.

Authors:  H M Berman; J Westbrook; Z Feng; G Gilliland; T N Bhat; H Weissig; I N Shindyalov; P E Bourne
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Assessing annotation transfer for genomics: quantifying the relations between protein sequence, structure and function through traditional and probabilistic scores.

Authors:  C A Wilson; J Kreychman; M Gerstein
Journal:  J Mol Biol       Date:  2000-03-17       Impact factor: 5.469

3.  Twilight zone of protein sequence alignments.

Authors:  B Rost
Journal:  Protein Eng       Date:  1999-02

4.  Enzyme function less conserved than anticipated.

Authors:  Burkhard Rost
Journal:  J Mol Biol       Date:  2002-04-26       Impact factor: 5.469

5.  Database of homology-derived protein structures and the structural meaning of sequence alignment.

Authors:  C Sander; R Schneider
Journal:  Proteins       Date:  1991

6.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

7.  Local alignment statistics.

Authors:  S F Altschul; W Gish
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

Review 8.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

9.  Enlarged representative set of protein structures.

Authors:  U Hobohm; C Sander
Journal:  Protein Sci       Date:  1994-03       Impact factor: 6.725

10.  Intermediate sequences increase the detection of homology between sequences.

Authors:  J Park; S A Teichmann; T Hubbard; C Chothia
Journal:  J Mol Biol       Date:  1997-10-17       Impact factor: 5.469

View more
  42 in total

1.  NLProt: extracting protein names and sequences from papers.

Authors:  Sven Mika; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

2.  Predicting transmembrane beta-barrels in proteomes.

Authors:  Henry R Bigelow; Donald S Petrey; Jinfeng Liu; Dariusz Przybylski; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2004-05-11       Impact factor: 16.971

3.  How do side chains orient globally in protein structures?

Authors:  Aimin Yan; Robert L Jernigan
Journal:  Proteins       Date:  2005-11-15

4.  Characterization of metalloproteins by high-throughput X-ray absorption spectroscopy.

Authors:  Wuxian Shi; Marco Punta; Jen Bohon; J Michael Sauder; Rhijuta D'Mello; Mike Sullivan; John Toomey; Don Abel; Marco Lippi; Andrea Passerini; Paolo Frasconi; Stephen K Burley; Burkhard Rost; Mark R Chance
Journal:  Genome Res       Date:  2011-04-11       Impact factor: 9.043

5.  LocTree3 prediction of localization.

Authors:  Tatyana Goldberg; Maximilian Hecht; Tobias Hamp; Timothy Karl; Guy Yachdav; Nadeem Ahmed; Uwe Altermann; Philipp Angerer; Sonja Ansorge; Kinga Balasz; Michael Bernhofer; Alexander Betz; Laura Cizmadija; Kieu Trinh Do; Julia Gerke; Robert Greil; Vadim Joerdens; Maximilian Hastreiter; Katharina Hembach; Max Herzog; Maria Kalemanov; Michael Kluge; Alice Meier; Hassan Nasir; Ulrich Neumaier; Verena Prade; Jonas Reeb; Aleksandr Sorokoumov; Ilira Troshani; Susann Vorberg; Sonja Waldraff; Jonas Zierer; Henrik Nielsen; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2014-05-21       Impact factor: 16.971

6.  SSpro/ACCpro 5: almost perfect prediction of protein secondary structure and relative solvent accessibility using profiles, machine learning and structural similarity.

Authors:  Christophe N Magnan; Pierre Baldi
Journal:  Bioinformatics       Date:  2014-05-24       Impact factor: 6.937

7.  TMSEG: Novel prediction of transmembrane helices.

Authors:  Michael Bernhofer; Edda Kloppmann; Jonas Reeb; Burkhard Rost
Journal:  Proteins       Date:  2016-09-16

8.  Knotted vs. unknotted proteins: evidence of knot-promoting loops.

Authors:  Raffaello Potestio; Cristian Micheletti; Henri Orland
Journal:  PLoS Comput Biol       Date:  2010-07-29       Impact factor: 4.475

9.  Protein secondary structure appears to be robust under in silico evolution while protein disorder appears not to be.

Authors:  Christian Schaefer; Avner Schlessinger; Burkhard Rost
Journal:  Bioinformatics       Date:  2010-01-16       Impact factor: 6.937

10.  Ab initio and homology based prediction of protein domains by recursive neural networks.

Authors:  Ian Walsh; Alberto J M Martin; Catherine Mooney; Enrico Rubagotti; Alessandro Vullo; Gianluca Pollastri
Journal:  BMC Bioinformatics       Date:  2009-06-26       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.