Literature DB >> 17277330

SCOOP: a simple method for identification of novel protein superfamily relationships.

Alex Bateman1, Robert D Finn.   

Abstract

MOTIVATION: Profile searches of sequence databases are a sensitive way to detect sequence relationships. Sophisticated profile-profile comparison algorithms that have been recently introduced increase search sensitivity even further.
RESULTS: In this article, a simpler approach than profile-profile comparison is presented that has a comparable performance to state-of-the-art tools such as COMPASS, HHsearch and PRC. This approach is called SCOOP (Simple Comparison Of Outputs Program), and is shown to find known relationships between families in the Pfam database as well as detect novel distant relationships between families. Several novel discoveries are presented including the discovery that a domain of unknown function (DUF283) found in Dicer proteins is related to double-stranded RNA-binding domains. AVAILABILITY: SCOOP is freely available under a GNU GPL license from http://www.sanger.ac.uk/Users/agb/SCOOP/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17277330      PMCID: PMC2603044          DOI: 10.1093/bioinformatics/btm034

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  15 in total

1.  A ubiquitin-binding motif required for intramolecular monoubiquitylation, the CUE domain.

Authors:  Susan C Shih; Gali Prag; Smitha A Francis; Myra A Sutanto; James H Hurley; Linda Hicke
Journal:  EMBO J       Date:  2003-03-17       Impact factor: 11.598

2.  SMART 4.0: towards genomic data integration.

Authors:  Ivica Letunic; Richard R Copley; Steffen Schmidt; Francesca D Ciccarelli; Tobias Doerks; Jörg Schultz; Chris P Ponting; Peer Bork
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  COMPASS: a tool for comparison of multiple protein alignments with assessment of statistical significance.

Authors:  Ruslan Sadreyev; Nick Grishin
Journal:  J Mol Biol       Date:  2003-02-07       Impact factor: 5.469

4.  COACH: profile-profile alignment of protein families using hidden Markov models.

Authors:  Robert C Edgar; Kimmen Sjölander
Journal:  Bioinformatics       Date:  2004-02-12       Impact factor: 6.937

Review 5.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

Authors:  S F Altschul; T L Madden; A A Schäffer; J Zhang; Z Zhang; W Miller; D J Lipman
Journal:  Nucleic Acids Res       Date:  1997-09-01       Impact factor: 16.971

6.  Hidden Markov models in computational biology. Applications to protein modeling.

Authors:  A Krogh; M Brown; I S Mian; K Sjölander; D Haussler
Journal:  J Mol Biol       Date:  1994-02-04       Impact factor: 5.469

7.  The region on 9p associated with 46,XY sex reversal contains several transcripts expressed in the urogenital system and a novel doublesex-related domain.

Authors:  C Ottolenghi; R Veitia; L Quintana-Murci; D Torchard; L Scapoli; N Souleyreau-Therville; J Beckmann; M Fellous; K McElreavey
Journal:  Genomics       Date:  2000-03-01       Impact factor: 5.736

8.  The Universal Protein Resource (UniProt): an expanding universe of protein information.

Authors:  Cathy H Wu; Rolf Apweiler; Amos Bairoch; Darren A Natale; Winona C Barker; Brigitte Boeckmann; Serenella Ferro; Elisabeth Gasteiger; Hongzhan Huang; Rodrigo Lopez; Michele Magrane; Maria J Martin; Raja Mazumder; Claire O'Donovan; Nicole Redaschi; Baris Suzek
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

9.  Pfam: clans, web tools and services.

Authors:  Robert D Finn; Jaina Mistry; Benjamin Schuster-Böckler; Sam Griffiths-Jones; Volker Hollich; Timo Lassmann; Simon Moxon; Mhairi Marshall; Ajay Khanna; Richard Durbin; Sean R Eddy; Erik L L Sonnhammer; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2006-01-01       Impact factor: 16.971

10.  SCOP database in 2004: refinements integrate structure and sequence family data.

Authors:  Antonina Andreeva; Dave Howorth; Steven E Brenner; Tim J P Hubbard; Cyrus Chothia; Alexey G Murzin
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

View more
  19 in total

1.  Gene expression in superior temporal cortex of schizophrenia patients.

Authors:  C Sellmann; L Villarín Pildaín; A Schmitt; F Leonardi-Essmann; P F Durrenberger; R Spanagel; T Arzberger; H Kretzschmar; M Zink; O Gruber; M Herrera-Marschitz; R Reynolds; P Falkai; P J Gebicke-Haerter; F Matthäus
Journal:  Eur Arch Psychiatry Clin Neurosci       Date:  2013-11-28       Impact factor: 5.270

2.  Profiles of Natural and Designed Protein-Like Sequences Effectively Bridge Protein Sequence Gaps: Implications in Distant Homology Detection.

Authors:  Gayatri Kumar; Narayanaswamy Srinivasan; Sankaran Sandhya
Journal:  Methods Mol Biol       Date:  2022

3.  webPRC: the Profile Comparer for alignment-based searching of public domain databases.

Authors:  Bernd W Brandt; Jaap Heringa
Journal:  Nucleic Acids Res       Date:  2009-05-06       Impact factor: 16.971

4.  The Pfam protein families database.

Authors:  Robert D Finn; Jaina Mistry; John Tate; Penny Coggill; Andreas Heger; Joanne E Pollington; O Luke Gavin; Prasad Gunasekaran; Goran Ceric; Kristoffer Forslund; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2009-11-17       Impact factor: 16.971

5.  DUFs: families in search of function.

Authors:  Alex Bateman; Penny Coggill; Robert D Finn
Journal:  Acta Crystallogr Sect F Struct Biol Cryst Commun       Date:  2010-03-05

6.  RefProtDom: a protein database with improved domain boundaries and homology relationships.

Authors:  Mileidy W Gonzalez; William R Pearson
Journal:  Bioinformatics       Date:  2010-08-06       Impact factor: 6.937

7.  Rfam: Wikipedia, clans and the "decimal" release.

Authors:  Paul P Gardner; Jennifer Daub; John Tate; Benjamin L Moore; Isabelle H Osuch; Sam Griffiths-Jones; Robert D Finn; Eric P Nawrocki; Diana L Kolbe; Sean R Eddy; Alex Bateman
Journal:  Nucleic Acids Res       Date:  2010-11-09       Impact factor: 16.971

8.  Conotoxin protein classification using free scores of words and support vector machines.

Authors:  Nazar Zaki; Stefan Wolfsheimer; Gregory Nuel; Sawsan Khuri
Journal:  BMC Bioinformatics       Date:  2011-05-29       Impact factor: 3.169

9.  Bacterial pleckstrin homology domains: a prokaryotic origin for the PH domain.

Authors:  Qingping Xu; Alex Bateman; Robert D Finn; Polat Abdubek; Tamara Astakhova; Herbert L Axelrod; Constantina Bakolitsa; Dennis Carlton; Connie Chen; Hsiu-Ju Chiu; Michelle Chiu; Thomas Clayton; Debanu Das; Marc C Deller; Lian Duan; Kyle Ellrott; Dustin Ernst; Carol L Farr; Julie Feuerhelm; Joanna C Grant; Anna Grzechnik; Gye Won Han; Lukasz Jaroszewski; Kevin K Jin; Heath E Klock; Mark W Knuth; Piotr Kozbial; S Sri Krishna; Abhinav Kumar; David Marciano; Daniel McMullan; Mitchell D Miller; Andrew T Morse; Edward Nigoghossian; Amanda Nopakun; Linda Okach; Christina Puckett; Ron Reyes; Christopher L Rife; Natasha Sefcovic; Henry J Tien; Christine B Trame; Henry van den Bedem; Dana Weekes; Tiffany Wooten; Keith O Hodgson; John Wooley; Marc-André Elsliger; Ashley M Deacon; Adam Godzik; Scott A Lesley; Ian A Wilson
Journal:  J Mol Biol       Date:  2009-11-10       Impact factor: 5.469

10.  Phospholipid scramblases and Tubby-like proteins belong to a new superfamily of membrane tethered transcription factors.

Authors:  Alex Bateman; Robert D Finn; Peter J Sims; Therese Wiedmer; Andreas Biegert; Johannes Söding
Journal:  Bioinformatics       Date:  2008-11-13       Impact factor: 6.937

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.