Literature DB >> 10383472

Blocks+: a non-redundant database of protein alignment blocks derived from multiple compilations.

S Henikoff1, J G Henikoff, S Pietrokovski.   

Abstract

MOTIVATION: As databanks grow, sequence classification and prediction of function by searching protein family databases becomes increasingly valuable. The original Blocks Database, which contains ungapped multiple alignments for families documented in Prosite, can be searched to classify new sequences. However, Prosite is incomplete, and families from other databases are now available to expand coverage of the Blocks Database.
RESULTS: To take advantage of protein family information present in several existing compilations, we have used five databases to construct Blocks+, a unified database that is built on the PROTOMAT/BLOSUM scoring model and that can be searched using a single algorithm for consistent sequence classification. The LAMA blocks-versus-blocks searching program identifies overlapping protein families, making possible a non-redundant hierarchical compilation. Blocks+ consists of all blocks derived from PROSITE, blocks from Prints not present in PROSITE, blocks from Pfam-A not present in PROSITE or Prints, and so on for ProDom and Domo, for a total of 1995 protein families represented by 8909 blocks, doubling the coverage of the original Blocks Database. A challenge for any procedure aimed at non-redundancy is to retain related but distinct families while discarding those that are duplicates. We illustrate how using multiple compilations can minimize this potential problem by examining the SNF2 family of ATPases, which is detectably similar to distinct families of helicases and ATPases. AVAILABILITY: http://blocks.fhcrc.org/

Mesh:

Substances:

Year:  1999        PMID: 10383472     DOI: 10.1093/bioinformatics/15.6.471

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  74 in total

1.  Increased coverage of protein families with the blocks database servers.

Authors:  J G Henikoff; E A Greene; S Pietrokovski; S Henikoff
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  PRINTS-S: the database formerly known as PRINTS.

Authors:  T K Attwood; M D Croning; D R Flower; A P Lewis; J E Mabey; P Scordis; J N Selley; W Wright
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

3.  Automated search of natively folded protein fragments for high-throughput structure determination in structural genomics.

Authors:  Y Kuroda; K Tani; Y Matsuo; S Yokoyama
Journal:  Protein Sci       Date:  2000-12       Impact factor: 6.725

4.  DNA sequence and comparison of virulence plasmids from Rhodococcus equi ATCC 33701 and 103.

Authors:  S Takai; S A Hines; T Sekizaki; V M Nicholson; D A Alperin; M Osaki; D Takamatsu; M Nakamura; K Suzuki; N Ogino; T Kakuda; H Dan; J F Prescott
Journal:  Infect Immun       Date:  2000-12       Impact factor: 3.441

5.  Roles of the recJ and recN genes in homologous recombination and DNA repair pathways of Neisseria gonorrhoeae.

Authors:  Eric P Skaar; Matthew P Lazio; H Steven Seifert
Journal:  J Bacteriol       Date:  2002-02       Impact factor: 3.490

6.  Sentra, a database of signal transduction proteins.

Authors:  Natalia Maltsev; E Marland; G X Yu; S Bhatnagar; R Lusk
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

7.  DIAN: a novel algorithm for genome ontological classification.

Authors:  Y Pouliot; J Gao; Q J Su; G G Liu; X B Ling
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

8.  GapA and CrmA coexpression is essential for Mycoplasma gallisepticum cytadherence and virulence.

Authors:  L Papazisi; S Frasca; M Gladd; X Liao; D Yogev; S J Geary
Journal:  Infect Immun       Date:  2002-12       Impact factor: 3.441

9.  FoldMiner: structural motif discovery using an improved superposition algorithm.

Authors:  Jessica Shapiro; Douglas Brutlag
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

10.  GFScan: a gene family search tool at genomic DNA level.

Authors:  Zhenyu Xuan; W Richard McCombie; Michael Q Zhang
Journal:  Genome Res       Date:  2002-07       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.