Literature DB >> 9223186

Pfam: a comprehensive database of protein domain families based on seed alignments.

E L Sonnhammer1, S R Eddy, R Durbin.   

Abstract

Databases of multiple sequence alignments are a valuable aid to protein sequence classification and analysis. One of the main challenges when constructing such a database is to simultaneously satisfy the conflicting demands of completeness on the one hand and quality of alignment and domain definitions on the other. The latter properties are best dealt with by manual approaches, whereas completeness in practice is only amenable to automatic methods. Herein we present a database based on hidden Markov model profiles (HMMs), which combines high quality and completeness. Our database, Pfam, consists of parts A and B. Pfam-A is curated and contains well-characterized protein domain families with high quality alignments, which are maintained by using manually checked seed alignments and HMMs to find and align all members. Pfam-B contains sequence families that were generated automatically by applying the Domainer algorithm to cluster and align the remaining protein sequences after removal of Pfam-A domains. By using Pfam, a large number of previously unannotated proteins from the Caenorhabditis elegans genome project were classified. We have also identified many novel family memberships in known proteins, including new kazal, Fibronectin type III, and response regulator receiver domains. Pfam-A families have permanent accession numbers and form a library of HMMs available for searching and automatic annotation of new protein sequences.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9223186     DOI: 10.1002/(sici)1097-0134(199707)28:3<405::aid-prot10>3.0.co;2-l

Source DB:  PubMed          Journal:  Proteins        ISSN: 0887-3585


  410 in total

1.  The Pfam protein families database.

Authors:  A Bateman; E Birney; R Durbin; S R Eddy; K L Howe; E L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  TIGRFAMs: a protein family resource for the functional identification of proteins.

Authors:  D H Haft; B J Loftus; D L Richardson; F Yang; J A Eisen; I T Paulsen; O White
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

3.  A human protein containing multiple types of protease-inhibitory modules.

Authors:  M Trexler; L Bányai; L Patthy
Journal:  Proc Natl Acad Sci U S A       Date:  2001-03-13       Impact factor: 11.205

4.  Automated search of natively folded protein fragments for high-throughput structure determination in structural genomics.

Authors:  Y Kuroda; K Tani; Y Matsuo; S Yokoyama
Journal:  Protein Sci       Date:  2000-12       Impact factor: 6.725

5.  DNA sequence and comparison of virulence plasmids from Rhodococcus equi ATCC 33701 and 103.

Authors:  S Takai; S A Hines; T Sekizaki; V M Nicholson; D A Alperin; M Osaki; D Takamatsu; M Nakamura; K Suzuki; N Ogino; T Kakuda; H Dan; J F Prescott
Journal:  Infect Immun       Date:  2000-12       Impact factor: 3.441

6.  Novel genes coding for lithotrophic sulfur oxidation of Paracoccus pantotrophus GB17.

Authors:  C G Friedrich; A Quentmeier; F Bardischewsky; D Rother; R Kraft; S Kostka; H Prinz
Journal:  J Bacteriol       Date:  2000-09       Impact factor: 3.490

7.  Visual management of large scale data mining projects.

Authors:  I Shah; L Hunter
Journal:  Pac Symp Biocomput       Date:  2000

8.  Massive sequence comparisons as a help in annotating genomic sequences.

Authors:  A Louis; E Ollivier; J C Aude; J L Risler
Journal:  Genome Res       Date:  2001-07       Impact factor: 9.043

9.  Novel, thermostable family-13-like glycoside hydrolase from Methanococcus jannaschii.

Authors:  J W Kim; H A Terc; L O Flowers; M Whiteley; T L Peeples
Journal:  Folia Microbiol (Praha)       Date:  2001       Impact factor: 2.099

10.  Origin of fibronectin type II (FN2) modules: structural analyses of distantly-related members of the kringle family idey the kringle domain of neurotrypsin as a potential link between FN2 domains and kringles.

Authors:  O A Ozhogina; M Trexler; L Bányai; M Llinás; L Patthy
Journal:  Protein Sci       Date:  2001-10       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.