| Literature DB >> 19455212 |
Anastasia N Nikolskaya1, Cecilia N Arighi, Hongzhan Huang, Winona C Barker, Cathy H Wu.
Abstract
The PIRSF protein classification system (http://pir.georgetown.edu/pirsf/) reflects evolutionary relationships of full-length proteins and domains. The primary PIRSF classification unit is the homeomorphic family, whose members are both homologous (evolved from a common ancestor) and homeomorphic (sharing full-length sequence similarity and a common domain architecture). PIRSF families are curated systematically based on literature review and integrative sequence and functional analysis, including sequence and structure similarity, domain architecture, functional association, genome context, and phyletic pattern. The results of classification and expert annotation are summarized in PIRSF family reports with graphical viewers for taxonomic distribution, domain architecture, family hierarchy, and multiple alignment and phylogenetic tree. The PIRSF system provides a comprehensive resource for bioinformatics analysis and comparative studies of protein function and evolution. Domain or fold-based searches allow identification of evolutionarily related protein families sharing domains or structural folds. Functional convergence and functional divergence are revealed by the relationships between protein classification and curated family functions. The taxonomic distribution allows the identification of lineage-specific or broadly conserved protein families and can reveal horizontal gene transfer. Here we demonstrate, with illustrative examples, how to use the web-based PIRSF system as a tool for functional and evolutionary studies of protein families.Entities:
Keywords: Domain architecture; Functional convergence; Functional divergence; Genome context; Protein family classification; Taxonomic distribution
Year: 2007 PMID: 19455212 PMCID: PMC2674652
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1PIRSF protein family classification and curation workflow
Genome context of [NiFe]-hydrogenase-3-type complex Eha in completely sequenced Euryarchaeotic genomes. The first 15 genes in the operon are shown
| PIRSF ID and Name | Mt | Mk | Mj | Mm |
|---|---|---|---|---|
| PIRSF005019: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaA | MTH384 | MK0477 | MJ0528 | MMP1448 |
| PIRSF019706: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaB | MTH385 | MK0476 | MJ0527 | MMP1449 |
| PIRSF036534: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaC | MTH386 | MK0475 | MJ0526.1 | MMP1450 |
| PIRSF006581: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaD | MTH387 | MK0474 | MJ0526 | MMP1451 |
| PIRSF036535: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaE | MTH388 | MK0473 | MJ0525 | MMP1452 |
| PIRSF019373: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaF | MTH389 | MK0472 | MJ0524 | MMP1453 |
| PIRSF019136: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaG | MTH390 | MK0471 | MJ0523 | MMP1454 |
| PIRSF036536: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaH | MTH391 | MK0470 | MJ0522 | MMP1455 |
| PIRSF036537: [NiFe]-hydrogenase-3-type complex Eha, membrane protein Ehal | MTH392 | -- | -- | -- |
| PIRSF000215, subfamily PIRSF500037: [NiFe]-hydrogenase-3-type complex, membrane subunit C/D/J | MTH393 | MK0468 | MJ0520 | MMP1457 |
| PIRSF036538: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaK | MTH394 | -- | MJ0519 | MMP1458 |
| PIRSF004953: [NiFe]-hydrogenase-3-type complex Eha, membrane protein EhaL | MTH395 | MK0466 | MJ0518 | MMP1459 |
| PIRSF005292: [NiFe]-hydrogenase-3-type complex Eha, hydrophilic subunit EhaM | MTH396 | MK0465 | MJ0517 | MMP1460 |
| PIRSF002913, subfamily PIRSF500034: [NiFe]-hydrogenase-3-type complex, small subunit | MTH397 | MK0464 | MJ0516 | MMP1461 |
| MTH398 | MK0463 | MJ0515 | MMP1462 |
* Mt, Methanobacterium thermoautotrophicum str. Delta H; Mj, Methanocaldococcus jannaschii DSM 2661; Mk, Methanopyrus kandleri AV19; Mm, Methanococcus maripaludis S2.
-- Genes encoding proteins not homologous to the Methanobacterium thermoautotrophicum counterparts, and therefore not members of the listed PIRFs.
Figure 2The PIRDAGbrowser view displaying PIRSF025009 family hierarchy with Yip1 and Yip4 subfamilies and protein membership.
Figure 3(A) Selected PIRSF response regulator families with CheY-like phosphoacceptor domain (PF00072); (B) domain display of the selected PIRSF families
Figure 5Functional convergence of cobaltochelatases in evolutionary unrelated PIRSF families that do not share common domain architecture or SCOP fold. Note that the number of PIRSF entries retrieved in this or any other search may change due to the addition of new annotations.
Figure 4The PIR tree and alignment view of PIRSF000886 metallophosphoesterase-fold proteins, showing sequence variation of the Vps29 subfamily
Figure 6(A) Chorismate mutase-related PIRSF families and (B) taxonomic distribution of family members