| Literature DB >> 15715905 |
David J Studholme1, J Allan Downie, Gail M Preston.
Abstract
BACKGROUND: Evolution of new complex biological behaviour tends to arise by novel combinations of existing building blocks. The functional and evolutionary building blocks of the proteome are protein domains, the function of a protein being dependent on its constituent domains. We clustered completely-sequenced proteomes of prokaryotes on the basis of their protein domain content, as defined by Pfam (release 16.0). This revealed that, although there was a correlation between phylogeny and domain content, other factors also have an influence. This observation motivated an investigation of the relationship between an organism's lifestyle and the complement of domains and domain architectures found within its proteome.Entities:
Mesh:
Year: 2005 PMID: 15715905 PMCID: PMC554113 DOI: 10.1186/1471-2164-6-17
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Clustering of complete prokaryotic proteomes based on their protein domain content. 100 jacknife trials were performed, each leaving out a random 10% of the data.
Pfam protein domain families found in phylogentically disparate plant-associated bacteria and not found in non-plant associated bacteria.
| Avirulence | |
| DspF | |
| DUF1427 | |
| DUF811 | |
| HrpE | |
| HrpF | |
| Ice_nucleation | |
| NolX | |
| VirK |
Pfam protein domain families restricted to plant-associated bacteria and eukaryotes.
| CBM_14 | |
| CD225 | |
| DUF726 | |
| DUF763 | |
| GDA1_CD39 | |
| Het-C | |
| PAX | |
| PPR | |
| Rhamnogal_lyase | |
| Ribosomal_60s | |
| RolB_RolC | |
| SBP56 | |
| ST7 |
Figure 2Examples of proteins containing a RolB/RolC domain.
Protein domain families over-represented in plant-associated proteobacteria.
| Domain family | Expected number of proteins | Observed number of proteins | P | |
| Pfam accesion | Pfam ID | |||
| PF00211 | Guanylate_cyc | 33.39 | 70 | 2.17E-008 |
| PF00296 | Bac_luciferase | 46.36 | 81 | 2.56E-006 |
| PF04828 | DUF636 | 36.58 | 65 | 1.40E-005 |
| PF04679 | DNA_ligase_A_C | 17.65 | 38 | 1.76E-005 |
| PF01068 | DNA_ligase_A_M | 24.03 | 47 | 2.18E-005 |
| PF02738 | Ald_Xan_dh_C2 | 35.72 | 63 | 2.33E-005 |
| PF03758 | SMP-30 | 19.35 | 40 | 2.63E-005 |
| PF01638 | DUF24 | 37 | 64 | 3.51E-005 |
| PF01757 | Acyl_transf_3 | 54.86 | 87 | 3.75E-005 |
| PF00067 | p450 | 24.24 | 46 | 5.31E-005 |
| PF02746 | MR_MLE_N | 50.18 | 80 | 6.30E-005 |
| PF02894 | GFO_IDH_MocA_C | 66.35 | 100 | 6.97E-005 |
| PF01799 | Fer2_2 | 31.26 | 55 | 7.69E-005 |
| PF06169 | DUF982 | 11.06 | 26 | 8.88E-005 |
| PF07536 | HWE_HK | 23.82 | 44 | 1.35E-004 |
| PF01022 | HTH_5 | 68.47 | 101 | 1.38E-004 |
| PF03573 | OprD | 14.03 | 30 | 1.41E-004 |
| PF00656 | Peptidase_C14 | 14.89 | 31 | 1.73E-004 |
| PF03459 | TOBE | 83.78 | 139 | 2.48E-004 |
| PF02627 | CMD | 51.89 | 79 | 2.75E-004 |
| PF01188 | MR_MLE | 56.78 | 85 | 2.79E-004 |
| PF07506 | RepB | 10.84 | 24 | 3.81E-004 |
| PF01261 | AP_endonuc_2 | 85.48 | 122 | 4.91E-004 |
| PF00150 | Cellulase | 11.06 | 24 | 4.97E-004 |
| PF01408 | GFO_IDH_MocA | 85.7 | 130 | 5.36E-004 |
| PF00941 | FAD_binding_5 | 21.05 | 38 | 5.58E-004 |
| PF01315 | Ald_Xan_dh_C | 29.35 | 49 | 5.60E-004 |
| PF00353 | HemolysinCabind | 36.15 | 57 | 8.12E-004 |
| PF06823 | DUF1236 | 8.93 | 20 | 9.64E-004 |
Domain architectures found in phytobacteria of two or more subdivisions of the Proteobacteria and not found in non-plant-associated bacteria.
| Hypothetical protein XCC1094. ( | ||
| VirK | VirK (Tiorf135 protein). ( | |
| DUF1427 | Hypothetical protein XCC2052. ( | |
| DUF1486 | Hypothetical protein. ( | |
| RepB | Msr9757 protein. ( | |
| DUF442~Lactamase_B | Metallo-beta-lactamase superfamily protein. ( | |
| GAF~Phytochrome | Phytochrome-like protein. ( | |
| Glyco_hydro_6~CBM_2 | Cellulose 1,4-beta-cellobiosidase. ( | |
| DUF811 | Hypothetical protein. ( | |
| Condensation~Condensation~AMP-binding~PP-binding~Condensation~AMP-binding~PP- binding~Condensation~AMP-binding~PP- binding~Condensation~AMP-binding~PP- binding~Condensation~AMP-binding~PP- binding~Thioesterase~Thioesterase | Probable peptide synthesis protein. ( | |
| NolX | HrpF protein. ( | |
| DUF802~DUF802 | Hypothetical protein XAC3753. ( | |
| Avirulence~Avirulence | Avirulence protein AvrXa7-3M. ( | |
| RgpF-RgpF | Mll4799 protein. ( | |
| TPR_2~TPR_1~Sulfotransfer_1 | TPR domain/sulfotransferase domain protein. ( |
Figure 3Examples of proteins containing phytochrome domains.
Figure 4Examples of proteins containing phytochrome domains.
Figure 5Genetic islands unique to . Genes encoding transposases are marked with an asterisk (*) and the asparaginyl tRNA gene is marked 'tAsn'. Black diamonds indicate genes encoding unique domain architectures [49].