| Literature DB >> 16539742 |
Abstract
BACKGROUND: Phytochromes are photoreceptors, discovered in plants, that control a wide variety of developmental processes. They have also been found in bacteria and fungi, but for many species their biological role remains obscure. This work concentrates on the phytochrome system of Agrobacterium tumefaciens, a non-photosynthetic soil bacterium with two phytochromes. To identify proteins that might share common functions with phytochromes, a co-distribution analysis was performed on the basis of protein sequences from 138 bacteria.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16539742 PMCID: PMC1552090 DOI: 10.1186/1471-2105-7-141
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schematic diagram for virtual co-distribution analysis. Each circle represents a protein which is designated with a letter and a digit. The letter stands for one of six different species. Protein homologs that arise from a global BLAST analyses are connected with lines. The virtual co-distribution analysis was performed with protein A1 as a query and all proteins of species A were used as target proteins. The species in which homologs of A-proteins are found are as follows: A1: B, C, D, E; A2: B, D, E, F; A3: B, D, F; A4: C, D, E, F; A5: B, F; A6: B, C, D; A7: B, D, F; A8: C, E; A9: C, F; A0: B.
Co-distribution analysis result of the example outlined in Fig. 1
| Query protein A1 from species A | |||||||
| global BLAST run with E value of 0.000001 | |||||||
| gi A1 taxid A, BLAST homologs found in k = 4 species | |||||||
| a: sorting range | 4 | ||||||
| b: gi of target protein | 6 | ||||||
| c: taxid of target protein | (above values for k and n for calculation of p) | ||||||
| l: number of species in which BLAST homologs were found | |||||||
| m: number of species identical with query protein | |||||||
| p: probability for co-distribution | |||||||
| e: 1 if the target protein is a direct BLAST ortholog of the query protein | |||||||
| f: protein annotation as | |||||||
| a | b | c | l | m | p | e | f |
| 1 | A1 | A | 4 | 4 | 6.67E-02 | 1 | function one protein |
| 2 | A6 | A | 3 | 3 | 2.00E-01 | 0 | function six protein |
| 3 | A8 | A | 2 | 2 | 4.00E-01 | 0 | function eight protein |
| 2 | A2 | A | 4 | 3 | 5.33E-01 | 1 | function two protein |
| 5 | A4 | A | 4 | 3 | 5.33E-01 | 0 | function four protein |
| 6 | A5 | A | 2 | 1 | 5.33E-01 | 0 | function five protein |
| 7 | A9 | A | 2 | 1 | 5.33E-01 | 0 | function nine protein |
| 8 | A3 | A | 3 | 2 | 6.00E-01 | 0 | function three protein |
| 9 | A7 | A | 3 | 2 | 6.00E-01 | 0 | function seven protein |
| 10 | A0 | A | 1 | 1 | 6.67E-01 | 0 | function zero protein |
Summary of all co-distribution analyses. Abbreviations for species names: S6803, Synechocystis PCC 6803; Agrtu, Agrobacterium tumefaciens; Deira, Deinococcus radiodurans; Pseae, Pseudomonas aeruginosa. In cases where the results are given by two numbers separated by "/", the first number stands for the number of proteins that are listed among the first 100 of the co-distribution tables, while the second number stands for the total of all proteins in this organisms. The numbers in parentheses are the corresponding % values.
| Query protein; species | global BLAST with E value of 0.000001 | global BLAST with E value of 10 | |
| D1 (photosynthesis); S6803 | number of proteins in analysis | 3153 | 3166 |
| BLAST homologs in number of species | 6 | 16 | |
| Remarks | 142 proteins have the same distribution as D1, for details see text | see text | |
| Agp1 (phytochrome 1); Agrtu | number of proteins in analysis | 5091 | 5096 |
| BLAST homologs in number of species | 65 | 65 | |
| co-distributed "two component sensors" | 20/55 (20%/1.08%) | 18/55 (18%/1.08%) | |
| co-distributed "two component response regulators" | 8/48 (8%/0.94%) | 7/48 (7%/0.94%) | |
| co-distributed "transcriptional regulators" | 13/312 (13%/6.12%) | 32/312 (32%/6.12%) | |
| Agp2 (phytochrome 2); Agrtu | number of proteins in analysis | 5091 | 5096 |
| BLAST homologs in number of species | 15 | 61 | |
| co-distributed "two component sensors" | 9/55 (9%/1.08%) | 15/55 (15%/1.08%) | |
| co-distributed "two component response regulators" | 1/48 (1%/0.94%) | 12/48 (12%/0.94%) | |
| co-distributed "transcriptional regulators" | 4/312(4%/6.12%) | 15/312(15%/6.12%) | |
| AgR (response regulator); Agrtu | number of proteins in analysis | 5091 | 5096 |
| BLAST homologs in number of species | 19 | 72 | |
| co-distributed "two component sensors" | 20/55 (20/1.08) | 24/55 (24/1.08) | |
| co-distributed "two component response regulators" | 4/48 (4%/0.94%) | 5/48 (7%/0.94%) | |
| co-distributed "transcriptional regulators" | 1/312 (1%/6.12%) | 10/312 (10%/6.12%) | |
| ExsF (response regulator); Agrtu | number of proteins in analysis | 5091 | 5096 |
| BLAST homologs in number of species | 7 | 10 | |
| co-distributed "two component sensors" | 6/55 (6/1.08) | 13/55 (13/1.08) | |
| co-distributed "two component response regulators" | 2/48 (2%/0.94%) | 9/48 (13%/0.94%) | |
| co-distributed "transcriptional regulators" | 5/312 (5%/6.12%) | 37/312 (37%/6.12%) | |
| Query protein; species | global BLAST with E value of 0.000001 | global BLAST with E value of 10 | |
| ExsG (histidine kinase); Agrtu | number of proteins in analysis | 5091 | 5096 |
| BLAST homologs in number of species | 17 | 80 | |
| co-distributed "two component sensors" | 10/55 (10/1.08) | 20/55 (20/1.08) | |
| co-distributed "two component response regulators" | 3/48 (3%/0.94%) | 6/48 (6%/0.94%) | |
| co-distributed "transcriptional regulators" | 5/312 (5%/6.12%) | 22/312 (22%/6.12%) | |
| BhpP (phytochrome); Deira | number of proteins in analysis | 3176 | 3182 |
| BLAST homologs in number of species | 30 | 57 | |
| co-distributed "histidine kinases" | 13/19 (13%/0.60%) | 12/19 (13%/0.60%) | |
| co-distributed "response regulators" | 11/26(11%/0.82%) | 6/26(6%/0.82%) | |
| BhpP (phytochrome); Pseae | number of proteins in analysis | 5549 | 5566 |
| BLAST homologs in number of species | 63 | 69 | |
| co-distributed "two-component sensors" | 20/51 (20%/0.92%) | 24/51 (24%/0.92%) | |
| co-distributed "two-component response regulators" | 19/56 (19%/1.01%) | 16/56 (16%/1.01%) | |
| Cph1 (phytochrome); S6803 | number of proteins in analysis | 3153 | 3166 |
| BLAST homologs in number of species | 62 | 62 | |
| co-distributed "histidine kinase" | 15/28 (15%/0.80%) | 13/28 (13%/0.80%) | |
| co-distributed "AraC/PatA/CheY/NarL/OmpR subfamily" | 16/31 (16%/0.89%) | 17/31 (17%/0.89%) | |
| Rcp1 (response regulator); S6803 | number of proteins in analysis | 3153 | 3166 |
| BLAST homologs in number of species | 21 | 62 | |
| co-distributed "histidine kinase" | 8/27 (8%/0.77%) | 17/27 (17%/0.77%) | |
| co-distributed "AraC/PatA/CheY/NarL/OmpR subfamily" | 1/31 (1%/0.89%) | 22/31 (22%/0.89%) |
Figure 2. Gene names are listed below and names of the encoded proteins listed above the diagram. The apg1, agR and exsG open reading frames belong to the same operon, the exsF open reading frame points to the opposite direction.
Co-distribution of selected A. tumefaciens proteins. The first column lists the names of the query proteins, and the first line lists the names of the target proteins. Numbers indicate the position of the target protein in the co-distribution lists. The first and second numbers are from global BLAST analysis with E values of 0.000001 and 10, respectively. Numbers < 100 are printed in bold, if both numbers are < 100, they are underlined.
| query\target | Agp1 | Agp2 | AgR | ExsF | ExsG |
| Agp1 | 1/1 | 2237/232 | 321/ | 2823/1336 | 2309/577 |
| Agp2 | 2143/ | 1/1 | 348/184 | ||
| AgR | 1/1 | 190/420 | |||
| ExsF | 2726/1513 | 1083/367 | 1/1 | ||
| ExsG | 1687/253 | 180/260 | 1/1 |
Co-distribution of Cph1 and Rcp1 from Synechocystis PCC6803. Co-distribution of Cph1 and Rcp1, presented as in Table 3.
| query\target | Cph1 | Rcp1 |
| Cph1 | 1/1 | 220/ |
| Rcp1 | 1/1 |
Phytochromes as query and selected target proteins. The co-distribution of selected proteins, presented as in Table 3.
| query\target | photolyase/cryptochrome | glutamate synthases | methionine synthase |
| AgrtuAgp1 | 156/17 | 4,6,2552 (923)/3,5,10 (988)1 | 3/87 |
| AgrtuAgp2 | 1855/2054 | 2767,2841,4656 (2999)/144,201,247 (950)1 | 1749/373 |
| DeiraBphP | n.a. | 45 (614)/13 (1593) 2 | 19/1534 |
| PseaBphP | 655/958 | 74 (273)/123 (235) 2 | 19/419 |
| S6803Cph1 | 420,421/630,756 | 7,8 (311)/12,53 (916) 3 | 5/304 |
1 three large subunits and one small subunit (positions of the latter are given in brackets)
2 one large subunit and one small subunit (positions of the latter are given in brackets)
3 two ferredoxin-dependent and one NADH-dependent enzymes (positions of the latter are given in brackets)
4 designated as 5-methyltetrahydrofolate-homocysteine methyltransferase