| Literature DB >> 19582169 |
Paul Francis Morris1, Laura Rose Schlosser, Katherine Diane Onasch, Tom Wittenschlaeger, Ryan Austin, Nicholas Provart.
Abstract
Complex enzymes with multiple catalytic activities are hypothesized to have evolved from more primitive precursors. Global analysis of the Phytophthora sojae genome using conservative criteria for evaluation of complex proteins identified 273 novel multifunctional proteins that were also conserved in P. ramorum. Each of these proteins contains combinations of protein motifs that are not present in bacterial, plant, animal, or fungal genomes. A subset of these proteins were also identified in the two diatom genomes, but the majority of these proteins have formed after the split between diatoms and oomycetes. Documentation of multiple cases of domain fusions that are common to both oomycetes and diatom genomes lends additional support for the hypothesis that oomycetes and diatoms are monophyletic. Bifunctional proteins that catalyze two steps in a metabolic pathway can be used to infer the interaction of orthologous proteins that exist as separate entities in other genomes. We postulated that the novel multifunctional proteins of oomycetes could function as potential Rosetta Stones to identify interacting proteins of conserved metabolic and regulatory networks in other eukaryotic genomes. However ortholog analysis of each domain within our set of 273 multifunctional proteins against 39 sequenced bacterial and eukaryotic genomes, identified only 18 candidate Rosetta Stone proteins. Thus the majority of multifunctional proteins are not Rosetta Stones, but they may nonetheless be useful in identifying novel metabolic and regulatory networks in oomycetes. Phylogenetic analysis of all the enzymes in three pathways with one or more novel multifunctional proteins was conducted to determine the probable origins of individual enzymes. These analyses revealed multiple examples of horizontal transfer from both bacterial genomes and the photosynthetic endosymbiont in the ancestral genome of Stramenopiles. The complexity of the phylogenetic origins of these metabolic pathways and the paucity of Rosetta Stones relative to the total number of multifunctional proteins suggests that the proteome of oomycetes has few features in common with other Kingdoms.Entities:
Mesh:
Year: 2009 PMID: 19582169 PMCID: PMC2705460 DOI: 10.1371/journal.pone.0006133
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Search strategy to identify novel multifunctional proteins in the Phytophthora sojae genome.
Multifunctional metabolic enzymes in oomycete and other eukaryotic genomes.
| Gene_ID | Metabolic Process | Catalytic activities |
|
|
|
| Ps108209 | amino acid catabolism | Methyl malonyl Coenzyme A mutase | none | P22033 | none |
| Ps108248 | aspartate metabolism | Aspartate kinase-homoserine dehydrogenase | Q9SA18 | none | none |
| Ps108383 | proline synthesis | Delta 1-pyrroline-5-carboxylate synthetase (P5CS) | P54887 | none | P54885 |
| Ps108492 | purine biosynthesis | AIR synthase related protein | A9Y5J1 | A8K8N7 | Q45U25 |
| Ps108690 | glutathione | 1-cys-peroxiredoxin | O04005 | P30041 | P34227 |
| Ps108696 | tryptophan | Anthranilate synthase component II | Q9LUB2 | None | P00937 |
| Ps108681 | fucose | GDP-L-fucose synthase | O49213 | Q13630 | none |
| Ps108780 | glycolysis | Pyruvate carboxylase | none | P11498 | P32327 |
| Ps108801 | lysine catabolism | Lysine-ketoglutarate reductase-saccharopine dehydrogenase | Q9SMZ4 | Q9UDR5 | P38999 |
| Ps108913 | Lipid biosynthesis | ATP citrate lyase | none | P53396 | none |
| Ps108918 | Acetyl CoA carboxylase | Q38970 | Q13085 | P32874 | |
| Ps108989 | mRNA synthesis | RNA polymerase II | P18616 | P24928 | P04050 |
| Ps109060 | histidine synthesis | Glutamine amidotransferase glutamine cyclase | Q9SZ30 | none | P33734 |
| Ps109110 | riboflavin metabolism | GTP cyclohydrolase II Dihydropteroate synthase | Q9SJY9 | none | Q99258 |
| Ps109141 | folic acid | 7,8-Dihydro-6-hydroxymethylpterin-pyrophosphokinase, | Q9SZV3 | none | P53848 |
| Ps109146 | Mo cofactor biosynthesis | Molybdenum cofactor biosynthesis protein 1 | Q39055 | Q9NZB8-5 | none |
| Ps109150 | ketone metabolism | Succinyl-CoA:3-ketoacid-coenzyme A transferase | none | P55809 | none |
| Ps109206 | urea cycle | Carbamoyl-phosphate synthetase III; | Q9SZV3 | none | Q2VQX0 |
| Ps109271 | mRNA synthesis | RNA polymerase | P38420 | B4DHJ3 | P08518 |
| Ps109291 | fatty acid degradation | Enoyl-CoA hydratase; Long chain 3-hydroxyacyl-CoA dehydrogenase | Q9ZPI6 | P40939 | P28817 |
| Ps109321 | tetrahydrofolate | Dihydrofolate reductase-thymidylate synthase | Q05762 | P04818 | None |
| Ps109425 | urea cycle | Carbamoyl phosphate synthetase | Q42601 | P31327 | PO7259 |
| Ps109486 | purine biosynthesis | Phosphoribosylaminoimidazole carboxamide formyltransferase activity | Q8RWT5 | P31939 | P54113 |
| Ps109918 | fatty acid metabolism | Acetyl-coenzyme A carboxylase | Q84MM8 | none | P36022 |
| Ps123952 | PABA | p-aminobenzoate synthase-glutamine-amidotransferase | Q9ZV26 | P49915 | P49915 |
| Ps128107 | Mo cofactor biosynthesis | Molybdopterin biosynthesis | Q39054 | Q9NQX3 | none |
| Ps129332 | adenine biosynthesis | ADE1 | P52420 | Q59HH3 | P07244 |
| Ps131873 | glucose metabolism | Phosphofructo-2 kinase, fructose 2,6 bisphosphatase | Q9MB58 | Q16875 | P32604 |
| Ps133964 | multi-functional tRNA | Glutamyl prolyl tRNA synthetase | O82462 | Q8NAJ6 | P46655 |
| Ps134345 | biotin synthesis | Bifunctional diaminopelargonate synthase-dethiobiotin synthetase | B0F481 | none | none |
| Ps134889 | sucrose metabolism | Phosphofructo-2 kinase, fructose 2,6 bisphosphatase | Q9MB58 | B2R6L2 | P40433 |
| Ps135279 | histidine biosynthesis | Phosphoribosyl-ATP pyrophosphohydrolase–Phosphoribosyl-AMP cyclohydrolase | novel | novel | novel |
| Ps135354 | pyrimidine metabolism | Orotidine 5′phosphate decarboxylase-Orotate phosphoribosyltransferase | none | none | none |
| Ps136410 | Aspartate biosynthesis | Aspartate kinase-homoserine dehydrogenase | Q9SA18 | None | P10869 |
| Ps137041 | RNA polymerase | DNA-directed RNA polymerase III | Q9LVH0 | O14802 | P040051 |
| Ps137320 | Topoisomerase/gyrase | Topoisomerase/gyrase | P30182 | P11388 | Q8TG58 |
| Ps141174 | glucose metabolism | Phosphofructo-2 kinase, fructose 2,6 bisphosphatase | Q9MB58 | Q16877 | P32604 |
| Ps143339 | amino acid biosynthesis | Pentafunctional aromatic amino acid synthesis | None | None | P08566 |
| Ps155429 | Aspartate biosynthesis | Aspartate kinase-homoserine dehydrogenase | Q9SA18 | None | P10869 |
Novel multifunctional proteins present in the P. sojae, P. tricornutum and T. pseudonana genomes.
|
|
|
| Domains |
| Ps108305 | Tp28241 | 32747 | Glyceraldehyde 3-phosphate dehydrogenase |
| Ps109321 | Tp13647 | 1690 | bifunctional dihydrofolate reductase-thymidylate synthase |
| Ps110986 | Tp268564 | no model in PT | Novel |
| Ps112102 | Tp35055 | 19901 | Adenylylsulfate kinase |
| Ps116824 | Tp5802 | 8996 | N terminal protein is upstream 5801 |
| Ps127510 | Tp8169 | 39696 | calmodulin and leucine rich repeats domains not present |
| Ps128154 | Tp260768 | 16649 | Beta-ketoacyl synthase &hydroxymethylglutaryl-CoA synthase |
| Ps129073 | Tp35637 | 40689 | Phosphodiesterase and phosphohydrolase |
| Ps129281 | Tp8768 | no | RNA Bindi55ng |
| Ps129631 | Tp21612 | 21612 | Hypothetical protein |
| Ps131310 | Tp21710 | no blast hit | ankyrin |
| Ps131558 | Tp264628 | Mannitol dehydrogenase | |
| Ps132073 | Tp24275 | 47805 | Heavy metal transport |
| Ps132790 | Tp3330 | 3120 | Cyclinc nucleotide binding, regulator of G protein |
| Ps134136 | Tp262528 | 39951 | Phosphatidylinositol 3- and 4-kinase, plekstrin |
| Ps134555 | Tp26966 | 36588 | Mov 3 family PWWP domain |
| Ps135354 | Tp37071 | 11740 | Pyrimidine metabolism |
| Ps138204 | Tp12191 | no direct hit | Hypothetical protein |
| Ps138371 | Tp263580 | 48692 | Cold shock protein |
| Ps138788 | Tp264631 | 48208 | Cyclin, N-terminal |
| Ps140109 | Tp262068 | No direct hit | Plekstrin, Zn-finger |
Figure 2Lysine biosynthetic pathway of oomycetes.
Gene model IDs for P. sojae are shown for each step on the pathway. The probable origin of each gene model based on PHYML analysis is listed as bacterial, metazoan, plant or unknown. The oomycete pathway closely mirrors the plant biosynthetic strategy, but has one fewer step and the majority of enzymes appear to be of bacterial origin.
Figure 3PHYML analysis of genes with homology to oomycete genes with predicted domains for arginine, ornithine or diaminopimelate decarboxylase activity.
Numbers at nodes indicate bootstrap support values (100 replications). Gene names are color coded to indicate the major phylogenetic groups: archaea, purple; bacteria, blue; plants, green; stramenopiles, red.
Figure 4Divergence of diatoms and oomycetes from a common ancestral genome.
Selective transfer of DNA from the endosymbiont of LUCA, and the continued acquisition of bacterial DNA by phagocytosis after the separation of oomycetes and diatoms has contributed to the divergence of these genomes.
Phylogenetic associations of enzyme domains in serine biosynthesis and sulfate assimilation pathways of P. sojae and P. ramorum.
| Process | Gene Function | GENE_ID | Clade Association |
| Serine Biosynthesis | |||
| 1 phosphoglycerate dehydrogenase | Ps142688a | metazoan | |
| 2 phosphoserine aminotransferase | Ps142688b | bacterial | |
| 3 phosphoserine phosphatase | Ps132144 Ps134157 Ps157034 | bacterial | |
| 4 serine-pyruvate aminotransferase | Ps109249 | bacterial | |
| Sulfate Assimilation/Metabolism | |||
| 1 Adenyl sulfate kinase | Ps112102a | indeterminant | |
| 2 ATP sulfuryase | Ps112102b | indeterminant | |
| 3 pyrrophosphatase | Ps112102c | indeterminant | |
| 4 phosphoadenosine phosphosulfate reductase | Ps156997a | plant/cyano | |
| 5 glutaredoxin | Ps156997b | bacterial | |
| 6 sulfite reductase ECM17 | Ps139493 | fungal | |
| 6 sulfite reductase MET10 | Ps139488 | fungal | |
| 7 cysteine synthase | Ps109172, Ps109175 | plant | |
| 8 cystathionine gamma lyase | Ps109222 | metazoan | |
| 9 cystathionine beta lyase | Ps120833 | indeterminant | |
Letter following Gene ID indicates a multifunctional protein where each of the domains was analyzed separately.
Clade association was assigned when bootstrap support was >70% of 100 replicates using PHYML parameters as described in methods.