| Literature DB >> 18096044 |
Martin Münsterkötter1, Gero Steinberg.
Abstract
BACKGROUND: The corn smut fungus Ustilago maydis is a well-established model system for molecular phytopathology. In addition, it recently became evident that U. maydis and humans share proteins and cellular processes that are not found in the standard fungal model Saccharomyces cerevisiae. This prompted us to do a comparative analysis of the predicted proteome of U. maydis, S. cerevisiae and humans.Entities:
Mesh:
Substances:
Year: 2007 PMID: 18096044 PMCID: PMC2262911 DOI: 10.1186/1471-2164-8-473
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
General characteristics of the coding region
| Percent coding | (%) | 73.09 | 60.97 | 1.22 -2.33 |
| GC content | (%) | 39.58 | 56.26 | 52.10 |
| Average gene size | (bp) | 1451 | 1747 | 1526 |
| Average gene density | (kb/gene) | 1.99 | 2.86 | 125.43 |
| Protein-coding genes | 6127 | 6867 | 22762¶ | |
| Exons | 6482 | 9932 | 267941 | |
| Average exon size | (bp) | 1372 | 1208 | 297 |
| Exons per gene | 1.058 | 1.45 | 6.149 | |
| Genes with introns | (%) | 4.81 | 27.73 | 97.96 |
¶ the predicted number increases to 43576 when including splice variants.
Figure 1Global comparison of the predicted proteome sequence of the fungi U. maydis, S. cerevisiae and H. sapiens. Based on the Smith-Waterman comparison of all proteins from genome A against all proteins of genomes B the median e-value and the median %-identity value were calculated. Note that U. maydis is more similar to humans than to its fungal cousin, whereas the proteome of S. cerevisiae is more closely to that of the corn smut.
Analysis of coiled-coil domain containing representatives from U. maydis
| um03791 | Ums2 | Heat shock 70 kd protein | 507–539 | 3.8–7.4 |
| um05406 | Yup1 | t-SNARE | 249–286 | 3.2–14.1 |
| um11985 | Tpm2 | Tropomyosin | 1–76, 7–161 | 3.1–13 |
| um04372 | Dyn2 | Dynein heavy chain | 89–156, 311–387, 663–699 | 3.6–8.5 |
| um04555 | Myo5 | Myosin V | 958–1014, 1022–1078, 1079–1116 | 3.5–11.3 |
| um06338 | Clip1 | Microtubule binding protein | 561–607, 634–660, 661–708, 709–761, 822–867, 868–941, 1041–1082 | 10.0–16.9 |
| um04218 | Kin1 | Motor protein | 346–378, 441–551, 604–675, 688–736, 807–873, 874–905 | 9.4–14.3 |
¶ % identity over entire protein length to unrelated coiled-coil proteins
Figure 2Comparison of the proteome of U. maydis, S. cerevisiae and H. sapiens using cut-off criteria of 20 % identity over the total protein length.
Figure 3Genome-wide protein homology correlation of U. maydis against humans and S. cerevisiae using the % identity over entire protein length. Depicted are predicted U. maydis proteins that show > 20% identity to one partner and > 10% less identity to the other. Most proteins (2159) are conserved at similar levels in S. cerevisiae and H. sapiens (B). However, 587 proteins are more closely related to S. cerevisiae (A), whereas an even larger number of 651 are more similar to humans (C). Note that this analysis show "% identity over the total protein length" and thus does not include proteins that share sequence similarity only within a domain.
Figure 4Total number of predicted proteins that show significant similarity (> 20% identity to one partner and > 10% less identity to the other) over their entire length (grey bar) or within domains of at least 50 amino acids (black bar).
Functional classification of predicted proteins
| 1. Biosynthesis* | 75 | 1,7e-33 |
| 2. Spore formation/budding/cell polarity | 68 | 1.9e-15 |
| 3. Transport** | 49 | 1.3e-18 |
| 4. Ion homeostasis | 36 | 4.2e-12 |
| 5. Stress response$ | 35 | 7.5e-11 |
| 6. Detoxification/Resistance | 29 | 1.4e-10 |
| 7. Unknown function† | 96 | 1 |
| 1. mRNA splicing | 38 | 6.9e-17 |
| 2. Protein modification/degradation | 29 | 2.4e-04 |
| 3. G-protein-mediated signalling | 17 | 3,6e-06 |
| 4. Amino acid degradation | 14 | 4.4e-09 |
| 5. Oxidation of fatty acids | 10 | 1.1e-05 |
| 6. Unknown function† | 222 | 1 |
| 1. Biosynthesis, secondary§ | 105 | 3.0e-05 |
| 2. Protein fate, – modification | 89 | 7.4e-06 |
| 3. Oxidative stress response | 8 | 3.7e-03 |
| 4. tRNA modification | 7 | 3.4e-03 |
| 5. Unknown function† | 46 | 1 |
*Note that proteins can appear in more than one functional category. In addition, numerous proteins are assigned to additional functional classes that are not listed.
¶ Only those predicted proteins were considered that (1) show more than 20% identity with either in both organisms and (2) show at least 10 % less homology in the third partner
*Amino acids/vitamins/cofactors
**Amino acids/lipids/carbohydrates
$Salt and oxidative stress
§Polyamines, vitamins, co-factors and other secondary products
†Predicted proteins are not found in the COGs or FunCat database
Figure 5Proteins of the U. maydis – H. sapiens and S. cerevisiae – H. sapiens homology sets, which human counterpart have a predicted role in diseases. The classification was done according to main disease class in the genetic association database [47]. Note that U. maydis contains numerous genes that are thought to be involved in psychological disorders, such as ERCC1 and ERCC4.
Disease-related proteins with counterparts in U. maydis
| ACADM | acyl-CoA dehydrogenase | Um01049 | acyl-CoA dehydrogenase deficiency | 58.0 | 11.6 | [68] |
| ACADS | acyl-CoA dehydrogenase | Um04833 | acyl-CoA dehydrogenase deficiency | 35.2 | 11.2 | [69] |
| BZRP/PBR | Benzodiazepine receptor | Um06406 | Cancer | 27.4 | 10.1 | [70] |
| CES1 | Carboxylesterase | Um06071 | Cancer | 27.6 | 9.3 | [71] |
| CLPTM1 | unknown | Um02586 | Cleft lip development | 30.5 | 11.1 | [72] |
| DDC | Dopa decarboxylase | Um06083 | Parkinsons | 34.7 | 11.0 | [73] |
| DOCK3 | Dedicator of cytokinesis 3 | Um01178 | Attention deficit hyperactivity disorder | 23.1 | 12.3 | [74] |
| EPHX1 | Epoxide hydrolase | Um01938 | Cancer | 36.6 | 10.1 | [74] |
| GCDH | glutaryl-CoA dehydrogenase | Um01335 | glutaryl-CoA dehydrogenase deficiency | 54.1 | 13.9 | [75] |
| HEXA | Hexosaminidase A | Um00695 | Tay-Sachs disease | 20.1 | 12.7 | [76] |
| HMGCL | Methylglutaryl CoA lyase | Um02001 | methylglutaric CoA lyase deficiency | 43.6 | 11.4 | [77] |
| PIMT/PCMT1 | Methyl-transferase | Um11483 | Type 1 diabetes | 48.9 | 9.3 | [78] |
| PRSS16 | serine protease | Um10091 | Diabetes mellitus/susceptibility to autoimmunity | 20.7 | 6.6 | [79] |
| UGCG | Glucosyl-ceramide synthase | Um04496 | induced apoptosis | 22.3 | 8.1 | [80] |
¶ Identification tag for search at the MIPS server [13]
* Values are given as percent identity to human protein, all the U. maydis hits are best bidirectional
** Reference for the human protein/disease