| Literature DB >> 20615251 |
David L Joly1, Nicolas Feau, Philippe Tanguay, Richard C Hamelin.
Abstract
BACKGROUND: Obligate biotrophs such as rust fungi are believed to establish long-term relationships by modulating plant defenses through a plethora of effector proteins, whose most recognizable feature is the presence of a signal peptide for secretion. Since the phenotypes of these effectors extend to host cells, their genes are expected to be under accelerated evolution stimulated by host-pathogen coevolutionary arms races. Recently, whole genome sequence data has allowed the prediction of secretomes, facilitating the identification of putative effectors.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20615251 PMCID: PMC2996950 DOI: 10.1186/1471-2164-11-422
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of Melampsora cDNA libraries characteristics
| Species/Natural host | Material | Unisequences | Contigs | Singletons | Putative Sa (%) | Putative S+b (%) |
|---|---|---|---|---|---|---|
| Haustoria | 1148 | 427 | 721 | 54 (4.7) | 72 (6.3) | |
| 787 | 266 | 521 | 51 (6.5) | 92 (11.7) | ||
| 937 | 452 | 485 | 95 (10.1) | 131 (14.0) | ||
| 1547 | 571 | 976 | 98 (6.3) | 246 (15.9) | ||
| 1625 | 493 | 1132 | 107 (6.6) | 148 (9.1) |
aS: Initial predicted set of unisequences encoding putative secreted proteins.
bS+: Final set of unisequences encoding putative secreted proteins, following reassignments based on reciprocal BLAST.
Similarity of Melampsora unisequences to sequences from UniProtKB, BasidiomycotaDB and PuccinialesDB
| Library | % with homologues in | % with homologues in BasidiomycotaDB | % with homologues in PuccinialesDB | |||
|---|---|---|---|---|---|---|
| 22.6 | 13.9 | 21.5 | 13.9 | 29.6 | 23.6 | |
| 54.0 | 54.3 | 54.1 | 50.0 | 62.4 | 56.5 | |
| 56.2 | 50.4 | 54.7 | 45.8 | 64.9 | 56.5 | |
| 36.7 | 23.6 | 38.3 | 23.2 | 47.4 | 35.4 | |
| 47.4 | 43.9 | 49.3 | 37.2 | 59.6 | 54.7 | |
BasidiomycotaDB included Basidiomycota (excluding Pucciniales) EST sequence 6-frame translations and protein sequences from the non-redundant database of NCBI and gene models from Coprinus cinereus Okayama 7 (#130), Cryptococcus neoformans var. grubii serotype A, strain H99, Laccaria bicolor S238N-H82, Malassezia globosa CBS 7966, Phanerochaete chrysosporium RP78, Postia placenta MAD-698, Sporobolomyces roseus IAM 13481 and Ustilago maydis 521. PuccinialesDB included Pucciniales EST sequence 6-frame translations and protein sequences from the non-redundant database of NCBI and gene models from P. graminis f. sp. tritici CRL 75-36-700-3. BLASTX hits were considered significant when E-value ≤ 1e-4.
aNS: Final set of unisequences not predicted to encode putative secreted proteins.
bS+: Final set of unisequences encoding putative secreted proteins, following reassignments based on reciprocal BLAST.
Figure 1Gene Ontology classification of the . For each main GO category (A: Biological process; B: Molecular function; and C: Cellular component), percentages were based on the total number of ontologies found for unisequences encoding putative secreted proteins (S+) or non-secreted proteins (NS) (see values in D). Note that individual GO categories can have multiple mappings resulting in percentage values higher than 100%. Underlined GO categories are overrepresented in S+ compared with NS (***: significant at the 0.1% level; **: significant at the 1% level; *: significant at the 5% level).
Figure 2Serial analysis of gene expression: stage-specificity of putative secretome members of the . Similarities in serial analysis of gene expression (SAGE) patterns of 182 putative secretome members of the Melampsora unisequence dataset were determined using an average linkage hierarchical clustering according to the Spearman Rank Correlation. Corresponding tags were identified as described in the Methods section. Each row represents a tag, whereas each column corresponds to a SAGE library. 22H_C: 22 hours after inoculation (compatible interaction); 22H_I: 22 hours after inoculation (incompatible interaction); 2H: 2 hours after inoculation (germinating spores); 5D_C: 5 days after inoculation (compatible interaction); and 5D_I: 5 days after inoculation (incompatible interaction). Genes upregulated during the biotrophic or pre-biotrophic stages are highlighted in blue and yellow, respectively. Medium blue = genes specific to the biotrophic stage; Light blue = genes upregulated during the biotrophic stage; Dark yellow = genes upregulated in 22H_I; Medium yellow = genes upregulated in 22H_C; and Light yellow = genes upregulated in 2 H. The relative abundance of the SAGE tag in the library correlates with the intensity of the red color (black, not present; intense red, highly abundant). Brackets indicate clades containing homologues of haustorially expressed secreted proteins (HESPs) 735, 417, and 379, respectively.
Figure 3Accelerated sequence divergence of secreted proteins between predicted ORFs from the . The set of Melampsora predicted ORFs (2319 non-secreted ORFs, represented by white circles; and 267 secreted ORFs, represented by red circles) was compared with translated gene models of Puccinia graminis f. sp. tritici. Each radius ranges from 0 (center) to 100 (outer circle), representing BLASTP percent identity. A BLASTP hit was considered significant when E-value ≤ 1e-4. Position of ORFs along the circumference is random.
Figure 4Similarity of . Similarity of Melampsora A) non-secreted and B) secreted homologous gene groups (HGGs) to the proteomes of Puccinia graminis f. sp. tritici, Ustilago maydis and Sporobolomyces roseus. For each of the 369 Melampsora HGGs, a BLASTX search was performed against the proteomes of the other three species and SimiTri was used to plot the sequence similarity relationships between 417 consensus sequences derived from these HGGs and related species. Each tile in the graphics represents a unique consensus sequence and its relative position is computed from the raw BLAST scores derived above (with a cutoff of > 40). Hence, each tile's position indicates its degree of sequence similarity to each of the three selected databases. Sequences showing similarity to only one database are not shown. Sequences showing sequence similarity to only two databases appear on the lines joining the two databases. The position of positively selected HGGs is indicated by arrows. Tiles are colored by their highest BLASTX score for each of the databases: red ≥ 300; yellow ≥ 200; green ≥ 150; blue ≥ 100; and purple < 100.
Figure 5Elevated . The rates of non-synonymous nucleotide substitutions per non-synonymous site (dN) and the rates of synonymous nucleotide substitutions per synonymous site (dS) were calculated across all possible pairwise comparisons within each of the 369 HGGs using the maximum likelihood method implemented in the codeml program (runmode = -2) in the PAML 4 software package. The distribution of highest calculated ω (dN/dS ratios) among all pairwise comparisons within each HGG is shown. In cases where dN > 0 and dS = 0 (i.e. ω = ∞), the second higher ratio was selected.
Characteristics of the Melampsora homologous gene groups (HGGs) predicted to be under positive selection (at least one pairwise ω = dN/dS > 1.2)
| HGG | NS/Sa | Length | BLASTX | E-value | BLASTX | E-value | PFAM | Cys | ωb |
|---|---|---|---|---|---|---|---|---|---|
| 205 | NS | 115 | No hit | No hit | - | ND | 2.68 | ||
| 254 | NS | 200-202 | No hit | PGTT03363 | 1e-10 | - | ND | 2.59 | |
| 1278 | NS | 173-175 | XP_001211022 conserved hypothetical protein | 3e-07 | PGTT11073 | 1e-34 | - | ND | 1.88 |
| 28 | S | 161 | No hit | No hit | - | 0 | 1.38 | ||
| 92 | S | 271-276c (227)d | AAS45284 proline-rich antigen | 4e-09 | PGTT01207 | 2e-13 | CFEM | 6 (2)d | 1.43 |
| 729 | S | 269 | EAT81533 hypothetical protein | 7e-11 | PGTT12331 | 7e-61 | - | 8 | 1.32 |
| 747 | S | 163 | XP_757360 hypothetical protein | 1e-04 | PGTT02151 | 2e-14 | - | 4 | 1.95 |
| 4191 | S | 129 | No hit | No hit | - | 4 | 1.57 | ||
| 5606 | S | 119-123e | No hit | No hit | - | 6 | 1.76 | ||
| 5610 | S | 98 | No hit | No hit | - | 6g | 4.31 | ||
| 5617 | S | 168-169f | No hit | No hit | - | 0g | 1.23 | ||
| 5624 | S | 176 | No hit | No hit | - | 8 | 1.93 | ||
| 6067 | S | 285 | XP_758577 hypothetical protein | 6e-06 | PGTT13234 | 5e-21 | - | 10g | 1.27 |
BLASTX and PFAM hits were considered significant when E-value ≤ 1e-4 and 1e-5, respectively.
aNS: non-secreted; S: secreted.
bHighest calculated dN/dS ratio among all pairwise comparisons within each HGG.
cLength is 271 (M. medusae f. sp. deltoidae, M. medusae f. sp. tremuloidae and M. occidentalis), and 276 (M. larici-populina); an 86 amino acids insertion and a 5 amino acids minisatellite are also present in a M. medusae f. sp. deltoidae unisequence (length: 357) and a M. larici-populina unisequence (length: 281), respectively.
dA 44 amino acids deletion (including 4 Cys residues) is present in a M. medusae f. sp. deltoidae unisequence.
eLength is 119 (M. medusae f. sp. tremuloidae) and 123 (M. occidentalis).
fA 12 amino acids minisatellite is present in a M. occidentalis unisequence (length: 181).
gAdditional Cys residues are present before the signal peptide cleavage site.