| Literature DB >> 21435244 |
Junhuan Xu1, Rob Linning, John Fellers, Matthew Dickinson, Wenhan Zhu, Ivan Antonov, David L Joly, Michael E Donaldson, Tamar Eilam, Yehoshua Anikster, Travis Banks, Sarah Munro, Michael Mayo, Brian Wynhoven, Johar Ali, Richard Moore, Brent McCallum, Mark Borodovsky, Barry Saville, Guus Bakkeren.
Abstract
BACKGROUND: Rust fungi are biotrophic basidiomycete plant pathogens that cause major diseases on plants and trees world-wide, affecting agriculture and forestry. Their biotrophic nature precludes many established molecular genetic manipulations and lines of research. The generation of genomic resources for these microbes is leading to novel insights into biology such as interactions with the hosts and guiding directions for breakthrough research in plant pathology.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21435244 PMCID: PMC3074555 DOI: 10.1186/1471-2164-12-161
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
cDNA libraries produced and number of generated quality EST reads
| cDNA lib | source | designation a | # reads | unigenes b | singlets | contigs | predicted fungal c | stage-specific d |
|---|---|---|---|---|---|---|---|---|
| PT029 | aeciospores | pte | 1292 | 208 (16%) | 166 | 42 | 199 (95%) | 100 (48%) |
| PT030 | pycniospores | ptp | 4869 | 1252 (26%) | 807 | 445 | 1116 (89%) | 961 (77%) |
| PT031 | dormant teliospores | ptt | 3703 | 1165 (31%) | 878 | 287 | 1041 (89%) | 801 (69%) |
| PT033 | isolated haustoria | ptih | 4896 | 2886 (59%) | 2620 | 266 | 307 (11%) | 441 (15%) |
| PTDH | isolated haustoria | ptih | n/a | 1921 | 354 | 1567 | 1236 (64%) | 1069 (56%) |
| PTDG | germinated urediniospores | ptg | n/a | 1003 | 83 | 920 | 740 (74%) | 398 (40%) |
a life cycle stage-specific suffix, used at the end of all EST sequences: pte, aeciospores; ptp, pycniospores; ptt, teliospores; ptih, isolated haustoria; ptg, germinated urediniospores.
b non redundant sequences obtained via CAP3 analysis of life cycle stage-specific EST set (as percent of reads generated), yielding singlets and contigs
c number of unigenes predicted to be fungal (as percent of compiled unigenes for that stage)
d number of unigenes not represented in other stages (as percent of all unigenes from this library)
Figure 1Flow chart of . Gene prediction in the P. triticina EST sequences was done by the EST version of the GeneMarkS program [25]. The initial 2nd order model of protein coding regions were derived automatically; then from the set of predicted genes we selected a reduced set of those that were likely to be fungal genes (see text) and more specific 4th order model for fungal genes was derived. Frameshift detection in ESTs with one or more genes predicted in the same strand were analyzed by the frameshift prediction program GeneTack [28] (see text).
Figure 2Estimating . Distribution of numbers of EST/unigene sequences with 0, 1, 2 ... genes predicted by the 2nd order and the 4th order Markov model.
Similarity searches of Pt unigene sequences to related fungal databases
| < = -100 | 863 (867) | 490 (490) | 285 (285) | (3426) | 120 (120) | 79 (70) | 117 (117) | 68 (68) | 67 (67) |
| -100 < × ≤ -50 | 434 (442) | 749 (751) | 763 (763) | (678) | 151 (151) | 253 (254) | 434 (434) | 277 (277) | 249 (249) |
| -50 < × ≤ -20 | 452 (481) | 786 (815) | 957 (958) | (395) | 211 (213) | 377 (381) | 795 (796) | 533 (535) | 487 (489) |
| -20 < × ≤ -5 | 513 (612) | 753 (849) | 745 (764) | (250) | 270 (292) | 473 (516) | 758 (781) | 629 (656) | 656 (680) |
| TOTAL | 2262 (2402) | 2778 (2905) | 2750 (2770) | (4749) | 752 (776) | 1182 (1230) | 2104 (2128) | 1507 (1535) | 1459 (1485) |
a the 6,308 most-likely Pt unigene subset (as well as the complete 13,328 Pt unigene sequences: numbers in parentheses) were compared to available Pgt ESTs and genome sequences (upper and lower line in each cell, respectively), using the algorithms indicated
b results comparing the 20,567 and 18,241 (excluding mostly TE-related) gene calls, were identical
c early Pt genome Sanger and 454 sequences
d set of 3,297 published, most-likely P. striiformis f. sp. tritici EST sequences from NCBI dbEST [17,20,21]
e set of 16,831 predicted proteins from the Melampsora laricis-populina genome (http://genome.jgi-psf.org/Mellp1/Mellp1.home.html)
f predicted/valid transcripts (many matching ESTs, [43,56,57], Morrison et al. in preparation] and 364 additional EST unigenes not predicted in the Um genome at MIPS
g MUMDB (http://mips.helmholtz-muenchen.de/genre/proj/ustilago)
Figure 3Venn diagram depicting numbers of predicted putative fungal . The sizes of the circles reflect the pool size of unigenes for each stage: 369 Ptu (urediniospores), 697 Ptt (teliospores), 88 Pte (aeciospores) and 911 Ptp (pycniospores). One unigene was shared by all spore stages. All numbers reflect counts in that particular section only.
Stage-specific Pt unigene counts and representation in Pgt
| stage | # unique unigenes a | predicted fungal b | # matching | |
|---|---|---|---|---|
| ptp | 961 | 911 | 2246 | 563 (62%) |
| ptt | 801 | 697 | 2259 | 374 (54%) |
| pte | 100 | 88 | 818 | 47 (54%) |
| ptu | 486 | 369 | 1131 | 152 (41%) |
| ptg | 3308 | 475 (59%) | ||
| ptih | 1283 | 973 | 2430 | 292 (30%) |
a number of non-redundant sequences obtained from the indicated specific developmental stage only: ptp, pycniospores; ptt, teliospores; pte, aeciospores; ptu, dormant urediniospores; ptg, germinated urediniospores; ptih, isolated haustoria
b number of unigenes predicted to be fungal
c number of predicted fungal unigenes for that stage was normalized using quality reads in relation to the stage with the highest sampling, ptg (bold type)
d BLASTX search of Pgt genome-predicted proteins at e < 10-5 (percent of actual predicted fungal unigenes)
Figure 4Molecular relationships among some families of SSPs expressed during infection. (A) 31 related predicted SSPs common to four rust fungal species. Comparisons were seeded by Pt Contig5547 (arrow) which represents a gene expressed and, based on EST counts, possibly induced upon infection. Pt proteins predicted from the genome representing the unigenes were used in the comparison; several "unigenes" represented the same Pt protein indicating they did not collapse into one contig for various reasons (possibly representing allelic sequences). (B) Relationship of a small family of Puccinia-specific, small (~118 aa) SSPs; the two Pgt pairs are likely allelic proteins. Note that no proper gene calls exist (yet) for the Pt proteins and thus more members could exist. Relationships based on a ClustalW alignment (Additional file 10) were inferred using the Maximum Parsimony method (Close-Neighbor-Interchange algorithm, gaps and missing data were eliminated from the dataset). Bootstrap consensus tree (1000 replicates) with values at the nodes. SSPs indicated with an asterisk were used as outgroups. Phylogenetic analyses were conducted in MEGA4 [55].