| Literature DB >> 23236275 |
Robin A Ohm1, Nicolas Feau, Bernard Henrissat, Conrad L Schoch, Benjamin A Horwitz, Kerrie W Barry, Bradford J Condon, Alex C Copeland, Braham Dhillon, Fabian Glaser, Cedar N Hesse, Idit Kosti, Kurt LaButti, Erika A Lindquist, Susan Lucas, Asaf A Salamov, Rosie E Bradshaw, Lynda Ciuffetti, Richard C Hamelin, Gert H J Kema, Christopher Lawrence, James A Scott, Joseph W Spatafora, B Gillian Turgeon, Pierre J G M de Wit, Shaobin Zhong, Stephen B Goodwin, Igor V Grigoriev.
Abstract
The class Dothideomycetes is one of the largest groups of fungi with a high level of ecological diversity including many plant pathogens infecting a broad range of hosts. Here, we compare genome features of 18 members of this class, including 6 necrotrophs, 9 (hemi)biotrophs and 3 saprotrophs, to analyze genome structure, evolution, and the diverse strategies of pathogenesis. The Dothideomycetes most likely evolved from a common ancestor more than 280 million years ago. The 18 genome sequences differ dramatically in size due to variation in repetitive content, but show much less variation in number of (core) genes. Gene order appears to have been rearranged mostly within chromosomal boundaries by multiple inversions, in extant genomes frequently demarcated by adjacent simple repeats. Several Dothideomycetes contain one or more gene-poor, transposable element (TE)-rich putatively dispensable chromosomes of unknown function. The 18 Dothideomycetes offer an extensive catalogue of genes involved in cellulose degradation, proteolysis, secondary metabolism, and cysteine-rich small secreted proteins. Ancestors of the two major orders of plant pathogens in the Dothideomycetes, the Capnodiales and Pleosporales, may have had different modes of pathogenesis, with the former having fewer of these genes than the latter. Many of these genes are enriched in proximity to transposable elements, suggesting faster evolution because of the effects of repeat induced point (RIP) mutations. A syntenic block of genes, including oxidoreductases, is conserved in most Dothideomycetes and upregulated during infection in L. maculans, suggesting a possible function in response to oxidative stress.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23236275 PMCID: PMC3516569 DOI: 10.1371/journal.ppat.1003037
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Species used in this comparative study.
| Species | Taxonomy | Lifestyle | Host/substrate | Sequencing Center | Ref | |
| Class | Order | |||||
|
| ||||||
|
|
|
| Necrotrophic plant pathogen | Maize ( | JGI (USA) | (unpublished data) |
|
|
|
| Necrotrophic plant pathogen | Maize ( | JGI (USA) | (unpublished data) |
|
|
|
| Hemibiotrophic plant pathogen | Wheat, barley and other grasses | JGI (USA) | (unpublished data) |
|
|
|
| Hemibiotrophic plant pathogen | Maize ( | JGI (USA) | (unpublished data) |
|
|
|
| Necrotrophic plant pathogen |
| Washington State University (USA) |
|
|
|
|
| Necrotrophic plant pathogen | Barley ( | Curtin University, (Australia) |
|
|
|
|
| Necrotrophic plant pathogen | Wheat ( | BROAD (USA) |
|
|
|
|
| Hemibiotrophic plant pathogen |
| INRA/Genoscope (France) |
|
|
|
|
| Necrotrophic plant pathogen | Wheat ( | BROAD (USA) |
|
|
|
|
| Saprotrophic | Tree bark and wood | Oregon State University (USA) | |
|
|
|
| Saprotrophic | Various including plants | Oregon State University (USA) | |
|
|
|
| (Hemi)biotrophic plant pathogen | Poplar trees ( | University of British Columbia (Canada) | (unpublished data) |
|
|
|
| (Hemi)biotrophic plant pathogen | Poplar trees ( | JGI (USA) | (unpublished data) |
|
| Dothideomycetes | Capnodiales | Hemibiotrophic plant pathogen | Banana ( | JGI (USA) | (unpublished data) |
|
|
|
| Biotrophic plant pathogen | Tomato ( | Wageningen University & Research centre (The Netherlands) |
|
|
|
|
| Hemibiotrophic plant pathogen | Pines and other conifers | JGI (USA) |
|
|
|
|
| Hemibiotrophic plant pathogen | Wheat ( | JGI (USA) |
|
|
|
|
| Saprotrophic, extremophile | Ethanol vapor. Hard surfaces | JGI (USA) | (unpublished data) |
|
| ||||||
|
|
|
| Saprotrophic | Soil | AspGD consortium |
|
|
|
|
| Necrotrophic plant pathogen | Fruit and leaves of many species | BROAD |
|
|
|
|
| Saprotrophic | Dead plant material | BROAD |
|
|
|
|
| Necrotrophic plant pathogen | Wheat ( | BROAD |
|
|
|
|
| Hemibiotrophic plant pathogen/endophyte/saprotrophic | Many plant hosts; soil | BROAD |
|
|
|
|
| Hemibiotrophic plant pathogen | Rice ( | BROAD |
|
|
|
|
| Necrotrophic plant pathogen/saprotrophic | Many plant hosts; soil | JGI |
|
|
|
|
| Saprotrophic | Dead plant material, particularly after fires | BROAD |
|
|
|
|
| Saprotrophic | Fruit | SGD consortium |
|
|
|
|
| Necrotrophic plant pathogen | Many, primarily dicots | BROAD |
|
|
|
|
| Saprotrophic | Soil, woody surfaces | JGI |
|
|
|
|
| Necrotrophic plant pathogen | Many dicots | BROAD |
|
|
| ||||||
|
|
|
| Saprotrophic | Dung, plant material | BROAD |
|
|
|
|
| Animal pathogen | Humans | BROAD |
|
|
|
|
| Ectomycorrhizal | Many tree species | JGI |
|
|
|
|
| Obligate biotrophic plant pathogen | Poplar trees ( | JGI |
|
|
|
|
| Saprotrophic | Wood | JGI |
|
|
|
|
| Saprotrophic | Wood | JGI |
|
|
|
|
| Obligate biotrophic plant pathogen | Wheat ( | BROAD |
|
|
|
|
| Saprotrophic | Dead wood | JGI |
|
|
|
|
| Biotrophic plant pathogen | Maize ( | BROAD |
|
Additional genome-centric papers are planned.
Taxonomy and lifestyle of the 18 Dothideomycetes used in this study, as well as the outgroups for comparative purposes.
Figure 1Estimated phylogeny and divergence times of Dothideomycetes, based on sequences of three protein-coding genes.
Species with a sequenced genome that are included in this study are highlighted in dark blue. Vertical lines in blue and green indicate minimum and maximum ages for specific nodes, respectively. The age ranges for highlighted taxa are indicated by blocks with different shades of gray. Horizontal green lines indicate bootstrap recovery for specific nodes – thickened branches represent more than 70%, normal branches, 50–70% and less than 50% are indicated with dashed lines. In some cases relevant horizontal lines were stylistically extended to highlight node labels. Only families with multiple genomes are indicated. Orders, suborders and families that contain important plant-pathogenic species are colored brown and those containing majority lichenized species are green. Brown squares indicate plant pathogenic and green triangles lichenized species. Saprotrophs and fungi with other nutritional modes are not labeled.
Figure 2Phylogeny and genome characteristics of the 18 studied Dothideomycetes.
A. Genome-based phylogenetic tree of 18 Dothideomycetes computed using 51 conserved protein families. Bootstrap values are indicated on the branches. Lifestyles and strategies of pathogenesis (green circle for necrotrophs, orange circle for saprotrophs and blue circle for [hemi]biotrophs) are indicated. Aspergillus nidulans was used as an outgroup and its branch on the tree is not drawn to scale. B. Genome size and repeat content. Repeat content varies widely among Dothideomycetes, but in general the largest part consists of long terminal repeats. Asterisks indicate genomes that were sequenced exclusively with Illumina technology. Repeat content in these genomes is likely an underestimate. C. Number of predicted genes, broken down by level of conservation. D. Gene counts of classes that have been implicated in plant pathogenesis. Members of Capnodiales have fewer genes in these classes than Pleosporales and Hysteriales (with the exception of Cladosporium fulvum). This trend is also illustrated by the estimated gene counts for the last common ancestors of the indicated taxa (below the x-axis), which correspond to the taxa in (A). See also Figure S3. Bars on all graphs (B, C, and D) correspond to the organisms on the tree in (A).
Figure 3Whole-genome DNA comparison of Cochliobolus heterostrophus C5 to progressively distantly related organisms reveals the process leading from macrosynteny to mesosynteny.
A. Strains C4 and C5 of C. heterostrophus are progeny of C. heterostrophus backcrosses and show clear macrosynteny. B. When C. heterostrophus C5 is compared to C. sativus, macrosynteny is observed. However, intra-chromosomal inversions are observed in several comparisons of scaffold pairs. C. Numerous intra-chromosomal inversions have occurred in all scaffolds when compared to Setosphaeria turcica. D. A pattern of mesosynteny is observed when compared to Stagonospora nodorum. Syntenic regions are short and spread across the scaffold pairs. Scaffolds in this figure are not drawn to scale and only a subset of the scaffolds is depicted.
Figure 4Simulation of chromosome evolution leading to mesosynteny.
A. Two identical sequences show perfect macrosynteny. B. This is also the case for scaffold_1 of Cochliobolus heterostrophus C4 and scaffold_2 of C. heterostrophus C5, reflecting their close relationship as progeny. C. The two sequences from (A) have each undergone one random inversion. D. Scaffold_4 of C. heterostrophus C5 and scaffold_9 of C. sativus show a very similar pattern as in (C). E. The two sequences in (A) have each undergone 25 random inversions. F. Scaffold_8 of Setosphaeria turcica and part of scaffold_10 of C. heterostrophus C5 show a pattern of syntenic regions progressively spreading across the scaffolds similar to that in (E) G. The two sequences from (A) have each undergone 500 random inversions. Syntenic regions are short and spread homogeneously across the two scaffolds. H. Scaffold_1 of Dothistroma septosporum and scaffold_1 of Mycosphaerella populorum show a very similar pattern as in (G). Scaffolds in this figure are not drawn to scale.
Potential core and dispensable chromosomes in the genomes of Dothideomycetes.
| Organism | Category | Size excluding gaps (bp) | GC-content (%) | Gene count | Gene density (genes/Mbp) | Gene products with PFAM domain (%) | Repeat content (%) | Name |
|
| Core chromosomes | 35077646 | 52.33 | 10317 | 294.12 | 50.01 | 11.32 | |
| Dispensable chromosomes | 4602608 | 50.65 | 654 | 142.09 | 3.36 | 19.42 | chr_14, chr_15, chr_16, chr_17, chr_18, chr_19, chr_20, chr_21 | |
|
| Putative core scaffolds | 63951705 | 45.85 | 12699 | 198.57 | 50.76 | 37.12 | |
| Putative dispensable scaffolds | 9779308 | 40.71 | 408 | 41.72 | 2.45 | 56.72 | scaffold_11, scaffold_13, scaffold_14, scaffold_15, scaffold_16, scaffold_17, scaffold_18, scaffold_20, scaffold_21, scaffold_22, scaffold_23, scaffold_24, scaffold_25, scaffold_26 | |
|
| Putative core scaffolds | 42453795 | 45.55 | 12423 | 292.62 | 45.79 | 31.59 | |
| Putative dispensable scaffolds | 919483 | 35.42 | 42 | 45.68 | 9.52 | 89.36 | lm_SuperContig_22_v2, lm_SuperContig_29_v2 | |
|
| Putative core scaffolds | 38177662 | 51.46 | 11695 | 306.33 | 55.94 | 14.46 | |
| Putative dispensable scaffolds | 74746 | 41.92 | 7 | 93.65 | 14.29 | 72.32 | scaffold_28 | |
|
| Putative core scaffolds | 36907047 | 50.52 | 12380 | 335.44 | 51.61 | 2.77 | |
| Putative dispensable scaffolds | 142552 | 28.14 | 0 | 0.00 | 0.00 | 32.97 | scaffold_46 | |
|
| Putative core scaffolds | 36190278 | 49.81 | 13322 | 368.11 | 53.51 | 8.39 | |
| Putative dispensable scaffolds | 134153 | 40.04 | 14 | 104.36 | 7.14 | 85.38 | scaffold_27 |
Mycosphaerella graminicola has been shown previously to contain dispensable (i.e., not necessary for survival) chromosomes [24]. These chromosomes are smaller, less gene-dense and more repeat-rich than the core chromosomes. Proteins encoded by genes on these chromosomes less frequently contain a PFAM domain. Scaffolds with similar characteristics are also present in five other Dothideomycetes. Additional statistics for these scaffolds are given in Table S12.
Figure 5The full and core proteomes of the 18 Dothideomycetes.
A. The full proteome of the Dothideomycetes contains 215,225 proteins and for the majority of these the function according to KOG [93] is unknown or poorly characterized. B. The core proteome contains the 66,761 proteins from multi-gene families that had at least one member in each Dothideomycete. Relative to (A), this set of proteins has more KOG annotations than the full proteome. In particular genes involved in metabolism are over-represented.
Summary of expansion and depletion of PFAM domains and multi-gene families in various comparisons based on phylogeny and lifestyle (Table 1).
| PFAM domain | Multi-gene families | |||
| Comparison | Expanded (of which unique) | Depleted (of which absent) | Expanded (of which unique) | Depleted (of which absent) |
|
| 233 (2) | 37 (3) | 3358 (840) | 280 (116) |
| Dothideomycete plant pathogens versus other plant pathogens | 69 (10) | 21 (9) | 2098 (1411) | 1209 (1081) |
|
| 137 (39) | 67 (31) | 2995 (2468) | 2129 (1917) |
| Necrotrophic versus (hemi)biotrophic dothideomycete plant pathogens | 4 (4) | 21 (21) | 299 (299) | 1195 (1195) |
| Dothideomycete cereal pathogens versus other | 6 (6) | 14 (14) | 492 (492) | 359 (359) |
| Dothideomycete tree pathogens versus other | 4 (4) | 77 (77) | 1220 (974) | 2226 (2226) |
| Dothideomycete saprotrophs versus dothideomycete plant pathogens | 7 (7) | 25 (25) | 516 (511) | 551 (550) |
All expanded and depleted PFAM domains and multi-gene families (as well as the statistics) are given in Tables S14 and S15, respectively.
Figure 6Heat map of CAZY families in the Dothideomycetes.
Both the CAZY families and the organisms are hierarchically clustered. The clustering of organisms largely follows the phylogeny in Figure 2A. Notable exceptions are the observation that the biotroph C. fulvum clusters as an outgroup to the hemibiotrophs and saprotroph within the Capnodiales, and the observation that the two pathogens of Brassica spp. (L. maculans and A. brassicicola) cluster together.
Summary of gene classes that are over-represented in repeat regions (i.e., the 2000 bp flanking predicted transposable elements).
| TE repeat content (%) | Over-representation of gene classes in repeat regions | ||||
| Small secreted proteins | All secreted proteins | Secondary metabolism | Expanded orphan multi-gene families | ||
|
| 5.58 | • | |||
|
| 0.4 | ||||
|
| 44.24 | ||||
|
| 7.77 | • | • | • | |
|
| 0 | ||||
|
| 5.44 | • | • | • | |
|
| 0.67 | ||||
|
| 0.57 | ||||
|
| 30.93 | • | • | • | • |
|
| 38.97 | • | |||
|
| 11.66 | • | • | ||
|
| 20.81 | • | |||
|
| 3.56 | • | • | • | |
|
| 11.44 | • | • | • | • |
|
| 1.98 | • | |||
|
| 0.18 | ||||
|
| 11.16 | • | • | • | • |
|
| 2.37 | • | • | ||
See also Table S27 for more information. Genomes labeled with an asterisk (*) have been sequenced exclusively using Illumina technology.
Figure 7The RIP index (TpA/ApT) of genes as a function of the distance from a transposable element.
The RIP index is highest near the transposable elements and levels off after approximately 2000 bp, signifying that these regions are subjected to repeat induced point mutations.