| Literature DB >> 26643441 |
M A Van der Nest1, E T Steenkamp2, A R McTaggart3, C Trollip4, T Godlonton5, E Sauerman6, D Roodt7, K Naidoo8, M P A Coetzee9, P M Wilken10, M J Wingfield11, B D Wingfield12.
Abstract
BACKGROUND: Proteins in the Glycoside Hydrolase family 32 (GH32) are carbohydrate-active enzymes known as invertases that hydrolyse the glycosidic bonds of complex saccharides. Fungi rely on these enzymes to gain access to and utilize plant-derived sucrose. In fungi, GH32 invertase genes are found in higher copy numbers in the genomes of pathogens when compared to closely related saprophytes, suggesting an association between invertases and ecological strategy. The aim of this study was to investigate the distribution and evolution of GH32 invertases in the Ceratocystidaceae using a comparative genomics approach. This fungal family provides an interesting model to study the evolution of these genes, because it includes economically important pathogenic species such as Ceratocystis fimbriata, C. manginecans and C. albifundus, as well as saprophytic species such as Huntiella moniliformis, H. omanensis and H. savannae.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26643441 PMCID: PMC4672557 DOI: 10.1186/s12862-015-0550-7
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Genome information of the Huntiella, Ceratocystis and Sordariomycetes species included in this study
| Taxon | Assembly size (Mbp) | Number of scaffoldsa | Number of gene modelsb | Referencesc |
|---|---|---|---|---|
|
| 31.71 | 980 | 7 494 | Van der Nest et al. 2014. |
|
| 29.40 | 2 641 | 7 266 | Wilken et al. 2013 |
|
| 27.15 | 939 | 6 967 | Van der Nest et al. 2014. |
|
| 25.43 | 365 | 6 832 | Van der Nest et al. 2014. |
|
| 28.54 | 361 | 7 361 | This study |
|
| 31.50 | 1 638 | 8 395 | Van der Nest et al. 2014. |
|
| 54.42 | 13 | 9 521 | JGI |
|
| 56.23 | 786 | 15 755 | JGI |
|
| 47.67 | 686 | 16 992 | JGI |
|
| 33.69 | 235 | 10 364 | Xiao et al. 2012 Scientific Reports 2 |
|
| 34.90 | 37 | 11 124 | JGI |
|
| 51.60 | 653 | 12 006 | O’Connell et al. 2012 Nat Genet 2012 44:1060–5 |
|
| 49.08 | 10 235 | 16 172 | O’Connell et al. 2012 Nat Genet 2012 44:1060–5 |
|
| 42.38 | 135 | 13 657 | JGI |
|
| 32.27 | 32 | 9 651 | Zheng et al. 2011 Genome Biol 12:R116 |
|
| 43.90 | 26 | 11 609 | JGI |
|
| 37.55 | 398 | 11 173 | JGI |
|
| 54.01 | 2 334 | 11 685 | Blanco-Ulate et al. 2013 Genome Announc 1:e00390–13 |
|
| 43.83 | 12 | 14 813 | Wiemann et al. 2013 PLoS Pathog 9:e1003475 |
|
| 36.45 | 31 | 13 322 | Cuomo et al. 2007 Science 317:1400–2 |
|
| 61.36 | 114 | 17708 | JGI |
|
| 41.78 | 36 | 14 188 | JGI |
|
| 51.15 | 72 | 15 707 | JGI |
|
| 50.04 | 307 | 15 777 | JGI |
|
| 58.84 | 119 | 18 975 | JGI |
|
| 29.79 | 289 | 8 312 | DiGuistini et al. 2011 PNAS 108:2504–9 |
|
| 37.70 | 899 | 11 712 | JGI |
|
| 63.66 | 325 | 22 250 | JGI |
|
| 39.42 | 241 | 9 849 | Gao et al. 2011 PLoS Genet 7:e1001264 |
|
| 39.15 | 176 | 10 583 | Gao et al. 2011 PLoS Genet 7:e1001264 |
|
| 38.74 | 7 | 9 110 | Berka et al. 2011 Nature Biotech 29:922–927 |
|
| 41.04 | 20 | 10785 | JGI |
|
| 176.0 | 37.3 | 9 948 | JGI |
|
| 39.10 | 81 | 10 380 | Ellison et al. 2011 Genetics 189:55–69 |
| Ophiostoma piceae UAMH 11346 | 32.84 | 45 | 8 884 | Haridas et al. 2013 BMC Genomics 14:373 |
|
| 47.47 | 624 | 8 834 | Blanco-Ulate et al. 2013 Genome Announc 1, e00390–13 |
|
| 35.01 | 7 | 10588 | JGI |
|
| 43.45 | 29 | 9 411 | JGI |
|
| 40.66 | 153 | 9 204 | JGI |
|
| 32.74 | 109 | 11 942 | JGI |
|
| 30.99 | 69 | 10 954 | JGI |
|
| 31.18 | 251 | 11 338 | JGI |
|
| 36.91 | 6.00 | 9 813 | Berka et al. 2011 Nature Biotech 29:922–927 |
|
| 36.10 | 29.00 | 11 863 | JGI |
|
| 40.87 | 2 282 | 13 932 | JGI |
|
| 40.98 | 532 | 14 095 | JGI |
|
| 40.87 | 2 282 | 13 932 | JGI |
|
| 39.00 | 93 | 12 427 | JGI |
|
| 34.10 | 89 | 9 129 | JGI |
|
| 33.83 | 52 | 10 535 | Klosterman et al. 2011 PLoS Pathogens 7: e1002137 |
a Number of scaffolds/supercontigs that make up an individual genome assembly
b Number of the genes predicted for each genome assembly
c JGI = Joint Genome Institute portal (www.jgi.doe.gov)
Statistics of the Huntiella savannae genome assembly and gene annotations
| Summary data |
|
|---|---|
| Total reads before trim (bp) | 33 168 540 |
| Total reads after trim (bp) | 33 055 449 |
| Average length of reads before trim (bp) | 101 |
| Average length of reads after trim (bp) | 85.68 |
| Number of scaffolds | 361 |
| Total sequence length (Mb) | 28.54 |
| Largest scaffold (bp) | 1 009 760 |
| N50 Scaffold size (bp) | 229 095 |
| GC % | 47.39 |
| Predicted gene models | 7 687 |
| CEGMAa | 96.37 |
a Genome completeness was evaluated using the Core Eukaryotic Genes Mapping Approach (CEGMA) pipeline [38]
GH32 family members identified in Huntiella and Ceratocystis
| Gene Name | Taxona | Genomic Locationb | Length (bp) | Length (aa) |
|---|---|---|---|---|
| CmINV_V |
| JJRZ01000038:33284-34837 | 1945 | 627 |
| CmINV_CW |
| JJRZ01000038:36427-37962 | 1952 | 625 |
| CfINV_V |
| APWK02000925:4324-5817 | 1945 | 627 |
| CfINV_CW |
| APWK02000924:3-1490 | 1938 | 625 |
| CaINV_V |
| JSSU01001085:20903-22444 | 1829 | 627 |
| CaINV_CW |
| JSSU01001085:17498-19033 | 1952 | 625 |
| HmINV_CW |
| JMSH01000004:78747-80591 | 1848 | 615 |
| HaINV_CW |
| NODE_2:199823- 201667 | 1848 | 615 |
| HoINV_CW |
| JSUI01006495:45419-47263 | 1848 | 615 |
a The GenBank accesssion numbers for the C. manginecans, C. fimbriata and C. albifundus genomes are JJRZ01000000, APWK00000000 and JSSU00000000. Those for the H. moniliformis, H. savannae and H. omanensis genomes are JMSH00000000, LCZG00000000 and SUI00000000
b The name of the contig is followed by the nucleotide position of the gene within the contig
Fig. 1Glycoside hydrolase 32 (GH32) gene family expansions and contractions mapped onto the Sordariomycetes chronogram. Significant (P< 0.05) expansions (indicated with red lines) and contractions (indicated with green lines) were inferred using CAFE v3.1 (Computational Analysis of gene Family Evolution) [45]. The probable ancestral gene family size for each node is indicated within white circles, while the family sizes in extant species are indicated at the tips of terminal branches. Colour-coding designates the Sordariomycetes taxa to either order or family level. The chronogram was inferred in this study (see Additional file 2: Figure S1). The sequences from Dothideomycetes were used for outgroup purposes
Fig. 2Alignment of the conserved motifs of the glycoside hydrolase 32 (GH32) enzymes. These include conserved regions (labelled A-G) and various amino acids (shown with black stars). The N-terminal β-propeller module (indicated in the blue block) and the C-terminal β-sandwich module (indicated in the red block) are also highlighted. The translated sequences of one group of the Ceratocystis GH32 gene possess a trans-membrane domain (shown with dotted lines) characteristic of vacuolar invertases [5], while the translated sequences of the other Ceratocystis GH32 gene and the Huntiella GH32 gene possess an eukaryotic secretion signal (shown with dotted lines) needed for secretion [60]
Fig. 33D structure of the C. manginecans invertase (CmINV-CW). Roman numerals (I–V) show the five blades of the β-propeller module, while the C-terminal β-sandwich module is indicated in dark red. These structures were inferred with the Swiss-Model Web server (http://www.expasy.org/swissmod/SWISS-MODEL.html) by making use of a fructosyltransferase from Aspergillus japonicus (PDB id: 3lfi.1) as template
Fig. 4The predicted genes flanking the Glycoside hydrolase 32 (GH32) gene family members in Huntiella and Ceratocystis. Genes present on the scaffolds harbouring the putative invertases were predicted using AUGUSTUS [32] and annotated using Blast2GO [44]. Note that the genes are not drawn to scale. The Huntiella GH32 family gene is flanked by putative G1/S-specific cyclin Pcl5 (Colletotrichum orbiculare, ENH86823), RNAse P Rpr2/Rpp21 subunit domain-containing protein (Gaeumannomyces gramini, EPQ63823), Malate synthase-like protein (Acremonium chrysogenum, XP003651419), serine/threonine-protein kinase (Metarhizium acridum, EFY93082.1), nitrogen response regulator (Colletotrichum gloeosporioides, ELA29612.1), DEAD/DEAH box helicase (Colletotrichum sublineola, KDN64774), 2-dehydropantoate 2-reductase (Colletotrichum gloeosporioides, EQB48758), and structural maintenance of chromosomes 5 (Villosiclava virens, KDB17190) genes. The two Ceratocystis GH32 family genes were flanked by putative Phosphatidylinositol-specific phospholipase (Metarhizium anisopliae, KFG82763), putative WD domain-containing protein (Togninia minima, EOO00810.1), reverse transcriptases (Sclerotinia sclerotiorum, XP_001588999 and Blumeria graminis, CCU77161), transcription elongation factor 5 (Scedosporium apiospermum, KEZ42236), adenylate kinase (Magnaporthe oryzae, XP003716198), and Fot5 transposase (Colletotrichum gloeosporioides, ELA33194.1) genes
Fig. 5Maximum likelihood phylogeny of the Sordariomycetes Glycoside Hydrolase 32 (GH32) gene family. Representative sequences of the 8 groups that span the fungal GH32 gene phylogeny [2] were included in this study. GenBank accession numbers or sequence identifiers from genome projects for each of these sequences are provided in parentheses. Percentage bootstrap support (based on a 1000 repeats) is indicated at the internodes. The exon-intron structure of the genes is diagrammatically indicated next to each taxon where gaps within solid lines indicate intron positions. Colour-coding designates the groups previously identified [2]. The sequences from Arabidobsis thaliana were used for outgroup purposes
Fig. 6The inferred evolutionary history of the Ceratocystidaceae Glycoside hydrolase 32 (GH32) gene family and the orthology relationships among these genes. a The evolutionary tree shows nine homologous genes from six species (A). The Huntiella cell wall invertase genes are depicted as HsINV-CW, HmINV-CW and HoINV-CW, while the Ceratocystis vacuolar invertase genes are depicted as CaINV-V, CmINV-V and CfINV-V and those encoding the Ceratocystis cell wall invertases as CaINV-CW, CmINV-CW and CfINV-CW. As indicated by CAFE, the genome of the Ceratocystidaceae ancestor likely encoded two invertase (INV) genes, one of which (depicted by the grey line) was subsequently lost from both the Ceratocystis and Huntiella lineages (depicted by grey broken line) before the radiation of species. However, the remaining invertase gene (depicted in orange) was duplicated in the Ceratocystis ancestor resulting in the two invertase genes encoded by the genomes of the extant species. This duplication was also reconstructed using NOTUNG 2.6 which detects duplications based on gene tree to species tree reconciliation [89] (results not shown). All of the invertase genes in the extant Ceratocystis and Huntiella species thus evolved from the same ancestral gene in the Ceratocystidaceae ancestor (depicted by the orange line). The respective Ceratocystis genes each evolved through vertical decent after their emergence (i.e., gene duplication) in the last common ancestor. b Following the standard nomenclature for duplicated genes (reviewed by Koonin [36]), the Huntiella cell wall invertase genes share an orthologous relationship (i.e., orthologs are related via speciation and are derived via vertical decent from the common ancestor). The same is also true for the respective cell wall and vacuolar invertase genes of Ceratocystis, where each represent a set of orthologs. Because the duplication that gave rise to the Certocystis genes occurred before radiation of this genus, the Ceratocystis cell wall and vacuolar invertase genes represent outparalogs (i.e., homologs that derive from a gene duplication event that precedes lineage radiation/speciation) [36]. However, all of the Ceratocysistis invertase genes represent co-orthologs of the gene in Huntiella. This is because the lineage-specific duplication in Ceratocystis gave rise to a set of genes that are collectively orthologous to those of Huntiella [36]
Fig. 7Maximum likelihood phylogeny of the Fot5 DDD catalytic domain. This analysis was done using the WAG substitution model [49] and gamma correction to account for among site rate variation. The Ceratocystis Fot5 sequences are included in the grey area and indicated according to species (green dots = C. albifundus, blue dots = C. fimbriata, red dots = C. manginecans). The branch labelled with an asterisk received 81 % bootstrap support based on the analysis of 1000 pseudoreplicates (see Additional file 4: Figure S2 for full information regarding bootstrap support for the tree, as well as the sequence identifiers of putative Ceratocystis Fot5 homologs and Additional file 3: Table S2 for their genomic coordinates). GenBank accession numbers or for previously identified Fot5 homologs are: Fot2 [Genbank:JN624854, F. oxysporum), Fot5 [Genbank:CAE55867, F. oxysporum], Fot1 [Genbank:X64799, F. oxysporum], Fot4 [Genbank:AF076632, F. oxysporum], Fot9 [JGI:2517, F. graminearum], Fotyl [Genbank:CAG33729.1 Yarrowia lipolytica], Molly [Genbank:CAD32687, Parastagonospora nodorum], Ophio [Genbank:ABG26269, Ophiostoma novo-ulmi], PABRA [Genbank:ACY56713, Paracoccidioides brasiliensis], Pixie [Genbank:CAD32689, Parastagonospora nodorum], Pot2 [Genbank:CAA83918, Magnaporthe grisea], Pot3 [Genbank:AAC49418, M. grisea], SCSCL [Genbank:XP001592252, Sclerotinia sclerotiorum], Taf1 [Genbank:AAX83011, Aspergillus fumigatus], Tan1 [Genbank:U58946, Aspergillus awamori] USMA [Genbank:UM03882, Ustilago maydis), Flipper [Genbank:AAB63315, Botryotinia fuckeliana] and Cirt1 [Genbank:XP710204, Candida albicans]