| Literature DB >> 19019209 |
Jorge M C Mondego1, Marcelo F Carazzolle, Gustavo G L Costa, Eduardo F Formighieri, Lucas P Parizzi, Johana Rincones, Carolina Cotomacci, Dirce M Carraro, Anderson F Cunha, Helaine Carrer, Ramon O Vidal, Raíssa C Estrela, Odalys García, Daniela P T Thomazella, Bruno V de Oliveira, Acássia Bl Pires, Maria Carolina S Rio, Marcos Renato R Araújo, Marcos H de Moraes, Luis A B Castro, Karina P Gramacho, Marilda S Gonçalves, José P Moura Neto, Aristóteles Góes Neto, Luciana V Barbosa, Mark J Guiltinan, Bryan A Bailey, Lyndel W Meinhardt, Julio Cm Cascardo, Gonçalo A G Pereira.
Abstract
BACKGROUND: The basidiomycete fungus Moniliophthora perniciosa is the causal agent of Witches' Broom Disease (WBD) in cacao (Theobroma cacao). It is a hemibiotrophic pathogen that colonizes the apoplast of cacao's meristematic tissues as a biotrophic pathogen, switching to a saprotrophic lifestyle during later stages of infection. M. perniciosa, together with the related species M. roreri, are pathogens of aerial parts of the plant, an uncommon characteristic in the order Agaricales. A genome survey (1.9x coverage) of M. perniciosa was analyzed to evaluate the overall gene content of this phytopathogen.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19019209 PMCID: PMC2644716 DOI: 10.1186/1471-2164-9-548
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Flow diagram of bioinformatics procedures applied in M. perniciosa genome survey.
Comparison between assembly values and values calculated using Lander Waterman theory
| # Clusters (contigs+singlets) | 25,056 | 24,950 |
| # Contigs | 17,991 | 18,370 |
| # Gaps | - | 24,951 |
| Average cluster size (bp) | 1,065 | 1,152 |
| Average gap size (bp) | - | 413 |
Figure 2Number of Left ellipse: gene models predicted by ab initio methods. Right ellipse: gene models predicted by extrinsic methods. The intersection contains gene models detected by both methods. Underlined: number of gene models with BLASTX-NR E-value similarity ≤ 1e-10. In italics: number of gene models with BLASTX-NR E-value similarity > 1e-10.
Moniliophthora perniciosa predicted gene comparisons
| 9910 (60.6%) | 4811 (29.27%) | |
| 10649 (65.1%) | 3281 (19.97%) | |
| 9311 (57.0%) | 2021 (12.30%) | |
| 5904 (36.1%) | 57 (0.35%) | |
| 5645 (34.5%) | 39 (0.2%) | |
| 5488 (33.6%) | 42 (0.26%) | |
| 5454 (33.4%) | 78 (0.47%) | |
| 5127 (31.4%) | 16 (0.10%) | |
| 3296 (20.2%) | 4 (0.02%) | |
| All NR database | 11349 (69.5%) | _ |
* – Each M. perniciosa gene model was compared to all the proteins from the organisms listed in the table. BLASTX score was defined as 1e-10
Figure 3Correlation between the average number of M perniciosa gene models and the length of C. cinereus proteins.
Figure 4Correlation of various organisms' genome size and number of genes (Gene Density). Error bar in M. perniciosa data point depicts the Standard Deviation (SD ± 0.05) of Gene/Kbp ratio using ab initio predicitions, extrinsic predictions and the sum of both predictions (see text).
Moniliophthora perniciosa genome survey features
| Contigs | 17,991 |
| Contigs size (average) | 1.3 Kbp |
| Singlets | 7,065 |
| Singlets size (average) | 455 bp* |
| Sum of all reads | 75 Mbp |
| Assembly size | 26.7 Mbp |
| Estimated genome size | 39 Mbp |
| Coverage | 1.92× |
| Gene models | |
| | 13,640 |
| Extrinsic gene models | 12,249 |
| Total gene models | 16,329 |
| Gene density | 2.77 Kbp/gene (SD ± 0.39) |
| 0.36 gene/Kbp (SD ± 0.05) | |
| Average gene size | 651.4 bp |
| GC content overall | 47.7 |
| GC content coding | 49.7 |
| GC content non coding | 46.8 |
| Intron size (mode) | 52 bp |
| Intron size (average) | 63 bp |
| Exon size (mode) | 60 bp |
| Exon size (average) | 166 bp |
| AutoFACT annotation | |
| Classified (Hits) | 11,950 (73%) |
| Assigned (Conserved expressed proteins) | 7,416 (45%) |
| Unassigned (Conserved hypothetical proteins) | 4,534 (28%) |
| Unclassified (No Hits) | 4,379 (27%) |
* – Singlet size after LUCY quality trimming (phred ≥ 16).
MCL analysis of M. perniciosa gene models
| 1 | 175 | 1.24987 | 140 | Cytochrome P450 monooxygenase |
| 2 | 143 | 1.82632 | 78 | Gypsy like-retrotranspon |
| 4 | 84 | 1.35205 | 62 | Conserved hypothetical protein |
| 3 | 84 | 1.46889 | 57 | Alcohol oxidases and GMC oxidoreductase |
| 5 | 82 | 1.26479 | 65 | Gypsy like-retrotranspon |
| 6 | 76 | 1.57535 | 48 | Serine threonine kinase |
| 7 | 64 | 1.54312 | 41 | Conserved hypothetical protein |
| 8 | 59 | 1.54180 | 38 | Conserved hypothetical protein |
| 9 | 57 | 1.46370 | 39 | Proteins containing WD-40 and NACHT domain |
| 10 | 55 | - | - | |
| 11 | 52 | 1.34190 | 39 | Cytochrome P450 |
| 12 | 50 | 1.52169 | 33 | Serine threonine kinase |
| 13 | 46 | 1.18098 | 39 | Conserved hypothetical protein |
| 14 | 43 | 1.21555 | 35 | Cytochrome P450 |
| 15 | 41 | 1.45275 | 28 | Carboxylesterase/lipase |
| 16 | 39 | 1.67524 | 23 | Gypsy like-retrotranspon |
| 17 | 38 | 1.26598 | 30 | Hexose transporter |
| 19 | 35 | 1.11159 | 31 | MFS transporter |
| 18 | 35 | 1.43881 | 24 | Splicing factor |
| 20 | 32 | 1.17902 | 27 | Laccase – Mulitcopper oxidase |
a – Number of gene models present in each family
b – Normalization Factor used to normalize the number of gene models present in each family (see methods)
c – Normalization number of gene models present in each family
d – Manual annotation of MCL families according to BLAST similarity analysis
Top 20 CDD-PFAM domains in M. perniciosa proteins
| gnl|CDD|40168 | pfam00067, Cytochrome P450 | 256 | 1.57% | 1 | 1.3565 | 188.71427 | 1.15% | 1 |
| gnl|CDD|40170 | pfam00069, Protein kinase | 141 | 0.86% | 2 | 1.6059 | 87.80184 | 0.54% | 2 |
| gnl|CDD|40184 | pfam00083, Sugar (and other) transporters | 89 | 0.54% | 3 | 1.6902 | 52.65651 | 0.32% | 3 |
| gnl|CDD|40206 | pfam00106, adh_short, short chain dehydrogenase | 84 | 0.51% | 4 | 1.7141 | 49.00442 | 0.30% | 4 |
| gnl|CDD|40235 | pfam00135, COesterase, Carboxylesterase | 57 | 0.35% | 5 | 1.4738 | 38.67603 | 0.24% | 5 |
| gnl|CDD|40207 | pfam00107, ADH_zinc_N, Zinc-binding dehydrogenase | 43 | 0.26% | 8 | 1.2398 | 34.68221 | 0.21% | 6 |
| gnl|CDD|40813 | pfam00732, GMC_oxred_N, GMC oxidoreductase | 52 | 0.32% | 6 | 1.5402 | 33.76192 | 0.21% | 7 |
| gnl|CDD|40345 | pfam00248, Aldo_ket_red, Aldo/keto reductase family | 41 | 0.25% | 9 | 1.2881 | 31.83088 | 0.19% | 8 |
| gnl|CDD|41245 | pfam01185, Hydrophobin, Fungal hydrophobin | 26 | 0.16% | 21 | 0.8460 | 30.73119 | 0.19% | 9 |
| gnl|CDD|40253 | pfam00153, Mito_carr, Mitochondrial carrier protein | 38 | 0.23% | 12 | 1.3890 | 27.35860 | 0.17% | 10 |
| gnl|CDD|40748 | pfam00665, rve, Integrase core domain | 50 | 0.31% | 7 | 1.8943 | 26.39449 | 0.16% | 11 |
| gnl|CDD|40493 | pfam00400, WD40, WD domain, G-beta repeat | 35 | 0.21% | 13 | 1.5151 | 23.10130 | 0.14% | 12 |
| gnl|CDD|40179 | pfam00078, RVT, Reverse transcriptase | 39 | 0.24% | 11 | 1.7620 | 22.13448 | 0.14% | 13 |
| gnl|CDD|45101 | pfam05199, GMC_oxred_C, GMC oxidoreductase | 28 | 0.17% | 16 | 1.2741 | 21.97654 | 0.13% | 14 |
| gnl|CDD|40588 | pfam00501, AMP-binding, AMP-binding enzyme | 40 | 0.24% | 10 | 1.8537 | 21.57819 | 0.13% | 15 |
| gnl|CDD|41412 | pfam01360, Monooxygenase, Monooxygenase | 25 | 0.15% | 22 | 1.2253 | 20.40382 | 0.12% | 16 |
| gnl|CDD|40107 | pfam00005, ABC_tran, ABC transporter | 33 | 0.20% | 14 | 1.7198 | 19.18851 | 0.12% | 17 |
| gnl|CDD|40367 | pfam00270, DEAD, DEAD/DEAH box helicase | 30 | 0.18% | 15 | 1.5947 | 18.81230 | 0.12% | 18 |
| gnl|CDD|40419 | pfam00324, AA_permease, Amino acid permease | 26 | 0.16% | 20 | 1.4641 | 17.75798 | 0.11% | 19 |
| gnl|CDD|41783 | pfam01753, zf-MYND, MYND finger | 25 | 0.15% | 23 | 1.4288 | 17.49737 | 0.11% | 20 |
a – Number of CDD entry
b – Number and description of
PFAM domain
c – Number of gene models containing each CDD-PFAM domain
d – Percentage of gene models containing each CDD-PFAM domain in relation to total number of M. perniciosa gene models (proteins).
e – Non-normalized CDD-PFAM ranking
f – Factor used to normalize the number of gene models containing each CDD-PFAM domain (see methods)
g – Normalized number of gene models containing a CDD-PFAM domain
h – Percentage of normalized number of gene models containing each CDD-PFAM domain in relation to total number of M. perniciosa gene models (proteins)
i – Normalized M. perniciosa CDD-PFAM ranking
Figure 5Comparison of . (A) comparison between cytochrome P450 monooxygenases, carboxylesterases, deuterolysins, thaumatins and aegerolysins; (B) comparison of plant cell wall degrading enzymes from fungi that interact with plants. Mp = Moniliophthora perniciosa, Lb = Laccaria bicolor, Cc = Coprinopsis cinerea, Pc = Phanerochaete chrysosporium, Um = Ustilago maydis, Mg = Magnaporthe grisea, Cn = Cryptococcus neoformans, Fg = Fusarium graminearum, Nc = Neurospora crassa, Sc = Saccharomyces cerevisiae. The legend at the right refers to the nomenclature of plant cell wall degrading enzymes according to CAZy .
Figure 6Indole-3-acetate (IAA) biosynthesis pathways. M. perniciosa gene models are annotated in red.
Figure 7An illustrated hypothetical model for WBD. Model correlating classical symptoms of green and dry broom in the field (A, C and E), M. perniciosa development inside cacao (B: biotrophic stage, D: transition from biotrophic to saprotrophic stage, F: saprotrophic stage) and molecular and cellular events displayed by the fungus in each developmental stages, based on genes annotated in genome survey (right side of the panel). Notice in B (right side) the presence of biotrophic mycelia in the apoplast surrounded by intact living cells. Also notice in C the presence of biotrophic mycelia (arrowhead) and saprotrophic mycelia (double arrowhead) inside a necrotic region. Micrographs scales: B left side: 15 μM; B right side: 25 μM; D: 50 μM; F: 50 μM.