| Literature DB >> 23496816 |
Shima Khoshraftar1, Stacy Hung, Sadia Khan, Yunchen Gong, Vibha Tyagi, John Parkinson, Mohini Sain, Alan M Moses, Dinesh Christendat.
Abstract
BACKGROUND: The ascomycete fungus Ophiostoma ulmi was responsible for the initial pandemic of the massively destructive Dutch elm disease in Europe and North America in early 1910. Dutch elm disease has ravaged the elm tree population globally and is a major threat to the remaining elm population. O. ulmi is also associated with valuable biomaterials applications. It was recently discovered that proteins from O. ulmi can be used for efficient transformation of amylose in the production of bioplastics.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23496816 PMCID: PMC3618308 DOI: 10.1186/1471-2164-14-162
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
General characteristics of the genome
| Size assembled genome (Mb) | 31.5 |
| GC content overall (%) | 50.02 |
| GC content (coding) (%) | 57.8 |
| Protein coding genes | 8639 |
| Gene density (genes/kb) | 1/3642 |
| Average gene length (bp) | 1854 |
| Average number of introns per gene | 1.14 |
| Median intron size (bp) | 67 |
| Median exon size (bp) | 395 |
Figure 1Overview of genome annotation statistics. a) the distribution of the lengths of the genes of the O. ulmi with G. clavigera and N. crassa. b) the number of introns in each of the predicted genes in O. ulmi. Most of the predicted genes have either zero or one introns and a few of them have more than 5 introns. c) distribution of the length of the introns of the gene models. The majority of the introns are 60 bp to 200 bp and 3% of them are between 200 bp and 300 bp. Almost 8% are longer than 300 bp but are not shown in the figure for the purpose of clarity.
Figure 2Phylogenic tree. Phylogenetic tree between O. ulmi and three other species G. clavigera, N. crassa and S. cerevisiae. Branch length represents the phylogenetic distances in substitution per site.
Figure 3Distribution of coverage for the coding regions of the gene models. The coverage for the coding regions of the predicted genes is shown. Almost 80% have the coverage greater than 1.
Figure 4Examples of mRNA-seq coverage. The number of mRNA-seq reads mapped is plotted as a function of genomic coordinate. Eight predicted genes are displayed by red arrows, (o753 to o760). Yellow sections inside the red arrows refer to the intron parts of the gene models. A zoomed version of the area in which the intron appears is given in the box below the figure.
Figure 5Summary of genome annotation validation. The biggest circle shows the total number of gene models predicted and every other circle represents a subset of gene models that are supported by any of the three types of evidences. The gene models that have evidence from EST data from O. novo-ulmi are referred to by EST data in the figure, those having ortholog genes from at least one of the mentioned three species referred to by Orthologs and those with evidence from mRNA-seq data from O. ulmi referred to by mRNA-seq.
Comparison of the number of PHI-base pathogen genes found in the three species
| 610 | 3 | |
| 598 | 2 | |
| 611 | 7 |
PHI-base pathogen genes found in not in and
| PHI:48|CnLAC1|BAD91825|TX:5207|Cryptococcus neoformans|Reduced virulence | A laccase enzyme which catalyzes the synthesis of melanin in the presence of phenolic compounds [ |
| PHI:876|MGG_11671|EDK03349|TX:148305|Magnaporthe grisea|Reduced virulence | hypothetical protein similar to reverse transcriptase |
| PHI:1048|CTB7|ABK64184|TX:29003|Cercospora nicotianae|Reduced virulence | Encodes putative FAD/FMN- or NADPH-dependent oxidoreductases in the cercosporin toxin biosynthetic pathway of C. nicotianae [ |
Figure 6Metabolic network of . Node size corresponds to RNA-seq expression coverage, with background nodes (borderless) included for context of KEGG pathways. Groups of enzymes dotted-circled in black appear to be conserved in the fungus while enzymes dotted-circled in red appear to be absent in the fungus.