| Literature DB >> 19578406 |
Li-Jun Ma1, Ashraf S Ibrahim, Christopher Skory, Manfred G Grabherr, Gertraud Burger, Margi Butler, Marek Elias, Alexander Idnurm, B Franz Lang, Teruo Sone, Ayumi Abe, Sarah E Calvo, Luis M Corrochano, Reinhard Engels, Jianmin Fu, Wilhelm Hansberg, Jung-Mi Kim, Chinnappa D Kodira, Michael J Koehrsen, Bo Liu, Diego Miranda-Saavedra, Sinead O'Leary, Lucila Ortiz-Castellanos, Russell Poulter, Julio Rodriguez-Romero, José Ruiz-Herrera, Yao-Qing Shen, Qiandong Zeng, James Galagan, Bruce W Birren, Christina A Cuomo, Brian L Wickes.
Abstract
Rhizopus oryzae is the primary cause of mucormycosis, an emerging, life-threatening infection characterized by rapid angioinvasive growth with an overall mortality rate that exceeds 50%. As a representative of the paraphyletic basal group of the fungal kingdom called "zygomycetes," R. oryzae is also used as a model to study fungal evolution. Here we report the genome sequence of R. oryzae strain 99-880, isolated from a fatal case of mucormycosis. The highly repetitive 45.3 Mb genome assembly contains abundant transposable elements (TEs), comprising approximately 20% of the genome. We predicted 13,895 protein-coding genes not overlapping TEs, many of which are paralogous gene pairs. The order and genomic arrangement of the duplicated gene pairs and their common phylogenetic origin provide evidence for an ancestral whole-genome duplication (WGD) event. The WGD resulted in the duplication of nearly all subunits of the protein complexes associated with respiratory electron transport chains, the V-ATPase, and the ubiquitin-proteasome systems. The WGD, together with recent gene duplications, resulted in the expansion of multiple gene families related to cell growth and signal transduction, as well as secreted aspartic protease and subtilase protein families, which are known fungal virulence factors. The duplication of the ergosterol biosynthetic pathway, especially the major azole target, lanosterol 14alpha-demethylase (ERG11), could contribute to the variable responses of R. oryzae to different azole drugs, including voriconazole and posaconazole. Expanded families of cell-wall synthesis enzymes, essential for fungal cell integrity but absent in mammalian hosts, reveal potential targets for novel and R. oryzae-specific diagnostic and therapeutic treatments.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19578406 PMCID: PMC2699053 DOI: 10.1371/journal.pgen.1000549
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Relationship of major phyla within the fungal kingdom.
Phylogeny is shown as a dendrogram using H. sapiens (Metazoa) as the out-group. B. dendrobatidis (phylum Chytridiomycota) is a unicellular organism with flagellated spores. The terrestrial multicellular fungi include the monophyletic Dikaryomycota (Ascomycota and Basidiomycota) and the more basal fungal lineages, including R. oryzae. In contrast to the Dikaryomycota fungi that form hyphae divided by septa (white arrows), the hyphae of R. oryzae are multinucleate but not divided into separate cells (coenocytic).
Rhizopus oryzae genome statistics.
|
| |
| Total contig length (Mb) | 45.26 |
| Total scaffold length (Mb) | 46.09 |
| Average base coverage (Fold) | 11 |
| N50 contig (kb) | 303.66 |
| N50 scaffold (Mb) | 3.1 |
| Linkage groups | 15 |
| GC-content (%) | 35.6 |
| Coding (%) | 40.6 |
| Non-coding (%) | 32.6 |
|
| |
| Percent coding (%) | 39.0 |
| Average gene size (bp) | 1212 |
| Average gene density (kb/gene) | 2.6 |
| Protein-coding genes | 17,467 |
| Exons | 57,981 |
| Average exon size (bases) | 310 |
| Exons/gene | 3.3 |
| tRNA genes | 239 |
|
| |
| Introns | 40,514 |
| Introns/gene | 2.32 |
| Average intron length (base) | 79 |
| Intergenic regions | 17,546 |
| Average intergenic distance (bp) | 1420 |
Figure 2R. oryzae genomic structure showing duplicated regions retained after WGD and distribution of LTR transposable elements.
The length of the light blue background for each linkage group is defined by the optical map. For each chromosome, row a represents the genomic scaffolds positioned on the optical linkage groups. The red oval indicates linkage to telomeric repeat arrays. Row b displays the 256 duplicated regions capturing 648 gene pairs and spanning 12% of the genome. The shaded backgrounds around some duplicated regions illustrate the duplicated blocks by merging duplicated regions that are within 200 kb after discounting the transposon sequences. These extended duplicated blocks contain the same amount of the duplicates but span 23% of the genome. A pair of corresponding duplicated regions between linkage 2 and linkage 9 are shown in the zoomed images. The numbers in the gene boxes are gene IDs. Row c corresponds to the distribution of the LTR retroelements.
Transposable elements (TEs) in the R. oryzae genome.
| Elements | Total bases | % of assembly | Sequence identity (%) | EST |
|
|
|
| ||
| LTR elements / Ty3 | 3,700,795 | 8.03 | 97% | Yes |
| LINES | 1,742,093 | 3.78 | 97% | Yes |
| DIRS | 146,622 | 0.32 | 97% | Yes |
|
|
|
| ||
| Mariners | 1,666,728 | 3.62 | 98% | Yes |
| En/Spn | 314,481 | 0.68 | 98% | No |
| Tigger | 262,307 | 0.57 | 94% | No |
| Crypton | 191,823 | 0.42 | 98% | No |
| Helitron | 66,534 | 0.14 | 99% | No |
|
|
|
|
The genomic distribution of the representative elements was identified using the sensitive mode of RepeatMasker version open-3.0.8, with cross_match version 0.990329.
Sequence identity was computed based on the average identity of the full-length copies of each representative against the consensus sequence of each group.
EST reads overlap with the identified TEs (see Table S6).
Figure 3Estimation of duplication dates using P. blakesleeanus as an outgroup.
(A) An unrooted tree diagram for the duplicated gene pairs in R. oryzae and their homologous gene in P. blakesleeanus. Midpoint rooting is used to calculate of the relative age of each duplication (R) in relation to the root. The branch lengths as substitutions per site for the unrooted tree topology were calculated using the WAG evolutionary model [49] employing a maximum likelihood-based package, PhyML [50]. The distance between two duplicated genes in R. oryzae is t 1+t 2, and the distances between the duplicates and their orthologous gene in P. blaskesleeanus are t+t 3+t 1 and t+t 3+t 2, respectively. (B) The distribution of the relative duplication time for each duplicated region in comparison to the root (R). R is normalized within each duplicated region by averaging the divergences of all the duplicated gene pairs within the region. If the divergence time between R. oryzae and P. blakesleeanus is defined as t using midpoint rooting, approximately 78% of all these regions were estimated to be duplicated within one standard deviation (0.115) of the mean (0.386t).
Duplication of protein complexes in the R. oryzae genome*.
| Complexes | Respiratory chain complexes | V-ATPase | Ubiquitin–proteasome system | ||||||||||||
| Subunits | I | II | III | IV | ATPase | Total | V1 | V0 | Total | Alpha | Beta | ATPase | LID | Modifier | Total |
| Reference genes | 28 | 4 | 9 | 9 | 10 | 60 | 7 | 5 | 12 | 7 | 7 | 6 | 13 | 3 | 36 |
|
| 20 | 3 | 8 | 8 | 8 | 47 | 5 | 3 | 8 | 6 | 6 | 5 | 10 | 2 | 29 |
| % duplicated genes | 71.4 | 75.0 | 88.9 | 88.9 | 80.0 | 78.3 | 71.4 | 60.0 | 66.7 | 85.7 | 85.7 | 83.3 | 76.9 | 66.7 | 80.6 |
*: Duplicated protein complexes in R. oryzae retained after WGD. The reference nuclear genes of protein complexes from Saccharomyces cerevisiae or Neurospora crassa were used to identify homologous sequences in the R. oryzae proteome. We searched for homologous genes using BLASTP (1e–5) and manually checked for short proteins that usually have higher e-values.
Gene family expansion in the R. oryzae genome.
| Species | Cell wall synthesis | Protein hydrolysis | Cell signaling | |||
| CHS | CDA | SAP | Subtilases | GTPases | GTPase regulators | |
|
|
|
|
|
|
|
|
|
| 9 | 9 | 6 | 4 | 81 | 76 |
|
| 7 | 5 | 17 | 8 | 84 | 79 |
|
| 8 | 11 | 8 | 7 | — | — |
|
| 7 | 2 | 7 | 4 | 82 | 76 |
|
| 8 | 1 | 14 | 2 | — | — |
|
| 8 | 4 | 7 | 2 | 78 | 77 |
|
| 9 | 16 | 2 | 3 | 86 | 83 |
|
| 8 | 8 | 6 | 1 | 80 | 77 |
Expanded gene families in R. oryzae compared to selected dikaryotic fungal genomes.
—, not tested.
*: based on the SGD (http://www.yeastgenome.org/) and CGD (http://www.candidagenome.org/) annotation.
Figure 4RT–PCR of R. oryzae chitin synthases (CHSs).
Presence of a transcript was detected from mycelia grown with four different growth phases: 1L, 1-day-old liquid culture; 1S, 1-day-old agar plate; 2S, 2-day-old agar plate; and 3S, 3-day-old agar plate. Gene pairs retained after WGD as detected in the duplicated regions are shown in blue.