| Literature DB >> 26484270 |
Malali Gowda1, Meghana D Shirke1, H B Mahesh2, Pinal Chandarana3, Anantharamanan Rajamani1, Bharat B Chattoo3.
Abstract
The Indian subcontinent is the center of origin and diversity for rice (Oryza sativa L.). The O. sativa ssp. indica is a major food crop grown in India, which occupies the first and second position in area and production, respectively. Blast disease caused by Magnaporthe oryzae is a major constraint to rice production. Here, we report the analysis of genome architecture and sequence variation of two field isolates, B157 and MG01, of the blast fungus from southern India. The 40 Mb genome of B157 and 43 Mb genome of MG01 contained 11,344 and 11,733 predicted genes, respectively. Genomic comparisons unveiled a large set of SNPs and several isolate specific genes in the Indian blast isolates. Avr genes were analyzed in several sequenced Magnaporthe strains; this analysis revealed the presence of Avr-Pizt and Avr-Ace1 genes in all the sequenced isolates. Availability of whole genomes of field isolates from India will contribute to global efforts to understand genetic diversity of M. oryzae population and to track the emergence of virulent pathotypes.Entities:
Keywords: Genome comparison; Isolate specific genes; Magnaporthe; Next generation sequencing; Single nucleotide polymorphism
Year: 2015 PMID: 26484270 PMCID: PMC4583678 DOI: 10.1016/j.gdata.2015.06.018
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Genome assembly and annotation of M. oryzae strains, B157 and MG01.
| Isolate name | B157 | MG01 | B157 | MG01 |
| Illumina reads (millions) | 35.83 | 38.81 | 35.83 | 38.81 |
| Coverage (X) | 89 | 97 | 89 | 97 |
| No. of contigs | 6815 | 7722 | 5364 | 6046 |
| No. of scaffolds | 3534 | 7303 | 2508 | 3060 |
| Largest scaffold size (Kb) | 649 | 324 | 489 | 292 |
| N50 (Kb) | 91.16 | 55.03 | 92.4 | 54.6 |
| Assembly size (Mb) | 38 | 40 | 41 | 43 |
| % repeats | 3.01 | 3.23 | 10.4 | 10.39 |
| No. of predicted genes with Augustus | 11,340 | 11,744 | 12,535 | 13,135 |
| No. of predicted genes (> 200 bp) | – | – | 11,334 | 11,733 |
Fig. 1Major repeat elements in the genomes of M. oryzae strains B157, MG01 and 70-15.
Fig. 2Isolate specific genes from B157 and MG01 based on the reference (70-15) genome assembly.
Fig. 3Distribution of genes, SNPs and inter-chromosomal variations in Indian isolates (B157 and MG01) in comparison with 70-15 reference genome. The outermost circle shows the distribution of genes in MG01, and the second circle shows seven chromosomes of 70-15 with their sizes in 1 Kb intervals. The third circle shows the distribution of genes in B157, and the fourth and fifth circles show SNP distribution for MG01 and B157, respectively at 1 Kb intervals. The high SNP density regions are marked as peaks. The innermost circle shows possible translocation patterns in both B157 and MG01.
Host specificity factors and avirulence (Avr) genes in sequenced strains of M. oryzae.
| B157 | MG01 | 70-15 | Y34 | P131 | KJ201 | 4091-59-8 | |||
|---|---|---|---|---|---|---|---|---|---|
| PWL1 | − | + | − | − | − | − | − | − | − |
| PWL2 | − | + | + | + | + | + | − | + | − |
| PWL3 | − | − | + | + | + | + | + | + | + |
| PWL4 | − | − | + | + | + | + | + | + | + |
| Avr-CO39 | − | − | − | − | − | − | − | − | + |
| Avr-Piz-t | + | + | + | + | + | + | + | + | + |
| Avr-Pia | − | + | − | − | − | + | + | − | − |
| Avr-Pii | − | − | − | − | − | − | − | − | − |
| Avr-Pik | + | − | + | + | + | + | − | + | − |
| Avr-Pita | − | + | + | + | + | + | + | + | + |
| Avr ACE1 | + | + | + | + | + | + | + | + | + |
Fig. 4Sense and antisense transcript expression support for annotated genes of MG01. A, the overall distribution of genes validated by strand-specific RNA-sequence data. A1, number of genes with no RNA-seq evidence; A2, number of genes with RNA-seq evidence. A2 is subdivided into sense (A2.1), antisense (A2.2) and both sense and antisense (A2.3). All expressed genes were classified based on their strand being expressed into three main classes as sense (B), antisense (C) or both (D) (sense and antisense). Each main class was subcategorized into three subclasses based on the FPKM value (Low, medium and high). The FPKM value was categorized as low (if FPKM ≤ 10), medium (if FPKM > 10 to ≤ 200) or high (> 200).
Strand specific RNA (ssRNA) seq analysis for MG01 strain.
| Gene feature | B157 | MG01 |
|---|---|---|
| Total no. of genes predicted (> 200 bp) | 11,334 | 11,733 |
| No. of genes having RNA-seq evidence | 8672 | 8338 |
| a) No. of genes having sense expression | – | 7404 |
| b) No. of genes having antisense expression | – | 338 (58 |
| c) No. of genes having sense and antisense expression | – | 595 (75 |
| No. of genes with no RNA-seq evidence | 2662 | 3395 |
Genes are confirmed by MPSS and RL-SAGE data [13].
Genes confirmed by MPSS data [13].
Enterotoxin-A domain encoding genes in pathogenic fungi including the sequenced M. oryzae strains.
| Pathogenic fungi | Host | No. of genes |
|---|---|---|
| Plant | 1 | |
| Plant | 1 | |
| Plant | 1 | |
| Plant | 1 | |
| Plant | 7 | |
| 5 | ||
| Animal | 7 | |
| Animal | 16 | |
| Animal | 4 |