| Literature DB >> 26416668 |
Laura H Okagaki1, Cristiano C Nunes1, Joshua Sailsbery1, Brent Clay2, Doug Brown2, Titus John2, Yeonyee Oh1, Nelson Young3, Michael Fitzgerald4, Brian J Haas4, Qiandong Zeng4, Sarah Young4, Xian Adiconis4, Lin Fan4, Joshua Z Levin4, Thomas K Mitchell5, Patricia A Okubara6, Mark L Farman7, Linda M Kohn8, Bruce Birren4, Li-Jun Ma9, Ralph A Dean10.
Abstract
Magnaporthaceae is a family of ascomycetes that includes three fungi of great economic importance: Magnaporthe oryzae, Gaeumannomyces graminis var. tritici, and Magnaporthe poae. These three fungi cause widespread disease and loss in cereal and grass crops, including rice blast disease (M. oryzae), take-all disease in wheat and other grasses (G. graminis), and summer patch disease in turf grasses (M. poae). Here, we present the finished genome sequence for M. oryzae and draft sequences for M. poae and G. graminis var. tritici. We used multiple technologies to sequence and annotate the genomes of M. oryzae, M. poae, and G. graminis var. tritici. The M. oryzae genome is now finished to seven chromosomes whereas M. poae and G. graminis var. tritici are sequenced to 40.0× and 25.0× coverage respectively. Gene models were developed by the use of multiple computational techniques and further supported by RNAseq data. In addition, we performed preliminary analysis of genome architecture and repetitive element DNA.Entities:
Keywords: Gaeumannomyces; Magnaporthe; repetitive DNA; sequence; synteny
Mesh:
Year: 2015 PMID: 26416668 PMCID: PMC4683626 DOI: 10.1534/g3.115.020057
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Sequencing project summary
| Organism Name | |||
|---|---|---|---|
| Strain/isolate name | 70-15 | ATCC 64411 | R3-111a-1 |
| Assembly name | MG8 | Mag_poae_ATCC_64411_V1 | Gae_graminis_V2 |
| Mitochondrial/plasmid assembly name(s) | MG7_MITO | Mag_poae_ATCC_64411_V1_Mito | Gae_graminis_V1_Mito |
| Sequencing platforms | Sanger | Sanger/454/ABI | Sanger/454/ABI |
| Sequencing coverage | Finished to seven chromosomes | 40.0× | 25.0× |
| Genbank accession | AACU03000000 | ADBL01000000 | ADBI00000000 |
| Gene numbering | MGG_##### | MAPG_##### | GGTG_##### |
| NCBI project ID | 13840 | 37933 | 37931 |
Sequencing was performed at the Broad Institute as part of the Fungal Genome Initiative. NCBI, National Center for Biotechnology Information.
RNAseq reads per treatment
| Biological Treatment | |||
|---|---|---|---|
| 2° | − | − | 38,264,704 |
| 4° | 41,670,516 | 55,296,984 | − |
| 40° | − | − | 52,913,434 |
| 42° | 47,383,418 | 51,115,288 | − |
| NaCl (500 mM) | 51,560,152 | 49,951,750 | 46,568,806 |
| Light | 43,071,966 | − | − |
| Dark | 46,564,826 | 54,886,478 | − |
| Melanized | − | 44,351,958 | − |
| 1× PDB | − | − | 30,717,700 |
| V8 medium | − | − | 31,607,460 |
| Complete Medium | − | 108,842,14 | − |
Magnaporthaceae species were grown in complete medium before being subjected to different conditions for 15−60 min (2°, 4°, 40°, 42°, NaCl), 3-5 d (light, dark, 1× PDB, V8 medium, complete medium, and melanizing) before RNA extraction. RNA libraries were subjected to paired-end deep sequencing using GAII Illumina technology. RNAseq read were assembled and aligned to their respective genomes using Bowtie, TopHat, and Inchworm software. PDB, potato dextrose broth.
Genome statistics
| Organism Name | |||
|---|---|---|---|
| Genome size, bp | 41,027,733 | 39,503,331 | 43,618,147 |
| Contig N50, bp | 823,590 | 16,565 | 48,943 |
| Scaffold N50, bp | 6,606,598 | 3,426,601 | 6,703,616 |
| Contig count | 219 | 3,106 | 1,808 |
| Scaffold count | 8 | 205 | 513 |
| Protein-coding genes | 12,696 | 12,113 | 14,255 |
| Coding regions in the genome, % | 61.34 | 63.59 | 63.38 |
| Gene length median, bp | 1,755 | 1,823 | 1,711 |
| mRNA length median, bp | 1,556 | 1,584 | 1,494 |
| CDS length median, bp | 1,083 | 987 | 1,041 |
| Exon length median, bp | 374 | 412 | 370 |
| Intron length median, bp | 88 | 87 | 88 |
| Exon per spliced transcript | 3.22 | 3.3 | 3.21 |
| 5-UTR length median, bp | 267 | 301 | 246 |
| 3-UTR length median, bp | 298 | 343 | 304 |
| Intergenic region length median, bp | 742 | 654 | 580 |
| Contig gap length median, bp | 100 | 537 | 481 |
| Genome G+C content, % | 51.61 | 56.99 | 56.85 |
| Genic region G+C content, % | 54.09 | 58.77 | 59.8 |
| Intergenic region G+C content, % | 47.66 | 52.57 | 50.77 |
| mRNA G+C content, % | 55 | 59.31 | 60.67 |
| CDS G+C content, % | 57.63 | 61.65 | 62.76 |
| Exon G+C content, % | 55 | 59.31 | 60.67 |
| Intron G+C content, % | 46.58 | 53.76 | 53.03 |
| 5-UTR G+C content, % | 48.8 | 56.29 | 56.97 |
| 3-UTR G+C content, % | 46.4 | 51.75 | 51.91 |
| Genes with 5-UTR, % | 8,120 | 8,017 | 8,590 |
| Genes with 3-UTR | 8,153 | 7,749 | 8,802 |
| Spliced genes | 10,341 | 9,388 | 11,176 |
| Average exons per transcript | 2.78 | 2.76 | 2.71 |
| Alternatively spliced genes | 796 | 877 | 885 |
| rRNA genes | 40 | 26 | 13 |
| tRNA genes | 325 | 167 | 273 |
Magnaporthaceae species were sequenced using Sanger sequencing (M. oryzae), or Sanger sequencing, ABI, and 454 Next-Generation sequencing (M. poae and G. graminis var. tritici). M. oryzae was finished to seven chromosomes, whereas M. poae and G. graminis var. tritici were sequenced to 40-fold and 25-fold coverage, respectively. mRNA, messenger RNA; CDS, coding sequence; UTR, untranslated region; rRNA, ribosomal RNA; tRNA, transfer RNA.
Repetitive elements in the Magnaporthaceae
| Total Length, bp | Repetitive Content, % | Genome, % | Total Length, bp | Repetitive Content, % | Genome, % | Total Length, bp | Repetitive Content, % | Genome, % | |
|---|---|---|---|---|---|---|---|---|---|
| Class I (retrotransposon) | |||||||||
| LTR/Gypsy | 1790041 | 43.08 | 4.36 | 142221 | 32.83 | 0.36 | 1317401 | 44.99 | 3.02 |
| LTR/Copia | 375743 | 9.04 | 0.92 | 529308 | 18.08 | 1.21 | |||
| Unknown | 212069 | 5.10 | 0.52 | ||||||
| Subtotal | 2377853 | 57.23 | 5.80 | 142221 | 32.83 | 0.36 | 1846709 | 63.07 | 4.23 |
| Non-LTR retrotransposons | |||||||||
| LINE/Tad1 | 690463 | 16.62 | 1.68 | 342293 | 11.69 | 0.78 | |||
| Unknown | 135550 | 3.26 | 0.33 | 176681 | 40.78 | 0.45 | 13605 | 0.46 | 0.03 |
| 826013 | 19.88 | 2.01 | 176681 | 40.78 | 0.45 | 355898 | 12.16 | 0.82 | |
| Class II (DNA tranpsosons) | |||||||||
| DNA/TcMar-Fot1 | 514336 | 12.38 | 1.25 | 19406 | 4.48 | 0.05 | 157841 | 5.39 | 0.36 |
| DNA/PIF-Harbinger | 18302 | 0.63 | 0.04 | ||||||
| DNA/hAT-Ac | 17124 | 0.58 | 0.04 | ||||||
| Unknown | 306017 | 7.36 | 0.75 | 7139 | 1.65 | 0.02 | |||
| Subtotal | 820353 | 19.74 | 2.00 | 26545 | 6.13 | 0.07 | 193267 | 6.60 | 0.44 |
| Other | |||||||||
| Unknown | 131047 | 3.15 | 0.32 | 87794 | 20.26 | 0.22 | 532034 | 18.17 | 1.22 |
| Total | 4155266 | 10.13 | 433241 | 1.10 | 2927908 | 6.71 | |||
Repetitive elements were collapsed into parent families and classified by RECON. BLASTx was used to confirm classification. Unclassified families were further analyzed for TIRs and LTRs using EINVERTED and POLYDOT, respectively. LTRs, Long terminal repeats; TIRs, terminal inverted repeats.
Figure 1Genome synteny. CoGe genome synteny analysis software was used to compare the genomes of M. oryzae and M. poae (top), M. oryzae and G. graminis var. tritici (middle), and G. graminis var. tritici and M. poae (bottom). Regions of synteny are plotted as green dots. Highly syntenic regions appear as linear segments when plotted.