| Literature DB >> 21909385 |
Dario Cantu1, Manjula Govindarajulu, Alex Kozik, Meinan Wang, Xianming Chen, Kenji K Kojima, Jerzy Jurka, Richard W Michelmore, Jorge Dubcovsky.
Abstract
BACKGROUND: The wheat stripe rust fungus (Puccinia striiformis f. sp. tritici, PST) is responsible for significant yield losses in wheat production worldwide. In spite of its economic importance, the PST genomic sequence is not currently available. Fortunately Next Generation Sequencing (NGS) has radically improved sequencing speed and efficiency with a great reduction in costs compared to traditional sequencing technologies. We used Illumina sequencing to rapidly access the genomic sequence of the highly virulent PST race 130 (PST-130). METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2011 PMID: 21909385 PMCID: PMC3164196 DOI: 10.1371/journal.pone.0024230
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of raw and trimmed reads and assemblies of PST-130 genomic DNA.
| Sequencing run | GenBank accession number | No. of cycles | No. ofpaired-endreads | Filtered reads | |||
| Total No. | Length (nt) | ||||||
| Total | Average | Median | |||||
|
| SRX022476 | 85 | 27,999,777 | 18,671,826 | 1,364,151,052 | 73.1 | 79 |
|
| SRX022476 | 1,469,928,965 | 78.7 | 85 | |||
|
| SRX022218 | 101 | 25,809,472 | 20,906,479 | 1,943,054,211 | 92.9 | 101 |
|
| SRX022218 | 1,930,999,343 | 92.4 | 101 | |||
|
| AEEW00000000 | 29,178 | 64,782,816 | 2,220 | 900 d | ||
1 and 2 indicate different Illumina sequencing runs, and A and B indicate paired-end reads.
The number of paired-end reads needs to be multiplied by 2 to obtain the total number of reads.
The following 22 contigs (84 kb) include PST-130 mitochondrial DNA: PST130_238, PST130_239, PST130_275, PST130_6617, PST130_6630, PST130_6699, PST130_6903. PST130_7099, PST130_7122, PST130_7145, PST130_7255, PST130_10273, PST130_10278, PST130_20206, PST130_20452, PST130_20455, PST130_20456, PST130_20466, PST130_20619, PST130_28223, PST130_28226, PST130_28239.
N50 = 5,137 nucleotides.
Figure 1Coverage distribution by contig (log10 scale).
Contigs are ordered by coverage (calculated using Bowtie [10]). Some repetitive regions are assembled into single contigs resulting in higher coverage and an underestimation of the genome size. The horizontal line indicates the median coverage (60.4x) and the dotted line the 2-fold coverage (120.8x) used as the threshold to calculate the proportion of the genome represented by contigs assembled from duplicated regions.
Figure 2Microsynteny between PST-130 and PGTG contigs.
Microsynteny between PGTG [AAWC01001263 (A), AAWC01003253 (B), AAWC01000559 (C) [5]] and PST-130 (PST130_8308, PST130_8617, PST130_7101) contigs. Contig alignment and similarity visualization was done using ACT-Artemis Comparison Tool (http://www.sanger.ac.uk/resources/software/act/). Red and blue lines indicate similar regions between PST-130 and PGTG contigs (% identity ≥60). Blue lines indicate inversions. Arrows correspond to putative genes predicted by geneid in PST-130 contigs and annotated genes in PGTG contigs [5].
Summary of repeat contents in the PST-130 contigs.
| Transposable element class | Contig coverage(% assembled length | Major Superfamilies(% within class) | |
| DNA transposons | 8.2 | ||
|
| (13.6) | ||
|
| (10.3) | ||
|
| (9.1) | ||
|
| (6.7) | ||
|
| (6.4) | ||
|
| (5.0) | ||
|
| (1.2) | ||
| LTR retrotransposons | 7.2 | ||
|
| (52.1) | ||
|
| (32.3) | ||
|
| (5.3) | ||
| LINEs | 1.0 | ||
|
| (18.6) | ||
| SINEs | 0.01 | ||
| Satellites and microsatellites | 0.1 | ||
| Unclassified repeats | 1.3 | ||
| Total | 17.8 | ||
Total length of assembled contigs = 64.8 Mb.
The remaining DNA transposons (48%) are represented by ∼60 families of non-autonomous elements which could not be assigned to any specific superfamilies.
Figure 3Genes identified in the PST-130 contigs that encode proteins involved in the ergosterol biosynthetic pathway.
In red the IDs of the PST-130 manually curated genes. ERG1: squalene epoxidase (HQ698552); ERG2: C-8 sterol isomerase (HQ698553); ERG3: C-5 sterol desaturase (JN033203); ERG4: C-24 sterol reductase (partial clone, not submitted to GenBank); ERG5: C-22 sterol desaturase (JN033204); ERG6: C-24 sterol methyl-transferase (JN033205); ERG7: lanosterol synthase (JN033206); ERG11: lanosterol 14α-demethylase (HQ698554); ERG24: C-14 sterol reductase (HQ698555); ERG25: C-4 sterol methyloxidase (JN033207); ERG26: C-4 sterol decarboxylase (JN033208); ERG27: C-3 sterol ketoreductase (JN033209).