| Literature DB >> 18452608 |
Steven L Salzberg1, Daniel D Sommer, Michael C Schatz, Adam M Phillippy, Pablo D Rabinowicz, Seiji Tsuge, Ayako Furutani, Hirokazu Ochiai, Arthur L Delcher, David Kelley, Ramana Madupu, Daniela Puiu, Diana Radune, Martin Shumway, Cole Trapnell, Gudlur Aparna, Gopaljee Jha, Alok Pandey, Prabhu B Patil, Hiromichi Ishihara, Damien F Meyer, Boris Szurek, Valerie Verdier, Ralf Koebnik, J Maxwell Dow, Robert P Ryan, Hisae Hirata, Shinji Tsuyumu, Sang Won Lee, Young-Su Seo, Malinee Sriariyanum, Pamela C Ronald, Ramesh V Sonti, Marie-Anne Van Sluys, Jan E Leach, Frank F White, Adam J Bogdanove.
Abstract
BACKGROUND: Xanthomonas oryzae pv. oryzae causes bacterial blight of rice (Oryza sativa L.), a major disease that constrains production of this staple crop in many parts of the world. We report here on the complete genome sequence of strain PXO99A and its comparison to two previously sequenced strains, KACC10331 and MAFF311018, which are highly similar to one another.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18452608 PMCID: PMC2432079 DOI: 10.1186/1471-2164-9-204
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Comparison of 3 Xanthomonas oryzae pv. oryzae genomes
| PXO99A | KACC | MAFF | |
| Length (bp) | 5,240,075 | 4,941,439 | 4,940,217 |
| GC content (%) | 63.6 | 63.7 | 63.7 |
| Annotated genes | 5,083 | 4,637 | 4,372 |
| IS elements (complete/fragment) | 267 (683) | 252 (714) | 251 (712) |
| TAL effector genes | 19 | 15 | 17 |
Figure 1Circular representation of the . Rings illustrate, from outside to inside: protein coding genes (forward strand), protein coding genes (reverse strand), TAL effectors (green) and IS elements (red), and GC-skew plot showing (G-C)/(G+C) in 10 kilobase windows. Positive values of GC-skew indicate the leading strand of replication, negative values the lagging strand.
Figure 2A 38.8 kb region including nonfimbrial adhesin genes that is unique to PXO99. A: organization of the region in the PXO99A genome. Block arrows represent genes; inverted triangles represent insertion sequence elements. The region is flanked by DSP (dual specificity protein) and DBP (DNA binding protein) encoding genes, which are also present in MAFF and KACC. B: the corresponding locus in MAFF and KACC, missing the entire block of genes. The point of insertion/deletion maps to an ISXo5 insertion sequence element between DSP and DBP.
Figure 5Inversions and rearrangements in PXO99. The alignment shows regions of PXO99A that align to the same (red) or opposite (blue) strand of MAFF. Transposase genes and their orientation (+ or -) are shown at the sites of each rearrangement. Letters A-J indicate specific rearrangement events. A: the IS element ISXoo3 is composed of two distinct and independently conserved ORFs and is responsible for an inversion spanning coordinates 267869–5114959 (all coordinates refer to the PXO99A genome). B: ISXo8 occurs in opposite orientation at each end of a 2.6 Mbp inversion spanning positions 1356757–3898472. C: ISXo1 occurs in inverted copies at the endpoints of a 1.8 Mbp inversion spanning 1558996–3391786. D: a 33270 bp inverted region spanning 4394742–4428012 is flanked by oppositely-oriented copies of ISXo8. E: Each copy of the 212-kb duplication is flanked by ISXo5, which also occurs adjacent to two other translocations in this region. The duplication appears as two parallel diagonal lines in this box. F: ISXo8 also occurs in inverted copies at the boundaries of a 47540 bp segment that is translocated from approximately 4800000 to 685272. G: ISXoo3 flanks both ends of a 47540 bp translocation from approximately 1117000 to 4339239. H: A 9,862 bp region occurs in inverted copies at 217,455 and 4,305,307. MAFF311018 contains only one copy of this region. I,J: Segments spanning 96,753 bp (I) and 17,021 bp (J) are inverted with respect to MAFF311018 but not associated with transposases.
TAL effector genes in PXO99A
| 03922 | 559109..562222 | - | 21.5 | ||
| 00227 | 1645240..1649043 | - | 23.5 | ||
| 00223 | 1650351..1653557 | - | 14.5 | ||
| 00511 | 1860212..1862083 | + | 17.5 | N-term deletion, truncated | |
| 00505 | 1864934..1866895 | + | 17.5 | N-term deletion, truncated | |
| 00318 | 2083533..2085968 | - | 15.5 | ||
| 00572 | 2354996..2358139 | - | 22.5 | ||
| 00567 | 2360008..2362440 | - | 15.5 | ||
| 00546 | 2384284..2387193 | + | 19.5 | ||
| 05609 | 2388988..2392041 | + | 20.5 | N-term frameshift | |
| 05633 | 2683629..2686343 | + | 17.5 | ||
| 01085 | 2688137..2691088 | + | 19.5 | ||
| 06229 | 2895716..2898430 | + | 17.5 | Duplicate of | |
| 06234 | 2900224..2903175 | + | 19.5 | Duplicate of | |
| 02172 | 4101543..4104803 | + | 19.5 | ||
| 05714 | 4106597..4110244 | + | 26.5 | ||
| 05718 | 4112038..4114644 | + | 16.5 | ||
| 02269 | 4116438..4118642 | + | 12.5 | ||
| 02272 | 4120436..4123759 | + | 23.5 |
1The rice gene induced by the effector is in italics.
Figure 3Relationship of TAL effector genes in Xoo strains PXO99. The individual genes, distributed among nine loci in PXO99A and eight in MAFF, are represented by open arrows and labeled as described in the text. Pseudogenes (truncated genes or genes with early stop codons) are indicated by an apostrophe. Genes that have identical repeat regions based on number of repeats and identity at the twelfth and thirteenth codons are connected with a black dashed line. Blue dashed lines connect genes with nearly identical repeat regions (see text). Names of previously characterized genes are centered above or below the corresponding open arrow. Colored boxes indicate TAL gene clusters (not to scale), with the same color representing loci at the same relative positions in the two genomes. Locus 4 in PXO99A and locus 3 in MAFF are uniquely positioned in their respective genomes. The solid black rectangle and arrows beneath it represent the 212 kb direct repeat in the PXO99A genome.
Figure 4Alignment of PXO99. Notes: 1 * indicates a proposed deletion of the thirteenth codon in the repeat; 2, novel variable codons; 3, truncation; 4, six-codon deletion; 5, N-terminal frameshift; 6, five-codon deletion in repeat.
Figure 6Alignment of CRISPR elements from the PXO99. Spacers are numbered from right (S0) to left, with the oldest elements on the right. Gaps (green boxes) indicate the positions of additional spacers in the genomes not shown here. Red lines indicate spacers shared in all three genomes, heavy black lines indicate spacers shared in just two species, and thin black lines indicate spacers that are similar but not identical between two species.
Figure 7Compositional analysis of the PXO99A genome. Analysis of genome composition in 1000 bp windows. The red plot shows a X2 analysis, in which the trinucleotide composition of each window is compared to the overall composition. The green plot shows GC content for the same windows.