| Literature DB >> 24586171 |
Raul A Cernadas1, Erin L Doyle2, David O Niño-Liu3, Katherine E Wilkins4, Timothy Bancroft5, Li Wang1, Clarice L Schmidt3, Rico Caldo3, Bing Yang6, Frank F White7, Dan Nettleton5, Roger P Wise8, Adam J Bogdanove1.
Abstract
Bacterial leaf streak of rice, caused by Xanthomonas oryzae pv. oryzicola (Xoc) is an increasingly important yield constraint in this staple crop. A mesophyll colonizer, Xoc differs from X. oryzae pv. oryzae (Xoo), which invades xylem to cause bacterial blight of rice. Both produce multiple distinct TAL effectors, type III-delivered proteins that transactivate effector-specific host genes. A TAL effector finds its target(s) via a partially degenerate code whereby the modular effector amino acid sequence identifies nucleotide sequences to which the protein binds. Virulence contributions of some Xoo TAL effectors have been shown, and their relevant targets, susceptibility (S) genes, identified, but the role of TAL effectors in leaf streak is uncharacterized. We used host transcript profiling to compare leaf streak to blight and to probe functions of Xoc TAL effectors. We found that Xoc and Xoo induce almost completely different host transcriptional changes. Roughly one in three genes upregulated by the pathogens is preceded by a candidate TAL effector binding element. Experimental analysis of the 44 such genes predicted to be Xoc TAL effector targets verified nearly half, and identified most others as false predictions. None of the Xoc targets is a known bacterial blight S gene. Mutational analysis revealed that Tal2g, which activates two genes, contributes to lesion expansion and bacterial exudation. Use of designer TAL effectors discriminated a sulfate transporter gene as the S gene. Across all targets, basal expression tended to be higher than genome-average, and induction moderate. Finally, machine learning applied to real vs. falsely predicted targets yielded a classifier that recalled 92% of the real targets with 88% precision, providing a tool for better target prediction in the future. Our study expands the number of known TAL effector targets, identifies a new class of S gene, and improves our ability to predict functional targeting.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24586171 PMCID: PMC3937315 DOI: 10.1371/journal.ppat.1003972
Source DB: PubMed Journal: PLoS Pathog ISSN: 1553-7366 Impact factor: 6.823
Figure 1Rice transcriptional responses to Xanthomonas oryzae pv. oryzicola BLS256 (Xoc) or X. oryzae pv. oryzae PXO99A (Xoo).
Distribution of genes differentially expressed over a 96(see Materials and Methods) in response to either strain relative to a mock inoculation is shown. Each circle of the Venn diagram represents a different pairwise comparison of treatments, as indicated in non-bold text. Results are based on mixed linear model analysis using four biological replicates for each time point of the study and an estimated false discovery rate of 0.3. The intersections represent the genes differentially expressed uniquely in response to the different treatments, indicated in bold text. Note that differentially expressed uniquely in response to mock means differentially expressed similarly in Xoc and Xoo relative to mock, and differentially expressed uniquely in response to all three treatments means differentially expressed both in Xoc and Xoo relative to mock, but also differentially between Xoc and Xoo. Also, since differential expression in a given pairwise comparison is determined using a statistical cutoff, transitive predictions, i.e., A = B and B = C, therefore A = C, may not hold.
Figure 2Expression patterns of the most significantly differentially expressed rice genes.
Normalized least square means of signal intensities (y-axis) at 2, 4, 8, 24, and 96 h after inoculation (x-axis) with X. oryzae pv. oryzicola BLS256 (Xoc), X. oryzae pv. oryzae strain PXO99A (Xoo) or mock control are plotted for the genes most significantly differentially expressed relative to mock uniquely in response to Xoc (Xoc only), uniquely in response to Xoo (Xoo only), similarly in response to Xoc and Xoo (Xoc and Xoo similarly), and differently in response to Xoc and Xoo (Xoc and Xoo differently). Where two probe sets correspond to the same gene, the one with the lower q-value was selected for display. Locus IDs are given at right, omitting the prefix “LOC_Os”. Results were derived from a mixed linear model analysis with four replicates. Vertical bars represent standard error. Asterisks mark previously identified targets of Xoo TAL effectors, TFIIaγ1(Os01g73890) and OsSWEET11 (Os08g42350), activated by PthXo7 and PthXo1, respectively. Daggers flag Xoc TAL effector targets discovered in this study.
Figure 3Type III secretion system dependence of the most significant rice gene expression changes.
RT-PCR results reflecting transcript abundance are shown for rice genes identified by GeneChip expression analysis as the ten (or fewer) most significantly differentially expressed in response to (A) X. oryzae pv. oryzicola BLS256 (Xoc), (B) X. oryzae pv. oryzae strain PXO99A (Xoo), (C) Xoc and Xoo similarly, or (D) Xoc and Xoo to different extents. Leaf samples were harvested at 36 hours after inoculation with wild-type strains or with the type III secretion (T3S−) deficient derivatives. RT-PCR results for previously reported Xoo-induced genes, OsSWEET11 and TFIIAγ1 [9], [10], are omitted. An actin gene (Os04g57210) that is not differentially expressed was used as a reference for relative transcript abundance across samples. The experiment was repeated twice and yielded the same results.
Predicted X. oryzae pv. oryzicola BL256 TAL effector targets in rice (cv. Nipponbare) induced during infection and results of verification experiments.a
| TAL effector | Target Locus ID | Probe set ID(s) | Fold change 2–96 h Xoc | Fold Change Mock-Xoc 96 h |
| EBE Score | EBE rel. score | EBE rank | EBE to TLS | EBE to TXS | EBE to TATA box | EBE to Y patch | Induced by | Description | |
|
| Xag expressing the | ||||||||||||||
| Tal4a | 01g27210 | Os.7911.1.S1_at | 1.63 | 1.66 | 1.8E-01 | 29.22 | 2.85 | 341 | 253 | 143 | −50 | none | + | − | Glutathione S-transferase, putative, expressed |
| Tal6 | 01g31220 | Os.6438.1.S1_a_at | 1.48 | 1.53 | 8.0E-02 | 18.75 | 2.38 | 685 | 157 | 152 | none | 33 | − | + | Expressed protein |
| Os.6438.2.S1_x_at | 1.48 | 1.52 | 1.1E-02 | ||||||||||||
| Tal2d | 01g51040 | Os.53457.1.S1_at | 2.30 | 2.23 | 1.9E-01 | 14.32 | 2.19 | 324 | 527 | none | −328 | 0 | + | − | Transmembrane protein 16K, putative, expressed |
| Tal9b | 01g51040 | Os.53457.1.S1_at | 2.30 | 2.23 | 1.9E-01 | 14.07 | 2.81 | 275 | 18 | 0 | −299 | none | − | + | Transmembrane protein 16K, putative, expressed |
| Tal2g | 01g52130 | Os.41841.1.S1_at | 13.00 | 9.59 | 1.3E-06 | 13.94 | 1.97 | 77 | 427 | 58 | 28 | none | − | + | Sulfate transporter, putative, expressed |
| Tal3b | 01g53220 | Os.35681.1.S1_at | 3.50 | 4.12 | 2.2E-06 | 17.72 | 2.92 | 611 | 146 | −5 | −137 | none | nd | nd | HSF-type DNA-binding domain containing protein, expressed |
| Tal6 | 02g14770 | Os.2450.1.S1_a_at | 1.88 | 1.77 | 1.3E-02 | 18.48 | 2.35 | 569 | 92 | 48 | −70 | −37 | + | − | Phosphoenolpyruvate carboxylase, putative, expressed |
| Os.2450.3.S1_x_at | 1.85 | 1.55 | 6.8E-02 | ||||||||||||
| Tal11a | 02g15290 | Os.56119.1.S1_at | 1.72 | 4.93 | 4.1E-07 | 20.15 | 3.18 | 582 | 422 | none | −288 | none | + | − | VQ domain containing protein, putative, expressed |
| Tal5a | 02g15290 | Os.56119.1.S1_at | 1.72 | 4.93 | 4.1E-07 | 23.32 | 1.88 | 107 | 148 | 30 | −3 | −180 | − | + | VQ domain containing protein, putative, expressed |
| Tal7 | 02g15710 | OsAffx.2629.1.S1_at | 5.70 | 5.52 | 8.4E-02 | 15.75 | 1.94 | 265 | 951 | none | 150 | 434 | nd | nd | Plastocyanin-like domain containing protein, putative, expressed |
| Tal3b | 02g34970 | Os.47735.1.S1_at | 9.07 | 5.31 | 4.0E-07 | 15.33 | 2.53 | 75 | 110 | 29 | −282 | none | − | + | No apical meristem protein, putative, expressed |
| Tal2a | 02g43760 | Os.1349.1.S1_at | 1.25 | 1.45 | 1.7E-03 | 15.87 | 1.75 | 21 | 521 | none | −334 | −5 | nd | + | Ubiquitin carboxyl-terminal hydrolase, family 1, putative, expressed |
| OsAffx.2950.1.S1_s_at | 1.23 | 1.32 | 5.3E-03 | ||||||||||||
| Tal7 | 02g43760 | Os.1349.1.S1_at | 1.25 | 1.45 | 1.7E-03 | 16.45 | 2.03 | 547 | 628 | 341 | 17 | 117 | + | − | Ubiquitin carboxyl-terminal hydrolase, family 1, putative, expressed |
| OsAffx.2950.1.S1_s_at | 1.23 | 1.32 | 5.3E-03 | ||||||||||||
| Tal3c | 02g47660 | Os.7751.1.S1_at | 2.25 | 2.24 | 1.9E-03 | 10.93 | 1.92 | 53 | 140 | −63 | −98 | none | − | + | Basic helix-loop-helix, putative, expressed |
| Tal4c | 02g47660 | Os.7751.1.S1_at | 2.25 | 2.24 | 1.9E-03 | 24.73 | 3.01 | 434 | 367 | 310 | −25 | none | + | − | Basic helix-loop-helix, putative, expressed |
| Tal2c | 03g03034 | Os.10510.1.S1_at | 1.49 | 3.11 | 1.1E-02 | 19.55 | 1.83 | 0 | 142 | 114 | −779 | 6 | − | + | Flavonol synthase/flavanone 3-hydroxylase, putative, expressed |
| Os.53217.1.S1_x_at | 1.26 | 2.83 | 6.8E-02 | ||||||||||||
| Tal3b | 03g03034 | Os.10510.1.S1_at | 1.49 | 3.11 | 1.1E-02 | 16.73 | 2.76 | 258 | 759 | 567 | none | none | + | − | Flavonol synthase/flavanone 3-hydroxylase, putative, expressed |
| Os.53217.1.S1_x_at | 1.26 | 2.83 | 6.8E-02 | ||||||||||||
| Tal11a | 03g05370 | OsAffx.24978.1.S1_at | 10.98 | 13.02 | 5.2E-04 | 17.39 | 2.74 | 307 | 798 | 331 | 526 | none | + | − | Expressed protein |
| Tal3c | 03g07540 | OsAffx.3165.1.S1_at | 6.33 | 3.84 | 3.6E-02 | 12.33 | 2.17 | 350 | 248 | 99 | −625 | none | − | + | bHLH family protein, putative, expressed |
| Tal7 | 03g25490 | Os.34992.2.S1_at | 2.10 | 2.02 | 3.8E-05 | 16.32 | 2.01 | 494 | 199 | 30 | −363 | 9 | + | − | Cytochrome P450 72A1, putative, expressed |
| Tal4a | 03g37840 | Os.20541.1.S1_at | 2.24 | 1.96 | 2.2E-04 | 15.58 | 1.52 | 0 | 362 | 151 | −3 | none | − | + | Potassium transporter, putative, expressed |
| Tal2d | 04g49194 | Os.17316.1.S1_at | 22.42 | 10.49 | 3.9E-07 | 8.22 | 1.26 | 0 | 101 | 26 | −715 | none | − | + | Naringenin,2-oxoglutarate 3-dioxygenase, putative, expressed |
| Tal3a | 05g12450 | OsAffx.26856.1.S1_at | 1.81 | 1.34 | 2.3E-01 | 16.07 | 2.01 | 294 | 446 | 315 | none | none | + | − | Hydroquinone glucosyltransferase, putative, expressed |
| Tal3b | 05g27590 | Os.57186.1.S1_at | 2.40 | 4.42 | 3.4E-08 | 11.40 | 1.88 | 2 | 103 | 33 | −1 | none | − | + | Wound-induced protein WI12, putative, expressed |
| Tal11b | 06g14750 | OsAffx.15432.1.S1_at | 1.29 | 1.30 | 2.0E-01 | 12.44 | 2.77 | 129 | 313 | 195 | 160 | −15 | + | − | Phosphatidylinositol-4-phosphate 5-Kinase family protein, putative, expressed |
| Tal1c | 06g14750 | OsAffx.15432.1.S1_at | 1.29 | 1.30 | 2.0E-01 | 12.00 | 2.44 | 256 | 178 | 47 | none | none | + | − | Phosphatidylinositol-4-phosphate 5-Kinase family protein, putative, expressed |
| Tal2a | 06g14750 | OsAffx.15432.1.S1_at | 1.29 | 1.30 | 2.0E-01 | 17.59 | 1.94 | 88 | 79 | −22 | −618 | −33 | nd | − | Phosphatidylinositol-4-phosphate 5-Kinase family protein, putative, expressed |
| Tal4c | 06g37080 | Os.16282.1.A1_at | 5.54 | 7.15 | 2.7E-10 | 14.64 | 1.78 | 0 | 150 | 39 | −1 | none | − | + | L-ascorbate oxidase precursor, putative, expressed |
| OsAffx.15788.1.S1_at | 11.84 | 9.94 | 6.3E-09 | ||||||||||||
| Tal8 | 06g37080 | Os.16282.1.A1_at | 5.54 | 7.15 | 2.7E-10 | 19.92 | 2.32 | 605 | 661 | 560 | −36 | 548 | + | − | L-ascorbate oxidase precursor, putative, expressed |
| OsAffx.15788.1.S1_at | 11.84 | 9.94 | 6.3E-09 | ||||||||||||
| Tal2g | 06g46500 | Os.49496.1.S1_at | 6.40 | 6.88 | 4.3E-08 | 14.27 | 2.01 | 117 | 89 | 59 | −489 | −47 | − | + | Monocopper oxidase, putative, expressed |
| Tal11a | 06g47950 | OsAffx.15977.1.S1_s_at | 1.78 | 1.67 | 2.8E-02 | 16.20 | 2.55 | 19 | 527 | none | −328 | 0 | nd | nd | Tetratricopeptide-like helical, putative, expressed |
| Tal1c | 07g06970 | Os.49794.1.S1_at | 2.95 | 2.27 | 1.3E-02 | 5.97 | 1.22 | 0 | 216 | 24 | none | none | − | + | HEN1, putative, expressed |
| Tal3a | 07g06970 | Os.49794.1.S1_at | 2.95 | 2.27 | 1.3E-02 | 16.21 | 2.03 | 354 | 930 | 815 | 444 | none | + | − | HEN1, putative, expressed |
| Tal4c | 07g29750 | Os.46631.1.S1_x_at | 6.69 | 5.05 | 3.4E-07 | 25.09 | 3.05 | 557 | 233 | 30 | −5 | none | + | − | Glycosyl hydrolases family 16, putative, expressed |
| Tal4b | 07g34510 | Os.51294.1.S1_at | 0.95 | 1.00 | 2.8E-01 | 8.88 | 1.63 | 33 | 302 | 151 | −425 | none | nd | nd | Retrotransposon protein, putative, unclassified, expressed |
| Tal3b | 07g36430 | Os.31021.1.S1_at | 2.53 | 2.40 | 2.6E-02 | 15.78 | 2.6 | 108 | 117 | 31 | −4 | none | − | + | Expressed protein |
| Tal6 | 07g47790 | Os.8920.1.S1_at | 4.16 | 8.41 | 3.6E-02 | 13.38 | 1.7 | 8 | 798 | 610 | −192 | 694 | + | − | AP2 domain containing protein, expressed |
| Tal4a | 09g20220 | Os.4759.1.S1_at | 2.17 | 2.38 | 4.9E-02 | 28.93 | 2.83 | 280 | 170 | 139 | −751 | 34 | + | − | Glutathione S-transferase, putative, expressed |
| Tal2d | 09g23560 | Os.5983.1.S1_at | 2.19 | 5.02 | 2.8E-01 | 14.19 | 2.17 | 288 | 525 | none | none | 93 | nd | nd | Dehydrogenase, putative, expressed |
| Tal6 | 09g29100 | Os.18607.1.S1_at | 1.64 | 1.97 | 3.6E-02 | 17.00 | 2.13 | 167 | 0 | 0 | 0 | 0 | − | + | Cyclin, putative, expressed |
| Tal4b | 09g32100 | Os.16365.1.S1_at | 3.34 | 2.45 | 8.0E-03 | 8.15 | 1.5 | 16 | 270 | 84 | 21 | none | − | + | Expressed protein |
| Tal9a | 11g01480 | Os.18448.1.S1_s_at | 5.42 | 3.94 | 8.2E-06 | 19.71 | 2.56 | 206 | 776 | 621 | 365 | none | + | − | MYB family transcription factor, putative, expressed |
| OsAffx.30765.1.S1_at | 5.74 | 4.10 | 5.4E-06 | ||||||||||||
| Tal9a | 12g01490 | Os.18448.1.S1_at | 5.21 | 3.92 | 2.6E-05 | 19.71 | 2.56 | 205 | 302 | 191 | 151 | none | + | − | MYB family transcription factor, putative, expressed |
| Tal6 | 12g42970 | Os.11382.1.S1_at | 2.31 | 1.65 | 2.2E-04 | 16.84 | 2.14 | 139 | 132 | 30 | −565 | 12 | − | + | GATA zinc finger domain containing protein, expressed |
| Tal6 | 12g42970 | Os.11382.1.S1_at | 2.31 | 1.65 | 2.2E-04 | 18.27 | 2.32 | 411 | 107 | 5 | −590 | −13 | − | + | GATA zinc finger domain containing protein, expressed |
Expression values are from the GeneChip expression experiment; see Materials and Methods.
Prefix “LOC_Os” is omitted.
Fold change in transcript abundance in leaves at 96 h relative to 2 h after inoculation with X. oryzae pv. oryzicola BLS256 (Xoc).
Fold change in transcript abundance at 96 h in Xoc-inoculated leaves relative to mock-inoculated leaves.
Calculated for the comparison of transcript abundance in Xoc vs. mock inoculated leaves across all time points.
Score is according to Doyle et al. [32] except that new RVDs ‘SN’ and ‘YG’, present in Tal2g were assigned nucleotide association frequencies of ‘NN’ and ‘NG’, respectively (see text).
EBE relative score, ratio of the observed EBE score to the best possible score for the TAL effector [32].
EBE rank among the single best scoring sites for the TAL effector in each rice promoter [32].
Distance in bases from the 5′ end of the EBE to the translational start site (TLS) of the target locus; a positive value indicates a location downstream of the EBE.
Distance in bases from the 5′ end of the EBE to the transcriptional start site (TXS) based on cDNA evidence in the Rice Genome Annotation Project Release 7 (http://rice.plantbiology.msu.edu/); a positive value indicates a location downstream of the EBE; none, cDNA evidence of TXS missing.
Distance in bases from the 5′ end of the EBE to the nearest identified putative TATA box; a positive value indicates a location downstream of the EBE; none, putative TATA box not present.
Distance in bases from the 5′ end of the EBE to the nearest identified putative Y patch; a positive value indicates a location downstream of the EBE; none, putative Y patch not present.
Results of RT-PCR 48 h after inoculation, relative to a negative control inoculation (see Supplemental Figure S1); Xoc, X. oryzae pv. oryzicola BLS256; Xag, X. axonopodis pv glycines EB08; +, induced; −, not induced; nd, transcript not detected by RT-PCR (in each case, amplification by standard PCR from genomic DNA as template was confirmed).
Figure 4Virulence of X. oryzae pv. oryzicola BLS256 tal gene knockout strains.
(A) Suicide plasmid pSM7 (Table S8) used for tal gene knockouts by homologous recombination in BLS256. pSM7 harbors a 4.5-kb PstI fragment containing all but the first 80 bp of the ORF of tal gene aB4.5
[12] with an insertion of the EZ-Tn5
Figure 5Virulence contribution of X. oryzae pv. oryzicola BLS256 TAL effector Tal2g.
(A) Lengths of lesions caused by X. oryzae pv. oryzicola BLS256 (WT), the tal2g knockout derivative M27 carrying an empty plasmid vector (ev), and M27 carrying the vector with the cloned tal2g gene, measured as in Figure 4, but at 10 days after infiltration. The asterisk indicates a significant difference relative to WT (p<0.01). Error bars represent standard deviation (N≥10). (B) Total and surface (exudate) bacterial populations of leaves seven days after inoculation with the strains in panel A. The asterisk indicates a significant difference relative to WT (p<0.01). Error bars represent standard deviation (N≥6). Experiments were repeated three times with consistent results.
Figure 6Determination of Os01g52130 as the relevant target of Tal2g using designer TAL effectors.
(A) DNA sequence of the promoter regions of Tal2g induced genes Os06g46500 and Os01g52130 in rice cv. Nipponbare. The effector binding elements (EBEs) for Tal2g are in bold. The EBEs for designer TAL effectors dT434 targeting Os06g46500 and dT436 and dT437 targeting Os01g52130 are underlined and labeled above. Periods indicate transcriptional start sites and italics indicate translational start sites, per the Rice Genome Annotation Project (Release 7, http://rice.plantbiology.msu.edu). (B) Activation of Os06g46500 and Os1g52130 by Tal2g, and specific activation respectively of Os06g46500 and Os01g52130 by dT434, and dT436 or dT437. Shown are the results of RT-PCR amplification from leaf RNA isolated 48 h after inoculation by infiltration with X. oryzae pv. oryzicola BLS256 (WT), the tal2g knockout derivative M27 carrying an empty plasmid vector (ev), M27 carrying the vector with the cloned tal2g gene, or M27 carrying the vector with coding sequences for dT436, dT436, or dT437 as indicated. The actin gene Os04g57210 was used as a reference for relative transcript abundance across samples. (C) Rescue of the virulence defect of M27 by dT436 or dT437 but not dT434 in the lesion length assay. Lesion lengths were measured as in Figure 4, 10 days after inoculation with the indicated strains. Values labeled with the same letter are not significantly different and those labeled with different letters are (Student's t-test, p<0.01). Error bars represent standard deviation (N≥10). Experiments were repeated twice with consistent results. (D) A rice (cv. Nipponbare) leaf showing bacterial leaf streak symptoms two days after inoculation with a suspension of WT cells at an OD600 of 0.5 (approximately 1×108 CFU/ml) by infiltration using a needleless syringe over a 4 mm diameter leaf area, and labeled to indicate the site of inoculation, at which surface bacterial populations were quantified, and the three 12 mm long leaf sections in which total bacterial populations were quantified, as presented in panel E. (E) Restoration of the surface population and the total population spread of M27 to wild-type levels by dTAL437 but not dTAL434. Populations were quantified at 2, 5, 8 and 11 days after inoculation. Results are the means and standard deviations of samples from three leaves; nd, not detected. At each time point (not across time points), values labeled with the same letter are not significantly different, and those labeled with different letters are (Student's t-test p<0.0001).
Figure 7Functional characterization of selected rice promoter sequences similar to the verified Tal2g EBEs.
(A) Alignment of selected rice promoter sequences (from loci Os06g13880, Os12g36920, and Os05g10650; see text) and EBEs from the verified Tal2g targets Os01g52130 (OsSULTR3;6) and Os06g46500 with the corresponding sequence of repeat variable diresidues (RVD) of Tal2g. Position (Pos) is that of the 5′ end relative to the annotated transcriptional start site. Rare RVDs ‘YG’ and ‘SN’ of Tal2g are in bold. Scores were calculated according to [32], either substituting the nucleotide association frequencies of common RVDs ‘NN’ and ‘NG’ for the new RVDs ‘SN’ and ‘YG’ (“Sub Scores”) or using the default wild card specificity values for the new RVDs (“Def Scores”). An asterisk indicates that the score is outside the cutoff to be considered a candidate EBE for Tal2g, calculated independently for each scoring method. Nucleotide mismatches to the new RVDs using the substituted specificities are underlined, as is a (5′) mismatch in the 06g13880 sequence to the first RVD (‘NN’) of Tal2g. Whether a gene is induced (Ind) upon infection by Xanthomonas oryzae pv. oryzicola BLS256 is indicated by a plus or minus sign at right. (B) Activity of the selected sequences in an Agrobacterium-mediated transient transformation based reporter assay in Nicotiana benthamiana leaves [40]. In this assay, a TAL effector gene (none, tal2g, or avrBs3) driven by the 35S promoter is introduced together with the GUS gene under the control of a minimal promoter from the pepper Bs3 gene, with the test sequence inserted slightly upstream of the native EBE for AvrBs3 (AvrBs3 is the TAL effector from the pepper pathogen X. euvesicatoria that activates Bs3 upon infection). The inserted sequences are indicated by locus ID on the X axis; “(–)” indicates the minimal Bs3 promoter with only the AvrBs3 EBE and no added sequence. Error bars represent standard deviation (N = 3). Experiments were repeated twice with consistent results. (C) Activity and specificity of the EBEs from the two verified targets of Tal2g, as in panel B.
Figure 8Expression levels of probesets associated with X. oryzae pv. oryzicola BLS256 (Xoc) TAL effector targets relative to other probesets.
Individual box plots show average normalized expression values over time for probesets associated with verified (real) Xoc TAL effector targets, probesets associated with genes predicted but shown not to be targeted by an Xoc TAL effector (falsely predicted targets), all probesets differentially expressed (DE) in the mock vs. Xoc comparison at q≤0.3, or all probesets on the chip. The top row of plots shows data from mock-inoculated plants and the bottom row data from plants inoculated with Xoc. For each plot, the central bar indicates the median value and the top and bottom of the box indicate the 75th percentile and the 25th percentile, respectively. Whiskers indicate the most extreme data points above and below the median that are not outliers, calculated as ≤1.5*(75th percentile – 25th percentile) above the 75th percentile or below the 25th percentile. Outliers are plotted individually. Boxplots were made using the ‘boxplot()’ function of the statistical software package R (www.r-project.org).
Performance of a Naive Bayes classifier trained on all EBE features and of a logistic regression classifier trained on distance to transcriptional start site (TXS) using leave-one-out cross validation.a
| Features | Accuracy | Precision | Recall | F measure | MCC | AUC |
| All | .89 | .88 | .92 | .90 | .77 | .88 |
| Distance to TXS | .87 | .88 | .88 | .88 | .73 | .87 |
See text for features included. Accuracy, precision, and recall are at the maximum F measure obtained by varying the discrimination threshold. Using TP, TN, FP, FN to represent numbers of true positives, true negatives, false positives, and false negatives, respectively, accuracy is , precision is , recall is , F measure is , MCC (Matthews correlation coefficient) is , and AUC is the area under the receiver operating characteristic curve, a curve created by plotting TP vs. FP as the discrimination threshold is varied.