| Literature DB >> 21952044 |
Agnieszka Zmieńko1, Magdalena Guzowska-Nowowiejska, Radosław Urbaniak, Wojciech Pląder, Piotr Formanowicz, Marek Figlerowicz.
Abstract
Plastids are small organelles equipped with their own genomes (plastomes). Although these organelles are involved in numerous plant metabolic pathways, current knowledge about the transcriptional activity of plastomes is limited. To solve this problem, we constructed a plastid tiling microarray (PlasTi-microarray) consisting of 1629 oligonucleotide probes. The oligonucleotides were designed based on the cucumber chloroplast genomic sequence and targeted both strands of the plastome in a non-contiguous arrangement. Up to 4 specific probes were designed for each gene/exon, and the intergenic regions were covered regularly, with 70-nt intervals. We also developed a protocol for direct chemical labeling and hybridization of as little as 2 micrograms of chloroplast RNA. We used this protocol for profiling the expression of the cucumber chloroplast plastome on the PlasTi-microarray. Owing to the high sequence similarity of plant plastomes, the newly constructed microarray can be used to study plants other than cucumber. Comparative hybridization of chloroplast transcriptomes from cucumber, Arabidopsis, tomato and spinach showed that the PlasTi-microarray is highly versatile.Entities:
Year: 2011 PMID: 21952044 PMCID: PMC3195753 DOI: 10.1186/1746-4811-7-29
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Criteria for oligonucleotide probe selection
| Oligo selection criterion | probes satisfying the rule [%] | Rule symbol |
|---|---|---|
| Melting temperature = 70°C (+/- 8°C) | 97.18% | T |
| Nucleotide composition: oligonucleotide cannot have a contiguous single nucleotide base repeat longer than 8 bases AND probe GC content fits the 20-to-60% range AND each base cannot constitute more than 40% of the oligonucleotide sequence | 85.33% | C |
| Stem of potential hairpin structure cannot be longer than 8 bases | 99.20% | H |
| Minimum Hamming distance to non-target parts of genome > 25 | 99.45% | D |
| Substring: not more than 20 contiguous bases common to other (non-target) parts of genome | 99.02% | S |
Figure 1Presentation of the selected parameters in the final microarray probe set. A. Comparison of numbers of probes not meeting each design criterion (T - Melting Temperature, D - Hamming Distance, S - Substring, H - Hairpin, C - Nucleotide Composition, 1 type (2 types, 3 types, 4 types) - probes for which 1 criterion (2, 3 or 4 criteria, respectively) are not fulfilled, Any - probes that do not meet at least one design criterion). B. Frequency distribution of probe GC content (a component of the Nucleotide Composition criterion). C. Frequency distribution of probe melting temperatures. D. Frequency distribution of minimum Hamming distance of the probes, describing the probe specificity within the cucumber chloroplast genome.
Discrepancies from the probe genome perfect complementarities in inverted repeat regions
| Discrepancy | Number of probes affected |
|---|---|
| 70/70 > Identity ≥ 65/65, 0 Mismatches, 0 Gaps | 2 |
| Identity = 69/70, 1 Mismatch, 0 Gaps | 7 |
| Identity = 69/70 or 70/71, 0 Mismatches, 1 Gap | 4 |
| Identity = 68/70 or 70/72, 0 Mismatches, 2 Gaps | 2 |
| Identity = 70/76, 0 Mismatches, 6 Gaps | 1 |
Discrepancy occurs when the probe targets IRA and IRB in regions that are not perfectly identical. In this case, the probe is designed to match one IR and is not 100% complementary to the other one. Level of identity of the probe to the second IR region is presented as number of probe bases perfectly aligning to target region/total length of aligning probe sequence. Probe 5' or 3' ends that do not match the second IR, do not add up to the reported total alignment length. Number of internal mismatches and gaps in alignment is also shown
Evaluation of hybridization signal intensities on PlasTi-microarrays
| Experiment | A | B | C | D | E |
|---|---|---|---|---|---|
| 9.509 | 10.164 | 10.819 | 10.122 | 10.269 | |
| 0.508 | 0.797 | 0.784 | 0.814 | 0.770 | |
| 1.535 | 1.462 | 1.362 | 1.444 | 1.401 | |
| 287 (91.11%) | 254 (80.63%) | 286 (90.79%) | 268 (85.08%) | 277 (87.94%) | |
| 267 (20.32%) | 226 (17.20%) | 225 (17.12%) | 203 (15.45%) | 245 (18.65%) | |
| 5.540 | 8.276 | 8.571 | 8.317 | 7.989 | |
| 1.021 ± 0.160 | 0.999 ± 0.020 | 1.002 ± 0.027 | 1.002 ± 0.026 | 1.001 ± 0.034 | |
| 0.916 ± 0.043 | 0.993 ± 0.018 | 0.989 ± 0.013 | 0.990 ± 0.014 | 0.987 ± 0.010 | |
| 1.031 ± 0.155 | 1.000 ± 0.026 | 1.001 ± 0.027 | 1.001 ± 0.022 | 1.002 ± 0.029 | |
Amean is the mean intensity of all probes on all microarrays within the experiment. Anegmean is the mean intensity of all negative control spots on all microarrays within the experiment. Norm.min and Norm.max are normalized minimal and maximal intensity values. Athr is a threshold intensity value qualifying the probe as "detected" and was calculated from the formula Athr = Anegmean + 2 SD. Experiments A-E are briefly described in the Additional file 3.
Figure 2Patterns of probe signal intensities from five independent microarray experiments. Comparison of signal intensities of all 1629 microarray probes, obtained from five independent, separately normalized microarray experiments. Probes are ordered according to the genome coordinates of their target regions, on the plus strand (upper diagram) and minus strand (lower diagram). Probes covering IR regions have targets on both strands and are displayed twice; on the minus strand, they are masked by semitransparent blocks. Normalized probe intensity from each of five independent microarray experiments (A-E) is presented (Anorm). For each probe in each experiment, Anorm value was obtained by dividing probe intensity (average from two duplicates) by the corresponding Amean (defined in Table 2). This enabled signal intensity comparison across all five experiments. All microarrays were analyzed with no background correction, apart from experiment D, for which local background subtraction was applied, due to high background intensity on most microarrays in this set.
Figure 3Comparative analysis of plastid transcriptomes in cucumber. Gene expression fold change in flowers is presented relative to expression level in leaves. For each gene, an average signal from all probes targeting this gene is reported. Genes are grouped according to their function. A - only results with statistical significance (average p-value of all the probes per gene < 0.05) are presented. B - all gene expression data are displayed on the chloroplast metabolic pathway diagram. Expression changes transformed to logarithmic values are displayed in colors, according to the red-blue scale.
Figure 4Signal intensities of the probes targeting gene introns and intergenic regions. This diagram is a magnified part of a signal intensity diagram for experiment E. The diagram covers a 37,018 -base region of the minus strand, covered by 264 probes. Signals from the probes targeting gene coding regions are marked with asterisks, the operons are marked by yellow blocks, and one gene containing introns is marked by a green block. Signals from the probes targeting non-coding regions that exceed the threshold value (Athr = 10.5) are colored: gray - probes in the proximity of gene coding regions; red - signals from probes in the antisense orientation to genes coded on the plus strand; blue - signals from probes positioned with a distance larger than 600 bases from known coding regions.
Sequence similarity of cucumber PlasTi-microarray probes to plastid genomes of other plants
| Organism | ||||||
|---|---|---|---|---|---|---|
| All | SC coding | SC non-coding | IR coding | IR non-coding | ||
| A.tha | 1188 (72.9%) | 246 (93.9%) | 638 (60.1%) | 53 (100%) | 251 (99.6%) | 1035 (63.5%) |
| N.tab | 1253 (76.9%) | 250 (95.4%) | 706 (66.5%) | 53 (100%) | 244 (96.8%) | 1108 (68.0%) |
| S.lyc | 1212 (74.4%) | 250 (95.4%) | 666 (62.7%) | 53 (100%) | 243 (96.4%) | 1086 (66.7%) |
| S.ole | 927 (56.9%) | 214 (81.7%) | 470 (44.3%) | 49 (92.5%) | 194 (77.0%) | 807 (49.5%) |
| L.sat | 1225 (75.2%) | 252 (96.2%) | 682 (64.2%) | 52 (98.1%) | 239 (94.8%) | 1078 (66.2%) |
| M.tru | 1101 (67.6%) | 238 (90.8%) | 615 (57.9%) | 52 (98.1%) | 196 (77.8%) | 944 (57.9%) |
| L.jap | 1164 (71.4%) | 247 (94.3%) | 625 (58.9%) | 52 (98.1%) | 240 (95.2%) | 1032 (63.4%) |
| P.tri | 1287 (79.0%) | 245 (93.5%) | 740 (69.7%) | 53 (100%) | 249 (98.8%) | 1142 (70.1%) |
| H.vul | 927 (56.9%) | 219 (83.6%) | 474 (44.6%) | 49 (92.5%) | 185 (73.4%) | 802 (49.2.%) |
Each probe was used as a query in a blastn search against each plastid genome, with the E value threshold set at 1 × 10-3. C.sa - Cucumis sativus [GenBank: NC_007144], A. tha - Arabidopsis thaliana [GenBank: NC_000932], N.tab - Nicotiana tabacum [GenBank: NC_001879], S.lyc - Solanum lycopersicon [GenBank: NC_007898], S.ole - Spinacia oleracea [GenBank: NC_002202], L.sat - Lactuca sativa [GenBank: NC_007578], M.tru - Medicago truncatula [GenBank: NC_003119], L.jap - Lotus japonicus [GenBank: NC_002694], P.tri - Populus trichocarpa [GenBank: NC_009143], H.vul - Hordeum vulgare [GenBank: NC_008590]. For each genome, number of probes matching with E value lower than the threshold as well as number of probes matching with E ≤ 1 × 10-10 is reported.
Figure 5Similarity of cucumber probes to . Only results with E < 10-3 are presented. For perfectly matching probes, without gaps or mismatches, the alignment length is exactly 70 bases and the identity is 100%. The bubble size reflects the number of probes with the same alignment length and % identity. A, C, E - similarity to A. thaliana plastid genome; B, D, F - similarity to N. tabacum plastid genome.
Summary of CSH microarray experiments
| Array | Green Channel (Cy3) | Red Channel (Cy5) | No. of probes with both spot replicates flagged as "good" | Cy5/Cy3 total intensity ratio of probes | |
|---|---|---|---|---|---|
| "coding" | "non-coding" | ||||
| a | Arabidopsis | cucumber | 306 | 1251 | 1.96 |
| b | cucumber | Arabidopsis | 306 | 969 | 2.13 |
| c | tomato | cucumber | 308 | 1165 | 2.24 |
| d | cucumber | tomato | 307 | 1114 | 1.76 |
| e | spinach | cucumber | 307 | 1129 | 2.11 |
| f | cucumber | spinach | 308 | 1147 | 1.18 |
Figure 6Proportion of PlasTi-microarray probes with SNR ≥ 3 in CSH-based microarray experiments. Graphs present percentages of probes with SNR ≥ 3 in CSH-based microarray experiments, separately for "coding" (panels A, C, E) and "non-coding" probes (panels B, D, F). Each bar represents the value obtained from one channel of one microarray. Only probes with both replicate spots flagged as good were considered, and the SNR values were averaged between the replicates. a-f - arrays, as indicated in (Table 5). M -probes matching analyzed genome sequence with E ≤ 10-3, NM - probes not matching the genome sequence with E ≤ 10-3. Colors represent array channels: green - Cy3 channel, red - Cy5 channel.
Published plant plastome macro- and microarrays
| Organism | Array description | Experiment | Ref |
|---|---|---|---|
| tobacco | nylon macroarray | leaves of wild-type vs transplastomic tobacco lacking PEP | [ |
| tobacco | cDNA microarray; 220 PCR probes (71 - 2373 bp), each corresponding to a single known gene or an intergenic region | light- or dark-grown seedlings, RIP-chip analysis of MatK-bound RNAs | [ |
| cDNA microarray; 108 DNA fragments to detect all annotated plastid genes | analysis of knockout transformant for the arginine tRNA gene, trnR-CCG | [ | |
| cDNA microarray; 79 PCR probes (88 - 1646 bp) representing protein-coding genes | effects of the | [ | |
| Maize (used also for barley in CSH studies) | cDNA microarray; 248 overlapping PCR products (73 - 1653 bp) covering the whole plastid genome | identification of RNAs associated with PPR proteins in maize (CRP1, PPR4, PPR5) or whirly1 in barley, by RIP-chip | [ |
| cDNA microarray; PCR products (150 - 1500 bp) for 47 chloroplast, 9 mitochondrial, and 15 nuclear genes | analysis of nonphotosynthetic mutants carrying mutations in the | [ | |
| cDNA microarray; 193 PCR probes for protein coding genes and orfs | role of nuclear-encoded sigma factors in plastid transcriptome changes during the shift from dark to light | [ | |
| wheat (used also for barley and rice in CSH studies) | nylon macroarray; 67 PCR products (200 bp - 1259 bp) representing 60 wheat plastid genes (excluding tRNAs) and 7 nuclear genes related to plastid metabolism | germinating seeds and seedlings at three different | [ |
| Maize | cDNA microarray; PCR probes for 887 nuclear, 62 chloroplast, and 27 mitochondrial transcripts | comparison of chloroplasts and etioplasts in stage 2 semi-emerged leaf blades of one month-old plant | [ |
| tobacco, potato, tomato | oligonucleotide microarray; 128 probes (68-71 bases) representing tobacco genes, ycfs and orfs (+ 5 probes designed for potato and tomato due to insufficient homology of tobacco probes) | tomato fruit development and chloroplast-to-chromoplast conversion; potato tuber amyloplasts vs leaf chloroplasts comparison | [ |
| nylon macroarray; 96 PCR probes (75 - 400 bp), representing all | 12 different developmental stages and stress treatments | [ | |
| nylon macroarray; 94 PCR probes for genes encoding plastid proteins, tRNAs and rRNAs; data were complemented with analysis of published data from Affymetrix 22 K ATH1 array experiments | numerous nuclear | [ | |