| Literature DB >> 17727707 |
Johan Vallon-Christersson1, Johan Staaf, Anders Kvist, Patrik Medstrand, Ake Borg, Carlos Rovira.
Abstract
BACKGROUND: Recent studies revealed that many mammalian protein-coding genes also transcribe their complementary strands. This phenomenon raises questions regarding the validity of data obtained from double-stranded cDNA microarrays since hybridization to both strands may occur. Here, we wanted to analyze experimentally the incidence of antisense transcription in human cells and to estimate their influence on protein coding expression patterns obtained by double-stranded microarrays. Therefore, we profiled transcription of sense and antisense independently by using strand-specific cDNA microarrays.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17727707 PMCID: PMC2020490 DOI: 10.1186/1471-2164-8-295
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Strand-specific end modifications (amino-linkers) are incorporated into DNA by two parallel PCR reactions. After clean-up, the PCR product for each end-modified strand is printed separately onto the same microarray glass slide. The amino terminal groups are further coupled to the glass. The probes are digested with 5'-3' T7 gene 6 exonuclease and then immersed in boiling water. As a result of this treatment, only end-modifed strands remain attached to the surface. (b) Hybridization of labelled in vitro transcribed RNA from the β-lactamase gene with β-lactamase single-stranded sense and antisense DNA capture probes. Control spots containing ds-DNA probes or amino-modified PCR primers alone were also included on the array (not shown). Both probes and controls were spotted in 10 × 10 replicates. (c) Scatter plot showing the distribution of raw median pixel signal intensities from the hybridization performed in (b). Signals from all replicates are clearly discriminated according to the strand from which they originated (sense or antisense). Control spots for background intensities (printing buffer, PCR and amplification primers) demonstrate the specificity of the single stranded probes.
Figure 2Strand specificity of Histogram showing the distribution of log2 ratios after hybridization to single stranded C (blue), N (red), and double stranded CN (gray) capture probes with labelled Cy3- M13 forward and Cy5 – reverse sequencing primers (The log2 ratios for C and N probes from cDNA clones of plate IRAT3 were centered around -5 and 5 respectively, as expected since their cloning sites were reversed compared to the other clones, but to simplify the presentation the values are inverted). (b) log2 ratios for probes shown in panel matched on cDNA clone ID and sorted on ascending log2 ratio for the respective CN probe. (c) Mean log2ratios for β-lactamase spike-in sense (blue) and antisense (red) control features for hybridizations A to I. For individual features, log2 ratios were calculated using untransformed background corrected data (Median FG – Median BG). For mean log2 ratio calculations, only features where MedianFG > MedianBG in both channels, i.e. Cy3 and Cy5, and where SNR > = 3 in either channel, were included. Error bars denote +/-1SD. Mean log2ratio were calculated using values from between 66 and 70 replicated control spots.
Hybridization signals detected in sample channel and reference channel
| Detected N signals | Masked IMAGE ID | Putative antisense signals | C signals | |||||||||
| total* | reported** | repeats*** | double hits**** | self-complementarity+ | repeats, double, or self++ | reported+++ | total++++ | reported$ | new$$ | total$$$ | % paired N signal$$$$ | |
| UACC812 (A) | 281 | 59 | 148 | 61 | 7 | 151 | 31 | 353 | 27 | |||
| ZR7530 (B) | 316 | 67 | 149 | 57 | 8 | 153 | 33 | 353 | 35 | |||
| BT474 (C) | 610 | 134 | 252 | 74 | 11 | 257 | 50 | 478 | 64 | |||
| MDA361 (D) | 603 | 132 | 241 | 71 | 0 | 246 | 49 | 465 | 66 | |||
| MCF10A (E) | 556 | 114 | 241 | 72 | 11 | 246 | 46 | 473 | 57 | |||
| SKBR3 (F) | 449 | 94 | 192 | 65 | 8 | 196 | 41 | 431 | 48 | |||
| UACC893 (G) | 516 | 110 | 216 | 67 | 8 | 220 | 43 | 442 | 56 | |||
| MDA453 (H) | 562 | 115 | 233 | 72 | 9 | 238 | 47 | 460 | 60 | |||
| CAMA1 (I) | 695 | 151 | 276 | 76 | 11 | 282 | 53 | 501 | 74 | |||
| Mean | 509,78 | 108,44 | 216,44 | 68,33 | 9,22 | 221 | 43,67 | 288,8 | 64,78 (22,2%) | 224 | 439,56 | 54,14 |
| SD | 137,63 | 30,55 | 44,89 | 6,32 | 1,56 | 45,7 | 7,53 | 92,37 | 23,10 (1,2%) | 69,52 | 53,03 | 15,18 |
| Nmbr(union) | 776 | 169 | 300 | 78 | 11 | 306 | 60 | 470 | 109 (23,2%) | 361 | 511 | |
| Nmbr(intersection) | 244 | 52 | 125 | 55 | 7 | 128 | 27 | 116 | 25 (21,6%) | 91 | 315 | |
| Reference (A) | 753 | 169 | 296 | 78 | 11 | 303 | 62 | 510 | 81 | |||
| Reference (B) | 751 | 169 | 288 | 79 | 11 | 295 | 59 | 505 | 82 | |||
| Reference (C) | 813 | 185 | 303 | 77 | 11 | 309 | 65 | 513 | 90 | |||
| Reference (D) | 808 | 179 | 301 | 79 | 11 | 308 | 62 | 514 | 89 | |||
| Reference (E) | 806 | 178 | 301 | 80 | 11 | 307 | 62 | 513 | 89 | |||
| Reference (F) | 826 | 186 | 307 | 79 | 11 | 314 | 65 | 519 | 91 | |||
| Reference (G) | 745 | 169 | 288 | 78 | 11 | 294 | 62 | 502 | 80 | |||
| Reference (H) | 822 | 182 | 311 | 78 | 11 | 317 | 66 | 520 | 90 | |||
| Reference (I) | 874 | 196 | 316 | 79 | 11 | 323 | 64 | 524 | 97 | |||
| Mean | 799,78 | 179,22 | 301,22 | 78,56 | 11 | 307,8 | 63 | 492 | 116,22 (23,6%) | 375,8 | 513,33 | 87,79 |
| SD | 42,63 | 9,24 | 9,54 | 0,88 | 0 | 9,58 | 2,18 | 33,63 | 7,84 (0,4%) | 26,02 | 7,07 | 5,65 |
| Nmbr(union) | 885 | 198 | 322 | 81 | 11 | 329 | 66 | 556 | 132 (23,7%) | 424 | 525 | |
| Nmbr(intersection) | 673 | 153 | 269 | 75 | 11 | 275 | 57 | 398 | 96 (24,1%) | 302 | 489 | |
*Number of IMAGE clones where N probe SNR > 10
**Number of IMAGE clones previously reported (ref 9) as transcribed from antisense (out of Detected N signals total*)
***Number of IMAGE clones with > = 40 continuous bases homologous to repeats (out of Detected N signals total*)
****Number of IMAGE clones with > = 30 continuous bases on both sense- and antisense strand that match RefSeq (out of Detected N signals total*)
+Number of IMAGE clones with > = 30 continuous bases complementary between sense- and antisense transcript (out of Detected N signals total*)
++Number of IMAGE clones with either repeats***, double hits****, or self-complementarity+(out of Detected N signals total*)
+++Number of IMAGE clones that overlap between masked++and clones priviously reported** (out of Detected N signals total*)
++++Number of IMAGE clones with putative antisense transcription, i.e. those clones that are not masked++(out of Detected N signals total*)
$Number of IMAGE clones previously reported as transcribed from antisense out of, Putative antisense signals total++++(percentage within parenthesis)
$$Number of IMAGE clones not masked++ and not privously reported** as transcribed from antisense (out of Detected N signals total*)
$$$Number of IMAGE clones with putative sense transcription, i.e. where C probe SNR > 10, excluding clones that are masked++
$$$$Percent of IMAGE clones displaying N signal out of those that display C signals excluding clones that are masked++
Note: values are paired per clone, i.e. the number of IMAGE clones with C signal and N signal as percentage of the number of IMAGE clones with C signal
Figure 3Validation of putative antisense RNA transcripts. Northern blot hybridizations of total RNA with sense P32-labeled riboprobes corresponding to a sample set for which antisense RNAs were detected on the single-stranded microarray but had not preciously been reported. cDNAs correspond to: ubiquitin C partial cds CCRN4L (BC000449), src homology 2 domain containing transforming protein SHC1 (BC014158), mitogen activated protein kinase 2 MAP2K2 (BE543096), growth arrest and DNA-damage inducible protein gamma GACC45G (BC000465), c-AMP responsive element binding protein 3 CREB3 (BC010158), splicing factor SFRS10 (BC000451) and MCM4 minichromosome maintenance deficient 4 MCM4 (BC036035).
Figure 4Sense and antisense expression profiles. (a) Unsupervised cluster analysis of expression profiles detected by C (blue), N (red) and CN (grey) capture probes across 9 cell lines. A: UACC812; B: ZR75-30; C: BT474; D: MDA361; E MCF10A, F: SKBR3; G: UACC893; H: MDA453 and I: CAMA1. Columns in the gene expression matrix correspond to IMAGE clone IDs. Each cell line is represented by one row. Minimum presence per cDNA clone was set to 7 out of 9 in each data set. (b) Correlation between N- and C-probes across hybridizations A to I for 430 probe pairs, i.e. IMAGE clones. Frequency of Pearson correlation is shown as bars with a bin width of 0.05 (gray bars). For each frequency, the contribution of probe pairs from IMAGE clones previously reported as antisense transcribed is shown as striped areas. Correlation calculations are performed on log2ratios using within hybridization mean log2ratio per reporter across features with SNR > = 10 in either channel, i.e. Cy3 or Cy5. IMAGE clones were removed if their respective N- and C- probes were suspected to potentially cross-hybridize or if their number of missing values across hybridizations was > 2 for either N- or C- probe. (c) Schematic representation of the overlap between KHK (BC006233) and CR623121 transcripts and rational of the RT-PCR reaction aimed to verify the observed coordinated regulation of sense and antisense transcripts. Black arrows represent orientation of transcription for each RNA. Grey arrows mark positions for the PCR primers used in (d). Distance between primers positions refer to the final amplification products. (d) Relative expression of KHK and its antisense CR623121 detected by RT-PCR across all cell lines. Two amplicons of 406 and 626 bp corresponding to KHK sense and antisense transcripts were obtained. Lower panel: ethidium bromide staining after separation in 1% agarose. Band intensities were quantified with the Kodak 1D Image Analysis Software and mean intensities were plotted.
Figure 5(a) Cluster analysis of C (blue), N (red) and CN (grey) expression profiles where all samples were median-centred and treated as they were independent experiments. Minimum presence per cDNA clone was set to 7 out of 9 in each data set. (b) Histogram of pair-wise Euclidean distances over all samples for IMAGE clones. For each IMAGE in 9 dimensions, i.e. hybridization 1–9, the Euclidean distance between C and N is √(Σi = 1 (Ci-Ni)2) where Ci and Ni is the log2ratio for C and N probe respectively for hybridization i. (c) Histogram of distances between C and N probes for all IMAGE for each hybridization.