| Literature DB >> 23259405 |
Tingting Lu1, Chuanrang Zhu, Guojun Lu, Yunli Guo, Yan Zhou, Zhiyong Zhang, Yan Zhao, Wenjun Li, Ying Lu, Weihua Tang, Qi Feng, Bin Han.
Abstract
BACKGROUND: Cis-natural antisense transcripts (cis-NATs) are RNAs transcribed from the antisense strand of a gene locus, and are complementary to the RNA transcribed from the sense strand. Common techniques including microarray approach and analysis of transcriptome databases are the major ways to globally identify cis-NATs in various eukaryotic organisms. Genome-wide in silico analysis has identified a large number of cis-NATs that may generate endogenous short interfering RNAs (nat-siRNAs), which participate in important biogenesis mechanisms for transcriptional and post-transcriptional regulation in rice. However, the transcriptomes are yet to be deeply sequenced to comprehensively investigate cis-NATs.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23259405 PMCID: PMC3549290 DOI: 10.1186/1471-2164-13-721
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Statistics of 3819 NATs identified in rice
| 1 | 10,536 | 558 | 301 | 146 | 11 | 205 | 176 | 92 | 39 | 30 | 16 |
| 2 | 8,527 | 440 | 239 | 115 | 86 | 159 | 141 | 63 | 36 | 30 | 11 |
| 3 | 9,197 | 491 | 296 | 120 | 75 | 175 | 156 | 90 | 29 | 22 | 19 |
| 4 | 6,762 | 361 | 185 | 95 | 81 | 100 | 132 | 55 | 27 | 27 | 20 |
| 5 | 5,940 | 290 | 161 | 71 | 58 | 107 | 108 | 34 | 21 | 11 | 9 |
| 6 | 6166 | 299 | 170 | 69 | 60 | 111 | 100 | 38 | 23 | 13 | 14 |
| 7 | 5,862 | 278 | 163 | 63 | 52 | 90 | 96 | 30 | 24 | 19 | 19 |
| 8 | 5,290 | 239 | 138 | 51 | 50 | 103 | 74 | 27 | 19 | 10 | 6 |
| 9 | 4,256 | 189 | 100 | 34 | 55 | 70 | 71 | 10 | 16 | 10 | 12 |
| 10 | 4,144 | 221 | 128 | 41 | 52 | 85 | 74 | 24 | 19 | 10 | 9 |
| 11 | 4,861 | 231 | 139 | 47 | 45 | 92 | 75 | 25 | 23 | 7 | 9 |
| 12 | 4,472 | 222 | 129 | 46 | 47 | 81 | 72 | 24 | 20 | 15 | 10 |
| Total | 76,013 | 3,819 | 2,149 | 898 | 772 | 1,378 | 1,275 | 512 | 296 | 204 | 154 |
acis-NAT pairs without transposons.
b One transcript being entirely reverse-complementarily overlapped by the other.
c Convergent cis-NAT (with 3′-ends overlapped).
d Divergent cis-NAT (with 5′-ends overlapped).
e Coding sequence with PFAM domain-containing.
f Predicted as coding sequence but without any PFAM domain hits.
Numbers of NATs with expression evidence and nat-siRNAs, under four different conditions and in epidermal cells
| sda | 1,789 | 1,043 | 72 | 9 |
| STb | 1,572 | 949 | 58 | 5 |
| CDc | 1,668 | 1,003 | 66 | 5 |
| DTd | 1,668 | 986 | 75 | 5 |
| Co- | 1,072 (46.7%)e | 725 (58.6%)e | 10 (5.9%)e | 2 (15.4%)e |
| Total | 2,292 | 1,238 | 166 | 13 |
a 14-d-old-seedling.
b 14-d-old-seedling treated with 200 mM NaCl.
c 14-d-old-seedling grown under cold stress at 4°C for 24 h in darkness.
d 14-d-old-seedling treated with 20% PEG-6000.
e The percentage of co-cis-NATs in the corresponding total cis-NATs was calculated.
f Small RNAs were only enriched in the overlapping regions with more than five reads.
g The density of siRNAs in overlapping regions was > 5 times that in non-overlapping regions.
Figure 1Overview of 2292 one-to-one NATs. (A) Venn diagram showing the shared cis-NATs that expressed under normal (control), cold, salt and drought conditions. The numbers of cis-NATs that expressed under different conditions are noted in the corresponding parentheses. (B) Scatter plot showing comparison of transcripts expression ratio trends of 1072 co-expressed cis-NAT pairs between normal and cold stress conditions. Five subgroups 1, 2, 3, 4 and 5 are indicated by red, green, purple, blue and orange points, respectively. (C) Three sets of genes are categorized on the basis of components of each cis-NAT pair. Set I (blue bar): 767 cis-NAT pairs of protein-coding genes and non-protein-coding RNAs; Set II (red bar): 818 cis-NAT pairs of protein-coding genes and predicted CDS without any PFAM domain; Set III (green bar): 377 cis-NATs composed of both protein-coding genes. Y-axis represents the number of transcripts. X-axis represents 43 protein families that were the major enriched terms in either of three sets. Four proteins of pentatricopeptide repeat (PPR) families, leucine rich repeat, NB-ARC domain and UDP-glucoronosyl and UDP-glucosyl transferase, which exhibited distinctively enriched terms with P < 0.001 in Sets II and III, are highlighted in pink in the X-axis.
Figure 2An example of regulatory small RNAs derived from a -NAT pair. (A) A cis-NAT pair was formed with genes UDP-glucoronosyl and UDP-glucosyl transferase (Os05t0500000-00) and predicted CDS without any protein hits (Os05t0500101-01). Os05t0500000-00 was re-annotated by ssRNA-seq data (short blue lines) from normal condition in the plus strand. Os05t0500101-01 was re-annotated by ssRNA-seq data (short red lines) in the minus strand. The cis-NATs spawned five unique small RNAs, mainly from the plus strand (short green lines). (B) Under cold-stress conditions, expression of this cis-NAT pair was dramatically down-regulated. (C) On the left: Northern blot analysis of the cis-NATs confirmed differential expression levels under normal and cold treatment. On the right: the figure shows the FPKM results of this cis-NAT pair. FPKM refers to Fragments Per Kilobase of exon Models.
Figure 3An example of an overwhelmingly up-regulated NAT under abiotic stresses. (A) Expressions of a cis-NAT pair of an annotated gene Os09t0482800-02 and a novel ncRNA CUFF.14823.1 are dramatically up-regulated under cold-stress conditions. Expression levels of this cis-NAT pair under normal, cold, salt and drought conditions are relatively low and displayed. (B) The unique small RNAs that are generated mainly from the plus strand (short blue lines) of the cis-NAT pair of the gene Os09t0482800-02 and the ncRNA CUFF.14823.1 are shown. (C) Expressions of this cis-NAT pair were detected through semi-quantitative RT-PCR analysis. Rice actin expression was used as a control. The PCR products of the cis-NAT were amplified by 36 cycles, while actin products were amplified by 26 cycles. Real-time RT-PCR analysis of the cis-NATs was used to confirm the differential expressed genes under normal and cold conditions. Expression data from ssRNA-seq (FPKM, Fragments Per Kilobase of exon Models) are represented as gray blocks; and the data from real-time RT-PCR are indicated as orange lines.
Figure 4Expression patterns of eight one-to-one NATs and the distribution of small RNAs located in overlapped regions of these gene pairs. (A-H) Expression patterns of eight cis-NATs are displayed. The left Y-axis of each graph represents FPKM of each transcript. Blue bars indicate FPKM of plus- transcripts (above the X-axis); red bars indicate FPKM of its partner of minus- transcripts (below the X-axis). The right Y-axis indicates the number of small RNAs located in overlapped regions. The green curves represent unique nat-siRNAs, and the purple curves represent all nat-siRNAs. FPKM refers to Fragments Per Kilobase of exon Models.