| Literature DB >> 17040561 |
Huan Wang1, Nam-Hai Chua, Xiu-Jie Wang.
Abstract
BACKGROUND: Natural antisense transcripts (NATs) are coding or non-coding RNAs with sequence complementarity to other transcripts (sense transcripts). These RNAs could potentially regulate the expression of their sense partner(s) at either the transcriptional or post-transcriptional level. Experimental and computational methods have demonstrated the widespread occurrence of NATs in eukaryotes. However, most previous studies only focused on cis-NATs with little attention being paid to NATs that originate in trans.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17040561 PMCID: PMC1794575 DOI: 10.1186/gb-2006-7-10-r92
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Summary of trans-NAT pairs and their corresponding full-length cDNAs
| No. of | ||||
| Total | Both transcripts with FL-cDNA | One transcript with FL-cDNA | No matching FL-cDNA | |
| High-coverage | 368 | 162 | 117 | 89 |
| 100 nt | 952 | 496 | 327 | 129 |
| Total | 1,320 | 658 | 444 | 218 |
FL-cDNAs, full-length cDNAs.
Figure 1Annealed structure of a trans-NAT pair (At4g19270::At1g56530). The annealed structure of two transcripts was predicted by the hybrid program. Transcript At4g19270 is shown as the upper strand from 5' to 3', whilst transcript At1g56530 is shown as the lower strand from 3' to 5'. The paired region obtained by the blast search result is shown in red.
Expression analysis of trans-NAT pairs using MPSS data
| No. of | |||||
| Without MPSS tag | Single strand with MPSS tag | Both strands with MPSS tag (same tissue) | No. of total pairs | ||
| HC | 17 nt MPSS tag | 196 | 125 | 47 (37) | 368 |
| 20 nt MPSS tag | 197 | 115 | 56 (40) | ||
| Either 17 nt or 20 nt MPSS tag | 192 | 118 | 58 (50) | ||
| 100 nt | 17 nt MPSS tag | 276 | 436 | 240 (184) | 952 |
| 20 nt MPSS tag | 269 | 428 | 255 (199) | ||
| Either 17 nt or 20 nt MPSS tag | 252 | 430 | 270 (231) | ||
Data in parentheses are number of trans-NAT pairs with expression in the same tissue. HC and 100-nt refer to the 'high-coverage' and the '100 nt' trans-NAT pair categories, respectively.
Tissue specific MPSS data demonstrate co-expression pattern of some trans-NAT pairs
| Libraries | |||||||||||||||||
| ID | CAF | INF | LEF | ROF | SIF | API | AP3 | AGM | INS | ROS | SAP | SO4 | S52 | LES | GSE | CAS | SIS |
| Pair A | |||||||||||||||||
| At1g50020 | 0 | 18 | 6 | 1 | 0 | 12 | 22 | 1 | 17 | 0 | 0 | 0 | 0 | 0 | 73 | 0 | 0 |
| At1g04820 | 0 | 0 | 0 | 0 | 0 | 19 | 13 | 8 | 13 | 1 | 3 | 0 | 0 | 0 | 0 | 0 | 0 |
| Pair B | |||||||||||||||||
| At5g02370 | 0 | 0 | 0 | 0 | 0 | 36 | 15 | 20 | 16 | 0 | 56 | 1 | 19 | 29 | 21 | 28 | 39 |
| At3g09390 | 0 | 0 | 0 | 0 | 0 | 60 | 36 | 27 | 14 | 0 | 55 | 8 | 4 | 28 | 362 | 0 | 0 |
The MPSS data of each transcript within each tissue or differently treated plants are shown to reflect their expression levels
Co-expression analysis of trans-NAT pairs using Arabidopsis root cell in situ hybridization results
| No. of | |||
| Both transcripts with | One transcript with | No | |
| High-coverage | 35 (25) | 66 | 32 |
| 100 nt | 320 (212) | 169 | 36 |
| Total | 355 (237) | 235 | 68 |
Numbers of trans-NAT pairs with expression difference between sense and antisense transcripts less than two-fold according to the in situ hybridization data are shown in parentheses. *Expression difference ≤ 2-fold.
Figure 2Functional analysis of trans-NATs using GO. The percent of Arabidopsis annotated genes and genes involved in trans-NAT pairs in each functional category are shown.
Over-represented gene families or functional groups in Arabidopsis trans-NAT pairs
| Rank | N | X | P-adj | GO attribute |
| 1 | 43 | 167 | <0.001 | 0008194: UDP-glycosyl transferase activity |
| 2 | 197 | 2,634 | <0.001 | 0016757: transferase activity |
| 3 | 16 | 40 | <0.001 | 0016168: chlorophyll binding |
| 4 | 102 | 1,158 | <0.001 | 0005515: protein binding |
| 5 | 16 | 58 | <0.001 | 0042546: cell wall biosynthesis |
| 6 | 10 | 25 | <0.001 | 0030076: light-harvesting complex |
| 7 | 25 | 195 | 0.003 | 0006511: ubiquitin-dependent proteolysis |
| 8 | 42 | 437 | 0.003 | 0006464: protein modification |
| 9 | 53 | 611 | 0.003 | 0007165: signal transduction |
| 10 | 392 | 7,058 | 0.006 | 0003824: catalytic activity |
| 11 | 23 | 194 | 0.013 | 0009733: response to auxin stimulus |
N, number of transcripts from the same GO category involved in Arabidopsis trans-NAT pairs; P-adj, adjusted p value calculated by 1,000 null-hypothesis simulations using Fisher exact test; X, number of genes from the same GO category in Arabidopsis genome.
Phylogenetic conservation of Arabidopsis trans-NAT pairs
| No. of | |||
| Conserved in both poplar and rice | Conserved in poplar only | Conserved in rice only | |
| 1 | 0 | 0 | |
| 13 | 14 | 58 | |
| 320 | 134 | 229 | |
Figure 3Pairing relationship of transcript clusters in trans-NAT pairs.
Figure 4Networks of cis- and trans-NAT pairs formed by transcripts encoding UDP-glucosyl transferase family proteins in A. thaliana. Green ellipses represent UDP-glucosyl transferase transcripts involved in trans-NAT pairs only; red ellipses represent UDP-glucosyl transferase transcripts involved in cis-NAT pairs only; yellow ellipses represent UDP-glucosyl transferase trasncripts involved in both cis- and trans-NAT pairs. Transcripts from other protein families are shown as blue ellipses. Directed lines present the pairing relationship of two transcripts, with arrows pointing to UDP-glucosyl transferase transcripts.
siRNA matches on trans-NAT pairs
| No. of pairs carrying siRNAs on overlapping region | No. of siRNAs matching on overlapping region | |
| HC | 106 | 98 |
| 100 nt | 65 | 77 |
| Total | 171 | 148 |
HC and 100 nt refer to the 'high-coverage' and the '100 nt' trans-NAT categories, respectively.
Number of genes with alternative splicing in the Arabidopsis genome and in predicted trans-NAT pairs
| No. of TUs/pairs with alternative splicing | Total TUs/pairs | Proportion (percent) | |
| All annotated genes | 1,764 | 15,214 | 11.59 |
| 130 | 658 | 19.76 |
TU refers to transcription units with multiple sequences.