| Literature DB >> 18194570 |
Hailing Jin1, Vladimir Vacic, Thomas Girke, Stefano Lonardi, Jian-Kang Zhu.
Abstract
BACKGROUND: In spite of large intergenic spaces in plant and animal genomes, 7% to 30% of genes in the genomes encode overlapping cis-natural antisense transcripts (cis-NATs). The widespread occurrence of cis-NATs suggests an evolutionary advantage for this type of genomic arrangement. Experimental evidence for the regulation of two cis-NAT gene pairs by natural antisense transcripts-generated small interfering RNAs (nat-siRNAs) via the RNA interference (RNAi) pathway has been reported in Arabidopsis. However, the extent of siRNA-mediated regulation of cis-NAT genes is still unclear in any genome.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18194570 PMCID: PMC2262095 DOI: 10.1186/1471-2199-9-6
Source DB: PubMed Journal: BMC Mol Biol ISSN: 1471-2199 Impact factor: 2.946
List of Arabidopsis NATs. (Based on TAIR 6.0 dataset "AGI Genes", which contains gene sequences inclusive of introns and UTRs.)
| All NATs (pairs) | 1823 | 108 | 164 | 2095 | 402 ± 524 |
| cis-NATs (pairs) | 941 | 41 | 75 | 1057 | 472 ± 401 |
| | |||||
| PC/tRNA | 3 | 0 | 22 | 25 | 69 ± 11 |
| PC/ta-siRNA | 1 | 0 | 1 | 2 | 143 ± 158 |
| PC/snRNA | 3 | 2 | 1 | 6 | 241 ± 248 |
| PC/snoRNA | 0 | 0 | 1 | 1 | 121 ± 0 |
| PC/transp | 7 | 2 | 2 | 11 | 584 ± 1178 |
| transp/tRNA | 0 | 0 | 1 | 1 | 71 ± 0 |
| transp/transp | 1 | 2 | 0 | 3 | 81 ± 31 |
PC: protein coding gene
Figure 1Twelve genes that overlap with two antisense transcripts. The point ends represent the 3' end of the gene.
Figure 2Pearson correlation coefficients for the cis-NAT pairs and non-overlapping gene pairs plotted to form a density plot. The corresponding means, standard deviations and p-values are in Table 2.
Correlation Analysis of NAT Pairs
| 0.022 ± 0.011 | -0.006 ± 0.304 | 0.082 ± 0.298 | 7.55E-13 | 0.163 | |
| 0.022 ± 0.013 | 0.001 ± 0.352 | 0.082 ± 0.332 | 1.20E-08 | 0.351 | |
The mean Pearson and Spearman correlation coefficients with their standard deviations for randomized (Random, mean of 500 iterations), NAT and all non-overlapping gene pairs (Neighbor). A two-sample t-test was used to calculate p-values for differences in distribution of coefficients between the cis-NAT dataset and the other two datasets.
NAT Pairs with Opposite Differential Expression Changes
| Comp | 155 | 159.0 ± 9.0 | 0.703 |
| Exp Set | 593 | 585.3 ± 5.7 | 0.097 |
The number (N) of cis-NAT pairs with opposite differential expression changes are given for the stringent (Comp) and the less stringent (Exp Set) approach. The same counts were calculated for randomized gene pairs (1000 iterations). Their mean values and standard deviations (± SD) are provided. The p-values represent the number of times the randomized data generated a value ≥ the observed value (N) divided by 1000.
Total endogenous small RNAs from ASRP, MPSS, DB and our stress library dataset that match cis-NAT genes in Arabidopsis.
| Number of siRNAs that match | Stress | ASRP | MPSS | DB | Unique total |
| AGI genes | 8,862 | 65,439 | 31,095 | 118,825 | 213,117 |
| PC genes | 1,891 | 37,010 | 16,657 | 64,099 | 115,709 |
| NAT full-length genes | 87 | 879 | 497 | 2115 | 3492 |
| NAT overlapping regions | 16 | 167 | 76 | 598 | 844 |
Number of genes with small RNA hits in datasets.
| AGI genes | 2,596 | 9,320 | 8,195 | 11,516 | 15,291 |
| PC genes | 1,320 | 6,351 | 5,083 | 8,455 | 12,072 |
| NAT genes | 86 | 382 | 245 | 608 | 914 |
| NAT full-length genes | 69 | 298 | 193 | 448 | 646 |
| NAT overlapping regions | 15 | 65 | 37 | 127 | 180 |
AGI genes: from TAIR 6.0. PC genes: Protein-coding genes.
Summary of the numbers of reads from stress-treated small RNA libraries obtained by 454 sequencing
| Small RNA reads (>= 18) | 21203 | 26246 | 17824 | 12465 | 33289 | 37599 |
| Unique reads (duplicates removed) | 13542 | 15221 | 11262 | 8325 | 15774 | 26806 |
| Match genomic DNAs | 5431 | 5882 | 4514 | 4198 | 6236 | 9613 |
| Match genes + introns + UTR | 2917 | 3183 | 2444 | 2102 | 3933 | 5914 |
| Match genes - introns + UTR | 2814 | 3063 | 2357 | 2038 | 3873 | 5667 |
| Matching NAT genes | 30 | 22 | 19 | 19 | 17 | 24 |
Abiotic stress-treated libraries
Abio1: Cold treatment, 4°C
Abio2: Draught-treatment and ABA treatment
Abio3: Salt treatment and copper treatment
Abio4: UV treatment and heat treatment
Bio1: Pseudomonas syringae DC3000, and DC3000 (avrRpt2) infection
Bio2: A.b., and B. c. infection
Enrichment analysis of small RNAs in the negatively co-regulated NAT pairs
| NAT pairs | OL Hits | FL Hits | |
| Pearson correlation < -0.4 | 71 | 15 (21.1%) | 51 (71.3%) |
| Pearson correlation ≥ -0.4 | 595 | 74 (12.4%) | 370 (62.2%) |
| Total | 666 | 89 (13.3%) | 421 (63.2%) |
The p-value for the enrichment in the number of overlap regions (OL Hits) was 0.03499, and for the enrichment in the number of hits in the full-length NAT region (FL Hits) was 0.04992.
Figure 3An example of one putative nat-siRNA (black arrow, sequence is indicated in the box) that can potentially silence all three UDP-glycosyl transferase genes: At4g34131, At4g34135 and At4g34138. Blue arrows indicate two more target sites.
Figure 4Cellular component GO analysis. The bar plot shows the relative distribution of cellular component GO slim categories among cis-NAT genes, non-NAT genes and all genes in the Arabidopsis genome. The p-values from the hypergeometric distribution test for the NAT genes are in parentheses after the corresponding GO identifiers.