| Literature DB >> 22058132 |
Dijun Chen1, Chunhui Yuan, Jian Zhang, Zhao Zhang, Lin Bai, Yijun Meng, Ling-Ling Chen, Ming Chen.
Abstract
Natural antisense transcripts (NATs), as one type of regulatory RNAs, occur prevalently in plant genomes and play significant roles in physiological and pathological processes. Although their important biological functions have been reported widely, a comprehensive database is lacking up to now. Consequently, we constructed a plant NAT database (PlantNATsDB) involving approximately 2 million NAT pairs in 69 plant species. GO annotation and high-throughput small RNA sequencing data currently available were integrated to investigate the biological function of NATs. PlantNATsDB provides various user-friendly web interfaces to facilitate the presentation of NATs and an integrated, graphical network browser to display the complex networks formed by different NATs. Moreover, a 'Gene Set Analysis' module based on GO annotation was designed to dig out the statistical significantly overrepresented GO categories from the specific NAT network. PlantNATsDB is currently the most comprehensive resource of NATs in the plant kingdom, which can serve as a reference database to investigate the regulatory function of NATs. The PlantNATsDB is freely available at http://bis.zju.edu.cn/pnatdb/.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22058132 PMCID: PMC3245084 DOI: 10.1093/nar/gkr823
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.System overview of PlantNATsDB core framework. (A) Schematic presentation of the five different types of cis-NATs (natural antisense transcripts) (i.e. Divergent, Convergent, Containing, Nearby head-to-head and Nearby tail-to-tail) and the trans-NATs predicted by PlantNATsDB. The complementary regions are highlighted and linked with vertical lines. Sequences used for NAT prediction were retrieved from variously public databases, as detailed in the website page. All NATs predicted by PlantNATsDB were deposited in MySQL relational databases. (B) Highlighted features of PlantNATsDB, which integrates various data to evaluate the function of NATs. [B(a)] The 69 plant species currently available in PlantNATsDB. [B(b)] Network formed by different NATs displayed in the integrated network browser, which is based on Cytoscape Web program (31). Note that this network can be edited and used for further analysis, such as ‘Gene Set Analysis’. An example of the output for ‘Small RNA Expression’ of a NAT pair is shown in [B(c)] and ‘GO Annotation’ in [B(d)]. Please note that small RNAs are enriched in the overlapped region and the two genes of the NAT pair share very similar GO annotation. [B(e)] An example of the output for ‘Gene Set Analysis’ based on GO annotation. The enriched GO categories are listed in the table and the P-value indicating the significance of enrichment. The number of genes in each GO category is indicated and shown in the pie chart. Additional functional modules, such as ‘Browser’, ‘Searcher’ and ‘Viewer’ as detailed in the PlantNATsDB website.
Summary statistics of small RNA data sets in PlantNATsDB
| No. | Species | GEO Data sets | |
|---|---|---|---|
| Series | Samples | ||
| 1 | 15 | 80 | |
| 2 | 3 | 8 | |
| 3 | 2 | 4 | |
| 4 | 3 | 6 | |
| 5 | 1 | 2 | |
| 6 | 2 | 6 | |
| 7 | 2 | 5 | |
| 8 | 2 | 5 | |
| 9 | 2 | 6 | |
| 10 | 1 | 2 | |
| 11 | 6 | 38 | |
| 12 | 3 | 10 | |
| 13 | 1 | 2 | |
| 14 | 1 | 2 | |
| 15 | 1 | 1 | |
| 16 | 2 | 2 | |
| 17 | 2 | 5 | |
| 18 | 4 | 12 | |
| Total | 54 | 196 | |
aNumber of GEO Series or GEO Samples in each species, including biological and technical replicates. Detailed information of the data sets in each species is provided at the PlantNATsDB website.
Note that all small RNA data sets in this study were downloaded from the GEO database (http://www.ncbi.nlm.nih.gov/geo/) (26).
Statistical result of NATs predicted in this study
| No. | Species | Genes | NATs ( | No. | Species | Genes | NATs ( |
|---|---|---|---|---|---|---|---|
| 1 | 4063 (10) | 5 ( | 36 | 7712 (429) | 564 ( | ||
| 2 | 13 556 (655) | 141 ( | 37 | 45 554 (3962) | 3521 ( | ||
| 3 | 32 670 (4841) | 6757 (918, 5839) | 38 | 40 745 (10 153) | 144 088 (387, 143 701) | ||
| 4 | 33 239 (8049) | 7788 (3005, 4783) | 39 | 57 624 (30 799) | 409 789 (1186, 408 603) | ||
| 5 | 4785 (249) | 192 ( | 40 | 7805 (2773) | 1498 (1482, 16) | ||
| 6 | 25 532 (5363) | 107 933 (36, 107 897) | 41 | 7725 (3030) | 1790 (1620, 170) | ||
| 7 | 50 542 (20771) | 45 930 ( | 42 | 52 936 (4631) | 4802 ( | ||
| 8 | 14 727 (2138) | 6119 ( | 43 | 2259 (39) | 25 ( | ||
| 9 | 25 536 (3991) | 7302 (180, 7122) | 44 | 22 518 (1063) | 754 ( | ||
| 10 | 15 935 (6549) | 26 879 (1450, 25 429) | 45 | 11 954 (638) | 433 ( | ||
| 11 | 32 287 (2243) | 3554 ( | 46 | 35 938 (3976) | 24 396 (195, 24 201) | ||
| 12 | 26 081 (3451) | 7492 ( | 47 | 42 746 (22360) | 43 535 ( | ||
| 13 | 7511 (202) | 163 ( | 48 | 39 798 (10897) | 14 298 ( | ||
| 14 | 32 775 (6104) | 23 373 (1471, 21 902) | 49 | 41 377 (5001) | 13 107 (744, 12 363) | ||
| 15 | 9122 (387) | 340 ( | 50 | 27 852 (4642) | 26 163 (298, 25 865) | ||
| 16 | 10 727 (103) | 96 ( | 51 | 17 804 (2138) | 2142 ( | ||
| 17 | 10 617 (309) | 151 ( | 52 | 17 939 (356) | 233 ( | ||
| 18 | 12 248 (156) | 96 ( | 53 | 31 221 (2570) | 3348 (495, 2853) | ||
| 19 | 34 809 (10 622) | 117 786 (574, 117 212) | 54 | 42 377 (5311) | 7210 ( | ||
| 20 | 46 367 (11 352) | 78 339 (436, 77 903) | 55 | 1471 (52) | 32 ( | ||
| 21 | 50 081 (31 296) | 80 835 ( | 56 | 22 285 (2399) | 1558 (669, 889) | ||
| 22 | 9508 (667) | 426 ( | 57 | 28 167 ( | 1793 ( | ||
| 23 | 20 130 (2460) | 2255 ( | 58 | 14 512 (219) | 336 ( | ||
| 24 | 43 306 (6993) | 8503 ( | 59 | 31 972 (2849) | 2866 ( | ||
| 25 | 11 754 (57) | 31 ( | 60 | 34 496 (8231) | 145 374 (241, 145 133) | ||
| 26 | 12 505 (347) | 263 ( | 61 | 9275 (178) | 128 ( | ||
| 27 | 8047 (215) | 140 ( | 62 | 14 724 (889) | 1593 ( | ||
| 28 | 40 504 (7783) | 29 575 (126, 29 449) | 63 | 17 442 (1491) | 1224 ( | ||
| 29 | 34 945 (3631) | 4356 ( | 64 | 7165 (672) | 539 ( | ||
| 30 | 47 443 (14 342) | 30 308 (4454, 25 854) | 65 | 93 508 (32 258) | 120 316 ( | ||
| 31 | 50 962 (18 083) | 164 686 (1151, 163 535) | 66 | 19 333 (592) | 405 ( | ||
| 32 | 3627 (207) | 156 ( | 67 | 26 346 (11 898) | 108 392 (685, 107 707) | ||
| 33 | 10 547 (4717) | 11 881 (1573, 10 308) | 68 | 15 669 (7438) | 90 222 (273, 89 949) | ||
| 34 | 10 108 (4321) | 2338 (2189, 149) | 69 | 32 540 (6944) | 25 726 (1528, 24 198) | ||
| 35 | 27 501 (8885) | 160 109 (1032, 159 077) | Total | 1 746 886 (384 466) | 2 138 498 (28 398, 2 110 100) | ||
aNumber of genes used for NAT prediction in each species. The number of genes formed at least one NAT pair with other genes is shown in parenthesis.
bNumber of predicted NAT (cis- and trans-NAT) pairs in each species.
cThe total number in all species belonging to each categories.
Figure 2.The information page of the NAT pair formed by SRO5 (AT5G62520) and P5CDH (AT5G62530) (9). (A) Summary of the NAT information, including the type, sequences and length of overlapped region. The sequence of the overlapped region is highlighted below. (B) Detailed annotation of the two genes of this cis-NAT pair. (C) GO functional annotation of the two genes. The annotated GO terms are displayed in Venn chart and GO network graph. The GO network graph contains two types of nodes: those that represent the NAT pairs (triangle nodes) and those that represent GO hierarchical terms (circle nodes). The shared GO terms (red color) and specific GO terms (purple and green colors) are shown in different colors. Functional similarity of these two genes is represented by the percent of shared GO terms. (D) The expression of the small RNAs derived from the NAT pair. Small RNAs from different data sets are indicated by dots in different colors. The overlapped region is highlighted in the chart. Small RNA data sets can be added or removed for demonstration in the chart by clicking the buttons below. The enrichment score for small RNA generated from the overlapped region is calculated based on the specific data sets. Please note that there is no observation of enriched small RNA derived from the overlapped region because this NAT pair is specially formed in the salt stress condition and PlantNATsDB lacks such data sets.