Literature DB >> 22058132

PlantNATsDB: a comprehensive database of plant natural antisense transcripts.

Dijun Chen1, Chunhui Yuan, Jian Zhang, Zhao Zhang, Lin Bai, Yijun Meng, Ling-Ling Chen, Ming Chen.   

Abstract

Natural antisense transcripts (NATs), as one type of regulatory RNAs, occur prevalently in plant genomes and play significant roles in physiological and pathological processes. Although their important biological functions have been reported widely, a comprehensive database is lacking up to now. Consequently, we constructed a plant NAT database (PlantNATsDB) involving approximately 2 million NAT pairs in 69 plant species. GO annotation and high-throughput small RNA sequencing data currently available were integrated to investigate the biological function of NATs. PlantNATsDB provides various user-friendly web interfaces to facilitate the presentation of NATs and an integrated, graphical network browser to display the complex networks formed by different NATs. Moreover, a 'Gene Set Analysis' module based on GO annotation was designed to dig out the statistical significantly overrepresented GO categories from the specific NAT network. PlantNATsDB is currently the most comprehensive resource of NATs in the plant kingdom, which can serve as a reference database to investigate the regulatory function of NATs. The PlantNATsDB is freely available at http://bis.zju.edu.cn/pnatdb/.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 22058132      PMCID: PMC3245084          DOI: 10.1093/nar/gkr823

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Gene regulation at RNA level has been progressively shown to be more important and prevalent than previously presumed (1,2). With the advances of high-throughput experimental technologies and bioinformatics methods, an explosion of recent findings underscores both the predominance and complexity of regulatory RNA molecules in eukaryotes, including the discovery of ubiquitous regulatory short non-coding RNAs (ncRNAs) (3), including microRNAs (miRNAs), endogenous short interfering RNAs (siRNAs) and Piwi-interacting RNAs (piRNAs), and the functional long ncRNAs (1,4). Natural antisense transcripts (NATs), as a new member of regulatory RNAs, occur prevalently in prokaryote and eukaryote genomes, and play significant roles in physiological and/or pathological processes (5). NATs are a group of endogenous RNA molecules containing sequences that are complementary to other transcripts (5–7). This class of RNAs includes both protein- and non-coding transcripts. NATs can be grouped into two categories, cis-NATs and trans-NATs, based on whether they act in cis or trans. Cis-NAT pairs are transcribed from opposing DNA strands at the same genomic locus and have a variety of orientations and differing lengths of overlap between the perfect sequence complementary regions, whereas trans-NAT pairs are transcribed from different loci and form partial complementarily (5). Although underlying mechanistic insights are largely unknown, NATs have been implicated in many aspects of gene regulation including genomic imprinting, transcriptional interference, RNA masking, RNA editing, RNA interference (RNAi) and translational regulation (5,7,8). However, since the discovery of the founder example of cis-NATs, SRO5 and P5CDH, involving in the regulation of salt tolerance through RNAi pathway in Arabidopsis (Arabidopsis thaliana) (9), more and more examples of NATs have been shown to act together with endogenous siRNAs (nat–siRNAs) from the overlapping regions in both plant and animal species (10–16). Moreover, deep sequencing of small RNAs (sRNAs) together with bioinformatics analysis reveals that the overlap portions of NATs are the hotspots for siRNA generation (12,13,17,18), further indicating that NATs are an important biogenesis mechanism of endogenous siRNAs. These recent discoveries revealed the unexpected complexity of the regulatory networks formed by NATs (17). Whole-genome searches based on computational analysis have identified thousands of NAT pairs in multiple eukaryotes. Thus, standardized applications or databases are required for data description, deposition, organization, parsing and analysis, and also allowing for functional discovery by integrating other biological data. To date, there are just a few free available NAT databases, one of which, NATsDB (19), comprises 10 animal species. However, the existing databases mainly focus on cis-NATs and none of them expand to any plant species, although both cis-NATs and trans-NATs have been reported in several plant species including two model plants, the monocot rice (Oryza sativa) (17,18,20) and the eudicot Arabidopsis (18,21–23). Furthermore, the functional annotation and graphical visualization of the NATs is limited. In the current analysis, we developed a genome-scale computational pipeline to identify NATs in plant species. A convenient database of plant NATs (PlantNATsDB) was constructed, which contains 69 plant species and provides the most comprehensive data set to date. PlantNATsDB serves the plant research community by providing facilitated access to a huge amount of resources regarding the NATs as well as a variety of specific analysis tools including browsing, searching, viewing, downloading and so on. In addition, it integrates Gene Ontology (GO) annotation (24) and sRNA high-throughput sequencing data sets to evaluate and investigate the function of NATs. Moreover, a ‘Gene Set Analysis’ module based on GO annotation was implemented to excavate the statistical significantly overrepresented GO categories from the complex network formed by different NATs. PlantNATsDB provides an information rich and user-friendly interface and an integrated, graphical network browser to facilitate mining-specific functional NAT pairs (Figure 1). Detailed information is provided at the PlantNATsDB website (http://bis.zju.edu.cn/pnatdb/).
Figure 1.

System overview of PlantNATsDB core framework. (A) Schematic presentation of the five different types of cis-NATs (natural antisense transcripts) (i.e. Divergent, Convergent, Containing, Nearby head-to-head and Nearby tail-to-tail) and the trans-NATs predicted by PlantNATsDB. The complementary regions are highlighted and linked with vertical lines. Sequences used for NAT prediction were retrieved from variously public databases, as detailed in the website page. All NATs predicted by PlantNATsDB were deposited in MySQL relational databases. (B) Highlighted features of PlantNATsDB, which integrates various data to evaluate the function of NATs. [B(a)] The 69 plant species currently available in PlantNATsDB. [B(b)] Network formed by different NATs displayed in the integrated network browser, which is based on Cytoscape Web program (31). Note that this network can be edited and used for further analysis, such as ‘Gene Set Analysis’. An example of the output for ‘Small RNA Expression’ of a NAT pair is shown in [B(c)] and ‘GO Annotation’ in [B(d)]. Please note that small RNAs are enriched in the overlapped region and the two genes of the NAT pair share very similar GO annotation. [B(e)] An example of the output for ‘Gene Set Analysis’ based on GO annotation. The enriched GO categories are listed in the table and the P-value indicating the significance of enrichment. The number of genes in each GO category is indicated and shown in the pie chart. Additional functional modules, such as ‘Browser’, ‘Searcher’ and ‘Viewer’ as detailed in the PlantNATsDB website.

System overview of PlantNATsDB core framework. (A) Schematic presentation of the five different types of cis-NATs (natural antisense transcripts) (i.e. Divergent, Convergent, Containing, Nearby head-to-head and Nearby tail-to-tail) and the trans-NATs predicted by PlantNATsDB. The complementary regions are highlighted and linked with vertical lines. Sequences used for NAT prediction were retrieved from variously public databases, as detailed in the website page. All NATs predicted by PlantNATsDB were deposited in MySQL relational databases. (B) Highlighted features of PlantNATsDB, which integrates various data to evaluate the function of NATs. [B(a)] The 69 plant species currently available in PlantNATsDB. [B(b)] Network formed by different NATs displayed in the integrated network browser, which is based on Cytoscape Web program (31). Note that this network can be edited and used for further analysis, such as ‘Gene Set Analysis’. An example of the output for ‘Small RNA Expression’ of a NAT pair is shown in [B(c)] and ‘GO Annotation’ in [B(d)]. Please note that small RNAs are enriched in the overlapped region and the two genes of the NAT pair share very similar GO annotation. [B(e)] An example of the output for ‘Gene Set Analysis’ based on GO annotation. The enriched GO categories are listed in the table and the P-value indicating the significance of enrichment. The number of genes in each GO category is indicated and shown in the pie chart. Additional functional modules, such as ‘Browser’, ‘Searcher’ and ‘Viewer’ as detailed in the PlantNATsDB website.

DATABASE CONSTRUCTION

Data source

Of the 69 plant species, 27 have genomic information. For these 27 genomically sequenced species, the annotated transcription units (TUs) used for NAT prediction and other annotation information were downloaded from the specific genome-sequencing projects. Based on the fact that pseudogenes and transposons can form NATs with protein-coding genes (10,11,17), all the pseudogenes and transposons were retained for NAT prediction. For the remaining 42 plant species, the tentative consensus sequences (TCs), which can be used to provide putative genes with functional annotation similar to TUs were used for NAT prediction, and their related information were downloaded from The Gene Index Project (25). SRNA high-throughput sequencing data sets of each species were obtained from the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/) (26). All the sRNA data sets retrieved for this study were summarized in Table 1.
Table 1.

Summary statistics of small RNA data sets in PlantNATsDB

No.SpeciesGEO Data setsa
SeriesSamples
1Arabidopsis thaliana1580
2Arabidopsis lyrata38
3Brachypodium distachyon24
4Chlamydomonas reinhardtii36
5Citrus sinensis12
6Gossypium hirsutum26
7Glycine max25
8Medicago truncatula25
9Nicotiana benthamiana26
10Oryza sativa subsp. indica12
11Oryza sativa subsp. japonica638
12Physcomitrella patens310
13Prunus persica12
14Solanum lycopersicum12
15Selaginella moellendorffii11
16Triticum aestivum22
17Vitis vinifera25
18Zea mays412
Total54196

aNumber of GEO Series or GEO Samples in each species, including biological and technical replicates. Detailed information of the data sets in each species is provided at the PlantNATsDB website.

Note that all small RNA data sets in this study were downloaded from the GEO database (http://www.ncbi.nlm.nih.gov/geo/) (26).

Summary statistics of small RNA data sets in PlantNATsDB aNumber of GEO Series or GEO Samples in each species, including biological and technical replicates. Detailed information of the data sets in each species is provided at the PlantNATsDB website. Note that all small RNA data sets in this study were downloaded from the GEO database (http://www.ncbi.nlm.nih.gov/geo/) (26).

Prediction of NAT pairs

Prediction of NAT pairs was performed as previously described (17,18,22). Specifically, the following criteria were used to identify cis-NATs and trans-NATs, respectively. For cis-NATs, they can be grouped into five categories, namely: (i) Divergent (head to head or 5′ to 5′ overlap); (ii) Convergent (tail-to-tail or 3′ to 3′ overlap); (iii) Containing (full overlap); (iv) Nearby head-to-head (5′ close to 5′) and (v) Nearby tail-to-tail (3′ close to 3′) according to their relative orientation and degree of overlap (Figure 1A) (27). If a pair of transcripts is located in opposite strands at adjacent genomic loci and has at least 1 nt overlapping, or their distance on the chromosome is no >100 nt, then they were considered as a cis-NAT pair. In total, 27 plant species were subjected to cis-NAT prediction. For trans-NATs, BLASTN (ftp://ftp.ncbi.nlm.nih.gov/blast/executables/release/, Release 2.2.20) (28) was used to search for transcript pairs with high sequence complementary to each other and the following criteria should be satisfied for each transcript pair: (i) If the complementary region identified by BLAST covered more than half the length of either transcript, this transcript pair was designated to be a ‘high-coverage’ (HC) trans-NAT pair; (ii) If the two transcripts had a continuous complementary region >100 nt, they were classified as a ‘100-nt’ pair. Functional trans-NATs should form RNA–RNA duplexes in vivo. We therefore used DINAMelt (29) to verify whether the transcript pairs could melt into RNA–RNA duplexes in the complementary regions in silico. All the trans-NAT pairs based on BLAST search were further used to DINAMelt hybridization validation. The trans-NAT pair was retained if it satisfied: (i) the paired region indentified by DINAMelt should be coincident with the BLAST-based search; (ii) any bubble in the paired region predicted by DINAMelt should be no longer than 10% of the region. For the BLAST-based trans-NAT pairs that contain transcripts >10 kb, they were not applied to DINAMelt validation due to the heavy computational work. Instead, if the paired region identified by BLAST was >10% of its longer transcript, it was considered as verified trans-NAT. All the NAT pairs predicted in this study were summarized in Table 2.
Table 2.

Statistical result of NATs predicted in this study

No.SpeciesGenesaNATs (cis, trans)bNo.SpeciesGenesaNATs (cis, trans)b
1Allium cepa4063 (10)5 (NA, 5)36Nicotiana benthamiana7712 (429)564 (NA, 564)
2Aquilegia coerulea13 556 (655)141 (NA, 141)37Nicotiana tabacum45 554 (3962)3521 (NA, 3521)
3Arabidopsis lyrata32 670 (4841)6757 (918, 5839)38Oryza sativa subsp. indica40 745 (10 153)144 088 (387, 143 701)
4Arabidopsis thaliana33 239 (8049)7788 (3005, 4783)39Oryza sativa subsp. japonica57 624 (30 799)409 789 (1186, 408 603)
5Beta vulgaris4785 (249)192 (NA, 192)40Ostreococcus lucimarinus CCE99017805 (2773)1498 (1482, 16)
6Brachypodium distachyon25 532 (5363)107 933 (36, 107 897)41Ostreococcus tauri7725 (3030)1790 (1620, 170)
7Brassica napus50 542 (20771)45 930 (NA, 45 930)42Panicum virgatum52 936 (4631)4802 (NA, 4802)
8Capsicum annuum14 727 (2138)6119 (NA, 6119)43Petunia hybrida2259 (39)25 (NA, 25)
9Carica papaya25 536 (3991)7302 (180, 7122)44Phaseolus coccineus22 518 (1063)754 (NA, 754)
10Chlamydomonas reinhardtii15 935 (6549)26 879 (1450, 25 429)45Phaseolus vulgaris11 954 (638)433 (NA, 433)
11Citrus clementina32 287 (2243)3554 (NA, 3554)46Physcomitrella patens35 938 (3976)24 396 (195, 24 201)
12Citrus sinensis26 081 (3451)7492 (NA, 7492)47Picea abies42 746 (22360)43 535 (NA, 43 535)
13Coffea canephora7511 (202)163 (NA, 163)48Pinus taeda39 798 (10897)14 298 (NA, 14 298)
14Cucumis sativus32 775 (6104)23 373 (1471, 21 902)49Populus trichocarpa41 377 (5001)13 107 (744, 12 363)
15Ectocarpus siliculosus9122 (387)340 (NA, 340)50Prunus persica27 852 (4642)26 163 (298, 25 865)
16Euphorbia esula10 727 (103)96 (NA, 96)51Quercus robur17 804 (2138)2142 (NA, 2142)
17Festuca arundinacea10 617 (309)151 (NA, 151)52Raphanus sativus17 939 (356)233 (NA, 233)
18Festuca pratensis12 248 (156)96 (NA, 96)53Ricinus communis31 221 (2570)3348 (495, 2853)
19Fragaria vesca34 809 (10 622)117 786 (574, 117 212)54Saccharum officinarum42 377 (5311)7210 (NA, 7210)
20Glycine max46 367 (11 352)78 339 (436, 77 903)55Secale cereale1471 (52)32 (NA, 32)
21Gossypium hirsutum50 081 (31 296)80 835 (NA, 80 835)56Selaginella moellendorffii22 285 (2399)1558 (669, 889)
22Gossypium raimondii9508 (667)426 (NA, 426)57Solanum lycopersicum28 167 (2039)1793 (NA, 1793)
23Helianthus annuus20 130 (2460)2255 (NA, 2255)58Solanum melongena14 512 (219)336 (NA, 336)
24Hordeum vulgare43 306 (6993)8503 (NA, 8503)59Solanum tuberosum31 972 (2849)2866 (NA, 2866)
25Ipomoea nil11 754 (57)31 (NA, 31)60Sorghum bicolor34 496 (8231)145 374 (241, 145 133)
26Lactuca sativa12 505 (347)263 (NA, 263)61Striga hermonthica9275 (178)128 (NA, 128)
27Lactuca serriola8047 (215)140 (NA, 140)62Theobroma cacao14 724 (889)1593 (NA, 1593)
28Lotus japonicus40 504 (7783)29 575 (126, 29 449)63Triphysaria eriantha17 442 (1491)1224 (NA, 1224)
29Malus x domestica34 945 (3631)4356 (NA, 4356)64Triphysaria versicolor7165 (672)539 (NA, 539)
30Manihot esculenta47 443 (14 342)30 308 (4454, 25 854)65Triticum aestivum93 508 (32 258)120 316 (NA, 120 316)
31Medicago truncatula50 962 (18 083)164 686 (1151, 163 535)66Vigna unguiculata19 333 (592)405 (NA, 405)
32Mesembryanthemum crystallinum3627 (207)156 (NA, 156)67Vitis vinifera26 346 (11 898)108 392 (685, 107 707)
33Micromonas pusilla CCMP154510 547 (4717)11 881 (1573, 10 308)68Volvox carteri15 669 (7438)90 222 (273, 89 949)
34Micromonas sp. RCC29910 108 (4321)2338 (2189, 149)69Zea mays32 540 (6944)25 726 (1528, 24 198)
35Mimulus guttatus27 501 (8885)160 109 (1032, 159 077)Totalc1 746 886 (384 466)2 138 498 (28 398, 2 110 100)

aNumber of genes used for NAT prediction in each species. The number of genes formed at least one NAT pair with other genes is shown in parenthesis.

bNumber of predicted NAT (cis- and trans-NAT) pairs in each species.

cThe total number in all species belonging to each categories.

Statistical result of NATs predicted in this study aNumber of genes used for NAT prediction in each species. The number of genes formed at least one NAT pair with other genes is shown in parenthesis. bNumber of predicted NAT (cis- and trans-NAT) pairs in each species. cThe total number in all species belonging to each categories.

Small RNA analysis

SRNA sequences containing incomplete information (such as containing ‘N’) with length <18 or >28 were removed for further analysis. For each data set, the filtered sRNA sequences were mapped to all the gene models of the related plant species. All mapping steps were performed using the Bowtie algorithm (30) allowing no mismatch. Besides, for comparison, the normalized abundance of sRNAs from each data set was calculated as RPMs (reads per million), which divided the read number of each sRNA by the total reads from this data set, and multiplied by 106. For each NAT, an enrichment score was calculated to evaluate whether sRNAs were enriched in the overlapping region (17,18). The enrichment score E was calculated using the following formula: where So = the total normalized abundance of the sRNAs generated from the overlapping region, Lo = the total length of the paired region of the two transcripts of the NATs, Sa = the total normalized abundance of the sRNAs generated from these two transcripts and La = the total length of the two transcripts. Furthermore, a standard χ2 test (Pearson's chi-square test) was performed to test the significance of the enrichment.

Database implementation

All the predicted NATs and processed sRNAs were organized and stored in the MySQL database (http://www.mysql.com/). Besides, the gene sequence information, annotated gene models and their functional annotations, including GO annotations, were collected and stored in the database. These genes can also be linked to external genome browsers. PlantNATsDB was implemented in JSP language and deployed on the Apache Tomcat web server (http://tomcat.apache.org/). The integrated network browser is created by Cytoscape Web program (http://cytoscapeweb.cytoscape.org/) (31). JavaScript and adobe flash player are required in order to use the full functionality of PlantNATsDB. PlantNATsDB can be accessed through IE 6.0 or higher, Netscape 7.0 or higher, Safari, Opera, Chrome and Firefox from multiple platforms.

WEB INTERFACE AND DATABASE USAGE

Search modules

PlantNATsDB provides various query interfaces and graphical visualization tools to facilitate the retrieve and demonstration of NAT data. Four major search modules for retrieving NATs are designed: ‘Simple Searcher’, ‘Batched Searcher’, ‘Advanced Searcher’ and ‘BLAST Searcher’. Alternatively, users can get the entire NAT list by species in the ‘Browser’ module. The ‘Simple Searcher’ module allows users to enter any keyword in all fields for all data entries, including gene locus identifiers (IDs), gene aliases or any words in their annotation texts. The ‘Batched Searcher’ module supports gene set search, which allows users to enter a list of gene locus IDs or gene aliases. The ‘Advanced Searcher’ was designed to facilitate users to access any NAT data according multiple options such as the plant species, the types of NATs, the length of overlapping regions and the GO annotation. In addition, users can perform a BLAST sequence search to retrieve NAT data in the fourth module, ‘BLAST Searcher’. All the search results performed by the above search modules can be further used for functional investigation (see below).

NAT information page

For each NAT pair, PlantNATsDB provides rich annotation according to the relationships between the related two genes. The result page largely comprises four main parts, i.e. NAT summarization, gene information, GO annotation and sRNA expression. Generally, all parts are displayed vividly in the graphical fashion. Figure 2 shows the example of SRO5 and P5CDH cis-NAT pair (9). The first part is the summary of NAT information and the overlapping region is highlighted (Figure 2A). The second part shows the detailed annotation of the two genes (Figure 2B). The third part displays the GO functional assessment of this NAT pair based on the GO annotation of the two genes (Figure 2C). Functional NAT pairs are expected to have similar ‘Molecular Function’, involve in the related ‘Biological Process’ and/or locate in the same ‘Cellular Component’. Therefore, the same GO terms shared by the two genes are highlighted in the GO network graph. The information provided in this part is very useful for evaluating the function of NAT. The last part provides sRNA expression derived from the NAT pair (Figure 2D). Based on the finding that sRNAs were the important component in the NAT regulatory pathway (9), most, if not all, of the sRNA data sets currently available were collected and further processed and organized into the database (Table 1). Thus, these invaluable data sources will be of much help to users to inquire the function of NATs. Furthermore, a user-friendly interface is provided that allows users to add or remove data sets for analysis and to highlight different regions of the NAT.
Figure 2.

The information page of the NAT pair formed by SRO5 (AT5G62520) and P5CDH (AT5G62530) (9). (A) Summary of the NAT information, including the type, sequences and length of overlapped region. The sequence of the overlapped region is highlighted below. (B) Detailed annotation of the two genes of this cis-NAT pair. (C) GO functional annotation of the two genes. The annotated GO terms are displayed in Venn chart and GO network graph. The GO network graph contains two types of nodes: those that represent the NAT pairs (triangle nodes) and those that represent GO hierarchical terms (circle nodes). The shared GO terms (red color) and specific GO terms (purple and green colors) are shown in different colors. Functional similarity of these two genes is represented by the percent of shared GO terms. (D) The expression of the small RNAs derived from the NAT pair. Small RNAs from different data sets are indicated by dots in different colors. The overlapped region is highlighted in the chart. Small RNA data sets can be added or removed for demonstration in the chart by clicking the buttons below. The enrichment score for small RNA generated from the overlapped region is calculated based on the specific data sets. Please note that there is no observation of enriched small RNA derived from the overlapped region because this NAT pair is specially formed in the salt stress condition and PlantNATsDB lacks such data sets.

The information page of the NAT pair formed by SRO5 (AT5G62520) and P5CDH (AT5G62530) (9). (A) Summary of the NAT information, including the type, sequences and length of overlapped region. The sequence of the overlapped region is highlighted below. (B) Detailed annotation of the two genes of this cis-NAT pair. (C) GO functional annotation of the two genes. The annotated GO terms are displayed in Venn chart and GO network graph. The GO network graph contains two types of nodes: those that represent the NAT pairs (triangle nodes) and those that represent GO hierarchical terms (circle nodes). The shared GO terms (red color) and specific GO terms (purple and green colors) are shown in different colors. Functional similarity of these two genes is represented by the percent of shared GO terms. (D) The expression of the small RNAs derived from the NAT pair. Small RNAs from different data sets are indicated by dots in different colors. The overlapped region is highlighted in the chart. Small RNA data sets can be added or removed for demonstration in the chart by clicking the buttons below. The enrichment score for small RNA generated from the overlapped region is calculated based on the specific data sets. Please note that there is no observation of enriched small RNA derived from the overlapped region because this NAT pair is specially formed in the salt stress condition and PlantNATsDB lacks such data sets.

GO functional analysis module

Gene set analysis based on GO annotation (24) and statistical test is widely used to identify enriched GO categories and to explore the most important biological terms associated with the given gene set. A ‘Gene Set Analysis’ module (Figure 1B) has been developed for organizing a set of genes based on GO annotation, where the set of genes can be found by the search modules (see above) or collected in the network formed by NAT pairs (see below). Here, we used the combination of the χ2 test and Fisher's exact test to evaluate the significance of enrichment for GO category. Detailed methods can be referred from the PlantNATsDB website.

Graphical interaction network visualization

One gene may forms multiple NAT pairs with other antisense transcription partners, just as multiple paralogous genes may form RNA duplexes with the same antisense transcripts. Different NAT pairs might form complex regulatory networks in the related process (17). To this end, a graphical browser based on Cytoscape Web program (31) was developed to display the network formed by different NAT pairs (Figure 1B). Different types of nodes (genes) and relationships (NAT pairs) are colored distinctly. Moreover, the network graph can be edited (such as, to click/double-click/right-click the nodes/edges, to delete the nodes/edges, to apply distinct layouts and to export the graph in various formats) and all the genes contained in the network can be further subject to gene set analysis based on GO annotation (see above). In addition, users can use the toolkit of ‘My Network’, where genes or NAT pairs of interest may be stored temporarily on the server side during the session period and later retrieved in the ‘My Network’ page. There is a button to add selected genes or NAT pairs to ‘My Network’ in many pages of the website, which will greatly facilitate users’ digging out specific biological network formed by related NAT pairs involved in regulation of the interrelated process.

SUMMARY AND FUTURE DIRECTIONS

This work presents a comprehensive collection of plant NATs, which are organized and deposited in an online database named PlantNATsDB. The biological function of NAT pairs can be elucidated from the variously integrated data currently available. Moreover, vivid web interfaces are also designed to facilitate the presentation of NATs. PlantNATsDB serves the plant research community by providing a reference database to investigate the functions of NATs. In the near future, PlantNATsDB will collect and include more experimentally validated data and plan to make distinction between experimentally determined and predicted NATs. In addition, more useful and precise algorithms or tools will be designed to evaluate the functions of NAT pairs or to dig out functional NAT pairs based on GO network graphs and NATs-formed regulatory network. For example, it would be helpful to put such a regulatory subnetwork graph to the context of a larger network. Besides, some NAT pairs or subnetworks formed by NATs may be conserved between species. PlantNATsDB intends to allow users to select a specific family and to make comparisons within the family members. As new and improved high-throughput technologies are applied to a broader set of species, cell lines, tissues and conditions, more and more data sets will be generated, PlantNATsDB will be continuously maintained and timely updated to keep up with these improvements. In addition, gene expression data, such as ESTs (expression sequence tags), microarray and RNA-Seq data and degradome-sequencing data (32,33) will be integrated into PlantNATsDB to improve our understanding of the regulatory networks formed by NATs.

FUNDING

The National Natural Sciences Foundation of China (30971743, 31050110121, 31071659); The Ministry of Science and Technology of China (2009DFA32030); Program for New Century Excellent Talents in University of China (NCET-07-0740); Huazhong Agricultural University Scientific & Technological Self-innovation Foundation (2010SC07). Funding for Open Access charge: Partial waiver by Oxford University Press. Conflict of interest statement. None declared.
  33 in total

1.  Endogenous siRNAs derived from a pair of natural cis-antisense transcripts regulate salt tolerance in Arabidopsis.

Authors:  Omar Borsani; Jianhua Zhu; Paul E Verslues; Ramanjulu Sunkar; Jian-Kang Zhu
Journal:  Cell       Date:  2005-12-29       Impact factor: 41.582

2.  Two distinct mechanisms generate endogenous siRNAs from bidirectional transcription in Drosophila melanogaster.

Authors:  Katsutomo Okamura; Sudha Balla; Raquel Martin; Na Liu; Eric C Lai
Journal:  Nat Struct Mol Biol       Date:  2008-09       Impact factor: 15.369

Review 3.  In search of antisense.

Authors:  Giovanni Lavorgna; Dvir Dahary; Ben Lehner; Rotem Sorek; Christopher M Sanderson; Giorgio Casari
Journal:  Trends Biochem Sci       Date:  2004-02       Impact factor: 13.807

4.  An endogenous small interfering RNA pathway in Drosophila.

Authors:  Benjamin Czech; Colin D Malone; Rui Zhou; Alexander Stark; Catherine Schlingeheyde; Monica Dus; Norbert Perrimon; Manolis Kellis; James A Wohlschlegel; Ravi Sachidanandam; Gregory J Hannon; Julius Brennecke
Journal:  Nature       Date:  2008-05-07       Impact factor: 49.962

Review 5.  Small silencing RNAs: an expanding universe.

Authors:  Megha Ghildiyal; Phillip D Zamore
Journal:  Nat Rev Genet       Date:  2009-02       Impact factor: 53.242

6.  Endogenous siRNA and miRNA targets identified by sequencing of the Arabidopsis degradome.

Authors:  Charles Addo-Quaye; Tifani W Eshoo; David P Bartel; Michael J Axtell
Journal:  Curr Biol       Date:  2008-05-08       Impact factor: 10.834

7.  NATsDB: Natural Antisense Transcripts DataBase.

Authors:  Yong Zhang; Jiongtang Li; Lei Kong; Ge Gao; Qing-Rong Liu; Liping Wei
Journal:  Nucleic Acids Res       Date:  2006-11-01       Impact factor: 16.971

8.  Genome-wide prediction and identification of cis-natural antisense transcripts in Arabidopsis thaliana.

Authors:  Xiu-Jie Wang; Terry Gaasterland; Nam-Hai Chua
Journal:  Genome Biol       Date:  2005-03-15       Impact factor: 13.583

9.  Small RNAs and the regulation of cis-natural antisense transcripts in Arabidopsis.

Authors:  Hailing Jin; Vladimir Vacic; Thomas Girke; Stefano Lonardi; Jian-Kang Zhu
Journal:  BMC Mol Biol       Date:  2008-01-14       Impact factor: 2.946

10.  Endogenous siRNAs derived from transposons and mRNAs in Drosophila somatic cells.

Authors:  Megha Ghildiyal; Hervé Seitz; Michael D Horwich; Chengjian Li; Tingting Du; Soohyun Lee; Jia Xu; Ellen L W Kittler; Maria L Zapp; Zhiping Weng; Phillip D Zamore
Journal:  Science       Date:  2008-04-10       Impact factor: 47.728

View more
  36 in total

1.  PLncDB: plant long non-coding RNA database.

Authors:  Jingjing Jin; Jun Liu; Huan Wang; Limsoon Wong; Nam-Hai Chua
Journal:  Bioinformatics       Date:  2013-03-07       Impact factor: 6.937

Review 2.  Long Noncoding RNAs in Plants.

Authors:  Hsiao-Lin V Wang; Julia A Chekanova
Journal:  Adv Exp Med Biol       Date:  2017       Impact factor: 2.622

Review 3.  Mechanisms of small RNA generation from cis-NATs in response to environmental and developmental cues.

Authors:  Xiaoming Zhang; Yifan Lii; Zhigang Wu; Anton Polishko; Huiming Zhang; Viswanathan Chinnusamy; Stefano Lonardi; Jian-Kang Zhu; Renyi Liu; Hailing Jin
Journal:  Mol Plant       Date:  2013-03-15       Impact factor: 13.164

4.  Development of a rapid and inexpensive method to reveal natural antisense transcripts.

Authors:  Silvio Collani; Gianni Barcaccia
Journal:  Plant Methods       Date:  2012-09-12       Impact factor: 4.993

5.  Bioinformatics analysis of small RNAs in pima (Gossypium barbadense L.).

Authors:  Hongtao Hu; Dazhao Yu; Hong Liu
Journal:  PLoS One       Date:  2015-02-13       Impact factor: 3.240

6.  Genome-wide view of natural antisense transcripts in Arabidopsis thaliana.

Authors:  Chunhui Yuan; Jingjing Wang; Andrew P Harrison; Xianwen Meng; Dijun Chen; Ming Chen
Journal:  DNA Res       Date:  2015-04-28       Impact factor: 4.458

Review 7.  Long non-coding RNAs and their biological roles in plants.

Authors:  Xue Liu; Lili Hao; Dayong Li; Lihuang Zhu; Songnian Hu
Journal:  Genomics Proteomics Bioinformatics       Date:  2015-04-30       Impact factor: 7.691

Review 8.  Transcriptional regulation of osmotic stress tolerance in wheat (Triticum aestivum L.).

Authors:  Shabir H Wani; Prateek Tripathi; Abbu Zaid; Ghana S Challa; Anuj Kumar; Vinay Kumar; Jyoti Upadhyay; Rohit Joshi; Manoj Bhatt
Journal:  Plant Mol Biol       Date:  2018-08-14       Impact factor: 4.076

Review 9.  Epigenetic responses to stress: triple defense?

Authors:  Ruben Gutzat; Ortrun Mittelsten Scheid
Journal:  Curr Opin Plant Biol       Date:  2012-09-07       Impact factor: 7.834

10.  Prediction and identification of natural antisense transcripts and their small RNAs in soybean (Glycine max).

Authors:  Hu Zheng; Jiang Qiyan; Ni Zhiyong; Zhang Hui
Journal:  BMC Genomics       Date:  2013-04-24       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.