| Literature DB >> 26029219 |
Gang Wu1, Libin Zhang1, Yongtai Yin1, Jiangsheng Wu2, Longjiang Yu1, Yanhong Zhou1, Maoteng Li1.
Abstract
Raphanus sativus is an important Brassicaceae plant and also an edible vegetable with great economic value. However, currently there is not enough transcriptome information of R. sativus tissues, which impedes further functional genomics research on R. sativus. In this study, RNA-seq technology was employed to characterize the transcriptome of leaf tissues. Approximately 70 million clean pair-end reads were obtained and used for de novo assembly by Trinity program, which generated 68,086 unigenes with an average length of 576 bp. All the unigenes were annotated against GO and KEGG databases. In the meanwhile, we merged leaf sequencing data with existing root sequencing data and obtained better de novo assembly of R. sativus using Oases program. Accordingly, potential simple sequence repeats (SSRs), transcription factors (TFs) and enzyme codes were identified in R. sativus. Additionally, we detected a total of 3563 significantly differentially expressed genes (DEGs, P = 0.05) and tissue-specific biological processes between leaf and root tissues. Furthermore, a TFs-based regulation network was constructed using Cytoscape software. Taken together, these results not only provide a comprehensive genomic resource of R. sativus but also shed light on functional genomic and proteomic research on R. sativus in the future.Entities:
Keywords: RNA sequencing; Raphanus sativus; simple sequence repeats; transcription factor; transcriptome
Year: 2015 PMID: 26029219 PMCID: PMC4428447 DOI: 10.3389/fpls.2015.00198
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Sequencing, The clean reads distribution map of leaf tissue on B. rapa genome. (B) The de novo assembly result of leaf transcriptome. (C) Comparative analysis of unigenes length with or without hit against Nr and Swissprot databases. (D) E-value distribution of BLAST hits for each unique sequence against the Nr database. (E) Similarity distribution of the top BLAST hits for each sequence against the Nr database. (F) Species distribution is shown as a percentage of the total homologous sequences against the Nr database. (G) Comparative analysis of the leaf unigenes with the unigene and mRNA databases of R. sativus.
Blast results of the assembled leaf unigenes of .
| Nr | 52,677(77.37%) | ||
| Nt | 59,115(86.82%) | ||
| Pfam | 68,086 | 60,685 | 43,447(63.80%) |
| SwissProt | 35,915(52.75%) | ||
| Trembl | 52,525(77.15%) |
.
| Number of clean reads | 125,591,478 | 125,591,478 |
| Total unigenes generated | 103,222 | 106,874 |
| N50 length (bp) | 1250 | 1770 |
| Average unigene length (bp) | 786 | 1108 |
| Length of longest unigene | 7884 | 15,647 |
Blast results of Oases-assembled .
| Nr | 106,874 | 76,520 (71.6%) |
| COG | 42,171 (39.5%) | |
| Pfam | 62,843 (58.8%) | |
| SwissProt | 62,497 (58.5%) |
Figure 2The GO annotation of .
Figure 3Biological Process analysis of differentially expressed genes between leaf and root tissues. GO modules enriched with up-regulated DEGs (A) and down-regulated DEGs (B) were visualized by the Enrichment Map in Cytoscape. The red and yellow circles indicate the common and different biological processes between up-regulated and down-regulated DEGs, respectively.
Figure 4The SSR mining results in . The profiles of different SSR types in R. sativus.
SSRs distribution in the unigenes of .
| Di-nucleotide | 7431 | 7959 | 355 | 15,745 |
| Tri-nucleotide | 9373 | 6182 | 0 | 15,555 |
| Tetra-nucleotide | 540 | 12 | 0 | 552 |
| Penta-nucleotide | 13 | 0 | 0 | 13 |
| Hexa-nucleotide | 10 | 0 | 0 | 10 |
Figure 5Discovery of the transcripts encoding transcription factors in Length distribution of potential transcription factor genes. (B) Transcripts distribution in various transcription factor families.
Figure 6Construction of TFs-based regulation network by Cytoscape software. Sky blue and red circles represent identified TFs and genes involved in the network (A–C), respectively. The blue and green circles represent up-regulated and down-rugulated TFs in leaf.
Figure 7Functional characterization and classification of potential enzyme genes in the transcripts of .