| Literature DB >> 24595283 |
Yulan Peng1, Xinfen Gao1, Renyuan Li2, Guoxing Cao2.
Abstract
Youngia japonica, a weed species distributed worldwide, has been widely used in traditional Chinese medicine. It is an ideal plant for studying the evolution of Asteraceae plants because of its short life history and abundant source. However, little is known about its evolution and genetic diversity. In this study, de novo transcriptome sequencing was conducted for the first time for the comprehensive analysis of the genetic diversity of Y. japonica. The Y. japonica transcriptome was sequenced using Illumina paired-end sequencing technology. We produced 21,847,909 high-quality reads for Y. japonica and assembled them into contigs. A total of 51,850 unigenes were identified, among which 46,087 were annotated in the NCBI non-redundant protein database and 41,752 were annotated in the Swiss-Prot database. We mapped 9,125 unigenes onto 163 pathways using the Kyoto Encyclopedia of Genes and Genomes Pathway database. In addition, 3,648 simple sequence repeats (SSRs) were detected. Our data provide the most comprehensive transcriptome resource currently available for Y. japonica. C4 photosynthesis unigenes were found in the biological process of Y. japonica. There were 5596 unigenes related to defense response and 1344 ungienes related to signal transduction mechanisms (10.95%). These data provide insights into the genetic diversity of Y. japonica. Numerous SSRs contributed to the development of novel markers. These data may serve as a new valuable resource for genomic studies on Youngia and, more generally, Cichoraceae.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24595283 PMCID: PMC3942458 DOI: 10.1371/journal.pone.0090636
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of sequencing and de novo assembling of plant transcriptome in Youngia japonica.
| Transcript Length | Total Number | Percentage |
| 200–300 | 14937 | 16.68% |
| 300–500 | 18703 | 20.88% |
| 500–1000 | 21949 | 24.51% |
| 1000–2000 | 23755 | 26.52% |
| 2000+ | 10220 | 11.41% |
| Total number | 89564 | |
| Total length | 88908044 | |
| N50 length | 1497 | |
| Mean length | 992.6761199 |
Figure 1Overview of the Youngia japonica transcriptome assembly. (A) Size distribution of contigs; (B) size distribution of unigenes.
Functional annotation of the Youngia japonica transcriptome.
| Anno Database | Annotated Number | 300≤length<1000 | length≥1000 |
| OG Annotation | 12269 | 4843 | 5639 |
| GO Annotation | 30610 | 13968 | 11594 |
| KEGG Annotation | 9125 | 4084 | 3179 |
| Swissprot Annotation | 28594 | 12826 | 11253 |
| TrEMBL Annotation | 36310 | 17277 | 12902 |
| Nr Annotation | 36517 | 17388 | 12889 |
| Nt Annotation | 27929 | 12052 | 11356 |
| All Annotated | 37734 | 17991 | 12968 |
Figure 2Functional annotation of assembled sequences based on gene ontology (GO) categorization.
Figure 3Clusters of orthologous group (COG) classification.
Figure 4KEGG biochemical mappings for Youngia japonica.
Summary of simple sequence repeat (SSR) types in Youngia japonica.
| Searching item | Numbers |
| Total number of sequences examined | 10251 |
| Total number of identified SSRs | 3648 |
| Number of sequences containing more than one SSR | 564 |
| Number of SSRs present in compound formation | 203 |
| Mono-nucleotide | 1341 |
| Di-nucleotide | 589 |
| Tri-nucleotide | 1466 |
| Tetra-nucleotide | 45 |
| Penta-nucleotide | 4 |
| Hexa-nucleotide | 0 |
Figure 5Characteristics of homology search of Youngia japonica unigenes.
(A) E-value distribution of the top Blastx hits against the non-redundant (Nr) protein database for each unigene; (B)E-value distribution of the top Blastn hits against the NCBI non-redundant nucleotide sequence (Nt) database; (C) number of unigenes matching the top nine species using Blastx in the Nr database.
Similarity distribution of unigenes of Youngia japonica aligned in the Nt and Nr databases.
| Similarity range | Number of unigenes (Nr) | Percent of unigenes (Nr) | Number of unigenes (Nt) | Percent of unigenes (Nt) |
| 23%–40% | 2382 | 6.52% | 0 | 0.00% |
| 40%–60% | 9666 | 26.48% | 0 | 0.00% |
| 60%–80% | 14144 | 38.74% | 1887 | 6.76% |
| 80%–100% | 8240 | 22.57% | 24199 | 86.69% |
| 100% | 2076 | 5.69% | 1827 | 6.55% |