| Literature DB >> 30365506 |
Delin Xu1,2, Hongbo Chen1, Murat Aci2, Yinchi Pan1, Yanni Shangguan1, Jie Ma1, Lin Li1, Gang Qian1, Qianxing Wang1.
Abstract
Bletilla striata is an endangered orchid that has been used for millennia as a medicinal herb, in cosmetics and as a horticultural plant. To construct the first nucleotide database for this species and to develop abundant EST-SSR markers for facilitating further studies, various tissues and organs of plants in the main developmental stages were harvested for mRNA isolation and subsequent RNA sequencing. A total of 106,054,784 clean reads were generated by using Illumina paired-end sequencing technology. The reads were assembled into 127,261 unigenes by the Trinity package; the unigenes had an average length of 612 bp and an N50 of 957 bp. Of these unigenes, 67,494 (51.86%) were annotated in a series of databases. Of these annotated unigenes, 41,818 and 24,615 were assigned to gene ontology categories and clusters of orthologous groups, respectively. Additionally, 20,764 (15.96%) unigenes were mapped onto 275 pathways using the KEGG database. In addition, 25,935 high-quality EST-SSR primer pairs were developed from the 15,433 unigenes by MISA mining. To validate the accuracy of the newly designed markers, 87 of 100 randomly selected primers were effectively amplified; 63 of those yielded PCR products of the expected size, and 25 yielded products with significant amounts of polymorphism among the 4 landraces. Furthermore, the transferability test of the 25 polymorphic markers was performed in 6 individuals of two closely related genus Phalaenopsis and dendrobium. Which results showed a total of 5 markers can successfully amplified among these populations. This research provides a comprehensive nucleotide database and lays a solid foundation for functional gene mining and genomic research in B. striata. The developed EST-SSR primers could facilitate phylogenetic studies and breeding.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30365506 PMCID: PMC6203367 DOI: 10.1371/journal.pone.0205954
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of the analysis of the de novo assembly and EST-SSR identification.
| Category | Items | Description |
|---|---|---|
| Raw reads | Total raw reads | 109,714,290 |
| Clean reads | Total clean reads | 106,054,784 |
| Total clean nucleotides (nt) | 13.26 G | |
| Q20 (%) | 95.69% | |
| Q30 (%) | 91.49% | |
| GC content (%) | 47.87% | |
| Unigenes | Total sequences | 127,261 |
| Total sequence bases (Kb) | 79,698 | |
| Largest (bp) | 21,950 | |
| Smallest (bp) | 201 | |
| Average (bp) | 612 | |
| N50 (bp) | 957 | |
| N90 (bp) | 249 | |
| Annotated | 67,494 | |
| EST-SSR | Total number of tested sequences | 127,261 |
| Total size of tested sequences (Kb) | 79,698 | |
| Total number of identified SSRs | 18,335 | |
| Number of SSR-containing sequences | 15,433 | |
| Number of sequences containing more than one SSR | 2,427 | |
| Number of SSRs present in compound formation | 778 |
Fig 1Length distribution of assembled unigenes.
The x-axis indicates the number of unigenes, and the y-axis indicates the length range.
Fig 2GO classification of unigenes in the categories of Biological Process, Cellular Component, and Molecular Function.
The left y-axis shows the number of unigenes annotated in each subcategory. The abscissa axis represents the subterms of Biological Process, Cellular Component, and Molecular Function, three different GO categories.
Fig 3KOG function classification of assembled unigenes.
The x-axis represents the 26 KOG groups. The y-axis shows the number of unigenes in each KOG functional category.
Characterization of SSR analysis in assembled unigenes.
| Repeat type | SSR number | Proportion | Distribution density | Number of motif | Main repeat |
|---|---|---|---|---|---|
| Mono- | 9,128 | 49.78 | 1/8.87 kb | 2 | A/T (98.55%) |
| Di- | 4,884 | 26.64 | 1/16.57 kb | 4 | AG/CT (67.28%) |
| Tri- | 4,166 | 22.72 | 1/19.42 kb | 10 | AAG/CTT (24.48%) |
| Tetra- | 116 | 0.63 | 1/697.62 kb | 28 | AAAT/ATTT (25.86%) |
| Penta- | 19 | 0.10 | 1/4259.13 kb | 15 | AAGAG/CTCTT (15.78%) |
| Hexa- | 22 | 0.12 | 1/3678.34 kb | 21 | - |
| Total | 18,335 | 100 | 1/4.41 kb | 78 | - |