| Literature DB >> 20141623 |
Shaohua Zeng1, Gong Xiao, Juan Guo, Zhangjun Fei, Yanqin Xu, Bruce A Roe, Ying Wang.
Abstract
BACKGROUND: Epimedium sagittatum (Sieb. Et Zucc.) Maxim, a traditional Chinese medicinal plant species, has been used extensively as genuine medicinal materials. Certain Epimedium species are endangered due to commercial overexploition, while sustainable application studies, conservation genetics, systematics, and marker-assisted selection (MAS) of Epimedium is less-studied due to the lack of molecular markers. Here, we report a set of expressed sequence tags (ESTs) and simple sequence repeats (SSRs) identified in these ESTs for E. sagittatum.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20141623 PMCID: PMC2829513 DOI: 10.1186/1471-2164-11-94
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Size distribution of E. sagittatum ESTs before and after assembly
| Items | 1-100 bp | 101-200 bp | 201-300 bp | 301-400 bp | >400 bp |
|---|---|---|---|---|---|
| Cleaned EST sequence | 17,801 | 50,516 | 147,708 | 1,334 | 21 |
| Singlet | 6,720 | 14,171 | 37,444 | 875 | 18 |
| Contig | 163 | 1,084 | 5,461 | 5,026 | 5,497 |
| Consensus | 6,883 | 15,255 | 42,905 | 5,901 | 5,515 |
Number of ESTs in the assembled consensus sequences
| Number of reads per consensus | Number of consensuses |
|---|---|
| 2 | 7,509(43.6%) |
| 3 | 3,074(17.8) |
| 4 | 1,600(9.3%) |
| 5 | 1,029(6.0%) |
| 6 | 667(3.9%) |
| 7 | 499(2.9%) |
| 8 | 385(2.2%) |
| 9 | 267(1.5%) |
| 10 | 192(1.1%) |
| 11-15 | 688(4.0%) |
| 16-20 | 352(2.0%) |
| 21-25 | 204(1.1%) |
| 26-30 | 138(0.8%) |
| 31-35 | 94(0.5%) |
| 36-40 | 81(0.5%) |
| 41-45 | 60(0.3%) |
| 46-50 | 43(0.2%) |
| >51 | 349(2.0%) |
Note: the numbers in parentheses indicate the percentages of certain consensuses in all contigs.
Figure 1Pie chart representations of GO-annotation results of 22,295 . The total numbers of Epimedium consensus sequences annotated for each category are 12,242 for Biological Process (A), 13,807 for Molecular Function (B), and 12,308 for Cellular Component (C). Since one gene product can be assigned to more than one GO terms, the total percentage in each category could excess 100%.
Frequencies of repeat type with repeat numbers in EST-SSRs from E. sagittatum
| Motif length | Repeat number | total | % | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 4 | 5 | 6 | 7 | 8 | 9 | 10 | >10 | |||
| Di | - | - | 234 | 171 | 147 | 78 | 58 | 167 | 855 | 30.4 |
| Tri | - | 1,010 | 353 | 99 | 58 | 11 | 12 | 9 | 1,552 | 55.2 |
| Tetra | 156 | 41 | 4 | 1 | 2 | 0 | 0 | 0 | 204 | 7.3 |
| Penta | 53 | 5 | 1 | 1 | 2 | 0 | 0 | 0 | 62 | 2.2 |
| Hexa | 124 | 7 | 3 | 2 | 1 | 0 | 0 | 0 | 137 | 4.9 |
| total | 333 | 1,063 | 595 | 274 | 210 | 89 | 70 | 176 | 2,810 | |
| % | 11.9 | 37.8 | 21.2 | 9.8 | 7.5 | 3.2 | 2.5 | 6.3 | ||
Frequencies of different repeat motifs of di- and trinucleotide repeats in EST-SSRs from E. sagittatum
| Repeat motif | Repeat number | total | % | ||||||
|---|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | >10 | |||
| AG/CT | - | 128 | 103 | 103 | 56 | 43 | 110 | 543 | 19.3 |
| AT/AT | - | 47 | 52 | 33 | 15 | 11 | 53 | 211 | 7.5 |
| AC/GT | - | 59 | 16 | 11 | 7 | 4 | 4 | 101 | 3.6 |
| AAG/CTT | 384 | 171 | 47 | 39 | 7 | 9 | 6 | 663 | 23.6 |
| ACC/GGT | 234 | 55 | 11 | 10 | 2 | 0 | 312 | 11.1 | |
| AAC/GTT | 116 | 23 | 18 | 5 | 1 | 1 | 1 | 165 | 5.9 |
| AGC/CGT | 58 | 25 | 9 | 1 | 0 | 93 | 3.3 | ||
| ACG/CTG | 51 | 16 | 3 | 1 | 0 | 71 | 2.5 | ||
| AGG/CCT | 37 | 26 | 4 | 0 | 67 | 2.4 | |||
| AAT/ATT | 47 | 14 | 3 | 1 | 0 | 65 | 2.3 | ||
| ACT/ATG | 45 | 7 | 2 | 1 | 1 | 1 | 57 | 2 | |
| AGT/ATC | 35 | 16 | 2 | 1 | 1 | 1 | 56 | 2 | |
| CCG/CGG | 3 | 3 | 0.1 | ||||||
Figure 2Polyacrylamide gel electrophoresis of PCR fragments amplified by EsESP30 primer pairs. Lane 1- 55 corresponded to the species listed in additional file 3. M indicated 25 bp ladder marker.