| Literature DB >> 26664999 |
Eun Soo Seong1, Ji Hye Yoo2, Jae Hoo Choi2, Chang Heum Kim2, Mi Ran Jeon2, Byeong Ju Kang2, Jae Geun Lee3, Seon Kang Choi4, Bimal Kumar Ghimire5, Chang Yeon Yu2.
Abstract
Perilla frutescens is valuable as a medicinal plant as well as a natural medicine and functional food. However, comparative genomics analyses of P. frutescens are limited due to a lack of gene annotations and characterization. A full-length cDNA library from P. frutescens leaves was constructed to identify functional gene clusters and probable EST-SSR markers via analysis of 1,056 expressed sequence tags. Unigene assembly was performed using basic local alignment search tool (BLAST) homology searches and annotated Gene Ontology (GO). A total of 18 simple sequence repeats (SSRs) were designed as primer pairs. This study is the first to report comparative genomics and EST-SSR markers from P. frutescens will help gene discovery and provide an important source for functional genomics and molecular genetic research in this interesting medicinal plant.Entities:
Year: 2015 PMID: 26664999 PMCID: PMC4668317 DOI: 10.1155/2015/679548
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Figure 1Reads length representation in EST (expressed sequence tags) sequencing of Perilla frutescens. Range of read length was indicated from 121 bps to 1051 bps.
Figure 2GC content division of unigenes. GC content of unigenes changed from 29.45% to 61.32%.
Annotated unigenes from different databases by EST (expressed sequence tags) sequencing of Perilla frutescens.
| Annotation DB (methods) | Hits | % | No hits | % |
|---|---|---|---|---|
| NT (BLASTn) | 312 | 90.96% | 31 | 9.04% |
| NR (BLASTx) | 322 | 93.88% | 21 | 6.12% |
| Uniprot + Swissprot (BLASTx) | 317 | 92.42% | 26 | 7.58% |
| COG (BLASTx) | 111 | 32.36% | 232 | 67.64% |
List of species containing sequence matches to Perilla frutescens.
| Species (total: 38) | Genes (total: 322) |
|---|---|
|
| 185 |
|
| 78 |
|
| 6 |
|
| 4 |
|
| 4 |
|
| 4 |
|
| 3 |
|
| 2 |
|
| 2 |
|
| 2 |
|
| 2 |
|
| 2 |
|
| 2 |
|
| 2 |
|
| |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
|
| 1 |
Figure 3Classification of unigenes by Gene Ontology (GO) analysis. Three major clusters were displayed with annotated genes at hierarchy level 2 of GO analysis.
EST-SSR primer pairs produced in EST (expressed sequence tags) sequencing database of Perilla frutescens.
| Number | Unigene ID | Repeat motifs | Left primer sequence | Tm. | Right primer sequence | Tm. | Product size (bp) | Annotation (NR DB) |
|---|---|---|---|---|---|---|---|---|
| 1 | Contig 17 | ATCAT(8) | GAGAGTATAAACAAATCCAAAACAGC | 58.795 | AGCCGGTATATCCAATTCCC | 60.006 | 562 | PREDICTED: protein CURVATURE THYLAKOID 1B, chloroplastic [Sesamum indicum] |
| 2 | Contig 67 | A(29) | AGCAACTGCGGGTAGCTAGA | 60.176 | CAATCCGACCACAGTTGATG | 59.96 | 172 | PREDICTED: photosystem I subunit O [Sesamum indicum] |
| 3 | Perilla-1-1a_pTriplEx2-seq_C16 | GA(16) | AGCGTACTGTTGAAAGCGTG | 59.148 | CAGCAAACGTGCTCGAATTA | 60.014 | 247 | PREDICTED: uncharacterized protein LOC105172991 isoform X2 [Sesamum indicum] |
| 4 | Perilla-1-1a_pTriplEx2-seq_E18 | CTT(9) | GCCAATTTGAAGCTTTAGCC | 58.969 | GAATGTGAAGTGGGAACGCT | 60.119 | 773 | PREDICTED: GRF1-interacting factor 3 [Sesamum indicum] |
| 5 | Perilla-1-1a_pTriplEx2-seq_M02 | AGAATG(4) | TGGAGCAAGTGAAGCAACAG | 60.175 | CCTTTTCAGTGAGGAGCCAG | 59.982 | 191 | |
| 6 | Perilla-1-2a_pTriplEx2-seq_A02 | TTTTG(7) | AATGATGGGTGTGATGAGCA | 59.925 | AAAGAATTTGAAGGCGCAGA | 59.96 | 401 | PREDICTED: homeobox-leucine zipper protein HAT5-like [Sesamum indicum] |
| 7 | Perilla-1-3a_pTriplEx2-seq_B13 | ATCAT(8) | GAGAGTATAAACAAATCCAAAACAGC | 58.795 | CGGTATATCCAATTCCCACG | 60.031 | 559 | hypothetical protein MIMGU_mgv1a015066mg [Erythranthe guttata] |
| 8 | Perilla-1-3a_pTriplEx2-seq_L05 | CT(14) | CCCAAATTCACATCCACTGA | 59.343 | AACAACTGACATGGCCTTCC | 59.973 | 185 | PREDICTED: uncharacterized protein LOC105160440 [Sesamum indicum] |
| 9 | Perilla-1-3a_pTriplEx2-seq_L15 | TC(15) | CAGTTTTAACTTCGCCTCGC | 60.018 | CACTCGCAAAAAGGGGTAAG | 59.741 | 619 | PREDICTED: annexin D5 [Sesamum indicum] |
| 10 | Perilla-2-1a_pTriplEx2-seq_A19 | GGA(8) | GCTCCTCGCAGTAACTTTGG | 60.015 | TCATCTCTTGCTCTGTTTCCA | 58.583 | 107 | hypothetical protein MIMGU_mgv1a016040mg [Erythranthe guttata] |
| 11 | Perilla-2-2a_pTriplEx2-seq_A06 | CT(12) | CATTGGCCTTAAACTTCGGA | 60.067 | ATAAATGTGGATTGGGGCAA | 60.016 | 341 | hypothetical protein MIMGU_mgv1a020048mg [Erythranthe guttata] |
| 12 | Perilla-2-2a_pTriplEx2-seq_C18 | AG(14) | GGGGGATCATTTCCAGTCTT | 60.133 | GTGCCCACTGGTTCTTTGTT | 60.012 | 404 | hypothetical protein MIMGU_mgv1a012334mg [Erythranthe guttata] |
| 13 | Perilla-3-2a_pTriplEx2-seq_E14 | GATGACGATGAT(2) | CTTTCCAACCCTCCGAATTT | 60.291 | CGACGCCTGTCTCATCTACA | 60.008 | 522 | NAC transcription factor 1 [Salvia miltiorrhiza] |
| 14 | Perilla-3-2a_pTriplEx2-seq_O22 | GA(17) | GGGGATATGTTATGTTGCTTGTT | 59.179 | TCGCCGTACTTGATCCCTAC | 60.096 | 514 | PREDICTED: uncharacterized protein LOC105169169 isoform X1 [Sesamum indicum] |
| 15 | Perilla-3-3a_pTriplEx2-seq_B03 | CT(16) | CGAGTGTGTTCGTATGGGTG | 60.025 | AACGCGTACGGAACAGAGAC | 60.321 | 184 | hypothetical protein MIMGU_mgv1a014836mg [Erythranthe guttata] |
| 16 | Perilla-3-3a_pTriplEx2-seq_B23 | TCCTCTTCCTCTCC(2) | TAGTGTCGAAGCTCAATGGC | 59.028 | TGACCAGCATCAGCTTTCAC | 59.992 | 662 | PREDICTED: chloroplast stem-loop binding protein of 41 kDa a, chloroplastic [Sesamum indicum] |
| 17 | Perilla-3-3a_pTriplEx2-seq_F11 | GAG(9) | GAAAGACTGGTTGGCTCTGG | 59.844 | ATCCAAAATTCGTCCTGTGC | 59.939 | 381 | hypothetical protein MIMGU_mgv1a011207mg [Erythranthe guttata] |
| 18 | Perilla-3-3a_pTriplEx2-seq_P03 | GA(13) | AAAGCTGTTTGCCCTTGCTA | 60.018 | CTCAAATGGAGTCACGCAGA | 59.984 | 284 | hypothetical protein MIMGU_mgv1a016830mg [Erythranthe guttata] |