| Literature DB >> 26571372 |
Yaling Liu1,2, Pengfei Zhang2, Meiling Song2, Junling Hou1, Mei Qing3, Wenquan Wang1,4, Chunsheng Liu1.
Abstract
Licorice is an important traditional Chinese medicine with clinical and industrial applications. Genetic resources of licorice are insufficient for analysis of molecular biology and genetic functions; as such, transcriptome sequencing must be conducted for functional characterization and development of molecular markers. In this study, transcriptome sequencing on the Illumina HiSeq 2500 sequencing platform generated a total of 5.41 Gb clean data. De novo assembly yielded a total of 46,641 unigenes. Comparison analysis using BLAST showed that the annotations of 29,614 unigenes were conserved. Further study revealed 773 genes related to biosynthesis of secondary metabolites of licorice, 40 genes involved in biosynthesis of the terpenoid backbone, and 16 genes associated with biosynthesis of glycyrrhizic acid. Analysis of unigenes larger than 1 Kb with a length of 11,702 nt presented 7,032 simple sequence repeats (SSR). Sixty-four of 69 randomly designed and synthesized SSR pairs were successfully amplified, 33 pairs of primers were polymorphism in in Glycyrrhiza uralensis Fisch., Glycyrrhiza inflata Bat., Glycyrrhiza glabra L. and Glycyrrhiza pallidiflora Maxim. This study not only presents the molecular biology data of licorice but also provides a basis for genetic diversity research and molecular marker-assisted breeding of licorice.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26571372 PMCID: PMC4646622 DOI: 10.1371/journal.pone.0143017
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Assembly results.
| Length Range (nt) | Contigs | Transcript | Unigene |
|---|---|---|---|
| 0–100 | 2960407 (95.05%) | 0 | 0 |
| 100–200 | 98089 (3.15%) | 0 | 0 |
| 200–300 | 19013 (0.61%) | 17,951 (20.58%) | 15,287 (32.78%) |
| 300–500 | 14,149 (0.45%) | 15,279 (17.51%) | 11,103 (23.81%) |
| 500–1000 | 10,723 (0.34%) | 17,416 (19.96) | 8,549 (18.33%) |
| 1000–2000 | 8,157 (0.26%) | 21,904 (25.11%) | 7,532 (16.15%) |
| 2000+ | 4,100 (0.13%) | 14,692 (16.84%) | 4,170 (8.94%) |
| Total Number | 3,114,638 | 87,242 | 46,641 |
| Total Length | 158,837,163 | 97,878,454 | 36,725,337 |
| N50 Length | 48 | 1,816 | 1,395 |
| Mean Length | 51.00 | 1121.92 | 787.40 |
Fig 1Frequency distribution of transcripts.
Fig 2Frequency distribution of unigenes.
Fig 3Functional annotation of unigenes based on GO categories.
Fig 4Functional classification of licorice based on COG.
Fig 5Distribution of pathway based on the KEGG database.
Enzymes related to glycyrrhizic acid metabolic pathways.
| EC | Enzyme name | Gene |
|---|---|---|
| EC2.3.1.9 | acetyl-CoA C-acetyltransferase | c12403.graph_c0 & c35153.graph_c0 |
| EC2.3.3.10 | hydroxymethylglutaryl-CoA synthase | c30007.graph_c0 |
| EC1.1.1.34 | hmg-CoA reductase | c33876.graph_c0 & c19615.graph_c0 & c21217.graph_c0 |
| EC1.1.1.88 | hydroxymethylglutaryl-CoA reductase | 0 |
| EC2.7.1.36 | mevalonate kinase | c29677.graph_c0 |
| EC2.7.4.2 | phosphomevalonate kinase | 0 |
| EC4.1.1.33 | diphosphomevalonate decarboxylase | c25954.graph_c0 |
| EC2.2.1.7 | 1-deoxyxylulose-5-phosphate synthase | c31138.graph_c0 & c34113.graph_c0 |
| EC1.1.1.267 | DXP reductiosomerase | 0 |
| EC2.7.7.60 | MEP cytidylytransferase | 0 |
| EC2.7.1.148 | 4-diphosphocytidyl-2-C-methyl-d-erythritol kinase | c27328.graph_c0 |
| EC4.6.1.12 | 2-C-methyl-D-erythritol 2,4-cyclodiphosphate synthase | c20723.graph_c0 |
| EC1.17.7.1 | 4-hydroxy-3-methylbut-2-enyl-diphosphate synthase | 0 |
| EC1.17.1.2 | 1-hydroxy-2-methyl-2-(E)-butenyl 4-diphosphate reductase | c26327.graph_c0 |
| EC2.5.1.10 | farnesyl diphosphate synthase | 0 |
| EC2.5.1.1 | dimethylallyl transtransferase | 0 |
| EC2.5.1.21 | farnesyl-diphosphate farnesyltransferase | c16912.graph_c0 |
| EC1.14.99.7 | squalene monooxygenase | c37309.graph_c0 & c5057.graph_c0 |
| EC5.4.99.- | β-amyrin synthase | 0 |
| EC5.3.3.2 | isopentenyl-PP isomerase | 0 |
Numbers of SSR repeat types in licorice.
| Repeat type | Number |
|---|---|
| Mono-nucleotide | 3394 |
| Di-nucleotide | 1692 |
| Tri-nucleotide | 1814 |
| Tetra-nucleotide | 101 |
| Penta-nucleotide | 19 |
| Hexa-nucleotide | 12 |
| All of SSR locus number | 7032 |
| Number of sequence contain SSR loci | 4861 |
| Number of sequence contain more than SSR loci | 1547 |
| Total number of sequences examined | 11702 |
| Total size of examined sequences (bp) | 22739272 |
| Total number of identified SSRs | 7032 |
| Number of SSR containing sequences | 4861 |
| Number of SSRs present in compound formation | 595 |
Amounts of different SSR repeat motifs in licorice.
| Motif | Repeat number | Frequency (%) |
|---|---|---|
| A/T | 3251 | 46.23 |
| C/G | 143 | 2.03 |
| AC/GT | 339 | 4.82 |
| AG/CT | 1114 | 15.84 |
| AT/AT | 238 | 3.38 |
| CG/CG | 1 | 0.01 |
| AAC/GGT | 320 | 4.55 |
| AAG/CTT | 438 | 6.23 |
| AAT/ATT | 187 | 2.66 |
| ACC/GTT | 194 | 2.76 |
| ACG/CGT | 36 | 0.51 |
| ACT/AGT | 35 | 0.50 |
| AGC/CTG | 161 | 2.29 |
| AGG/CCT | 181 | 2.57 |
| ATC/ATG | 198 | 2.82 |
| CCG/CGG | 64 | 0.91 |
| Others | 132 | 1.88 |
| Total | 7032 | 100 |
Fig 6Photograph of PCR amplification results for SSR markers in licorice.
The first line is the DNA ladder. The subsequent lines are the PCR products generated using different primers.