| Literature DB >> 26990438 |
Shakeel Ahmed1,2,3,4, Chuansong Zhan1,2,3,4, Yanyan Yang1,3,4, Xuekui Wang1,3,4, Tewu Yang1,3,4, Zeying Zhao1,3,4, Qiyun Zhang1,2,3,4, Xiaohua Li1,2,3,4, Xuebo Hu1,2,3,4.
Abstract
Atractylodes lancea (Thunb.) DC., named "Cangzhu" in China, which belongs to the Asteraceae family. In some countries of Southeast Asia (China, Thailand, Korea, Japan etc.) its rhizome, commonly called rhizoma atractylodis, is used to treat many diseases as it contains a variety of sesquiterpenoids and other components of medicinal importance. Despite its medicinal value, the information of the sesquiterpenoid biosynthesis is largely unknown. In this study, we investigated the transcriptome analysis of different tissues of non-model plant A. lancea by using short read sequencing technology (Illumina). We found 62,352 high quality unigenes with an average sequence length of 913 bp in the transcripts of A. Lancea. Among these, 43,049 (69.04%), 30,264 (48.53%), 26,233 (42.07%), 17,881 (28.67%) and 29,057(46.60%) unigenes showed significant similarity (E-value<1e(-5)) to known proteins in Nr, KEGG, SWISS-PROT, GO, and COG databases, respectively. Of the total 62,352 unigenes, 43,049 (Nr Database) open reading frames were predicted. On the basis of different bioinformatics tools we identify all the enzymes that take part in the terpenoid biosynthesis as well as five different known sesquiterpenoids via cytosolic mevalonic acid (MVA) pathway and plastidal methylerythritol phosphate (MEP) pathways. In our study, 6, 864 Simple Sequence Repeats (SSRs) were also found as great potential markers in A. lancea. This transcriptomic resource of A. lancea provides a great contribution in advancement of research for this specific medicinal plant and more specifically for the gene mining of different classes of terpenoids and other chemical compounds that have medicinal as well as economic importance.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26990438 PMCID: PMC4798728 DOI: 10.1371/journal.pone.0151975
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Putative sesquiterpenoid biosynthetic pathway in Atractylodes lancea.
A flow diagram of biosynthetic pathway of terpenoid backbone and sesquiterpenoids biosynthesis in Atractylodes lancea. The structures of chemicals in the pathway are shown in boxes. The green boxes represent the plasticidal pathway while the black boxes show the pathway in cytoplasm & mitochondria. The words on the boxes are enzymes for the reaction while the numbers in red color represent the number of transcripts for that specific gene. Reactions in cytoplasm, mitochondria and plastids are shown in green. The boxes with red border show the structure of various sesquiterpenoids of A. lancea.
Statistic of sequencing and de novo assembling of transcriptome in Atractylodes lancea.
| Sample | Total number | Total length(nt) | Mean Length(nt) | N50 | Total consensus sequences | Distinct Clusters | Distinct Singletons | |
|---|---|---|---|---|---|---|---|---|
| Contigs | Leaf | 112883 | 42287508 | 375 | 806 | 0 | 0 | 0 |
| Root | 94663 | 37505566 | 396 | 837 | 0 | 0 | 0 | |
| Stem | 101679 | 39492194 | 388 | 359 | 0 | 0 | 0 | |
| Unigenes | Leaf | 64106 | 43921277 | 685 | 1258 | 64106 | 19718 | 44388 |
| Root | 55409 | 37866604 | 684 | 1221 | 55409 | 16802 | 38607 | |
| Stem | 56565 | 40135278 | 710 | 1328 | 56565 | 16947 | 39618 | |
| Total | 62352 | 56923290 | 913 | 1494 | 62352 | 23974 | 38378 |
Fig 2Length distribution of unigenes in Atractylodes lancea.
The x-axis represent the size of the all assembled sequences and the y-axis indicates the corresponding number of unigenes.
Fig 3The species distribution of the non-redundant unigene annotation.
The column shows the homology of Atractylodes lancea unigene number with that from other species. The numbers inside parentheses indicate the percentage of the homology to different species.
Statistics of annotations for assembled unigenes of Atractylodes lancea in different public databases.
| Database | Unigenes | Percentage(%) |
|---|---|---|
| NR | 43049 | 69.04 |
| SWISS-PROT | 30264 | 48.53 |
| KEGG | 26233 | 42.07 |
| COG | 17881 | 28.67 |
| GO | 29057 | 46.6 |
| ALL | 44482 | 71.34 |
Fig 4Distributions of GO annotation of all unigenes.
The results were classified into three main categories: biological process, cellular component, and molecular function. The left y-axis indicates the percentage of a specific category of genes in that category. The right y-axis indicates the number of genes in a category.
Fig 5COG function classification of all unigenes.
The annotated unigenes are divided into a variety of functional orthologous groups, which are indicated by letters A-Z and annotated besides the figure.
Fig 6Differentially expressed genes profiling of three libraries of leaf, root and stem of Atractylodes lancea.
The red and green columns indicate up- and down-regulated genes in comparisons of leaves, stem and root libraries in A. lancea. FDR≤0.05 and the absolute value of Log2FC Ratio ≥1 were used as the threshold to judge the significance of gene expression difference from transcriptome data.
A summary of SSRs identified in Atractylodes lancea.
| Searching Items | Numbers |
|---|---|
| Total number of sequence examined | 62352 |
| Total size of examined sequence | 56,9232,90 |
| Total number of identified cSSRs | 6864 |
| Number of cSSRs containing sequences | 5970 |
| Number of sequences containing more than one cSSRs | 757 |
| Number of cSSRs present in compound formation | 303 |
| Mono-nucleotides | 570 |
| Di-nucleotides | 3122 |
| Tri-nucleotides | 2307 |
| Tetra-nucleotides | 130 |
| Penta-nucleotides | 303 |
| Hexa-nucleotides | 432 |
Fig 7Quantity statistics of SSR classification: The X-axis is the repeat times of repeat units; the Y-axis is the number of SSRs from Atractylodes lancea.
The di-nucleotide category was found in large number and among the di-nucleotide (AG/CT) was most abundant one in our SSRs.
A summary of SNP results in Atractylodes lancea.
| SNP Type | Leave | Root | Stem | Total |
|---|---|---|---|---|
| Transition | 57839 | 51702 | 55579 | 1,65,120 |
| AG | 29135 | 26042 | 28015 | 83192 |
| CT | 28704 | 25660 | 27564 | 81928 |
| Transversions | 33701 | 29976 | 31750 | 95427 |
| AC | 8428 | 7402 | 7902 | 23732 |
| AT | 9601 | 8564 | 8989 | 27154 |
| GC | 7281 | 6629 | 6952 | 20862 |
| GT | 8391 | 7381 | 7907 | 23679 |
| Total | 91540 | 81678 | 87329 | 2,60,547 |
Fig 8Statistics of SNP number.
The X-axis is SNP types; the Y-axis is the number of SNP.