| Literature DB >> 23056442 |
Mingying Liu1, Guirong Qiao, Jing Jiang, Huiqin Yang, Lihua Xie, Jinzhong Xie, Renying Zhuo.
Abstract
BACKGROUND: Bamboo occupies an important phylogenetic node in the grass family with remarkable sizes, woodiness and a striking life history. However, limited genetic research has focused on bamboo partially because of the lack of genomic resources. The advent of high-throughput sequencing technologies enables generation of genomic resources in a short time and at a minimal cost, and therefore provides a turning point for bamboo research. In the present study, we performed de novo transcriptome sequencing for the first time to produce a comprehensive dataset for the Ma bamboo (Dendrocalamus latiflorus Munro).Entities:
Mesh:
Year: 2012 PMID: 23056442 PMCID: PMC3463524 DOI: 10.1371/journal.pone.0046766
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of Illumina transcriptome sequencing for D. latiflorus Munro.
| Sample | Read Length | No. of Reads | Data | GC (%) | Q20 (%) |
| Ma bamboo | 72+72 | 15,138,726 | 2,179,976,544 | 49.48 | 94.67 |
Figure 1Overview of the Dendrocalamus latiflorus Munro transcriptome sequencing and assembly.
(A) Length distribution of D. latiflorus Munro transcripts. (B) Size distribution of D. latiflorus Munro unigenes. (C) Log-log plot showing the dependence of unigene lengths on the number of reads assembled into each unigene.
Summary of Illumina transcriptome assembly for D. latiflorus Munro.
| Scaffolds (unique genes) | ||
| Scaffold Length | Total length (bp) | Percentage |
| 0–300 | 30,162 (28,656) | 29.18% (42.00%) |
| 300–500 | 23,909 (19,546) | 23.13% (28.65%) |
| 500–1000 | 24,158 (13,652) | 23.37% (20.01%) |
| 1000–2000 | 19,236 (5,590) | 18.61% (8.19%) |
| 2000+ | 5,879 (785) | 5.69% (1.15%) |
| Total Length | 76,066,622 (33,216,845) | |
| Count | 103,354 (68,229) | |
| N50_Length | 1,132 (590) | |
| Mean_Length | 735.98 (486.84) | |
Functional annotation of the D. latiflorus Munro transcriptome.
| Annotated databases | All sequences | ≥300 bp | ≥1000 bp |
| nr_Annotation | 46,087 | 25,385 | 6,306 |
| nt_Annotation | 52,660 | 28,065 | 6,282 |
| swissprot_Annotation | 28,165 | 15,800 | 5,506 |
| GO_Annotation | 11,921 | 6,329 | 3,547 |
| kegg_Annotation | 45,649 | 25,222 | 6,304 |
| COG_Annotation | 10,147 | 5,560 | 2,959 |
| Total | 54,893 | 29,083 | 6,343 |
Figure 2Functional annotation of assembled sequences based on gene ontology (GO) categorization.
GO analysis was performed at the level 2 for three main categories (cellular component, molecular function and biological process).
Figure 3Clusters of orthologous groups (COG) classification.
In total, 10,147 of the 68,229 sequences with Nr hits were grouped into 25 COG classifications.
Figure 4Ma bamboo unigene similarity comparison with rice and millet and functional classification by GO analysis.
(A) Similarity search of Ma bamboo sequence against rice and millet. (B) Functional classification of Ma bamboo unigenes with and without homologs with rice and millet.
Number of bamboo FL-cDNAs and number of genes found in the rice genome that encode nine key enzymes in the lignin biosynthesis pathway.
| Enzymes | Ma bamboo | Moso bamboo | Rice |
| 4-coumarate-CoA ligase (4CL) | 35 | 2 | 26 |
| Caffeoyl caffeoyl-CoA O-methyltransferase (CCoAOMT) | 4 | 9 | 10 |
| Cinnamoyl-CoA reductase (CCR) | 10 | 7 | 18 |
| Caffeic acid O-methyltransferase (COMT) | 7 | 2 | 10 |
| Cinnamate-4-hydroxylase (C4H) | 4 | 1 | 4 |
| Cinnamoyl alcohol dehydrogenase (CAD) | 2 | 6 | 21 |
| Laccase | 34 | 5 | 23 |
| 5-hydroxyconiferyl aldehyde O-methyltransferase (AldOMT) | 0 | 1 | 7 |
| 3-deoxy-D-arabino-heptulosonate 7-phosphate synthase (DAHPS) | 7 | 2 | 8 |
The results were cited from Peng et al (2010).
Summary of simple sequence repeat (SSR) types in the D. latiflorus Munro transcriptome.
| Repeat motif | Number | Percentage (%) | |||||
|
| |||||||
| AC/CA/GT/TG | 36 | ||||||
| AG/GA/CT/TC | 152 | ||||||
| AT/TA | 13 | ||||||
| CG/GC | 6 | ||||||
|
|
|
| |||||
|
| |||||||
| AAC/AAG | 4 | ||||||
| GAA/AGA/ACA/CTT/TTC/TCT/TTG | 21 | ||||||
| TAA/ATT/TTA/ATA | 6 | ||||||
| ACC/CAC/CCA/GGT/GTG/TGG | 35 | ||||||
| ACG/GAC/CGA/CGT/GTC/TCG | 21 | ||||||
| TAC/TAG/GTA/TCA/ACT | 10 | ||||||
| AGC/CAG/GCA/TGC/CTG/GCT | 69 | ||||||
| AGG/GGA/GAG/TCC/CTC/CCT | 92 | ||||||
| ATC/CAT/GAT/ATG/TGA | 25 | ||||||
| CCG/CGG/CGC/GCC/GCG/GGC | 61 | ||||||
|
|
|
| |||||
|
| |||||||
| ATGC/ATGG/AAAT | 4 | ||||||
| TGGA | 1 | ||||||
| GAAA/GCAT/GTAG/GAGT/GCCG/GCAC | 6 | ||||||
| CTCC/CAAC | 4 | ||||||
|
|
|
| |||||
|
| |||||||
| ACTGG/ATTGT | 2 | ||||||
| CCCTG/CTTCC/CTGTG | 3 | ||||||
| GAGAG/GATGG | 2 | ||||||
|
|
|
| |||||
|
| |||||||
| CCACGG/TCAGGC/TTTTCT | 3 | ||||||
|
|
|
| |||||
|
|
|
| |||||
Number of the total SSRs detected in unigenes
The relative percentage of SSRs with different repeat motifs among the total SSRs