| Literature DB >> 23923022 |
Guilherme Toledo-Silva1, Claudio Benicio Cardoso-Silva, Liana Jank, Anete Pereira Souza.
Abstract
Guinea grass (Panicum maximum Jacq.) is a tropical African grass often used to feed beef cattle, which is an important economic activity in Brazil. Brazil is the leader in global meat exportation because of its exclusively pasture-raised bovine herds. Guinea grass also has potential uses in bioenergy production due to its elevated biomass generation through the C4 photosynthesis pathway. We generated approximately 13 Gb of data from Illumina sequencing of P. maximum leaves. Four different genotypes were sequenced, and the combined reads were assembled de novo into 38,192 unigenes and annotated; approximately 63% of the unigenes had homology to other proteins in the NCBI non-redundant protein database. Functional classification through COG (Clusters of Orthologous Groups), GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) analyses showed that the unigenes from Guinea grass leaves are involved in a wide range of biological processes and metabolic pathways, including C4 photosynthesis and lignocellulose generation, which are important for cattle grazing and bioenergy production. The most abundant transcripts were involved in carbon fixation, photosynthesis, RNA translation and heavy metal cellular homeostasis. Finally, we identified a number of potential molecular markers, including 5,035 microsatellites (SSRs) and 346,456 single nucleotide polymorphisms (SNPs). To the best of our knowledge, this is the first study to characterize the complete leaf transcriptome of P. maximum using high-throughput sequencing. The biological information provided here will aid in gene expression studies and marker-assisted selection-based breeding research in tropical grasses.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23923022 PMCID: PMC3726610 DOI: 10.1371/journal.pone.0070781
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of assembled transcripts and unigenes of P. maximum leaves.
| Total raw reads | 168,053,718 |
| Total data | 13.44 Gb |
| Total clean reads | 120,838,336 |
| Q20 bases | 96.32% |
| GC percentage | 47.32% |
| Total transcripts | 88.292 |
| Transcripts mean length | 942 bp |
| Transcripts N50 | 1272 bp |
| PE mapped reads (Bowtie) | 85.96% |
| Unigenes | 38.192 |
| Unigenes mean length | 758 bp |
| Unigenes N50 | 981 bp |
| Unigenes GC percentage | 48.42% |
| Predicted ORFs (transcripts) | 31.04% |
| Predicted ORFs (unigenes) | 29.32% |
Figure 1De novo assembly length distribution.
Histogram of the sequence-length distribution of transcripts, unigenes and unigenes with significant BLASTx hits in the NCBI nr database.
Annotation summary of 38,192 P. maximum unigenes.
| Database | Hits | Hits percentage |
| NCBI non-redundant proteins (nr) | 24,122 | 63.15% |
| SwissProt | 16,396 | 42.93% |
| COG | 22,473 | 58.84% |
| KEGG | 4,110 | 10.76% |
| Grass | 26,319 | 68.92% |
| Gene ontology | 18,995 | 49.73% |
Figure 2Gene ontology classification of Panicum maximum unigenes.
Distribution of the GO categories assigned to the P. maximum transcriptome. Transcripts were classified into three categories: cellular components, molecular functions and biological processes.
Figure 3Shared and unique unigenes of Panicum maximum determined based on RSEM analysis.
The 10 most abundant transcripts found in the Guinea grass leaf transcriptome.
| Putative gene | E-value | FPKM | UniProtKB |
| Transcript antisense to ribosomal RNA protein 2 | 2.00E-016 | 10702.87 | gi|74630365|sp|Q8TGM7.1|ART2_YEAST |
| Carbonic anhydrase | 1.00E-121 | 10021.35 | gi|729003|sp|P40880.1|CAHC_HORVU |
| Transcript antisense to ribosomal RNA protein 1 | 1.00E-011 | 7523.47 | gi|74644329|sp|Q8TGM6.1|TAR1_YEAST |
| Uncharacterized protein ycf76 | 2.00E-050 | 7410.46 | gi|75121187|sp|Q6ENQ6.1|YCF76_SACOF |
| Stem-specific protein TSJT1 | 1.00E-028 | 5228.23 | gi|136452|sp|P24805.1|TSJT1_TOBAC |
| Phosphoenolpyruvate carboxylase 1 | 0.0 | 5043.19 | gi|115608|sp|P04711.2|CAPP1_MAIZE |
| Chlorophyll a-b binding protein | 4.00E-139 | 4389.51 | gi|122246902|sp|Q10HD0.1|CB23_ORYSJ |
| Pyruvate, phosphate dikinase 1 | 0.0 | 4164.02 | gi|193806357|sp|P11155.2|PPDK1_MAIZE |
| Metallothionein-like protein 3A | 2.00E-008 | 3959.71 | gi|158512839|sp|A2 WLS0.1|MT3A_ORYSI |
| Metallothionein-like protein 1A | 8.00E-009 | 3775.59 | gi|158513336|sp|A2ZH20.1|MT1A_ORYSI |
List of genes comprising the C4 photosynthetic pathway found among P. maximum unigenes.
| Putative gene | KEGG orthology | Enzyme code | Unigenes |
| Carbonic anhydrase | K01673/K01674 | 4.2.1.1 | 8 |
| Phosphoenolpyruvate carboxylase | K01595 | 4.1.1.31 | 12 |
| Aspartate aminotransferase, cytosolic | K14454 | 2.6.1.1 | 1 |
| Phosphoenolpyruvate carboxykinase | K01610 | 4.1.1.49 | 3 |
| Pyruvate kinase | K00873 | 2.7.1.40 | 10 |
| Alanine transaminase | K00814 | 2.6.1.2 | 1 |
| Malate dehydrogenase (NADP+) | K00051 | 1.1.1.82 | 3 |
| Malate dehydrogenase (oxaloacetate-decarboxylating) | K00029 | 1.1.1.40 | 4 |
| Pyruvate, orthophosphate dikinase | K01006 | 2.7.9.1 | 1 |
| Malate dehydrogenase | K00025 | 1.1.1.37 | 1 |
| Malate dehydrogenase (decarboxylating) | K00028 | 1.1.1.39 | 2 |
| Ribulose-bisphosphate carboxylase | K01602 | 4.1.1.39 | 1 |
| Aspartate aminotransferase, chloroplastic | K00811 | 2.6.1.1 | 2 |
List of genes comprising the cellulose and lignin pathways found among P. maximum unigenes.
| Putative gene | KEGG orthology | Enzyme code | Unigenes |
|
| |||
| UTP–glucose-1-phosphate uridylyltransferase | K00963 | 2.7.7.9 | 2 |
| Sucrose synthase | K00695 | 2.4.1.13 | 3 |
| Cellulose synthase A | K10999 | 2.4.1.12 | 21 |
| Sterol 3beta-glucosyltransferase | K05841 | 2.4.1.173 | 4 |
|
| |||
| Phenylalanine ammonia-lyase | K10775 | 4.3.1.24 | 3 |
| Trans-cinnamate 4-monooxygenase | K00487 | 1.14.13.11 | 4 |
| 4-Coumarate-CoA ligase | K01904 | 6.2.1.12 | 10 |
| Cinnamoyl-CoA reductase | K09753 | 1.2.1.44 | 5 |
| Cinnamyl alcohol dehydrogenase | K00083 | 1.1.1.195 | 6 |
| Peroxidase | K00430 | 1.11.1.7 | 21 |
| Shikimate O-hydroxycinnamoyltransferase | K13065 | 2.3.1.133 | 8 |
| Caffeoyl-CoA O-methyltransferase | K00588 | 2.1.1.104 | 2 |
| Ferulate-5-hydroxylase | K09755 | 1.14.-.- | 2 |
Summary of putative SSRs found in P. maximum unigenes.
| Nucleotiderepeat | Differentmotifs | Number ofSSRs | % of totalSSRs |
| Dinucleotide | 4 | 451 | 8.9% |
| Trinucleotide | 10 | 4333 | 86% |
| Tetranucleotide | 30 | 184 | 3.6% |
| Pentanucleotide | 29 | 44 | 0.8% |
| Hexanucleotide | 23 | 23 | 0.4% |
| Total | 96 | 5035 | 100% |
Summary of putative SNPs found in P. maximum unigenes.
| SNP type | Count |
|
| 228,468 |
| A-G/G-A | 113,075 |
| C-T/T-C | 115,393 |
|
| 117,988 |
| A-C/C-A | 29,403 |
| A-T/T-A | 26,310 |
| T-G/G-T | 29,035 |
| G-C/C-G | 33,240 |
| Total | 346,456 |
Figure 4Shared and unique putative SNPs found in Panicum maximum unigenes.