| Literature DB >> 18096061 |
Tetsuya Sakurai1, Germán Plata, Fausto Rodríguez-Zapata, Motoaki Seki, Andrés Salcedo, Atsushi Toyoda, Atsushi Ishiwata, Joe Tohme, Yoshiyuki Sakaki, Kazuo Shinozaki, Manabu Ishitani.
Abstract
BACKGROUND: Cassava, an allotetraploid known for its remarkable tolerance to abiotic stresses is an important source of energy for humans and animals and a raw material for many industrial processes. A full-length cDNA library of cassava plants under normal, heat, drought, aluminum and post harvest physiological deterioration conditions was built; 19968 clones were sequence-characterized using expressed sequence tags (ESTs).Entities:
Mesh:
Substances:
Year: 2007 PMID: 18096061 PMCID: PMC2245942 DOI: 10.1186/1471-2229-7-66
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Summary of library properties and assembly results after sequencing the clones from both ends.
| Clones | 19968 |
| Sequence reads (trimmed) | 35400 |
| Contigs | 6355 |
| Singletons | 9026 |
| Scaffolds | 10577 |
| Fully sequenced transcripts | 2005 |
| Distinct genes | 7796 |
| Novel cassava transcripts | 6967 |
| Novel plant transcripts | 1521 |
Figure 1Cluster profile of the assembly of cassava ESTs. The graph presents the number of clones per assembled scaffold; it should be noticed that over 7000 transcripts are represented by a single clone in the full-length library.
Number of predicted transcripts according to the species-specific datasets downloaded from the given locations.
| 50698 | This paper | |
| 58036 | Joint Genome Institute [105] | |
| 31527 | TAIR [106] | |
| 62827 | TIGR [107] |
Figure 2Pathway map of starch and sucrose metabolism. Sequences presumed to have been captured in the full-length library are shown in red. Arabidopsis genes not captured in cassava with this library are presented in green.
Comparison of the number of genes per pathway in Arabidopsis and in the full-length cDNA library according to the automated annotation. The 40 KEGG pathways with the largest number of cassava genes are presented.
| ath03010 | Ribosome | 89 | 110 | 0.81 |
| ath00190 | Oxidative phosphorylation | 54 | 89 | 0.61 |
| ath00195 | Photosynthesis | 41 | 69 | 0.59 |
| ath00230 | Purine metabolism | 36 | 67 | 0.54 |
| ath00240 | Pyrimidine metabolism | 29 | 54 | 0.54 |
| ath03050 | Proteasome | 26 | 31 | 0.84 |
| ath00010 | Glycolysis/Gluconeogenesis | 22 | 22 | 1.00 |
| ath00710 | Carbon fixation | 22 | 24 | 0.92 |
| ath00500 | Starch and sucrose metabolism | 22 | 29 | 0.76 |
| ath00193 | ATP synthesis | 21 | 29 | 0.72 |
| ath00620 | Pyruvate metabolism | 19 | 24 | 0.79 |
| ath00970 | Aminoacyl-tRNA synthetases | 17 | 24 | 0.71 |
| ath00251 | Glutamate metabolism | 16 | 23 | 0.70 |
| ath00100 | Biosynthesis of steroids | 16 | 23 | 0.70 |
| ath00400 | Phenylalanine, tyrosine and tryptophan biosynthesis | 15 | 24 | 0.63 |
| ath00030 | Pentose phosphate pathway | 14 | 15 | 0.93 |
| ath00020 | Citrate cycle (TCA cycle) | 14 | 15 | 0.93 |
| ath00860 | Porphyrin and chlorophyll metabolism | 13 | 20 | 0.65 |
| ath03020 | RNA polymerase | 13 | 21 | 0.62 |
| ath00260 | Glycine, serine and threonine metabolism | 13 | 24 | 0.54 |
| ath00252 | Alanine and aspartate metabolism | 11 | 16 | 0.69 |
| ath00330 | Arginine and proline metabolism | 11 | 19 | 0.58 |
| ath03022 | Basal transcription factors | 11 | 21 | 0.52 |
| ath00670 | One carbon pool by folate | 10 | 11 | 0.91 |
| ath00052 | Galactose metabolism | 10 | 12 | 0.83 |
| ath03060 | Protein export | 10 | 13 | 0.77 |
| ath00051 | Fructose and mannose metabolism | 10 | 14 | 0.71 |
| ath00640 | Propanoate metabolism | 10 | 15 | 0.67 |
| ath00350 | Tyrosine metabolism | 10 | 17 | 0.59 |
| ath00071 | Fatty acid metabolism | 9 | 10 | 0.90 |
| ath00630 | Glyoxylate and dicarboxylate metabolism | 9 | 12 | 0.75 |
| ath00290 | Valine, leucine and isoleucine biosynthesis | 9 | 12 | 0.75 |
| ath00510 | N-Glycan biosynthesis | 9 | 15 | 0.60 |
| ath00380 | Tryptophan metabolism | 9 | 15 | 0.60 |
| ath00910 | Nitrogen metabolism | 9 | 16 | 0.56 |
| ath00900 | Terpenoid biosynthesis | 8 | 8 | 1.00 |
| ath00941 | Flavonoid biosynthesis | 8 | 9 | 0.89 |
| ath00360 | Phenylalanine metabolism | 8 | 10 | 0.80 |
| ath00940 | Stilbene, coumarine and lignin biosynthesis | 8 | 11 | 0.73 |
| ath00280 | Valine, leucine and isoleucine degradation | 8 | 16 | 0.50 |
Arabidopsis stress-induced genes identified by the RAFL microarray [33] captured in the cassava full-length library.
| FL5-3E18 | ATHERD10 | Aquaporin homolog | 17 |
| FL3-5J1 | AB004872 | Gamma tonoplast intrinsec protein 2 | 14 |
| FL5-3P12 | AB039929 | EXGT-A2 | 13 |
| FL5-2E17 | AB039928 | Beta-glucosidase homolog | 12 |
| AB039927 | Thiol protease | 11 | |
| FL5-3J4 | ATHRD19A | Heat shock protein dnaJ homolog | 11 |
| FL5-2123 | AB050546 | Ascorbate peroxidase | 11 |
| FL3-2C6 | AB044404 | Thioredoxin | 10 |
| DREB1A | AB050557 | EREBP/AP2 protein | 9 |
| FL2-5G7 | AB050558 | Catalase 3 (CAT3) | 8 |
| FL2-1C1 | AB050576 | Cysteine proteinase homolog | 8 |
| AB050560 | - | 6 | |
| FL5-2122 | AB050542 | DC 1.2 homolog | 6 |
| FL5-1N11 | AB050561 | Non-specific lipid transfer protein | 5 |
| FL3-27 | AB050562 | Cysteine proteinase inhibitor homolog | 5 |
| FL5-95 | AB050563 | Rice glyoxalase 1 homolog | 4 |
| FL5-94 | AB050550 | Enolase | 4 |
| FL2-5A4 | AB050564 | DEAD box ATPase/RNA helicase protein (DHR1) | 4 |
| FL3-5A3 | AB015098 | Putative cold acclimation protein | 3 |
| FL5-2G21 | AB044405 | Reversibly glycosylated polypeptide-3 | 3 |
| FL5-1A9 | AB046991 | Nodulin-like protein homolog | 3 |
| FL5-90 | AB050565 | β-amylase | 3 |
| FL3-3B1 | AB050566 | Hypothetical protein | 2 |
| AB050567 | Group II LEA protein | 1 | |
| AB050568 | Group II LEA protein | 1 | |
| AB050571 | - | 1 | |
| AB050551 | Membrane protein | 1 | |
| FL5-1F23 | AB050573 | Pyrroline-5-carboxylate synthetase | 1 |
| FL5-3M24 | AB007787 | LEA protein SAG21 homolog | 1 |
| FL5-1O3 | AB050574 | - | 1 |
| FL1-159 | AB050575 | HVA22 homolog | 1 |
| FL2-1H6 | AB050552 | Jasmonate-inducible protein homolog | 1 |
Figure 3Comparison of the annotation of 6566 cassava sequences with putative one-to-one orthologs and 4313 sequences without. The Gene Ontology terms overrepresented and under represented (p-value < 0.05) for the sequences shared between cassava and A. thaliana, P. trichocarpa or E. esula are presented according to legend. GO terms related to stress response are frequent among cassava genes without one-to-one orthologs in any of these three species. 302 redundant sequences produced by CAP3 were included in the analysis.
Figure 4Main GO categories in the annotation of 230 potential gene duplications in cassava.
Figure 5Reactive oxygen species processing in plant cells. Possible gene duplications in cassava are shown in bold and underlined. AOX, alternative oxidase; FNR, ferredoxin NADPH reductase; MAPKK, mitogen activated protein kinase kinase; MDAR, monodehydroascorbate reductase; GLR, glutaredoxin; GR, glutathione reductase; GCL, glutamate cysteine ligase; NTR, NADPH thioredoxin reductase; HSP20, heat shock protein 20; PSII, photosystem II; PQ, plastoquinone; Cytb6f, cytochrome b6f; PC, plastocyanin; PSI, photosystem I; Fd, ferredoxin; SOD, superoxide dismutase; ABA, abscisic acid; AsA, ascorbate; APX, ascorbate peroxidase; MDA, mohodehydroascorbate; DHA, dehydroascorbate; DHAR, DHA reductase; GSSG, oxidized glutathione; GSH, glutathione; Glu, glutamate; CAT, catalase; PrxR, peroxireductase; Trx, thioredoxin; Based on [72, 75, 104]
Conditions and tissues used for mRNA extraction.
| No treatment | 9, 11, 12 weeks | leaf | |
| No treatment | 9 month | root | |
| Drought shock | 7-weeks | leaf | 3, 6, 24, 72 hours |
| Heat | 9-weeks | leaf | 3, 6, 24, 72 hours |
| PPD | 9 month | root | 24, 48, 120 hours |
| High Al, low pH | 9 weeks | leaf | 3, 6, 24, 72 hours |
| High Al, low pH | 9 month | root | 6, 24, 48 hours |