| Literature DB >> 19159482 |
Xinguo Li1, Harry X Wu, Shannon K Dillon, Simon G Southerton.
Abstract
BACKGROUND: Wood is a major renewable natural resource for the timber, fibre and bioenergy industry. Pinus radiata D. Don is the most important commercial plantation tree species in Australia and several other countries; however, genomic resources for this species are very limited in public databases. Our primary objective was to sequence a large number of expressed sequence tags (ESTs) from genes involved in wood formation in radiata pine.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19159482 PMCID: PMC2636829 DOI: 10.1186/1471-2164-10-41
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Assembly of radiata pine xylogenesis ESTs from six cDNA libraries
| Assembly for six libraries a | 5,952 | 952 | 2,352 | 3,304 | 47.8 |
| Assembly for each library b | |||||
| Juvenile earlywood | 1,259 | 198 | 711 | 909 | 27.8 |
| Juvenile latewood | 1,636 | 241 | 935 | 1,176 | 28.1 |
| Transition earlywood | 694 | 92 | 410 | 502 | 27.7 |
| Transition latewood | 799 | 73 | 371 | 444 | 44.4 |
| Mature earlywood | 837 | 128 | 454 | 582 | 30.5 |
| Mature latewood | 727 | 65 | 559 | 624 | 14.2 |
| Total of each library | 5,952 | 797 | 3,440 | 4,237 | 28.8 d |
a: ESTs from the 5 ends of 5,952 clones were used in the assembly;
b: Only 5 end ESTs in each library were used in the assemblies;
c: Redundancy was estimated by: 1- (number of unigenes/number of ESTs);
d: The average redundancy from each of six cDNA libraries.
Figure 1Distribution of 952 contigs based on the number of clustered ESTs. The number of ESTs clustered in contigs reduces the efficiency of ESTs for representing different genes, thus can be regarded as an indicator for EST redundancy in cDNA library.
Summary of annotation and functional classification of 3,304 xylogenesis unigenes in radiata pine*.
| Contig | 952 | 942 | 10 | 593 | 272 | 524 | 453 | 349 |
| Singleton | 2,352 | 2,232 | 120 | 1,220 | 595 | 1,092 | 900 | 1,012 |
| Unigene | 3,304 | 3,174 | 130 | 1,813 | 867 | 1,616 | 1,353 | 1,361 |
*: Annotation was based on blastx with unigenes against UniProt; for the no hits a further blastn was performed against the TIGR database; functional classification was based on GO terms.
Figure 2Functional classification for the 1,843 unigenes which were assigned with GO terms. Among 3,304 unigenes from radiata pine xylogenesis, 1,813 unigenes were functionally assigned with GO terms as molecular function (A), cellular component (B) and biological process (C). For each of these GO terms, functions were further assigned with sub-category of GO terms. The total percentage of functional categories or sub-category may be over 100% due to possibly multiple functions from some unigenes.
The most enriched GO terms in the radiata pine xylogenesis genomic resource*.
| BP | GO:0044267 cellular protein metabolic process | 79 | 4.99 | 0.025 |
| BP | GO:0006810 transport | 47 | 2.97 | 0.006 |
| BP | GO:0016043 cellular component organization and biogenesis | 36 | 2.28 | 0.011 |
| BP | GO:0009056 catabolic process | 17 | 1.07 | 0.060 |
| CC | GO:0044444 cytoplasmic part | 99 | 6.26 | 0.000 |
| CC | GO:0032991 macromolecular complex | 46 | 2.91 | 0.001 |
| CC | GO:0043228 non-membrane-bound organelle | 23 | 1.45 | 0.074 |
| MF | GO:0000166 nucleotide binding | 55 | 3.48 | 0.088 |
| MF | GO:0005215 transporter activity | 25 | 1.58 | 0.059 |
| MF | GO:0008233 peptidase activity | 19 | 1.20 | 0.014 |
| MF | GO:0005198 structural molecule activity | 17 | 1.07 | 0.002 |
* A total of 1,582 unigenes with homologs and non-redundant accession numbers in UniProt are included in the DAVID functional annotation analysis, only the enriched GO terms including more than 1% of genes are listed.
Thirty highly abundant genes (or gene families) in the 5,952 xylogenesis ESTs of radiata pine.
| 175 | 2.94 | |
| 165 | 2.77 | |
| 102 | 1.71 | |
| 102 | 1.71 | |
| 89 | 1.50 | |
| 75 | 1.26 | |
| 58 | 0.97 | |
| 46 | 0.77 | |
| 41 | 0.69 | |
| 38 | 0.64 | |
| 38 | 0.64 | |
| 36 | 0.60 | |
| 35 | 0.59 | |
| 32 | 0.54 | |
| 29 | 0.49 | |
| 29 | 0.49 | |
| 29 | 0.49 | |
| 29 | 0.49 | |
| 29 | 0.49 | |
| 28 | 0.47 | |
| 27 | 0.45 | |
| 26 | 0.44 | |
| 25 | 0.42 | |
| 25 | 0.42 | |
| 24 | 0.40 | |
| 24 | 0.40 | |
| 23 | 0.39 | |
| 23 | 0.39 | |
| 21 | 0.35 | |
| 21 | 0.35 |
Thirty highly abundant protein families in the radiata pine xylogenesis EST resource.
| Cellulose_synt | Cellulose synthase | PF03552 | 108 | 1.81 |
| MIP | Major intrinsic protein | PF00230 | 90 | 1.51 |
| Pkinase | Protein kinase domian | PF00069 | 58 | 0.97 |
| Actin | Actin | PF00022 | 50 | 0.84 |
| Tubulin_C | Tubulin/FtsZ family, C-terminal domain | PF03953 | 45 | 0.76 |
| Cu_bind_like | Plastocyanin-like domain | PF02298 | 41 | 0.69 |
| P450 | Cytochrome P450 | PF00067 | 38 | 0.64 |
| Uiquitin | Ubiquitin family | PF00240 | 38 | 0.64 |
| S-AdoMet_synt_M | S-adenosylmethionine synthetase, Central domain | PF02772 | 37 | 0.62 |
| RRM_1 | RNA recognition motif | PF00076 | 35 | 0.59 |
| Meth_synt_2 | Cobalamin-independent synthase, Catalytic domain | PF01717 | 35 | 0.59 |
| DUF1218 | Unknown function protein | PF06749 | 34 | 0.57 |
| Methyltransf_3 | O-methyltransferases | PF01596 | 30 | 0.50 |
| PAR1 | Photoassimilate-responding protein | PF06521 | 30 | 0.50 |
| Ras | Ras superfamily | PF00071 | 28 | 0.47 |
| Hin1 | Harpin-induced protein 1 (Hin1) | PF07320 | 27 | 0.45 |
| Tryp_alpha_amyl | Protease inhibitor/seed storage/LTP family | PF00234 | 26 | 0.44 |
| Tubulin | Tubulin/FtsZ family, GTPases domain | PF00091 | 25 | 0.42 |
| Cu-oxidase_2 | Multicopper oxidase-like domains | PF07731 | 25 | 0.42 |
| Peroxidase | Peroxidase | PF00141 | 24 | 0.40 |
| Efhand | Signaling proteins and buffering/transport proteins | PF00036 | 23 | 0.39 |
| Mito_carr | Mitochondrial carrier protein | PF00153 | 23 | 0.39 |
| UQ_con | Ubiquitin-conjugating enzyme | PF00179 | 23 | 0.39 |
| Peptidase_C1 | Papain family cysteine protease | PF00112 | 21 | 0.35 |
| Glyco_hydro_9 | Glycosyl hydrolase family 9 | PF00759 | 21 | 0.35 |
| Arf | ADP ribosylation factor (Arf) | PF00025 | 20 | 0.34 |
| Thioredoxin | Thioredoxin | PF00085 | 20 | 0.34 |
| Ricin_B_lectin | Ricin-type beta-trefoil lectin domain | PF00652 | 20 | 0.34 |
| Glyco_hydro_19 | Chitinase class I | PF00182 | 19 | 0.32 |
| Epimerase | NAD dependent epimerase/dehydratase family | PF01370 | 19 | 0.32 |
Forty-one transcription factor families in the radiata pine xylogenesis EST resource.
| PHD | Cys4--His--Cys3 zinc finger | 55 | 15.36 |
| C3H | Zinc finger, C-x8-C-x5-C-x3-H type (and similar) | 38 | 10.61 |
| HB | Homeobox domain | 29 | 8.10 |
| C2H2 | Zinc finger, C2H2 type | 23 | 6.42 |
| NAC | No apical meristem (NAM) protein | 22 | 6.15 |
| MYB-related | N-terminal myb-domain | 20 | 5.59 |
| MYB | Myb-like DNA-binding domain | 16 | 4.47 |
| AP2-EREBP | AP2 domain | 15 | 4.19 |
| PcG | Polycomb group (PcG) proteins | 14 | 3.91 |
| bHLH | Helix-loop-helix DNA-binding domain | 11 | 3.07 |
| LIM | LIM domain | 11 | 3.07 |
| EIL | Ethylene insensitive1 (EIN3) | 10 | 2.79 |
| HMG | HMG (high mobility group) box | 8 | 2.23 |
| C2C2-GATA | GATA zinc finger | 7 | 1.96 |
| LUG | LEUNIG gene | 7 | 1.96 |
| WRKY | WRKY DNA-binding domain | 7 | 1.96 |
| bZIP | Basic leucine zipper (bZIP) motif | 6 | 1.68 |
| GARP-G2-like | GLK proteins | 5 | 1.40 |
| SBP | SBP-domain | 5 | 1.40 |
| Trihelix | Trihelix DNA-binding domain | 5 | 1.40 |
| MADS | DNA-binding and dimerisation domain | 4 | 1.12 |
| CCAAT-Dr1 | CCAAT-box-related motifs | 4 | 1.12 |
| ZIM | Zinc-finger protein expressed in inflorescence meristem | 4 | 1.12 |
| Alfin | Cys4 zinc finger and His/Cys3 | 3 | 0.84 |
| AUX-IAA | AUX/IAA family | 3 | 0.84 |
| TAZ | TAZ zinc finger | 3 | 0.84 |
| ABI3-VP1 | ABI3/VP1 proteins | 2 | 0.56 |
| AS2 | Asymmetric leaves2 | 2 | 0.56 |
| BES1 | BRI1-EMS-Suppressor 1 | 2 | 0.56 |
| C2C2-CO-like | CCT motif | 2 | 0.56 |
| CAMTA | Calmodulin-binding transcription activators | 2 | 0.56 |
| FHA | Forkhead domain | 2 | 0.56 |
| GRF | Growth-regulating factor1 | 2 | 0.56 |
| MBF1 | Multiprotein bridging factor 1 | 2 | 0.56 |
| ARF | Auxin response factor | 1 | 0.28 |
| ARID | AT-rich interaction domain | 1 | 0.28 |
| CCAAT-HAP3 | CCAAT-binding factor | 1 | 0.28 |
| HSF | Heat shock factor | 1 | 0.28 |
| Nin-like | Nodule inception protein | 1 | 0.28 |
| TLP | Tubby family | 1 | 0.28 |
| ZF-HD | ZF-HD class homeobox domain | 1 | 0.28 |
*: % is based on the 358 radiata pine ESTs with homologs in the PlantTFDB.
Figure 3Hierarchical clustering dendrogram trees for juvenile and mature wood cDNA libraries. ESTs or unigenes from each of the four libraries (juvenile earlywood-JE, juvenile latewood-JL, mature earlywood-ME and mature latewood-ML) were annotated against the UniProt and TIGR databases, and assigned with GO terms. The abundance of ESTs or unigenes in each GO term was calculated for each library, and further normalized as a percentage based on the number of ESTs or unigenes in each library. Statistical significance of EST or unigene abundance in each GO term among the four libraries was valuated by Chi Square tests and P-values. The normalized EST or unigene abundance was used to construct the hierarchical clustering dendrogram trees. A: the dendrogram tree for ESTs. B: the dendrogram tree for unigenes.
Figure 4Phylogenetic tree of CesA proteins from radiata pine and other species. Muscle software was used to align multiple protein sequences of CesAs derived from GenBank. The protein sequences are deduced amino acid sequences from full length cDNAs except for some radiata pine CesAs. GenBank accession numbers are as follows: Pinus radiata, PrCesA1 (AAT57672), PrCesA2 (AAQ63936), PrCesA3 (AAQ63930), PrCesA5 (AAQ63931), PrCesA6 (AAQ63932), PrCesA7 (AAQ63933), PrCesA8 (AAQ63934), PrCesA10 (AAQ63935), PrCesA11 (AAQ63929). PrCesA1 and PrCesA10 are full length sequences, PrCesA2 is nearly full length sequence, others are partial sequences; Pinus taeda, PtCesA1 (AAX18647), PtCesA2 (AAX18648), PtCesA3 (AAX18649); Populus tremuloides, PtrCesA2 (AAM26299), PtrCesA3 (AAQ08987), PtrCesA4 (AAO25536), PtrCesA5 (AAL23710), PtrCesA6 (AAP40636), PtrCesA7 (AAO25581); Populus tremula × Populus tremuloides, PttCesA1 (AAT09894), PttCesA2 (AAT09895), PttCesA3-1 (AAT09896), PttCesA3-2 (AAT09897); Populus tomentosa, PtoCesA1 (AAY21910); Populus canescens, PcaCesA1 (AAC78476); Arabidopsis thaliana, AtCesA1 (O48946), AtCesA2 (O48947), AtCesA3 (Q941L0), AtCesA4 (Q84JA6), AtCesA5 (Q8L778), AtCesA6 (Q94JQ6), AtCesA7 (Q9SWW6), AtCesA8 (Q8LPK5), AtCesA9 (Q9SJ22), AtCesA10 (Q9SKJ5). Red colour indicates secondary cell wall CesAs, blue colour indicates primary cell wall CesAs.
Figure 5A wood core and SilviScan profile of wood density for a 32-year-old radiata pine tree. The cross section of wood core sampled at 1.3 m height shows the annual change in width and colour of earlywood and latewood within each ring. Wood density is the most important wood trait for the sawmill timber industry. SilviScan profiling for wood density of radiata pine revealed the distinct characteristics of juvenile wood, particularly juvenile earlywood, compared to transition and mature wood. SilviScan analysis was performed in the method of Evans, et al, 2000 [78].
Comparisons of EST abundance of genes related to lignin biosynthesis in different wood development stages.
| EW a | LWb | P-value e | JW c | MW d | P-value e | |
| 16 | 13 | > 0.05 | 9 | 12 | 0.04 | |
| 10 | 8 | > 0.05 | 4 | 4 | > 0.05 | |
| 10 | 6 | > 0.05 | 8 | 5 | > 0.05 | |
| 6 | 3 | 0.07 | 1 | 4 | 0.04 | |
| 7 | 2 | > 0.05 | 2 | 3 | > 0.05 | |
| 2 | 6 | > 0.05 | 3 | 3 | > 0.05 | |
| 1 | 1 | > 0.05 | 2 | 0 | > 0.05 | |
| 0 | 2 | > 0.05 | 2 | 0 | > 0.05 | |
| 11 | 5 | 0.08 | 0 | 7 | 0.001 | |
| 16 | 20 | > 0.05 | 16 | 10 | > 0.05 | |
| 15 | 31 | 0.05 | 19 | 11 | > 0.05 | |
| 4 | 3 | > 0.05 | 3 | 1 | > 0.05 | |
| 7 | 7 | > 0.05 | 10 | 0 | 0.01 | |
a: The number of ESTs for the combined earlywood (EW) libraries. Total ESTs in the combined earlywood libraries from juvenile, transition and mature age are 2,790;
b: The number of ESTs for the combined latewood (LW) libraries. Total ESTs in the combined latewood libraries from juvenile, transition and mature age are 3,169;
c: The number of ESTs for the combined juvenile wood (JW) libraries. Total ESTs in the combined JW libraries from earlywood and latewood tissues are 2,902;
d: The number of ESTs for the combined mature wood (MW) libraries. Total ESTs in the combined MW libraries from earlywood and latewood tissues are 1,564;
e: Statistical significance according to Audic and Claverie 1997 [79].
RT-MLPA validation of four genes preferentially represented in earlywood and latewood.
| EW/LW | Normalized ratio | P-value | JE/JL * | P-value | TE/TL ** | P-value | |
| 34/11 | 3.51 | 0.001 | 1.36 | < 0.001 | 13.5 | < 0.001 | |
| 11/1 | 12.49 | 0.001 | 2.59 | < 0.001 | 5.74 | < 0.001 | |
| 13/3 | 4.92 | 0.005 | 4.33 | < 0.001 | |||
| 2/17 | 0.13 | 0.001 | 0.55 | < 0.001 | |||
*: JE, juvenile earlywood; JL, juvenile latewood;
**: TE, transition earlywood; TL, transition latewood.