| Literature DB >> 31111606 |
Vikash Kumar1, Matthieu Hainaut2,3, Nicolas Delhomme1, Chanaka Mannapperuma4, Peter Immerzeel1,5, Nathaniel R Street4, Bernard Henrissat2,3, Ewa J Mellerowicz1.
Abstract
Carbohydrate-active enzymes (CAZymes) catalyze the formation and modification of glycoproteins, glycolipids, starch, secondary metabolites and cell wall biopolymers. They are key enzymes for the biosynthesis of food and renewable biomass. Woody biomass is particularly important for long-term carbon storage and as an abundant renewable natural resource for many industrial applications. This study presents a re-annotation of CAZyme genes in the current Populus trichocarpa genome assembly and in silico functional characterization, based on high-resolution RNA-Seq data sets. Altogether, 1914 CAZyme and expansin genes were annotated in 101 families. About 1797 of these genes were found expressed in at least one Populus organ. We identified genes involved in the biosynthesis of different cell wall polymers and their paralogs. Whereas similar families exist in poplar and Arabidopsis thaliana (with the exception of CBM13 found only in poplar), a few families had significantly different copy numbers between the two species. To identify the transcriptional coordination and functional relatedness within the CAZymes and other proteins, we performed co-expression network analysis of CAZymes in wood-forming tissues using the AspWood database (http://aspwood.popgenie.org/aspwood-v3.0/) for Populus tremula. This provided an overview of the transcriptional changes in CAZymes during the transition from primary to secondary wall formation, and the clustering of transcripts into potential regulons. Candidate enzymes involved in the biosynthesis of polysaccharides were identified along with many tissue-specific uncharacterized genes and transcription factors. These collections offer a rich source of targets for the modification of secondary cell wall biosynthesis and other developmental processes in woody plants.Entities:
Keywords: carbohydrate metabolism; cell wall; comparative genomics; genome sequencing; vegetative development; wood formation
Mesh:
Substances:
Year: 2019 PMID: 31111606 PMCID: PMC6852159 DOI: 10.1111/tpj.14417
Source DB: PubMed Journal: Plant J ISSN: 0960-7412 Impact factor: 6.417
CAZyme families and the number of CAZymes models per family detected in Populus trichocarpa v3.0 and Arabidopsis thaliana v10.0 genomes (November 2018)
| CAZyme families |
|
|
| CAZyme families |
|
|
| ||
|---|---|---|---|---|---|---|---|---|---|
| v.10 | v.3 | v.3 vs v.1 | v.10 | v.3 | v.3 vs v.1 | ||||
| AA1 | 36 | 78 | 78 | 3.3 |
| ||||
| AA5 | 7 | 11 | 11 | 0.1 | GH63 | 2 | 1 | −2 | 0.9 |
| AA6 | 4 | 9 | 9 | 0.5 | GH77 | 2 | 3 | 0 | 0.1 |
| AA7 | 27 | 14 | 14 | 11.6 | GH79 | 3 | 6 | −1 | 0.2 |
| CE6 | 2 | 4 | 4 | 0.2 | GH81 | 2 | 1 | −3 | 0.9 |
| CE8 | 67 | 89 | 0 | 0.7 | GH85 | 2 | 2 | 2 | 0.2 |
| CE11 | 4 | 2 | 2 | 1.8 | GH89 | 1 | 3 | 1 | 0.4 |
| CE13 | 12 | 11 | 1 | 1.5 | GH95 | 1 | 4 | 0 | 0.9 |
| PL1 | 26 | 31 | 3 | 0.8 | GH100 | 9 | 16 | −3 | 0.2 |
| PL4 | 8 | 11 | 0 | 0.1 | GH116 | 4 | 5 | 5 | 0.1 |
| EXPN | 36 | 65 | 23 | 0.8 | GH146 | 2 | 3 | 3 | 0.1 |
| (CBM13)1–2 | 0 | 6 | 6 | 4.0 | GH152 | 19 | 40 | 40 | 1.5 |
| CBM18‐(EXPN)0–1 | 1 | 5 | 5 | 0.9 | GT1 | 122 | 281 | −45 | 15.6 |
| CBM18‐GH19 | 9 | 15 | 15 | 0.1 | GT2 | 42 | 60 | −14 | 0.1 |
| CBM20 | 2 | 7 | 7 | 1.2 | GT4 | 24 | 38 | −3 | 0.1 |
| (CBM20)2‐GH77 | 1 | 2 | 2 | 0.1 | GT5 | 6 | 7 | −6 | 0.3 |
| (CBM22)1–4‐GH10 | 11 | 8 | 8 | 2.6 | GT8 | 42 | 56 | −5 | 0.4 |
| CBM32 | 1 | 1 | 1 | 0.1 | GT10 | 3 | 4 | 0 | 0.1 |
| CBM43 | 31 | 34 | 34 | 1.7 | GT13 | 1 | 2 | −1 | 0.1 |
| GH17‐CBM43 | 28 | 43 | 43 | 0.1 | GT14 | 11 | 16 | 2 | 0.1 |
| CBM45 | 2 | 2 | 2 | 0.2 | GT16 | 1 | 1 | 0 | 0.1 |
| (CBM45)2‐GH13 | 1 | 1 | 1 | 1.6 | GT17 | 7 | 5 | 0 | 1.8 |
| CBM48 | 8 | 18 | 18 | 0.9 | GT19 | 1 | 1 | 0 | 0.1 |
| (CBM48)1–2‐GH13 | 7 | 10 | 10 | 0.1 | GT20 | 11 | 13 | 0 | 0.4 |
| GH9‐CBM49 | 3 | 3 | 3 | 0.3 | GT22 | 3 | 5 | 1 | 0.1 |
| CBM50 | 1 | 12 | 12 | 5.7 | GT24 | 1 | 1 | −1 | 0.1 |
| (CBM53)3‐GT5 | 3 | 1 | 1 | 2.1 | GT28 | 4 | 5 | −1 | 0.1 |
| CBM57 | 4 | 92 | 92 | 51.1 | GT29 | 3 | 5 | 0 | 0.1 |
| GH1 | 48 | 49 | −1 | 3.8 | GT30 | 1 | 1 | 0 | 0.1 |
| GH2 | 2 | 3 | 0 | 0.1 | GT31 | 33 | 48 | −4 | 0.1 |
| GH3 | 16 | 27 | −6 | 0.2 | GT32 | 6 | 5 | −1 | 1 |
| GH5 | 13 | 24 | 2 | 0.4 | GT33 | 1 | 1 | 0 | 0.1 |
| GH9 | 29 | 32 | −1 | 1.6 | GT34 | 8 | 9 | 1 | 0.4 |
| GH10 | 12 | 8 | 1 | 3.4 | GT35 | 2 | 5 | −1 | 0.4 |
| GH13 | 10 | 16 | −5 | 0.1 | GT37 | 10 | 7 | −1 | 2.6 |
| GH14 | 9 | 14 | 2 | 0.1 | GT41 | 2 | 4 | −1 | 0.2 |
| GH16 | 33 | 43 | 2 | 0.5 | GT43 | 4 | 7 | 0 | 0.1 |
| GH17 | 51 | 88 | −9 | 0.6 | GT47 | 39 | 70 | −5 | 0.8 |
| GH18 | 11 | 28 | −2 | 2.3 | GT48 | 13 | 13 | −6 | 1.2 |
| GH19 | 14 | 21 | 0 | 0.1 | GT50 | 1 | 1 | 0 | 0.1 |
| GH20 | 3 | 6 | 0 | 0.2 | GT57 | 2 | 3 | −1 | 0.1 |
| GH27 | 4 | 9 | 1 | 0.5 | GT58 | 1 | 1 | 0 | 0.1 |
| GH28 | 68 | 85 | −4 | 1.4 | GT59 | 1 | 1 | 0 | 0.1 |
| GH29 | 1 | 3 | 0 | 0.4 | GT61 | 8 | 10 | 0 | 0.2 |
| GH31 | 5 | 12 | −2 | 0.8 | GT64 | 3 | 5 | 1 | 0.1 |
| GH32 | 8 | 8 | −1 | 0.7 | GT66 | 2 | 4 | −2 | 0.2 |
| GH33 | 1 | 1 | 1 | 0.1 | GT75 | 5 | 10 | −1 | 0.3 |
| GH35 | 18 | 23 | 1 | 0.3 | GT76 | 1 | 2 | 2 | 0.1 |
| GH36 | 6 | 17 | −2 | 1.9 | GT77 | 19 | 19 | 3 | 1.7 |
| GH37 | 1 | 4 | 0 | 0.9 | GT90 | 9 | 7 | 7 | 1.8 |
| GH38 | 4 | 9 | 1 | 0.5 | GT92 | 5 | 3 | 3 | 1.8 |
| GH43 | 2 | 1 | −1 | 0.9 | GT95 | 3 | 6 | 6 | 0.2 |
| GH47 | 5 | 6 | ‐1 | 0.2 | GT96 | 2 | 1 | 1 | 0.9 |
| GH51 | 2 | 6 | 3 | 0.8 | GT106 | 34 | 44 | 44 | 0.5 |
Families in P. trichocarpa not reported by Geisler‐Lee et al. (2006) or in A. thaliana not listed in the CAZy database (http://www.cazy.org/). The identified A. thaliana genes include CBM32 (At5g49570) and AA family genes listed in Table S8.
* and **, families differing in relative gene frequencies between P. trichocarpa and A. thaliana, taking into account the total number of genes in these species, 41377 and 27416, respectively (*P ≤ 0.10; **P ≤ 0.05, χ2 test).
CAZymes with specific expression pattern in one organ/tissue of aspen and expression value ≥1.2a
| Potri ID | Pt name | CAZy family | Expression (VST) | Specific in: |
|---|---|---|---|---|
| Potri.002G202100 | GH28 | 1.27 | Expanding flowers | |
| Potri.003G223500 | EXPN | 1.21 | Drought‐stressed leaves | |
| Potri.015G040700 | AA1 | 1.29 | ||
| Potri.008G010700 | GH28 | 1.20 | Petiole | |
| Potri.016G107900 | AA1 | 1.25 | Roots | |
| Potri.001G351600 | CBM43 | 1.24 | Mature seeds | |
| Potri.001G223700 | GH1 | 1.24 | ||
| Potri.014G082000 | EXPN | 2.02 | Developing phloem | |
| Potri.011G077400 | XTH10 | GH16 | 2.10 | |
| Potri.005G054000 | CBM18‐EXPN | 2.00 | Developing xylem |
Expression data from Sundell et al. (2015) and Immanen et al. (2016). All organs/tissues considered as listed in Table S9.
Figure 1Heat map of CAZyme expression patterns showing that the majority of CAZymes expressed in wood‐forming tissues of aspen (1187 genes) have maximum expression (shown as red color) at specific wood developmental stages, defined by the position (section number) on the bottom. Different developmental zones were defined based on the expression of marker genes (Sundell et al., 2017). The expression profiles were sorted by tissue specificity and maximum expression in corresponding tissue. Abbreviations: CA–RE, cambium and radial expansion zone; PW–SW, primary to secondary wall transition; SW, secondary wall formation zone. Data analyzed from AspWood database (http://aspwood.popgenie.org/aspwood-v3.0/).
Figure 2Variability in cell wall composition and in the CAZyme transcriptome related to the biosynthesis and degradation of different polymers across wood developmental zones. (a) Count of gene models associated with specific wood developmental zones (bar graph) and the composition of each group according to predicted functions (pie charts). Data based on Sundell et al. (2017). Gene metabolic classification as listed in Table S12. ‘Sugar’ category includes enzymes related to sucrose, raffinose, stachyose, galactinol and trehalose metabolism. Abbreviations: CA–RE, cambium–radial expansion zone; PW–SW, primary to secondary wall transition; SW, secondary wall formation zone. (b and c) Analysis of cell wall composition in different samples of developing secondary phloem and xylem by alditol acetates (excluding Glc) and uronic acid (UA) contents (b) and Klason lignin content (c). Cell wall samples were obtained by the sequential scraping of bark and wood surfaces exposed after peeling the bark. Alignment between transcriptome and cell wall samplings is shown by the gray shadow. Data in (b) and (c) are mean values from n = 3 technical replicates ± SE.
Figure 3The co‐expression networks of CAZymes in wood‐forming aspen tissues (http://aspwood.popgenie.org/aspwood-v3.0/). The CAZyme gene co‐expression at a context likelihood of relatedness threshold of 5 or higher was extracted as a network from popgenie (http://popgenie.org/exnet) and visualized in cytoscape 3.4.0. (a) The genes colored by developmental zones where they have peak expression as shown in Fig. 1. (b) The same network colored by the CAZyme metabolic function. ‘Uncharacterized’ class includes members of families with probable function in cell wall biosynthesis. The ‘Sugar’ category includes enzymes related to sucrose, raffinose, stachyose, galactinol and trehalose metabolism. Lists of all genes in the networks are given in Table S14. The guide genes used in the primary wall forming zone are marked by hexagons and those of the secondary wall are marked by diamonds. The first neighbors of the guide genes are listed in Tables S15 and S16.
Figure 4The difference in neighborhoods of genes encoding proteins of secondary wall xylan synthase complex Pt GT43B and Pt GT43C (a), and secondary wall cellulose synthase complex PtCesA8‐A and PtCesA8‐B (b). The co‐expressed genes were selected at threshold = 5 (http://aspwood.popgenie.org/aspwood-v3.0/). The blue nodes are unique neighbors of Pt GT43C (in a) and PtCesA8‐B (in b) and the orange nodes are unique for Pt GT43B (in a) and PtCesA8‐A (in b). The pink nodes are neighbors in common for both genes in (a) and (b). The transcription factors are marked by rectangles. Unless otherwise indicated, the Populus gene names are used in the figures.
Transcription factors co‐regulated with genes encoding enzymes synthesizing cellulose, mannan and xylan in secondary walls in the developing wood of aspen (AspWood, http://aspwood.popgenie.org/aspwood-v3.0/)
| Potri. ID | Potri name | AT ID | AT name | TF class | CesA7‐A Potri.006G181900 | CesA8‐A Potri.011G069600 | CesA8‐B Potri.004G059600 | GT43B Potri.016G086400 | GT43C Potri.007G047500 | GT47A‐1 Potri.001G068100 | GATL1‐B Potri.002G132900 | GATL1‐A Potri.014G040300 | GT47C Potri.009G006500 | CSLA1 Potri.008G026400 | GUX1‐A Potri.007G107200 | GUX1‐B Potri.005G061600 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 001G099800 | MYB10 | 1G63910 | MYB103 | MYB | + | + | + | |||||||||
| 001G112200 | KNAT7 | 1G62990 | IXR11, KNAT7 | TALE | + | + | + | |||||||||
| 001G258400 | C3H | + | + | + | + | + | ||||||||||
| 002G119400 | 3G49940 | LBD38 | LBD | + | + | + | + | + | ||||||||
| 002G257800 | 4G28610 | PHR1 | G2‐like | + | ||||||||||||
| 006G253800 | 5G25390 | SHN3 | ERF | + | + | + | + | + | ||||||||
| 007G135300 | NAC156 | 4G28500 | ANAC073, NAC073, SND2 | NAC | + | + | + | + | + | + | + | + | ||||
| 008G117500 | 1G68670 | HHO2 | G2‐like | + | + | + | + | |||||||||
| 009G045900 | 5G60440 | AGL62 | M‐type_MADS | + | + | + | + | + | + | |||||||
| 017G016700 | NAC154 | 4G28500 | ANAC073, NAC073, SND2 | NAC | + | + | + | + | + | + | ||||||
| 002G031000 | BLH3 | 2G16400 | BLH7 | TALE | + | + | ||||||||||
| 002G178700 | WND2B, SND1‐B2 | 2G46770 | ANAC043, EMB2301, NST1 | NAC | + | |||||||||||
| 003G132000 | MYB128 | 1G63910 | MYB103 | MYB | + | |||||||||||
| 006G138500 | 5G20730 | ARF7,BIP,IAA21,‐23,‐25, MSG1,NPH4,TIR5 | ARF | – | ||||||||||||
| 008G038900 | 4G32890 | GATA9 | GATA | + | ||||||||||||
| 001G141100 | 5G46690 | bHLH071 | bHLH | + | ||||||||||||
| 012G055700 | 1G73830 | BEE3 | bHLH | + | ||||||||||||
| 005G221400 | 2G42410 | ZFP11 | C2H2 | + |
+, positive; –, negative correlation