| Literature DB >> 19811687 |
Zhanyou Xu1, Dandan Zhang, Jun Hu, Xin Zhou, Xia Ye, Kristen L Reichel, Nathan R Stewart, Ryan D Syrenne, Xiaohan Yang, Peng Gao, Weibing Shi, Crissa Doeppke, Robert W Sykes, Jason N Burris, Joseph J Bozell, Max Zong-Ming Cheng, Douglas G Hayes, Nicole Labbe, Mark Davis, C Neal Stewart, Joshua S Yuan.
Abstract
BACKGROUND: As a major component of plant cell wall, lignin plays important roles in mechanical support, water transport, and stress responses. As the main cause for the recalcitrance of plant cell wall, lignin modification has been a major task for bioenergy feedstock improvement. The study of the evolution and function of lignin biosynthesis genes thus has two-fold implications. First, the lignin biosynthesis pathway provides an excellent model to study the coordinative evolution of a biochemical pathway in plants. Second, understanding the function and evolution of lignin biosynthesis genes will guide us to develop better strategies for bioenergy feedstock improvement.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19811687 PMCID: PMC3226193 DOI: 10.1186/1471-2105-10-S11-S3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Schema of the experiment.
Identification of lignin biosynthesis genes
| Enzymes | Domain investigated | E-value | NSH**** | NGH**** | NGH/NSH (%)**** |
|---|---|---|---|---|---|
| C3H | PTHR19383:SF44 | 1.00E-30 | 993 | 14 | 1.5 |
| F5H | PTHR19383:SF46 | 1.00E-30 | 990 | 15 | 1.5 |
| C4H | PTHR19383:SF33 | 1.00E-30 | 530 | 18 | 3.4 |
| 4CL | PTHR11968:SF43 | 1.00E-30 | 2212 | 120 | 5.4 |
| CAD | PTHR11695:SF38 | 1.00E-30 | 1022 | 110 | 10.8 |
| HCT | PF02458 | 1.00E-30 | 279 | 37 | 13.3 |
| CCOAMT | PTHR10509 | 1.00E-30 | 61 | 55 | 90.2 |
| CCR* | PTHR10366:SF9 | 1.00E-24 | 207 | 207 | 100.0 |
| PAL** | PF00221/TIGR01226 | 1.00E-30 | 55 | 55 | 100.0 |
| COMT*** | PIRSF005739 | 1.00E-08 | 187 | 187 | 100.0 |
| Total | 11 | Na | 6536 | 818 | Na |
Summary results from different steps of gene identification including Blastp and InterproScan analysis are presented along with the functional domains used for protein identification.
* We used E value cutoff of e-24 for CCR gene identification, which is supported by the functional domain data.
**The identification of PAL genes involved two domains.
*** The E value cutoff for COMT gene identification is e-08, which is supported by the functional domain data.
****Number of sequence homologs (NSH), Number of gene homologs (NGH), and Pentage of the NGH in the NSH were included.
CAD: Cinnamyl alcohol dehydrogenase
PAL: L-Phenylalanine ammonia-lyase
CCOAMT: Cinnamoyl CoA O-methyltransferase
COMT: Caffeic acid O-methyltransferase
4CL: 4-Coumarate:coenzyme A(CoA) ligase
CCR: Cinnamoyl-CoA reductase
C4H: Cinnamate-4-hydroxylase
HCT: Hydroxycinnamoyl transferase
C3H: 4-Coumarate 3-hydroxylase
F5H: Ferulate 5-hydroxylase
Summary of the species and genome resources used in the study
| Sample | Common Name | Name | Classification | Total transcripts | URL reference for genome |
|---|---|---|---|---|---|
| 1 | mushroom | Symbiotic fungus | 19,036 | ||
| 2 | microalgae | Prasinophyte | |||
| 3 | microalgae | Prasinophyte | |||
| 4 | microalgae | Prasinophyte | |||
| 5 | diatom | Bacillariophyte | 10025 | ||
| 6 | diatom | Bacillariophyte | 11390 | ||
| 7 | green algae | Chlorophyte | 14,598 | ||
| 8 | alga | Chlorophyte | 15,544 | ||
| 9 | moss | Bryophyte | 35938 | ||
| 10 | spike moss | Lycophytes | 34,697 | ||
| 11 | Sorghum | Monocot | 34496 | ||
| 12 | Arabidopsis | Dicot | 32016 | ||
| 13 | Poplar | Dicot | 45555 | ||
| 14 | Rice | Monocot | 67,393 | ||
| 15 | Medicago | Dicot | 44830 | ||
| Total | 388,667 |
Summary of number of lignin biosynthesis genes in each gene family across the 15 species studied.
| Species | CAD | CCoAMT | 4CL | CCR | PAL | C4H | HCT | COMT | C3H | F5H | Total |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | |
| 2 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 3 | |
| 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 4 | |
| 1 | 1 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 5 | |
| 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | |
| 4 | 2 | 0 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 10 | |
| 2 | 1 | 5 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 10 | |
| Volvox | 3 | 2 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 7 |
| Physcomitrella | 4 | 2 | 11 | 7 | 14 | 4 | 4 | 3 | 1 | 0 | 50 |
| Spike moss | 18 | 8 | 26 | 29 | 2 | 2 | 6 | 28 | 2 | 0 | 121 |
| Arabidopsis | 9 | 7 | 13 | 7 | 4 | 1 | 1 | 16 | 3 | 2 | 63 |
| Medicago | 21 | 4 | 10 | 18 | 4 | 1 | 6 | 26 | 1 | 3 | 93 |
| Sorghum | 14 | 7 | 15 | 44 | 8 | 3 | 4 | 41 | 2 | 3 | 141 |
| Rice | 5 | 11 | 16 | 55 | 14 | 4 | 9 | 38 | 1 | 3 | 157 |
| Poplar | 21 | 7 | 22 | 40 | 6 | 3 | 7 | 35 | 4 | 4 | 149 |
| Total | 110 | 55 | 120 | 207 | 55 | 18 | 37 | 187 | 14 | 15 | 818 |
Figure 2Phylogenetic analysis of C3H genes from seven species. The figure shows a clear distinction of three subgroups with the exception of one Arabidopsis genes. The groups include lower plant species (moss and spike moss), monocots (sorghum and rice), and dicots (poplar, Medicago, and Arabidopsis). The species is indicated by the first two or three letter of the gene name as follows; AT: Arabidopsis; MTR: Medicago; SEL: spike moss; OS: rice; POP: poplar; PHY: Physcomitrella; SB: sorghum.
Figure 3Phylogenetic analysis of F5H genes from five species. The analysis revealed clear separation between monocot and dicot species. The species is indicated by the first two or three letter of the gene names as follows: AT: Arabidopsis; MTR: Medicago; OS: rice; POP: poplar; SB: sorghum. The shaded area is a dicot group of genes. The gray box includes all dicot genes.
Figure 4Phylogenetic analysis of PAL genes from eight species including . The species is indicated by the first two letters of the gene names as described in Figure 2. AT: Arabidopsis; MTR: Medicago; SEL: spike moss; OS: rice; POP: poplar; PHY: Physcomitrella; SB: sorghum; LAC: mushroom; VOL: Volvox. The two Laccaria genes were circled by a blue box.
Distribution of the ten lignin biosynthesis gene families in rice and sorghum chromosomes
| Gene Family | Number of Chromosome in Sorghum | Number of Genes in Sorghum | Chromosome Name in Sorghum | Number of Chromosome In Rice | Number of Genes in Rice | Chromosome Name in Rice |
|---|---|---|---|---|---|---|
| C3H | 1 | 2 | 9 | 1 | 2 | 5 |
| C4H | 3 | 4 | 2,3,4 | 3 | 4 | 1, 2, 5 |
| CAD | 3 | 5 | 4,6,7 | 4 | 5 | 2,4,8,10 |
| CCOAMT | 3 | 3 | 2,7,10 | 3 | 3 | 6,8,9 |
| F5H | 3 | 3 | 1,2,5 | 3 | 3 | 3,6,10 |
| HCT | 3 | 4 | 2,4,6 | 4 | 4 | 2,6,4,9 |
| PAL | 3 | 7 | 1,4,6, | 4 | 7 | 2,4,11,12 |
| 4CL | 6 | 12 | 1,2,3,4,7,10 | 7 | 12 | 1,2,3,6,7,8,10 |
| COMT | 6 | 9 | 2,3,4,5,7,8 | 6 | 9 | 1,2,4,6,8,11 |
| CCR | 7 | 12 | 1,2,3,4,7,9,10 | 9 | 12 | 1,2,3,5,6,7,8,9,10 |
| Total | 38 | 44 |
The data summarize how many genes exist for each family and how many chromosomes contain these genes.
Figure 5Collinearity map for rice and sorghum based on the orthologs of lignin biosynthesis genes.
Figure 6Cluster analysis of lignin biosynthesis genes under different treatments. The logarithm 2 transformed ratios were used, and the color schema is as shown in the figure. The gene ID is shown on the right and the different treatments including insect, flood, salt, PEG (for osmotic stress), and low temperature are labeled. The blue box indicates that the paralog genes share a similar gene expression pattern (possible RR mode), while the pink box indicates that the paralog genes have evolved different gene expression patterns (possible RN mode).
Figure 7Biomass composition analysis and phylogenetic analysis of the COMT gene family. Panel A shows principle component analysis (PCA) of the MBMS data. Panel B shows the wet chemistry analysis of the sample. Panel C shows phylogenetic analysis of COMT gene families in all dicot species.