| Literature DB >> 30081833 |
Hengfu Yin1,2, Hao-Bo Guo3, David J Weston1, Anne M Borland1,4, Priya Ranjan1,5, Paul E Abraham5,6, Sara S Jawdy1,5, James Wachira7, Gerald A Tuskan1,5, Timothy J Tschaplinski1,5, Stan D Wullschleger8, Hong Guo3, Robert L Hettich5,6, Stephen M Gross9,10, Zhong Wang9,11,12, Axel Visel9,11,12, Xiaohan Yang13,14.
Abstract
BACKGROUND: Crassulacean acid metabolism (CAM) enhances plant water-use efficiency through an inverse day/night pattern of stomatal closure/opening that facilitates nocturnal CO2 uptake. CAM has evolved independently in over 35 plant lineages, accounting for ~ 6% of all higher plants. Agave species are highly heat- and drought-tolerant, and have been domesticated as model CAM crops for beverage, fiber, and biofuel production in semi-arid and arid regions. However, the genomic basis of evolutionary innovation of CAM in genus Agave is largely unknown.Entities:
Keywords: Circadian rhythm; Comparative genomics; Crassulacean acid metabolism; Photosynthesis; Positive selection; Transcriptome
Mesh:
Substances:
Year: 2018 PMID: 30081833 PMCID: PMC6090859 DOI: 10.1186/s12864-018-4964-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Temporal expression of CAM and gene co-expression modules in Agave americana. a Diel expression pattern of selected co-expression modules in mature leaf, as identified from network analysis of RNA-Seq data with relevance to CAM physiology. The black and white bars indicate nighttime and daytime, respectively. b Diel expression pattern of other modules in mature leaf with distinct profiles. c Expression pattern in young leaves sampled at 3 time points and non-leaf tissues sampled at one time point (9 am). The number in the parentheses is the number of transcripts in each individual module
Biological processes over-represented (p < 0.02) in the co-expression module M11
| GO ID | GO Term | Corrected |
|---|---|---|
| GO:0009738 | Abscisic acid-activated signaling pathway | 4.2E-08 |
| GO:0016311 | Dephosphorylation | 6.2E-08 |
| GO:0071215 | Cellular response to abscisic acid stimulus | 9.0E-08 |
| GO:0009737 | Response to abscisic acid | 9.3E-08 |
| GO:0097306 | Cellular response to alcohol | 5.1E-07 |
| GO:0071396 | Cellular response to lipid | 6.6E-07 |
| GO:0006470 | Protein dephosphorylation | 7.0E-07 |
| GO:0097305 | Response to alcohol | 7.8E-07 |
| GO:0009611 | Response to wounding | 4.6E-06 |
| GO:0009745 | Sucrose mediated signaling | 4.8E-04 |
| GO:0009753 | Response to jasmonic acid | 6.3E-04 |
| GO:0019722 | Calcium-mediated signaling | 1.7E-03 |
| GO:0009788 | Negative regulation of abscisic acid-activated signaling pathway | 1.8E-03 |
| GO:1901420 | Negative regulation of response to alcohol | 1.8E-03 |
| GO:0009694 | Jasmonic acid metabolic process | 2.5E-03 |
| GO:1901419 | Regulation of response to alcohol | 2.6E-03 |
| GO:0009787 | Regulation of abscisic acid-activated signaling pathway | 2.6E-03 |
| GO:0009968 | Negative regulation of signal transduction | 2.7E-03 |
| GO:0010648 | Negative regulation of cell communication | 2.7E-03 |
| GO:0023057 | Negative regulation of signaling | 2.7E-03 |
| GO:0019856 | Pyrimidine nucleobase biosynthetic process | 2.7E-03 |
| GO:0010224 | Response to UV-B | 5.0E-03 |
| GO:0019932 | Second-messenger-mediated signaling | 6.0E-03 |
| GO:0010243 | Response to organonitrogen compound | 6.1E-03 |
| GO:0006835 | Dicarboxylic acid transport | 6.3E-03 |
| GO:0009875 | Pollen-pistil interaction | 6.3E-03 |
| GO:0010200 | Response to chitin | 6.5E-03 |
| GO:0009695 | Jasmonic acid biosynthetic process | 7.0E-03 |
| GO:0006206 | Pyrimidine nucleobase metabolic process | 7.4E-03 |
| GO:0006984 | ER-nucleus signaling pathway | 8.1E-03 |
| GO:0071324 | Cellular response to disaccharide stimulus | 8.3E-03 |
| GO:0071329 | Cellular response to sucrose stimulus | 8.3E-03 |
| GO:0042538 | Hyperosmotic salinity response | 9.6E-03 |
| GO:0030968 | Endoplasmic reticulum unfolded protein response | 1.1E-02 |
| GO:0015743 | Malate transport | 1.2E-02 |
| GO:0034620 | Cellular response to unfolded protein | 1.3E-02 |
| GO:0035967 | Cellular response to topologically incorrect protein | 1.3E-02 |
| GO:0010118 | Stomatal movement | 1.3E-02 |
| GO:0006986 | Response to unfolded protein | 1.3E-02 |
| GO:0048544 | Recognition of pollen | 1.5E-02 |
| GO:0008037 | Cell recognition | 1.6E-02 |
| GO:0009827 | Plant-type cell wall modification | 1.8E-02 |
| GO:0015740 | C4-dicarboxylate transport | 1.9E-02 |
Fig. 2Comparative analysis of protein sequences among CAM and non-CAM plant species. a Plant species used in comparative genomics analysis. b Ortholog groups in 15 plant species as identified by OrthoMCL. Number of ortholog groups is listed in each of the ortholog clades. c Percent of ortholog clade were predicted to be transcription factors in Agave americana. “a” and “b” indicate that the transcription factors are over-represented (p < 0.05) and under-represented (p < 0.05), respectively. d Percent of ortholog clade undergoing positive selection (i.e., nonsynonymous to synonymous substitution ratio (Ka/Ks) > 1, as calculated from Agave-Arabidopsis gene pairs with a sliding window of 50 amino acids). “*” indicates that the ortholog clade was over-represented (p < 0.0001) by Agave genes with Ka/Ks ratio greater than 1. Clade NVP:C3:CAM:C4 is shared by NVP, C3, CAM, and C4; NVP:C3:CAM shared only by NVP, C3, and CAM; C3:CAM:C4 shared only by C3, CAM, and C4; C3:CAM shared only by C3 and CAM; and CAM-only is specific to CAM species
Fig. 3Positive selection region in phosphoenolpyruvate carboxylase kinase (PPCK1). a Ka/Ks profile of Agave americana (Aa) versus Arabidopsis thaliana (At); b superimposed structures in Aa and At, with the positive selection region highlighted; (c) Ka/Ks profile of Aa versus Oryza sativa (Os); d superimposed structures in Aa and Os, with the positive selection region highlighted; e Ka/Ks profile of Aa versus Zea mays (Zm); f superimposed structures in Aa and Zm. An ATP substrate that may bind to the Aa PPCK1 is marked by an arrow. The proteins are colored in grey for Aa, blue for At, green for Os and red for Zm. g A snapshot of PPCK1 (Aam048341) structure model revealing the positive selected sites. The PPCK1 model bound with an ATP substrate (blue surface) is after a 1-us MD simulation. K42 (codon 124, at the C-end of β3) and N73 (codon219, at the N-end of β4) are located at the two strands connecting to the αC helix, the orientation of which is known to be involved in the activation of the kinase
The positively selected sites of PPCK1 and CT-BMY under models from HYPHY (REL, FUBAR). The sites were listed as positively selected sites if they had a posterior probability greater than 80%
| Codon(FUBAR) | α | β | β-α | Posterior Prob β > α | Emp. Bayes Factor | PSRF | Neff | |
| PPCK1 | 124* | 0.81 | 2.82 | 2.01 | 0.83 | 10.81 | 1.00 | 1292.65 |
| 219* | 0.81 | 2.71 | 1.90 | 0.82 | 10.27 | 1.00 | 1327.29 | |
| CT-BMY | 73* | 0.60 | 1.93 | 1.34 | 0.83 | 15.97 | 1.01 | 491.79 |
| Codon(REL) | E[dS] | E[dN] | Normalized E[dN-dS] | Posterior Probability | Bayes Factor | |||
| PPCK1 | 124* | 0.96 | 3.43 | 2.47 | 0.95 | 307.87 | ||
| 219* | 0.97 | 3.41 | 2.44 | 0.94 | 268.09 | |||
| 267 | 1.07 | 3.31 | 2.24 | 0.90 | 155.82 | |||
| 273 | 1.06 | 3.32 | 2.25 | 0.90 | 165.39 | |||
| 275 | 1.09 | 3.14 | 2.05 | 0.85 | 97.75 | |||
| 276 | 1.07 | 2.85 | 1.78 | 0.77 | 58.35 | |||
| CT-BMY | 73* | 0.94 | 0.85 | −0.0961 | 0.75 | 52.6 |
The * sites indicated they were identified by both models
Fig. 4Diel shift in gene expression pattern between Agave americana and Arabidopsis thaliana. a Morning-to-night shift with peak expression during morning in Arabidopsis and during night in Agave. b Afternoon-to-night shift with peak expression during afternoon in Arabidopsis and during late night in Agave. See gene annotation in Additional file 14: Table S11 and Additional file 15: Table S12
Distribution of gene co-expression modules in the gene sets with positive selection and diel shift in gene expression pattern, respectively, in Agave americana. Gene set 1a includes Agave genes with Ka/Ks ratio greater than one in the Agave-Arabidopsis, Agave-Oryza and Agave-Zea orthologous gene pairs. Gene set 1b includes Agave genes with Ka/Ks ratio greater than one in Agave-Arabidopsis and Agave-Oryza but not Agave-Zea pairs. Gene sets 2a and 2b includes Agave genes with morning-to-night shift and afternoon-to-night shift, respectively, in expression pattern as compared with the orthologous genes in Arabidopsis. The numbers represent the observed and expected (in parentheses) number of genes
| Co-expression module | Gene set 1a | Gene set 1b | Gene set 2a | Gene set 2b |
|---|---|---|---|---|
| M01 | 14 (27) | 17 (16) | 10 (4)* | 5 (4) |
| M02 | 13 (7)* | 4 (4) | 2 (1) | 0 (1) |
| M03 | 16 (7) ** | 10 (4)* | 0 (1) | 0 (1) |
| M04 | 1 (5) | 0 (3) | 0 (1) | 0 (1) |
| M05 | 0 (4) | 0 (2) | 1 (1) | 0 (1) |
| M06 | 1 (4) | 1 (2) | 0 (1) | 3 (1)* |
| M07 | 5 (10) | 1 (6) | 1 (1) | 2 (1) |
| M08 | 2 (2) | 0 (1) | 0 (0) | 0 (0) |
| M09 | 1 (9) | 2 (5) | 0 (1) | 1 (1) |
| M10 | 49 (28)** | 23 (16) | 0 (4) | 0 (4) |
| M11 | 5 (5) | 2 (3) | 4 (1)* | 0 (1) |
| M12 | 0 (3) | 0 (2) | 0 (0) | 0 (0) |
| M13 | 3 (11) | 2 (6) | 3 (2) | 10 (1)** |
| M14 | 2 (8) | 2 (5) | 0 (1) | 0 (1) |
| M15 | 0 (2) | 1 (1) | 0 (0) | 0 (0) |
| M16 | 1 (2) | 1 (1) | 1 (0) | 0 (0) |
| Non-module | 47 (25)** | 28 (15)* | 0 (4) | 0 (3) |
| Total | 160 | 94 | 22 | 21 |
*Overrepresentation (FDR adjusted p-value< 0.05, cumulative Poisson distribution) of co-expression modules in each category. **Overrepresentation (FDR adjusted p-value< 0.01, cumulative Poisson distribution) of co-expression modules in each category
Fig. 5Diel gene expression pattern of circadian system genes in Agave americana and Arabidopsis thaliana. In the circular heatmaps, the outer and inner rings represent Agave americana and Arabidopsis thaliana, respectively. The black and white half-circles inside the circular heatmaps indicate night-time and day-time, respectively. Full gene names are listed in Additional file 18: Table S13
Fig. 6Functionally annotated Agave genes showing positive selection and diel rewiring of expression pattern relative to C3 plants. a Genes involved in circadian clock. A gene involved in stomatal opening. c Genes involved in carboxylation, malate transport, decarboxylation, and starch/sugar metabolism. d Genes involved in photosynthetic electron transport chain. Red circles indicate positive selection. Green circles indicate morning-to-night shift in peak gene expression. Yellow circles indicate afternoon-to-night shift in peak gene expression. AGP16, Arabinogalactan protein 16; AKT2, Arabidopsis Shaker family K+ channels 2/3; CT-BMY, Chloroplast Beta-Amylase; ELF3, Early Flowering 3; GAUT7, Galacturonosyltransferase 7; LHY, Late Elongated Hypocotyl; LUX, Phytoclock 1; PPCK1, Phosphoenlpyruvate Carboxylase Kinase 1; PPDK-RP, Pyruvate orthophosphate dikinase regulatory protein; SS2, Starch Synthase 2; TDT, Tonoplast Dicarboxylate Transporter