| Literature DB >> 25879735 |
Yong Jia1, Darren C J Wong2,3, Crystal Sweetman4,5, John B Bruning6, Christopher M Ford7.
Abstract
BACKGROUND: Sorbitol dehydrogenase (SDH, EC 1.1.1.14) is the key enzyme involved in sorbitol metabolism in higher plants. SDH genes in some Rosaceae species could be divided into two groups. L-idonate-5-dehydrogenase (LIDH, EC 1.1.1.264) is involved in tartaric acid (TA) synthesis in Vitis vinifera and is highly homologous to plant SDHs. Despite efforts to understand the biological functions of plant SDH, the evolutionary history of plant SDH genes and their phylogenetic relationship with the V. vinifera LIDH gene have not been characterized.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25879735 PMCID: PMC4404067 DOI: 10.1186/s12870-015-0478-5
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Figure 1Distribution of SDH homologous genes in higher plants. Closely related species were specified accordingly. The gene abundance heat map was based on the total copy number of SDH genes in each species. SDHs of P. bretschneideri [39] and E. japonica (loquat) [35] were obtained from literature; additional SDHs may be identified in these two species when complete genome information becomes available. The classification of SDH Class I and SDH Class II was based on the phylogenetic analysis carried out in the present study.
Figure 2Phylogenetic tree showing the evolutionary history of the angiosperm SDH family. A: A simplified schematic phylogeny of the SDH family inferred by MEGA 6.0 [97] software using the Neighbour Joining method. Values (as percentage, cutoff value 50) of Internal branch test (1000 replicates) supports are indicated above the corresponding branches. B: The Maximum Likelihood phylogeny of the SDH family developed by MEGA 6.0 [97] software using the selected best-fitting substitution model JTT + G [99]. 1000 times Bootstraping supports (cut off at 0.5) are displayed above corresponding branch. Closely related species are annotated accordingly. The V. vinifera LIDH (GSVIVT01010646001) is also marked.
Amino acid sequence identity between different SDH groups
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| 83.4 (71-99.7) | 75.2 (67-83) | 78.5 (71-86) | 77.5 (71-83) | 48.0 (44-50) | 40.9 (38-43) |
|
| 79.0 (71-99) | 73.2 (68-80) | 71.0 (67-74) | 46.4 (43-49) | 39.3 (37-42) | |
|
| 86.7 (83-99.7) | 75.7 (72-79) | 48.0 (47-50) | 41.4 (40-43) | ||
|
| 88.4 (86-93) | 47.4 (46-49) | 41.5 (40-45) | |||
|
| 87.8 (82-99.8) | 42.3 (39-44) | ||||
|
| 65.5 (48-99.7) |
SDH sequences were divided into six groups (Class I, Class II, A. coerulea, Monocot, Mammal and Yeast SDHs) according to the phylogenetic analysis carried out in the present study (Figure 2). The amino acid sequence identity (as percentage) was obtained using all-vs-all BLAST tool. The average pair-wise identity between each group is presented, followed by the identity range (in bracket).
Figure 3Multiple sequence alignment of plant SDH family. ESPript output was obtained with the sequence alignment of plant SDHs and human SDH. Secondary structures were inferred using human SDH (PDB: 1PL8) as a template, with springs representing helices and arrows representing beta-strands. Sequences were grouped into 1 (1PL8 and core Eudicot SDH Class I), 2 (core Eudicot SDH Class II), 3 (A.coerlea SDH) and 4 (monocot SDH). Amino acid site numbering above the alignment is according to LIDH (Q1PSI9) without the first 20 amino acids. Adjacent similarity amino acid sites were boxed in blue frame. Similarity calculations were based on the complete SDH alignments but only partial sequences for SDH Class I and SDH Class II were displayed. The active site residues identified in this study are marked with red triangles. Conserved domains are indicated above the alignment.
Gene duplication patterns of plant SDH
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
|
| cs9 | Cs9g16680.1 (orange1.1g017426m) | I | Tandem | 16143063 | 16147624 |
| cs9 | Cs9g16690.1 (orange1.1g048013m) | I | Tandem | 16150122 | 16154404 | |
| cs9 | Cs9g16660.1 (orange1.1g017793m) | II | WGD or Sgm | 16135216 | 16138066 | |
|
| eg11 | Eucgr. K00213.1 | I | Tandem | 2624187 | 2627945 |
| eg11 | Eucgr.K00212.1 | II | Tandem | 2615486 | 2618589 | |
|
| md1 | MDP0000786110 | I | Tandem | 25191824 | 25193641 |
| md1 | MDP0000873573 | I | Tandem | 25182502 | 25183812 | |
| md1 | MDP0000707567 | I | Tandem | 25180931 | 25182241 | |
| md1 | MDP0000515106 | I | Tandem | 25177288 | 25178612 | |
| md1 | MDP0000250546 | I | Tandem | 25173127 | 25174375 | |
| md1 | MDP0000874667 | I | Tandem | 25157544 | 25158783 | |
| md1 | MDP0000638442 | I | WGD or Sgm | 25149134 | 25150444 | |
| md1 | MDP0000123910 | I | WGD or Sgm | 25087036 | 25088743 | |
| md1 | MDP0000305455 | II | Dispersed | 14150327 | 14159200 | |
| md7 | MDP0000188052 | I | Tandem | 23301490 | 23302735 | |
| md7 | MDP0000171573 | I | WGD or Sgm | 23281847 | 23283529 | |
| md7 | MDP0000188054 | I | Proximal | 23310942 | 23312187 | |
| md7 | MDP0000167088 | I | Tandem | 23405354 | 23406795 | |
| md7 | MDP0000807470 | I | WGD or Sgm | 23390960 | 23392683 | |
| md14 | MDP0000759646 | I | Dispersed | 24043122 | 24044360 | |
|
| Pm5 | Pm019393 | I | WGD or Sgm | 23673441 | 23675177 |
| Pm6 | Pm021180 | II | Tandem | 7217228 | 7219256 | |
| Pm6 | Pm021179 | II | Tandem | 7217228 | 7225304 | |
|
| pp2 | ppa007458m|PACid:17644502 | I | WGD or Sgm | 24766424 | 24768515 |
| pp4 | ppa007327m|PACid:17655491 | II | WGD or Sgm | 17729024 | 17731238 | |
| pp8 | ppa007343m|PACid:17644328 | II | Tandem | 15254677 | 15256888 | |
| pp8 | ppa007374m|PACid:17655656 | II | Tandem | 15249947 | 15251989 | |
|
| pt12 | POPTR_0012s13780 | II | WGD or Sgm | 13789342 | 13787442 |
| pt12 | POPTR_0012s13790 | I | WGD or Sgm | 13790093 | 13792804 | |
|
| st01 | PGSC0003DMC400055323 | I | Dispersed | 1594220 | 1598967 |
| st06 | PGSC0003DMC400043871 | II | WGD or Sgm | 24156879 | 24158593 | |
|
| tc03 | Tc03_g019280 | I | WGD or Sgm | 18300080 | 18303115 |
| tc03 | Tc03_g019270 | II | WGD or Sgm | 18298897 | 18296706 | |
|
| vv16 | GSVIVT01010642001 | I | WGD or Sgm | 15653874 | 15651701 |
| vv16 | GSVIVT01010646001 | II | Tandem | 15675560 | 15678887 | |
| vv16 | GSVIVT01010644001 | II | Tandem | 15666264 | 15664425 |
SDH gene duplication patterns were characterized by the duplicate_gene_classifier program in the MCScanX package. “WGD or Sgm” refers to Whole Genome Duplication or segmental duplication. “SDH Class” is defined according to the present phylogenetic analysis. Notably, MDP0000149907 from M. domestica could not be anchored in any chromosome and was therefore absent in this table.
Figure 4Identification of collinear gene pairs among plant SDH families. A circular plot of SDH gene family collinearity. Collinear SDH genes are linked by red curved lines. SDH genes located at each position in corresponding chromosomes are indicated. Family collinearity is shown in the genomic collinearity background. Only those chromosomes containing SDH genes are included.
Natural selection tests of plant SDH
|
|
|
|
|
|
|---|---|---|---|---|
|
| ||||
| w(mono)=w (Aer)=w(sdhC1)=w(sdhC2) | 1 | -30147.4 | w(mono)=w(Aer)=w(sdhC1)=w(sdhC2)=0.10492 | Not Allowed (NA) |
|
| ||||
| w(mono)=w(Aer)=w(sdhC1)≠w(sdhC2) (two ratios) | 2 | -30141.1 | w(mono)=w(Aer)=w(sdhC1)=0.10415, w(sdhC2)=859.33956 | NA |
| w(mono)≠w(Aer)≠w(sdhC1)=w(sdhC2) (three ratios) | 3 | -30147.3 | w(mono)=0.10510, w(Aer)=0.10821, w(sdhC1)=w(sdhC2)=0.06935 | NA |
| w(mono)≠w(Aer)≠w(sdhC1)≠w(sdhC2) (four ratios) | 4 | -30140.8 | w(mono)=0.10428, w(Aer)=0.09731, w(sdhC1)=0.0001, w(sdhC2)=999 | NA |
| w(mono)=w(Aer)=w(sdhC1)≠w(sdhC2) (two ratios with w(sdhC2) fixed to 1) | 1 | -30141.4 | w(mono)=w(Aer)=w(sdhC1)=0.10424 (w(sdhC2)=1) | NA |
|
| ||||
| M1:Neutral (2 site classes) | 2 | -29650.0 | p0=0.87775 (p1=1-p0=0.12225); w0=0.07628 (w1=1) | NA |
| M2:Selection (3 site classes) | 3 | -29650.0 | p0=0.87775, p1=0.07499 (p2=1-p0-p1=0.04726); w0=0.07628 (w1=1), w2=1 | None |
|
| ||||
| Model A Null (4 site classes) | 3 | -29643.2 | p0=0.33951, p1=0.04783 (p2+p3=0.61266); w0=0.07544 | NA |
| Model A (4 site classes) | 4 | -29640.9 | p0=0.82864, p1=0.11666 (p2+p3=0.0547), w0=0.07544 (w1=1), w2=132.6226 | Sites for foreground lineage: 42H,43F,112G, 113S,116T, 270Q (p > 0.99); |
All calculations were implemented using codeml at PAML4.7. Different models were specified according to the software instruction. “np” refers to the number of parameters, “l = (ln L)” refers to the log value of the likelihood. The estimated parameters w and p refer to the Ka/Ks ratio and the percentage of the corresponding site classes respectively. In the one-ratio model M0 and the Branch-specific models, w(mono), w(Aer), w(sdhC1) and w(sdhC2) stand for the w ratios for the monocot, A. coerulea, SDH Class I and SDH Class II branches respectively. In the Site-specific models and the Branch-site models, w0, w1 and w2 represent the w ratios for the specific site classes in respective models (see the Methods section for more details). For the Branch-site models, the SDH Class II branch was specified as the foreground branch. Amino acid site numbering is according to LIDH (Uniprot No: Q1PSI9) without the first 20 amino acids.
Figure 5Homology models of Vv_LIDH and Vv_SDH and proposed reaction mechanisms. A. Structure superimposition of Vv_LIDH_idonate (green) and Vv_SDH_sorbitol (yellow) in Ribbon forms. B. The proposed reaction mechanism for Vv_LIDH on the oxidation of L-idonate into 5-keto-D-gluconate (5KGA). C. Superimposition of the active site residues of Vv_LIDH (green) and Vv_SDH (yellow). The distances (Å) between corresponding atoms are labelled. Target active site residues are shown in stick forms and labelled correspondingly. D. Hydrophobicity variance at Y42H between Vv_LIDH (green) and Vv_SDH (yellow) with red and white colours representing the highest hydrophobicity and the lowest hydrophobicity respectively. (All amino acid site numbering is according to LIDH (UniProt No: Q1PSI9) without the first 20 amino acids).
Figure 6Transcript and gene co-expression profiles of SDH in different plants. A. Expression profiles for Class I and Class II SDH genes in various tissues and developmental stages of V. vinifera. Class I and II SDH genes were moderately to highly expressed in most tissues (Log2 intensity > 10; 50th percentile of all gene expression values, see Methods). The heatmap was adjusted to colour ranges between log2 intensity of 5 (blue), 10 (white) and 15 (red) to illustrate low, moderate and high expression when compared to all other genes respectively. B. Expression profiles for Class I and Class II SDH gene in citrus. The heatmap was adjusted to colour ranges between log2 intensity of 4 (blue), 10 (white) and 14 (red) to illustrate low, moderate and high expression when compared to all other genes respectively. C. Heatmap of selected enriched GO terms (−log10 (adj. p-value) for genes co-expressed with SDHs from A. thaliana (At), V. vinifera (Vv), C. sinensis (Cs), P .trichocarpa [84], O. sativa (Os) and associated frequencies in the plants tested. Light and dark orange denote enrichment scores between 1 and 3 respectively. Highly enriched scores (>5) are coloured in red. Grey colour denotes no significant enrichment.