| Literature DB >> 23167570 |
Hanane Moummou1, Yvonne Kallberg, Libert Brice Tonfack, Bengt Persson, Benoît van der Rest.
Abstract
BACKGROUND: Short-chain dehydrogenases/reductases (SDRs) form one of the largest and oldest NAD(P)(H) dependent oxidoreductase families. Despite a conserved 'Rossmann-fold' structure, members of the SDR superfamily exhibit low sequence similarities, which constituted a bottleneck in terms of identification. Recent classification methods, relying on hidden-Markov models (HMMs), improved identification and enabled the construction of a nomenclature. However, functional annotations of plant SDRs remain scarce.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23167570 PMCID: PMC3541173 DOI: 10.1186/1471-2229-12-219
Source DB: PubMed Journal: BMC Plant Biol ISSN: 1471-2229 Impact factor: 4.215
Reference and size of the analyzed genomes
| Chlorophyte | Chlre4.1_Augustus9 | 15935 | [ | |
| Moss | proteins. Phypa1_1.FilteredModels | 35938 | [ | |
| Lycophyte | Selmo1_GeneModels_FilteredModels3 | 22285 | [ | |
| Eudicot | TAIR9 | 27379 | [ | |
| Eudicot | Populus.trichocarpa.v2.0 | 41377 | [ | |
| Eudicot | 12X March 2010 release | 26346 | [ | |
| Eudicot | Glyma1_pacId | 46367 | [ | |
| Monocot | MSU Rice Genome Annotation (Osa1) Release 6.1 | 40577 | [ | |
| Monocot | ZmB73_4a.53_working_translations | 102202 | [ | |
| Monocot | Sorbi1_GeneModels_Sbi1_4_aa | 34496 | [ |
The number of loci corresponds to the protein coding genes predicted by the annotation.
Figure 1Decision rules used to make an inventory of plant SDRs using three sets of HMM. All the HMM sets were run independently on the 10 predicted proteomes. The complete inventory and the ambiguous predictions are included as supplementary material (Additional file 2: Table S2 and Additional file 1: Table S1).
Figure 2Distribution of the SDR families in the analyzed plant genomes represented as a heat map. The heat map was built on a distribution matrix deduced from the inventory classification shown in Additional file 2: Table S2. The blue to red color gradient reflects the number of SDR listed in each family in the different genomes; the absence of family is indicated with a white square. The names of the families were deduced from the ‘SDR Nomenclature Initiative’ HMMs or by a representative gene accession for orphan families not recognized by a specific HMM.
Distribution of SDRs in different plants
| | ||||||
|---|---|---|---|---|---|---|
| 90 | 1 | 72 | 8 | 7 | ||
| 122 | 4 | 106 | 16 | 20 | ||
| 95 | 2 | 88 | 14 | 6 | ||
| 145 | 4 | 138 | 15 | 13 | ||
| 110 | 2 | 95 | 10 | 10 | ||
| 97 | 3 | 113 | 7 | 10 | ||
| 106 | 2* | 114 | 7 | 8 | ||
| 64 | 1 | 55 | 5 | 17 | ||
| 59 | 2 | 55 | 0 | 10 | ||
| 41 | 1 | 21 | 1 | 4 | ||
Families with low scores and no structural data (listed in Table S2) were omitted. *The presence of divergent SDRs in Sorghum bicolor was deduced from the Sbicolor_79_peptide annotation.
Classification of plant SDRs
| AT4G23420 | SDR7C | 49,4 | ||
| AT1G67730 | SDR12C | β-ketoacyl reductase (fatty acids elongation) | 48,4 | |
| AT3G12800 | SDR17C | - | 64,1 | |
| AT4G05530 | SDR25C | SDRA-IBR1 (indole-3-butyric acid response 1) | 67,7 | |
| AT3G03330 | SDR34C | - | 56,1 | |
| AT3G06060 | SDR35C | - | 47,9 | |
| AT4G09750 | SDR40C | - | 70,8 | |
| AT1G54870 | SDR57C | - | 58,0 | |
| AT5G06060 | SDR65C | Tropinone Reductase | 53,3 | |
| AT3G03980 | SDR68C | | 57,0 | |
| AT5G54190 | SDR73C | Protochlorophyllide Oxidoreductase | 74,5 | |
| AT3G50560 | SDR84C | - | 60,4 | |
| AT1G52340 | SDR110C | ABA2 (xanthoxin oxidase), Tasselseed2, Secoisolariciresinol dehydrogenase, Momilactone A synthase, Isopiperitenol dehydrogenase | 47,1 | |
| AT3G61220 | SDR114C | 45,4 | ||
| AT5G50600 | SDR119C | Hydroxysteroid Dehydrogenase | 44,4 | |
| AT3G55290 | SDR132C | 62,4 | ||
| AT1G24360 | SDR152C | FAS-II- β-ketoacyl reductase (FabG) | 68,3 | |
| AT1G10310 | SDR357C | Pterin aldehyde reductase (folate salvage) | 70,0 | |
| AT5G10050 | SDR368C | - | 45,8 | |
| AT4G27760 | SDR369C | 57,2 | ||
| AT2G05990 | SDR87D | Enoyl-ACP reductase (ENR) | 75,0 | |
| AT1G49670 | - | - | 50,6 | |
| AT3G01980 | - | 57,8 | ||
| AT4G13250 | - | NYC1/NOL (chlorophyll b reductase) | 48,1 | |
| AT4G20760 | - | - | 61,8 | |
| AT5G04070 | - | - | 52,7 | |
| AT4G10960 | SDR1E | UDP-D-glucose/UDP-D-galactose 4-epimerase,
UDP-arabinose 4-epimerase | 55,4 | |
| AT1G78570 | SDR2E | NDP-L-rhamnose synthase/epimerase | 74,7 | |
| AT5G66280 | SDR3E | GDP-mannose 4,6-dehydratase | 72,3 | |
| AT1G17890 | SDR4E | GDP-4-keto-6-deoxymannose-3,5-epimerase-4-reductase | 73,1 | |
| AT2G28760 | SDR6E | UDP-xylose synthase, UDP-glucuronic acid decarboxylase | 69,7 | |
| AT2G20360 | SDR22E | - | 60,1 | |
| AT1G47290 | SDR31E | 3β-hydroxysteroid-dehydrogenase/decarboxylase | 48,2 | |
| AT2G33630 | SDR42E | - | 66,2 | |
| AT4G30440 | SDR50E | UDP-D-glucuronate 4-epimerase | 61,3 | |
| AT4G33030 | SDR52E | UDP-sulfoquinovose synthase | 73,8 | |
| AT1G08200 | SDR67E | UDP-D-apiose/UDP-D-xylose synthase | 81,9 | |
| AT5G28840 | SDR93E | GDP-D-mannose 3′,5′-epimerase | 87,4 | |
| AT5G42800 | SDR108E | Dihydroflavonol 4-reductase, Anthocyanidin reductase,
Cinnamoyl-CoA reductase, Phenylacetaldehyde reductase,
Eutypine reductase | 36,6 | |
| GRMZM2G086773 | SDR115E | HC-toxin reductase | 55,0 | |
| AT5G22500 | SDR117E | fatty-acyl-CoA reductase | 46,8 | |
| AT4G24220 | SDR75U | VEIN PATTERNING 1 (VEP1), progesterone 5β-reductase | 53,4 | |
| AT4G35250 | SDR81U | - | 76,4 | |
| AT1G09340 | SDR83U | Chloroplast stem-loop binding protein | 50,7 | |
| AT5G18660 | SDR98U | 3,8-divinyl protochlorophyllide a 8-vinyl reductase | 62,5 | |
| AT5G02240 | SDR358U | - | 68,9 | |
| AT1G32100 (PLR-IFR) | SDR460A | Pinoresinol reductase, Isoflavone reductase, Vestitone reductase, Phenylcoumaran benzylic ether reductase, Eugenol synthase | 45,3 | |
| AT4G33360 | - | Farnesol NAD dehydrogenase | 63,2 | |
| AT4G00560 | - | 56,5 |
Each family was associated with a representative gene and, when possible, with a specific SDR nomenclature initiative HMM. Information on the occurrence of SDRs in different genomes are reported by the taxon name (ViridP: Viridiplantae; LandP: Embryophytae; TracheoP: Tracheophytae; FlowerP: Magnioliophyta). Average pairwise identities were calculated from the sequences of plant genomes. Ambiguously predicted SDRs and families absent in flowering plants were omitted. *: occurrence in Viriplantae was deduced from the presence of homologues in other Green Algae genomes.
Figure 3Phylogenetic tree of the SDR108E and SDR115E families. The blue arrow indicates the node at the origin of the ‘AnR, 4-DFR and SDR115E’ branch. Amino acid sequences recognized by the SDR108E and SDR115E HMMs were aligned with ClustalW algorithm. The evolutionary history was inferred using the Neighbor-Joining method. The percentages of replicate trees in which the associated taxa clustered together in the bootstrap test (500 replicates) are shown next to the branches. Full references of sequences compressed in different clusters are provided as supplemental data (Additional file 3: figure 1A). Consistent trees were obtained using the Maximum Likelihood method or rooting the tree with other SDR families (SDR1E, SDR6E, SDR31E) as outgroups.
Figure 4Diversification patterns of plant SDR families deduced from Principal Component Analysis. PCA was calculated on the distribution matrix shown in Figure 2. A) Scatter plot deduced from the two first components: the first and second axes respectively participate for 79% and 9% of the diversity. B) Contribution of different genomes (expressed as vectors) in the first and second axes values. Angiosperms genomes follow the order (anticlockwise): G. max, Z. mays, A. thaliana, S. bicolor, P. trichocarpa, O. sativa, V. vinifera .
Figure 5Phylogenetic tree of the SDR110C family. Amino acids sequences recognized by the SDR110C HMM were aligned with ClustalW algorithm. The evolutionary history and the bootstrap test (500 replicates) were computed as described for SDR108E (Figure 3). Full references of sequences compressed in different clusters are provided as supplemental data (Additional file 3: Figure S1B).
Figure 6Phylogenetic trees of the chlorophyllase b (NYC1/NOL) family. Amino acids sequences from the AT4G13250 family were aligned with ClustalW algorithm. The evolutionary history and the bootstrap test (500 replicates) were computed as described for SDR108E (Figure 3).
Figure 7Phylogenetic trees of three families involved in lipid primary metabolism. (A) SDR152C-FasII-β-keto-reductase (β-KR); (B): SDR87D-FasII-Enoyl-ACP-reductase (ENR); (C) SDR52E UDP-sulfoquinovose synthase (SQD1). Amino acids sequences recognized by the SDR152C, SDR87D and SDR52E HMMs were aligned with ClustalW algorithm. The evolutionary history and the bootstrap test (500 replicates) were computed as described for SDR108E (Figure 3).