| Literature DB >> 27893862 |
Carlos Gaona-López1, Adriana Julián-Sánchez1, Héctor Riveros-Rosas1.
Abstract
BACKGROUND: Alcohol dehydrogenase (ADH) activity is widely distributed in the three domains of life. Currently, there are three non-homologous NAD(P)+-dependent ADH families reported: Type I ADH comprises Zn-dependent ADHs; type II ADH comprises short-chain ADHs described first in Drosophila; and, type III ADH comprises iron-containing ADHs (FeADHs). These three families arose independently throughout evolution and possess different structures and mechanisms of reaction. While types I and II ADHs have been extensively studied, analyses about the evolution and diversity of (type III) FeADHs have not been published yet. Therefore in this work, a phylogenetic analysis of FeADHs was performed to get insights into the evolution of this protein family, as well as explore the diversity of FeADHs in eukaryotes. PRINCIPALEntities:
Mesh:
Substances:
Year: 2016 PMID: 27893862 PMCID: PMC5125639 DOI: 10.1371/journal.pone.0166851
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Unrooted tree constructed with protein sequences that possess homology to iron-dependent ADHs.
2459 nonredundant protein sequences were retrieved from Protein Data Bank, Swiss Prot database, NCBI’s Conserved Domain Database, and Pfam database (using RP15 option to allow maximum representation of divergent proteins). Amino acid sequences were ascribed to protein families as considered by Pfam database (A) or NCBI’s Conserved Domain Database (B).
Protein subfamilies that comprise the FeADH family.
| CDD Protein Subfamily | Reported activity [properties]/Characterized proteins [accession number] | Reported structure | Phyletic distribution | Reference | ||
|---|---|---|---|---|---|---|
| Bacteria | Eukarya | Archaea | ||||
| LPO cd08176 | Yes | Yes | Yes | |||
| FucO | 2BL4; 2BI4; 1RRM | (Euglenozoa; Heterolobosea; | [ | |||
| DhaT | 3BFJ | [ | ||||
| DhaT | — | [ | ||||
| ADH3 ( | 4FR2 | [ | ||||
| DhaT | — | [ | ||||
| MDH | — | [ | ||||
| ADH2 | 3OX4 | [ | ||||
| ADH4 | — | [ | ||||
| ADH4 Schizosaccharomyces pombe [Q09669] | — | [ | ||||
| YiaY | — | [ | ||||
| MAR cd08177 | Yes | Yes | Yes | |||
| MacA1 Rhodococcus opacus 1CP [O84992] | — | (Haptophyceae; Stramenopiles; | [ | |||
| TfdFI (Reut_D6463) | — | [ | ||||
| TfdFII (Reut_D6471) | — | [ | ||||
| TcpD (Reut_A1589) | — | [ | ||||
| HxqD (Reut_B4129) | 3JZD | [ | ||||
| HqoD (Reut_B4694) | — | [ | ||||
| MAR (Ncgl2952 locus) | 3IV7 | [ | ||||
| MAR (Ncgl1112 locus) | — | [ | ||||
| DxnE (Swit_4891 locus) | — | [ | ||||
| TftE | — | [ | ||||
| LinF | — | [ | ||||
| CcaD | — | [ | ||||
| GraC | 3W5S | [ | ||||
| MacA (Atu2528 locus) | 3HL0 | Unpublished | ||||
| FUM7 | — | [ | ||||
| AAD-C cd08178 | Yes | Yes | No | |||
| ADHE Geobacillus thermoglucosidasius NCIMB 11955 [WP_013877698; C-terminal domain, 435–867 aa] [crystallized with Zn2+] | 3ZDR | (Amoebozoa; Alveolata; Diplomonadida; Cryptophyta; | [ | |||
| ADH2 | — | [ | ||||
| ADHE | — | [ | ||||
| NADPH-BDH cd08179 | Yes | No | Yes | |||
| AdhA | — | [ | ||||
| ADH1 | — | [ | ||||
| PDD cd08180 | Yes | No | No | |||
| — | [ | |||||
| PDD-like cd08181 | Yes | Yes | No | |||
| TM0920 gene of | 1O2D; 1VHD | (Diplomonadida) | [ | |||
| HEPD cd08182 | Yes | Yes | Yes | |||
| phpC | — | (Stramenopiles) | [ | |||
| fomC Streptomyces fradiae [ACG70833] | — | [ | ||||
| BDH cd08187 | Yes | Yes | No | |||
| YqhD Escherichia coli [Q46856] [crystallized with NADP+, Zn2+] | 1OJ7; 4QGS | (Stramenopiles; Amoebozoa; Parabasalidea) | [ | |||
| BDH (TM0820 locus) | 1VLJ | Unpublished | ||||
| HOT cd08190 | Yes | Yes | Yes | |||
| ADHFE1 Homo sapiens [Q8IWW8] | — | (Ichthyosporea; Apusozoa; Stramenopiles; Amoebozoa; Rhizaria; | [ | |||
| ADHFE1 Rattus norvegicus [Q4QQW3] | — | |||||
| HVD cd08193 | Yes | Yes | No | |||
| CpnD | — | (Haptophyceae; | [ | |||
| FeADH1 cd08185 | None characterized | — | Yes | No | Yes | |
| FeADH2 cd08183 | None characterized | — | Yes | Yes | Yes | |
| (Alveolata; Stramenopiles; Rhodophyta; | ||||||
| FeADH3 cd08184 | Yes | No | No | |||
| KdnB | — | [ | ||||
| FeADH4 cd08188 | None characterized | Yes | No | Yes | ||
| FeADH5 cd08189 | None characterized | Yes | Yes | No | ||
| (Euglenozoa) | ||||||
| FeADH6 cd08194 | None characterized | Yes | Yes | Yes | ||
| (Alveolata; Haptophyceae; Rhizaria; Ichthyosporea; | ||||||
| FeADH8 cd08186 | Yes | Yes | Yes | |||
| — | (Diplomonadida) | [ | ||||
| Thermococcus sp. AN1 [AAB63011] | — | |||||
| — | ||||||
| FeADH7 cd08192 | Yes | Yes | Yes | |||
| — | (Haptophyceae; Stramenopiles) | |||||
| HHD FeADH10 cd08191 | Yes | No | No | |||
| ChnD1 Brevibacterium sp. HCU [AAK73161] | — | [ | ||||
1 Found in Lokiarchaeum sp. GC14_75.
2 Personal communication from Dr. David Schleheck, University of Konstanz.
3 Found in Thaumarchaeota archeon SCGC AB-539-E09.
4 Found in Posidonia oceanica, a Mediterranean seagrass.
5 Found in Nannochloropsis gaditana, an oleaginous microalgae.
Fig 2Phylogenetic analysis of 538 Fe-ADH protein sequences retrieved from the NCBI’s Conserved Domain Database (CDD).
The unrooted phylogenetic tree was inferred using the Maximum Likelihood method based on the Le-Gascuel model [49]. Branches are colored according to the Conserved Domain Database Fe-ADH subfamily they belong. The tree with the highest log likelihood (-2505413,5328) is shown. Similar trees were obtained with maximum-parsimony, minimum-evolution and neighbour-joining methods. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 0.8682)). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. There were a total of 783 positions in the final dataset. The proportion of replicate trees in which the associated taxa clustered together in a bootstrap test (500 replicates) is given in color next to selected branches. Rectangles and triangles adjacent to each Fe-ADH subfamily name, indicate the presence of protein sequences from archaea domain (triangles), or eukarya domain (rectangles with A (animals), F (fungi), V (viridiplantae), and P (other eukaryotes) in each subfamily. Protein sequences from bacteria are present in all FeADH subfamilies.
Fig 3Phylogenetic analysis of 867 Fe-ADH protein sequences from eukaryotes plus 352 non-redundant sequences retrieved from the NCBI’s Conserved Domain Database (CDD).
The evolutionary history was inferred using the Maximum Likelihood method based on the Le-Gascuel model [1]. The tree with the highest log likelihood (-3414819.0869) is shown. Initial tree(s) for the heuristic search was/were obtained automatically applying Neighbor-Join and BioNJ algorithms to a matrix of pairwise distances estimated using a JTT model, and then selecting the topology with superior log likelihood value. A discrete Gamma distribution was used to model evolutionary rate differences among sites (5 categories (+G, parameter = 0.4901)). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The analysis involved 1219 amino acid sequences. There were a total of 996 positions in the final dataset.
Number of FeADH proteins from plants found in different subfamilies.
| Organism | Number of genes | |||
|---|---|---|---|---|
| cd08178 | cd08176 | cd08183 | cd08193 | |
| AAD-C | LPO | FeADH2 | HVD | |
| 3 | 0 | 0 | 0 | |
| 1 | 0 | 0 | 0 | |
| 2 | 0 | 0 | 0 | |
| 2 | 0 | 0 | 0 | |
| 1 | ||||
| 2 | 0 | 0 | 0 | |
| 0 | 1 | 0 | 0 | |
| 0 | 1 | 1 | 0 | |
| 0 | 0 | 1 | 0 | |
| 0 | 0 | 1 | 0 | |
| 0 | 0 | 1 | 0 | |
| 0 | 0 | 1 | 0 | |
| 0 | 0 | 0 | 1? | |
Fig 4Comparison of the different FeADH proteins with a known three-dimensional structure.
These proteins belong to five different FeADH protein subfamilies (sorted inside blue rectangles according to the protein subfamily to which they belong). Below each structure the scientific name of the organisms where the protein is found, as well as the PDB accession number is indicated in parenthesis. In a red rectangle are included representative structures of proteins of homolog protein families that belong to the DHQ-FeADH protein superfamily (cd07766). For reference, a structure prediction (performed with I-TASSER server; [77]) of human ADHFE1 (accession NP_653251), which belongs to HOT subfamily (cd08190), is also included. Numbers in dark red show sequence identity (I) and similarity (S) between human ADHFE1 sequence and the indicated proteins. Secondary structure elements are colored in rainbow successive colors, starting from blue for the N-terminus and ending with red at the C-terminus. Protein structures were drawn using UCSF Chimera version 1.9 [78]
Fig 5Multiple structure-based sequence alignment of FeADHs with a known 3D structure (residues 1–250 according to human ADHFE1).
These proteins belong to five different subfamilies of the FeADH family. For comparison, ADHFE1 sequence from human is included in the alignment, as well as four glycerol dehydrogenase sequences with a known three-dimensional structure. PDB accession number of each sequence is indicated at the left side of alignment, whereas the protein subfamily to which each sequence belongs, is in the right side of the alignment. Conserved β-strands and α-helices for each structure are indicated in yellow and green, respectively. Residue position determinant for coenzyme specificity is indicated with a red square. Residues involved in the binding of Fe atom are highlighted in pink; residues involved in the binding of Zinc atom in glicerol dehydrogenases are highlighted in grey. Amino acid residues from human ADHFE1 sequence, highlighted in blue and grey indicate positions that belong to the N-terminal or C-terminal domains, respectively. The three-dimensional alignment of FeADH structures was performed using the VAST tool at the NCBI’s server [43].
Fig 6Multiple structure-based sequence alignment of FeADHs with a known 3D structure (residues 251–467 according to human ADHFE1).
For additional details see caption of Fig 5.
Fig 7Sequence logos of selected positions in different FeADH subfamilies.
The sequences of FeADH family were sorted in subfamilies according to the results of the phylogenetic analysis. Numbering is according to the sequence of human ADHFE1. Residue in position 81 is determinant for coenzyme preference; residues in positions 242, 246, 330 and 357 are involved in metal binding. The amino acid residue coloring scheme was according to their chemical properties: polar (G, S, T, Y, C), green; neutral (Q, N), purple; basic (K, R, H), blue; acidic (D, E), red; and hydrophobic (A, V, L, I, P, M, W, F), black. FeADH subfamilies, whose members putatively use NADP+ as coenzyme, are enclosed with a blue box, those that use NAD+ as coenzyme, are enclosed with a red box, and those that use both NAD+ and NADP+, are enclosed in a green box. FeADH subfamilies with experimental support for coenzyme preference are indicated with an asterisk. Sequence logos were made using WebLogo 3 (http://weblogo.threeplusone.com) [51].