Literature DB >> 25071412

Searching for convergent evolution in manganese superoxidase dismutase using hydrophobic cluster analysis.

Heng Xiang1, Ruizhi Zhang2, Nengzhang Li1, Charles R Vossbrinck3.   

Abstract

There are numerous examples of convergent evolution in nature. Major ecological adaptations such as flight, loss of limbs in vertebrates, pesticide resistance, adaptation to a parasitic way of life, etc., have all evolved more than once, as seen by their analogous functions in separate taxa. But what about protein evolution? Does the environment have a strong enough influence on intracellular processes that enzymes and other functional proteins play, to evolve similar functional roles separately in different organisms? Manganese Superoxide Dismutase (MnSOD) is a manganesedependant metallo-enzyme which plays a crucial role in protecting cells from anti-oxidative stress by eliminating reactive (superoxide) oxygen species. It is a ubiquitous housekeeping enzyme found in nearly all organisms. In this study we compare phylogenies based on MnSOD protein sequences to those based on scores from Hydrophobic Cluster Analysis (HCA). We calculated HCA similarity values for each pair of taxa to obtain a pair-wise distance matrix. A UPGMA tree based on the HCA distance matrix and a common tree based on the primary protein sequence for MnSOD was constructed. Differences between these two trees within animals, enterobacteriaceae, planctomycetes and cyanobacteria are presented and cited as possible examples of convergence. We note that several residue changes result in changes in hydrophobicity at positions which apparently are under the effect of positive selection.

Entities:  

Keywords:  convergence; hydrophobic cluster analysis; manganese superoxide dismutase; phylogeny; structure

Year:  2014        PMID: 25071412      PMCID: PMC4094605          DOI: 10.1590/s1415-47572014005000008

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Introduction

Superoxide dismutases are a group of metalloenzymes that are present in the vast majority of organisms thus far studied. They are responsible for the dismutation of reactive superoxide (oxygen molecule with an extra electron O2−) to peroxide and water as follows: 2 O2− + 2 H2O = O2 +H2O2 and 2 OH− (Fridovich, 1995). In concert with catalase, the peroxide is further to broken down to H2O and O2 (Belinky ). There are three separate superoxide dismutases based on the metal cofactor associated with the enzyme: iron SODs (FeSODs) found in prokaryotes, protists and plants, copper/zinc SODs (Cu/ZnSODs) found in bacteria and the cytosol of eukaryotes, and manganese SODs (MnSODs) found in prokaryotes and the mitochondrial matrix of eukaryotes (Natvig ). MnSODs are usually found as homodimers in bacteria or homotetramers in eukaryotes (Natvig ). Each MnSOD monomer contains two domains (C- and N-terminal) connected by a loop. The C-terminal domain consists of a three-stranded antiparallel beta-sheet and four helices, while the N-terminal one consists of two helices folded into an antiparallel hairpin, along with a left-handed twist (Guan ). It has been shown that superoxide acts against oxidative stress and that SOD mutants exhibit additional requirements for methionine and lysine, and may have shortened life spans and are oxygen intolerant (Fridovich, 1995; Alscher ). Because of its crucial role in preventing damage by superoxide ions, MnSODs have been found in nearly all species reported to date. Hydrophobic Cluster Analysis (HCA) was developed for structural comparisons between proteins based on primary sequence information, in cases where X-ray crystal structure was difficult if not impossible to obtain (Gaboriaud ). It has been pointed out that the sequence variation based on the 20 essential amino acids is much greater than the number of observed variations in protein secondary and tertiary structure. That is to say that common 2-D and 3-D folding patterns can consist of very different amino acid sequences (Lemesle-Varloot ). There is, for example, an 80% similarity based on HCA in the hydrophobic regions between human hemoglobin α-chain and lupin leghemoglobin while there is only a 15% sequence identity (Gaboriaud ). A previous study applied HCA to the analysis of a tandem mutation of superoxide dismutase from the microsporidium Nosema bombycis to look for structural differences between these isozymes (Xiang ). The goal was to understand whether this duplication represents the evolution of an SOD with different functional characteristics, or simply a duplication of the same protein to produce more SOD for the same purpose. The aim of this present study was to compare phylogenies based on a combination of (maximum likelihood, maximum parsimony, neighbor joining and Bayesian) analyses of amino acid sequence data, from a wide variety of (48) organisms, with one based on values obtained through HCA to look for structural and possible functional convergence within these proteins.

Materials and Methods

Obtaining sequence data

Amino acid sequences of 48 homologous MnSODs (manganese superoxide dismutase) were obtained from GenBank and their accession numbers are listed as follow: Anabaena variabilis (YP_321963), Apis mellifera (XP_393570), Arabidopsis thaliana (NP_187703), Aspergillus fumigatus cytoplasm (Q92450), Aspergillus fumigatus mitochondria (EAL89634), Blastopirellula marina (ZP_01089421), Blumeria graminis (AAL56985), Candida albicans (EAK99504), Candidatus Nitrosopumilus maritimus (ZP_02024558), Cenarchaeum symbiosum (ABK76972), Comamonas testosteroni (ZP_01519759), Cordyceps militaris (AAO47725), Crocosphaera watsonii (ZP_00514026), Danio rerio (NP_956270), Debaryomyces hansenii (CAG90212), Delftia acidovorans (ZP_01581157), Escherichia coli (P00448), Gallus gallus (NP_989542), Gibberella zeae cytoplasm (EAA69573), Gibberella zeae mitochondria (EAA73238), Gloeobacter violaceus (NP_923628), Homo sapiens mitochondria (P04179), Hyphomonas neptunium (YP_760332), Kluyveromyces lactis (CAG99194), Leptolyngbya boryana (P50058), Magnaporthe grisea cytoplasm (EAA53420), Magnaporthe grisea mitochondria (EAA48554), Methylobacillus flagellatus (YP_544498), Nematostella vectensis (XP_001641079), Neurospora crassa cytoplasm (EAA32343), Neurospora crassa mitochondria (EAA30249), Nodularia spumigena (ZP_01630049), Nostoc punctiforme (ZP_00112125), Oryza sativa (AAA57131), Paracoccidioides brasiliensis (AAW83518), Penicillium chrysogenum (AAC36585), Planctomyces maris (ZP_01856783), Saccharomyces cerevisiae (NP_011872), Schizosaccharomyces pombe (CAB62411), Stigmatella aurantiaca (ZP_01463295), Tatumella ptyseos (AAQ14590), Thermosynechococcus elongatus (NP_680827), Tribolium castaneum (XP_972440), Trichodesmium erythraeum (YP_723986), Wolinella succinogenes (NP_907960), Xenopus laevis (NP_001083968), Yarrowia lipolytica (CAG82903) and Yersinia enterocolitica (P53655).

Hydrophobic cluster analysis

The software developed for Hydrophobic Cluster Analysis creates a two dimensional plot of the protein based on the amino acid sequence. The HCA plots of each MnSOD protein were drawn using the Drawhca program (Woodcock ). This plot is presented, as it would be on a cylinder with 3.6 amino acids per turn shown as a classical α-helix. After five turns, the residues i and i + 18 are placed at the same locations on the cylinder. Then, the cylinder is separated along its axis and unrolled. As some adjacent residues would be widely cut, the representation is duplicated making it easier to follow and allowing a visualization of the environment of each amino acid residue. The HCA plot is then defined by encircling the adjacent hydrophobic residues (tryptophans, tyrosines, methionines, phenylalanines, isoleucines, leucines, valines) and marking the prolines (asterisk) and glycines (closed rhomb) as presenting loops and the cystines as involving disulphide bonds (Gaboriaud ). We then developed a numerical score between each amino acid sequence pair to compare their structure similarity. To do this, Perl scripts were written to align the sequences (fasta format) based on their correspondence to the HCA plot and obtained the CR numbers by counting corresponding hydrophobic residues between each pair of sequences. HCA similarity scores were obtained by calculating the number of hydrophobic amino acids via the following formula: HCA similarity score (%) = (2CR x 100)/(RC1 + RC2), where RC1 (RC2) is the number of hydrophobic residues in protein 1 (protein 2), and CR is the number of hydrophobic residues which are in correspondence between the two sequences (Gaboriaud ).

Constructing and comparing trees

Based on the HCA similarity scores between each pair of MnSOD proteins, we obtained a distance matrix (distance=1-HCA similarity percentage), which was then used to construct a HCA tree by UPGMA method of PAUP software (Swofford, 2002). We also used the nearest neighor interchange (NNI) search algorithm in PAUP to complete a Maximum Parsimony tree for the sequence identities of MnSOD proteins, after their amino acid sequences were aligned using CLUSTAL X software (Thompson ). Based on this multiple sequence alignment, a Neighbor Joining tree was reconstructed through the Poisson model of MEGA 4 (Tamura ). The WAG matrix (Whelan and Goldman, 2001) with a gamma of 1.092, which was considered as the most suitable substitution model using ProtTest (Abascal ), was used to construct a Maximum Likelihood tree using PhyML (Guindon and Gascuel, 2003). All bootstrap values were acquired using 500 replicates. A phylogram based on Bayesian analysis was also constructed by Phylobayes 3.3f (Lartillot ), using the WAG model with a default gamma distribution and a saving frequency Convergent evolution in MnSOD of 100 generations. Due to their similar topologies, the four phylogenetic trees (MP, NJ, ML and Bayesian) were combined into one common tree using the MacClade software package (Maddison and Maddison, 2000). Visual comparisons between the HCA tree and the common tree were accomplished using the Treejuxtaposer program (Munzner ). This program can be used to browse many trees and compare them side by side to examine topological differences by controlling the BCN (Best Corresponding Node) score. The lowest BCN threshold was chosen to check the differences between each pair of trees.

Selective test

In order to find out whether the observed hydrophobic cluster convergences are under natural selection, DnaSP v5 software (Rozas ) was used to calculate the ratios of the nonsynonymous (Ka) to synonymous (Ks) substitution rates for pairwise genes. The selection of each individual residue within these genes was also analyzed by the site models (M1a, M2a, M7, and M8) of the CODEML software from PAML (Yang, 1997). The posterior probabilities were calculated using the Bates empirical Bates (BEB) algorithm.

Results

Trees comparison

Figure 1 is a phylogenetic comparison of MnSOD based on Maximum Parsimony, Neighbor Joining, Maximum Likelihood, and Bayesian analysis (see Supplementary Material Figures S1–S4 for individual trees) with one based on the pair-wise comparison using hydrophobic cluster analysis of the same protein. The common tree presented in Figure 1 shows relatively high bootstrap values for most of the branch points and high agreement for the analyses performed. Phylogenetic relationships based on amino acid sequence (common tree) of this protein correspond well with analyses using other genes (Ciccarelli ). All three analyses separated the taxa into the three accepted domains of life; Eukarya, Archaea and Bacteria. The eukaryotes are separated into three common kingdoms (Fungi, Animalia, and Plantae). The fungi show a dichotomy into cytosolic and mitochondrial forms of their MnSODs as previously reported (Frealle ). The fungal taxa presented are all Ascomycetes and conform to the relationships presented in earlier works constructing a 6-gene phylogeny (James ). Interestingly the HCA tree conforms better to the 6-gene tree in one minor respect, with Candida albicans and Debaryomyces hansenii being sister taxa (Kurtzman and Robnett, 1998).
Figure 1

Comparison of phylogenetic trees, based on the amino acid sequences of the MnSOD gene (left, Common tree) and the pairwise hydrophobic cluster analysis of the same gene (right, HCA tree). All organisms were classified clearly in both trees. Nodes with bootstrap values of maximum likelihood, neighbor joining and maximum parsimony methods, and Bayesian posterior probabilities respectively are shown in the consensus tree (order: ML/NJ/MP/Bayesian). The species indicated in red show differences between two trees as determined by the Tree Juxtaposer program based on the lowest BCN threshold.

Comparison of phylogenetic trees, based on the amino acid sequences of the MnSOD gene (left, Common tree) and the pairwise hydrophobic cluster analysis of the same gene (right, HCA tree). All organisms were classified clearly in both trees. Nodes with bootstrap values of maximum likelihood, neighbor joining and maximum parsimony methods, and Bayesian posterior probabilities respectively are shown in the consensus tree (order: ML/NJ/MP/Bayesian). The species indicated in red show differences between two trees as determined by the Tree Juxtaposer program based on the lowest BCN threshold. While we did not see differences between the two trees at the higher levels of domain and kingdom, except perhaps for the Archaea (Figure 1), we did see differences at the phylum to generic levels. The red branches shown in Figure 1, especially the five groups of Table 1 highlight significant differences between the common tree and that created by HCA values.
Table 1

Differences between sequence identity scores and HCA similarity scores for MnSOD genes within four groups. Non-synonymous (Ka) and synonymous (Ks) substitution rates for pairwise MnSOD were also shown. n.a., not available.

GroupSequence 1Sequence 2HCA similarity score (%)Amino acid identitiy score (%)KaKsKa/Ks
1H. sapiensG. gallus95830.0881.1290.078
H. sapiensX. laevis94.4810.125n.a.n.a.
H. sapiensD. rerio96.5800.139n.a.n.a.
G. gallusX. laevis96.5820.1201.5520.078
G. gallusD. rerio95.8790.1294.6100.028
X. laevisD. rerio93.7790.1462.3180.063
2E. coliT. ptyseos97.6880.0770.8470.091
E. coliY. enterocolitica98.5850.1000.9870.102
T. ptyseosY. enterocolitica96.1820.1201.4750.082
3C. symbiosumC. N. maritimus82.6550.367n.a.n.a.
C. symbiosumD. acidovorans85.7500.4721.1780.400
C. symbiosumC. testosteroni82.8460.4812.0070.240
C. symbiosumM. flagellatus84.6520.4632.5730.180
C. symbiosumB. marina84.6550.404n.a.n.a.
C. symbiosumP. maris85550.367n.a.n.a.
C. N. maritimusD. acidovorans81.7450.524n.a.n.a.
C. N. maritimusC. testosteroni80.6430.547n.a.n.a.
C. N. maritimusM. flagellatus83.5450.525n.a.n.a.
C. N. maritimusB. marina80.6490.479n.a.n.a.
C. N. maritimusP. maris80.9490.496n.a.n.a.
D. acidovoransC. testosteroni93.9710.2100.6200.339
D. acidovoransM. flagellatus92.9700.2201.7120.128
D. acidovoransB. marina92.9680.2331.6360.142
D. acidovoransP. maris91.9660.2501.7250.145
C. testosteroniM. flagellatus93690.2442.0240.120
C. testosteroniB. marina93620.2921.5480.189
C. testosteroniP. maris92.1630.289n.a.n.a.
M. flagellatusB. marina96.8690.250n.a.n.a.
M. flagellatusP. maris81.5690.2401.8470.130
B. marinaP. maris97.5790.166n.a.n.a.
4C. watsoniiA. variabilis81.2460.451n.a.n.a.
C. watsoniiL. boryana87470.4982.4380.204
C. watsoniiT. erythraeum81.8460.4681.3270.352
A. variabilisL. boryana88.6550.302n.a.n.a.
A. variabilisT. erythraeum80.9520.3932.6520.148
L. boryanaT. erythraeum86.4570.452n.a.n.a.
Differences between sequence identity scores and HCA similarity scores for MnSOD genes within four groups. Non-synonymous (Ka) and synonymous (Ks) substitution rates for pairwise MnSOD were also shown. n.a., not available.

Animal relationships

The relationships among the Metazoa, as seen in the common tree, show a close correlation with long standing conventional wisdom and with previous phylogenies based on other molecular data. We can clearly see from the comparisons of Figure 1 and the scores of Table 1 and Figure 2 that, although the human (Homo sapiens) MnSOD sequence is more similar to bird (Gallus gallus MnSOD) (83%) than Danio rerio MnSOD (80%), based on amino acid sequence, the structure based on HCA of H. sapiens MnSOD is more similar to that of D. rerio MnSOD (96.5%) than G. gallus MnSOD (95.0%).
Figure 2

Comparison of the HCA structure and amino acid sequence of four vertebrates MnSODs. Note: The similarity in HCA between fish (D. rerio) and man (H. sapiens). (A) HCA plot of MnSOD. The vertical red box shows a region of significant differences based on HCA. Amino acid changes resulting in changes in the HCA plot are in red (hydrophilic) or blue (hydrophobic). The phylogram to the left indicates the likely evolutionary relationshipof the taxa in order to compare with changes in hydrophobicity. (B) Amino acid sequenced alignment of MnSOD. The black background shows regions where two or more amino acids are totally identical, while the gray background indicates changes resulting in similar amino acids. The positions in red (hydrophilic) or blue (hydrophobic) background show changes in hydrophobicity. The red horizontal box encloses a region corresponding to the regions enclosed by the vertical box in A. The green boxes enclose the positively selected residues detected by PAML in Table 2.

Comparison of the HCA structure and amino acid sequence of four vertebrates MnSODs. Note: The similarity in HCA between fish (D. rerio) and man (H. sapiens). (A) HCA plot of MnSOD. The vertical red box shows a region of significant differences based on HCA. Amino acid changes resulting in changes in the HCA plot are in red (hydrophilic) or blue (hydrophobic). The phylogram to the left indicates the likely evolutionary relationshipof the taxa in order to compare with changes in hydrophobicity. (B) Amino acid sequenced alignment of MnSOD. The black background shows regions where two or more amino acids are totally identical, while the gray background indicates changes resulting in similar amino acids. The positions in red (hydrophilic) or blue (hydrophobic) background show changes in hydrophobicity. The red horizontal box encloses a region corresponding to the regions enclosed by the vertical box in A. The green boxes enclose the positively selected residues detected by PAML in Table 2.
Table 2

Positive selection tests for putative convergent MnSOD genes using site models of PAML. The residues in bold mean the posterior probabilities are > 0.95 and only the values > 0.5 are presented to indicate the positive selective sites. The residues in underline correspond to the colored ones in the HCA plots and amino acid alignment figures. (A) Animal (H. sapiens, G. gallus, X. laevis, and D. rerio) in Figure 2; (B) Enterobacteriaceae (E. coli, T. ptyseos, and Y. enterocolitica) in Figure 3; (C) Archaea and relatives (C. symbiosum, C.N. maritimus, D. acidovorans, C .testosteroni, M. flagellatus, B. marina, and P. maris) in Figure 4; (D) Cyanobacteria (A. variabilis, L. boryana, T. erythraeum, and C. watsonii) in Figure 5.

(A) ModellParametersPositively selected residues
M0: one ratio−2140.72ω = 0.066Not allowed
M1a: neutral−2109.36p0 = 0.907, p1 = 0.093Not allowed
M2a: selection−2109.36p0 = 0.907, p1 = 0.076, p2 = 0.017, ω2 = 1.0007V 8C 14Q 140A 158R
M7: beta−2105.09p = 0.210, q = 2.019Not allowed
M8: beta&ω−2103.47p0 = 0.978, p = 0.277, q = 3.501, (p1 = 0.022), ωs = 2.9687V 8C 14Q 140A 158R
(B)
M0: one ratio−1515.00ω = 0.091Not allowed
M1a: neutral−1494.10p0 = 0.879, p1 = 0.121Not allowed
M2a: selection−1493.71p0 = 0.902, p1 = 0.000, p2 = 0.098, ω2 = 1.50151N 53P 60K 71V 91K 96Q 105R 119K 134L 166M 182R 205A 206K
M7: beta−1495.72p = 0.106, q = 0.626Not allowed
M8: beta&ω−1493.73p0 = 0.904, p = 4.691, q = 99.000, (p1 = 0.096), ωs = 1.53151N 53P 60K 71V 91K 96Q 105R 119K 134L 166M 182R 205A 206K
(C)
M0: one ratio−3813.11ω = 0.003Not allowed
M1a: neutral−3751.00p0 = 0.849, p1 = 0.151Not allowed
M2a: selection−3751.00p0 = 0.849, p1 = 0.151, p2 = 0.000, ω2 = 1.00064G 112S 140V
M7: beta−3708.81p = 0.513, q = 28.496Not allowed
M8: beta&ω−3708.53p0 = 0.935, p = 0.595, q = 8.550, (p1 = 0.065), ωs = 2.48153G 64G 65Q 69A 103G 112S 123K 140V 149E 162E 168L 213K
(D)
M0: one ratio−3060.23ω = 0.024Not allowed
M1a: neutral−3033.20p0 = 0.797, p1 = 0.203Not allowed
M2a: selection−3033.20p0 = 0.797, p1 = 0.004, p2 = 0.199, ω2 = 1.000None
M7: beta−3007.44p = 0.621, q = 10.021Not allowed
M8: beta&ω−3007.44p0 = 1.000, p = 0.621, q = 10.022, (p1 = 0.00001), ωs = 10.672None
The vertical red box of Figure 2A, shows the similarities among the vertebrate proteins. Figure 2B shows a single key amino acid change (Alanine A to Valine V) which might have occurred in the mammalian (human) sequence resulting in a structural change in the MnSOD protein.

Enterobacteriaceae

Figure 3A is a HCA plot of three members of the proteobacteria in the family Enterobacteriaceae which show differences between the two phylogenies seen in Figure 1. In Figure 3A we see two contour regions (the vertical red boxes) showing similarities between Escherichia coli and Yersinia enterocolitica suggesting convergent changes in tertiary structure between MnSODs of these two species.
Figure 3

Comparison of the HCA structure and amino acid sequence of Enterobacteriaceae MnSODs. Note: The similarity in HCA between E. coli and Y. enterocolitica. (A) HCA plot of MnSOD. The vertical red boxes show regions of significant differences based on HCA. Amino acid changes resulting in changes in the HCA plot are in red (hydrophilic) or blue (hydrophobic). The phylogram to the left indicates the likely evolutionary relationship of the taxa in order to compare with changes in hydrophobicity. (B) Amino acid sequence alignment of MnSODs. The black background shows regions where two or more amino acids are totally identical, while the gray background indicates changes resulting in similar amino acids. The positions in red (hydrophilic) or blue (hydrophobic) background show changes in hydrophobicity. The red horizontal boxes enclose regions corresponding to the regions enclosed by the vertical boxes in A. The green boxes enclose the positively selected residues detected by PAML in Table 2.

Comparison of the HCA structure and amino acid sequence of Enterobacteriaceae MnSODs. Note: The similarity in HCA between E. coli and Y. enterocolitica. (A) HCA plot of MnSOD. The vertical red boxes show regions of significant differences based on HCA. Amino acid changes resulting in changes in the HCA plot are in red (hydrophilic) or blue (hydrophobic). The phylogram to the left indicates the likely evolutionary relationship of the taxa in order to compare with changes in hydrophobicity. (B) Amino acid sequence alignment of MnSODs. The black background shows regions where two or more amino acids are totally identical, while the gray background indicates changes resulting in similar amino acids. The positions in red (hydrophilic) or blue (hydrophobic) background show changes in hydrophobicity. The red horizontal boxes enclose regions corresponding to the regions enclosed by the vertical boxes in A. The green boxes enclose the positively selected residues detected by PAML in Table 2. Comparison of the HCA structure and amino acid sequence of seven MnSODs from the groups Archaea, beta-Proteobacteria and Planctomycetes. Note: A variety of amino acid changes among similar HCA analyses. (A) HCA plot of MnSOD. The vertical red boxes show regions of significant differences based on HCA. Amino acid changes resulting in changes in the HCA plot are in red (hydrophilic) or blue (hydrophobic). The phylogram to the left indicates the likely evolutionary relationship of the taxa in order to compare with changes in hydrophobicity. (B) Amino acid sequenced alignment of MnSOD. The black background shows regions where two or more amino acids are totally identical, while the gray background indicates changes resulting in similar amino acids. The positions in red (hydrophilic) or blue (hydrophobic) background show changes in hydrophobicity. The red horizontal boxes enclose regions corresponding to the regions enclosed by the vertical boxes in A. The green boxes enclose the positively selected residues detected by PAML in Table 2. Comparison of the HCA structure and amino acid sequence of MnSODs of four Cyanobacteria. Note: C. watsonii shows the same contour as L. boryana and A. variabilis but presents different amino acids, this implying possible convergence in parallel evolution. (A) HCA plot of MnSOD. The vertical red boxes show regions of significant differences based on HCA. Amino acid changes resulting in changes in the HCA plot are in red (hydrophilic) or blue (hydrophobic). The phylogram to the left indicates the likely evolutionary relationship of the taxa in order to compare with changes in hydrophobicity. (B) Amino acid sequenced alignment of MnSOD. The black background shows regions where two or more amino acids are totally identical, while the gray background indicates changes resulting in similar amino acids. The positions in red (hydrophilic) or blue (hydrophobic) background show changes in hydrophobicity. The red horizontal boxes enclose regions corresponding to the regions enclosed by the vertical boxes in A. The green boxes enclose the positively selected residues detected by PAML in Table 2. Figure 3B shows the changes of amino acid residues that result in these two differences. In addition we can see from Figure 3B a greater degree of amino acid similarity between E. coli and T. ptyseos (88%) than between E. coli and Y. enterocolitica (85%) and between T. ptyseos and Y. enterocolica (82%) (Table 1). Clearly Y. enterocolitica and E. coli have more structural similarity based on HCA (98.5%), although Y. enterocolitica is the more distantly related of the three phylogenetically.

Archaea

As Archaea, Cenarchaeum symbiosum and Candidatus Nitrosopumilus maritimus cluster together phylogenetically, while based on HCA, C. symbiosum clusters with the Planctomycetes/Proteobacteria clade (Figure 1). The HCA similarity scores shown in Table 1 suggest that the structure of C. symbiosum MnSOD is more similar with that of Planctomycetes and beta-Proteobacteria MnSODs (82.8%–85.7%) than to C. N. maritimus MnSOD (82.6%), this implying structural convergence. Two boxes of Figure 4A, enclosed in red, show regions of similarity between the archean C. symbiosum and the Proteobacteria, while C. N. maritimus shows a different contour. Figure 4B, shows the amino acid residues (in red) that correspond to these differences, with the hydrophobic residues indicated in blue. We infer from these results that the archean C. symbiosum MnSOD may function in a manner more similar to the Planctomycetes and beta-Proteo-bacteria MnSODs than to the other archean C. N. maritimus MnSOD.
Figure 4

Comparison of the HCA structure and amino acid sequence of seven MnSODs from the groups Archaea, beta-Proteobacteria and Planctomycetes. Note: A variety of amino acid changes among similar HCA analyses. (A) HCA plot of MnSOD. The vertical red boxes show regions of significant differences based on HCA. Amino acid changes resulting in changes in the HCA plot are in red (hydrophilic) or blue (hydrophobic). The phylogram to the left indicates the likely evolutionary relationship of the taxa in order to compare with changes in hydrophobicity. (B) Amino acid sequenced alignment of MnSOD. The black background shows regions where two or more amino acids are totally identical, while the gray background indicates changes resulting in similar amino acids. The positions in red (hydrophilic) or blue (hydrophobic) background show changes in hydrophobicity. The red horizontal boxes enclose regions corresponding to the regions enclosed by the vertical boxes in A. The green boxes enclose the positively selected residues detected by PAML in Table 2.

Blue-green algae

The relationships among the cyanobacteria show high bootstrap values in the common tree and significant differences with the HCA tree. In the common tree, Crocosphaera watsonii is the sister taxon to the remaining cyanobacteria, but based on HCA analysis, C. watsonii is the sister group to Leptolyngbya boryana and Anabaena variabilis (Figure 1). We see evidence for convergence based on the comparison of amino acid substitutions as shown in Figure 5B vs. the hydrophobic cluster analysis of Figure 5A. Crocosphaera watsonii shows the same contour as outlined by the red boxes of Figure 5A as A. variabilis and L. boryana, while the amino acid sequences found in these contours (Figure 5B) show only 46% and 47% sequence similarities, respectively (Table 1). In addition, Trichodesmium erythraeum, a taxon more closely related to A. variabilis and L. boryana than C. watsonii, shows a different contour in Figure 5A than the other three blue-green algae, providing further evidence for convergence, at least at the contour level.
Figure 5

Comparison of the HCA structure and amino acid sequence of MnSODs of four Cyanobacteria. Note: C. watsonii shows the same contour as L. boryana and A. variabilis but presents different amino acids, this implying possible convergence in parallel evolution. (A) HCA plot of MnSOD. The vertical red boxes show regions of significant differences based on HCA. Amino acid changes resulting in changes in the HCA plot are in red (hydrophilic) or blue (hydrophobic). The phylogram to the left indicates the likely evolutionary relationship of the taxa in order to compare with changes in hydrophobicity. (B) Amino acid sequenced alignment of MnSOD. The black background shows regions where two or more amino acids are totally identical, while the gray background indicates changes resulting in similar amino acids. The positions in red (hydrophilic) or blue (hydrophobic) background show changes in hydrophobicity. The red horizontal boxes enclose regions corresponding to the regions enclosed by the vertical boxes in A. The green boxes enclose the positively selected residues detected by PAML in Table 2.

Selection analysis

A selection analysis revealing Ka values (non-synonymous nucleotide changes) lower than Ks (synonymous nucleotide changes), i.e. Ka/Ks < 1, indicates that mutations in nucleotides resulting in amino acid changes are less frequent than silent changes, this implying that the respective gene is under constraint, impairing change at the protein level. This is referred to as purifying selection. In contrast, Ka values larger than Ks values (Ka/Ks > 1), imply that the gene is under positive selection, and that a change in amino acid composition from a previous state, at that position, is driven by evolutionary processes. Although some Ks values were not available, all of the obtained Ka/Ks ratios were less than 1 (Table 1), indicating purifying selection for all MnSOD genes in our analysis. Nevertheless, as shown in Table 2, several positively selected residues were detected through PAML software. These changes correspond to the HCA plots and sequence alignment maps shown in Figures 2–5, where colored residues show variations in hydrophobicity. These are changes at Alanine at position 140 of H. sapiens, Valine at position 71 and Leucine at position 134 of E. coli, Serine at position 112, and Lysine at the 123 and 213 sites in C. symbiosum. These appear to be under positive selection based on PAML analysis (boxed residues in Table 2). This selection may contribute to the mutations of amino acids resulting in the changes of protein structure and function. Positive selection tests for putative convergent MnSOD genes using site models of PAML. The residues in bold mean the posterior probabilities are > 0.95 and only the values > 0.5 are presented to indicate the positive selective sites. The residues in underline correspond to the colored ones in the HCA plots and amino acid alignment figures. (A) Animal (H. sapiens, G. gallus, X. laevis, and D. rerio) in Figure 2; (B) Enterobacteriaceae (E. coli, T. ptyseos, and Y. enterocolitica) in Figure 3; (C) Archaea and relatives (C. symbiosum, C.N. maritimus, D. acidovorans, C .testosteroni, M. flagellatus, B. marina, and P. maris) in Figure 4; (D) Cyanobacteria (A. variabilis, L. boryana, T. erythraeum, and C. watsonii) in Figure 5.

Discussion

The presentation of the HCA tree for MnSODs gives a visual representation to look for taxa where these proteins may have deviated structurally from their phylogenetic position. Hydrophobic cluster analysis allows us to observe convergent evolution not only by looking for reversions back to the same amino acid but also reversions to the same HCA contour. If this method can be automated then one could search nucleotide and protein databases for similarities in protein structure among unrelated organisms. The question is not whether these differences between phylogeny and HCA are minor or major changes, but whether they represent functional convergence at all. We believe that we may have observed some structural convergence at the molecular level regarding the blue-green algae, the Archaea, the Enterobacteriaceae and the animals, but no functional correlation can be made at this time. The relationship between these contours and the structure/function of proteins needs to be verified using X-ray crystal analysis, enzyme kinetics, etc. Since there is a large production of superoxide in the process of photosynthesis, testing levels of photo-inhibition on various blue-green algae, which may have evolved to different exposures to light, vs. HCA, might be one possible study. From the standpoint of phylogenetic analysis, the HCA only provides a distance matrix without parameters or statistics to check for accuracy or reproducibility. At the same time, trees based on HCA for the MnSOD proteins follow closely enough those based on the evolutionary history, so that one must wonder about the reason(s) for the noticeable deviations from this relationship.
  19 in total

1.  A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach.

Authors:  S Whelan; N Goldman
Journal:  Mol Biol Evol       Date:  2001-05       Impact factor: 16.240

2.  DnaSP, DNA polymorphism analyses by the coalescent and other methods.

Authors:  Julio Rozas; Juan C Sánchez-DelBarrio; Xavier Messeguer; Ricardo Rozas
Journal:  Bioinformatics       Date:  2003-12-12       Impact factor: 6.937

3.  Detection of secondary structure elements in proteins by hydrophobic cluster analysis.

Authors:  S Woodcock; J P Mornon; B Henrissat
Journal:  Protein Eng       Date:  1992-10

4.  ProtTest: selection of best-fit models of protein evolution.

Authors:  Federico Abascal; Rafael Zardoya; David Posada
Journal:  Bioinformatics       Date:  2005-01-12       Impact factor: 6.937

5.  Toward automatic reconstruction of a highly resolved tree of life.

Authors:  Francesca D Ciccarelli; Tobias Doerks; Christian von Mering; Christopher J Creevey; Berend Snel; Peer Bork
Journal:  Science       Date:  2006-03-03       Impact factor: 47.728

6.  The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools.

Authors:  J D Thompson; T J Gibson; F Plewniak; F Jeanmougin; D G Higgins
Journal:  Nucleic Acids Res       Date:  1997-12-15       Impact factor: 16.971

7.  Hydrophobic cluster analysis: an efficient new way to compare and analyse amino acid sequences.

Authors:  C Gaboriaud; V Bissery; T Benchetrit; J P Mornon
Journal:  FEBS Lett       Date:  1987-11-16       Impact factor: 4.124

8.  Manganese superoxide dismutase based phylogeny of pathogenic fungi.

Authors:  Emilie Fréalle; Christophe Noël; Nicole Nolard; Françoise Symoens; Maria-Sueli Felipe; Eduardo Dei-Cas; Daniel Camus; Eric Viscogliosi; Laurence Delhaes
Journal:  Mol Phylogenet Evol       Date:  2006-05-12       Impact factor: 4.286

9.  Identification and phylogeny of ascomycetous yeasts from analysis of nuclear large subunit (26S) ribosomal DNA partial sequences.

Authors:  C P Kurtzman; C J Robnett
Journal:  Antonie Van Leeuwenhoek       Date:  1998-05       Impact factor: 2.271

Review 10.  Superoxide radical and superoxide dismutases.

Authors:  I Fridovich
Journal:  Annu Rev Biochem       Date:  1995       Impact factor: 23.643

View more
  1 in total

Review 1.  On the Origin of Superoxide Dismutase: An Evolutionary Perspective of Superoxide-Mediated Redox Signaling.

Authors:  Adam J Case
Journal:  Antioxidants (Basel)       Date:  2017-10-30
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.