Literature DB >> 31092607

Contraction of the ROS Scavenging Enzyme Glutathione S-Transferase Gene Family in Cetaceans.

Ran Tian1,2, Inge Seim2,3, Wenhua Ren1, Shixia Xu4, Guang Yang4.   

Abstract

Cetaceans are a group of marine mammals whose ancestors were adaptated for life on land. Life in an aquatic environment poses many challenges for air-breathing mammals. Diving marine mammals have adapted to rapid reoxygenation and reactive oxygen species (ROS)-mediated reperfusion injury. Here, we considered the evolution of the glutathione transferase (GST) gene family which has important roles in the detoxification of endogenously-derived ROS and environmental pollutants. We characterized the cytosolic GST gene family in 21 mammalian species; cetaceans, sirenians, pinnipeds, and their terrestrial relatives. All seven GST classes were identified, showing that GSTs are ubiquitous in mammals. Some GST genes are the product of lineage-specific duplications and losses, in line with a birth-and-death evolutionary model. We detected sites with signatures of positive selection that possibly influence GST structure and function, suggesting that adaptive evolution of GST genes is important for defending mammals from various types of noxious environmental compounds. We also found evidence for loss of alpha and mu GST subclass genes in cetacean lineages. Notably, cetaceans have retained a homolog of at least one of the genes GSTA1, GSTA4, and GSTM1; GSTs that are present in both the cytosol and mitochondria. The observed variation in number and selection pressure on GST genes suggest that the gene family structure is dynamic within cetaceans.
Copyright © 2019 Tian et al.

Entities:  

Keywords:  GST; cetaceans; gene family; glutathione transferase; oxidative stress adaptation

Mesh:

Substances:

Year:  2019        PMID: 31092607      PMCID: PMC6643896          DOI: 10.1534/g3.119.400224

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


One of the classical examples of evolution is the return of terrestrial vertebrates to an aquatic environment – manifested by functional (secondary) adaptations in species whose ancestors departed an aquatic environment hundreds of millions of years earlier. The order Cetacea (whales, dolphins, and porpoises) is a model group of air-breathing marine mammals that transitioned to an aquatic lifestyle approximately 55 million years ago (Uhen 2007). The majority of cetaceans make shallow, short dives. This includes the common dolphin (Delphinus delphis) which usually dives down to 200 m and remains submerged for 5 min (Schreer and Kovacs 1997). Some cetaceans are capable of deep, long dives. For example, the sperm whale (Physeter catodon) can dive to a depth of 3,000 m and stay underwater for at least 138 min (Schreer and Kovacs 1997). Regardless of their diving abilities, all cetaceans face the tremendous challenge posed by a lack of oxygen during dives, so-called asphyxia (the integration of hypoxia, hypercapnia, and acidosis) (Elsner and Gooden 1983). In response, cetaceans have numerous adaptations of the respiratory system, such as improved oxygen delivery and storage in blood and muscle, as well as increased activity of glycolytic enzymes (Ramirez ). Cardiovascular adaptations, including bradycardia (slowed heart rate) and peripheral vasoconstriction, also play a vital role in the conservation of oxygen in cetaceans. They augment or maintain blood flow to the central nervous system and heart but reduce flow in peripheral tissues such as kidney, liver, and skeletal muscle (ischemia). In terrestrial mammals reperfusion injury occurs when blood flow and oxygen delivery is restored to ischemic tissues, resulting in oxidative stress (Panneton 2013; Zenteno-Savín ; Cantú-Medellín ). Oxidative stress reflects an imbalance between the generation of reactive oxygen species (ROS) and the cell’s ability to detoxify reactive intermediates and repair damage (Birben ). Oxidative stress can damage cells by lipid peroxidation and alteration of protein and nucleic acid structures (Apel and Hirt 2004). The current evidence suggests that diving marine mammals and hibernating terrestrial mammals (e.g., ground-squirrels and some bats) have adapted to rapid reoxygenation and ROS-mediated reperfusion injury (Hermes-Lima ). Oxidative damage is limited in cetaceans due to an intrinsic protection against ROS by scavenging enzymes and nonenzymatic antioxidants. Ceteceans have increased blood levels of the reduced form of glutathione (GSH), one of the most important nonenzymatic ROS scavengers (Wilhelm Filho ; García-Castañeda ). Similarily, the blood levels of vitamin E (α-tocopherol), which acts as a nonenzymatic antioxidant by protecting against peroxidation (Niwa 1999), is elevated in bottlenose dolphins (Tursiops truncatus) compared to its terrestrial sister taxa (Kasamatsu ). ROS scavenging enzymes include glutathione peroxidase (GPX) which consumes hydrogen peroxide, glutathione reductase (GRS) which recycles glutathione from glutathione disulfide, superoxide dismutase (SOD) which scavenges superoxide radicals, and glutathione-S-transferase (GST) which catalyzes the conjugation of glutathione (Birben ; Wilhelm Filho ; Dröge 2002). Studies have revealed that antioxidant enzyme activies is higher in ceteceans compared to terrestrial mammals (Birben ; Wilhelm Filho ). Taken together, it is now recognized that cetaceans reduce reperfusion injury in various ways, however the genetic changes associated with these adaptations remain elusive. Gene family innovation may enable species to adapt to novel or stressful environments (Kondrashov 2012). Here, we consider the ROS scavenging enzyme gene family glutathione transferase (GST; EC 2.5.1.18). A handful of studies suggest that GST gene gain/loss and positive selection facilitate the adaptation to changing environments (Ding ; Low ; Monticolo ; Khan 2014; Liu ). The GST family is abundant and widely distributed in vertebrates, plants, insects, and microbes (Board and Menon 2013). In addition to conjugating GSH with reactive electrophilic compounds, some GSTs can also deteoxify hydroperoxides (Sherratt and Hayes 2002). Mammalian GSTs have been divided into three structurally distinct superfamily classes with separate evolutionary origins (Board and Menon 2013): cytosolic, mitochondrial, and microsomal transferases. Cytosolic GSTs represent the largest class and consists of seven distinct subclasses: alpha (α; encoded by human chr 6), mu (μ; chr 1), theta (θ; chr 22), pi (π; chr 11), zeta (ζ; chr 14), sigma (σ; chr 4), and omega (ω; chr 10). The number of genes in each subclass varies across the phylogenetic tree. For example, human alpha (GSTA1 to GSTA5) and mu (GSTM1 to GSTM5) have five enzymes each, omega (GSTO1 and GSTO2) two members each, and zeta (GSTZ) has only one member (Table S1). Members within each GST subclass share greater than 40% amino acid sequence identity, while members of different subclasses share less than 25% identity (Wu and Dong 2012). Although diverse mammals have retained each cytosolic GSTs subclass, there is variation in the number of genes in each subclass. For example, there is one pi subclass gene (GSTP) in humans and two pi subclass genes in mice (Bammler ). In this study, we performed a comparative genomics analysis of cytosolic GST genes in 21 mammals, including seven cetaceans, to improve our knowledge on the genetic and evolutionary dynamics of the GST superfamily in mammals.

Materials and Methods

Sequence retrieval

We obtained known human GST genes by perusing research articles and recent reviews (Board and Menon 2013; Morel ) and by downloading coding sequences (CDS) from GenBank (Benson ). GenBank accession numbers are listed in Table S1. Employing human protein sequences as queries, BLASTn searches were performed using an in-house Python script (see Supplemental Material, File 2) on local databases constructed from downloaded genomic sequences from 20 species. These included seven cetaceans: bottlenose dolphin (Tursiops truncatus), killer whale (Orcinus orca), Yangtze river dolphin (Lipotes vexillifer), Yangtze finless porpoise (Neophocaena asiaeorientalis asiaeorientalis), minke whale (Balaena acutorostrata), bowhead whale (Balaena mysticetus), and sperm whale (Physeter macrocephalus); two pinnipeds: Weddell seal (Leptonychotes weddellii), Pacific walrus (Odobenus rosmarus divergens); one sirenian: Florida manatee (Trichechus manatus latirostris); and ten terrestrial mammals: cow (Bos taurus), Tibetan yak (Bos mutus), sheep (Ovis aries), Tibetan antelope (Pantholops hodgsonii), dog (Canis lupus familiaris), horse (Equus caballus), a bat (little brown bat; Myotis lucifugus), mouse (Mus musculus), naked mole rat (Heterocephalus glaber), and (African) elephant (Loxodonta africana). The completeness of the annotated gene set of the 20 specices was assessed using Benchmarking Universal Single-Copy Orthologs (BUSCO v3.0) with mammalian-specific single-copy orthologs (mammalia_odb9) (Simão ). Genome sequencing and assembly information of each species are listed in Table S2. To identify GST genes, we set the BLASTn parameter E-value cut-off to 10 and hits with the highest score, lowest E-value, and a length of ≥ 150bp were retained. We next retrieved multiple non-redundant hits by extending 1,000 bp at both 5′ and 3′ ends to find exon boundaries, following the canonical gt/ag exon/intron junction rule (Cheng ). We compared the genomic locations of each coding sequence among all genes to filter out repeated sequences with the same location on the same scaffold. The online resource GENEWISE (http://www.ebi.ac.uk/Tools/psa/genewise) (Birney ) was employed to identify open reading frame (ORF) for all obtained sequences. Finally, all predicted GST sequences were verified by BLAST against their respective species’ genome to acquire complete GST gene sets. Mammalian GSTs were named according to the human GST nomenclature (Nebert and Vasiliou 2004). All identified GST genes were categorized into three categories – based on amino acid composition, unique motifs, and BLAST and alignment results: 1) intact gene, a complete CDS region with the canonical structure typical of GST families; 2) partial gene, putative functional protein, but missing a start codon and/or stop codon; 3) pseudogene, highly similar to functional orthologs but with (a) inactivating mutation(s) and/or stop codon(s). To achieve a high accuracy in identifying GST genes in mammals, we used Genomicus v93.01 (Nguyen ) to identify genes flanking the GST gene clusters in human and searched the mammalian genomes using BLAST to identify orthologous genomic regions. This facilitated the identification of the correct arrangement and orientation of the GST genes in each species.

Phylogenetic analyses

The phylogenetic relationships between putative GST members in each subclass were estimated by maximum likelihood and Bayesian methods, as implemented in RAxML v8.0.26 (Stamatakis 2014) and MrBayes v3.1.2 (Ronquist and Huelsenbeck 2003). Nucleotide sequences were aligned using MUSCLE v3.6 (Edgar 2004), implemented in SeaView v4.5.4 (Gouy ), and manually corrected upon inspection. MrModeltest was used to estimate the best-fit model of nucleotide substitution (SYM+G) and amino acid substitution (JTT+G4) (Nylander 2009). For the RAxML analyses the ML phylogeny was estimated with 1,000 bootstrap replicates. For MrBayes analyses we performed two simultaneous independent runs for 50 million iterations of a Markov Chain, with six simultaneous chains and sampling every 1,000 generations. A consensus tree was obtained after discarding the first 25% trees as burn-in.

Gene family analyses

To identify expanding and contracting gene ortholog groups across the mammalian phylogeny we estimated the gene numbers on internal branches using a random birth and death process model implemented in the software CAFÉ v3.0, a tool for the statistical analysis of the evolution of the size of gene families (De Bie ). We defined gene gain and loss by comparing cluster size differences between ancestors and each terminal branch among the phylogenetic tree. An ultrametric tree, based on the concatenated orthogroups, was estimated with BEAST v1.10 using Markov chain Monte Carlo (MCMC) with fossil calibrations and a Yule tree prior (Suchard ). Molecular timescales were obtained from TimeTree (http://www.timetree.org) (Kumar ). The analyses ran for 10 million generations, with a sample frequency of 1,000 and a 10% burn-in.

Adaptive evolution analyses

To evaluate the positive selection of GST genes during mammalian evolution, codon substitution models implemented in the codeml program in PAML v4.4 (Yang 2007) were applied to GST gene alignments. Two pairs of site-specific modes were compared using the likelihood ratio test (LRT): M8 (beta & ω) vs. M8a (beta & ω = 1) (Swanson ). M8 estimates the beta-distribution for ω and takes into account positively selected sites (ω > 1), with the neutral model M8a not ‘allowing’ a site with ω > 1. We next employed branch-site models (test 2) to explicitly assess the rate of evolution on a site along a specific lineage of a tree: branch-site model (Ma) vs. branch-site model with fixed ω1 = 1 (Ma0) (Zhang ). The Ma model assumes that sites in the foreground branch (i.e., the branch of interest) are under positive selection. When the LRT was significant (P < 0.05) under the M8 and Ma tests, codon sites under positive selection were assessed using the Bayes Empirical Bayes (BEB) method (Yang ). The species tree was used as the guide tree in all analyses (Figure S1). Multiple testing for positive selection on genes was corrected by performing a false discovery rate (FDR, Benjamini-Hochberg) test at a cutoff of 0.05 (Benjamini and Hochberg 1995). We only accepted positively selected sites with a posterior probability (PP) > 0.80. A series of models implemented in HyPhy (http://www.datamonkey.org) (Pond and Muse 2005) were also used to estimate ratios of nonsynonymous (d ) and synonymous (d ) based on a maximum likelihood (ML) framework (Pond and Frost 2005a). These models tested were Single Likelihood Ancestor Counting (SLAC), Fixed Effect Likelihood (FEL), and Random Effect Likelihood (REL) (Pond and Frost 2005b). As a criterion to identify candidates under selection, we used a Bayes Factor > 50 for REL and P-value of 0.10 for SLAC and FEL. The program TreeSAAP (Selection on Amino Acid Properties using Phylogenetic trees) v3.2 (Woolley ) was used to evaluate amino acid residue replacement during evolution, taking into account physicochemical properties and the assumption of random replacement under a neutral model of evolution. TreeSAAP complements d/d analysis, which does not distinguish between different types of non-synonymous substitutions. InterProScan 5 was used to annotate positively selected sites found in functional protein domains (Jones ). We also used PyMOL v2.3 (http://pymol.sourceforge.net/) to load and manipulate PDB files, highlighting the position of selected residues.

Comparison of mammalian niches

To investigate the potential links between the molecular evolution of GST genes and ecological adaptations, we assigned habitats (aquatic or terrestrial) according to data in the literature (Uhen 2007). We used Clade Model C (CmC) to identify the level of divergent selection among clade with different ecological niches and to test what partition of clades best fit the data (Bielawski and Yang 2004). CmC, which allows site variation among a priori defined foreground (cetacean, pinnipeds, sirenians) and background partitions (Figure S1), was compared with the M2a_rel null mode which does not allow variation among divisions. Taking into account the phylogenetic relationships between mammals, phylogenetic ANOVA (Garland ) using the function phylANOVA implemented in the R package ‘phytools’ (Revell 2012) was performed to test for differences in the number of GST genes between marine and terrestrial mammals.

Data availability

The authors affirm that all data necessary for confirming the conclusions of this article are represented fully within the article and its tables and figures. Supplemental material available at FigShare: https://doi.org/10.25387/g3.7403828.

Results

Cytosolic glutathione transferase gene repertoires in mammals

To examine the evolution of cytosolic GST genes in mammals we interrogated public genomic data of the 21 species, representing all major mammalian taxa. We identified a total of 448 GST genes (333 intact genes, 22 partial genes, and 93 pseudogenes) (Table 1). Amino acid sequence identity between GST paralogous was more than 50%, whereas the identity between all subclasses was less than 30% (Table S3). It has been reported that a low-quality genome can effect gene family analyses by introducing frame shifting errors in coding sequences (Young ). In our study we employed genomes with good reported genome assemblies in an effort to minimize this potential bias. In agreement, BUSCO analysis on the protein gene sets showed that most, except for bowhead whale (74.6%) and Weddell seal (87.3%), included >95% complete sequences of mammalian universal single-copy orthologs (n = 4,104) (Table S2). This suggests that the genome assembly quality is similar between the marine and the terrestrial species.
Table 1

Overview of cytosolic glutathione transferase (GST) genes in 21 mammals

SubclassAlpha (GSTA)Mu (GSTM)Theta (GSTT)Pi (GSTP)Zeta (GSTZ)Omega (GSTO)Sigma (HPGDS)Total number of GSTsPseudogene proportion
Human12(7:5:0)5(0:5:0)3(0:3:0)1(0:1:0)1(0:1:0)2(0:2:0)1(0:1:0)25(7:18:0)0.28
Mouse6(1:5:0)10(4:6:0)3(0:3:0)7(0:7:0)1(0:1:0)2(0:2:0)1(0:1:0)30(5:25:0)0.17
Naked mole rat4(1:3:0)10(0:10:0)2(0:2:0)2(0:2:0)1(0:1:0)2(1:1:0)1(0:1:0)22(2:20:0)0.09
Bottlenose dolphin3(1:0:2)5(3:1:1)2(1:1:0)3(1:0:2)1(0:0:1)2(0:1:1)1(0:1:0)17(6:4:7)0.35
Killer whale3(1:2:0)5(4:1:0)2(0:2:0)3(1:2:0)1(0:1:0)2(0:2:0)1(0:1:0)17(6:11:0)0.35
Yangtze finless porpoise3(1:2:0)5(4:1:0)2(1:1:0)3(1:0:2)1(0:0:1)2(0:2:0)1(0:1:0)17(7:7:3)0.41
Yangtze river dolphin3(1:2:0)5(4:1:0)2(1:1:0)3(1:2:0)1(0:1:0)2(0:2:0)1(0:1:0)17(7:10:0)0.41
Sperm whale3(1:2:0)5(4:1:0)2(0:1:1)3(1:2:0)1(0:1:0)2(0:2:0)1(0:1:0)17(6:10:1)0.35
Minke whale3(1:2:0)5(3:1:1)2(1:1:0)3(1:2:0)1(0:1:0)2(0:1:1)1(0:1:0)17(6:9:2)0.35
Bowhead whale2(0:2:0)5(3:1:1)2(1:1:0)3(1:0:2)1(0:0:1)2(0:2:0)1(0:1:0)16(5:7:4)0.31
Cow6(2:4:0)8(2:6:0)2(0:2:0)2(0:2:0)1(0:1:0)3(0:3:0)1(0:1:0)23(4:19:0)0.17
Tibetan yak6(1:5:0)6(0:6:0)3(0:3:0)2(0:2:0)1(0:1:0)3(0:3:0)1(0:1:0)22(1:21:0)0.05
Sheep7(1:6:0)8(2:6:0)2(0:2:0)2(0:2:0)1(0:1:0)4(0:4:0)1(0:1:0)25(3:22:0)0.12
Tibetan antelope6(1:5:0)6(0:6:0)2(0:2:0)2(0:2:0)1(0:1:0)3(0:3:0)1(0:1:0)21(1:20:0)0.05
Weddell seal4(1:2:1)5(2:3:0)2(0:2:0)7(1:4:2)1(0:1:0)2(0:2:0)1(0:1:0)22(4:15:3)0.18
Pacific walrus5(1:4:0)8(2:6:0)2(0:2:0)6(1:5:0)1(0:1:0)2(0:2:0)1(0:1:0)25(4:21:0)0.16
Dog6(1:5:0)5(3:2:0)2(0:2:0)12(6:6:0)1(0:1:0)2(0:2:0)1(0:1:0)29(10:19:0)0.34
Horse10(1:9:0)6(2:4:0)2(0:2:0)2(0:2:0)1(0:1:0)2(0:2:0)1(0:1:0)24(3:21:0)0.13
Microbat6(0:6:0)5(1:3:1)4(0:4:0)6(5:1:0)1(0:1:0)2(0:2:0)1(0:1:0)25(6:18:1)0.24
Florida manatee4(0:4:0)8(0:8:0)2(0:2:0)1(0:1:0)1(0:1:0)2(0:2:0)1(0:1:0)19(0:19:0)0.00
Elephant3(0:3:0)8(0:8:0)2(0:1:1)1(0:1:0)1(0:1:0)2(0:2:0)1(0:1:0)18(0:17:1)0.00

The number outside brackets is the number of genes in a subclass, while the numbers in the brackets, separated by a colon, indicate the number of pseudogenes, intact gene, and partial genes. The orders of cetacean are shadowed.

The number outside brackets is the number of genes in a subclass, while the numbers in the brackets, separated by a colon, indicate the number of pseudogenes, intact gene, and partial genes. The orders of cetacean are shadowed.

The phylogenetic relationships of cytosolic GST genes

We next wished to classify the 448 cytosolic GST genes into their respective subclass. Phylogenetic relationships were reconstructed using maximum likelihood and Bayesian methods. Both topologies yielded similar branch patterns, indicating a reliable tree structure (Figure S2). Each GST subclass was clustered into a monophyletic group with high node bootstrap values (94–100% of bootstrapping) (Figure S2). However, subtrees of species within each subclass were not clearly resolved (< 50% node bootstrap support) (Figure S2) – especially for the alpha and mu subclasses where duplication events were common. According to the phylogenic tree, the cytosolic GST gene family is highly conserved in mammals (Figure S2). We detected 105 alpha subclass (81 genes and 24 pseudogenes), 133 mu subclass (90 genes and 43 pseudogenes), 47 theta subclass (42 genes and 5 pseudogenes), 74 pi (54 genes and 20 pseudogenes), 21 zeta subclass (21 genes and 0 pseudogenes), 47 omega subclass (46 genes and 1 pseudogenes), and 21 sigma subclass (21 genes and 0 pseudogenes) genes in the 21 species examined. Notably, ML and Bayesian phylogenies based on the 448 GST genes did not arrange the seven subclasses into well-supported clades (< 50% node bootstrap support). This could result from the 93 pseudogenes and 22 partial GST genes that are likely no longer under natural selection pressure. Therefore, only the 333 intact (complete CDS) sequences were used to infer the phylogenetic tree and determine the evolutionary relationship of subclasses. The tree generated using an amino acid substitution model (Figure S3) was consistent with the tree structure obtained using nucleotide sequences (Figure 1). According to the new phylogeny, the mu subclass clustered with the pi subclass but the bootstrap support value and posterior probability were low (BS = 18%; PP = 0.34, Figure 1). The alpha subclass grouped with sigma with high bootstrapping and posterior probability (BS = 91%; PP = 0.58, Figure 1). The mu, pi, alpha, and sigma subclasses were much more closely related to each other, with 98% bootstrap support and 0.50 posterior probability (Figure 1). The theta subclass was placed as sister to a clade containing mu, pi, alpha, and sigma genes with high support (BS = 85%; PP = 1.00, Figure 1). Moreover, these five subclasses formed a monophyletic sister clade to the zeta subclass. The phylogenetic tree recovered the omega subclass as the most diverged lineage within GSTs (Figure 1).
Figure 1

Phylogenetic tree of GST gene family in mammals. Maximum likelihood and Bayesian phylograms describing phylogenetic relationships among 333 intact (complete coding sequences) mammalian GSTs (21 species). Numbers on nodes correspond to maximum likelihood bootstrap support values and Bayesian posterior probabilities. The GST subclasses are indicated by different colors: alpha (red), mu (orange), pi (turquoise), omega (azure), sigma (green), zeta (purple), and theta (blue).

Phylogenetic tree of GST gene family in mammals. Maximum likelihood and Bayesian phylograms describing phylogenetic relationships among 333 intact (complete coding sequences) mammalian GSTs (21 species). Numbers on nodes correspond to maximum likelihood bootstrap support values and Bayesian posterior probabilities. The GST subclasses are indicated by different colors: alpha (red), mu (orange), pi (turquoise), omega (azure), sigma (green), zeta (purple), and theta (blue). Further strengthening our GST gene predictions, all seven cytosolic GST subclasses showed conserved synteny and similar arrangement across mammalian genomes (Figure 2). Duplicated genes within each subclass were arranged in a cluster, with two or more gene copies in tandem in the subclasses alpha, mu, theta, pi, and omega. The zeta and sigma classes had a single copy per species. The subclasses were flanked by the same pair of genes in all species (Figure 2), with the exception of the dog where pi subclass genes were present on separate scaffolds and possibly reflecting a sequencing or genome assembly artifact.
Figure 2

Genomic organization of GST genes in 21 mammalian species. The arrowed boxes represent genes and directions of transcription. GST genes are shown in orange, while flanking genes are indicated in green, purple, blue, and pink. Filled boxes: intact genes; empty boxes: partial gene, and empty boxes with a vertical line: pseudogenes (P). Connecting horizontal lines indicate genes on the same chromosome/genomic scaffold. Gene family sizes for ancestral states are shown along each node in the phylogenetic tree.

Genomic organization of GST genes in 21 mammalian species. The arrowed boxes represent genes and directions of transcription. GST genes are shown in orange, while flanking genes are indicated in green, purple, blue, and pink. Filled boxes: intact genes; empty boxes: partial gene, and empty boxes with a vertical line: pseudogenes (P). Connecting horizontal lines indicate genes on the same chromosome/genomic scaffold. Gene family sizes for ancestral states are shown along each node in the phylogenetic tree.

Lineage-specific gene duplications and deletions

We next contrasted the number of GST gene copy number in mammalian lineages. Of the 21 species studied, mouse (25 intact GSTs), naked mole rat (20 intact GSTs), Tibetan yak (21 intact GSTs), Tibetan antelope (20 intact GSTs), sheep (22 intact GSTs), Pacific walrus (21 intact GSTs), and horse (21 intact GSTs) have the largest GST repertoires (Table 1, Figure 2). On the opposite spectrum, cetaceans appear to have the smallest GST repertoire – about ten functional GSTs per species (Table 1, Figure 2). In agreement, the fraction of GST pseudogenes is the highest in cetaceans (mean, 36%, Table 1), which is three times higher than terrestrial artiodactyls (mean, 11%, Table 1). Considering the alpha subclass, we found that the cetacean alpha subclass consists of two functional GSTs, but their relatives (i.e., artiodactyls) have at least four intact alpha GSTs. Sheep (6 intact GSTAs), horse (9 intact GSTAs), and bat (6 intact GSTAs) have a relatively large number of alpha GSTs. Similarly, only two functional mu GSTs were identified in cetaceans. In contrast, the number of functional GSTM genes in artiodactyls (6) is almost six times that of cetaceans. Notably, a large group of species, including all rodents and two afrotherians (West Indian manatee and African elephant), harbor six to ten functional mu GSTs (Table 1, Figure 2). Additionally, bat (4 intact GSTTs) has the largest gene number of theta GSTs, while the lowest gene copy number was found in cetaceans (just one gene in six cetaceans; two in killer whale). Almost all mammalian species have two omega subclass genes, with the exception of artiodactyls (3 or 4 genes) (Table 1, Figure 2). The zeta and sigma subclass gene number appear to be more conserved in mammals, with one copy identified, respectively. Furthermore, the gene gain and loss of GSTs at each ancestral node was estimated by the software CAFÉ (De Bie ). We found that the terrestrial groups have similar GST gene repertoires – carnivorans (mean 24.8), rodents (mean 25), artiodactyls (mean 22.29) (Welch’s t-test, P > 0.05). In contrast, the number of intact GST genes in cetaceans (mean 16.92) was significantly lower than that of the terrestrial groups (Welch’s t-test, P < 0.001) (Figure 2). To compare gene numbers and account for statistical non-independence of closely-related species, we performed phylogenetic ANOVA (phylANOVA). After accounting for phylogeny, habitat (aquatic vs. terrestrial) was a signficant predictor of the gene copy number of all cytosolic GSTs combined (phylANOVA; F = 23.135, P = 0.009) (Figure 3A) and alpha-class GSTs (GSTA) alone (phylANOVA; F = 21.599, P = 0.007), with a smaller number of genes apparent in cetaceans (Figure 3B). In agreement, when we compared each marine order (Cetacea, Pinnipedia, Sirenia) to terrestrial species separately, cytosolic gene copy numbers was only significantly different between cetacean and terrestrial species (Figure 3C). This included all cytosolic GST combined (phylANOVA; F = 142.458, P = 0.001), GSTA alone (phylANOVA; F = 24.300, P = 0.026), and GSTM alone (phylANOVA; F = 27.557, P = 0.011).
Figure 3

Differences in cytosolic GST genes between mammals inhabing aquatic and terrestrial habitats. (A) All GSTs genes combined, comparing aquatic and terrestrial mammals. Green dots indicate terrestrial species; blue dots, cetacean species; gray dots, pinnipeds; purple dots, sirenians. (B) alpha-class GSTs (GSTA) genes, comparing aquatic and terrestrial mammals. Annotated as in (A). (C) Comparison of all GST genes, alpha-class GSTs (GSTA) genes, and mu-class GSTs (GSTM) genes in ceteaceans and terrestrial mammals. Bar chart shows mean± SEM. Blue bars indicates cetacean species; green bars, terrestrial species. We compared each pair of distributions by phylogenetic ANOVA (phylANOVA) tests, which control for shared ancestry. The p-value for each test is shown above each plot.

Differences in cytosolic GST genes between mammals inhabing aquatic and terrestrial habitats. (A) All GSTs genes combined, comparing aquatic and terrestrial mammals. Green dots indicate terrestrial species; blue dots, cetacean species; gray dots, pinnipeds; purple dots, sirenians. (B) alpha-class GSTs (GSTA) genes, comparing aquatic and terrestrial mammals. Annotated as in (A). (C) Comparison of all GST genes, alpha-class GSTs (GSTA) genes, and mu-class GSTs (GSTM) genes in ceteaceans and terrestrial mammals. Bar chart shows mean± SEM. Blue bars indicates cetacean species; green bars, terrestrial species. We compared each pair of distributions by phylogenetic ANOVA (phylANOVA) tests, which control for shared ancestry. The p-value for each test is shown above each plot.

Evolutionary model of the cytosolic GST gene family in mammals

To investigate the possible role of natural selection on the evolutionary history of the cytosolic GST gene family, a series of site-specific and branch-specific evolutionary models were evaluated. Site-specific selection tests implemented in PAML (Yang 2007) were performed to assess the selective pressure acting on mammalian GSTs. Site-specific positive selection with posterior probabilities > 0.80 were detected for GSTA1 (16 sites), GSTM1 (5 sites), GSTO1 (14 sites), GSTO2 (3 sites), GSTP1 (2 sites), GSTP2 (6 sites), GSTT2 (1 sites), and GSTZ1 (3 sites) (Table S4); suggesting that GSTs have evolved under diversifying selection in mammals. Similarly, positively selected codons in eight genes were identified using SLAC (23 codons), FEL (28), and REL (27) models implemented in Datamonkey (Pond and Muse 2005). In total, 38 codons from eight genes (GSTA1: 10; GSTM1: 1; GSTO1: 9; GSTO2: 5; GSTP1: 2; GSTP2: 5; GSTT2: 3; and GSTZ1: 3) with evidence of positive selection were detected with at least two of the ML methods (Table 2). Among these codons, 31 sites were also detected by amino-acid level selection analyses using TreeSAAP (Woolley ). Intriguingly, site enrichment analysis revealed that the sites under positive selection (28/38, 73%) are located or close to a GSH binding site, a substrate binding pocket, a C-terminal domain interface, or a N-terminal domain interface (Table S5, Figure S4).
Table 2

Amino acid sites under positive selection detected by ML methods

GeneSite PositionPAMLDatamonkeyTreeSAAP
M8aSLACbFELcRELdRadical Changes in AA PropertieseTotal
GSTA136Pα, c, pHi, αc, P5
49Ns, RF, Pc, h, F, p, Ra, P8
96pK’, F, Ra, Hp, Ht, P4
1000
103p1
121pK’, Ra, Ht3
208Pα, Pc, pK’, F, P5
2120
215Br, RF, h, pHi, Hnc, p, αc, Et8
222pK’1
GSTM1107Ns, RF, h, p, P5
GSTO123Br, Bl, RF, Pc, h, F, p, Et, P9
470
69K0, F, P3
125αc1
127pK’1
128Ns, Br, Bl, RF, Pc, h, F, p, El, Ra, Hp, Ht, Et, P14
216Ns, Br, RF, Pc, h, F, p, K0, Hnc, El, αc, αn, Esm, Et, P15
226pK’1
227Pα, pHi, P3
GSTO214Pα1
230
43Ns, αn, pHi, Br, RF, h, Hnc, p, Esm, Et10
141Pα, Pc, P3
164Ns, Pc, F, Ra, P5
GSTP112Pc, K0, pHi, αc4
40Pα, Br, Pc, h, F, p, Et, P8
GSTP211Pα, Pc, αn, Ra, P5
12K0, pHi, αc3
760
111Ns, pHi, Esm3
121h, pHi, p3
GSTT225αc, Pr, Et, pHi, Esm5
80Pα1
2370
GSTZ171Bl, Pc, F, αm, P5
125K0, Pc2
1370

Sites detected under selection in M8 with posterior probabilities > 80% in the BEB analyses.

Codons with P values < 0.2.

Codons with P values < 0.2.

Codons with Bayes factors > 50.

Radical changes in amino acid properties under category 6-8 were detected in TreeSAAP. Physicochemical amino acid properties available in TreeSAAP are as following: : Power to be C-term., α-helix; : Power to be in the N-terminal of an α-helix; : Buriedness; : Helical contact energy; : Long-range non-bonded energy; : Short and medium range non-bonded energy; : Total non-bonding Energy; : Mean r.m.s. fluctuation displacement; :Hydropathy; : Normal consensus hydrophobicity; : Surrounding hydrophobicity; : Thermodynamic transfer hydrophobicity; : Compressibility; : Refractive index; : Molecular volume; : Molecular weight; : Average number of surrounding residues; : α- helical tendencies; : β-structure tendencies; : Coil tendencies; : Turn tendencies; : Polarity; : Isoelectric point; : Equilibrium Constant of ionization for COOH; : Polar requirement; : Solvent accessible reduction ratio; : Chromatographic index; : Partial specific volume.

Sites detected under selection in M8 with posterior probabilities > 80% in the BEB analyses. Codons with P values < 0.2. Codons with P values < 0.2. Codons with Bayes factors > 50. Radical changes in amino acid properties under category 6-8 were detected in TreeSAAP. Physicochemical amino acid properties available in TreeSAAP are as following: : Power to be C-term., α-helix; : Power to be in the N-terminal of an α-helix; : Buriedness; : Helical contact energy; : Long-range non-bonded energy; : Short and medium range non-bonded energy; : Total non-bonding Energy; : Mean r.m.s. fluctuation displacement; :Hydropathy; : Normal consensus hydrophobicity; : Surrounding hydrophobicity; : Thermodynamic transfer hydrophobicity; : Compressibility; : Refractive index; : Molecular volume; : Molecular weight; : Average number of surrounding residues; : α- helical tendencies; : β-structure tendencies; : Coil tendencies; : Turn tendencies; : Polarity; : Isoelectric point; : Equilibrium Constant of ionization for COOH; : Polar requirement; : Solvent accessible reduction ratio; : Chromatographic index; : Partial specific volume. We also employed the PAML branch-site model (Yang 2007) to identify episodic adaptations that affect amino acids along specific lineages. Few internal branches and several terminal branches showed evidence of positive selection after FDR correction (Table 3, Figure S5). In cetaceans the lineage leading to GSTO1 in bottlenose dolphin and GSTP2 in sperm whale were under selection. In pinnipeds the branches leading to GSTM1 in Pacific walrus and its ancestor, GSTP2 in the ancestor, as well as GSTA1 in Weddell seal were under positive selection. GSTA1 and GSTM1 were under positive selection only in pinnipeds (Pacific walrus and Weddell seal). The lineage leading to human GSTA4, elephant and bat GSTT1, cow, horse and manatee GSTT2, sheep and antelope GSTP2, naked mole rat GSTO1, as well as cetartiodactyla (includes whales and dolphins, and even-toed ungulates) HPGDS were also under positive selection.
Table 3

Selective pattern analyzed by Branch-site model

GenesBranch-site modelsa-lnLb2ΔlnLp-valueω valuesPositively selected sitec
GSTA1Teminal branch of Lwe
ma4319.495ω0 = 0.092 ω1 = 1.0 ω2 = 338.76245-0.962*, 48-0.875, 52-0.875, 59-0.800, 64-0.853, 70-0.848, 184-0.864, 185-0.998**, 186-0.895, 201-0.827
ma04343.65848.325<0.001ω0 = 0.079 ω1 = 1.0 ω2 = 1.0
GSTA4Teminal branch of Hsa
ma2757.115ω0 = 0.102 ω1 = 1.0 ω2 = 99942-0.947, 185-0.961*
ma02762.89211.555<0.001ω0 = 0.096 ω1 = 1.0 ω2 = 1.0
GSTM1Teminal branch of Odi
ma3851.226ω0 = 0.104 ω1 = 1.0 ω2 = 24.37235 D 0.996**, 44 W 0.870, 82 H 0.869, 94 R 0.860, 166 R 0.873, 215 N 0.989*,
ma03855.0227.5920.006ω0 = 0.104 ω1 = 1.0 ω2 = 1.0
LCA of Lwe and Odi
ma3856.774ω0 = 0.113 ω1 = 1.0 ω2 = 88.80293 I 0.943
ma03860.4997.4500.006ω0 = 0.111 ω1 = 1.0 ω2 = 1.0
GSTT1Teminal branch of Laf
ma3407.906ω0 = 0.12 ω1 = 1.0 ω2 = 999.0172-0.943, 219-0.939,
ma03411.6907.5650.006ω0 = 0.119 ω1 = 1.0 ω2 = 1.0
Teminal branch of Mlu
ma3408.749ω0 = 0.126 ω1 = 1.0 ω2 = 999218-0.925,
ma03412.4907.4830.006ω0 = 0.12 ω1 = 1.0 ω2 =1.0
GSTT2Teminal branch of Bta
Ma3764.990ω0 = 0.104 ω1 = 1.0, ω2 = 999237-0.876, 239-0.987*
Ma03780.48930.999<0.001ω0 = 0.106, ω1 = 1.0, ω2 = 1
Teminal branch of Eca
Ma3765.226ω0 = 0.102 ω1 = 1.0, ω2 = 99918- 0.977*, 19- 0.971*, 21-0.964*, 24-0.895, 26-0.926, 30-0.936, 143-0.941
Ma03775.41820.384<0.001ω0 = 0.098 ω1 = 1.0, ω2 = 1
Teminal branch of Tla
ma3770.135ω0 = 0.096 ω1 = 1.0 ω2 = 99913-0.936, 28-0.953*, 89-0.922, 93-0.986*, 190-0.866
ma03776.14212.014<0.001ω0 = 0.091 ω1 = 1.0 ω2 = 1.0
GSTP2Teminal branch of Oar
ma2704.119ω0 = 0.069 ω1 = 1.0 ω2 = 999113-0.989*, 114-0.961*, 205-0.804, 210-0.951*
ma02717.93427.631<0.001ω0 = 0.063 ω1 = 1.0 ω2 = 1.0
LCA of Lwe and Odi
ma2711.299ω0 = 0.059 ω1 = 1.0 ω2 = 10.047106-0.890, 135-0.953*, 144-0.869, 174-0.987*, 208-0.878,
ma02714.8007.0020.008ω0 = 0.059 ω1 = 1.0 ω2 = 1.0
Teminal branch of Pho
ma2707.268ω0 = 0.056 ω1 = 1.0 ω2 = 15.7883-0.814, 4-0.997**, 6-0.995**, 7-0.952*, 8-0.808
ma02711.2517.9650.005ω0 = 0.055 ω1 = 1.0 ω2 = 1.0
Teminal branch of Pma
ma2710.120ω0 = 0.062 ω1 = 1.0 ω2 = 108.2441-0.933, 110-0.908, 117-0.941, 210-0.911
ma02717.79315.347<0.001ω0 = 0.062 ω1 = 1.0 ω2 = 1.0
GSTO1Teminal branch of Hgl
ma4450.655ω0 = 0.065 ω1 = 1.0 ω2 = 18.711-0.988*, 21-0.873, 24-0.853, 195-0.974*, 213-0.936
ma04454.9598.6080.003ω0 = 0.061 ω1 = 1.0 ω2 = 1.0
Teminal branch of Ttr
ma4453.983ω0 = 0.082 ω1 = 1.0 ω2 = 99947-0.991**,
ma04458.9289.8900.002ω0 = 0.082 ω1 = 1.0 ω2 = 1.0
GSTZ1Teminal branch of Laf
ma3360.900ω0 = 0.077 ω1 = 1.0 ω2 = 34.47714-0.815, 181-0.819, 183-0.826
ma03365.5549.3060.002ω0 = 0.074 ω1 = 1.0 ω2 = 1.0
HPGDSLCA of Cetartiodactyla
ma2891.856ω0 = 0.105 ω1 = 1.0 ω2 = 163.254183-0.980*,
ma02896.0248.3350.004ω0 = 0.106 ω1 = 1.0 ω2 = 1.0

Bta: Bos taurus; Eca: Equus caballus; Hgl: Heterocephalus glaber; Has: Homo sapiens; Lwe: Leptonychotes weddellii; Laf: Loxodonta africana; Mmu: Mus musculus; Mlu: Myotis lucifugus; Nph: Neophocaena phocaenoides; Odi: Odobenus rosmarus divergens; Oor: Orcinus orca; Oar: Ovis aries; Pho: Pantholops hodgsonii; Pma: Physeter macrocephalus; Tla: Trichechus manatus latirostris; Ttr: Tursiops truncatus.

lnL is the log-likelihood score.

Codons with posterior probabilities (pp) >80% in the BEB analyses. * pp >95%, ** pp >99%.

Bta: Bos taurus; Eca: Equus caballus; Hgl: Heterocephalus glaber; Has: Homo sapiens; Lwe: Leptonychotes weddellii; Laf: Loxodonta africana; Mmu: Mus musculus; Mlu: Myotis lucifugus; Nph: Neophocaena phocaenoides; Odi: Odobenus rosmarus divergens; Oor: Orcinus orca; Oar: Ovis aries; Pho: Pantholops hodgsonii; Pma: Physeter macrocephalus; Tla: Trichechus manatus latirostris; Ttr: Tursiops truncatus. lnL is the log-likelihood score. Codons with posterior probabilities (pp) >80% in the BEB analyses. * pp >95%, ** pp >99%. A more thorough investigation of the evolutionary history of mammalian GSTs was conducted by extending the analysis to comparing marine and terrestrial species. Clade model C allows for more than two clades to be defined as separate partitions, estimating ω separately for each partition. A model which assumes divergent selection along the cetacean, pinniped, sirenian, and marine mammal branches fitted the data better than the null model in the case of the GST genes GSTA1, GSTA4, GSTM1, GSTM3, and GSTT1 (P < 0.05, Table S6), suggesting there is a divergent selection pressure between marine and terrestrial mammals and within marine mammal groups.

Discussion

Molecular evolution of GST genes

Several previous surveys have documented the distribution of GST genes. Nebert and Vasiliou (2004) provided a comprehensive assessment of GST superfamily genes. Other studies examined the phylogeny (Pearson 2005) and evolution (da Fonseca ) of the superfamily across the tree of life. In the present study, we expanded on the previous surveys by performing a comprehensive search for cytosolic GST genes in 21 mammals representative of all major mammalian taxa. Of these, 15 species had no previous information on their cytosolic GST gene repertoire (Figure 2). We provide evidence for positive selection acting on several GST genes in divergent taxonomic groups (GSTA1, GSTM1, GSTO1, GSTO2, GSTP1, GSTP2, GSTT2, and GSTZ1), indicative of pervasive adaptive evolution. Considering that cytosolic GST genes play critical roles in the detoxification and metabolic activation of xenobiotics (Board and Menon 2013), these changes may reflect previous and ongoing adaptions to diverse environments by mammals. In addition, 31 positively selected sites with radical amino acid changes were identified by gene- and protein-level selection analyses. Interestingly, 73% of the total amino acids under positive selection were found to be located in or close to functional domains. These results indicate that positively selected amino acid changes might play an important role in modulating the specificity or potency of detoxification and antioxidant defenses during mammalian evolution. For example, residue 45 of GSTA1 is known to be involved in GSH binding (Balogh ), while GSTA1 residues 212 and 215 are close to a residue (216) important for thiolester substrate hydrolysis (Hederos ). Residue 107 (a histidine) of GSTM1 is a second substrate-binding site and has five radical amino acid changes in mammals. Site-directed mutatagenesis of this residue to an aspargine caused a 50% reduction in catalytic activity of human GSTM1 (Patskovsky ).

Mammalian cytosolic GSTs phylogeny and birth-and-death evolution

The phylogenetic relationships among 333 cytosolic GSTs were reconstructed by two independent methods which gave a similar estimation of mammalian phylogeny. The branch pattern indicated that the omega, theta, and zeta subclasses are ancient in mammals, while the alpha, mu, pi, and sigma subclasses evolved later (Figure 1). This is in agreement with previous studies based on structural and functional data (Armstrong 1997; da Fonseca ; Frova 2006). The omega subclass has a cysteine residue at the active site, while the theta and zeta subclasses employ catalytic serine hydroxyl to activate GSH. They are thus predicted to be the progenitors of GSTs (Frova 2006). According to our phylogenetic tree, the omega subclass harbors the most ancient genes (Figure 1). Omega GSTs have strong homology to glutaredoxins, the predicted ancestors of the N-terminal topology of GST (Frova 2006; Oakley 2005). Taken together, it is thus reasonable to assume that the omega subclass evolved earlier. A switch from serine to tyrosine in the alpha, mu, pi, and sigma subclasses is another evolutionary scenario of GSTs which would explain why these four subclasses cluster together in the tree with high bootstrapping (Figure 1). It would appear that the sigma subclass diverged before the mammalian alpha, mu, and pi group due to its presence in invertebrates and vertebrates (Frova 2006). In our study, however, sigma appeared as the sister group of the alpha subclass. This scenario is in line with a previous study (da Fonseca ). The mammalian sigma subclass, known as prostaglandin synthases, has a hydrophilic interface with a lock-and-key motif – similar to the alpha, mu, and pi subclasses (Sheehan ). Therefore, the observed clustering might be related to their specialized structure and function in mammals. On the other hand, our result also suggests that subclass mu proteins arose most recently. The theta subclass was placed as sister to a clade containing mu, pi, alpha, and sigma subclass with high support – supporting the prediction that alpha, mu, pi, and sigma subclasses arose from the duplication of theta subclass (Armstrong 1997). The GST gene family has been previously described in various prokaryotes and eukaryotes (Pearson 2005). In this study, we examined cytosolic GST gene repertories in 21 species representing all major mammalian taxa. Our results show an unequal copy number of GST genes across the mammalian phylogeny, as has previously been suggested (da Fonseca ; Pearson 2005). For instance, the largest gene expansion was observed in the mouse (30 copies), whereas only 16 GSTs were identified in bowhead whale. This could be due to a lineage or species-specific duplication or deletions of this gene family in mammals. In support of this possibility, a diverse pseudogene proportion was found in mammalian GSTs; ranging from 41% in Yangtze river dolphin and Yangtze finless porpoise to 0% in the African elephant and Florida manatee (Table 1). Studies of gene duplicates have shown that new genes are usually created by gene duplication (Lynch and Conery 2000; Nei and Rooney 2005). Some duplicated genes are maintained in the genome for a long time, while others are nonfunctional or deleted from the genome. This phenomenon is termed the birth-and-death evolution model (Lynch and Conery 2000; Nei and Rooney 2005). We argue that this model is supported by the gene duplication events observed in our data of mammalian GSTs. It is also important to note that extensive duplication (12 paralogous) of pi GSTs was found in the dog, with half being pseudogenes. It has been reported that most duplicated genes tend to experience a brief period of relaxed selection early in their history, often resulting in non-functionalization or pseudogenes (Lynch and Conery 2000). Therefore, the high fraction of pi GSTs pseudogenes in dog further supports a birth-and-death model of GSTs evolution. Notably, seven pi subclass paralogous identified in mouse are apparently intact and functional, as is the case for the naked mole rat mu GSTs (10 copies). This could be the outcome of adaptations to environmental toxins (mouse) and very low oxygen levels in underground burrows (naked mole rat) via diversification of duplicated copies. Moreover, our results reveal extensive positive selection in GSTP2 along five mammalian lineages (Figure S5), suggestive of episodic selection pressure – possibly in response to changes in xenobiotic exposure. In contrast, there is no evidence of positive selection in GSTP1 along specific lineages, indicating a functional conservation which is consistent with a critical role of GSTP1 in ethacrynic acid metabolism in the liver (Henderson ). These results reveal that divergent selective regimes occurred in paralogs within a cytosolic GST class.

Divergent selection of mammalian cytosolic GSTs

Numerous studies have reported that gene family evolution is closely tied with environments. This includes opsin genes in cichlid fish (Henderson ), hemoglobins in vertebrates (Nery ), keratin-associated proteins in vertebrates (Khan 2014; Sun ), and olfactory receptor genes in mammals (Hughes ; Niimura ). An interesting result of our analysis is the correlation between distinct ecological milieus and cytosolic GST gene copy number (Table 1). For example, in Carnivora 29 genes were identified in dog, while 22 and 25 genes were found in the Weddell seal and Pacific walrus, respectively. In Cetartiodactyla, 21 to 25 genes were found in Artiodactyla, while 16 to 17 genes were found in Cetacea. We also show that hypoxia-tolerant species with different ecological niches show evidence of divergent selection (Table S6), further suggesting that habitat plays a role in GST gene family evolution. Marine mammals (cetaceans, pinnipeds, and sirenians) encompass phenotypic convergences that accommodate the challenges of aquatic life. They present a similar respiratory and cardiovascular solutions, such as improved oxygen storage, to low oxygen levels (Ramirez ). Our data suggests that their shared adaptations are not reflected in the evolution of the GST gene family. A total of four, three, and one positively selected genes were identified along cetaceans, pinnipeds, and sirenians, respectively (Figure S5). This suggests that there is a difference in the evolutionary history of the GST family in marine mammals. There are several non-mutually exclusive scenarios that could explain this pattern: 1) Antioxidant status is directly related to diving capacity. Marine mammals that perform shallow/short and deep/long divers might experience different oxidative stress challenges, suggesting distinct mechanisms to maintain redox balance (Cantú-Medellín ); 2) Pinnipeds and sirenians possess enhanced enzymatic antioxidant capacities, while non-enzymatic antioxidant (e.g., levels of glutathione) seems to play an important role in cetaceans, indicating a different strategy for antioxidant defenses adaptation (Ninfali and Aluigi 1998; Wilhelm Filho ); 3) We do not rule out the potential impact of different number of species and the length of branches used in this study. As an ever-increasing amount of high-quality genome assemblies are generated, future studies is likely to resolve these issues.

Oxidative stress adaptation in cetaceans

Cetaceans are faced by chronic oxidant stress stemming from chemical pollutants in aquatic environment and reoxygenation following hypoxia (diving) (Valavanidis ; Li and Jackson 2002). Therefore, it might be expected that cetaceans have a large GST repertoire. However, we found that the number of cytosolic GST genes in cetaceans was significantly smaller than terrestrials (Figure 3C). The contraction of alpha GSTs is striking, with only two functional GSTAs (GSTA1 and GSTA4) in cetaceans. Similarly, five mu GSTs were identified in cetaceans, but only one gene (GSTM1) appears intact compared with 5-10 GSTM genes in other mammals (Figure 2). We observed that ancestral branches of cetaceans also showed reduced GST repertoires, which suggested that contraction of cetacean GSTs could be related to aquatic adaptations after the divergence of cetacea from artiodactyla approximately 55 Mya (Thewissen ). The presence of four GSTM pseudogenes provides further evidence for the contraction of the cetacean locus from a large GST family in the ancestral cetacean genome. Moreover, these gene losses in cetaceans are probably a consequence of relaxed selection after adaptations of aquatic environment (Table S7). This raises the probability of an alternative gene family responsible for enhanced oxidative stress resistance or perhaps selection on particular genes responsible for detecting toxicants and activating oxidative defenses, rather than a large gene repertoire. Regarding the former explanation, it has been reported that the peroxiredoxin (PRDX) and glutathione peroxidase (GPX) gene families have expanded in whale lineages (Yim ; Zhou ). On the other hand, it is also possible that the retained GSTs in cetaceans have improved or essential antioxidative properties. For example, single nucleotide polymorphisms that cause amino acid substitutions in GSTA1 alters its activity toward xenobiotics (Coles and Kadlubar 2005). GSTA4 protects against oxidative stress by clearing toxic lipid peroxidation by-products (Hayes ). GSTM1 is critical for the detoxification of various oxidants, as well as carcinogens and toxins (McIlwain ). Supporting an essential role of these cytosolic GST genes (GSTA1, GSTA4, and GSTM1), human and rodent studies have reported that these GSTs are also present in mitochondria where they likely play a role in protecting against mitochondrial injury during oxidative stress (Gallagher ; Raza ) We hypothesize that the cetacean homologs, in particular GSTA1 which was intact in all seven cetaceans, are retained because of their essential role in protecting against oxidative stress in both the cytosol and mitochondria. That is, complete loss of these genes is not tolerated. This hypothesis awaits experimental proof, including an assessment of the tissue/organelle location and function of cetacean GSTs. We also show evidence of positive selection on the retained GSTs in cetaceans. These observations lead us to speculate that widely dispersed xenobiotics in aquatic ecosystems and high oxidative stress drive the adaptive evolution of retained functional and essential GST genes in cetaceans. Previous studies demonstrated that gene loss in cetaceans could play an important role for natural phenotypic adaptations. For example, loss of genes with hair- and epidermis-related functions contributes to their unique skin morphology, a thicker epidermis and hairlessness (Sharma ). Some investigators have reported that gene loss may carry detrimental fitness consequences in modern environments. An intriguing recent example includes the loss of paraoxonase 1 (PON1) in marine mammals which likely eliminates their main defense against neurotoxicity from man-made organophosphorus compounds (Meyer ). Loss of GST genes in cetaceans may also have negative consequences. For instance, Gsta3 knockout mice are not only sensitive to acute cytotoxic and genotoxic effect of aflatoxin B1 (Ilic ), but also have increased oxidative stress marker levels (Crawford ), suggesting that GSTA3 loss in cetaceans might also weaken their oxidative damage defenses. Nevertheless, more research is necessary to validate whether a cetacean gene loss event is deleterious. In conclusion, we here characterized the cytosolic glutathione-S-transferase (GST) gene family in 21 mammalian species. In particular, our study shows that the gene family has contracted in cetaceans despite the important role of GST genes in the protection against various stressors. Our findings add another piece to the puzzle of understanding how ceteaceans adapt to an oxygen-poor aquatic environment. An ever-increasing amount of genomic research data and associated tools is likely to address this important research question.
  75 in total

1.  The human glutathione transferase alpha locus: genomic organization of the gene cluster and functional characterization of the genetic polymorphism in the hGSTA1 promoter.

Authors:  Fabrice Morel; Claudine Rauch; Brian Coles; Eric Le Ferrec; André Guillouzo
Journal:  Pharmacogenetics       Date:  2002-06

2.  Datamonkey: rapid detection of selective pressure on individual sites of codon alignments.

Authors:  Sergei L Kosakovsky Pond; Simon D W Frost
Journal:  Bioinformatics       Date:  2005-02-15       Impact factor: 6.937

3.  Acceleration of Olfactory Receptor Gene Loss in Primate Evolution: Possible Link to Anatomical Change in Sensory Systems and Dietary Transition.

Authors:  Yoshihito Niimura; Atsushi Matsui; Kazushige Touhara
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

4.  Diving and asphyxia. A comparative study of animals and man.

Authors:  R Elsner; B Gooden
Journal:  Monogr Physiol Soc       Date:  1983

Review 5.  Reactive species mechanisms of cellular hypoxia-reoxygenation injury.

Authors:  Chuanyu Li; Robert M Jackson
Journal:  Am J Physiol Cell Physiol       Date:  2002-02       Impact factor: 4.249

6.  Several glutathione S-transferase isozymes that protect against oxidative injury are expressed in human liver mitochondria.

Authors:  Evan P Gallagher; James L Gardner; David S Barber
Journal:  Biochem Pharmacol       Date:  2006-04-17       Impact factor: 5.858

7.  Structural analysis of a glutathione transferase A1-1 mutant tailored for high catalytic efficiency with toxic alkenals.

Authors:  Larissa M Balogh; Isolde Le Trong; Kimberly A Kripps; Kaspars Tars; Ronald E Stenkamp; Bengt Mannervik; William M Atkins
Journal:  Biochemistry       Date:  2009-08-18       Impact factor: 3.162

8.  Comparison between the antioxidant status of terrestrial and diving mammals.

Authors:  D Wilhelm Filho; F Sell; L Ribeiro; M Ghislandi; F Carrasquedo; C G Fraga; J P Wallauer; P C Simões-Lopes; M M Uhart
Journal:  Comp Biochem Physiol A Mol Integr Physiol       Date:  2002-11       Impact factor: 2.320

Review 9.  Glutathione S-transferase polymorphisms: cancer incidence and therapy.

Authors:  C C McIlwain; D M Townsend; K D Tew
Journal:  Oncogene       Date:  2006-03-13       Impact factor: 9.867

10.  MUSCLE: a multiple sequence alignment method with reduced time and space complexity.

Authors:  Robert C Edgar
Journal:  BMC Bioinformatics       Date:  2004-08-19       Impact factor: 3.169

View more
  1 in total

1.  Integrated Full-Length Transcriptome and MicroRNA Sequencing Approaches Provide Insights Into Salt Tolerance in Mangrove (Sonneratia apetala Buch.-Ham.).

Authors:  Beibei Chen; Zeyi Ding; Xiang Zhou; Yue Wang; Fei Huang; Jiaxin Sun; Jinhui Chen; Weidong Han
Journal:  Front Genet       Date:  2022-07-11       Impact factor: 4.772

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.