| Literature DB >> 25945801 |
Vittoria Roncalli1, Matthew C Cieslak1, Yale Passamaneck2, Andrew E Christie1, Petra H Lenz1.
Abstract
Detoxification is a fundamental cellular stress defense mechanism, which allows an organism to survive or even thrive in the presence of environmental toxins and/or pollutants. The glutathione S-transferase (GST) superfamily is a set of enzymes involved in the detoxification process. This highly diverse protein superfamily is characterized by multiple gene duplications, with over 40 GST genes reported in some insects. However, less is known about the GST superfamily in marine organisms, including crustaceans. The availability of two de novo transcriptomes for the copepod, Calanus finmarchicus, provided an opportunity for an in depth study of the GST superfamily in a marine crustacean. The transcriptomes were searched for putative GST-encoding transcripts using known GST proteins from three arthropods as queries. The identified transcripts were then translated into proteins, analyzed for structural domains, and annotated using reciprocal BLAST analysis. Mining the two transcriptomes yielded a total of 41 predicted GST proteins belonging to the cytosolic, mitochondrial or microsomal classes. Phylogenetic analysis of the cytosolic GSTs validated their annotation into six different subclasses. The predicted proteins are likely to represent the products of distinct genes, suggesting that the diversity of GSTs in C. finmarchicus exceeds or rivals that described for insects. Analysis of relative gene expression in different developmental stages indicated low levels of GST expression in embryos, and relatively high expression in late copepodites and adult females for several cytosolic GSTs. A diverse diet and complex life history are factors that might be driving the multiplicity of GSTs in C. finmarchicus, as this copepod is commonly exposed to a variety of natural toxins. Hence, diversity in detoxification pathway proteins may well be key to their survival.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25945801 PMCID: PMC4422733 DOI: 10.1371/journal.pone.0123322
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Search results from in silico mining of a de novo transcriptome from Calanus finmarchicus using glutathione S-transferase (GST) queries obtained from the copepod Tigriopus japonicus, the cladoceran Daphnia pulex and the insect Drosophila melanogaster.
| Class | Subclass | Transcript accession number | Transcript length |
|---|---|---|---|
| Cytosolic | Delta/Theta/Epsilon | GAXK01204954 | 888 |
| GAXK01204965 | 921 | ||
| GAXK01204950 | 852 | ||
| GAXK01204940 | 965 | ||
| GAXK01204947 | 1182 | ||
| GAXK01204957 | 902 | ||
|
| 764 | ||
| GAXK01204953 | 991 | ||
| GAXK01073468 | 401 | ||
| GAXK01096295 | 914 | ||
| GAXK01035521 | 823 | ||
| Mu | GAXK01204944 | 957 | |
| GAXK01204956 | 741 | ||
| GAXK01204948 | 742 | ||
| GAXK01204952 | 1686 | ||
| GAXK01204958 | 785 | ||
| Omega | GAXK01204960 | 984 | |
| GAXK01016325 | 1410 | ||
| GAXK01164502 | 894 | ||
| Sigma |
| 1346 | |
|
| 839 | ||
| GAXK01204943 | 815 | ||
| GAXK01204951 | 733 | ||
| GAXK01204959 | 955 | ||
| GAXK01204946 | 1022 | ||
| GAXK01204964 | 771 | ||
| GAXK01204961 | 884 | ||
| GAXK01204942 | 756 | ||
| GAXK01204966 | 881 | ||
| Zeta | GAXK01204939 | 1033 | |
| GAXK01204941 | 2790 | ||
| GAXK01084871 | 359 | ||
| Mitochondrial | Kappa | GAXK01046934 | 1108 |
| Microsomal | mGST-1 | GAXK01178264 | 771 |
| GAXK01081966 | 347 | ||
| mGST-3 | GAXK01204963 | 54 | |
| GAXK01204967 | 465 | ||
| GAXK01204962 | 7866 | ||
|
| 680 |
Transcript lengths (base pairs) are given for each transcript identified.
†Length in nucleotides.
Accession Nos for query proteins: GST Delta T. japonicus (ACE81244, ACE81245). GST Theta T. japonicus (ACE81253). GST Epsilon D. melanogaster (AAF57701). GST Mu T. japonicus (ACE81251, ACE81252, ACE81254). GST Omega T. japonicus (ACE81246). GST Sigma T. japonicus (AAY89316). GST Zeta T. japonicus (ACE81250). Mitochondrial GST Kappa D. pulex (EFX86155). Microsomal GST-1 T. japonicus (ACE81248). Microsomal GST-3 T. japonicus (ACE81249) as listed.
*Queries for Delta, Theta and Epsilon subclasses deduced the identical list of 11 transcripts.
C. finmarchicus transcripts whose accession numbers are shown in bold font have sequence support from expressed sequence tag data (see text).
Annotation of putative glutathione S-transferase-encoding transcripts from Table 1, using reciprocal BLAST results and protein domain analysis.
| Class—subclass and assigned protein name | Transcript accession No. | Deduced protein length | Structural domains | Species | Protein accession No. | E-value | % amino acid identity/similarity |
|---|---|---|---|---|---|---|---|
|
| |||||||
| Calfi-Delta-I | GAXK01204953 | 217 | GSTN, GSTC |
| ACO12967 | 8.9e-44 | 40/78 |
| Calfi-Delta-II | GAXK01204968 | 217 | GSTN, GSTC |
| ACO12967 | 8.9e-36 | 43/77 |
| Calfi-Delta-III | GAXK01204940 | 218 | GSTN, GSTC |
| ACO12967 | 7.0e-33 | 39/80 |
| Calfi-Delta-IV | GAXK01204965 | 220 | GSTN, GSTC |
| EFX81633 | 2.8e-29 | 69/70 |
| Calfi-Delta-V | GAXK01204954 | 221 | GSTN, GSTC |
| ACE81245 | 2.8e-29 | 57/86 |
| Calfi-Delta-VI | GAXK01204957 | 237 | GSTN, GSTC |
| ACO12967 | 1.1e-28 | 38/80 |
| Calfi-Delta-VII | GAXK01204947 | 262 | GSTN, GSTC |
| ADD38060 | 3.2e-25 | 59/89 |
| Calfi-Delta-VIII | GAXK01073468 | 113 | GSTN |
| ACE81245 | 7.5e-27 | 33/44 |
| Calfi-Delta-IX | GAXK01204950 | 178 | GSTC |
| ACO15541 | 3.2e-17 | 44/68 |
| Calfi-Delta-X | GAXK01035521 | 201 | GSTN, GSTC |
| ACO15749 | 2.2e-10 | 21/54 |
| Calfi-Delta-XI | GAXK01204939 | 330 | GSTN, GSTC |
| ADD38823 | 1.3e-07 | 27/54 |
|
| |||||||
| Calfi-Theta-I | GAXK01096295 | 262 | GSTN, GSTC |
| AEB91980 | 1.3e-79 | 45/72 |
|
| |||||||
| Calfi-Mu-I | GAXK01204944 | 222 | GSTN, GSTC |
| ACO15225 | 2.2e-105 | 65/89 |
| Calfi-Mu-II | GAXK01204956 | 222 | GSTN, GSTC |
| AGJ70295 | 3.1e-95 | 65/85 |
| Calfi-Mu-III | GAXK01204948 | 222 | GSTN, GSTC |
| ACE81254 | 1.0e-90 | 48/69 |
| Calfi-Mu-IV | GAXK01204952 | 222 | GSTN, GSTC |
| ACE81254 | 9.5e-90 | 58/82 |
| Calfi-Mu-V | GAXK01204958 | 222 | GSTN, GSTC |
| ACE81254 | 4.1e-59 | 45/76 |
|
| |||||||
| Calfi-Omega-I | GAXK01204960 | 268 | GSTN, GSTC |
| BAN21163 | 5.3e-46 | 36/64 |
| Calfi-Omega-II | GAXK01164502 | 235 | GSTN |
| EGI63780 | 3.2e-32 | 35/64 |
| Calfi-Omega-III | GAXK01016325 | 272 | GSTN |
| AFZ78680 | 2.6e-51 | 36/68 |
|
| |||||||
| Calfi-Sigma-I | GAXK01204964 | 200 | GSTN, GSTC |
| EFX82687 | 3.1e-44 | 36/70 |
| Calfi-Sigma-II | GAXK01204942 | 204 | GSTN, GSTC |
| EFX82672 | 2.0e-42 | 40/67 |
| Calfi-Sigma-III | GAXK01204943 | 216 | GSTN, GSTC |
| AAY89316 | 7.2e-45 | 40/74 |
| Calfi-Sigma-VI | GAXK01204959 | 217 | GSTN, GSTC |
| XP_003694330 | 8.2e-38 | 35/71 |
| Calfi-Sigma-V | GAXK01204951 | 218 | GSTN, GSTC |
| AAY89316 | 5.0e-33 | 38/70 |
| Calfi-Sigma-VI | GAXK01204961 | 218 | GSTN, GSTC |
| EFX82672 | 5.1e-39 | 38/69 |
| Calfi-Sigma-VII | GAXK01204946 | 225 | GSTN, GSTC |
| EFX63772 | 2.1e-39 | 38/76 |
| Calfi-Sigma-VIII | GAXK01204966 | 239 | GSTN, GSTC |
| EFX82672 | 1.8e-62 | 45/75 |
| Calfi-Sigma-XI | GAXK01204949 | 196 | GSTN, GSTC |
| AGZ95070 | 1.2e-35 | 35/51 |
| Calfi-Sigma-II | GAXK01204945 | 223 | GSTN, GSTC |
| XP_00370954 | 1.1e-37 | 36/56 |
|
| |||||||
| Calfi-Zeta-I | GAXK01204941 | 225 | GSTN, GSTC |
| NP_731358 | 5.0e-84 | 54/82 |
| Calfi-Zeta-II | GAXK01084871 | 113 | GSTC |
| AFI99067 | 6.1e-27 | 22/32 |
|
| |||||||
| Calfi-Kappa-I | GAXK01046934 | 256 | THX |
| ADV59554 | 1.2e-65 | 36/61 |
|
| |||||||
| Calfi-mGST-1-I | GAXK01081966 | 93 | MAPEG |
| XP_002023020 | 4.0e-42 | 36/61 |
| Calfi-mGST-1-II | GAXK01081966 | 93 | MAPEG |
| XP_00453691 | 3.2e-26 | 31/52 |
|
| |||||||
| Calfi-mGST-3-I | GAXK01204963 | 156 | MAPEG |
| AGN29624 | 3.2e-60 | 65/84 |
| Calfi-mGST-3-II | GAXK01204967 | 156 | MAPEG |
| EFX85348 | 7.1e-37 | 46/77 |
| Calfi-mGST-3-III | GAXK01204955 | 264 | MAPEG |
| EFX85347 | 2.2e-34 | 42/73 |
| Calfi-mGST-3-IV | GAXK01204962 | 145 | MAPEG |
| AGN29624 | 5.1e-49 | 61/75 |
BLAST searches were limited to NCBI non-redundant protein database for arthropods (taxid: 6656).
†Length in amino acids.
* Predicted full-length protein flanked by “stop” codons at both N- and C-terminals.
** Putative full-length protein flanked by a “methionine” at the N-terminal, and a “stop” codon at the C-terminal. Identification of full-length is based on presence of expected structural domains and similarity to full-length proteins.
*** Partial protein with either the N-terminal “methionine” or C-terminal “stop” codon missing.
**** Protein originally identified as full-length, but prediction corrected to partial after alignment with their most similar transcript in the Norwegian Sea transcriptome (see text).
Abbreviations: GSTN, GST amino (N)-terminal domain; GSTC, GST carboxyl (C)-terminal domain; THX, thioredoxin-like domain; MAPEG, membrane-associated proteins in eicosanoid and glutathione metabolism domain.
Fig 1Alignment of selected Calanus finmarchicus Delta GST (Calfi-Delta-IV) and Mu GST (Calfi-Mu-III) proteins with their top arthropod protein BLAST hits.
(A) Alignment of D. pulex Delta GST (Dappu-Delta) (Accession No. ; 222 amino acids long) and Calfi-Delta-IV (220 amino acids long). (B) Alignment of the T. japonicus Mu GST (Tigja-Mu; Accession No. ; 221 amino acids long) and Calfi-Mu-III (222 amino acids long). In each panel, ‘‘*” located beneath the alignment indicates residues that are identical in the two sequences, while ‘‘:” and ‘‘.” indicate conservatively substituted (similar) amino acids shared between the protein pairs. Amino acids highlighted in black are the ones predicted by SMART analysis to form the conserved amino (N)-terminal domain (GSTN), amino acids highlighted in red represent the conserved carboxyl (C)-terminal domain (GSTC).
Fig 2Alignment of Calanus finmarchicus microsomal glutathione S-transferase subclass 1 proteins with mGST-1s from other crustaceans.
(A) Alignment of C. finmarchicus putative microsomal GST-1 proteins (Calfi-mGST-1-I and Calfi-mGST-1-II) with the T. japonicus query used in their discovery (Tigja-mGST-1; Accession No. ). Highlighted in green are amino acids in the conserved MAPEG structural domain identified using SMART software. The abbreviation “TM” indicates predicted transmembrane regions in the C. finmarchicus mGST-1 proteins. The ‘‘*” located beneath each alignment indicates residues that are identical in the two sequences, while ‘‘:” and ‘‘.” indicate conservatively substituted (similar) amino acids shared between the protein pairs. (B) Multiple alignments of C. finmarchicus microsomal GST-1 proteins (Calfi-mGST-1-I and Calfi-mGST-1-II) with publicly available mGST-1s from the crustaceans C. clemensi (Calcl), C. rogercresseyi (Calro), L. salmonis (Lepsa), T. japonicus (Tigja) and D. pulex (Dappu). The conserved motif consisting of 16 amino acids (VERVRRXHLNDXENIX, where the three Xs represent variable residues) is highlighted in blue. The non-conservative substitution found only in C. finmarchicus is highlighted in pink.
Number of genes in different classes and subclasses of the glutathione S-transferase superfamily in the crustaceans Calanus finmarchicus, Tigriopus japonicus [15] and Daphnia pulex [14] and the insect Drosophila melanogaster [53].
| Class | Subclass |
|
|
|
|
|---|---|---|---|---|---|
| Cytosolic | Delta | 11 | 2 | 4 | 11 |
| Theta | 1 | 1 | 1 | 4 | |
| Epsilon | 0 | 0 | 0 | 14 | |
| Mu | 5 | 3 | 6 | 0 | |
| Omega | 3 | 1 | 1 | 4 | |
| Sigma | 10 | 1 | 10 | 1 | |
| Zeta | 2 | 1 | 3 | 2 | |
| Mitochondrial | Kappa | 1 | 1 | 2 | 0 |
| Microsomal | mGST-1 | 2 | 1 | 2 | 4 |
| mGST-3 | 4 | 1 | 2 | 0 | |
| Total GSTs | 39 | 12 | 31 | 40 |
* Proteins deduced from transcriptome data.
** Proteins deduced from genomic data.
† Predicted from the Gulf of Maine transcriptome only.
Fig 3Phylogenetic tree for cytosolic GSTs from the crustacean Calanus finmarchicus and other selected crustacean and insect species.
The consensus Bayesian likelihood tree shows the relationships between cytosolic GSTs from C. finmarchicus (Cf, in color) and those from the insect D. melanogaster (Dm), the copepod T. japonicus (Tj), and the cladoceran D. pulex (Dp). The tree was built using an analysis of 10,000,000 generations in MrBayes, excluding the initial 2,500,000 generations as burn-in. Bootstrap values were calculated using RAxML with 1,000 interactions. For 73 branches, Bayesian posterior probabilities were grater than P>0.5, 68% of those with P between 0.9 and 1 (data not shown). 73 branches had bootstrap values greater than 50% (color-coded circles).
Fig 4Relative expression of selected Calanus finmarchicus cytosolic, mitochondrial and microsomal GST-encoding transcripts across six developmental stages.
Relative expression measured in 2011 (black bars) and 2012 (grey bars) for nine GSTs are shown for embryos, early nauplii (NI-II), late nauplii (NV-VI), early copepodites (CI-II), late copepodites (CV), and adult females as RPKM (reads per kilobase per million mapped reads) in Log2. (A) Cytosolic GSTs belonging to the Delta (A1-A2), Mu (A3), Omega (A4) and Sigma (A5-A6) subclasses. (B) Mitochondrial Kappa GST class. (C) Microsomal GST subclass 1 (C1) and subclass 3 (C2). Error bars in 2011 (black) are standard deviations of two technical replicates for each stage, while in 2012 (gray) error bars are standard deviations of three biological replicates.
Comparison between putative Calanus finmarchicus glutathione S-transferases (GSTs)* identified via transcriptome mining of two de novo assemblies representing populations from the Gulf of Maine [34] and the Norwegian Sea [39].
| Class | Subclass | Protein name | Gulf of Maine transcriptome | Norwegian Sea transcriptome | % amino acid identity between proteins | ||
|---|---|---|---|---|---|---|---|
| Transcript accession No. | Deduced protein type | Transcript accession No. | Deduced protein type | ||||
| Cytosolic | Delta | Calfi-Delta-I | GAXK01204953 | F | GBFB01125513 | F | 99 |
| Calfi-Delta-II | GAXK01204968 | F | GBFB01087404 | F | 93 | ||
| Calfi-Delta-III | GAXK01204940 | F | GBFB01113062 | F | 98 | ||
| Calfi-Delta-V | GAXK01204954 | F | GBFB01106634 | F | 99 | ||
| Calfi-Delta-VI | GAXK01204957 | F | GBFB01102119 | F | 93 | ||
| Calfi-Delta-VII | GAXK01204947 | F | GBFB01126821 | F | 99 | ||
| Calfi-Delta-VIII | GAXK01073468 | P | GBFB01085692 | F | 99 | ||
| Calfi-Delta-IX | GAXK01204950 | P | GBFB01111155 | F | 98 | ||
| Calfi-Delta-X | GAXK01035521 | F | GBFB01031301 | P | 93 | ||
| Theta | Calfi-Theta-I | GAXK01096295 | F | GBFB01082889 | F | 99 | |
| Mu | Calfi-Mu-I | GAXK01204944 | F | GBFB01130857 | F | 100 | |
| Calfi-Mu-II | GAXK01204956 | F | GBFB01104663 | F | 93 | ||
| Calfi-Mu-III | GAXK01204948 | F | GBFB01069639 | F | 97 | ||
| Calfi-Mu-IV | GAXK01204952 | F | GBFB01171394 | F | 98 | ||
| Calfi-Mu-V | GAXK01204958 | F | GBFB01086262 | P | 200 | ||
| Omega | Calfi-Omega-I | GAXK01204960 | F | GBFB01112247 | F | 94 | |
| Calfi-Omega-II | GAXK01164502 | P | GBFB01122297 | P | 98 | ||
| Calfi-Omega-III | GAXK01016325 | P | GBFB01061154 | P | 99 | ||
| Sigma | Calfi-Sigma-I | GAXK01204964 | F | GBFB01117919 | F | 92 | |
| Calfi-Sigma-II | GAXK01204943 | F | GBFB01053851 | F | 99 | ||
| Calfi-Sigma-III | GAXK01204943 | F | GBFB01057086 | F | 99 | ||
| Calfi-Sigma-IV | GAXK01204959 | F | GBFB01080562 | F | 97 | ||
| Calfi-Sigma-V | GAXK01204951 | F | GBFB01211675 | P | 99 | ||
| Calfi-Sigma-VI | GAXK01204961 | P | GBFB01107763 | F | 96 | ||
| Calfi-Sigma-VII | GAXK01204946 | F | GBFB01147064 | F | 100 | ||
| Calfi-Sigma-VIII | GAXK01204966 | F | GBFB01103677 | P | 98 | ||
| Calfi-Sigma-IX | GAXK01204949 | F | GBFB01105091 | F | 99 | ||
| Calfi-Sigma-X | GAXK01204945 | F | GBFB01125239 | F | 97 | ||
| Zeta | Calfi-Zeta-I | GAXK01204941 | F | GBFB01157033 | P | 100 | |
| Calfi-Zeta-II | GAXK01084871 | P | GBFB01237836 | P | 100 | ||
| Mitochondrial | Kappa | Calfi-Kappa-I | GAXK01046934 | F | GBFB01121887 | F | 99 |
| Microsomal | mGST-1 | Calfi-mGST-1-I | GAXK01178264 | F | GBFB01067142 | F | 100 |
| mGST-3 | Calfi-mGST-3-I | GAXK01204963 | F | GBFB01089387 | F | 99 | |
| Calfi-mGST-3-II | GAXK01204967 | F | GBFB01094405 | F | 100 | ||
| Calfi-mGST-3-III | GAXK01204955 | F | GBFB01076955 | F | 100 | ||
| Calfi-mGST-3-IV | GAXK01204962 | P | GBFB01082093 | F | 99 | ||
*GST transcripts listed showed >90% amino acid identity between proteins.
†In Table 2 these proteins were identified as putative full-length because flanked by a “methionine’ at the N-terminal, and a “stop” codon at the C-terminal, and by the conservation of structural domains. However, alignment of each protein with its counterpart from the “Norwegian Sea” transcriptome suggests that they are partial proteins.
Putative Calanus finmarchicus glutathione S-transferases (GSTs) showing differences between the Gulf of Maine [34] and the Norwegian Sea [39] de novo assemblies.
| Type of variation | Protein name | Gulf of Maine transcriptome | Norwegian Sea transcriptome | % amino acid identity between proteins | ||
|---|---|---|---|---|---|---|
| Transcript accession No. | Deduced protein type | Transcript accession No. | Deduced protein type | |||
| Genetic variation | Calfi-Delta-IV | GAXK01204965 | F | GBFB01111154 | F | 48 |
| Calfi-Delta-XI | GAXK01204939 | F | GBFB01141707 | F | 88 | |
| Additional gene | Calfi-Omega-IV | GAXK01138968 | P | GBFB01119512 | F | 99 |
*This transcript was not identified as encoding a GST protein in the original screening of the Gulf of Maine transcriptome, but rather was detected via a query with its Norwegian Sea counterpart.