Literature DB >> 31193317

Analysis of haloarchaeal twin-arginine translocase pathway reveals the diversity of the machineries.

Deepanjan Ghosh1,2, Debjyoti Boral1,2, Koteswara Rao Vankudoth1,2, Sureshkumar Ramasamy1.   

Abstract

The twin-arginine translocase (Tat) pathway transports folded proteins across the plasma membrane and plays a critical role in protein transport in haloarchaea. Computational analysis and previous experimental evidence suggested that the Tat pathway transports almost the entire secretome in haloarchaea. The TatC, receptor component of this pathway shows greater variation in membrane topology in haloarchaea than in other organisms. The presence of a unique fourteen-transmembrane TatC homolog (TatCt) in haloarchaea, over and above the expected TatC topological variants, indicates a strong correlation between the additional homologs and the large number of substrates transported via the haloarchaeal Tat pathway. Various combinations of TatC homologs with different topologies-TatCo, TatCt, TatCn, and TatCx have been observed in haloarchaea. In this report, on the basis of these combinations we have segregated all haloarchaeal Tat substrates into two groups. The first group consists of substrates that are transported by TatCt alone, whereas the second group consists of substrates that are transported by the other TatC homologs (TatCo, TatCn, and TatCx). The various haloarchaea TatA components also shows the possible segregation towards the substrates. We have also identified the possible homologs for Tat substrate chaperones, which act as a quality-control mechanism for proper protein folding. Further sequence analysis implies that the two TatC domains of TatCt complement each other's functionally. Substrate analysis also revealed subtle differences between the substrates being transported by various homologs: further experimental analysis is therefore required for better understanding of the complexities of the haloarchaeal Tat pathway.

Entities:  

Keywords:  Microbiology

Year:  2019        PMID: 31193317      PMCID: PMC6525301          DOI: 10.1016/j.heliyon.2019.e01587

Source DB:  PubMed          Journal:  Heliyon        ISSN: 2405-8440


Introduction

The twin-arginine translocase (Tat) pathway exclusively transports folded proteins across the plasma membrane [1, 2, 3, 4], unlike the Sec pathway, which transports proteins in an unfolded form [5] and requires ATP for the protein translocation. In almost all prokaryotes, the Tat pathway is responsible for transporting fewer proteins across the membrane than the Sec pathway. Archaea are known to be evolutionarily distinct from both bacteria and eukarya in many of their cellular characteristics. In the case of halophilic archaea, the Tat pathway has a decisive role in transporting almost 50% of secretome [6, 7, 8]. This might be because of the high cytoplasmic salt concentration in halophilic archaea, which these organisms maintain in order to balance the high sodium concentration of the environment. The high cytoplasmic salt concentration results in faster folding of proteins to prevent aggregation, so relatively few unfolded proteins are transported. The transport of folded proteins also ensures that the proteins are not folded in the extracellular environment, where chaperones might not be available and the chance of misfolding is therefore much higher [9]. Tat translocation is carried out by integral membrane proteins in the TatA and TatC families [1, 10]. In some organisms, such as archaea and Gram-positive bacteria with low guanine-cytosine (GC) genomic content, single TatA and TatC components are sufficient to mediate transport through the Tat pathway (the TatAC system). For example, Bacillus subtilis contains two TatC genes (denoted as TatCd and TatCy), which coordinate functions with their respective TatA partners (TatAd and TatAy). However, in other cases, Tat-mediated translocation involves another member of the TatA family [11], termed TatB. The Tat pathway in Escherichia coli consists of three distinct membrane-localized proteins (TatA, TatB, and TatC) and is the most studied model in which all the three components are essential for Tat functioning. TatABC systems are present in Gram-negative bacteria (including E. coli) [12, 13], in Gram-positive bacteria with high GC-content genomes, and in plant chloroplasts [14]. TatC is the receptor component that recognizes the characteristic RR-signal in the substrate N-terminal [15, 16]. TatC or the TatBC complex interacts with the substrate and recruits TatA. TatA forms an appropriately sized pore through which the substrate is transported across the membrane [17]. In plants, the thylakoid Tat system is required for the assembly of many essential components, such as photosystem II and the cytochrome b6f complex [18, 19]. The recently solved crystal structures of TatC from the thermophilic bacteria Aquifex aeolicus [20, 21] confirmed the presence of six transmembrane (TM) domains and the orientation of the amino and carboxy termini towards the cytoplasmic side of the membrane [12, 22, 23]. The structure of TatAd from Bacillus subtilis has been determined by NMR to be composed of a single TM domain and a cytosolically located amphipathic helix [24]. Common Tat substrates include respiratory redox enzymes, bacterial virulence factors, lipoproteins [25, 26, 27], and proteins involved in maintaining cell-wall integrity. These are mostly complexes containing cofactors, and many form oligomeric assemblies. Delivery of these proteins is monitored by a special class of cytoplasmic chaperones that bind specifically to the Tat signal of their substrate protein masking the signal sequence thereby ensuring proper maturation and co-factor loading. These chaperones, dubbed redox-enzyme maturation proteins (REMP), prevent the futile export of immature protein [28, 29, 30]. In this study, we examined 20 halophilic archaea whose complete proteomes were available in the Universal Protein Resource (UniProt) database. One or two TatA homologs were found in each organism. In contrast, many of the organisms contained two to four TatC homologs. We investigated the implications of the presence of these TatC homologs in these organisms. Experiments have shown that an atypical TatC homolog with fourteen TM domains (TatCt) is present in halophilic archaea [31]. Unlike other organisms, the archaeal cell membrane phospholipid is composed of branched isoprene units linked by ether groups to glycerol. It may have some advantages to possess the novel protein transport components such as TatCt may be the result of the unusual membrane phospholipid structure. We believe that the variety in TatC homologs would have some significance for the substrates being transported [26]. To assess this, we analyzed the topology, horizontal gene transfer, and substrate diversity for the different TatC homologs. We also analyzed the TatA components with respect to E. coli and A. aeolicus TatA.

Materials and methods

Distribution of Tat components and topology assignment

All data on Tat pathway components was obtained via a query search in the UniProt database and was tabulated by organism. The amino acid sequences for TatC in these organisms were retrieved and were analyzed with the TMHMM Server v. 2.0 [32], which segregated the TatC homologs into different topologies. Different TM prediction servers were used to reinforce these results: the DAS-TM filter server, the HMMTOP server, and SCAMPI [33, 34, 35].

Sequence analysis and GC-content analysis

All TatC homologs were aligned using ClustalW and a neighbor-joining tree with a bootstrap value of 10,000 was generated. The sequence analysis was performed by Clustal-Omega-generated multiple sequence alignment and the relative positions of secondary structures were marked according to the TMHMM data. The GC content of the TatC gene along with fifty upstream and fifty downstream genes were plotted as box plots. For comparison of TatC versus genomic GC content, the entire gene chromosome sequence was used to obtain the GC content. The two datasets were compared using a two tailed z-test.

Substrate analysis

The complete proteomes for all twenty haloarchaea were downloaded from the UniProt database. The Tat substrates were identified using the TatFind server. To segregate the substrates, a bidirectional BLAST was performed against each other using standalone BLAST version 2.2.29. An E-value cutoff of 0.5 was imposed; all hits with less than 50% query coverage were removed. From these hits only the bidirectional hits (BETs) were considered for analysis (Fig. S2).

Signal sequence analysis

Multiple sequence alignment files for the RR signal in each dataset were generated from the TatFind data with ClustalW. TheWebLogo was generated at http://weblogo.berkeley.edu/logo.cgi.

Pfam domain analysis

The Pfam hidden Markov models (HMMs) present in each substrate were identified through a Pfam batch sequence search. The unique and common HMMs were then identified in the different groups. Venn diagrams were generated with http://bioinfogp.cnb.csic.es/tools/venny/.

Chaperon identification

Sequences for the known tat related chaperons were obtained from UniProt and blasted against the proteomes of these organisms using standalone BLAST (blast-2.2.29+) with an E-value cutoff of 1. Sequences were blasted and homologs for each of them were obtained. Standard BLASTp output provided pairwise sequence alignment using which the signature sequence conservation was checked.

Results and discussion

The archaeal Tat system generally consists of TatA and TatC, but is devoid of TatB. The Tat pathway is present in about half of the Euryarchaeota members and several of the Crenarchaeota species that have been sequenced. The number of substrates transported by the Tat system in archaea is generally comparable to that for bacteria, with many archaea, such as Sulfolobus tokodaii and Archaeoglobus fulgidus, encoding only a few known Tat substrates, primarily cofactor-containing redox proteins [36, 37]. But, in the case of haloarchaea, a substantially greater number of substrates are translocated via the Tat pathway. Sequenced and annotated data for twenty haloarchaeal species available in UniProt were used for this analysis.

Membrane topological variants of TatC

The amino acid sequences for TatC in haloarchaeal were retrieved and using TMHMM server v. 2.0 [32] the TMs were assigned. The homologs were segregated into different topologies. This analysis segregated the TatC homologs into four classes TatC with 6-TM helices (TatCo), TatC with 6-TM helices and possessing a long N-terminal cytoplasmic loop (TatCn), TatC with 10-TM helices (TatCx) and TatC with 14-TMhelices (TatCt) based on the major differences in membrane topology, specifically the number of TM helices and the length of the N-terminal cytoplasmic region. Different TM-prediction servers were used to validate these results: the DAS-TM filter server [34], the HMMTOP server [33], and SCAMPI [35]. The results of the analyses from these servers confirmed the results we obtained from the TMHMM server, except in few cases and on an average all the proteins fell into one of the topological categories described above (Fig. 1 and Table 1) [23].
Fig. 1

Schematic representation of different TatC topology and grouped in to different groups based on combinations. TatCt + TatCo, TatCt + TatCn and TatCt + TatCx which are denoted in group 1, Group 2 and group 3, respectively.

Table 1

Analysis of Tat Pathway receptor component and number of substrates present in Haloarchaea that were sequenced and annotated.

OrganismNo. of tat substratesNo. oftatCTatCtTatCxTatCnTatC0tatC (id)
Halophilic
Halalkalicoccus jeotgali103/4212211D8J5Y4 (6),D8J5Y5 (14)
Haloarcula hispanica122/3860211G0HQX6 (10),G0HQX5 (14)
Haloarcula marismortui148/4237211Q5UYP5(10),Q5UYP6(14)
Halobacterium salinarum (strain ATCC 29341)72/2578211B0R7G7 (6),B0R7G6 (14)
Halobacterium salinarum (strain ATCC 700922)63/2426211B0R7G7 (6),B0R7G6 (14)
Haloferax mediterranei171/4712312I3R116 (6L),I3R115 (14),M0IS38(6L)
Haloferax volcanii143/4813322aL9UI27(6L),D4GZD0 (6L),D4GZC9 (14),L9UGM9 (14)
Halogeometricum borinquense161/44384121L9UQ35 (6L),E4NRH6(14),E4NRH7(6L),E4NPB6(6)
Halomicrobium mukohataei125/3343211C7P1B7(14),C7P1B8(10)
Halopiger xanaduensis194/4221211F8D3F7 (6L),F8D3F6 (14)
Haloquadratum walsbyi (strain DSM 16790)64/2546211Q18E63 (6L),Q18E62 (14)
Haloquadratum walsbyi (strain DSM 16854)59/2638211G0LFQ9 (14),G0LFQ8 (6L)
Halorhabdus utahensis100/3001211C7NVD1 (10),C7NVD2 (14)
Halorubrum lacusprofundi98/3497211B9LTY5 (14),B9LTY6 (6L)
Haloterrigena turkmenica230/5116211D2RTM5 (14),D2RTM6 (6L)
Halovivax ruber132/3099211L0IGT4 (6L),L0ID04 (14)
Natrialba magadii228/4936211D3SVL3 (6L),D3SVL2 (14)
Natrinemapellirubrum171/5026211L0JIB8(6L),L0JEW5 (14)
Natrinema sp.142/4296211I7CEK6 (6L),I7CPH1(14)
Natronobacterium gregoryi138/3624211L0AHR8 (6L),L0AFL9 (14)
Natronomonas moolapensis69/2721211M1XNN2(6),M1XZE0 (14)
Natronomonas pharaonis106/2764211Q3ISX0 (6),Q3ISW9(14)
Others
Aquifex aeolicus15/155311O67305 (6)
Acidianus hospitalis2/232922F4B4X0 (6),F4B6B8(6)
Desulfurococcus kamchatkensis0/14700
Sulfobus ilandicus3/26610
Archaeoglobus veneficus7/206522F2KP54(6),F2KQ26 (6)
Methanobrevibacter smithii0/17830
Picrophilus torridus2/153511Q6KZY6 (6)

Same type.

Schematic representation of different TatC topology and grouped in to different groups based on combinations. TatCt + TatCo, TatCt + TatCn and TatCt + TatCx which are denoted in group 1, Group 2 and group 3, respectively. Analysis of Tat Pathway receptor component and number of substrates present in Haloarchaea that were sequenced and annotated. Same type. The expected 6-TM topology of TatC has been confirmed through analysis of its crystal structure [20, 21] and the existence of a 14-TM homolog has been demonstrated experimentally [26]. Multiple sequence alignment additionally confirmed that the TM predictions were appropriate (data not shown). The consistent prediction by all tools of a TatC homolog with 10-TM helices strongly suggests the existence of a 10-TM TatC homolog. The existence of this TatCx topological class is further supported by previous studies in H. marismortui [26]. Topology similar to that of the TatCn category has been commonly observed in TatC from chloroplasts [38]. The TatCt topological class is noteworthy because it is unique to haloarchaea. Further investigation via BLASTp analysis indicated that TatCt comprises two putative TatC domains. Multiple sequence analysis and TMHMM data confirmed that the first six TM helices at the N-terminal and the last six helices at the C-terminal had significant homology with the normal TatC domains (Fig. 2 and Table 2). Two TM helices and a large cytoplasmic loop separate these two homologous domains. No homologs could be identified for the middle two TM helices. For TatCx, the first six N-terminal TM helices form a characteristic TatC domain. No sequence homology was observed for the last four TM helices with any of the sequences. We then examined the distribution of these four TatC topology classes among the haloarchaea species. It was observed that all the species encompasses atleast one copy of TatCt and combination with TatCo, TatCn or TatCx (Table 1). Thus, we defined three main groups (Fig. 1): TatCt + TatCo (Group I), TatCt + TatCn (Group II) and TatCt + TatCx (Group III).
Fig. 2

The alignment between N and C terminal TatC domain of TatCt.

Table 2

Percentage similarity between N-terminal and C-terminal TatC domains of TatCt of Holoarchaea.

OrganismPercentage similarity
Halalkalicoccus jeotgali_D8J5Y520
Haloarcula hispanica_G0HQX521
Haloarcula marismortui_Q5UYP621
Halobacterium salinarum_B0R7G623
Haloferax mediterranei_I3R11523
Haloferax volcanii_D4GZC922
Haloferax volcanii_L9UGM926
Halogeometricum borinquense_E4NRH622
Halomicrobium mukohataei_C7P1B723
Halopiger xanaduensis_F8D3F621
Haloquadratum walsbyi_Q18E6222
Haloquadratum walsbyi_G0LFQ922
Halorhabdus utahensis_C7NVD222
Halorubrum lacusprofundi_B9LTY523
Haloterrigena turkmenica_D2RTM522
Halovivax ruber_L0ID0422
Natrialba magadi_D3SVL222
Natrinema pellirubrum_L0JEW520
Natrinema sp._I7CPH122
Natronobacterium gregoryi_L0AFL924
Natronomonas moolapensis_M1XZE023
Natronomonas pharaonis_Q3ISW925
The alignment between N and C terminal TatC domain of TatCt. Percentage similarity between N-terminal and C-terminal TatC domains of TatCt of Holoarchaea.

Sequence and phylogenetic analysis of TatC

A neighbor-joining tree generated from aligned TatC homologs was analyzed for all unique TatCt homologs, to assess the clustering of organisms in the three main groups outlined above. We found that the organisms with TatCt + TatCx and TatCt + TatCo were formed the distinct cluster (Fig. 3). The organisms with TatCt + TatCn did not cluster together but were widely distributed.
Fig. 3

Phylogenetic analysis of all TatCt from haloarchaea. Group 3 TatCt shown in bracket and group 1 members are highlighted; the remaining members from group 2 group.

Phylogenetic analysis of all TatCt from haloarchaea. Group 3 TatCt shown in bracket and group 1 members are highlighted; the remaining members from group 2 group. The sequences were further analyzed by multiple sequence alignment against E. coli and A. aeolicus, for which crystal structure is available, to determine the crucial residues that were actively involved in Tat signal interaction and dimer formation, and explore whether these residues were conserved across sequences. This analysis provides an insight into the probable oligomerization state of the different classes of TatC in archaea (Fig S1 and Table ST1). The TM regions were marked to ensure that the sequences conserved in the analysis were in proximity to the reported regions of the protein. A. aeolicus Glu165, which is exposed at the center of the concave face and thus places an ionizable group in the hydrophobic interior of the bilayer [20, 21], was replaced by glutamine and was conserved across TatCo, TatCn, TatCx and the N-terminal domain of TatCt. However, in the C-terminal TatCt domain, Glu165 was replaced with asparagine or aspartic acid. This forms hydrogen-bond networks with Ser107 and Tyr85 [39]. Both these residues were conserved across TatCo, TatCn and TatCx; they were only partially conserved in the two TatC domains of TatCt - Ser107 in the N-terminal TatCt domain and Tyr85 in the C-terminal TatCt domain. This evidence indicates that the difference in membrane properties in the haloarchaea Tat pathway might be due to the conservation of glutamine instead of glutamic acid. In addition, the differential conservation of the three residues Glu165, Ser107, and Tyr85 in the two Tat domains of TatCt might enable complementary functioning of the two domains for active transport. Glu96 is another critical residue for Tat transport. This residue was conserved across TatCo, TatCn, TatCx and the C-terminal Tat domain of TatCt. A similar distribution was found even in case of the residues involved in signal binding in Aquifex [40]. The Pro42, Phe87, Pro90 and Leu92 residues were conserved in TatCo, TatCn, TatCx and C-terminal Tat domain of TatCt but only Pro90 was conserved in TatCt. Pro57 and Pro48 were conserved in the N-terminal Tat domain of TatCt instead of Pro42. We have also investigated the residues involved in TatC dimerization and interaction with other Tat pathway membrane components. While the Asp205 was conserved across all TatC classes except for the C-terminal TatC domain of TatCt, the Tyr30 residue was not conserved in TatCo, TatCn or the N-terminal TatC domain of TatCt. We further surveyed GC-content for these TatC homologs in order to test the possibility of horizontal gene transfer to haloarchaea. The GC content of the TatC gene and 50 upstream and 50 downstream genes were plotted. For comparison of TatC versus genomic GC content, the entire gene chromosome sequence was used to obtain the GC content. The two datasets were compared using a two tailed z-test. It is clear from Fig. 4 that there is no horizontal gene transfer. No significant difference was observed between the individual GC content versus the genomic GC content for any of the TatC homologs (z = 1.044; two-tailed p = 0.2965), thereby indicating that these TatC gene homologs may have evolved within the haloarchaea and consequently the presence of a correlation between the different TatC homologs and the types of substrate transported by them.
Fig. 4

The whisker plot representation of GC content of fifty genes flanking either side of the TatC (represented as filled dot) of respective organism.

The whisker plot representation of GC content of fifty genes flanking either side of the TatC (represented as filled dot) of respective organism.

Haloarchaeal TatA

The size of TatA components in the selected Haloarchaea ranges from 75 to 145 amino acids. Phylogenetic analysis of unique TatA sequences (aligned with E. coli TatA and TatE and A. aeolicus TatA1 and TatA2) established that unlike TatCt, the TatA do not segregate according to the groups based on TatC combinations (Fig. 5). The alignment of E. coli TatA structure with haloarchaeal showed that the first TM region and the amphipathic helix region is highly conserved across all TatA components, but there was very little similarity in the n-terminal region (Fig. S2). TM residue Gln8 of E. coli TatA points inward, resulting in a short hydrophobic pore in the center of the complex. Different views were proposed based on simulations of the TatA complex in lipid bilayers indicate that the short TM domain distorts the membrane [41]. This residue is replaced with Glu in all haloarchaea. The E. coli Gly21 remains conserved in haloarchaea that keeps TM helix and amphipathic helix at right angles except in the case of TatA1 of Halomicrobiu mmukohataei (UniProt id C7P2I0). The movement of C-terminal portion of the amphipathic helix is considered important for TatA function and Phe39, mutation (F39A) of which causes TatA inactivation [42], is conserved across haloarchaea. Halorubrum lacusprofundi TatA (UniProt id B9LS70) is an exception which has a non-aromatic amino acid leucine. Phylogenetic analysis also showed that these two TatA homologs (C7P2I0 and B9LS70) segregate out, which may suggest different substrate interaction or inactive TatA homologs as these organisms contain another TatA homolog. Presence of a predicted coiled coil motif in the C-terminal of Natronomonas moolapensis TatA homolog (UniProt id M1XKM7) might play additional function in recognition.
Fig. 5

Phylogenetic analysis of TatA/E from 20 haloarcheal species studies. Group 1 members are highlighted in yellow, group 2 in green and group 3 in blue.

Phylogenetic analysis of TatA/E from 20 haloarcheal species studies. Group 1 members are highlighted in yellow, group 2 in green and group 3 in blue.

Tat substrate distribution among the different groups of haloarchaea

As discussed above, the three groups of species had TatCt in common; thus we hypothesized that the Tat substrates in each group would be translocated either by TatCt or by TatCo, TatCn or TatCx. The Tat substrates were identified from proteome of all twenty haloarchaea using the TATFIND 1.4 server [6]. To segregate the substrates, a bidirectional BLAST was performed against each other using standalone BLAST version 2.2.29. From these hits, only the bidirectional hits (BETs) were considered for analysis. The BETs were considered as shared sets and the substrates which had no hits were termed as unique sets. Furthermore, the substrates translocated by TatCt might exhibit common characteristics across all three groups in terms of signal and functional domains. We conducted extensive sequence analysis of the Tat substrates for these haloarchaea to check for any unique features exhibited by each group of substrates: those potentially transported by TatCt and those transported by the other TatC homologs [26]. From a total of 1,287,495 proteins, 2670 substrates were identified for the Group II (TatCt + TatCn), Group III (TatCt + TatCx), and Group I datasets (TatCt + TatCo) (Fig. S3).

Tat signal motif analysis

The Tat signal motif consists of three basic domains: a positively charged region at the N-terminal, a hydrophobic core and a more polar region that contains the cleavage site for a signal peptidase [6, 36, 43, 44]. The TATFIND server identifies Tat substrates and also provides information about the N-terminal signature RR signal and the middle hydrophobic region. The canonical RR signal is the region, which directly interacts with the TatC receptor [45, 46]; thorough analysis of this region was conducted for all Tat substrates in our dataset. It was clear that the phenylalanine in the fifth position of the signal sequence was highly conserved in almost all the Tat substrates examined (Fig. 6a), however, there were also differences in the mean hydrophobicity (data not shown). These results indicate that the signal region clearly plays a role in recognition, but that recognition is not solely due to either the RR signal or the hydrophobicity score of the signal region alone. The multiple sequence alignments of fifty amino acids of the N-terminal clearly showed a high occurrence of AGC sequences at the C-terminal of the hydrophobic region in a shared set of Tat substrates, but this was not observed in a unique set of substrates. This AGC sequence forms a distinctive recognition site called the ‘lipobox’ [31] and was depicted via the WebLogos (Fig. 6b). In the mature lipoprotein, this cysteine residue is attached to a membrane-associated lipid anchor. Analysis using the TatLipo server [7] confirmed that most of the substrates in the shared set were lipobox positive and were probably lipoproteins. This feature therefore clearly distinguishes between the shared (exported by more than one class of TatC) substrate datasets and the unique (exported by one type of TatC) substrate datasets. It is also probable that TatCt mainly transports proteins from the shared set, which are probably lipoproteins, whereas the other TatC homolog classes may transport the remaining unique set of substrates, which may be secreted proteins.
Fig. 6

a) Weblogo representation of Tat signal motifs in each class of substrates. b) Weblogo representation of Tat signal peptide regions from different classes of substrates showing the occurrence of lipobox motif (AGC) in the shared substrates. WebLogo was generated at http://weblogo.berkeley.edu/logo.cgi.

a) Weblogo representation of Tat signal motifs in each class of substrates. b) Weblogo representation of Tat signal peptide regions from different classes of substrates showing the occurrence of lipobox motif (AGC) in the shared substrates. WebLogo was generated at http://weblogo.berkeley.edu/logo.cgi.

Functional analysis of the Tat pathway of haloarchaea

We assessed the substrates in our dataset for specific functional roles, but in haloarchaea only a few have been experimentally tested and most are uncharacterized. We analyzed each substrate using the Pfam database to identify the different domains, repeats and signature motifs. The unique and common HMMs were identified in the different Tat groups (Table 3). A total of 891 unique families were identified among the 2670 haloarchaea Tat substrates. From the analysis, it can be inferred that there is more domain diversity across the unique set and that the shared set has many domains in common (Fig. 7 and Table 3).
Table 3

Distribution of PFAM families.

DatasetsNumber of pfam familiesNumber of proteins with pfam families predicted
Group 1 shared sets109231
Group 1 unique sets4729
Group 2 shared sets237555
Group 2 unique sets208144
Group 3 shared sets136276
Group 3 unique sets4731
Fig. 7

Venn diagrammatic representation of the Pfam family distribution in shared and unique group of substrates. Venn diagrams were generated with http://bioinfogp.cnb.csic.es/tools/venny/.

Distribution of PFAM families. Venn diagrammatic representation of the Pfam family distribution in shared and unique group of substrates. Venn diagrams were generated with http://bioinfogp.cnb.csic.es/tools/venny/. The Pfam families for the shared set of substrates were then compared across datasets. Fifty eight families were common to all the groups for the shared set, but only two families were common to all the groups for the unique set (Fig. 7). To assess the significance of the shared and unique sets of families, the distribution of these domains across the groups were analyzed by re-mapping the list of Pfam families to the substrates in each set and then mapping all the families present in these proteins. Although there were many unique Pfam families in both sets, most of the shared substrates contained one or more common Pfam families. The 58 frequently occurring Pfam families found in all the shared sets were mostly metal-binding domains, periplasmic substrate-binding domains that are responsible for ion transport, glycosidases, dehydrogenases and domains responsible for the biosynthesis of pyrimidine and trypsin-2. There were also four domains of unknown function and four from PfamB. It was quite clear that the proteins in the shared substrates set were mostly membrane-associated proteins, which correlates with the presence of lipobox in the signal region. The unique substrates set generally had lyase or hydrolase-type domains. Interestingly, a group of unique substrates in the group II were identified as 7TM-GPCRs, PAZ domains, Cox II, and a few metal-binding domains. However, most of the Tat substrates present in the unique substrates set were secreted proteins.

Protein-folding quality control

The quality-control mechanism for substrates transported by the Tat machinery is well established. In the bacterial system, this process is taken care of by chaperones that referred as REMP [47], which mask the twin-arginine signal and ensure proper folding or substrate maturation with appropriate cofactor loading [48]. In the case of halophilic archaea, the presence of such a control mechanism may be even more vital because the Tat pathway transports almost the entire secretome. We therefore identified chaperone homologs from the proteomes of these organisms. Homologs for E. coli REMP such as DmsD, HyaE, HybE, NapD, NarJ, NarW, TorD and YcdY were identified. Pairwise sequence analysis showed that the key sequence motif shown in Fig. 8, Fig. S4 and Table ST2 was partially conserved and the protein sizes were comparable in most of the cases, indicating that the haloarchaeal Tat pathway uses chaperones similar to their bacterial counterparts.
Fig. 8

REMP motifs E. coli TorD aligned to identified archaeal TorD homologs.

REMP motifs E. coli TorD aligned to identified archaeal TorD homologs.

Conclusions

Overall, these analyses provide insight into the diversity of the indispensable Tat pathway in the haloarchaeal system. The evolution of the Tat pathway itself suggests that there may be a fundamental conflict between the substrates transported and the Sec export mechanism in organisms such as haloarchaea that live in extreme habitats. The diversity and multiple unique topologies of the TatC receptor in haloarchaea indicate that they may exist specifically for transport of large numbers and a wide variety of Tat substrates. Conservation in the N-terminal region of TatA homologs to that of E. coli implies similar mechanism of transport but diversity in the C-terminal region could be useful in interaction with the wide range of substrates. This is a requirement for haloarchaeal proteins; they would otherwise aggregate after exiting the ribosome because haloarchaea have a high intracellular concentration of positive ions in order to maintain osmotic balance against the high extracellular sodium concentration. The quality of protein folding is a major concern for these organisms, and is positively facilitated by the sophisticated Tat pathway components. Unlike in other systems, many substrate groups are present here based on the class of receptor required for the translocation. This may be necessary for differential translocation dynamics and to ensure the stringent folding quality. Further invivo and invitro studies are required for dissecting the dynamics of the Tat pathway in haloarchaea.

Declarations

Author contribution statement

Deepanjan Ghosh: Conceived and designed the experiments; Performed the experiments; Analyzed and interpreted the data; Wrote the paper. Debjyoti Boral: Performed the experiments; Analyzed and interpreted the data. Koteswara Rao Vankudoth: Analyzed and interpreted the data; Contributed reagents, materials, analysis tools or data; Wrote the paper. Sureshkumar Ramasamy: Conceived and designed the experiments; Contributed reagents, materials, analysis tools or data; Wrote the paper.

Funding statement

Deepanjan Ghosh was supported by the Council of Scientific and Industrial Research (CSIR) New Delhi with a Senior Research Fellowship. Debjyoti Boral is supported by University Grant Commision (UGC), India.

Competing interest statement

The authors declare no conflict of interest.

Additional information

No additional information is available for this paper.
  47 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

Review 2.  The Tat protein export pathway.

Authors:  B C Berks; F Sargent; T Palmer
Journal:  Mol Microbiol       Date:  2000-01       Impact factor: 3.501

3.  Chloroplast TatC plays a direct role in thylakoid (Delta)pH-dependent protein transport.

Authors:  H Mori; E J Summer; K Cline
Journal:  FEBS Lett       Date:  2001-07-13       Impact factor: 4.124

4.  The Rieske Fe/S protein of the cytochrome b6/f complex in chloroplasts: missing link in the evolution of protein transport pathways in chloroplasts?

Authors:  S Molik; I Karnauchov; C Weidlich; R G Herrmann; R B Klösgen
Journal:  J Biol Chem       Date:  2001-08-28       Impact factor: 5.157

5.  Functional complexity of the twin-arginine translocase TatC component revealed by site-directed mutagenesis.

Authors:  Grant Buchanan; Erik de Leeuw; Nicola R Stanley; Margaret Wexler; Ben C Berks; Frank Sargent; Tracy Palmer
Journal:  Mol Microbiol       Date:  2002-03       Impact factor: 3.501

6.  The HMMTOP transmembrane topology prediction server.

Authors:  G E Tusnády; I Simon
Journal:  Bioinformatics       Date:  2001-09       Impact factor: 6.937

7.  On filtering false positive transmembrane protein predictions.

Authors:  Miklos Cserzö; Frank Eisenhaber; Birgit Eisenhaber; Istvan Simon
Journal:  Protein Eng       Date:  2002-09

Review 8.  Protein transport in the halophilic archaeon Halobacterium sp. NRC-1: a major role for the twin-arginine translocation pathway?

Authors:  Albert Bolhuis
Journal:  Microbiology       Date:  2002-11       Impact factor: 2.777

9.  Adaptation of protein secretion to extremely high-salt conditions by extensive use of the twin-arginine translocation pathway.

Authors:  R Wesley Rose; Thomas Brüser; Jessica C Kissinger; Mechthild Pohlschröder
Journal:  Mol Microbiol       Date:  2002-08       Impact factor: 3.501

10.  TatC is a specificity determinant for protein secretion via the twin-arginine translocation pathway.

Authors:  J D Jongbloed; U Martin; H Antelmann; M Hecker; H Tjalsma; G Venema; S Bron; J M van Dijl; J Müller
Journal:  J Biol Chem       Date:  2000-12-29       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.