Literature DB >> 19907126

Reflections on protein splicing: structures, functions and mechanisms.

Yasuhiro Anraku1, Yoshinori Satow.   

Abstract

Twenty years ago, evidence that one gene produces two enzymes via protein splicing emerged from structural and expression studies of the VMA1 gene in Saccharomyces cerevisiae. VMA1 consists of a single open reading frame and contains two independent genetic information for Vma1p (a catalytic 70-kDa subunit of the vacuolar H(+)-ATPase) and VDE (a 50-kDa DNA endonuclease) as an in-frame spliced insert in the gene. Protein splicing is a posttranslational cellular process, in which an intervening polypeptide termed as the VMA1 intein is self-catalytically excised out from a nascent 120-kDa VMA1 precursor and two flanking polypeptides of the N- and C-exteins are ligated to produce the mature Vma1p. Subsequent studies have demonstrated that protein splicing is not unique to the VMA1 precursor and there are many operons in nature, which implement genetic information editing at protein level. To elucidate its structure-directed chemical mechanisms, a series of biochemical and crystal structural studies has been carried out with the use of various VMA1 recombinants. This article summarizes a VDE-mediated self-catalytic mechanism for protein splicing that is triggered and terminated solely via thiazolidine intermediates with tetrahedral configurations formed within the splicing sites where proton ingress and egress are driven by balanced protonation and deprotonation.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19907126      PMCID: PMC3621562          DOI: 10.2183/pjab.85.409

Source DB:  PubMed          Journal:  Proc Jpn Acad Ser B Phys Biol Sci        ISSN: 0386-2208            Impact factor:   3.493


Introduction

In 1990, the Anraku Laboratory discovered evidence that the Saccharomyces cerevisiae VMA1 gene[1)] expresses the nascent 120-kDa translational product of VMA1 from which the 50-kDa VDE endonuclease (VMA1-derived endonuclease[2)] or the VMA1 intein) is excised out and the N- and C-terminal exteins thus formed are ligated to produce the mature 70-kDa catalytic subunit of the vacuolar H+-ATPase (see Figs. 1 and 2). This posttranslational splicing is rapid and found to occur both in vivo and in vitro systems.[3)] Since the first discovery of the VMA1 intein, more than 100 putative inteins have been identified in eubacteria, archaea, and eukaea species.[4)]
Fig. 1

VMA1 gene and products of the protein splicing.

Fig. 2

Sequences of the VMA1 protein and recombinants. The N- and C-terminal sequences of the recombinants are aligned with the VMA1 sequence of which the residues from Cys284 to Asn737 correspond to the VDE intein. Residues subjected to the mutation studies are indicated with bold characters. The N- and C-terminal side polypeptides are indicated with Nt and Ct, respectively. The replacements in the X10-series VDE recombinants are underlined. Splicing activity and crystallization for each recombinant are indicated with a corresponding sign (+ or −) in the columns of “Splicing” and “Crystal”.

VMA1 gene and products of the protein splicing. Sequences of the VMA1 protein and recombinants. The N- and C-terminal sequences of the recombinants are aligned with the VMA1 sequence of which the residues from Cys284 to Asn737 correspond to the VDE intein. Residues subjected to the mutation studies are indicated with bold characters. The N- and C-terminal side polypeptides are indicated with Nt and Ct, respectively. The replacements in the X10-series VDE recombinants are underlined. Splicing activity and crystallization for each recombinant are indicated with a corresponding sign (+ or −) in the columns of “Splicing” and “Crystal”. The comprehensive studies conducted by the Anraku group have shown that the reaction is folding-dependent and auto-catalytic, and takes place within the single molecule.[5)–7)] The mutagenesis study also indicated that the N-extein residues play unique and important roles in protein splicing.[8)] The Perler, Xu, and Paulus laboratories have demonstrated that the reaction involves several chemical steps including N→S acyl shift, transesterification, succinimide formation, and S→N acyl shift.[9)–11)] A large amount of biochemical and physicochemical information has been accumulated and eventually one recognizes that the splicing reaction is controlled in a rapid, accurate, and solely structure-dependent manner and also exquisitely coordinated so as to yield the correctly spliced protein as the primary product. Side-reactions such as cleavage at either of the two splicing junctions seem to occur only when inteins are inserted into foreign contexts or made to function under nonphysiological conditions.[9),12)–15)] Contemporary reviews on these subjects are available in References.[16)–18)] In 1999, the Anraku group adopted finally a crucial strategy for dissecting chemical mechanisms by which protein splicing regulates its complicated stepwise processes faithfully without any formation of side products. Crystal structural studies were thus implemented after designing splicing-inactive and slowly spliceable VDE recombinant precursors in soluble forms to address a precise chemical mechanism for protein splicing.[19)–21)]

The VMA1 gene

Nucleotide sequencing of VMA1 in a BamHI-XbaI fragment from the yeast hybridizing clone RH151 (nucleotide residues at −585 to +3497) revealed that the gene consists of a single open reading frame capable of encoding a polypeptide of 1,071 amino-acid residues (ref. 1; see also Fig. 2). The most upstream ATG (nucleotide residues at 1–3) among four possible candidates was determined to be the initiation codon based on the predicted N-terminal amino-acid sequence (residues 1–40). In the 5′-flanking region of the sequence, the TATA-like sequence at nucleotide position −79 and the consensus sequences for the yeast transcription initiation site at nucleotide positions −24 and −20 are located. The whole sequence stretches over 3,213 base-pairs without the yeast mRNA splicing consensus sequences. In the 3′-flanking region, a set of sequences homologous to the yeast transcription termination signal is found. The homology search indicated that the N- and C-terminal regions of the nascent precursor (residues at Met1Gly283 and Cys738–Asp1,071, respectively) showed high identity to those of the catalytic subunits of carrot[22)] and Neurospora crassa[23)] vacuolar membrane H+-ATPases. The in-frame insert with 454 amino-acid residues at Cys284Asn737 was determined to become a VDE portion after splicing[9),14)] Based on these pieces of structural information, the VMA1 gene is an ordinary operon containing information for two independent enzymes without any redundancy or shortage of genetic codons and transcribed to yield a single species of mRNA.[1)]

Essential residues for protein splicing

The first mutagenic study[24)] and alignment search for the VMA1 intein family[25)] suggested that the excision takes place at the peptide bonds between the splicing junctions of Gly283-Cys284 and Asp737-Cys738 (see also Fig. 2). This prediction was soon confirmed by the chemical determination of the N- and C-terminal amino-acid residues of VDE.[9),14)] Since the splicing reaction is catalyzed by the in-frame insert, VDE, Kawasaki et al. carried out a systematic search for crucial amino acids involved in the whole process of the reaction.[6)] A random mutagenesis strategy was used to identify such residues throughout the entire region of VDE and its neighboring elements and seven defective-mutants with single amino-acid replacements were mapped systematically in the three core regions: Gly283Val and Cys284Tyr at the N-terminal splicing junction and another two (Asn737Ser and Asn737Lys) are at the C-terminal splicing junction,[6)] in agreement with the previous results.[14),24)] The other three mutations are new and assigned at amino-acid positions His362, Val403, and Ser639 at the third core region. The His362 is conserved in every predicted splicing precursor,[25)] and the His362Leu mutation gives a severer defect in protein splicing though a recombinant VDE constructed from the His362Leu mutant, expressed in E. coli and purified shows normal DNA endonuclease activity. [6)] This finding indicates that active sites for the protein splicing and endonuclease activities reside apart in different regions. The His362 residue contributes to the first cleavage at the N-terminal splicing junction[6)] whereas the Val403 and Ser639 residues do not participate in an excision-ligation reaction but they have structural roles in regulation of the splicing activity.[21)]

Biochemical features of protein splicing

Kawasaki et al. established a folding-dependent in vitro splicing assay with the use of various VDE recombinant precursors and proved that the splicing reaction catalyses the joining of N-extein Gly283 and C-extein Cys738.[5)] In addition, with the use of purified XC-VDE-His (see Fig. 2) as a substrate, the splicing reaction proceeds accurately in the neutral pH even in the presence of strong protease inhibitors.[7)] To examine a possibility whether the reaction occurs within a single molecule or needs two molecules for splicing, two different species of VDE propeptides (MIIYVG-VDE-CGER—C-terminus and N-terminus—MIIYVG-VDE-CGER; see Fig. 2) were mixed in an equal amount for splicing assay. No swapping of the exteins was taken place as no intact 120-kDa VMA1 precursor was detected, supporting that the reaction is an intra-molecular process.[7)] All the results indicate that the splicing reaction involves bond rearrangements rather than bond cleavage followed by resynthesis. In other words, a new chemical principle that has never been uncovered in nature is adopted for the world of protein splicing.[5),7),14)]

Schematic diagram for a mechanism of protein splicing reaction

We have proposed a schematic diagram for the VDE-mediated protein splicing reaction (refs. 21, 26; see Fig. 3). In the initial N→S acyl shift, the thiazolidine intermediate at the Gly283-Cys284 junction is formed and then resolved into the thioester intermediate formed at Cys284. In the next step, the branched intermediate[13)] is formed by the transesterification that involves a nucleophilic attack on the thioester carbon atom by the side chain of conserved Cys738. In the third step, the intervening intein region from Cys284 to Asn737 is excised out by the peptide-bond cleavage that is triggered by a succinimide formation at conserved Asn737. The transient thioester linkage between the exteins undergoes the final S→N acyl rearrangement via the second thiazolidine intermediate.
Fig. 3

Schematic diagram for the mechanism of the protein-splicing reaction. N- and C-extein polypeptides in the N- and C-terminal sides of the VDE intein are indicated by the boxes and their N and C labels, respectively. In the initial N→S acyl shift, the thiazolidine intermediate at the Gly283-Cys284 junction is formed and then resolved into the thioester intermediate formed at Cys284. The nucleophilic attack on the thioester by the conserved Cys738 side-chain causes the transesterification that forms the branched intermediate. In the next step, the intervening region from Cys284 to Asn737 is excised out by the peptide bond cleavage upon the succinimide formation at conserved Asn737. The transient ligation product with the Gly283-Cys738 thioester linkage then undergoes the final S→N acyl shift via the second thiazolidine intermediate.

Schematic diagram for the mechanism of the protein-splicing reaction. N- and C-extein polypeptides in the N- and C-terminal sides of the VDE intein are indicated by the boxes and their N and C labels, respectively. In the initial N→S acyl shift, the thiazolidine intermediate at the Gly283-Cys284 junction is formed and then resolved into the thioester intermediate formed at Cys284. The nucleophilic attack on the thioester by the conserved Cys738 side-chain causes the transesterification that forms the branched intermediate. In the next step, the intervening region from Cys284 to Asn737 is excised out by the peptide bond cleavage upon the succinimide formation at conserved Asn737. The transient ligation product with the Gly283-Cys738 thioester linkage then undergoes the final S→N acyl shift via the second thiazolidine intermediate.

Design of VDE recombinants for structure-directed splicing

The structure of the VDE protein expressed in E. coli,[27)] referred to as generic VDE hereafter, revealed a bilobed domain structure: one domain bears residues requisite for the endonuclease activity, and the other those for the protein splicing. In the protein-splicing domain, the conserved N- and C-terminal residues Cys284, His736 and Asn737, as well as His362 that is essential for the splicing reaction,[6)] are assembled in a location suitable for taking part in the reaction. The unspliced XC-VDE recombinant bearing six N-extein and four C-extein residues was expressed largely as insoluble inclusion bodies.[5)] Hence, Mizutani and Satow designed the soluble X10-series VDE recombinants for crystallographic studies, by adopting the extended segments of 11 N-extein and 10 C-extein residues from the VMA1 sequence (ref. 20; see also Fig. 2). In order to reduce reaction speed in this series, mutations of Cys284Ser and His362Asn are introduced in the X10SNC recombinant; His362Asn and Cys738Ser in the X10CNS; Cys284Ser, His362Asn, and Cys738Ser in the X10SNS; and Cys284Ser, His362Asn, Asn737Ser, and Cys738Ser in the X10SSS. Purified X10-series samples were subjected to protein splicing assay.[20)] The X10SSS product gave a single 53-kDa band in the SDS-PAGE analysis and showed a MALDI-TOF mass of 53,010, in good agreement with the predicted mass of 53,078 for the intact X10SSS. Therefore, the X10SSS retains its N-and C-extein propeptides, and is a splicing-inactive precursor. The X10CNS showed that the product exists mainly as a 50-kDa fraction accompanied by a minor fraction of 53 kDa in the SDS-PAGE.[20)] The purified 53-kDa product was incubated for 12 hrs at 4 °C and ultrafiltrated with 10-kDa cutoff. The molecular mass of the filtrate fraction was determined to be 2,100 with the mass spectrometry, in exact agreement with that predicted for the spliced peptide (SNSDAIIYVG-SGERGNEMAE). The N-terminal amino-acid sequence of the filtrate was determined to be SNSDA, indicating that the protein splicing certainly but slowly occurred in the X10CNS. The X10SNC product was also observed as 50-kDa and 53-kDa bands in the SDS-PAGE. The X10SNS product was purified as a single-band protein of 53 kDa, whereas the 50-kDa band appeared after the crystallization.

Three-dimensional structural analyses

Crystals of the X10SSS and X10SNS recombinants were prepared at 20 °C within a few days after the purification, and grew to mature sizes within a week.[20),21)] The X10SSS crystals were dissolved into a solution, and the resultant solution gave again the 53-kDa band in the SDS-PAGE. The X10SNS crystals were obtained after the crystallization for 3 and 7 days at 20 °C, and the dissolved solutions gave the 53-kDa and 50-kDa bands. The amount of the 50-kDa band increased with time, indicating that the splicing reaction proceeded slowly even in the crystals. [21),26)] The X10SNS protein was aggregated as an intact precursor protein into the crystalline lattice isomorphous to the X10SSS crystal, and then underwent the splicing process, releasing the spliced peptide. The three-dimensional structure of the X10SSS crystal flash-cooled to 110 K in diffraction experiments was determined at 2.1 Å resolution (Protein Data Bank accession code of 1JVA, ref. 20). The X10SNS structure from the data obtained at 31 hrs after the crystallization setup was similarly obtained at 2.9 Å resolution (accession code 1UM2, ref. 21). The structures of two molecules A and B in the asymmetric unit in the space group P1 are nearly identical as shown in Fig. 4, consisting of the domain I (residues 463–699), the DNA recognition subdomain of the domain II (residues 369–437), and the splicing subdomain of the domain II (residues 284–368, 438–462, and 700–737). The residues at the C-terminal-side ends of helices α4 (486–501) and α7 (595–609) bear the endonuclease activity, and supposedly retain the domain I structure requisite for the endonuclease activity. Between the X10SSS, X10SNS, and generic VDE[27)] structures, the splicing junctions give marginal differences. These indicate that the overall structures of the splicing junctions are virtually unalterable, even in the course of the splicing reaction, by the N- and C-extein extension residues (see Figs. 4, 5 and 6). The splicing junctions of the X10SSS and X10SNS are superposed for comparison in Fig. 7.
Fig. 4

Structure of the X10SSS precursor protein. X10SSS molecules A and B are viewed along the non-crystallographic twofold axis. The C- and N-extein residues are illustrated as ball-and-stick models. Molecule B is highlighted by colors; red, the N-extein residues; green, the splicing sub-domain P of the domain II; light green, the DNA-recognition subdomain D of the domain II; blue, the domain I; yellow, the C-extein residues.

Fig. 5

Stereo drawing of the splicing site of the X10SSS precursor. The N- and C-extein residues are drawn in red and yellow, respectively, and their linkages to the intein in green are indicated by blue bonds. Hydrogen bonds involved with the splicing junctions are indicated by dotted light blue lines, with their donor-acceptor distances in Å.

Fig. 6

Stereo view of the splicing site of the X10SNS spliced product molecules. The electron-density map contoured at the 3σ level by omitting 281–283 and 738–741 residues is superposed on the extein residues of molecule B. The N- and C-extein residues are ligated by the Gly283-Ser738 peptide bond.

Fig. 7

Stereo drawing of the superposition of the splicing sites of the X10SSS precursor and X10SNS spliced product molecules. The X10SSS residues are drawn in grey. The N- and C-extein residues of the X10SNS products are drawn in red and yellow, respectively, their linkage at Gly283-Ser738 in blue, and the intein residues in green.

Structure of the X10SSS precursor protein. X10SSS molecules A and B are viewed along the non-crystallographic twofold axis. The C- and N-extein residues are illustrated as ball-and-stick models. Molecule B is highlighted by colors; red, the N-extein residues; green, the splicing sub-domain P of the domain II; light green, the DNA-recognition subdomain D of the domain II; blue, the domain I; yellow, the C-extein residues. Stereo drawing of the splicing site of the X10SSS precursor. The N- and C-extein residues are drawn in red and yellow, respectively, and their linkages to the intein in green are indicated by blue bonds. Hydrogen bonds involved with the splicing junctions are indicated by dotted light blue lines, with their donor-acceptor distances in Å. Stereo view of the splicing site of the X10SNS spliced product molecules. The electron-density map contoured at the 3σ level by omitting 281–283 and 738–741 residues is superposed on the extein residues of molecule B. The N- and C-extein residues are ligated by the Gly283-Ser738 peptide bond. Stereo drawing of the superposition of the splicing sites of the X10SSS precursor and X10SNS spliced product molecules. The X10SSS residues are drawn in grey. The N- and C-extein residues of the X10SNS products are drawn in red and yellow, respectively, their linkage at Gly283-Ser738 in blue, and the intein residues in green.

Precursor structure of the splicing junctions

The protein-splicing site is situated at the bottom of the β-sheet region of the domain II (Fig. 4). In the X10SSS precursor, the N-extein residues of Val282 and Gly283 are in a β-strand conformation, and located in the vicinity of Asn362 and Ser738 as expected (Fig. 5). The Val282 O atom is hydrogen-bonded to the Asn359 side-chain. Atomic contacts are observed among hydrophobic residues Ile280, Val282, Pro324, and Tyr714. The scissile peptide bond between Gly283 and Ser284 is in an ordinary trans conformation, and its Ser284 N atom is hydrogen-bonded also to the Asn359 side-chain. This trans conformation was also observed in the structure of the XA-VDE[19)] that bears an Ala284 residue and extra N-extein residues MIIYVG adopted from the VMA1 sequence, and also in the VMA29 VDE structure[28)] that has an Ala284 residue, extra N-terminal residues MKAEEGKLEG from the maltose binding protein, and extra C-extein residues CGER from the VMA1 sequence. The main-chain conformation of Ser738 in the C-extein is stabilized by the hydrogen bonds between its N and Ile717 O atoms and between its O and Thr718 Oγ1 atoms (Fig. 5). Residues 739–741 are in the β-strand conformation, forming a short anti-parallel β-sheet along with the N-extein residues 281–283. The formation of this β-sheet brings the residues of the N- and C-terminal junctions in close proximity; bringing the Gly283 carbonyl O atom into contact with the Ser738 side-chain Oγ atom. The side chains of Glu740 and Arg741 are in electrostatic interaction with residues Lys322 and Glu443, respectively. These Glu443 and Arg741 side-chains are clustering together with charged residues of Asp399, Arg401, Arg445, and Asp446. The mutagenesis study[8)] indicated that the N-extein residues of Ile279, Ile280, and Tyr281 possibly interact with the residues Val733, Val734, and Val735 that precede the C-terminal junction. The strands Ile279Tyr281 and Val733Val735 are not directly involved in atomic contact, but brought into a cluster together with hydrophobic residues Pro324, Ile717, and Phe727. This hydrophobic cluster contributes to the conformational construction of these strands.

Ligated structure of the splicing junctions

The ligated extein segments in the splicing junctions (residues 282–285, 359–362, and 736–739) are clearly delineated in the X10SNS structure as shown in Fig. 6.[21)] Asn737 of the excised intein was reported to turn up in an approximately equimolar amount of asparagine and succinimide forms.[10)] The Asn737 structure is interpretable either as an asparagine or as a succinimide, but an asparagine is better fitted into the density (Fig. 6). The mutated His362Asn has its side-chain conformation almost identical to that of the His362 side-chain of the generic VDE.[27)] The Gly283, Ser738, and Gly739 region of the ligated segment is in an atypical turn conformation (Fig. 6). This turn structure involves a hydrogen bond between the Gly283 N and Gly739 O atoms as in the X10SSS structure. The Ser738 main-chain is stabilized also by the hydrogen bond to the Thr718 Oγ1. Residues 739–741 form a short antiparallel β-sheet along with the N-extein residues 282 and 283 (Fig. 6). This β-sheet is originated from the Gly283-Ser738 linkage formed by the S→N acyl shift. The side chains of His736 and Asn737 are still in the vicinity of the 283–738 linkage; the Gly283 O atom close to the Asn737 O, and the His736 Nδ1 atom to the Asn737 O (see Fig. 7).

Comparison of the splicing junctions

The X10-series recombinants show notable large differences in the structure of the N- and C-extein residues, 280–283 and 739–741, to the VMA29 VDE structure[28)] in which Gly283 with the bound zinc ion near Cys 738 shows a distorted bond angle at the Cα atom and a distorted inverse-γγ turn at the scissile peptide bond to Ala284. It is reported that the VMA1 product splices correctly even in the presence of the chelating reagent,[5)–7)] and that the addition of zinc ion rather inhibits the protein splicing in the case of the RecA intein.[29)] Therefore, the X10SSS structure with the β-strand-conformed Ser284 side-chain in contact with the Ser738 side-chain is more suitable for understanding the splicing reaction. As for the structural analyses of the splicing reaction, the crystal structures were reported for Mycobacterium xenopi GyrA[30)] and Hedgehog auto-processing domain.[31)] The VDE residues Cys284, His736, Asn737, and Cys738 that have been reported to be essential for the splicing reaction,[6)] are super-posable on the corresponding residues of the GyrA and Hedgehog proteins, having common side-chain conformations.

Scissile bond structure triggering the first reaction step

The first step involves the chemically unfavorable N→S acyl shift that breaks an amide bond and forms a high-energy thioester linkage at the N-terminal scissile bond, and hence raises a still unresolved issue of driving forces. The GyrA crystal structure contains an extra Ala(−1) residue introduced immediately precedent to the conserved N-terminal Ser1.[30)] The peptide bond between Ala(−1) and Ser1, however, is in an energetically unfavorable cis conformation, suggesting that the cis conformation would destabilize the bond and hence generate a driving force for the first step of the reaction. The NMR structure of a splicing-active precursor of the GyrA intein showed that the amide of the Phe(−1)-Cys1 scissile bond is highly polarized and possibly in a distorted state, providing additional support for the idea that the first step in protein splicing is facilitated by destabilizing the scissile bond.[32)] In the X10SSS structure, the trans conformation for the Gly283-Ser284 scissile bond is restrained by the contact between the Gly283 main-chain O and Ser738 side-chain Oγ atoms, and also by the hydrogen bond between the Ser284 main-chain N and Asn359 side-chain Oδ atoms (Fig. 5). This contact is well suited for the formation of the thiazolidine intermediate in the N→S acyl shift (see Fig. 8b). Therefore, the highly distorted conformation seems to be not essential for the reaction in the VMA1 translational protein that has the small and flexible Gly283 residue in the scissile bond, although it is not precluded that destabilizing conformations indiscernible in the X10SSS structure might transiently take place in the actual reaction.
Fig. 8

Chemical mechanism for the protein-splicing reaction via thiazolidine intermediates. Broken lines represent possible hydrogen bonds. (a) The N→S acyl shift is initiated by the deprotonation of the Cys284 Sγ atom that is converted to a nucleophile by the Gly283 O atom. (b) The thiazolidine intermediate contains a tetrahedral configuration at the Gly283 C atom. The imidazole ring of His362 is in a location suitable for protonating the imine nitrogen in the thiazolidine ring. (c) The N→S acyl shift via the thiazolidine intermediate forms the thioester intermediate that undergoes transesterification by a nucleophilic attack of the Cys738 Sγ atom. (d) The tetrahedral intermediate of thioester transfer has the Gly283-Cys738 linkage. (e) The branched intermediate with a thioester linkage between the N- and C-exteins is formed by resolving the tetrahedral intermediate. (f) The Asn737-Cys738 peptide bond is cleaved upon the Asn737 cyclization. The ring structure in the succinimide intermediate is then hydrolyzed by a water molecule. (g) The thus-formed terminal carboxyl group of Asn737 in the vicinity of the His736 side-chain causes the deprotonation from the terminal Cys738 N atom, and triggers the S→N acyl shift at the thioester linkage in the second thioester intermediate. (h) The oxyanion at the tetrahedral Gly283 C atom in the second thiazolidine intermediate is stabilized by the N-terminal Cys284 N atom. The ring structure is resolved through the protonation of the Cys284 N atom. (i) The S→N acyl shift via the second thiazolidine intermediate forms the final products, the Cys284–Asn737 intein polypeptide and the N- and C-extein-ligated polypeptide.

Chemical mechanism for the protein-splicing reaction via thiazolidine intermediates. Broken lines represent possible hydrogen bonds. (a) The N→S acyl shift is initiated by the deprotonation of the Cys284 Sγ atom that is converted to a nucleophile by the Gly283 O atom. (b) The thiazolidine intermediate contains a tetrahedral configuration at the Gly283 C atom. The imidazole ring of His362 is in a location suitable for protonating the imine nitrogen in the thiazolidine ring. (c) The N→S acyl shift via the thiazolidine intermediate forms the thioester intermediate that undergoes transesterification by a nucleophilic attack of the Cys738 Sγ atom. (d) The tetrahedral intermediate of thioester transfer has the Gly283-Cys738 linkage. (e) The branched intermediate with a thioester linkage between the N- and C-exteins is formed by resolving the tetrahedral intermediate. (f) The Asn737-Cys738 peptide bond is cleaved upon the Asn737 cyclization. The ring structure in the succinimide intermediate is then hydrolyzed by a water molecule. (g) The thus-formed terminal carboxyl group of Asn737 in the vicinity of the His736 side-chain causes the deprotonation from the terminal Cys738 N atom, and triggers the S→N acyl shift at the thioester linkage in the second thioester intermediate. (h) The oxyanion at the tetrahedral Gly283 C atom in the second thiazolidine intermediate is stabilized by the N-terminal Cys284 N atom. The ring structure is resolved through the protonation of the Cys284 N atom. (i) The S→N acyl shift via the second thiazolidine intermediate forms the final products, the Cys284Asn737 intein polypeptide and the N- and C-extein-ligated polypeptide.

Structural roles of Val403 and Ser639 in protein splicing

Splicing-defective mutants of the GST-Vma1 fusion protein, with a replacement of either Gly283Val, Cys284Tyr, His362Leu, Val403Asp, Ser639Pro, Asn737Ser, or Asn737Lys, have been identified from the random mutagenesis study.[6)] The crystal structures clearly indicate that the replacements in the key residues of Cys284, His362, and Asn737 have direct effects on the splicing reaction (see Fig. 7). The Gly283Val replacement causes a steric hindrance with the catalytic His362 side-chain. The hydrophobic side-chain of Val403 located at about 19 Å from Cys284 is in interaction with that of Ile442 which is located in an anti-parallel β-sheet formed by the Tyr437Glu443 and Glu363Pro369 strands. The Val403Asp replacement brings a polar side-chain into this hydrophobic environment and affects the spatial arrangement of Ile442 which is in interaction with His362. The Ser639Pro replacement abolishes a hydrogen bond between the Ser639 N and Ser635 O atoms, and disrupts the helix formation at His626Leu640, hindering the folding of the domain II.

Chemical mechanism of the splicing reaction

Based on the crystal structures of the X10SSS precursor and X10SNS spliced products, we proposed the chemical mechanism of the splicing reaction, [20),21),26)] which is summarized in Fig. 8. The X10-series VDE recombinants show that the activity of the splicing is retained by the Cys284Ser and Cys738Ser replacements, as clearly indicated by the X10SNS spliced products (Fig. 6).

Initiation of the N→S acyl shift

In the close proximity of the Ser284 Oγ atom of the X10SSS precursor, the Gly283 O and Ser738 Oγ atoms are located (Fig. 5). The N→S acyl shift was reported even for the recombinant with the Cys738Ala replacement.[10)] This indicates that the Gly283 O atom as an anionic form, rather than the Cys738 Sγ atom, is able to abstract a proton from the Cys284 side-chain and activate its Sγ atom in the initial step of the N→S acyl shift as exemplified in Fig. 8a.

Thiazolidine intermediate

The resultant Cys284thiolate nucleophilically attacks on the Gly283 C atom, forming an intermediate which has a five-membered ring structure, called as thiazolidine, at the Gly283-Cys284 junction (Fig. 8b). The thiazolidine ring from the reported crystal structure of its derivative[33)] is nicely superposable on the junction. The Gly283 C atom embedded in the thiazolidine ring would have a tetrahedral configuration with a hydroxy Gly283 O atom which is protonated upon the Cys284 Sγ activation. When the Cys284 Sγ atom is activated by the other catalytic atom such as the Cys738 Sγ atom, the Gly283 C atom would have an oxyanion Gly283 O atom instead of the hydroxy group, as discussed by Paulus.[11)] The His362 side-chain is located in the vicinity of the 283–284 peptide bond. Though His362 was identified by the leucine replacement as the essential residue for the splicing reaction[6)] the X10CNS bearing the His362Asn replacement is proved to retain the splicing activity.[21)] When an imidazole ring is substituted for the amide group of the X10SNS Asn362 side-chain (Figs. 5 and 7), the resultant His362 Nδ1 atom comes close to the Ser284 N atom. Therefore, the imidazole ring of His362 is so located as to stabilize the hydroxy or oxyanion group of the thiazolidine intermediate and then as to protonate the Cys284 N atom (Fig. 8b).

Thioester intermediate

The protonation to the Cys284 N promotes the breakdown of the thiazolidine ring possibly stabilized by the His362 side-chain, and the thiazolidine intermediate is resolved into the Cys284 amino group and the Gly283-Cys284 thioester linkage toward the thioester intermediate (Fig. 8c). The reaction steps proposed here for this N→S acyl shift are entirely driven by protonation and deprotonation, in which proton ingress and egress are balanced within the splicing site.

Tetrahedral intermediate via transesterication

After the formation of the thioester intermediate, its acyl group is to be directed to the Cys738 side-chain, and its linkage with the Cys284 side-chain is to be transferred to the Cys738 sidechain. In this transesterification step, the protonated Cys284 amino-group that is formed by the opening of the thiazolidine ring activates the Cys738 Sγ atom into another nucleophile, which then attacks on the Gly283 thioester carbonyl carbon. Since the X10-series recombinants supposedly retain common precursory and intermediate structures, spatial arrangements of the splicing-site residues are essential also for the transesterification. The atomic contact between the Oγ atoms of the replaced Ser284 and Ser738 provides a structural basis for chemical activity of the Cys738 thiol side-chain in the course of the transesterification that forms the second tetrahedral configuration of an oxybissulfanyl C atom at Gly283 (Fig. 8d).

Branched intermediate and succinimide formation

The resultant tetrahedral intermediate of the thioester transfer is resolved into the Cys284 thiolate and the branched intermediate[13)] in which the N-extein acyl group is bonded to the Cys738 side-chain with the thioester linkage (Fig. 8e). The extein regions of this branched intermediate are then cleaved off from the intein region upon the succinimide formation at Asn737 (Fig. 8f).

Final S→N acyl shift via the second thioester and thiazolidine intermediates

In the final S→N acyl shift step of the protein splicing, the thioester structure from the branched intermediate that is resolved by the hydrolysis of the succinimide (Fig. 8g) is further transformed into a peptide-bond linkage between Gly283 and Cys738. As the thioester bond thus formed is chemically unstable, the S→N acyl shift reaction would proceed more rapidly in the hydrophobic environment of the splicing site, and the spontaneous hydrolysis be prevented by the balanced deprotonation and protonation as in the initial N→S acyl shift. In the beginning of the S→N acyl shift, the neutral Cys738 amino-group formed by the succinimide hydrolysis donates its proton to the carbonyl oxygen of Asn737 in the close vicinity of the side chain of essential His736, and the thus-formed Cys738 N nucleophile attacks on the Gly283 C atom. The resultant intermediate with the tetrahedral Gly283 C (Fig. 8h) has again a thiazolidine ring structure as is formed in the initial N→S acyl shift. The Ser284 N atom of the X10SNS product is closely located to the Gly283 O of the N-extein segment so that an oxyanion of this second thiazolidine intermediate is stabilized by the Cys284 N. The thiazolidine intermediate is finally resolved into a Cys738 thiol and the peptide bond between the N- and C-exteins, releasing the splicing product of the ligated exteins (Figs. 6 and 8i). Ludwig et al.[34)] have recently detected a thiazoline ring formation via a thiazolidine intermediate when the splicing assay using an artificial semi-synthetic protein trans-splicing system was carried out with a mutated intein in which the thioester formation step was blocked. This detection justifies our proposal stating the thiazolidine formation in the course of the initial N→S acyl shift (Fig. 8b). The final S→N acyl shift also involves the thiazolidine formation (Fig. 8h), and is the reverse reaction of the justified N→S acyl shift step.

Conclusion

Protein splicing is a novel biological reaction that implements the editing of genetic information transfer at a protein level and proceeds autocatalytically in a single molecule of precursor polypeptides. Based on the crystal structures we proposed and highlighted that the formation of the thiazolidine intermediate is a key step for the first and final acyl shifts, both of which are entirely driven by protonation and deprotonation within the splicing sites. This proposal has directed further theoretical studies with the use of ab initio quantum molecular dynamics. The work conducted on protein splicing has already opened a new era for investigating the origin and evolutional past of the selfish genetic element VDE in chromosomes as well as for providing novel technologies in protein chemistry.[26)] We conclude that protein splicing requires catalytic residues Cys284, His362 and Asn737 as the cis-acting elements whereas Gly283 and Cys738 as the trans effectors. In addition, the short antiparallel β-sheet formed between the N-extein residues 281–283 and the C-extein residues 739–741 plays a key structural role in stabilizing the splicing junctions. Val403 is located at the long distance from Cys284 in the splicing site, and its nonconservative mutation to aspartate hinders protein splicing by causing a structural perturbation on the catalytic residues. The splicing mechanism functions only when these cis elements in the parasitic intein domain combine with the trans effectors in the host extein domains of the single spliced propeptide. Our work establishes the first milestone of thiazolidine chemistry for the world of protein splicing.
  31 in total

1.  Structural insights into the protein splicing mechanism of PI-SceI.

Authors:  B W Poland; M Q Xu; F A Quiocho
Journal:  J Biol Chem       Date:  2000-06-02       Impact factor: 5.157

Review 2.  Protein splicing and related forms of protein autoprocessing.

Authors:  H Paulus
Journal:  Annu Rev Biochem       Date:  2000       Impact factor: 23.643

3.  Folding-dependent in vitro protein splicing of the Saccharomyces cerevisiae VMA1 protozyme.

Authors:  M Kawasaki; S Makino; H Matsuzawa; Y Satow; Y Ohya; Y Anraku
Journal:  Biochem Biophys Res Commun       Date:  1996-05-24       Impact factor: 3.575

4.  Protein splicing in the yeast Vma1 protozyme: evidence for an intramolecular reaction.

Authors:  M Kawasaki; Y Satow; Y Ohya; Y Anraku
Journal:  FEBS Lett       Date:  1997-08-04       Impact factor: 4.124

5.  Conserved sequence features of inteins (protein introns) and their use in identifying new inteins and related proteins.

Authors:  S Pietrokovski
Journal:  Protein Sci       Date:  1994-12       Impact factor: 6.725

6.  In vitro protein splicing of purified precursor and the identification of a branched intermediate.

Authors:  M Q Xu; M W Southworth; F B Mersha; L J Hornstra; F B Perler
Journal:  Cell       Date:  1993-12-31       Impact factor: 41.582

Review 7.  Protein splicing: its chemistry and biology.

Authors:  Y Anraku
Journal:  Genes Cells       Date:  1997-06       Impact factor: 1.891

8.  Protein splicing involving the Saccharomyces cerevisiae VMA intein. The steps in the splicing pathway, side reactions leading to protein cleavage, and establishment of an in vitro splicing system.

Authors:  S Chong; Y Shao; H Paulus; J Benner; F B Perler; M Q Xu
Journal:  J Biol Chem       Date:  1996-09-06       Impact factor: 5.157

9.  Homing of a DNA endonuclease gene by meiotic gene conversion in Saccharomyces cerevisiae.

Authors:  F S Gimble; J Thorner
Journal:  Nature       Date:  1992-05-28       Impact factor: 49.962

10.  Isolation of genes encoding the Neurospora vacuolar ATPase. Analysis of vma-1 encoding the 67-kDa subunit reveals homology to other ATPases.

Authors:  E J Bowman; K Tenney; B J Bowman
Journal:  J Biol Chem       Date:  1988-10-05       Impact factor: 5.157

View more
  4 in total

1.  A conserved threonine spring-loads precursor for intein splicing.

Authors:  Albert K Dearden; Brian Callahan; Patrick Van Roey; Zhong Li; Utsav Kumar; Marlene Belfort; Saroj K Nayak
Journal:  Protein Sci       Date:  2013-03-26       Impact factor: 6.725

2.  A unique self-truncation of bacterial GH5 endoglucanases leads to enhanced activity and thermostability.

Authors:  Mei-Huey Wu; Mu-Rong Kao; Chen-Wei Li; Su-May Yu; Tuan-Hua David Ho
Journal:  BMC Biol       Date:  2022-06-09       Impact factor: 7.364

3.  Antifragility and Tinkering in Biology (and in Business) Flexibility Provides an Efficient Epigenetic Way to Manage Risk.

Authors:  Antoine Danchin; Philippe M Binder; Stanislas Noria
Journal:  Genes (Basel)       Date:  2011-11-29       Impact factor: 4.096

4.  Discovery and Characterization of an Aberrant Small Form of Glycoprotein I of Herpes Simplex Virus Type I in Cell Culture.

Authors:  Xixi Gui; Wuchao Zhang; Peng Gao; Yongning Zhang; Lei Zhou; Xinna Ge; Xin Guo; John W Wills; Jun Han; Hanchun Yang
Journal:  Microbiol Spectr       Date:  2022-03-29
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.