Literature DB >> 30211279

Construction, expression, and characterization of AG11-843 and AG11-1581.

Xie Yan1, Yan-Tao Yang1, Wei Shi1, Xia Ai1, Xu-Guang Xi1,2.   

Abstract

This data article contains descriptive and experimental data on the construction, expression, and simple characterization of AG11-843 and AG11-1581. AG1 is an important member of the DUF1220 protein family. It׳s hard to get the recombinant protein because of its DNA sequence. The DNA sequence were optimized by proper design, cloned by overlap PCR and constructed into expression vector. AG11-843 and AG11-1581.were over expressed in Escherichia coli, purified and analyzed by dynamic light scattering and gel filtration analysis. An effective technique is provided to construct and express proteins with complicated sequences.

Entities:  

Keywords:  AG1; DUF1220; Gene synthesis; PCR; Plasmid construction; Recombinant protein expression

Year:  2018        PMID: 30211279      PMCID: PMC6134160          DOI: 10.1016/j.dib.2018.08.094

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table Value of the data A method of amino acid and DNA sequence optimization, synthesis, recombinant protein expression for proteins with complicated sequences is provided. Sequence analysis and synonymous codon substitution was used for sequence optimization. Overlap-PCR was used for the sequence synthesis. The recombinant proteins AG11–843 and AG11–1581 were expressed and purified for further analysis. The existence state in solution of AG11-843 and AG11-1581 were analyzed by DLS and gel filtration.

Data

There were several repeat sequences in AG1 coding sequence which is the obstacle for cloning (Fig. 1). Synonymous codon substitution was used to optimize the sequence and then the sequences were cloned into expression vectors (Fig. 2, Table 1). The recombinant proteins were purified (Fig. 3) and analyzed by DLS and gel filtration (Fig. 4).
Fig. 1

AG1 amino acid sequence comparison. Internal highly conserved sequence repeats are highlighted for each group. Cyan residues: 1–38, 245–281, and 489–525 (CRSA), which share 100% identity; Red residues: 38–112 and 282–356 (CRS1), which share 99% identity; Blue residues: 113–187 and 357–431 (CRS2), which share 100% identity; Green residues: 188–244 and 432–488 (CRS3), which share 100% identity. Furthermore, CRS1 and CRS2 share 75.7% identity, CRS2 and CRS3 share 58.7% identity, and CRS1 and CRS3 share 61.3% identity.

Fig. 2

The construction process used for the pET-15b-sumo-AG11–843 and pET-15b-sumo-AG11–1581 vectors. (A) The strategy for the synthesis and assembly of the AG11–450 fragment using overlap extension PCR. The AG11–450 PCR product was cloned into pMD19-T (Fig. S1A). (B) The strategy for the synthesis and assembly of the AG1450–843 fragment using overlap extension PCR, followed by cloning of the fragment into pMD19-T (Fig. S1B). (C) The AG11–450 and AG1450–843 fragments were mixed with pET-15b-sumo vector and sealed together via ligation of their restriction sites to form the recombinant expression plasmid (pET-15b-sumo-AG11–843). (D) Enzymatic digestion of the recombinant vector (electrophoresed on a 1% w/v agarose gel). Lane 1: pET-15b-sumo-AG11–843 digested with EcoRI and XhoI; Lane 2: 5000 bp DNA size marker. (E) The AG1450–843 and AG11–450 fragments were used to assemble the AG11–843 fragment in the third, fourth, and fifth PCR reactions. (F) The pET-15b-sumo-AG11–843 plasmid was used to assemble the template for the sixth PCR reaction, obtaining the AG1917–1581 and AG1185–843 fragments. (G) The AG11–450, AG1450–917, and AG1917-1581 were mixed with pET-15b-sumo vector and sealed via ligation of their restriction sites, to form the recombinant plasmid (pET-15b-sumo-AG11–1581). (H) Enzymatic digestion of the recombinant vector. Lane 1: pET-15b-sumo-AG11–1581 digested with EcoRI and XhoI; Lane 2: 5000 bp DNA size marker.

Table 1

List of synthesis primers for AG11–843 and AG11–1581 construction.

Primer namePrimer sequence 5′→3′
DUF1AAACACGTTGGTTTCTCTCTGGACGTTGGTGAAATCGAGAAGAAAGGTAAAGGTAAGAA
DUF2ACCACGACGACGTTCCTTCTTAGAACGACGACCACGACGCTTCTTACCTTTACCTTTCT
DUF3GAAGGAACGTCGTCGTGGTCGTAAAGAAGGTGAAGAAGACCAGAACCCGCCGTGCCCGC
DUF4ACTTCCGGACCCTTCTCGTCCAGCAGTTCACGAGACAGACGCGGGCACGGCGGGTTCTG
DUF5ACGAGAAGGGTCCGGAAGTTCTGCAGGACTCTCTGGACCGTTCTTACTCTACCCCGTCT
DUF6AACGGTACGGCTGGCAAGAGTCGGTCAGTTCCAGGCAACCAGACGGGGTAGAGTAAGAA
DUF7TCTTGCCAGCCGTACCGTTCTGCGTTCTACGTTCTGGAACAGCAGCGTGTTGGTCTGGC
DUF8TTCAACTTCCTGGTACTTCTCGATTTCGTCCATGTCAACCGCCAGACCAACACGCTGCT
DUF9GAAGTACCAGGAAGTTGAAGAAGACCAGGACCCGTCTTGCCCACGCTTATCGCGCGAAT
DUF10AGTGAATCTTGTAGCACCTCAGGCTCTTTTTCATCAAGCAATTCGCGCGATAAGCGTGG
DUF11AGGTGCTACAAGATTCACTTGATCGGTGTTATTCAACACCCTCAGGATACCTGGAACTG
DUF12GAGAGTAAACCGCAGAAGAGTACGGCTGACCCAGGTCCGGCAGTTCCAGGTATCCTGAG
DUF13TCTTCTGCGGTTTACTCTCTGGAAGAACAGTACCTGGGTCTGGCGCTGGACGTTGACCG
DUF14CGGACCCTGGTCTTCTTCTTCTTCCTGGTCCTTCTTGGTACGGTCAACGTCCAGCGCCA
DUF15AAGAAGAAGACCAGGGTCCGCCATGCCCCAGGCTCAGCAGGGAGCTGCTGGAGGTAGTA
DUF16AGGGAGCTGCTGGAGGTAGTAGAGCCTGAAGTCTTGCAGGACTCACTGGATAGATGTTA
DUF17ACTCACTGGATAGATGTTATTCAACTCCTTCCAGTTGTCTTGAACAGCCTGACTCCT
DUF18TCTTGAACAGCCTGACTCCTGCCAGCCCTATGGAAGTTCCTTTTATGCATTGGAGGAAA
DUF19AGTTCCTTTTATGCATTGGAGGAAAAACACGTTGGTTTCTCTCTGGACGTTGGTGAAA
Kpn1-FATTCAACACCCTCAGGGTACCTGGAACTGCCGGA
Kpn1-RTCCGGCAGTTCCAGGTACCCTGAGGGTGTTGAAT
DUF-FGAATTCCATATGAAACACGTTGGTTTCTCTCTGGACGTT
DUF-RGCCTCGAGTTAGTCGACACCTTCTTTACGACCACGACGACGTTCCTTCT
Fig. 3

SDS-PAGE analysis of the purified AG11–843 and AG11–1581 recombinant proteins using a 10% polyacrylamide gel. 1: AG11–843 recombinant protein; 2: undigested AG11–843 recombinant protein containing sumo tag; 4: undigested AG11–1581 recombinant protein containing sumo tag; 5: AG11–1581 recombinant protein; M: standard protein size marker.

Fig. 4

The quaternary structure of the recombinant AG11–843 and AG11–1581 proteins in solution. (A) Dynamic light scattering analysis of recombinant AG11–843. (B) Dynamic light scattering analysis of recombinant AG11–1581. (C) Size exclusion chromatography analysis of AG11–843 and AG11–1581 using a Superdex 200 10/300 GL column.

AG1 amino acid sequence comparison. Internal highly conserved sequence repeats are highlighted for each group. Cyan residues: 1–38, 245–281, and 489–525 (CRSA), which share 100% identity; Red residues: 38–112 and 282–356 (CRS1), which share 99% identity; Blue residues: 113–187 and 357–431 (CRS2), which share 100% identity; Green residues: 188–244 and 432–488 (CRS3), which share 100% identity. Furthermore, CRS1 and CRS2 share 75.7% identity, CRS2 and CRS3 share 58.7% identity, and CRS1 and CRS3 share 61.3% identity. The construction process used for the pET-15b-sumo-AG11–843 and pET-15b-sumo-AG11–1581 vectors. (A) The strategy for the synthesis and assembly of the AG11–450 fragment using overlap extension PCR. The AG11–450 PCR product was cloned into pMD19-T (Fig. S1A). (B) The strategy for the synthesis and assembly of the AG1450–843 fragment using overlap extension PCR, followed by cloning of the fragment into pMD19-T (Fig. S1B). (C) The AG11–450 and AG1450–843 fragments were mixed with pET-15b-sumo vector and sealed together via ligation of their restriction sites to form the recombinant expression plasmid (pET-15b-sumo-AG11–843). (D) Enzymatic digestion of the recombinant vector (electrophoresed on a 1% w/v agarose gel). Lane 1: pET-15b-sumo-AG11–843 digested with EcoRI and XhoI; Lane 2: 5000 bp DNA size marker. (E) The AG1450–843 and AG11–450 fragments were used to assemble the AG11–843 fragment in the third, fourth, and fifth PCR reactions. (F) The pET-15b-sumo-AG11–843 plasmid was used to assemble the template for the sixth PCR reaction, obtaining the AG1917–1581 and AG1185–843 fragments. (G) The AG11–450, AG1450–917, and AG1917-1581 were mixed with pET-15b-sumo vector and sealed via ligation of their restriction sites, to form the recombinant plasmid (pET-15b-sumo-AG11–1581). (H) Enzymatic digestion of the recombinant vector. Lane 1: pET-15b-sumo-AG11–1581 digested with EcoRI and XhoI; Lane 2: 5000 bp DNA size marker. List of synthesis primers for AG11–843 and AG11–1581 construction. SDS-PAGE analysis of the purified AG11–843 and AG11–1581 recombinant proteins using a 10% polyacrylamide gel. 1: AG11–843 recombinant protein; 2: undigested AG11–843 recombinant protein containing sumo tag; 4: undigested AG11–1581 recombinant protein containing sumo tag; 5: AG11–1581 recombinant protein; M: standard protein size marker. The quaternary structure of the recombinant AG11–843 and AG11–1581 proteins in solution. (A) Dynamic light scattering analysis of recombinant AG11–843. (B) Dynamic light scattering analysis of recombinant AG11–1581. (C) Size exclusion chromatography analysis of AG11–843 and AG11–1581 using a Superdex 200 10/300 GL column.

Experimental design, materials, and methods

Gene sequence analysis

The gene synthesis product in this study is that of the 1862 bp human AG1 gene (GenBank accession no: AF380580.1) which encodes the 615 amino acid DUF1220 AG1 protein fragment (http://www.uniprot.org/uniprot/Q8IX72). The AG1 gene and amino acid sequences are highly repetitive (Fig. 1). In order to increase the speed and efficiency of gene synthesis, we modified the AG1 nucleotide sequence. However, these changes did not affect the amino acid sequence. This codon optimization allows us to exploit the frequently used codons in Escherichia coli to obtain high level gene expression. Moreover, in order to improve the efficiency of gene transcription and RNA stability, the GC content of the synthetic gene was held at 52.9%. (Table 2, Table 3).
Table 2

Purify analysis of AG11–843 and AG11–1581.

StepProtein nameTotal proteina(mg)Target protein(mg)Purityb(%)Yield(%)
Cell lysatecAG11–8431012
AG11–1581896
SupernatantAG11–843209.2100
AG11–1581182.3100
Ni2+ column elutionAG11–84324.620.7849.9
AG11–158116.712.4746.8
HiTrap Q elutionAG11–84315.514.6947.0
AG11–15819.28.5924.7
Final productAG11–84314.814.4976.9
AG11–15818.68.0934.4

Total protein was determined by NanoDrop ND-2000.

Protein purity was estimated by SDS-PAGE image analysis.

Lysate was obtained from cells of a 1.5 L culture.

Table 3

The characteristic constants of the AG11–843 and AG11–1581 recombinant proteins.

AnalysisAG11–843AG11–1581
Length281 aa525 aa
Molecular32,225.3760,130.98
1 microgram31.031 pMoles16.630 pMoles
Molar Extinction coefficient16,44033,000
A A[280]corr.to1.96 mg/ml1.82 mg/ml
A[280]of 1 mg/ml0.51 AU0.55 AU
Isoelectric Point5.194.87
Charge at pH 7− 9.06− 27.98
Purify analysis of AG11–843 and AG11–1581. Total protein was determined by NanoDrop ND-2000. Protein purity was estimated by SDS-PAGE image analysis. Lysate was obtained from cells of a 1.5 L culture. The characteristic constants of the AG11–843 and AG11–1581 recombinant proteins.

Oligonucleotide design and purification

The amino acid sequences of two human AG1 fragments (1–1581 bp, 1–527 aa and 1–843 bp, 1–281 aa) were obtained from GenBank (GenBank accession nos: AAO15403.1 and AAX85105.1). The amino acid and nucleotide sequences of these proteins were analyzed and codon-optimized using ClustalW 2.1, Vector NTI Viewer 4.0.1, Sequencher_v4.1., ExPASy Bioinformatics Resource Portal (http://www.expasy.org/), and NCBI Blast (https://blast.ncbi.nlm.nih.gov/Blast.cgi) software packages. The design of the synthetic assembly oligonucleotides was similar to that of Xiong et al. [1], whereby each optimized DNA sequence was divided into economically sized oligonucleotides approximately 57–59 bases long that had 17–19 overlapping bases at both the 5′ and 3′ ends, leaving a 21 base gap between the overlapping regions. In addition, two outer amplification primers containing different restriction enzyme binding sites were designed for each gene to facilitate cloning. Both the AG11–843 and AG11–1581 sequences contained 23 oligoes. The oligonucleotides listed in Table 1 were from Sangon Biotech (Shanghai) Co., Ltd.

Rapid preparation of DUF1220 AG11–843

We used the single overlap extension method as well as the two-step successive PCR method to synthesize the duplicated DUF1220 AG1 gene. Firstly, we mixed 14 (DUF1-DUF12, DUF-F, and Kpn1-R) and 12 (DUF13- DUF19, DUF-R, and Kpn1-F) chemically synthesized single stranded oligonucleotides (1 μM) in separate reaction tubes, followed by hybridization and extension to form the long dsDNA AG11–450 (511 bp) and AG1450–843 (429 bp) constructs containing the appropriate restriction enzyme sites. AG11–450 contained EcoRI and KpnI sites, while AG1450–843 contained KpnI and XhoI sites. The PCR reactions were conducted for 25 cycles with 5 U Pfu polymerase (NEB) in a final volume of 100 μl, in presence of 1× Pfu buffer and 200 μM dNTP. The PCR conditions were 10 s at 90 °C, 10 s at 60 °C, and 50 s at 72 °C for each cycle, followed by extension for 10 min at 72 °C, unless stated otherwise. The AG11–450 and AG1450–843 gene fragments were then cloned into a simple pMD19-T vector and sequenced. The pMD19-T-AG11–450 plasmid was then digested with EcoRI and KpnI, while the pMD19-T-AG1450–843 was digested with Kpn 1 and XhoI, followed by separation on a 1% agarose gel. The digestion products (AG11–450 and AG1450–843) were excised from the gel with a blade, and a purification kit (CoWin Biosciences) was used according to the manufacturer׳s instructions. Then, the AG11–450 and AG1450–843gene fragments were cloned together into a pET-15b-sumo vector which contains 6× His tag and SUMO fusion tags. The molecular cloning of the synthesized DNA fragments was performed according to the standard procedures [1].

High efficiency preparation of DUF1220 AG11–1581

Gene AG11–1581 is composed of two repeats of the AG11–843 fragment, meaning it can be built with the AG11–450 and AG1450–843 fragments expressed in the pET-15b-sumo-AG11–843 plasmid. Firstly, the AG1450–843 and AG11–450 fragments and the pET-15b-sumo-AG11–843 plasmid were used to assemble the template for the third, fourth, and sixth PCR reaction. The two outermost oligonucleotide primers used were Kpn1-F and DUF1, DUF2 and DUF6, and DUF5 and DUF-R, respectively. Secondly, the DNA segment from the Kpn1-DUF1 and DUF2-DUF6 reactions were mixed and used to assemble the template for the fifth PCR reaction, which was carried out using the Kpn1-F and DUF6 oligonucleotides as the two outermost primers. All of the PCR reactions used 5 U Pfu polymerase and 200 μM dNTP and were performed with the following program: 98 °C for 1 min, then 25 cycles of 10 s at 90 °C, 10 s at 58 °C, and 50 s at 72 °C. Thirdly, the AG11–450 fragment was digested with EcoRI and KpnI, while the AG1450–917 and AG1917–1581 fragments were digested with KpnI/PstI and PstI/XhoI, respectively. Then, the digested products purified as described. The three purified DNA fragments were mixed with pET-15b-sumo vector, and the four DNA strands were sealed together at their sticky ends by DNA ligase to form the recombinant plasmid (pET-15b-sumo-AG11–1581). The pET-15b-sumo-AG11–843 and pET-15b-sumo-AG11–1581 sequences were then identified by PCR, double enzyme digesting, and sequencing.

Protein expression and purification

All proteins were expressed in E. coli BL21(DE3) cells. Cells inoculated in 10 ml of LB containing 100 μg/ml of ampicillin. Cultures were grown by shaking at 200 rpm at 37 °C until the absorbance at 600 nm (A600) was ~ 1.0. This starter culture was then inoculated into 1.5 L of the same LB medium and grown as above until A600 = 0.8–1. Then, 0.3 mM IPTG was added, and incubation was continued for 18 h at 18 °C. Cells were then pelleted by centrifugation and re-suspended in lysis buffer (20 mM Tris–HCl, pH 7.5, 1000 mM NaCl, 10% glycerol (v/v)). The cells were sonicated and then centrifuged at 12,000 rpm for 30 min. The samples were loaded on to a Ni2+-charged IMAC column (GE Healthcare), bound with 120 ml of lysis buffer, and washed with 240 ml of washing buffer (20 mM Tris–HCl, pH 7.5, 300 mM NaCl, 10% glycerol (v/v), 50 mM imidazole). Then, the protein was eluted from the Ni2+ affinity column with elution buffer (20 mM Tris–HCl, pH 7.5, 300 mM NaCl, 10% glycerol (v/v), 500 mM imidazole). The eluted protein was incubated with SUMO protease (Invitrogen, Beijing) for 5 h at 18 °C to yield the mature proteins with two extra N-terminal amino acid residues (GluPhe). Protein was subsequently loaded on to a High Q Sepharose 6 Fast Flow column (GE Healthcare) and eluted with a 300–1000 mM NaCl gradient in buffer H (20 mM Tris–HCl, pH 7.5, 10% glycerol, 2 mM dithiothreitol (DTT)). The eluted fraction containing AG11–843 or AG11–1581 was collected, concentrated. The final purified protein was dialyzed against the storage buffer (20 mM Tris–HCl, pH7.5, 300 mM NaCl, 10% glycerol (v/v), 2 mM DTT) and was stored at − 80 °C until use.

Dynamic light scattering (DLS)

DLS measurements were performed at 25 °C with a DynaPro NanoStar instrument (Wyatt Technology Corporation) with a 20 μl micro-cuvette and a thermostat cell holder. The scattered light was collected at an angle of 90°. All buffers were filtered using a 0.22 μm filter membrane, and the samples were centrifuged (13,000g for 30 min at 4 °C). The measurement recording times ranged from 3 to 5 min (averaging 20–30 cycles every 10 s), and the data were analyzed with Dynamics 7.0 software using regularization arithmetic calculations (Wyatt Technology Corporation). The molecular weight (Mw) was calculated from the hydrodynamic radius using the empirical formula: Mw = (1.68 * R)2.34, where R is the hydrodynamic radius (in nm) and Mw is the molecular weight (in kDa). The protein concentrations used were 5, 10, 15, and 20 μM in buffer (300 mM NaCl, 20 mM Tris–HCl, pH7.5, 5% glycerol (v/v), 2 mM DTT), with a total sample volume of 50 μl.

Size exclusion chromatography (SEC)

SEC was carried out at constant temperature room (18 °C) using a fast protein liquid chromatography (FPLC) system (ÄKTA Purifier, GE Healthcare) on an analytical grade 24 ml Superdex 200 10/30 GL column (GE Healthcare). The same buffers were used as described in the DLS methods above. Fractions (0.3 ml) were collected at a flow of 0.3 ml/min, and the absorbance was surveyed at 260 and 280 nm. Experimental method is showed Shi et al. [2].
Subject areabiology
More specific subject areaMolecular biology, protein science
Type of dataTable, graph, figure
How data was acquiredEMSA, SDS-PAGE,
Dynamic light scattering (DynaPro NanoStar instrument, Wyatt Technology Corporation, USA)
Size exclusion chromatography (ÄKTA Purifier, GE Healthcare, USA)
Data formatRaw and analyzed
Experimental factorsnone
Experimental featuresDLS, gel filtration
Data source locationCollege of Life Sciences, Northwest A&F University, Yangling, Shaanxi 712100, China
Data accessibilityThe data are available with the article.
Related research articleConstruction, expression, and characterization of AG11–843 and AG11–1581.
  2 in total

1.  A simple, rapid, high-fidelity and cost-effective PCR-based two-step DNA synthesis method for long gene sequences.

Authors:  Ai-Sheng Xiong; Quan-Hong Yao; Ri-He Peng; Xian Li; Hui-Qin Fan; Zong-Ming Cheng; Yi Li
Journal:  Nucleic Acids Res       Date:  2004-07-07       Impact factor: 16.971

2.  A helical bundle in the N-terminal domain of the BLM helicase mediates dimer and potentially hexamer formation.

Authors:  Jing Shi; Wei-Fei Chen; Bo Zhang; San-Hong Fan; Xia Ai; Na-Nv Liu; Stephane Rety; Xu-Guang Xi
Journal:  J Biol Chem       Date:  2017-02-22       Impact factor: 5.157

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.