Literature DB >> 21901127

A biobrick library for cloning custom eukaryotic plasmids.

Marco Constante1, Raik Grünberg, Mark Isalan.   

Abstract

Researchers often require customised variations of plasmids that are not commercially available. Here we demonstrate the applicability and versatility of standard synthetic biological parts (biobricks) to build custom plasmids. For this purpose we have built a collection of 52 parts that include multiple cloning sites (MCS) and common protein tags, protein reporters and selection markers, amongst others. Importantly, most of the parts are designed in a format to allow fusions that maintain the reading frame. We illustrate the collection by building several model contructs, including concatemers of protein binding-site motifs, and a variety of plasmids for eukaryotic stable cloning and chromosomal insertion. For example, in 3 biobrick iterations, we make a cerulean-reporter plasmid for cloning fluorescent protein fusions. Furthermore, we use the collection to implement a recombinase-mediated DNA insertion (RMDI), allowing chromosomal site-directed exchange of genes. By making one recipient stable cell line, many standardised cell lines can subsequently be generated, by fluorescent fusion-gene exchange. We propose that this biobrick collection may be distributed peer-to-peer as a stand-alone library, in addition to its distribution through the Registry of Standard Biological Parts (http://partsregistry.org/).

Entities:  

Mesh:

Year:  2011        PMID: 21901127      PMCID: PMC3161993          DOI: 10.1371/journal.pone.0023685

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The construction of specific plasmid DNA sequences is a routine technique in molecular biology laboratories [1]. The most widely used DNA sequences (for instance, a promoter followed by a fluorescent protein, a multiple cloning site and a polyadenylation signal) are easily found in commercially-available plasmids [2], [3]. However, when requirements start to become more stringent, multiple plasmid modifications are required and, although many changes may be relatively simple to perform, multiple modifications may become time-consuming cloning challenges. Complex, multi-factor plasmids have to be built frequently for the applications of synthetic biology [4], [5], [6], often with the combinatorial use of different DNA parts [7]. To simplify such types of constructions an idempotent cloning system has recently been developed by Tom Knight [8] (described in the BioBrick Foundation Request for Comments #10, BBF RFC 10; http://biobricks.org/). Briefly, this system uses a specific set of restriction enzyme sites at the 3′ and 5′ ends of each DNA cassette (‘biobrick’), such that a biobrick ‘A’ may be fused with a biobrick ‘B’ to produce ‘AB’. ‘AB’ contains an uncleavable ‘scar’ sequence between ‘A’ and ‘B’ and, importantly, the exact same set of restriction enzyme sites as the initial biobricks, at the 3′ and 5′ ends. In other words, every biobrick fusion product is itself a new biobrick and may even be used iteratively for the assembly of concatemers (See Figure 1). Given the physical idempotent characteristics of the system, biobricks may be fused together in any combination of parts, with few restrictions in the number of the biobricks, and no restrictions on the order (‘BA’ would be as simple to construct as ‘AB’).
Figure 1

The biobrick assembly principle[8], [10].

(A) Each biobrick part has the same prefix and suffix, containing restriction enzyme sites. (B) Following restriction digests, a two-insert ligation into the biobrick vector results in a biobrick fusion. (C) The new biobrick part regenerates the original prefix and suffix, but contains an in-frame Thr-Arg scar in protein-coding fusions. (D) MS2 binding site concatemers (MS2 BS), built with iterative biobrick assembly, from 1 to 12-copies (4 steps). M = marker (1 kb ladder). The upstream and downstream sequences between the primer annealing sites and the biobricks contribute 312 bp, while each MS2 BS is 39 bp.

The biobrick assembly principle[8], [10].

(A) Each biobrick part has the same prefix and suffix, containing restriction enzyme sites. (B) Following restriction digests, a two-insert ligation into the biobrick vector results in a biobrick fusion. (C) The new biobrick part regenerates the original prefix and suffix, but contains an in-frame Thr-Arg scar in protein-coding fusions. (D) MS2 binding site concatemers (MS2 BS), built with iterative biobrick assembly, from 1 to 12-copies (4 steps). M = marker (1 kb ladder). The upstream and downstream sequences between the primer annealing sites and the biobricks contribute 312 bp, while each MS2 BS is 39 bp. The RFC 10 Biobrick format [8] is itself very useful and has formed the core of engineering challenges such as the annual International Genetically Engineered Machine (iGEM) competitions [9], where students are asked to engineer systems using biobricks. It is a well-documented system with a large and growing collection of parts that use the prefix GAATTCGCGGCCGCTTCTAGAG (or GAATTCGCGGCCGCTTCTAG for protein coding parts starting with ATG) and the suffix TACTAGTAGCGGCCGCTGCAG. However, the original scheme has certain limitations, such as the difficulty of fusing parts coding for protein domains, since the fusion ‘scar’ changes the codon reading frame from biobrick ‘A’ to biobrick ‘B’. Phillips and Silver [10] resolved this issue by presenting a slight modification to the initial design (removing the last G of the prefix and the first T of the suffix thereby conserving the reading frame in the scar). Special attention is required when preparing new parts in this format (BBF RFC 23), since the removal of the last G of the prefix means that a Dam methylation site may be formed if the part starts with TC, which blocks restriction with Xba I. Since the scar resulting from a RFC 23 fusion (ACTAGA) codes for threonine-arginine, a chimeric protein ‘AB’ may be assembled from two RFC 23 biobricks ‘A’ and B’. When preparing such RFC 23 protein fusions, users should be aware that the arginine from the scar may be problematic since it is positively charged. It is of interest to note that biobricks in the RFC 23 format may still be assembled with biobricks in the RFC 10 format without requiring new restriction enzymes. Although such a RFC 23-RFC 10 cross-fusion does not maintain the 6-bp scar required for the maintenance of a codon frame, N-terminal fusions of RFC 23 protein coding parts may still be performed as long as the frameshift is corrected by an adaptor part. In addition, the RFC 23-RFC 10 cross-fusion may be useful for assemblies that do not require the maintenance of a reading frame or a specific scar size, such as linking non-coding biobricks (e.g. transcription factor binding sites). We refer to this compatibility between RFC 23 and RFC 10 as ‘assembly compatibility’, that should not be confused with ‘RFC 23 compatibility’ that is reserved for fusion between biobricks that maintain the reading frame. Further biobrick formats (with only limited cross-compability) were later proposed and are documented as BBF RFCs. Engineering with the Biobrick system is usually accompanied with an abstraction hierarchy perspective. With this perspective in mind, one uses ‘DNA’ to design basic ‘parts’ that may then be assembled into a single functional ‘device’ with a specific function. Finally, several of these ‘devices’ may be put together to design a synthetic ‘system’ with a high-level functionality. This hierarchy facilitates synthetic engineering since, ideally, a researcher working, for instance, at the ‘system’ level does not need to know the specific details on how to engineer and build the ‘devices’ used in that ‘system’. Synthetic biologists are now in the process of developing and characterizing basic ‘parts’ or part sets (‘devices’ [11]) that may be used to engineer biological machines [12], [13], [14] and to model them [15]. These parts are available from the Registry of Standard Biological Parts (http://partsregistry.org/) which is supported by the BioBricks Foundation (http://bbf.openwetware.org/). Although the registry is available to any researcher, whether or not they work in the field of synthetic biology, we believe that the size of the registry, the existence of different biobrick formats (not always compatible with each other), and the limited communication of this resource to non-synthetic biologists, may hamper the evolution and distribution of biobricks. We therefore developed a cloning tool that should be of interest outside the field of synthetic biology; a biobrick toolkit for the construction of custom eukaryotic expression plasmids, using frequently-used components. We originally noted that many biobrick parts that one would require for eukaryotic projects were apparently missing from the registry. These parts included: several multiple cloning sites with different reading frames; an extensive range of reporter proteins; eukaryotic selection markers; eukaryotic internal ribosomal entry sites; and protein epitope tags. Importantly, most of these parts should be compatible with each other [10], [16]. In addition, the novel use of MCS biobricks facilitates the cloning of DNA fragments of interest (e.g. open reading frames, promoters), without having to first remove any internal cleavage sites of the biobrick enzymes in order to render them ‘biobrick compatible’. This work is intended to highlight the potential of the biobrick system to construct custom plasmids for eukaryotic cell lines, for both synthetic and non-synthetic biologists. Since many parts that we consider important did not exist in the registry, we have built a distribution of 9 previously existing and 43 new biobricks. We demonstrate the utility of the new biobricks with some examples of assemblies. In addition, we use the created library to establish a gene-switching system, from tdTomato to EGFP expression, using recombinase-mediated DNA insertion (RMDI). Finally, we propose that this collection be distributed peer-to-peer as an independent library, as well as being available from the registry, and discuss some advantages in having this type of collection.

Materials and Methods

Preparation of parts

New parts were designed following the recommendations in BBF RFC 23. Briefly, parts were formatted to contain the prefix 5′-GAATTCGCGGCCGCTTCTAGA-3′ and the suffix 5′-ACTAGTAGCGGCCGCTGCAG-3′. Construction of the parts was done using 1) oligonucleotide inserts, 2) PCR with oligonucleotide-directed mutagenesis when required [1] or 3) synthesized directly by GenScript Corporation (Piscataway, NJ). Importantly, the part sequences do not start with a TC since this forms a Dam methylase site (GATC) and digestion with XbaI can be inhibited. The Biobrick sequences are available at the Registry of Standard Biological Parts website (http://partsregistry.org/), at GenBank (http://www.ncbi.nlm.nih.gov/genbank/) and in the Supporting Information S1.

Biobrick assemblies

Assemblies were performed similarly to the method presented in the Biobrick assembly kit from New England Biolabs (NEB; Ref. E0546S) and the Gingko Bioworks manual (http://ginkgobioworks.com/support/). Alternatively, one may use the adapted streamlined protocol we have previously reported [14]. Briefly, 500 ng of the upstream part are digested with 20000 U of EcoRI (NEB) and 10000 U of SpeI (NEB) in NEB EcoRI buffer. At the same time, 500 ng of the downstream part are digested with 20000 U of XbaI (NEB) and 20000 U of PstI (NEB) in NEB Buffer 3. Finally and concomitantly, 500 ng of the destination plasmid are digested with 20000 U of EcoRI and 20000 U of PstI in NEB EcoRI Buffer. Digestions are performed at 37°C for 1 hour followed by heat inactivation at 80°C for 20 minutes. Note that the recipient pSB1A* plasmids in the library contain a ccdB death cassette that is removed in the digestion. Furthermore, each assembly should contain an antibiotic resistance gene that is not present in either the upstream or the downstream part (e.g. if the upstream part is in the pSB1AK3 plasmid, containing ampicillin and kanamycin resistance, and the downstream part is in the pSB1AC3 plasmid, containing ampicillin and chloramphenicol resistance, the destination plasmid should be the pSB1AT3 plasmid since it contains the tetracycline resistance not present in the other plasmids). Since the destination plasmid contains the ccdB death cassette, a two-way ligation of the fragments can be done using the Roche rapid DNA ligation kit, according to the manufacturer's instructions, without need for gel isolation or other purifications. Briefly, 4 µl of the upstream part digestion, 4 µl of the downstream part digestion and 2 µl of the destination plasmid digestion are used. 2 µl of the ligation product are transformed in one shot competent Top10 E. coli (Invitrogen), according to the manufacturer's instructions, and are subsequently plated onto LB agar plates. The plates contain antibiotic for the resistance provided by the destination plasmid (in the example above one should use LB agar plates containing tetracycline) and are incubated overnight. PCRs of Biobricks (from plasmid DNA or bacterial colony) were performed using NEB Taq polymerase, using the manufacturer's instructions, and using the standard sequencing primers BBa_G00100 (5′-TGCCACCTGACGTCTAAGAA-3′) and BBa_G00101 (5′-ATTACCGCCTTTGAGTGAGC-3′) with 20 cycles at 55°C annealing temperature, with appropriate denaturation and elongation steps. PCR products were analyzed on agarose gels and positive colonies were grown overnight for DNA isolation using the Qiagen spin miniprep kit according to the manufacturer's instructions. All clones were verified by DNA sequencing.

Cell culture and transfection

HEK293 cells were purchased from ATCC. Cells were propagated in Dulbecco's Modified Eagle Medium (Gibco) supplemented with 10% fetal bovine serum (Gibco) and 1% penicillin-streptomycin (Gibco). Transfection of the cells was performed with lipofectamine 2000 (Invitrogen), according to the manufacturer's instructions, in 6-well plates: cells were incubated for 4 hours with 4 µg of DNA and 10 µl of lipofectamine reagent. Stable clones were prepared by selection with 0.5 mg/ml G418 (Sigma).

Cloning small Espin into the MCS of a custom-made plasmid

Rat small Espin (sEspin) [17] was amplified by PCR, from a plasmid kindly provided by the group of Hernán López-Schier, using the primers sEspinF (5′-AAGAGGGATCCATGAACTCCC-3′) and sEspinR (5′-CTTCTTACCGGTTTACTTAGGGATCTCCCCCTTC-3′) at 55°C annealing temperature. The PCR product was cloned into the Biobrick custom plasmid using the restriction enzymes BamHI and AgeI.

Microscopy

After transfection, cells were allowed to recover overnight and were dispensed into glass bottom culture dishes (MatTek). 24 h later, the medium was replaced with PBS for observation under the microscope.

Recombinase-mediated DNA inclusion plasmid preparation

The plasmid used for the creation of the stable clone expressing tdTomato was prepared using an assembly of the biobricks between the CMV promoter and polyadenylation (pA) signal. The assembled plasmid was digested with XbaI and PstI and cloned into pEGFP-C1 digested with NheI and PstI, thereby removing the EGFP sequence and providing the CMV upstream and pA downstream of the biobrick insert. Neomycin resistance used for selection is provided by the pEGFP-C1 plasmid backbone under the expression of the SV40 early promoter. The EGFP plasmid for inclusion was prepared using the biobrick collection.

Flow cytometry

Flow-assisted cell sorting (FACS) analysis was performed with FACSCanto (BD) and cell separation with FACSARIA II (BD).

Results

Library

To create a distribution for the construction of custom plasmids, we have put together a set of 9 existing and 43 new biobricks and plasmid backbones. We believe that these should be useful for the creation of custom-made plasmids for use in eukaryotic cell lines (see Table 1). Importantly, a set of multiple cloning site (MCS) biobricks, covering all three different frames has been built, essential for the combination of the biobrick assembly strategy with classical cloning strategies (Fig. 2).
Table 1

List of biobrick parts for eukaryotic plasmids.

GroupNicknameDescriptionRegistryGenBank
BackbonespSB1A3*High copy number plasmid carrying ampicillin resistance.pSB1A3
pSB1AK3*High copy number plasmid carrying ampicillin and kanamycin resistance.pSB1AK3
pSB1AC3*High copy number plasmid carrying ampicillin and chloramphenicol resistance.pSB1AC3
pSB1AT3*High copy number plasmid carrying ampicillin and tetracyclin resistance.pSB1AT3
ConstructionKozakSimple Kozak sequence protein head domain [26] BBa_J96000JN204869
StopTail domain with stop codons in all three framesBBa_J96001JN204870
CMV*Cytomegalovirus immediate-early promoterBBa_I712004
SV40pA*Eukaryotic – derived from SV40 early poly A signal sequenceBBa_J52016
MCS1aMultiple cloning site version 1, first frameBBa_J96002JN204871
MCS1bMultiple cloning site version 1, second frameBBa_J96003JN204872
MCS1cMultiple cloning site version 1, third frameBBa_J96004JN204873
MCS2aMultiple cloning site version 2, first frameBBa_J96005JN204874
MCS2bMultiple cloning site version 2, second frameBBa_J96006JN204875
MCS2cMultiple cloning site version 2, third frameBBa_J96007JN204876
MCS3aMultiple cloning site version 3, first frameBBa_J96008JN204877
MCS3bMultiple cloning site version 3, second frameBBa_J96009JN204878
MCS3cMultiple cloning site version 3, third frameBBa_J96010JN204879
SelectionFNeomycinResistance to G418/neomycin gene [1] BBa_J96011JN204880
FPuromycinResistance to puromycin gene [1] BBa_J96012JN204881
FHSTKHerpes simplex thymidine kinase conferring toxicity to ganciclovir [27] BBa_J96013JN204882
ReporterstdTomatoEngineered red fluorescent protein [28] BBa_J96029JN204883
EGFPEngineered green fluorescent protein [29] BBa_J96031JN204884
CeruleanEngineered cyan fluorescent protein [19] BBa_J96032JN204885
EBFP2Engineered blue fluorescent protein [30] BBa_J96033JN204886
mCherry*Engineered red fluorescent protein [28] BBa_J63000
RLuciferaseRenilla luciferase gene [31] BBa_J96034JN204887
TagsFlagFLAG affinity tag [32] BBa_J96035JN204888
HAHA affinity tag [33] BBa_J96036JN204889
HisHis affinity tag [34] BBa_J96037JN204890
StrepIIStrep II affinity tag [35] BBa_J96038JN204891
LocalizationSPMembrane or secretion: IgK leader peptide with Kozak [36] BBa_J96014JN204892
TMDPDGF Receptor Transmembrane Domain [36] BBa_J96015JN204893
MyristoylationMyristoylation signal sequence with Kozak [37] BBa_J96016JN204894
NLS*Nuclear Localization Signal from SV40 [38] BBa_J63008
RecombinationLoxpLox p sequence [39] BBa_J96017JN204895
Lox66Directional lox sequence compatible with lox71 [40] BBa_J96018JN204896
Lox71Directional lox sequence compatible with lox66 [40] BBa_J96019JN204897
CistronIRESInternal ribosomal entry site [41] BBa_J96040JN204898
P2ASelf cleaving 2A peptide [42] BBa_J96041JN204899
T2ASelf cleaving 2A peptide [42] BBa_J96042JN204900
PS3Short DNA sequence for ribosome recruitment (mini-IRES) [43] BBa_J96043JN204901
PS4Short DNA sequence for ribosome recruitment (mini-IRES) [43] BBa_J96044JN204902
OthersLinker24 aa flexible linker, rich in Gly and Ser.BBa_J96020JN204903
Spacer1*Randomized DNA spacerBBa_J96021
Spacer2Randomized DNA spacerBBa_J96022JN204904
MS2MS2 phage coat domain binding to RNA at the MS2 binding site sequence [44] BBa_J96023JN204905
MS2BSMS2 phage coat binding site sequence [44] BBa_J96024JN204906
LambdaNLambda N peptide sequence binding to RNA at the boxB binding site [45] BBa_J96025JN204907
BoxBLambda N peptide binding site sequence [45] BBa_J96026JN204908
TEVSiteTEV tobacco etch virus protease cleavage site [46] BBa_J96027JN204909
d1PESTMouse ornithine decarboxylase PEST sequence 1 hr half-life [47] BBa_96046JN204910
d2PESTMouse ornithine decarboxylase PEST sequence 2 hr half-life [47] BBa_96047JN204911

Asterisks (*) indicate previously existing Biobricks. “Registry” refers to the Registry reference, whereas “GenBank” refers to the GenBank accession number.

Figure 2

Multiple Cloning Site (MCS) biobricks.

The uniqueness of each cloning site is dependent on whether the remaining biobricks and backbones used for the custom plasmid also contain the restriction site. Blunt end restriction enzymes are represented in italic. These biobricks link classical cloning to the biobrick system.

Multiple Cloning Site (MCS) biobricks.

The uniqueness of each cloning site is dependent on whether the remaining biobricks and backbones used for the custom plasmid also contain the restriction site. Blunt end restriction enzymes are represented in italic. These biobricks link classical cloning to the biobrick system. Asterisks (*) indicate previously existing Biobricks. “Registry” refers to the Registry reference, whereas “GenBank” refers to the GenBank accession number. To facilitate flexible construction of custom-made plasmids, all biobricks are in the same format, containing the same prefix and suffix, as described in the Methods section. Biobricks may be divided into: (i) Non-coding (not intended for containing coding sequences; lengths are often not in multiples of three, or have a stop codons in the sequence). (ii) Head domains (that contain a Kozak sequence and are used to start the translation of the protein of interest). (iii) Internal domains (the coding regions for the desired domains (e.g. reporter proteins). (iv) Tail domains (containing the stop codon used to stop the translation of the protein of interest). (v) Translation units (containing the Kozak sequence, an internal domain and a stop codon). Table 2 indicates whether a given biobrick maintains the codon frame, whether it contains a Kozak sequence or a stop codon, and shows the restriction sites present in each biobrick or plasmid backbone. In this table we explicitly present those elements that may not be used for biofusion assembly (highlighted in bold).
Table 2

Characteristics of the collection plasmids.

NameRestriction SitesKozakStopBackboneSize(bp)3nWell
Backbones pSB1A3 ---2157*-1A
pSB1AK3 ClaI, HindIII, SmaI, XhoI---3189*-1B
pSB1AC3 SacI, XhoI---3055*-1C
pSB1AT3 BamHI, ClaI, HindIII, SalI, SphI, XhoI---3446*-1D
ConstructionKozakYESNAK12YES1E
Stop N YES AK11N1F
CMV SacIN YES AK654YES1G
SV40pA SphIN YES AK228YES1H
MCS1a ApaI, XhoI, BamHI, HindIIINNAK32N2A
MCS1bApaI, XhoI, BamHI, HindIIINNAT33YES2B
MCS1c ApaI, XhoI, BamHI, HindIIINNAT34N2C
MCS2aBglII, SacI, KpnI, AgeINNAT30YES2D
MCS2b BglII, SacI, KpnI, AgeINNAT31N2E
MCS2c BglII, SacI, KpnI, AgeINNAT32N2F
MCS3aBsrGI, StuI, SalI, SphI, ClaINNAC39YES2G
MCS3b BsrGI, StuI, SalI, SphI, ClaINNAC40N2H
MCS3c BsrGI, StuI, SalI, SphI, ClaINYESAC41N3A
Selection FNeomycin SphIYES YES AK801YES4A
FPuromycin StuIYES YES AC606YES4B
FHSTK ApaI, SmaI, SphIYES YES AC1137YES4C
ReporterstdTomatoNNAK1425YES5A
EGFPBsrGINNAC714YES5B
mCeruleanBsrGI, ClaINNAC714YES5C
EBFP2NNAC714YES5D
mCherryNNA705YES5E
RLuciferaseBsrGI, SphINNAK930YES5F
TagsFlagNNAC24YES6A
HANNAC27YES6B
HisNNAC18YES6C
StrepIINNAC24YES6D
LocalizationSPYESNAC69YES7A
TMDNNAC147YES7B
MyristYESNAC48YES7C
NLSNNA21YES7D
RecombinationLoxpNNAC36YES8A
Lox66NNAC36YES8B
Lox71NNAC36YES8C
Cistron IRES ApaI, HindIII, KpnIN YES AC504YES9A
P2ASmaINNAC42YES9B
T2ANNAC54YES9C
PS3 BsrGIN YES AC50N9D
PS4 N YES AC48YES9E
OthersLinkerNNAC72YES10A
Spacer1 N YES A72YES10B
Spacer2 NNAK70N10C
MS2BglII, SacI, SalINNAK387YES10D
MS2BSSalINNAC39YES10E
LambdaNNNAC66YES10F
BoxBApaINNAC21YES10G
TEVSiteNNAC21YES10H
d1PEST N YES AT129YES11A
d2PEST N YES AT129YES11B

Characteristics of the collection plasmids. In bold we present all plasmids that, when fused, do not allow for a continuous coding sequence (either because they contain a stop signal or because the length in bp is not a multiple of 3: “3n”). A (ampicillin), C (chloramphenicol), K (kanamycin) and T (tetracycline) indicate the resistances provided by each backbone. The position of samples in the distribution is indicated by “Well”. Asterisks (*) indicate that the presented size refers to the plasmid size instead of the biobrick size.

Characteristics of the collection plasmids. In bold we present all plasmids that, when fused, do not allow for a continuous coding sequence (either because they contain a stop signal or because the length in bp is not a multiple of 3: “3n”). A (ampicillin), C (chloramphenicol), K (kanamycin) and T (tetracycline) indicate the resistances provided by each backbone. The position of samples in the distribution is indicated by “Well”. Asterisks (*) indicate that the presented size refers to the plasmid size instead of the biobrick size. Whereas the biobrick assembly workflow will work for all parts in the collection, for classical cloning it is up to the user to check which biobricks will be used to assemble a plasmid. For example, the user must check whether a given restriction enzyme site is present only once (e.g. in the MCS site region) or whether it is not unique and is also found in another biobrick or plasmid backbone.

Constructing simple assemblies

The biobrick assembly principle is inherently iterative, allowing longer and longer poly-biobrick constructs to be built in a stepwise fashion (Fig. 1). This is particularly useful for making related constructs with variable copies of a motif, such as a transcription factor binding sites. To illustrate this, we show how the binding sites of the bacterial MS2 phage coat protein, which can be used to repress mRNA translation in synthetic biology applications [18], can be conveniently concatenated with biobrick assembly (Fig. 1D). One of the simplest expression plasmids that one might construct using the distribution is a plasmid such as that presented in Figure 3A; a simple EGFP expression plasmid, used to label cells, as may be observed in Figure 3B. A user will typically want to create a translation unit and insert it under a promoter with a polyadenylation sequence in the end. The translation unit will typically be composed of a head domain (in this case the Kozak sequence alone) followed by one or more internal domains (in this example the EGFP reporter coding sequence) and finished with a tail domain (here the simple stop codon biobrick).
Figure 3

Examples of biobrick assemblies.

(A) Structure of a classical plasmid for EGFP expression using 5 biobricks. (B) Structure of bicistronic custom plasmid. 8 biobricks are linked together to make a construct for C-terminal fusions to the blue fluorescent protein, cerulean, using classical restriction enzyme multiple cloning sites (MCS). For illustration, the actin-bundling binding protein sEspin is cloned into the MCS, resulting in a fusion with cerulean. (C) Fluorescence microscopy image of the EGFP construct in panel A, after transient transfection into HEK293 cells. (D) A fluorescence microscopy image of the Cerulean-sEspin fusion construct in panel B, allows the visualisation of stress fiber-like structures in a HEK293T cell. Scale bars are indicated below each image.

Examples of biobrick assemblies.

(A) Structure of a classical plasmid for EGFP expression using 5 biobricks. (B) Structure of bicistronic custom plasmid. 8 biobricks are linked together to make a construct for C-terminal fusions to the blue fluorescent protein, cerulean, using classical restriction enzyme multiple cloning sites (MCS). For illustration, the actin-bundling binding protein sEspin is cloned into the MCS, resulting in a fusion with cerulean. (C) Fluorescence microscopy image of the EGFP construct in panel A, after transient transfection into HEK293 cells. (D) A fluorescence microscopy image of the Cerulean-sEspin fusion construct in panel B, allows the visualisation of stress fiber-like structures in a HEK293T cell. Scale bars are indicated below each image.

Constructing a MCS plasmid for classical cloning

Although many commercially-available plasmids contain MCSs to allow fusing proteins of interest to fluorescent proteins, many combinations of colours, N-C orientations, stable cell-line selection genes or half-life modifications are simply not available. To illustrate this we have created a more complex plasmid, containing two cistrons, that is not commercially-available. In the first cistron, we assembled the blue fluorescent protein cerulean [19] before a complex MCS (assembled using MCS1 and MCS2). We subsequently cloned an actin-binding protein, rat sEspin [17], using the BamHI and AgeI restriction sites (Figure 3C). By fluorescent microscopy, we observed Cerulean-sEspin localization in actin bundle stress fiber-like structures [17] (Fig. 3D). In the second cistron we included the neomycin resistance gene for positive selection. The example shown is just one of many customized cloning vectors that can be generated with this platform, and requires just three biobrick cloning iterations (first round: CMV-Kozak, Cerulean-MCS1a, MCS2a-IRES, Neomycin-pA; second round: CMV-Kozak-Cerulean-MCS1a, MCS2a-IRES-Neomycin-pA; third round: CMV-Kozak-Cerulean-MCS1a-MCS2a-IRES-Neomycin-pA). The use of MCSs links the biobrick format to classical cloning strategies, which should increase their appeal to researchers who simply wish to ‘cut and paste’ their DNA cassettes with standard restriction enzymes.

Recombinase-Mediated DNA Insertion

The collection presented here may be used to obtain stable cloning systems, for chromosomal integration in eukaryotic cells, such as Recombinase-Mediated DNA Insertion (RMDI) [20]. We implemented RMDI by establishing a stable cell line expressing tdTomato, and containing an heteromeric lox site in its coding sequence (lox66 [21]; see scheme in Figure 4A). Through Cre-mediated recombination we inserted DNA from a donor plasmid, containing EGFP preceded by a compatible heteromeric site (lox71). Thus, red expression was converted to green expression.
Figure 4

Recombinase-mediated DNA insertion with biobricks.

(A) Schematic view of Cre-mediated recombination between lox66 and lox71, resulting in the insertion of EGFP and creating a mutated lox site (loxM) and a loxP site. This prevents the original tdTomato from being expressed while allowing EGFP expression. (B, C and D) Flow cytometry analysis of a stable cell line expressing tdTomato, with various recombinase or cell sorting treatments. (B) The untreated cell line (−Cre) contains mainly red-positive cells (72%). (C) Upon recombinase treatment (+Cre), the amount of green cells increases in quadrants 2 and 4 (Q2, Q4). (D) By sorting the green cells (Q2 and Q4 in the previous panel), using flow cytometry (+Sort), and growing to confluency, the resulting cells are enriched for the newly-generated green cells (20%; Q4).

Recombinase-mediated DNA insertion with biobricks.

(A) Schematic view of Cre-mediated recombination between lox66 and lox71, resulting in the insertion of EGFP and creating a mutated lox site (loxM) and a loxP site. This prevents the original tdTomato from being expressed while allowing EGFP expression. (B, C and D) Flow cytometry analysis of a stable cell line expressing tdTomato, with various recombinase or cell sorting treatments. (B) The untreated cell line (−Cre) contains mainly red-positive cells (72%). (C) Upon recombinase treatment (+Cre), the amount of green cells increases in quadrants 2 and 4 (Q2, Q4). (D) By sorting the green cells (Q2 and Q4 in the previous panel), using flow cytometry (+Sort), and growing to confluency, the resulting cells are enriched for the newly-generated green cells (20%; Q4). Since the EGFP plasmid did not contain a promoter, cells should express EGFP alone only if inserted at the correct RMDI site. Expression from random insertions next to genomic promoters (expected to be rare events) would be associated with both green and red expression. However, in the early stages after transfection, even correctly recombined cells would be expected to have some residual red expression, until the levels of tdTomato were degraded or diluted by cell division. Indeed, after three days we observed a cell population expressing both EGFP and tdTomato by flow cytometry (0.23% for RMDI when using both the EGFP plasmid and Cre versus 0.08% for random insertion in the control with the EGFP plasmid without Cre; Fig. 4B and C). Thus, there appeared to be some Cre-induced increase in GFP fluorescence, but it was not clear whether the effects were site-specific or due to background fluorescence. Therefore, to investigate whether the tdTomato levels would subsequently fall, indicating correct RMDI, we sorted the EGFP-positive population in Figure 4C, grew them to confluency and re-analysed the final population. We thus obtained a population of cells where RMDI had indeed occurred, with high levels of EGFP and low levels of tdTomato, similar to non-fluorescent controls (20% green-only cells; Fig. 4D). When considering only the EGFP-positive population (Q2+Q4) we note a marked shift in the Q4 population (8% of Q2+Q4 before sorting to 83% of Q2+Q4 after sorting and regrowth). This suggests that the insertion of the EGFP plasmid occurred at the intended lox site, thereby disrupting tdTomato, but that extra time was required for tdTomato levels to fall, via dilution and degradation, during regrowth of the sorted cells. The 17% of Q2+Q4 found in Q2 is an EGFP-tdTomato double-positive population (4% of the total cell population; Fig. 4D), likely arising from cells containing multiple chromosomal copies of the initial tdTomato construct: here recombination occurred in one or more instances, but not in every copy. Random integrations of GFP near promoter regions are likely to be rare but may also contribute to this population. We also observed tdTomato-positive EGFP-negative (32%) and tdTomato-EGFP double-negative (45%) populations. These are likely arising from cells that were unmodified and yet were carried-through the EGFP cell sorting step (false positives) or from cells that downregulated the CMV promoter [22].

Discussion

In this work, we have created a distribution of DNA fragments compatible with the Standard Registry of Biological Parts [8]. This distribution is intended for the specific use of creating custom-made plasmids, with a focus on use in eukaryotic cell lines, particularly because many previous parts have focused on prokaryotic components. We have therefore constructed biobricks for elements that are routinely used in plasmids for transfection in eukaryotic cell lines. Using these elements, we provide three examples of applications of this system: the assembly of sequence motif concatemers; the assembly of vectors for transient and stable transfection; and a Cre-based Recombinase-Mediated DNA Insertion. Although the present work makes use of existing concepts, such as the standardization of DNA parts [23] and classical cloning for expression in eukaryotic cell lines [1], the construction of the required elements into one collection and the combination of both cloning concepts makes this method a distinct new resource for preparing eukaryotic contructs. To provide this tool to users, we propose that the collection presented here be available as a stand-alone biobrick distribution exchanged in a peer-to-peer fashion, in addition to its availability through the Registry of Standard Biological Parts (http://partsregistry.org/). A researcher may request access to the Registry to obtain a large collection of biobricks which are a valuable resource, especially to synthetic biologists. However, the size of the Registry database may be detrimental in some instances, especially for sporadic users. One example of this is the impossibility - at the moment - to search the registry database for biobricks in a specific format. In fact, special care must be taken when using biobricks from the registry since it is not always clear whether two internal protein domain biobricks may be fused in frame, because of the different biobrick formats [10], [16]. This is in part due to the definition of the registry database of biobricks as being ‘compatible’ with a given format when meaning they may be assembled together even if the coding frame is not maintained, whereas other users may consider ‘compatibility’ as an indication that the fusion conserves the codon frame. In contrast, RFC 23 compatibility in the small collection presented here – in the sense of codon reading frame conservation – is ensured except where otherwise explicitly stated. Another advantage of having an independent small distribution for a specific function is the possibility to exchange it in 96-well plates and to be sure that one is using sequence-verified components. For example, when sequence-verifying the registry, certain discrepancies have been found [24]. The bacteria containing the plasmids can be grown and shipped in wells filled with agar medium, allowing direct sharing between peers, without need for advanced robotics or pipetting systems. We believe that other similar biobrick libraries should be created to be self-contained. That is, all parts in such libraries should be compatible and sufficient to be used for a specific function. For instance one could envisage kinase-phosphatase or signalling cascade libraries; transcription factor libraries; metabolic enzyme libraries. Such a library-based form of distribution may be one of several solutions for the growing size of the Registry that may rapidly reach a limit in the cost-efficiency of its own distribution (the registry is >13 000 parts and growing [25]). Ultimately, we hope to stimulate debate in this growing open standard for biological engineering. FASTA DNA sequences of the Biobrick Collection. Prefix and Suffix are presented in capitals. (DOC) Click here for additional data file.
  40 in total

1.  Engineering ingenuity at iGEM.

Authors:  Catherine Goodman
Journal:  Nat Chem Biol       Date:  2008-01       Impact factor: 15.040

2.  Analysis of bacteriophage N protein and peptide binding to boxB RNA using polyacrylamide gel coelectrophoresis (PACE).

Authors:  C D Cilley; J R Williamson
Journal:  RNA       Date:  1997-01       Impact factor: 4.942

3.  Building blocks for protein interaction devices.

Authors:  Raik Grünberg; Tony S Ferrar; Almer M van der Sloot; Marco Constante; Luis Serrano
Journal:  Nucleic Acids Res       Date:  2010-03-09       Impact factor: 16.971

4.  The encephalomyocarditis virus internal ribosome entry site allows efficient coexpression of two genes from a recombinant provirus in cultured cells and in embryos.

Authors:  I R Ghattas; J R Sanes; J E Majors
Journal:  Mol Cell Biol       Date:  1991-12       Impact factor: 4.272

5.  Correction of multi-gene deficiency in vivo using a single 'self-cleaving' 2A peptide-based retroviral vector.

Authors:  Andrea L Szymczak; Creg J Workman; Yao Wang; Kate M Vignali; Smaroula Dilioglou; Elio F Vanin; Dario A A Vignali
Journal:  Nat Biotechnol       Date:  2004-04-04       Impact factor: 54.908

6.  Purification of a RAS-responsive adenylyl cyclase complex from Saccharomyces cerevisiae by use of an epitope addition method.

Authors:  J Field; J Nikawa; D Broek; B MacDonald; L Rodgers; I A Wilson; R A Lerner; M Wigler
Journal:  Mol Cell Biol       Date:  1988-05       Impact factor: 4.272

7.  Exploration of new chromophore structures leads to the identification of improved blue fluorescent proteins.

Authors:  Hui-wang Ai; Nathan C Shaner; Zihao Cheng; Roger Y Tsien; Robert E Campbell
Journal:  Biochemistry       Date:  2007-04-20       Impact factor: 3.162

8.  Standard biological parts knowledgebase.

Authors:  Michal Galdzicki; Cesar Rodriguez; Deepak Chandran; Herbert M Sauro; John H Gennari
Journal:  PLoS One       Date:  2011-02-24       Impact factor: 3.240

9.  Small espin: a third actin-bundling protein and potential forked protein ortholog in brush border microvilli.

Authors:  J R Bartles; L Zheng; A Li; A Wierda; B Chen
Journal:  J Cell Biol       Date:  1998-10-05       Impact factor: 10.539

10.  Targeted development of registries of biological parts.

Authors:  Jean Peccoud; Megan F Blauvelt; Yizhi Cai; Kristal L Cooper; Oswald Crasta; Emily C DeLalla; Clive Evans; Otto Folkerts; Blair M Lyons; Shrinivasrao P Mane; Rebecca Shelton; Matthew A Sweede; Sally A Waldon
Journal:  PLoS One       Date:  2008-07-16       Impact factor: 3.240

View more
  7 in total

1.  Impact of upstream and downstream constraints on a signaling module's ultrasensitivity.

Authors:  Edgar Altszyler; Alejandra Ventura; Alejandro Colman-Lerner; Ariel Chernomoretz
Journal:  Phys Biol       Date:  2014-10-14       Impact factor: 2.583

2.  High-throughput, cost-effective verification of structural DNA assembly.

Authors:  Yandi Dharmadi; Kedar Patel; Elaine Shapland; Daniel Hollis; Todd Slaby; Nicole Klinkner; Jed Dean; Sunil S Chandran
Journal:  Nucleic Acids Res       Date:  2013-11-06       Impact factor: 16.971

3.  A computational pipeline for identifying kinetic motifs to aid in the design and improvement of synthetic gene circuits.

Authors:  Austin W T Chiang; Ming-Jing Hwang
Journal:  BMC Bioinformatics       Date:  2013-10-22       Impact factor: 3.169

4.  The synthetic biology future.

Authors:  Roy D Sleator
Journal:  Bioengineered       Date:  2014-02-21       Impact factor: 3.269

Review 5.  The PLOS ONE synthetic biology collection: six years and counting.

Authors:  Jean Peccoud; Mark Isalan
Journal:  PLoS One       Date:  2012-08-15       Impact factor: 3.240

6.  The Constructor: a web application optimizing cloning strategies based on modules from the registry of standard biological parts.

Authors:  Matthijn C Hesselman; Jasper J Koehorst; Thijs Slijkhuis; Dorett I Odoni; Floor Hugenholtz; Mark W J van Passel
Journal:  J Biol Eng       Date:  2012-09-04       Impact factor: 4.355

Review 7.  Expression of codon optimized genes in microbial systems: current industrial applications and perspectives.

Authors:  Claudia Elena; Pablo Ravasi; María E Castelli; Salvador Peirú; Hugo G Menzella
Journal:  Front Microbiol       Date:  2014-02-04       Impact factor: 5.640

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.