| Literature DB >> 29359196 |
Hualan Liu1, Morgan N Price1, Robert Jordan Waters1, Jayashree Ray1, Hans K Carlson1, Jacob S Lamson1, Romy Chakraborty2, Adam P Arkin1,3, Adam M Deutschbauer1.
Abstract
Transposon mutagenesis coupled to next-generation sequencing (TnSeq) is a powerful approach for discovering the functions of bacterial genes. However, the development of a suitable TnSeq strategy for a given bacterium can be costly and time-consuming. To meet this challenge, we describe a part-based strategy for constructing libraries of hundreds of transposon delivery vectors, which we term "magic pools." Within a magic pool, each transposon vector has a different combination of upstream sequences (promoters and ribosome binding sites) and antibiotic resistance markers as well as a random DNA barcode sequence, which allows the tracking of each vector during mutagenesis experiments. To identify an efficient vector for a given bacterium, we mutagenize it with a magic pool and sequence the resulting insertions; we then use this efficient vector to generate a large mutant library. We used the magic pool strategy to construct transposon mutant libraries in five genera of bacteria, including three genera of the phylum Bacteroidetes. IMPORTANCE Molecular genetics is indispensable for interrogating the physiology of bacteria. However, the development of a functional genetic system for any given bacterium can be time-consuming. Here, we present a streamlined approach for identifying an effective transposon mutagenesis system for a new bacterium. Our strategy first involves the construction of hundreds of different transposon vector variants, which we term a "magic pool." The efficacy of each vector in a magic pool is monitored in parallel using a unique DNA barcode that is introduced into each vector design. Using archived DNA "parts," we next reassemble an effective vector for making a whole-genome transposon mutant library that is suitable for large-scale interrogation of gene function using competitive growth assays. Here, we demonstrate the utility of the magic pool system to make mutant libraries in five genera of bacteria.Entities:
Keywords: genomics; transposons
Year: 2018 PMID: 29359196 PMCID: PMC5768790 DOI: 10.1128/mSystems.00143-17
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1 Overview of the magic pool strategy. (A) Basic structure of a typical transposon delivery vector (not drawn to scale). The inverted repeat (IR) for the specific transposase is indicated. We dissected the transposon delivery vector into five different parts compatible with Golden Gate assembly, and the different parts are indicated by different colors. (B) General workflow of construction and application of magic pools. In step 1, variants of the five different parts are designed, cloned into a part-holding vector, confirmed by sequencing, and archived. In step 2, the part vectors are mixed and assembled using Golden Gate assembly to produce the magic pools of transposon delivery vectors. In step 3, the magic pool vectors are characterized by DNA sequencing whereby each unique DNA barcode (random 20-nucleotide DNA barcode [N20]) is linked to a specific combination of parts. In step 4, preliminary mutant libraries of approximately 5,000 CFU are made using the magic pool, and TnSeq is performed to link the DNA barcode to the insertion site, thereby simultaneously assessing the efficacy of the vectors in the magic pool. ID, identification. In step 5, an effective vector is reassembled using the archived parts, fully barcoded with millions of random DNA barcodes, and a full RB-TnSeq transposon mutant library is constructed. oriT is the origin of transfer. AmpR is the beta-lactam resistance cassette. R6K is the conditional replication origin.
FIG 2 Mutagenesis efficiency of different mariner kanamycin vectors in magic pool. On the basis of barcode frequency associated with each vector in the mariner-kan magic pool, we show the fraction of unique insertions that map to each of the 10 transposon vector designs in the preliminary mutant library generated with the mariner-Kan magic pool. For a description of all of the parts in each vector, see Table S1 in the supplemental material. Abbreviations: Brev2, Brevundimonas sp. strain GW460-12-10-4-LB2; Sphingo3, Sphingobium sp. strain GW456-12-10-14-TSB1; Sphingo4, Sphingopyxis sp. strain GW247-27LB.
Summary statistics for full mutant libraries
| Category and characteristic | Sphingo3 | Brev2 | Cola | Ponti | Pedo557 |
|---|---|---|---|---|---|
| Summary of mutant libraries | |||||
| Mutant library name | Sphingo3_ML4 | Brev2_ML6 | Cola_ML5 | Ponti_ML7 | Pedo557_ML3 |
| Transposon | |||||
| Method of delivery | Conjugation | Conjugation | Conjugation | Conjugation | Conjugation |
| No. of strains with unique barcodes | 275,037 | 166,981 | 278,255 | 190,948 | 190,892 |
| % of barcodes with intact vectors | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 |
| Protein-coding genes | |||||
| Total no. | 4,903 | 3,210 | 4,625 | 4,546 | 5,229 |
| No. with central insertions | 3,943 | 2,584 | 4,090 | 4,010 | 4,670 |
| Median no. of strains/gene | 30 | 27 | 32 | 19 | 21 |
| Insertion bias (%) | 50.6 | 49.9 | 48.2 | 49.6 | 48.8 |
| Mean/median no. of reads/gene | 1.60 | 1.68 | 1.49 | 1.42 | 1.40 |
| BarSeq statistics | |||||
| % of reads with barcode | 98 | 98 | 98 | 98 | 98 |
| % of barcodes that map to a strain | 60 | 75 | 61 | 73 | 80 |
| No. of reads/million for the median gene[ | 79 | 128 | 73 | 79 | 90 |
The organisms are abbreviated as follows: Sphingo3, Sphingobium sp. strain GW456-12-10-14-TSB1; Brev2, Brevundimonas sp. strain GW460-12-10-14-LB2; Cola, Echinicola vietnamensis DSM17526; Ponti, Pontibacter actiniarum DSM19842; Pedo557, Pedobacter sp. strain GW460-11-11-14-LB5.
Only strains with insertions in the genome are included.
May represent integration events of the entire transposon plasmid into the genome.
When the transposon lies within the central 10 to 90% of a gene.
Includes only protein-coding genes with at least one central insertion.
Insertion bias toward coding strand of genes.
Computed with time zero samples.
Includes reads that have no multiplex or barcode in the denominator.
FIG 3 Mutant fitness data for two Proteobacteria. Fitness values for each gene as calculated using transposon insertions in the first half versus the second half (13). Each fitness value is a log2 ratio comparing the abundance of barcode abundance before and after growth selection. Genes shown in green and blue are listed in Table S4. TIGRFAM auxotrophs are predicted amino acid biosynthesis genes (20). (A) The mutant library of Sphingobium sp. GW456-12-10-14-TSB1 (Sphingo3_ML4) grown in defined medium with glucose as a carbon source. (B) The mutant library of Brevundimonas sp. GW460-12-10-14-LB2 (Brev2_ML6) grown in defined medium with glucose and a mixture of all 20 amino acids (see Materials and Methods).
FIG 4 Preference for part variants in the Erm magic pools. (A) As determined by DNA barcodes identified by TnSeq, the fraction of each of eight part2 variants in the combined mariner and Tn5 preliminary mutant libraries for Echinicola vietnamensis (Cola), Pontibacter actiniarum (Ponti), and Pedobacter sp. GW460-11-11-14-LB5 (Pedo557). (B) The fraction of each of the four part3 variants in the combined mariner and Tn5 preliminary mutant libraries for each bacterium. (C) The fraction of each of the 24 mariner part5 variants in Cola, Ponti, and Pedo557. (D) The fraction of each of the 25 Tn5 part5 variants in Cola and Ponti.
FIG 5 Genome-wide mutant fitness data for three Bacteroidetes strains. The fitness values for each gene were calculated using transposon insertions in the first half versus the second half (13). Each gene fitness value is a log2 ratio comparing the barcode abundance of mutants within that gene before and after growth selection. Genes highlighted in blue are listed in Table S4. TIGRFAM auxotrophs are predicted amino acid biosynthesis genes (20). (A) The mutant library of Echinicola vietnamensis (Cola_ML5) grown in defined minimal medium with glucose as the sole carbon source. (B) The mutant library of Pedobacter sp. GW460-11-11-14-LB5 (Pedo557_ML3) grown in defined minimal medium with glucose as the sole carbon source. (C) The mutant library of Pontibacter actiniarum (Ponti_ML7) grown in a defined minimal medium with a mixture of 20 amino acids (see Materials and Methods).
Strains used in this study
| Strain | Description | Source or reference |
|---|---|---|
| Cloning strain for plasmids carrying R6K replication origin | Lucigen | |
| Conjugation donor strain | William Metcalf, | |
| Wild-type strain; referred to in paper as “Sphingo3” | This study | |
| Wild-type strain; referred to in paper as “Sphingo4” | This study | |
| Wild-type strain; referred to in paper as “Brev2” | This study | |
| Wild-type strain; referred to in paper as “Ponti” | DSM | |
| Wild-type strain; referred to in paper as “Cola” | DSM | |
| Wild-type strain; referred to in paper as “Pedo557” | This study | |
| Sphingo3_ML4 | This study | |
| Brev2_ML6 | This study | |
| Ponti_ML7 | This study | |
| Cola_ML5 | This study | |
| Pedo557_ML3 | This study | |
| AMD583 | Tn | This study |
| AMD584 | This study | |
| AMD279 | Tn | This study |
| AMD280 | This study | |
| AMD636 | Barcoded | This study |
| AMD637 | Barcoded | This study |
| AMD691 | Barcoded | This study |
FIG 6 Golden Gate assembly of transposon delivery vectors from part vectors. (A to E) The five part vectors that are used for Golden Gate assembly of the magic pool transposon delivery vectors: part1 vector, part2 vector, part3 vector, barcoded part4 vector, and part5 vector (not drawn to scale). We show the sequences of the 4-nucleotide overhangs for Golden Gate assembly. (F) The transposon delivery vector with DNA barcode. ColE1 is the replication origin ColE1. cat is the chloramphenicol resistance gene. GFP is green fluorescent protein. oriT is the origin of transfer. AmpR is the beta-lactam resistance gene. R6K is a conditional replication origin. N20 is random 20-nucleotide DNA barcode.