| Literature DB >> 35856675 |
Emily C A Goodall1, Faye C Morris2, Samantha A McKeand2, Rudi Sullivan2, Isabel A Warner1, Emma Sheehan2, Gabriela Boelter2, Christopher Icke1, Adam F Cunningham3, Jeffrey A Cole2, Manuel Banzhaf2, Jack A Bryant2, Ian R Henderson1.
Abstract
In recent years the availability of genome sequence information has grown logarithmically resulting in the identification of a plethora of uncharacterized genes. To address this gap in functional annotation, many high-throughput screens have been devised to uncover novel gene functions. Gene-replacement libraries are one such tool that can be screened in a high-throughput way to link genotype and phenotype and are key community resources. However, for a phenotype to be attributed to a specific gene, there needs to be confidence in the genotype. Construction of large libraries can be laborious and occasionally errors will arise. Here, we present a rapid and accurate method for the validation of any ordered library where a gene has been replaced or disrupted by a uniform linear insertion (LI). We applied our method (LI-detector) to the well-known Keio library of Escherichia coli gene-deletion mutants. Our method identified 3,718 constructed mutants out of a total of 3,728 confirmed isolates, with a success rate of 99.7% for identifying the correct kanamycin cassette position. This data set provides a benchmark for the purity of the Keio mutants and a screening method for mapping the position of any linear insertion, such as an antibiotic resistance cassette in any ordered library. IMPORTANCE The construction of ordered gene replacement libraries requires significant investment of time and resources to create a valuable community resource. During construction, technical errors may result in a limited number of incorrect mutants being made. Such mutants may confound the output of subsequent experiments. Here, using the remarkable E. coli Keio knockout library, we describe a method to rapidly validate the construction of every mutant.Entities:
Keywords: E. coli; Keio; Tn-Seq; TraDIS; gene-knockout; knockout; library validation; mutant; sequencing; transposon insertion sequencing
Mesh:
Substances:
Year: 2022 PMID: 35856675 PMCID: PMC9431181 DOI: 10.1128/spectrum.00833-22
Source DB: PubMed Journal: Microbiol Spectr ISSN: 2165-0497
FIG 1Sequencing the Keio library. (A) Mutants of the Keio library are stocked in 96-well plates. Each 96-well plate was inoculated onto an LB agar plate and cultures incubated overnight. Colonies were pooled and genomic DNA was extracted from the pool of mutants. (B) The Keio library of mutants was constructed by individually targeting each gene for replacement by the kanamycin resistance cassette amplified from pKD13. The resulting mutant retained a start codon and its terminal 7 codons (inclusive of a stop codon) with the central coding sequence replaced by the resistance cassette. To sequence the Keio library, genomic DNA was extracted from the pooled mutants and fragmented. An adapter was ligated to the fragmented ends, and the cassette-gDNA junction was amplified using primers specific for the 3′ end of the kanamycin phosphotransferase gene and the adapter. (C) Output FASTQ data was mapped to a reference genome, and the first coordinate of each mapped read was counted as the kanamycin resistance cassette insertion site. FRT, Flp recognition target.
Number of reads surviving each stage of processing
| No. of barcoded reads | K tag 1 match | K tag 2 match | Reads > 20 bp and passing QC filter | Total mapped reads |
|---|---|---|---|---|
| 1,670,483 | 1,629,609 (97.5%) | 1,076,508 (66.06%) | 1,057,274 (98.21%) | 881,943 (83.42%) |
FIG 2Identification of the kanamycin resistance cassette insertion sites. (A) Schematic of the correctly located insertion site of a Keio mutant. (B) Examples of genes encoded across >1 reading frame, which were excluded from our analysis. (C) Decision chart for classifying mutants. Caveats: 1Genes encoded across >1 reading frame were discounted from our analysis (73). 2The gene could be essential (296); the gene could be a recent annotation and was not targeted for deletion (151); an insertion was detected but had ≤3 reads and was filtered from our analysis (7); or no insertion was detected for an expected mutant (54). 3Reference genome annotation is incomplete, but the mutant appears correct (23). (D) The presence and status of the corresponding Keio mutant for every annotated gene in Escherichia coli BW25113. The kanamycin resistance cassette was correctly positioned within the majority of genes (3,718 genes, gray). For a subset of genes, there were discrepancies with the identified location of the cassette and the gene annotation (148 genes, magenta). There are 296 annotated genes in the BW25113 reference genome reported to be essential during construction of the Keio library (teal) that had no insertion and 151 genes annotated in the BW25113 reference genome that were not targeted for deletion during construction of the Keio library (purple).
FIG 3Comparison of expected and constructed mutants. (A) Primers for construction of a Keio mutant; mapping of the reverse primer to the reference genome reveals the expected insertion site (red). (B) Comparison of expected insertion sites (red) with identified insertion sites (blue). (i) A correctly disrupted gene; (ii) a gene reported to be deleted but with no identified insertion; (iii) a gene reported as essential as the mutant could not be constructed; (iv) a gene reported as essential but with a correctly positioned resistance cassette.
FIG 4Identification of the mapped positions of primers used to construct specific Keio mutants (A–D). Construction of the flu, mltC, csiE, and rzoR and rzpR gene deletion mutants using the primers reported in the original Keio library paper. Incorrectly positioned primers are shown as arrows in magenta, mapped primer sites are shown as red spikes above the gene track, identified insertion sites are shown in blue below. (A) Recombination of the FRT sites within the flu mutant will result in an artifact peptide composed of the flu start codon, the 34-residue scar left by the cassette, and residues of the in-frame reading frame (magenta). (B) (i) Primers to construct the mltC mutant map to the incorrect location (magenta); construction of this mutant would maintain the C-terminal portion of the mltC CDS. (ii) Two cassette-gDNA junctions were identified within the mltC gene (blue). (C) (i) Primers to construct the csiE mutant map to the incorrect location (magenta), construction of this mutant would delete csiE and the 3′ end of the neighboring gene hcaT. (ii) A cassette-gDNA junctions corresponding with the mislocalized primer position (*) was identified within the hcaT gene (blue), in addition to a correctly position cassette within the csiE gene (blue). (D) (i) Primers to construct the rzoR and rzpR mutants map to the same, incorrect location (magenta). (ii) Sequence data for these loci reveal 2 mapped primer positions (red), and 3 identified cassette insertion sites (blue).