Literature DB >> 26794317

The TraDIS toolkit: sequencing and analysis for dense transposon mutant libraries.

Lars Barquist¹, Matthew Mayho², Carla Cummins², Amy K Cain², Christine J Boinett², Andrew J Page², Gemma C Langridge², Michael A Quail², Jacqueline A Keane², Julian Parkhill².

Abstract

UNLABELLED: Transposon insertion sequencing is a high-throughput technique for assaying large libraries of otherwise isogenic transposon mutants providing insight into gene essentiality, gene function and genetic interactions. We previously developed the Transposon Directed Insertion Sequencing (TraDIS) protocol for this purpose, which utilizes shearing of genomic DNA followed by specific PCR amplification of transposon-containing fragments and Illumina sequencing. Here we describe an optimized high-yield library preparation and sequencing protocol for TraDIS experiments and a novel software pipeline for analysis of the resulting data. The Bio-Tradis analysis pipeline is implemented as an extensible Perl library which can either be used as is, or as a basis for the development of more advanced analysis tools. This article can serve as a general reference for the application of the TraDIS methodology.
AVAILABILITY AND IMPLEMENTATION: The optimized sequencing protocol is included as supplementary information. The Bio-Tradis analysis pipeline is available under a GPL license at https://github.com/sanger-pathogens/Bio-Tradis CONTACT: parkhill@sanger.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities: Chemical

Mesh：

Substances：

Year: 2016 PMID： 26794317 PMCID： PMC4896371 DOI： 10.1093/bioinformatics/btw022

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

1 Introduction

Steady improvements in high-throughput sequencing technologies have resulted in an increasing number of sequenced bacterial genomes, revealing extensive genetic diversity both within and between species. Associated sequencing-based technologies, such as RNA-seq, ChIP-seq and RIP-seq provide insight into the effects of this variation on gene expression and regulation; however, none provides direct information on cell survival, and hence how this genetic variation may impact the fitness of the bacterium (Gray ). Transposon insertion sequencing (TIS) bridges this gap between sequence and fitness by allowing for direct measurement of survival dynamics within a population of single transposon mutants, by using sequencing reads flanking transposon insertions as a read-out of mutant frequency within the population (Barquist ; Van Opijnen and Camilli, 2013). We previously developed a method for this purpose, called Transposon Directed Insertion Sequencing (TraDIS; (Langridge ). TraDIS uses fragmentation of genomic DNA followed by specific PCR amplification of transposon-containing fragments to selectively enrich for transposon-flanking sequences, and can be adapted for any transposon of interest through a simple redesign of sequencing primers. TraDIS has since been applied to a variety of target organisms and transposons in a wide variety of both in vivo and in vitro growth conditions. These include Tn5-based libraries in Salmonella (Barquist ; Chaudhuri et al., 2013; Langridge ) and Escherichia (Dziva et al., 2013; Eckert ) and Mariner-based libraries in Clostridia (Dembek ) and Mycobacteria (Weerdenburg ).

2 Library preparation and sequencing

We have made a number of refinements to the TraDIS sequencing protocol since its initial publication (Langridge ), described in more detail in the supplement. We have redesigned TraDIS adapters and primers using a splinkerette approach (Devon ; Rad ; Uren ), which increases enrichment of genuine transposon-chromosome junctions by preventing hybridization of the reverse primer until the transposon-specific forward primer has generated a complementary strand. We have substituted a magnetic bead-based fragment size selection for gel-based size selection to increase yield and allow for easier automation (Bronner ). Finally, we have substituted Kapa Hifi DNA polymerase for Taq polymerase, as this enzyme has been shown to have minimal amplification biases (Quail ), and reduced the number of cycles of PCR amplification to provide a more accurate representation of input. TraDIS sequencing primers are designed to begin sequencing within the transposon sequence, so as to provide a short 8–10 base ‘transposon tag’ at the beginning of each read to verify that each read originates from a genuine transposon-chromosome junction. This poses a challenge for Illumina sequencing machines, as the base-calling algorithms assume a complex sample for the purposes of calibration. We have developed HiSeq and MiSeq recipes that use ‘dark cycles’ during which chemistry is run but no imaging is performed to read through this transposon tag, before imaged sequencing commences on the complex chromosomal DNA (see supplement). Once the first read is completed, the DNA is denatured and the transposon-specific sequencing primer is re-annealed for a separate short 10–12 cycle transposon read. This requires a PhiX (or other complex library) spike-in of 5–10% to prevent sequencing failure due to a lack of fluorescence in some channels. Using this protocol we routinely achieve results of > 90% of sequencing reads both containing an intact transposon tag and mapping uniquely to the source genome. We have applied this method to Tn5-, Tn917-, Himar1- and Mu-based mutant libraries, and it should be adaptable to any transposon of interest assuming a suitable priming site exists (see supplement for design parameter details).

3 The Bio-Tradis analysis pipeline

To support the use of this improved TraDIS protocol, we have developed a portable processing and analysis pipeline implemented in the Perl and R languages. The functionality provided is similar to that in other recently published TIS analysis pipelines (DeJesus ; Solaimanpour ), however our command-line driven approach has been designed with a production environment in mind, where many sequencing libraries may be processed simultaneously. We provide tools for each step of analysis from the raw unaligned fastq files produced by the sequencer, through to predictions of gene essentiality and fitness effects. The main pipeline script, bacteria_tradis, filters reads in fastq format for transposon tags, removes these tags, then maps the modified reads using the SMALT short read mapper (https://www.sanger.ac.uk/resources/software/smalt/), with support for multiple contigs and/or replicons, such as plasmids. Default k-mer, step size and percent identity parameters are set depending on input read length, though these can be manually specified by the user. The mapped bam file is then processed to produce plot files, containing insertion counts per nucleotide, suitable for visualization in the Artemis genome browser (Carver ) and for further analysis. The mapping, processing, and data manipulation steps are implemented as self-contained Perl modules that could be easily used as a foundation for the development of more sophisticated analyses. Additional scripts are provided to process these plot files in conjunction with genome annotations in EMBL-Bank format to produce annotated tab-delimited files containing various statistics including read counts and unique insertion sites per gene. Two basic analysis scripts for this gene-level data written in R are available. One, tradis_essentiality.R, produces predictions of gene essentiality within a high-density transposon library based on the empirically observed bimodal distribution of insertion sites over genes when normalized for gene length (Barquist ; Langridge ). The second, tradis_comparisons.R, applies the edgeR package (Robinson ) to identify significant differences in read counts, and hence mutant frequencies, between experimental conditions (Dembek ) providing insight into the relative contribution of all mutagenized genes to fitness under the assayed condition.

4 Summary

We have described recent refinements to the TraDIS method for the sequencing and analysis of dense transposon libraries. These include an optimized sequencing protocol, and processing and analysis tools that can rapidly provide insight into the contribution of genomic regions to organismal fitness. It is our hope that making these tools more accessible will accelerate their application to an ever wider variety of bacteria and experimental conditions.

19 in total

1. Retrospective application of transposon-directed insertion site sequencing to a library of signature-tagged mini-Tn5Km2 mutants of Escherichia coli O157:H7 screened in cattle.

Authors: Sabine E Eckert; Francis Dziva; Roy R Chaudhuri; Gemma C Langridge; Daniel J Turner; Derek J Pickard; Duncan J Maskell; Nicholas R Thomson; Mark P Stevens
Journal: J Bacteriol Date: 2011-01-28 Impact factor: 3.490

2. Improved Protocols for Illumina Sequencing.

Authors: Iraad F Bronner; Michael A Quail; Daniel J Turner; Harold Swerdlow
Journal: Curr Protoc Hum Genet Date: 2014-01-21

3. Splinkerettes--improved vectorettes for greater efficiency in PCR walking.

Authors: R S Devon; D J Porteous; A J Brookes
Journal: Nucleic Acids Res Date: 1995-05-11 Impact factor: 16.971

4. Sequencing and functional annotation of avian pathogenic Escherichia coli serogroup O78 strains reveal the evolution of E. coli lineages pathogenic for poultry via distinct mechanisms.

Authors: Francis Dziva; Heidi Hauser; Thomas R Connor; Pauline M van Diemen; Graham Prescott; Gemma C Langridge; Sabine Eckert; Roy R Chaudhuri; Christa Ewers; Melha Mellata; Suman Mukhopadhyay; Roy Curtiss; Gordon Dougan; Lothar H Wieler; Nicholas R Thomson; Derek J Pickard; Mark P Stevens
Journal: Infect Immun Date: 2012-12-28 Impact factor: 3.441

5. Genome-wide transposon mutagenesis indicates that Mycobacterium marinum customizes its virulence mechanisms for survival and replication in different hosts.

Authors: Eveline M Weerdenburg; Abdallah M Abdallah; Farania Rangkuti; Moataz Abd El Ghany; Thomas D Otto; Sabir A Adroub; Douwe Molenaar; Roy Ummels; Kars Ter Veen; Gunny van Stempvoort; Astrid M van der Sar; Shahjahan Ali; Gemma C Langridge; Nicholas R Thomson; Arnab Pain; Wilbert Bitter
Journal: Infect Immun Date: 2015-02-17 Impact factor: 3.441

Review 6. Transposon insertion sequencing: a new tool for systems-level analysis of microorganisms.

Authors: Tim van Opijnen; Andrew Camilli
Journal: Nat Rev Microbiol Date: 2013-05-28 Impact factor: 60.633

7. High-throughput analysis of gene essentiality and sporulation in Clostridium difficile.

Authors: Marcin Dembek; Lars Barquist; Christine J Boinett; Amy K Cain; Matthew Mayho; Trevor D Lawley; Neil F Fairweather; Robert P Fagan
Journal: MBio Date: 2015-02-24 Impact factor: 7.867

8. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors: Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal: Bioinformatics Date: 2009-11-11 Impact factor: 6.937

9. A comparison of dense transposon insertion libraries in the Salmonella serovars Typhi and Typhimurium.

Authors: Lars Barquist; Gemma C Langridge; Daniel J Turner; Minh-Duy Phan; A Keith Turner; Alex Bateman; Julian Parkhill; John Wain; Paul P Gardner
Journal: Nucleic Acids Res Date: 2013-03-06 Impact factor: 16.971

10. Comprehensive assignment of roles for Salmonella typhimurium genes in intestinal colonization of food-producing animals.

Authors: Roy R Chaudhuri; Eirwen Morgan; Sarah E Peters; Stephen J Pleasance; Debra L Hudson; Holly M Davies; Jinhong Wang; Pauline M van Diemen; Anthony M Buckley; Alison J Bowen; Gillian D Pullinger; Daniel J Turner; Gemma C Langridge; A Keith Turner; Julian Parkhill; Ian G Charles; Duncan J Maskell; Mark P Stevens
Journal: PLoS Genet Date: 2013-04-18 Impact factor: 5.917

78 in total

1. Rapid, Parallel Identification of Catabolism Pathways of Lignin-Derived Aromatic Compounds in Novosphingobium aromaticivorans.

Authors: Jacob H Cecil; David C Garcia; Richard J Giannone; Joshua K Michener
Journal: Appl Environ Microbiol Date: 2018-10-30 Impact factor: 4.792

2. The Rcs stress response inversely controls surface and CRISPR-Cas adaptive immunity to discriminate plasmids and phages.

Authors: Leah M Smith; Simon A Jackson; Lucia M Malone; James E Ussher; Paul P Gardner; Peter C Fineran
Journal: Nat Microbiol Date: 2021-01-04 Impact factor: 17.745

Review 3. Fruit crops in the era of genome editing: closing the regulatory gap.

Authors: Derry Alvarez; Pedro Cerda-Bennasser; Evan Stowe; Fabiola Ramirez-Torres; Teresa Capell; Amit Dhingra; Paul Christou
Journal: Plant Cell Rep Date: 2021-01-30 Impact factor: 4.570

4. Physical enrichment of transposon mutants from saturation mutant libraries using the TraDISort approach.

Authors: Ian T Paulsen; Amy K Cain; Karl A Hassan
Journal: Mob Genet Elements Date: 2017-03-31

5. Genome-wide Analysis of Salmonella enterica serovar Typhi in Humanized Mice Reveals Key Virulence Features.

Authors: Joyce E Karlinsey; Taylor A Stepien; Matthew Mayho; Larissa A Singletary; Lacey K Bingham-Ramos; Michael A Brehm; Dale L Greiner; Leonard D Shultz; Larry A Gallagher; Matt Bawn; Robert A Kingsley; Stephen J Libby; Ferric C Fang
Journal: Cell Host Microbe Date: 2019-08-22 Impact factor: 21.023

Review 6. Molecular phenotyping of infection-associated small non-coding RNAs.

Authors: Lars Barquist; Alexander J Westermann; Jörg Vogel
Journal: Philos Trans R Soc Lond B Biol Sci Date: 2016-11-05 Impact factor: 6.237

Review 7. SorTn-seq: a high-throughput functional genomics approach to discovering regulators of bacterial gene expression.

Authors: Leah M Smith; Simon A Jackson; Paul P Gardner; Peter C Fineran
Journal: Nat Protoc Date: 2021-08-04 Impact factor: 13.491

8. Streptococcus pyogenes genes that promote pharyngitis in primates.

Authors: Luchang Zhu; Randall J Olsen; Stephen B Beres; Matthew Ojeda Saavedra; Samantha L Kubiak; Concepcion C Cantu; Leslie Jenkins; Andrew S Waller; Zhizeng Sun; Timothy Palzkill; Adeline R Porter; Frank R DeLeo; James M Musser
Journal: JCI Insight Date: 2020-06-04

9. Transposase-Mediated Excision, Conjugative Transfer, and Diversity of ICE6013 Elements in Staphylococcus aureus.

Authors: Emily A Sansevere; Xiao Luo; Joo Youn Park; Sunghyun Yoon; Keun Seok Seo; D Ashley Robinson
Journal: J Bacteriol Date: 2017-03-28 Impact factor: 3.490

10. Genome-Wide Assessment of Streptococcus agalactiae Genes Required for Survival in Human Whole Blood and Plasma.

Authors: Luchang Zhu; Prasanti Yerramilli; Layne Pruitt; Matthew Ojeda Saavedra; Concepcion C Cantu; Randall J Olsen; Stephen B Beres; Andrew S Waller; James M Musser
Journal: Infect Immun Date: 2020-09-18 Impact factor: 3.441