| Literature DB >> 28974620 |
Guanhua Yang1, Gabriel Billings2,3, Troy P Hubbard2,3, Joseph S Park2,3, Ka Yin Leung1,4, Qin Liu1,5, Brigid M Davis2,3, Yuanxing Zhang1, Qiyao Wang6,5, Matthew K Waldor1,2,3.
Abstract
Transposon insertion sequencing (TIS) is a powerful high-throughput genetic technique that is transforming functional genomics in prokaryotes, because it enables genome-wide mapping of the determinants of fitness. However, current approaches for analyzing TIS data assume that selective pressures are constant over time and thus do not yield information regarding changes in the genetic requirements for growth in dynamic environments (e.g., during infection). Here, we describe structured analysis of TIS data collected as a time series, termed pattern analysis of conditional essentiality (PACE). From a temporal series of TIS data, PACE derives a quantitative assessment of each mutant's fitness over the course of an experiment and identifies mutants with related fitness profiles. In so doing, PACE circumvents major limitations of existing methodologies, specifically the need for artificial effect size thresholds and enumeration of bacterial population expansion. We used PACE to analyze TIS samples of Edwardsiella piscicida (a fish pathogen) collected over a 2-week infection period from a natural host (the flatfish turbot). PACE uncovered more genes that affect E. piscicida's fitness in vivo than were detected using a cutoff at a terminal sampling point, and it identified subpopulations of mutants with distinct fitness profiles, one of which informed the design of new live vaccine candidates. Overall, PACE enables efficient mining of time series TIS data and enhances the power and sensitivity of TIS-based analyses.IMPORTANCE Transposon insertion sequencing (TIS) enables genome-wide mapping of the genetic determinants of fitness, typically based on observations at a single sampling point. Here, we move beyond analysis of endpoint TIS data to create a framework for analysis of time series TIS data, termed pattern analysis of conditional essentiality (PACE). We applied PACE to identify genes that contribute to colonization of a natural host by the fish pathogen Edwardsiella piscicida. PACE uncovered more genes that affect E. piscicida's fitness in vivo than were detected using a terminal sampling point, and its clustering of mutants with related fitness profiles informed design of new live vaccine candidates. PACE yields insights into patterns of fitness dynamics and circumvents major limitations of existing methodologies. Finally, the PACE method should be applicable to additional "omic" time series data, including screens based on clustered regularly interspaced short palindromic repeats with Cas9 (CRISPR/Cas9).Entities:
Keywords: Edwardsiella piscicida; fitness dynamics and profiles; live attenuated vaccine; pattern analysis of conditional essentiality (PACE); transposon insertion sequencing
Mesh:
Substances:
Year: 2017 PMID: 28974620 PMCID: PMC5626973 DOI: 10.1128/mBio.01581-17
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1 Histogram showing the percentage of TA sites disrupted per gene in the E. piscicida transposon insertion library. Genes containing more than 10 TA sites were further classified using the EL-ARTIST analysis pipeline as either essential, regionally essential (regional), or neutral.
FIG 2 TIS time series results from turbot infection studies and validation of selected in vivo-attenuated mutants. Bacteria were recovered from fish livers at the indicated time points and analyzed via high-throughput sequencing of transposon insertion sites or barcode tags. (A) Data from one of three replicate transposon insertion libraries are depicted for each time point. P values produced for each locus, using a Mann-Whitney U test, were plotted against mean log2 fold change (FC [i.e., the log2-transformed value of the ratio of normalized output versus input reads]). Red dashed lines represent the thresholds for CE loci (P < 0.05; log2 FC, <−2). In the lower left corner, the number of genes meeting both criteria in two of three library replicates is indicated. (B) The barcoded WT, wt_ΔP, and in-frame deletion mutant strains were recovered from fish inoculated with a pool of WT and mutant strains, and competitive indices (CIs) were calculated based on the ratios of individual mutant to WT tags in output versus input. The y axis shows the log2-transformed ratio of either CI (blue and green dots) or TIS (red dots) FC values. The gray dashed line shows y = −2. *, P ≤ 0.01 for CI results, based on one-way ANOVA followed by Dunnett’s test for multiple comparisons. Mutants are ranked based on FC in TIS data at 14 dpi.
FIG 3 Analysis of time series TIS data via PACE. (A) Schematic representation of curve fitting within PACE. Relative abundance curves for each gene are fit to a series of models of increasing polynomial degree, and models are selected based on an F test of nested models, balancing goodness of fit and overfitting. (B) PACE results for E. piscicida isolated from livers of infected fish, showing the distribution of genes among the different polynomial models. Note that “nondecreasing” here includes genes with negative slope, but for which a 95% confidence interval of the slope included zero; these are excluded from the in vivo decreasing (IVD) group. (C) A Venn diagram compares genes classified as conditionally essential (CE) based on endpoint analysis (P < 0.05; log2 FC, <−2) and genes assigned to the IVD category based on assignment by PACE of a relative fitness value (b) of <0. (D) Venn diagram comparing IVD genes in the liver, spleen, and kidney. (E) Functional classification of the 68 genes classified as IVD in all 3 organs.
FIG 4 PACE enables identification of gene clusters exhibiting similar in vivo dynamics. (A) Dendrogram of IVD loci hierarchically clustered according to their fitting coefficients and colored according to cluster, with all singleton clusters colored black. The clustering cutoff was selected to identify clusters enriched for T3SS/T6SS genes. (B) Distribution of constant (x axis) and linear (y axis) coefficients of IVD genes, colored according to the clustering in panel A. (C) Relative abundance time series curves for each cluster, showing data for each gene (mean over n = 3 biological replicates) along with the cluster mean (magenta). T3SS and T6SS genes are highlighted in blue and yellow, respectively. (D) Relative abundance time series for each non-singleton cluster in panel C; at each time point, the mean of all genes in each cluster is plotted.
FIG 5 Efficacy of new live attenuated vaccine strains identified based on PACE. (A) Bacterial loads recovered from kidneys of fish inoculated i.p. with the LAV candidates at a dose of 3 × 105 CFU/fish. Each time point reflects the mean and standard error of the mean (SEM) from 5 fish. The dotted line indicates the limit of detection (LOD [200 CFU/g]). (B) Bactericidal capacities of sera from turbot inoculated with the indicated LAV candidates or controls. WT E. piscicida cells were incubated at 30°C for 8 h in serum isolated from vaccinated fish 28 dpi. Data points reflect the log10 fold change in CFU relative to input for each serum sample (n = 5). Bars show geometric means; the open circle reflects the limit of detection. ***, P < 0.001 based on one-way ANOVA and Fisher’s least significant difference (LSD) multiple comparison posttest. (C) Serum antibodies (IgM) against E. piscicida at 28 dpi were assayed by ELISA. Data reflect the mean absorbance and SEM (n = 5 for each condition). **, P < 0.01, based on one-way ANOVA and Fisher’s LSD multiple comparison posttest. (D) Bacterial load in kidneys of vaccinated fish after challenge with WT (Cmr) E. piscicida. The mean and SEM CFU per gram of tissue are shown (n = 5 fish per time point). The dotted line indicates the LOD (200 CFU/g). (E) Survival of vaccinated turbot after challenge with the WT. Fish (n = 90 per condition) were challenged 30 days after vaccination and monitored for 28 additional days. ***, P < 0.001 comparing LAV vaccine strains with the PBS control using Kaplan-Meier survival analysis with a log rank test (Mantel-Cox). (F) Relative protection index (RPS ± SEM) of each vaccine candidate, based on mortality at 28 dpi for n = 3 groups of 30 challenged fish.