| Literature DB >> 31752678 |
Siddharth Subramaniyam1, Michael A DeJesus2, Anisha Zaveri3, Clare M Smith4, Richard E Baker4, Sabine Ehrt3, Dirk Schnappinger3, Christopher M Sassetti4, Thomas R Ioerger5.
Abstract
BACKGROUND: Deep sequencing of transposon mutant libraries (or TnSeq) is a powerful method for probing essentiality of genomic loci under different environmental conditions. Various analytical methods have been described for identifying conditionally essential genes whose tolerance for insertions varies between two conditions. However, for large-scale experiments involving many conditions, a method is needed for identifying genes that exhibit significant variability in insertions across multiple conditions.Entities:
Keywords: Essentiality; Mycobacterium tuberculosis; TnSeq; Transposon insertion library; Zero-inflated negative binomial distribution
Mesh:
Substances:
Year: 2019 PMID: 31752678 PMCID: PMC6873424 DOI: 10.1186/s12859-019-3156-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Illustration of the counts vector and conditions matrix for 4 datasets, consisting of 2 conditions, each with 2 replicates. The insertion counts at the N TA sites in gene g for all 4 replicates are concatentated into a column vector . The matrix encodes the condition represented by each observation. Other covariates could be appended as columns in
Statistics of TnSeq datasets
| Dataset | Condition | Mapped reads | Densitya | NZmeanb |
|---|---|---|---|---|
| A1 | in vitro | 989413 | 0.55 | 23.9 |
| A2 | in vitro | 1376613 | 0.58 | 31.9 |
| A3 | in vitro | 1531598 | 0.58 | 35.3 |
| A4 | in vitro | 547902 | 0.47 | 15.5 |
| A5 | in vitro | 1450383 | 0.57 | 33.9 |
| A6 | in vitro | 475126 | 0.55 | 11.6 |
| B1 | in vivo | 1500646 | 0.47 | 42.5 |
| B2 | in vivo | 601683 | 0.45 | 17.7 |
| B3 | in vivo | 1245065 | 0.51 | 32.5 |
| B4 | in vivo | 1472365 | 0.49 | 39.9 |
| B5 | in vivo | 909394 | 0.42 | 29.3 |
| B6 | in vivo | 409018 | 0.34 | 16.2 |
aFraction of TA sites with insertions
bMean count at TA sites with insertions before normalization
Fig. 2Venn diagram of conditional essentials (qval<0.05) for three different methods: resampling, ANOVA, and ZINB
Fig. 3Reduction in mean insertion counts in-vivo (mice) for genes in the Mce1 locus. Genes that are detected as significant (q-value <0.05) by ZINB regression are marked with ‘*’. Genes with marginal q-values of 0.05-0.11 are marked with ’+’
Fig. 4Statistics for three genes detected to vary significantly in mice compared to in-vitro based on ZINB regression, but not by resampling. The upper panels are the Non-Zero Mean (among insertion counts at TA sites with counts >0), and the lower panels show the Saturation (percent of TA sites with counts >0). Each box represents a distribution over 6 replicates
Comparison of ZINB regression with and without saturation adjustment, for artificially-depleted samples
| Scenario | Datasets compared | Resampling | ZINB (SA +) | ZINB (SA −) |
|---|---|---|---|---|
| 1 | (A1,A2) vs (B1,B2) | 64 | 73 | 162 |
| 2 | (A1,A2) vs (B1[50%],B2[50%]) | 108 | 121 | |
| 3 | (A1[50%],A2[50%]) vs (B1,B2) | 17 | 112 | |
| 4 | (A1[50%],A2[50%]) vs (B1[50%],B2[50%]) | 8 | 30 | 37 |
TnSeq datasets in different antibiotic treatments
| Drug | Concentration ( | MIC50 ( | Number of replicates | Number of essentials vs. untreateda |
|---|---|---|---|---|
| Untreated | 3 | |||
| Isoniazid (INH) | 0.027 | 0.04 | 2 | 50 |
| Rifampicin (RIF) | 0.004 | 0.02 | 3 | 68 |
| ethambutol (EMB) | 0.5 | 0.6 | 3 | 58 |
| Meropenem (MER) | 1.2 | 30 | 3 | 106 |
| Vancomycin (VAN) | 16 | 11 | 3 | 93 |
MICs for H37Rv were obtained from [43]
aNumber of conditional essentials by comparison to the untreated condition using resampling
Identification of genes with significant variability across six conditions in antibiotic treatment data
| Method | Number of varying genes |
|---|---|
| Union of hits from 15 pairwise resampling tests | 276 |
| Genes significant in any pairwise resampling after pooled adjustment | 267 |
| ANOVA | 234 |
| ZINB | 307 |
Fig. 5Venn diagram of genes with significant variability in different antibioitic treatments of transposon insertion counts evaluated by three different methods
Fig. 6Significantly varying genes in cultures exposed to antibiotics. a Mean insertion counts in CinA. b Saturation in SigE (percent of TA sites with one or more insertions)
ZINB
is sigE (Fig. 6b). While the mean insertion counts do not vary much for this gene (ranging between 17 and 27), the saturation level varies significantly among drug exposures, from nearly fully saturated in the control and INH conditions (88–97%), to highly depleted of insertions for RIF, MER and EMB (29–52 %). This reduction suggests that sigE is required for tolerance of certain drugs. Indeed, this recapitulates the growth defects observed in a ΔsigE mutant when exposed to various drugs [47]. sigE is an alternative sigma factor that is thought to play a regulatory role in response to various stresses. This effect was only observable with a model that treats variations in saturation separately from magnitiudes of insertions.