Literature DB >> 31266871

The Symmetrical Wave Pattern of Base-Pair Substitution Rates across the Escherichia coli Chromosome Has Multiple Causes.

Brittany A Niccum¹, Heewook Lee², Wazim MohammedIsmail², Haixu Tang², Patricia L Foster³.

Abstract

Mutation accumulation experiments followed by whole-genome sequencing have revealed that, for several bacterial species, the rate of base-pair substitutions (BPSs) is not constant across the chromosome but varies in a wave-like pattern that is symmetrical about the origin of replication. The experiments reported here demonstrated that, in Escherichia coli, several interacting factors determine the wave. The origin is a major driver of BPS rates. When it is relocated, the BPS rates in a 1,000-kb region surrounding the new origin reproduce the pattern that surrounds the normal origin. However, the pattern across distant regions of the chromosome is unaltered and thus must be determined by other factors. Increasing the deoxynucleoside triphosphate (dNTP) concentration shifts the wave pattern away from the origin, supporting the hypothesis that fluctuations in dNTP pools coincident with replication firing contribute to the variations in the mutation rate. The nucleoid binding proteins (HU and Fis) and the terminus organizing protein (MatP) are also major factors. These proteins alter the three-dimensional structure of the DNA, and results suggest that mutation rates increase when highly structured DNA is replicated. Biases in error correction by proofreading and mismatch repair, both of which may be responsive to dNTP concentrations and DNA structure, also are major determinants of the wave pattern. These factors should apply to most bacterial and, possibly, eukaryotic genomes and suggest that different areas of the genome evolve at different rates.IMPORTANCE It has been found in several species of bacteria that the rate at which single base pairs are mutated is not constant across the genome but varies in a wave-like pattern that is symmetrical about the origin of replication. Using Escherichia coli as our model system, we show that this pattern is the result of several interconnected factors. First, the timing and progression of replication are important in determining the wave pattern. Second, the three-dimensional structure of the DNA is also a factor, and the results suggest that mutation rates increase when highly structured DNA is replicated. Finally, biases in error correction, which may be responsive both to the progression of DNA synthesis and to DNA structure, are major determinants of the wave pattern. These factors should apply to most bacterial and, possibly, eukaryotic genomes and suggest that different areas of the genome evolve at different rates.

Entities: Chemical Disease Gene Species

Keywords: Escherichia coli; mismatch repair; mutation accumulation; mutation rate; nucleoid-associated proteins; nucleoside triphosphate pools; proofreading; replication timing; whole-genome sequencing

Mesh：

Substances：

Year: 2019 PMID： 31266871 PMCID： PMC6606806 DOI： 10.1128/mBio.01226-19

Source DB: PubMed Journal: mBio Impact factor: 7.867

INTRODUCTION

Recent studies of mutations accumulated nonselectively across bacterial chromosomes revealed that rates of base-pair substitutions (BPSs) vary 2-fold to 4-fold in a wave-like pattern that is mirrored in the two independently replicating halves of the chromosome. These symmetrical patterns have been observed in mismatch repair (MMR)-defective strains of Escherichia coli (1), Vibrio fischeri, V. cholerae (2–4), Pseudomonas fluorescens (5), and P. aeruginosa (6). Such variations in mutation rates may affect the pace at which genes in different regions of the chromosome evolve and may exert selective pressure on gene placement. Yet the causes of this variation are not known. The fidelity of DNA replication, which, in E. coli, is about 1 mistake in 1,000 generations (7), is determined by the intrinsic accuracy of the DNA polymerase plus error correction by proofreading and MMR (8, 9). In E. coli, proofreading is performed by epsilon, a subunit of the DNA polymerase III holoenzyme. If the polymerase inserts an incorrect base, the 5′ to 3′ exonuclease activity of epsilon degrades a few bases of the new strand and polymerase then resynthesizes it, improving the accuracy of DNA synthesis about 4,000-fold (10). MMR is initiated by MutS and MutL, which recognize a mismatch and find a nearby GATC site. The As in GATC sites in E. coli are methylated by the Dam methylase; because methylation lags behind replication, unmethylated As identify the “new” and, presumably, error-containing DNA strand. MutH is recruited by MutSL and activated to nick the unmethylated strand, which is then degraded past the mismatch. Pol III then resynthesizes the strand. MMR improves the accuracy of DNA replication 100-fold to 200-fold (11). In our previous study of the BPS density pattern in MMR-defective E. coli (1), we correlated BPS rates to the chromosomal sites that are affected by two nucleoid-associated proteins (NAPs), HU and Fis. We suggested that when the replication fork encounters regions of the chromosome with high superhelical density due to the binding of these NAPs, the mutation rate increases. An alternative explanation ties mutation rates to replication timing (2, 3). An intriguing hypothesis is that mutation rates vary in concert with fluctuations in deoxynucleoside triphosphate (dNTP) pools when the replication origin fires repeatedly during rapid growth (2). In the work whose results are presented here, we investigated further the causes of the wave-like pattern of BPS rates. MMR-defective strains of E. coli additionally defective for other activities were used in mutation accumulation (MA) experiments followed by whole-genome sequencing (WGS). We also investigated the effects of different growth conditions on BPS rates. We conclude that the BPS density pattern does not have a single cause but is the result of the activity of several factors affecting DNA replication, repair, and chromosome structure. In addition, we report that a MMR-defective Bacillus subtilis strain also has a wave-like BPS rate pattern that is symmetrical about the origin of replication.

RESULTS

The base-pair substitution (BPS) pattern in mismatch repair-defective strains.

We demonstrated in a previous paper (1) that the rates at which BPSs accumulated across the chromosome during an MA experiment with a MMR-defective strain fell into a wave-like pattern symmetrical about OriC, the origin of replication. We have since performed nine additional MA experiments with MMR mutant strains, yielding a total of 30,061 BPSs (11). As shown in Fig. 1A, the wave-like pattern of BPS rates was reproducible among the 10 experiments. Also, the pattern does not exactly match that corresponding to the noninteracting chromosomal macrodomains (MDs) as defined by Valens et al. (12). In particular, the wave pattern of BPSs is symmetrical about the origin whereas the Ori MD is not. In the analyses described below, the combined data from the 10 experiments performed with MMR-defective strains are used as the standard to which the results obtained in other genetic backgrounds are compared.

FIG 1

The BPS rate pattern of MMR-defective strains. The BPS data from 10 experiments performed with MMR-defective strains were collected into 100-kb bins starting at the origin of replication shown on the left (bin 1) and continuing clockwise around the chromosome back to the origin shown on the right (bin 46). (A) Bars represent the mean BPS rate in each bin; error bars represent 95% CL of the mean. The gray line represents the Daubechies wavelet transform of the binned data. The chromosomal macrodomains (MDs) are indicated at the top of the plot as follows: green, Ori MD; blue, right and left MD; red, Ter MD. (B) Plot of the wavelet coherence between the binned BPS data from the MMR-defective strains taken in the clockwise and counterclockwise directions around the chromosome. The colors indicate the magnitude-squared coherence, a measure of the correlation between the data sets, according to the scale on the right. The dotted line gives the “cone of influence” within which the results are free of artifactual edge effects. Arrows indicate the phase lag between the two data sets; arrows pointing right indicate in phase, arrows pointing left indicate 180° out of phase, and arrows pointing in other directions indicate the various degrees between. Following the analysis of Dillon et al. (2), we computed the wavelet coherence of the MMR– data taken in the clockwise and counterclockwise directions around the chromosome. Panel B of Fig. 1 shows that, except for some asymmetry at the midpoint, the wave is symmetrical across the chromosome. The coherence is greatest between 800 and 1,600 kb per cycle (i.e., 8 to 16 bins per cycle), which is similar to that found by Dillon et al. (2). We then computed the wavelet coherence of the collected MMR– data against each of the experimental results presented below. The spectrum of BPSs in the MMR-defective strains is dominated by A:T transitions at 5′NAC3′/3′NTG5′ sites (11). To test if the mutational density pattern was simply due to the distribution of these sites, we removed from the data all A:T transitions at 5′NAC3′/3′NTG5′ sites. As given in Tables 1 and 2, the remaining BPSs fell into the same wave pattern. Thus, the variation in BPS rates must be reflective of regions of the chromosome and not of the distribution of the hot spot triplets.

TABLE 1

Pearson's correlation ρ for each strain versus MMR− strains

Strain(s)	Description	Whole chromosome (bins 1 to 46)		Right replichore (bins 1 to 23)		Left replichore (bins 46 to 24)		Origin (bins 1 to 13 and 46 to 34)		Terminus (bins 14 to 33)
Strain(s)	Description	ρ	P^a	ρ	P^a	ρ	P^a	ρ	P^a	ρ	P^a
Collective^b	MMR⁻
Collective	MMR⁻ minus A:T→G:C at 5′NAC3′/3′NTG5′ sites	0.94	<0.001	0.93	<0.001	0.96	<0.001	0.97	<0.001	0.71	0.004
PFM421	ΔrnhA ΔmutL	0.74	<0.001	0.63	0.002	0.84	<0.001	0.81	<0.001	0.47	0.075
PFM669	ΔmutL (AB1157)	0.59	<0.001	0.40	0.064	0.75	<0.001	0.71	<0.001	0.45	0.079
PFM430/431	oriC⁺ oriZ⁺ ΔmutL (AB1157)	0.75	<0.001	0.71	<0.001	0.81	<0.001	0.83	<0.001	0.46	0.075
PFM426	ΔoriC oriZ⁺ ΔmutL (AB1157)	0.39	0.008	0.21	0.34	0.62	0.003	0.40	0.044	0.41	0.11
PFM533/534	ΔseqA ΔmutL	0.49	0.001	0.71	<0.001	0.27	0.23	0.53	0.006	0.38	0.14
PFM5^b	ΔmutL on minimal medium	0.50	0.001	0.40	0.069	0.61	0.003	0.57	0.003	0.58	0.029
PFM343^b	ΔmutS on minimal medium	0.18	0.22	0.18	0.41	0.18	0.41	0.21	0.31	0.22	0.38
PFM343^b	ΔmutS on diluted LB	0.84	<0.001	0.84	<0.001	0.85	<0.001	0.87	<0.001	0.71	0.004
PFM343^b	ΔmutS on supplemented minimal medium	0.81	<0.001	0.82	<0.001	0.81	<0.001	0.83	<0.001	0.65	0.008
PFM342^b	ΔmutS at low temperature	0.57	<0.001	0.60	0.003	0.53	0.013	0.69	<0.001	−0.02	0.95
PFM799	ΔnrdR ΔmutL	0.49	0.001	0.58	0.005	0.39	0.080	0.69	<0.001	−0.31	0.24
PFM677	Δrep ΔmutL	0.48	0.001	0.49	0.022	0.48	0.027	0.62	0.001	−0.12	0.67
PFM256	Δtus ΔmutL	0.82	<0.001	0.79	<0.001	0.85	<0.001	0.87	<0.001	0.70	0.004
PFM257	ΔmatP ΔmutL	0.40	0.007	0.60	0.004	0.21	0.34	0.70	<0.001	−0.47	0.075
PFM422	ΔrecA ΔmutL	0.71	<0.001	0.70	<0.001	0.73	<0.001	0.85	<0.001	0.06	0.84
PFM424	ΔrecA ΔmutS	0.80	<0.001	0.78	<0.001	0.83	<0.001	0.91	<0.001	0.44	0.082
PFM456	ΔrecB ΔmutL	0.82	<0.001	0.87	<0.001	0.79	<0.001	0.89	<0.001	0.39	0.12
PFM118	ΔumuDC ΔdinB ΔmutL	0.59	<0.001	0.62	0.003	0.55	0.010	0.61	0.001	0.69	0.004
PFM120	lexA3 ΔsulA ΔmutL	0.82	<0.001	0.82	<0.001	0.83	<0.001	0.83	<0.001	0.80	0.001
PFM259	ΔhupB ΔmutL	0.73	<0.001	0.70	<0.001	0.77	<0.001	0.81	<0.001	0.57	0.031
PFM258	ΔhupA ΔmutL	0.49	0.001	0.43	0.053	0.58	0.005	0.67	<0.001	−0.23	0.38
PFM317/318	Δfis ΔmutL	0.51	<0.001	0.61	0.003	0.42	0.060	0.67	<0.001	−0.30	0.25
PFM482	Δfis ΔmutS	0.49	0.001	0.31	0.15	0.62	0.003	0.65	<0.001	−0.28	0.27
PFM741	Δhns ΔmutL	0.81	<0.001	0.81	<0.001	0.81	<0.001	0.88	<0.001	0.46	0.075
PFM713	Δdps ΔmutL	0.80	<0.001	0.84	<0.001	0.78	<0.001	0.89	<0.001	0.47	0.075
PFM163^c	mutD5	0.76	<0.001	0.79	<0.001	0.74	<0.001	0.82	<0.001	0.78	0.001
PFM165/397/399^c	mutD5 ΔmutL	0.64	<0.001	0.76	<0.001	0.52	0.014	0.86	<0.001	−0.46	0.075
Collective^b	Wild type	0.36	0.014	0.41	0.061	0.31	0.17	0.37	0.065	0.51	0.060
Collective^d	Bacillus subtilis mutS::Tn10	0.52	<0.001	0.63	0.003	0.40	0.068	0.74	<0.001	−0.52	0.055

P values were adjusted for multiple comparisons using the Benjamini-Hochberg method (56) with the false-discovery rate set at 25%.

Data are from reference 11.

Data are from reference 10.

Data were combined from four MA experiments performed with DK2140 (NCIB3610 mutS::Tn10), DK2141 (NCIB3610 mutS::Tn10), DK2142 (NCIB3610 mutS::Tn10 ΔcomI), and DK2143 (NCIB3610 mutS::Tn10 ΔcomI). There were no differences in mutation rates or spectra among these strains.

TABLE 2

Pearson's correlation ρ of right versus left replichore for each strain

Strain	Description	Whole chromosome (bins 1 to 23 and 46 to 24)		Origin (bins 1 to 13 and 46 to 34)		Terminus (bins 14 to 23 and 33 to 24)
Strain	Description	ρ	P^a	ρ	P^a	ρ	P^a
Collective^b	MMR⁻	0.91	<0.001	0.95	<0.001	0.66	0.23
Collective	MMR⁻ minus A:T→G:C at 5′NAC3′/3′NTG5′ sites	0.89	<0.001	0.94	<0.001	0.42	0.60
PFM421	ΔrnhA ΔmutL	0.49	0.026	0.58	0.051	0.24	0.82
PFM669	ΔmutL (AB1157)	0.52	0.018	0.64	0.026	0.29	0.77
PFM430/431	oriC⁺ oriZ⁺ ΔmutL (AB1157)	0.60	0.005	0.80	0.003	0.19	0.84
PFM426	ΔoriC oriZ⁺ ΔmutL (AB1157)	0.28	0.22	0.33	0.31	−0.06	0.91
PFM533/534	ΔseqA ΔmutL	0.41	0.062	0.53	0.079	0.32	0.71
PFM5^b	ΔmutL on minimal medium	0.27	0.22	0.19	0.53	0.05	0.91
PFM343^b	ΔmutS on minimal medium	−0.02	0.93	0.32	0.31	−0.53	0.39
PFM343^b	ΔmutS on diluted LB	0.87	<0.001	0.93	<0.001	0.66	0.23
PFM343^b	ΔmutS on supplemented minimal medium	0.74	<0.001	0.79	0.003	0.54	0.39
PFM342^b	ΔmutS at low temperature	0.84	<0.001	0.89	<0.001	0.73	0.19
PFM799	ΔnrdR ΔmutL	0.79	<0.001	0.78	0.003	0.84	0.07
PFM677	Δrep ΔmutL	0.43	0.053	0.58	0.051	−0.16	0.84
PFM256	Δtus ΔmutL	0.70	0.001	0.88	<0.001	0.26	0.81
PFM257	ΔmatP ΔmutL	0.45	0.042	0.40	0.20	0.72	0.19
PFM422	ΔrecA ΔmutL	0.66	0.001	0.81	0.003	0.11	0.88
PFM424	ΔrecA ΔmutS	0.64	0.002	0.80	0.003	0.07	0.91
PFM456	ΔrecB ΔmutL	0.67	0.001	0.78	0.003	0.03	0.94
PFM118	ΔumuDC ΔdinB ΔmutL	0.39	0.075	0.68	0.018	−0.13	0.87
PFM120	lexA3 ΔsulA ΔmutL	0.51	0.019	0.57	0.055	0.42	0.60
PFM259	ΔhupB ΔmutL	0.64	0.002	0.78	0.003	0.35	0.65
PFM258	ΔhupA ΔmutL	0.54	0.014	0.69	0.015	0.15	0.84
PFM317/318	Δfis ΔmutL	0.61	0.004	0.67	0.019	0.58	0.34
PFM482	Δfis ΔmutS	0.34	0.13	0.49	0.108	0.19	0.84
PFM741	Δhns ΔmutL	0.81	<0.001	0.85	0.001	0.60	0.33
PFM713	Δdps ΔmutL	0.52	0.017	0.77	0.004	−0.20	0.84
PFM163^c	mutD5	0.75	<0.001	0.77	0.004	0.52	0.39
PFM165/397/399^c	mutD5 ΔmutL	0.78	<0.001	0.88	<0.001	0.39	0.62
Collective^b	Wild type	0.17	0.46	0.24	0.45	0.17	0.84
Collective^d	Bacillus subtilis mutS::Tn10	0.73	<0.001	0.87	<0.001	0.37	0.65

P values were adjusted for multiple comparisons using the Benjamini-Hochberg method (56) with the false-discovery rate set at 25%.

Data are from reference 11.

Data are from reference 10.

Pearson's correlation ρ for each strain versus MMR− strains P values were adjusted for multiple comparisons using the Benjamini-Hochberg method (56) with the false-discovery rate set at 25%. Data are from reference 11. Data are from reference 10. Data were combined from four MA experiments performed with DK2140 (NCIB3610 mutS::Tn10), DK2141 (NCIB3610 mutS::Tn10), DK2142 (NCIB3610 mutS::Tn10 ΔcomI), and DK2143 (NCIB3610 mutS::Tn10 ΔcomI). There were no differences in mutation rates or spectra among these strains. Pearson's correlation ρ of right versus left replichore for each strain P values were adjusted for multiple comparisons using the Benjamini-Hochberg method (56) with the false-discovery rate set at 25%. Data are from reference 11. Data are from reference 10. Data were combined from four MA experiments performed with DK2140 (NCIB3610 mutS::Tn10), DK2141 (NCIB3610 mutS::Tn10), DK2142 (NCIB3610 mutS::Tn10 ΔcomI), and DK2143 (NCIB3610 mutS::Tn10 ΔcomI). There were no differences in mutation rates or spectra among these strains.

Transcription.

One obvious hypothesis is that the mutational pattern reflects transcriptional patterns. To test this hypothesis, we quantitated the RNA transcript levels in a ΔmutL mutant strain in the lag, exponential, and stationary phases of growth; the results should be representative of the cells growing in colonies during our MA experiments. To compare these results to the BPS rate pattern, we binned the transcriptome sequencing (RNA-Seq) results into the same bins used for mutational data. As shown in Fig. 2, there was no similarity in the two data sets. These data are further discussed in Text S1 in the supplemental material.

FIG 2

Comparison of the pattern of BPS rates to transcription levels. (A) The Daubechies wavelet transform of the binned reads from RNA-Seq samples taken during lag, log, and stationary-growth phases of ΔmutL mutant strain PFM144. The data used for the wavelet transforms represent means of results from three biological replicates. Arrows indicate the positions of the rRNA operons, but the reads from the ribosomal genes in those operons were not included in the plot (see Materials and Methods). (B) Plot of the wavelet coherence between the BPS data from the MMR-defective strains and the mean binned reads from RNA-Seq samples taken during lag, log, and stationary-growth phases of the ΔmutL mutant strain (see the legend to Fig. 1B for further explanation of the coherence plot.).

Replication initiation.

Over many experiments performed in different genetic backgrounds, the BPS pattern around OriC was consistent. In both replichores, the BPS rates declined to a minimum about 300 kb from OriC and then increased to a maximum about 1,000 kb from the origin. Mutation rates then fell again and reached a minimum about 3/5 of the distance toward the terminus. We tested whether replication initiation was responsible for this pattern by performing MA experiments with strains with errant replication start sites. The rnhA gene encodes RNase H1, which degrades RNA-DNA hybrids; in its absence, persistent R-loops initiate aberrant DNA replication that can disrupt normal fork migration (13, 14). However, the BPS pattern of a ΔrnhA ΔmutL mutant strain showed little difference from that of the ΔmutL mutant strain (Table 1), indicating that these aberrant replication initiations do not influence where BPSs occur, at least not when a powerful oriC gene is present. We also performed MA experiments with strains with oriC moved to the midpoint of the right replichore, where it is called oriZ (15). These strains are derived from E. coli K-12 strain AB1157, instead of the ancestor of our MA strains, MG1655. As a control, we tested an AB1157 ΔmutL mutant strain and found that its BPS rate pattern was similar to that of our MMR-defective strains, indicating that the pattern is intrinsic to E. coli K-12 and is not specific to substrain MG1655 (Table 1). We performed MA experiments on ΔmutL derivatives of a strain with both oriZ and oriC (strain WX320 ΔmutL) and one with only oriZ (strain WX340 ΔmutL). The strain with both origins had a BPS pattern similar to those seen with the MMR– strains (Table 1), suggesting that, under our conditions, firing of oriZ could not overcome the influence of oriC. However, the strain containing only oriZ showed a decrease in the BPS rates in the 200-kb area surrounding the new origin, similarly to that normally observed about oriC (Tables 1 and 2; see also Fig. 3).

FIG 3

The effect on the BPS rate pattern of relocating OriC. (A) Bars represent the BPS rate in each bin for the ΔoriC oriZ+ ΔmutL mutant strain (PFM426), and the pink line represents the Daubechies wavelet transform of the binned data (scale on left axis). The gray line represents the Daubechies wavelet transform of the binned data from the 10 MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B) Plot of the wavelet coherences between the data from the ΔoriC oriZ+ ΔmutL mutant strain and the data from the 10 MMR− strains. (See the legend to Fig. 1 for further explanations of the bar charts and the coherence plots.) The arrow in panel A indicates the position of oriZ+. We also performed MA experiments with a strain deleted for SeqA, which controls replication initiation (Tables 1 and 2; see also Fig. S1A and B in the supplemental material). These results are further discussed in Text S2 in the supplemental material. Further discussion of the SeqA results (see Fig. S1A and B in the supplemental material). Download Text S2, DOCX file, 0.01 MB. Effect of loss of SeqA, RecA, and RecB on the BPS rate pattern. (A, C, and E) BPS data were collected into 100-kb bins starting at the origin of replication on the left (bin 1) and continuing clockwise around the chromosome back to the origin on the right (bin 46). Bars represent the BPS rate in each bin accumulated in the tested strain; the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray lines represent the Daubechies wavelet transform of the binned data from 10 experiments performed with MMR-defective strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B, D, and F) Plots of the wavelet coherences between the data from the tested strain and the data from the 10 MMR− strains. The colors correspond to the magnitude-squared level of coherence, a measure of the correlation between the data sets, according to the scale on the right. The dotted line indicates the “cone of influence” within which the results are free of artifactual edge effects. Arrows indicate the phase lag between the two data sets; arrows pointing right indicate in phase, arrows pointing left indicate 180° out of phase, and arrows pointing in other directions indicate the various degrees between. Strains: panels A and B, ΔseqA ΔmutL mutant strain PFM533/534; panels C and D, ΔrecA ΔmutL mutant strain PFM422; panels E and F, ΔrecB ΔmutL mutant strain PFM456. Download FIG S1, EPS file, 2.6 MB.

Replication progression.

Dillon et al. (2) hypothesized that the periodic variation in mutation rates across the chromosome was due to systematic fluctuations in dNTP levels as OriC repeatedly fires during rapid cell growth. High levels of dNTPs increase the probability of misincorporation, and thus the BPS rate, whereas low levels of dNTPs slow replication and improve fidelity (16, 17). The hypothesis of Dillon et al. can be tested by examining the wave pattern from several experiments that we have already published (11). The number of replicating chromosomes is a positive function of the cell’s growth rate (18). When a ΔmutS or a ΔmutL mutant strain was grown on glucose minimal medium, which reduced the growth rate about 2-fold relative to that on LB, the BPS rate pattern became chaotic (Tables 1 and 2; see also Fig. 4A and B). Supplementing the minimal medium with just enough LB to increase the growth rate to normal restored the wave-like BPS rate pattern (Tables 1 and 2; see also Fig. 4C and D). Growing the cells on diluted LB, on which the growth rate was the same as on minimal glucose medium, also preserved the wave-like BPS pattern (Tables 1 and 2; see also Fig. 4E and F). When the cells were grown at a low temperature that also reduced the growth rate 2-fold, the overall shape of the BPS pattern was retained, but the peaks in BPS rates that normally occurred 1,000 kb from the origin were shifted about 200 kb further from OriC (Fig. 4G and H). In addition, the magnitude of the BPS rate fluctuations across the chromosome was doubled. Thus, it appears that growth rate is not the only determinant of the BPS pattern but that other factors, such as the composition of the growth medium, are important.

FIG 4

The effects on the BPS rate pattern of growing the cells on different media and at different temperatures. (A, C, E, and G) Bars represent the BPS rate in each bin for the tested strain; the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray lines represent the Daubechies wavelet transform of the binned data from the 10 MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B, D, F, and H) Plots of the wavelet coherences between the data from the tested strain and the data from the 10 MMR− strains. (See the legend to Fig. 1 for further explanations of the bar charts and of the coherence plots.) The cells grew at the normal rate in the experiments represented in plots C and D, whereas they grew at about half the normal rate in the other plots. Strains: panels A to F, ΔmutS mutant strain PFM343; panels G and H, ΔmutS mutant strain PFM342. Loss of the NrdR repressor results in increased expression of ribonucleotide reductase (RNR) and thus in increased dNTP levels throughout the cell cycle (19, 20) (see Text S3 in the supplemental material for further discussion). We tested a ΔnrdR ΔmutL mutant strain and found that the BPS rate doubled, as expected when dNTP levels are high (Table 3). As shown Tables 1 and 2 (see also Fig. 5A and B), loss of NrdR did not change the basic BPS pattern, but the peak rate on each side was shifted about 200 kb away from the origin. This pattern was similar to that observed when cells were grown at low temperature (Fig. 4G).

TABLE 3

Experimental data

Strain	Description	No. ofBPSs	No. of MAlines	Total no. ofgenerations	BPSs pergeneration× 10³	95%CL^d
Collective^a	MMR^–	30,061	334	264,958	113	2
PFM421	ΔrnhA ΔmutL	3,198	39	22,690	141	7
PFM669	ΔmutL (AB1157)	1,334	33	18,850	71	6
PFM430/431	oriC⁺ oriZ⁺ ΔmutL (AB1157)	3,150	65	33,607	94	6
PFM426	ΔoriC oriZ⁺ ΔmutL (AB1157)	3,544	45	25,176	141	9
PFM533/534	ΔseqA ΔmutL	1,855	44	20,927	89	7
PFM5m^a	ΔmutL on minimal medium	1,435	48	28,197	51	3
PFM343m^a	ΔmutS on minimal medium	1,608	46	26,280	61	4
PFM343VBs^a	ΔmutS on supplemented minimal medium	2,476	39	23,017	108	5
PFM343dLB^a	ΔmutS on diluted LB	3,546	40	22,495	158	8
PFM342LTM^a	ΔmutS at low temperature	1,259	44	24,875	51	5
PFM799	ΔnrdR ΔmutL	5,272	42	23,581	224	12
PFM677	Δrep ΔmutL	3,258	43	23,961	136	7
PFM256	Δtus ΔmutL	3,672	28	32,967	111	10
PFM257	ΔmatP ΔmutL	3,979	33	36,725	108	7
PFM422	ΔrecA ΔmutL	6,797	40	39,713	171	11
PFM424	ΔrecA ΔmutS	6,266	36	36,340	172	11
PFM456	ΔrecB ΔmutL	3,321	46	20,598	161	9
PFM118^a	ΔumuDC ΔdinB ΔmutL	1,110	23	12,078	92	41
PFM120	lexA3 ΔsulA ΔmutL	2,909	33	19,853	147	13
PFM259	ΔhupB ΔmutL	3,910	32	36,390	107	8
PFM258	ΔhupA ΔmutL	2,075	26	27,315	76	7
PFM317/318	Δfis ΔmutL	7,616	35	37,731	202	14
PFM482	Δfis ΔmutS	4,669	40	22,687	206	12
PFM741	Δhns ΔmutL	3,367	46	25,551	132	5
PFM713	Δdps ΔmutL	2,927	40	23,377	125	8
PFM163^b	mutD5	13,625	26	3,481	3,915	344
PFM165/397/399^b	mutD5 ΔmutL	40,686	75	7,012	5,802	456
Collective^a	Wild Type	1,933	341	2,015,066	0.96	0.10
Collective^c	Bacillus subtilis mutS::Tn10	10,551	71	171,805	61	3

Data are from reference 11.

Data are from reference 10.

Data represent combined results from four MA experiments performed with DK2140 (NCIB3610 mutS::Tn10), DK2141 (NCIB3610 mutS::Tn10), DK2142 (NCIB3610 mutS::Tn10 ΔcomI), and DK2143 (NCIB3610 mutS::Tn10 ΔcomI). There were no differences in mutation rates or spectra among these strains.

95% CL, 95% confidence limits to the BPSs per generation.

FIG 5

The effects on the BPS rate pattern of dysregulation of dNTP levels and loss of Rep, the auxiliary replication helicase. (A and C) Bars represent the BPS rate in each bin for the tested strain; the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray lines represent the Daubechies wavelet transform of the binned data from the 10 MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B and D) Plots of the wavelet coherences between the data from the tested strain and the data from the 10 MMR− strains. (See the legend to Fig. 1 for further explanations of the bar charts and of the coherence plots.) Strains: panels A and B, ΔnrdR ΔmutL mutant strain PFM799; panels C and D, Δrep ΔmutL mutant strain PFM677.

Experimental data Data are from reference 11. Data are from reference 10. Data represent combined results from four MA experiments performed with DK2140 (NCIB3610 mutS::Tn10), DK2141 (NCIB3610 mutS::Tn10), DK2142 (NCIB3610 mutS::Tn10 ΔcomI), and DK2143 (NCIB3610 mutS::Tn10 ΔcomI). There were no differences in mutation rates or spectra among these strains. 95% CL, 95% confidence limits to the BPSs per generation. The effects on the BPS rate pattern of dysregulation of dNTP levels and loss of Rep, the auxiliary replication helicase. (A and C) Bars represent the BPS rate in each bin for the tested strain; the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray lines represent the Daubechies wavelet transform of the binned data from the 10 MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B and D) Plots of the wavelet coherences between the data from the tested strain and the data from the 10 MMR− strains. (See the legend to Fig. 1 for further explanations of the bar charts and of the coherence plots.) Strains: panels A and B, ΔnrdR ΔmutL mutant strain PFM799; panels C and D, Δrep ΔmutL mutant strain PFM677. Further discussion of the control of ribonuclease reductase by NrdR. Download Text S3, DOCX file, 0.01 MB. Replication fork progression is aided by the accessory replication helicase, Rep, which removes proteins bound to the DNA in front of the fork (21–24). Rep also aids in restarting replication forks after they stall or collapse (25, 26). As shown in Tables 1 and 2 (see also Fig. 5C and D), with the exception of a region close to the origin, loss of Rep disrupted the BPS rate pattern across the chromosome, suggesting that slowing or stalling the fork results in a random distribution of BPSs.

Replication termination.

Most MA experiments showed that the BPS rate increased in the region where chromosomal replication terminates. Usually there were two unequal peaks, as shown in Fig. 1A, but in some experiments these peaks were better defined and of equal heights and, occasionally, there was just one peak. We do not know the source of this variation, but it may simply represent random noise. Replication terminates approximately 180° from the origin in a 1,200-kb region bounded by replication pause (Ter) sites; this region extends from bin 18 to bin 31 in our figures. The antihelicase Tus protein binds to the Ter sites and allows each replication fork to enter but not to exit, creating a replication fork “trap,” within which the two forks fuse and the chromosome dimer is resolved (27). To determine if the interaction of replication forks with Tus contributes to the increased mutation rate within this region, we performed an MA experiment with a Δtus ΔmutL mutant strain. As shown in Tables 1 and 2 (see also Fig. 6A and B), loss of Tus had little effect on the BPS rate pattern.

FIG 6

The effects on the BPS rate pattern of loss of the Tus antihelicase and MatP, the terminus organizing protein. (A and C) Bars represent the BPS rate in each bin for the tested strain; the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray lines represent the Daubechies wavelet transform of the binned data from the 10 MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B and D) Plots of the wavelet coherences between the data from the tested strain and the data from the 10 MMR− strains. (See the legend to Fig. 1 for further explanations of the bar charts and of the coherence plots.) The arrows in panel A mark the major Ter sites where Tus binds. The bar in panel C shows the region in which MatP binds. Strains: panels A and B, Δtus ΔmutL mutant strain PFM256; panels C and D, ΔmatP ΔmutL mutant strain PFM257. The Ter MD extends from kb 1200 to kb 2200 (12), which is roughly from bin 20 to bin 28 in our figures. The structure of the Ter MD is maintained by the MatP protein, which binds to 23 matS sites within this region (28). As shown in Tables 1 and 2 (see also Fig. 6C and D), loss of MatP caused a severe disruption of the BPS pattern. The mutation rates in the Ter MD were depressed whereas new peaks appeared on either side of the Ter MD in the right and left MDs. Interestingly, the BPS pattern near the origin was maintained in the right replichore but not in the left replichore.

Recombination.

Homologous recombination repairs and restarts the replisome after blockage (29), and the termination region is subject to hyperrecombination (30, 31). However, elimination of the major recombinase of E. coli, RecA, had only a modest effect on the BPS rate pattern in the ΔmutL or ΔmutS mutant strains (Tables 1 and 2; see also Fig. S1C and D in the supplemental material). BPS rates declined in bins 21 to 25, corresponding fairly well to the zone of hyperrecombination. However, this hyperrecombination is dependent on the RecA-RecBCD recombination pathway, and eliminating RecB did not reproduce the BPS pattern seen when RecA was absent (Tables 1 and 2; see also Fig. S1E and F in the supplemental material). Either our protocol is not sensitive enough to detect an effect of loss of RecB or another recombination pathway, e.g., RecFOR (32), is sufficient to maintain the BPS rate in the region.

The SOS response.

In addition to its role in recombination, RecA is also a master regulator of the SOS response to DNA damage. MA experiments performed with a lexA3 ΔmutL mutant strain that cannot induce the SOS response (33), and with a ΔumuDC ΔdinB ΔmutL mutant strain that is missing the error-prone DNA polymerases that are induced as part of the response (34), showed little effect on the BPS rate pattern (Tables 1 and 2; see also Fig. S2 in the supplemental material). Lack of effect of loss of SOS functions on the BPS rate pattern. (A and C) Bars represent the mean BPS rate in each bin accumulated in the tested strain; the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray lines represent the Daubechies wavelet transform of the binned data from the 10 MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B and D) Plots of the wavelet coherences between the data from the tested strain and the data from the 10 MMR− strains. (See the legend to Fig. S1 in the supplemental material for further explanation of bar charts and of the coherence plots.) The lexA3 allele encodes a noninducible repressor of the SOS genes. Deletion of sulA prevents lethal filamentation after induction of the SOS response (not relevant to this study). The dinB and umuDC genes encode the error-prone DNA polymerases (Pol IV and Pol V) that are induced as part of the SOS response. Strains: panels A and B, lexA3 ΔsulA ΔmutL mutant strain PFM120; panels C and D, ΔumuDC ΔdinB ΔmutL mutant strain PFM118. Download FIG S2, EPS file, 1.4 MB.

Nucleoid-associated proteins.

In a previous study (1), we found that the BPS pattern of a ΔmutL strain was correlated with the density of genes activated by the HU protein and repressed by the Fis protein. While both of these NAPs affect transcription, the general lack of correlation of the BPS rate with transcriptional levels (7) (see above) led us to hypothesize that mutation rates across the chromosome were correlated with areas of high DNA structure (1). To test this hypothesis, we performed MA experiments with MMR– strains also defective for each of several NAPs. HU exists as a dimer of its two subunits, HUα and HUβ, encoded by the paralogous genes hupA and hupB, respectively. While loss of both subunits confers a severe growth defect, loss of only one has little consequence during a normal growth cycle, suggesting they can substitute for each other. We performed MA experiments with both ΔhupA ΔmutL and ΔhupB ΔmutL mutant strains. Loss of hupB did not significantly affect the BPS pattern (Table 1). However, loss of hupA depressed BPS rates across the chromosome, particularly in the terminus region, while creating new peaks on either side of the Ter MD (Tables 1 and 2; see also Fig. 7A and B).

FIG 7

Role of HU and Fis in determining the BPS rate pattern. (A and C) Bars represent the BPS rate in each bin for the tested strain; the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray lines represent the Daubechies wavelet transform of the binned data from the 10 MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B and D) Plots of the wavelet coherences between the data from the tested strain and the data from the 10 MMR− strains. (See the legend to Fig. 1 for further explanations of the bar charts and of the coherence plots.) Strains: panels A and B, ΔhupA ΔmutL mutant strain PFM258; panels C and D, Δfis ΔmutL mutant strain PFM317/318. Sobetzko et al. (35) reported Fis binding sites clustered in the origin region; in contrast, both ChIP-chip (chromatin immunoprecipitation with microarray technology) and ChIP-Seq studies showed the density of Fis binding to be more or less constant across the chromosome (36, 37). In both Δfis ΔmutL and Δfis ΔmutS mutant strains (Tables 1 and 2) (Fig. 7B and C; see also Fig. S3A and B in the supplemental material), loss of Fis flattened the BPS rate pattern across the chromosome except around OriC. The effect of loss of the NAPs Fis, HNS, and DPS on the BPS rate pattern. (A, C, and E) Bars represent the mean BPS rate in each bin accumulated in the tested strain; the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray lines represent the Daubechies wavelet transform of the binned data from the 10 MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B, D, and F) Plots of the wavelet coherences between the data from the tested strain and the data from the 10 MMR− strains. (See the legend to Fig. S1 in the supplemental material for further explanation of bar charts and of the coherence plots.) Strains: panels A and B, Δfis ΔmutS mutant strain PFM482; panels C and D, Δhns ΔmutL mutant strain PFM741; panels E and F, Δdps ΔmutL mutant strain PFM713. Download FIG S3, EPS file, 2.8 MB. HNS binds to DNA at high-affinity binding sites, spreads by oligomerization, and condenses the DNA into a few clusters per chromosome (38–40). However, as shown in Tables 1 and 2 (see also Fig. S3C and D in the supplemental material), loss of HNS had little effect on the BPS pattern. The DPS protein accumulates in stationary-phase cells, condenses the nucleoid into a crystal-like state, and protects the DNA from oxidative and other damage (41). Despite this physical change, loss of DPS had little effect on the BPS rate pattern (Tables 1 and 2; see also Fig. S3E and F in the supplemental material). Of course, we do not know the degree to which cells in stationary phase contribute to the BPS rates under our experimental conditions.

Proofreading.

The mutD5 allele encodes an epsilon protein that is inactive for proofreading (42, 43). As shown in Tables 1 and 2 (see also Fig. 8A and B), when proofreading was inactive but MMR was active, the BPS rate pattern was less dramatic but basically the same as when only MMR was inactive. When both MMR and proofreading were inactive, which should reveal the mutations due to polymerase errors, the BPS pattern was nearly flat but still correlated with the MMR− pattern except in the termination region (Tables 1 and 2; see also Fig. 8C and D).

FIG 8

The role of proofreading in determining the BPS rate pattern. (A and B) Bars represent the BPS rate in each bin for the tested strain; the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray lines represent the Daubechies wavelet transform of the binned data from the 10 MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (C and D) Plots of the wavelet coherences between the data from the tested strain and the data from the 10 MMR− strains. (See the legend to Fig. 1 for further explanations of the bar charts and of the coherence plots.) The mutD5 allele encodes an exonuclease-deficient proofreader. Strains: panels A and B, mutD5 mutant strain PFM163; panels C and D, mutD5 ΔmutL mutant strains PFM165/397/399.

Wild type.

The wave pattern of BPSs evident in our data, and in data from other bacteria (2–6), was obtained when MMR was inactive. Thus, a question arises. Does the pattern appear in wild-type strains? This question is difficult to answer because the low mutation rates of wild-type strains would necessitate enormous experiments in order to accumulate enough mutations to approach statistical confidence. In a recent study, we compared the mutation rates and spectra of E. coli strains defective in various DNA repair activities; among those, the results from seven strains were indistinguishable from those of the wild-type parent (44). By combining the data from these strains, we achieved 1,933 BPSs (11), enough to expect to see a wave pattern if it existed. As is evident in Tables 1 and 2 (see also Fig. 9), these BPSs did not create a pattern significantly correlated to the MMR− pattern. However, the wild-type pattern is similar to both that seen in the MMR-defective strains and that seen in the mutD5 mutant strain, particularly around the terminus.

FIG 9

The BPS density pattern in the wild-type strain compared to the patterns of the MMR-defective strains and the mutD5 mutant strain. (A and C) Bars represent the BPS rate in each bin for the wild-type strains, and the pink lines represent the Daubechies wavelet transform of the binned data (scale on left axis). The gray line in panel A represents the Daubechies wavelet transform of the binned data from the 10 MMR− strains; the green line in panel C represents the Daubechies wavelet transform of the mutD5 mutant strain (pink line in Fig. 8A) (scale on right axis; adjusted to bring the wavelets together for comparison). (B and D) Plots of the wavelet coherences between the data from the wild-type strains and the data from the 10 MMR− strains (B) or the mutD5 mutant strain (D). (See the legend to Fig. 1 for further explanations of the bar charts and of the coherence plots). Wild Type, eight strains with the wild-type mutational phenotypes (see the text).

Bacillus subtilis.

To date, BPS patterns that are symmetrical across the chromosome have been found only in MMR− derivatives of Gram-negative bacteria (1–6). Here we add the Gram-positive bacterium Bacillus subtilis to this list. As shown in Table 2 (see also Fig. 10), the BPS rates across the chromosome in B. subtilis mutS::Tn10 mutant strains fell into a wave-like pattern that was symmetrical about the origin. Although similar in shape, the pattern was significantly different from that of E. coli (Table 1). However, as in E. coli, the BPS rate appeared to increase in the terminus region, which in B. subtilis is not 180° from the origin and corresponds to bins 20 to 24 (Fig. 10).

FIG 10

The BPS rate pattern of MMR-defective B. subtilis. (A) Bars represent the BPS rate in each bin for the B. subtilis MMR-defective strains; the pink line represents the Daubechies wavelet transform of the binned data (scale on left axis). The gray line represents the Daubechies wavelet transform of the binned data from the 10 E. coli MMR− strains (scale on right axis; adjusted to bring the wavelets together for comparison). (B) Plot of the wavelet coherence between the binned BPS data from the B. subtilis strains taken in the clockwise and counterclockwise directions around the chromosome. (See the legend to Fig. 1 for further explanations of the bar charts and of the coherence plots). Strains: four B. subtilis mutS::Tn10 strains with the same mutational phenotype (Table 3) (see also Table S1 in the supplemental material).

DISCUSSION

In this study, we examined the role that a number of factors play in establishing and maintaining the symmetrical wave-like BPS rate pattern across the E. coli chromosome. These factors were as follows: transcription, replication initiation and progression, replication termination, recombination, the SOS response to DNA damage, nucleoid-associated proteins, and error correction by MMR and proofreading. We found that transcription and the SOS response had little effect and that the effect of recombination was modest and confined to the terminal region. Here we discuss only the more significant factors. Providing additional replication origins, either by eliminating RNase H1 or by inserting an ectopic oriC (oriZ) gene (Tables 1 and 2), did not disrupt the wave pattern. However, when oriZ was the only origin of replication, the region of depressed BPS rates that surrounds oriC was reestablished about the new origin (Fig. 3). Only 5.1 kb of DNA containing oriC was relocated (15), whereas the region of reduced BPS rate is about 200 kb; thus, the BPS rate is not determined just by the DNA sequence surrounding oriC. We hypothesize that the process of replication initiation protects the DNA from damage and/or that newly established replication forks have a low error rate. In support of the second hypothesis, replication forks have been found to frequently pause in a 200-kb zone on either side of the origin (45), which could allow more-efficient error correction. The BPS mutation rate increased for about 1,000 kb (10 bins) on either side of OriZ such that the pattern resembled that of the same region normally about OriC. An ectopic origin was found to change the MD structure of the chromosome over similar lengths, although the mechanism of this remodeling is unknown (46). In the presence of OriZ, however, the BPS pattern across the rest of the chromosome remained symmetrical about the absent OriC (Fig. 3). Thus, the nature of the BPS rate pattern appears to be due to at least two factors: one factor affecting only the origin and another factor or factors determining the pattern elsewhere on the chromosome. As mentioned above, Dillon et al. (2) identified the timing of replication as a significant determinant of the BPS rate pattern. They suggested that the pattern reflects variations in dNTP levels that occur as origins fire during rapid growth. At replication initiation, a burst of RNR activity increases the dNTP levels, but as the fork progresses, dNTP levels fall, becoming limiting for replication. During rapid growth, initiation of each additional fork again increases dNTP levels, but these fall rapidly as the forks progress. These fluctuations in dNTP levels should result in fluctuations in replication accuracy, since accuracy is inversely related to dNTP levels (2). High dNTP levels increase the probability of misincorporation, whereas limiting levels of dNTPs slow polymerization, allowing error correction by proofreader and MMR to prevail (47). Our results provide partial support for this hypothesis. Dysregulation of RNR in the ΔnrdR mutant strain shifted the BPS rate pattern away from the origin (Fig. 5A and B), which is expected if the abnormal elevation of dNTP levels delayed the point at which they became limiting. Slowing cell growth by lowering the temperature caused a similar shift in the pattern, also expected if the initiation of fewer forks kept dNTP levels elevated (Fig. 4G and H). However, our results also indicate that growth on rich medium, rather than the growth rate per se, is a significant determinant of the BPS pattern (Fig. 4A to F), possibly because of effects on the expression and DNA binding of HU and Fis (see below). Loss of the accessory replication helicase (Rep) resulted in a nearly complete loss of the BPS pattern (Fig. 5C and D), suggesting that frequent and, presumably, random replication stalling causes mutation rates to become chaotic. An alternative, intriguing possibility is that Rep is responsible for modulating the interactions of the replication fork with the NAPs, whose binding determines the BPS rate pattern across much of the chromosome. After declining to a local minimum about 3/5 of the distance along each replichore, BPS rates rise in the terminus region (Fig. 1A). Eliminating MatP, the protein that maintains the structure of the Ter MD, reduced these rates (Fig. 6C and D), suggesting that replication of highly structured DNA is error prone. Interestingly, BPS rates increased in the right and left MDs in ΔmatP mutant cells, suggesting that these adjacent regions of DNA gained structure in the absence of MatP. The mechanism by which MatP structures the Ter MD is not entirely clear (48, 49), but it probably involves supercoils, which, when unconstrained, could migrate into adjacent areas. From our previous results, we predicted that the NAPs HU and Fis, but not HNS, have a role in establishing or maintaining the BPS rate pattern (1). The results presented here (Tables 1 and 2) (Fig. 7) confirmed those predictions and additionally show that DPS had no effect on the BPS rate pattern. Because the NAPs affect gene expression in various ways, we cannot conclude that DNA binding by the NAPs themselves is responsible for the mutational pattern. And, indeed, we found no significant correlations between the BPS pattern and published binding sites of the NAPs, although the locations of the binding sites themselves differ widely among published results (see, e.g., references 37, 50, and 51). Nonetheless, we favor the hypothesis that structuring of the DNA mediated by the NAPs, directly or indirectly, contributes to the BPS rate pattern. Specifically, we hypothesize that DNA replication becomes inaccurate in regions of high superhelical density. The local effect of HU binding is to bend the DNA, but dimer-dimer interactions produce higher-order HU-DNA complexes that can constrain negative supercoils (52, 53). (See Text S4 in the supplemental material for further discussion of the HU subunits.) While loss of HUβ had little effect on the BPS rate pattern (Table 1), loss of HUα reduced the overall BPS rate by 33% (Table 3) and also changed the pattern (Fig. 7A and B). BPS rates were low in the terminus region but were elevated in the right and left MDs, results which were very similar to the pattern seen in the ΔmatP ΔmutL mutant strain (Fig. 6C and D). Fis is a major transcriptional regulator, either activating or repressing, directly or indirectly, nearly a thousand genes (36). Fis acts as a classic transcriptional regulator at some promoters but affects gene expression by altering the DNA superhelical density at other promoters (54). The loss of Fis nearly doubled the overall BPS rate (Table 3) but flattened the wave pattern outside the origin region (Fig. 7C and D). In addition to local effects, recently developed chromosome conformation capture techniques have revealed that NAPs participate in the overall structuring of the chromosome (49). HU cooperates with the MukBEF condensin to promote long-range chromosomal interactions, while Fis acts similarly, although without the cooperation of MukBEF. Within the Ter domain, MatP restricts MukBEF activity, creating a distinct domain with short-range interactions, but HU and Fis partially counteract this restriction. Although these studies did not address how long-range DNA interactions are achieved, a likely candidate for the factor responsible is DNA supercoiling. Supercoil diffusion brings distant loci together as the two DNA strands writhe about each other (55). While not all of our results can be explained by those newly determined data, the effects of NAPs and MatP on the BPS rate pattern can be explained if replication accuracy is deceased in regions of high superhelical density.

Error correction.

Assuming that the BPSs recovered from the mutD5 ΔmutL mutant strain were due to intrinsic errors made by DNA polymerase, we conclude that the polymerase is accurate close to the origin and then becomes increasingly less accurate as replication proceeds to about 1/3 of the replichore, at which point accuracy increases again (Fig. 8C and D). The patterns of BPSs in the MMR-defective strains reveal proofreader biases, whereas the patterns of BPSs in the mutD5 mutant strain reveal MMR biases. Since the BPS patterns in the mutD5 mutant strain and the MMR-defective strains are similar (Fig. 8A and B), error correction by MMR and error correction by proofreader activity apparently have the same biases, although MMR is much less powerful. Thus, error correction by each is effective close to the origin, declines for about 1/3 of the replichore, and increases in the right and left MDs but declines again in the terminus region (Fig. 1; see also Fig. 8A and B). This pattern probably reflects the combined effects of dNTP pool fluctuations and NAP-induced DNA structure, both of which would affect the error correction activities of proofreader and MMR. Given the considerations described above, we would expect the BPS rate pattern in the wild-type strain to mimic that of the MMR-defective strains. However, the nearly 2,000 BPSs accumulated in phenotypically wild-type strains appear to be randomly distributed across the chromosome (Fig. 9). One explanation for this discrepancy is that when both proofreading and MMR are active, the mutation rate is reduced to the point that other, weaker DNA repair and mutagenic activities obscure the underlying pattern. Although the data are not significantly correlated, the wild-type BPS pattern is similar to both the MMR– pattern and the mutD5 mutant pattern, particularly around the terminus, supporting this hypothesis. Alternatively, although the eight strains that we combined to produce 2,000 BPSs have the same mutation rates and spectra, they may have different wave patterns that tend to negate each other and obscure the underlying pattern.

MATERIALS AND METHODS

Bacterial strains and media.

The strains used in this study are listed in Table S1 in the supplemental material, and the methods of their construction are described in Text S5 in the supplemental material. The oligonucleotides used in this study are listed in Table S2 in the supplemental material. Standard media and antibiotics were used (see Text S5 in the supplemental material). Further discussion of the HU subunits. Download Text S4, DOCX file, 0.01 MB. Supplemental material and methods. Download Text S5, DOCX file, 0.02 MB. Oligonucleotides used in this study. Download Table S2, DOCX file, 0.01 MB.

Mutation accumulation experiments.

The MA procedure has been described (7, 11, 44). Details are given in the Text S5 in the supplemental material.

Genomic DNA preparation, library construction, sequencing, and sequence analysis.

The methods used for genomic DNA preparation, library construction, sequencing, and sequence analysis have been described previously (7, 11, 44). Details are given in the Text S5 in the supplemental material.

RNA sequencing.

Strain PFM144, which is PFM2 ΔmutL (11), was grown in LB, and aliquots, each containing the same number of cells, were collected during the lag (optical density [OD] = 0.022), log (OD = 0.3), and stationary (OD = 1.5) phases. Total RNA was extracted, DNA and rRNA were removed, and libraries were constructed and sequenced as detailed in Text S5 in the supplemental material. Three biological triplicates were prepared for each growth phase. The numbers of RNA-Seq reads for each condition were normalized to the number of reads mapped to the holD gene, which was shown by real-time PCR (rtPCR) to be expressed at the same level in all phases of growth. The means of the normalized RNA reads from the triplicates were then binned into the same bins used for the mutational analysis. A fourth-order Daubechies wavelet transform was performed on the binned RNA-Seq reads as described for the mutational data (1).

Statistical analysis.

To obtain the BPS density patterns, the numbers of BPSs were binned into 46 bins, each ∼100 kb long, as described previously (1). A fourth-order Daubechies wavelet transform was performed on the binned mutation data as described previously (1). For presentation in the figures, these results were converted into rate values by dividing the number of BPSs by the appropriate number of generations. Pearson’s product-moment correlation coefficient, ρ, was used to evaluate the correlations between the binned BPS data (Tables 1 and 2). Spearman’s nonparametric correlation coefficient was also computed for a few data sets but gave similar results. To account for multiple comparisons, P values were adjusted using the Benjamini-Hochberg method (56) with the false-discovery rate set at 25%, implemented with the MatLab R2018a “mafdr” command. Because comparisons performed using data from the same strain are not independent, this adjustment was made separately for each column in Tables 1 and 2. To further compare the BPS density patterns between two data sets, wavelet coherence was calculated and plotted using the MatLab R2018a “wcoherence” command. While the Daubechies wavelet provides a good visual representation of the binned data, it is not continuous and thus not easily adapted for wavelet coherence analysis. The MatLab program first converts the binned data to Morlet wavelets and then computes the coherence between two of these wavelets. We chose to analyze the data with wavelet coherence because it gives a measure of the correlation between the signals (displayed as colors in the figures) (57, 58). In addition, the MatLab wavelet coherence plot indicates, as a dashed curve, the “cone of influence” within which results are free of artifactual edge effects (57). The relative levels of phase lag between the two signals are indicated by small arrows: arrows pointing right indicate in phase, arrows pointing left indicate 180° out of phase, and arrows pointing in other directions indicate the various degrees between. Because the MatLab program assumes a frequency-time series, the x axis of the plot represents cycles/sample and the y axis represents time; we converted these to bins/cycle (on an inverted scale) and bins, respectively.

Data availability.

The sequences and SNPs reported in this paper were deposited with the National Center for Biotechnology Information Sequence Read Archive (https://trace.ncbi.nlm.nih.gov/Traces/sra/) (accession no. PRJNA168337 for E. coli and accession no. PRJNA542344 for B. subtilis) and in the IUScholarWorks Repository (http://hdl.handle.net/2022/23028).

55 in total

1. Gene order and chromosome dynamics coordinate spatiotemporal gene expression during the bacterial growth cycle.

Authors: Patrick Sobetzko; Andrew Travers; Georgi Muskhelishvili
Journal: Proc Natl Acad Sci U S A Date: 2011-12-19 Impact factor: 11.205

Review 2. Effects of nucleoid-associated proteins on bacterial chromosome structure and gene expression.

Authors: Douglas F Browning; David C Grainger; Stephen Jw Busby
Journal: Curr Opin Microbiol Date: 2010-10-13 Impact factor: 7.934

3. Identification of the lexA gene product of Escherichia coli K-12.

Authors: J W Little; J E Harper
Journal: Proc Natl Acad Sci U S A Date: 1979-12 Impact factor: 11.205

4. Increase in dNTP pool size during the DNA damage response plays a key role in spontaneous and induced-mutagenesis in Escherichia coli.

Authors: Stéphanie Gon; Rita Napolitano; Walter Rocha; Stéphane Coulon; Robert P Fuchs
Journal: Proc Natl Acad Sci U S A Date: 2011-11-14 Impact factor: 11.205

5. NrdR controls differential expression of the Escherichia coli ribonucleotide reductase genes.

Authors: Eduard Torrents; Inna Grinberg; Batia Gorovitz-Harris; Hanna Lundström; Ilya Borovok; Yair Aharonowitz; Britt-Marie Sjöberg; Gerald Cohen
Journal: J Bacteriol Date: 2007-05-11 Impact factor: 3.490

6. Analysis and possible role of hyperrecombination in the termination region of the Escherichia coli chromosome.

Authors: J M Louarn; J Louarn; V François; J Patte
Journal: J Bacteriol Date: 1991-08 Impact factor: 3.490

7. Hypermutability and error catastrophe due to defects in ribonucleotide reductase.

Authors: Deepti Ahluwalia; Roel M Schaaper
Journal: Proc Natl Acad Sci U S A Date: 2013-10-28 Impact factor: 11.205

8. H-NS cooperative binding to high-affinity sites in a regulatory element results in transcriptional silencing.

Authors: Emeline Bouffartigues; Malcolm Buckle; Cyril Badaut; Andrew Travers; Sylvie Rimsky
Journal: Nat Struct Mol Biol Date: 2007-04-15 Impact factor: 15.369

9. Long range chromosome organization in Escherichia coli: The position of the replication origin defines the non-structured regions and the Right and Left macrodomains.

Authors: Stéphane Duigou; Frédéric Boccard
Journal: PLoS Genet Date: 2017-05-09 Impact factor: 5.917

INTRODUCTION

RESULTS

The base-pair substitution (BPS) pattern in mismatch repair-defective strains.

Transcription.

Replication initiation.

Replication progression.

Replication termination.

Recombination.

The SOS response.

Nucleoid-associated proteins.

Proofreading.

Wild type.

Bacillus subtilis.

DISCUSSION

Error correction.

MATERIALS AND METHODS

Bacterial strains and media.

Mutation accumulation experiments.

Genomic DNA preparation, library construction, sequencing, and sequence analysis.

RNA sequencing.

Statistical analysis.

Data availability.

Review 2. Effects of nucleoid-associated proteins on bacterial chromosome structure and gene expression.

Review 2. Mutation and Recombination Rates Vary Across Bacterial Chromosome.

Review 6. Bacterial phenotypic heterogeneity in DNA repair and mutagenesis.