| Literature DB >> 25888490 |
Matthew L Aardema1, Friederike D von Loewenich2.
Abstract
BACKGROUND: The host range of a pathogenic bacterial strain likely influences its effective population size, which in turn affects the efficacy of selection. Transmission between competent hosts may occur more frequently for host generalists than for specialists. This could allow higher bacterial population densities to persist within an ecological community and increase the efficacy of selection in these populations. Conversely, specialist strains may be better adapted to their hosts and consequently achieve greater within-host population densities, with corresponding increases in selection efficacy. To assess these different hypotheses, we examined the effective population sizes of three strains of the bacterium Anaplasma phagocytophilum and categorized the varying roles of selection and demography on patterns of genetic diversity and divergence in these populations. A. phagocytophilum is a tick-transmitted, obligately intracellular pathogen. Strains of A. phagocytophilum display varying degrees of host specialization, making this a good species for exploring questions regarding host range, effective population size and selection efficacy.Entities:
Mesh:
Year: 2015 PMID: 25888490 PMCID: PMC4383208 DOI: 10.1186/s12862-015-0335-z
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Phylogeny of the three strains used in this study based on seven conserved genetic regions. Rather than display each branch tip individually we have chosen to represent each of the three clusters with a composite triangle. The vertical height of the triangle indicates relative sample sizes for each strain examined in this study. The horizontal width of the triangle indicates the extent of intra-strain synonymous genetic diversity. The numbers along each branch indicate the observed number of synonymous substitutions per synonymous site (dS). The vertical order of the taxa displayed within each cluster is arbitrary.
Figure 2Boxplots showing the median values (thick horizontal black lines) and quartiles for π (a & b) and θ (c & d) estimates at synonymous and non-synonymous sites. Open dots indicate outlying estimates. The red lines with asterisks indicate significant differences between clusters based on a paired t-test (p < 0.05). Note that for both synonymous π and θw the y-axis is on a Log10 scale.
The mean number of segregating sites and estimates of synonymous and non-synonymous genetic diversity for each cluster across the seven genetic regions
|
|
| |||||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| 1 | 95.6 (3.0) | 11.1 (4.3) | 0.0131 (0.0047) | 0.0196 (0.0079) | 315.4 (19.8) | 5.4 (1.8) | 0.0008 (0.0005) | 0.0028 (0.0008) |
| 2 | 95.6 (3.1) | 11.4 (10.8) | 0.0379 (0.0477) | 0.0354 (0.0346) | 315.4 (19.7) | 2.7 (1.6) | 0.0015 (0.0011) | 0.0025 (0.0013) |
| 3 | 96.3 (3.0) | 1.6 (1.1) | 0.0056 (0.0044) | 0.0046 (0.0034) | 314.7 (19.1) | 0.9 (1.2) | 0.0008 (0.0010) | 0.0008 (0.0011) |
1The average number of sites per genetic region that were classified as either synonymous or non-synonymous respectively.
2The average number of segregating sites across the seven genetic regions.
3The average pairwise difference per site [18,19].
4Watterson’s estimator of genetic diversity based on the number of segregating sites [20].
() Standard deviations are indicated in parentheses.
Figure 3A representation of the circular genome of based on the sequenced HZ strain (NC_007797.1, [ 23 ]). The outer circle (orange & purple) gives genome landmarks in base pairs. The two purple sections indicate the locations of two p44 pseudogene clusters [65]. The inner circle (dark blue) shows the location of the seven genetic regions used in this study (pink bars) as well as six other genetic regions that have been important for A. phagocytophilum strain characterization or may have functional importance in host interactions (light blue bars). Green arrows indicate that the coding sequence of a genetic region is in the forward direction relative to the published genome and red arrows indicate that the coding sequence of the genetic region is in the reverse direction relative to the published genome. (See Additional file 1: Table S5 for more details).
Figure 4The mean population frequency of minor alleles for synonymous and non-synonymous segregating sites. The horizontal dashed lines represent the expected mean under neutrality given the number of observed segregating sites [34]. Error bars indicate 95% confidence intervals based on 10,000 bootstrap replicates of the observed data, randomly sampled with replacement.
Average estimates for tests of selection and additional neutrality tests based on the observed number of segregating sites
|
|
|
|
|
|
|
|---|---|---|---|---|---|
| 1 | 0.078 (0.054) | −0.149 (0.181) | −1.129 (0.508) | −0.972 (0.596) | −4.400* (2.153) |
| 2 | 0.086 (0.047) | −0.009 (0.112) | −0.585 (0.990) | −0.337 (1.169) | −1.931 (4.100) |
| 3 | 0.070 (0.048) | −0.026 (0.325) | 0.337 (1.013) | 0.677 (0.328) | −1.079* (1.539) |
1The average ratio of non-synonymous substitutions per non-synonymous site (dN) to synonymous substitutions per synonymous site (dS) for the seven loci.
2The average estimate from the direction of selection (DoS) test.
*Indicates statistical significance based on 10,000 coalescent simulations with the number of observed segregating sites and no recombination (p < 0.05).
() Standard deviations are indicated in parentheses.
Pairwise divergence estimates for the three clusters based on the concatenated dataset of all seven genetic regions
|
|
|
|
|
|---|---|---|---|
| 1 vs 2 | 0.757 | 0.027 | 0.021 |
| 1 vs 3 | 0.972 | 0.100 | 0.097 |
| 2 vs 3 | 0.945 | 0.105 | 0.099 |
1Fixation index [59,60].
2The average number of nucleotide substitutions per site between each cluster [19].
3The net number of nucleotide differences per site between each cluster [19].