| Literature DB >> 28992304 |
Eric L Miller1,2, Benjamin A Evans1,3, Omar E Cornejo4, Ian S Roberts1, Daniel E Rozen2.
Abstract
Natural transformation in the Gram-positive pathogen Streptococcus pneumoniae occurs when cells become "competent," a state that is induced in response to high extracellular concentrations of a secreted peptide signal called competence stimulating peptide (CSP) encoded by the comC locus. Two main CSP signal types (pherotypes) are known to dominate the pherotype diversity across strains. Using 4,089 fully sequenced pneumococcal genomes, we confirm that pneumococcal populations are highly genetically structured and that there is significant variation among diverged populations in pherotype frequencies; most carry only a single pherotype. Moreover, we find that the relative frequencies of the two dominant pherotypes significantly vary within a small range across geographical sites. It has been variously proposed that pherotypes either promote genetic exchange among cells expressing the same pherotype, or conversely that they promote recombination between strains bearing different pherotypes. We attempt to distinguish these hypotheses using a bioinformatics approach by estimating recombination frequencies within and between pherotypes across 4,089 full genomes. Despite underlying population structure, we observe extensive recombination between populations; additionally, we found significantly higher (although marginal) rates of genetic exchange between strains expressing different pherotypes than among isolates carrying the same pherotype. Our results indicate that pherotypes do not restrict, and may even slightly facilitate, recombination between strains; however, these marginal effects suggest the more likely possibility that the cause of CSP polymorphism lies outside of its effects on transformation. Our results suggest that the CSP balanced polymorphism does not causally underlie population differentiation. Therefore, when strains carrying different pherotypes encounter one another during cocolonization, genetic exchange can occur without restriction.Entities:
Keywords: balanced polymorphism; competence; horizontal gene transfer; pneumococcus; quorum sensing
Mesh:
Substances:
Year: 2017 PMID: 28992304 PMCID: PMC5629823 DOI: 10.1093/gbe/evx188
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Frequencies of Pherotypes within Geographic Sites
| Frequency in Set | ||||||
|---|---|---|---|---|---|---|
| Pherotype | Secreted Peptide | Maela, Thailand | Atlanta, USA | Mass., USA | Rotterdam, The Netherlands | All 4,089 Genomes |
| Csp-1 | 0.736 | 0.702 | 0.643 | 0.634 | 0.720 | |
| Csp-1_Short | 0.029 | 0 | 0 | 0 | 0.021 | |
| Csp-2 | 0.213 | 0.24 | 0.333 | 0.324 | 0.235 | |
| Csp-4_R1 | 0.005 | 0 | 0.003 | 0 | 0.004 | |
| Csp-4_R2 | 0.004 | 0.008 | 0.018 | 0.035 | 0.007 | |
| Csp-4_R3 | 0.005 | 0 | 0.002 | 0 | 0.004 | |
| Other | 0.007 | 0.017 | 0.002 | 0.007 | 0.007 | |
| None | 0.002 | 0.033 | 0 | 0 | 0.003 | |
This includes genome sets of variable geographic origin, such as the set of available genomes in GenBank.
Other pherotypes each found in < 0.2% of genomes, as well as two genomes of both Csp-1 and Csp-2 pherotype.
. 1.—Associations between ComC and comD. The UPGMA clustering of ComC genes is shown next to the inferred phylogenetic relationship between comD alleles. ComC amino acid variants (which encompass both the leader residues and the mature CSP) and comD nucleotide alleles found within the same genome are show as lines connecting the two phylograms, with thicker lines showing associations found in more strains. Line thickness is on a log scale. Classification of comD alleles is based on co-occurring CSPs within genomes, with a 99.8% correlation for the Csp-1 comD group, 99.2% correlation for the Csp-2 comD group, and 95.3% correlation for the Csp-4 comD group.
. 2.—Phylogenetic relationship between Streptococcus pneumoniae genomes. The inner coloured ring shows the population grouping of strains as determined by hierBAPS, shown as a number and a colour/pattern. Genomes not classified into a population are white in the inner ring. The outer ring denotes pherotype.
. 3.—Relationship between pherotypes and populations. (A) Distribution of pherotypes within each population. (B) Observed/expected fraction of within-population recombination for each population, with values <0 indicating less observed within-population recombination than expected. Error bars show 95% confidence intervals.
. 4.—Recombination between and within pherotypes. (A) Observed/expected fraction of recombination within and between pherotypes. Empty shapes are within-pherotype comparisons, whereas filled shapes show between-pherotype comparisons. Colours correspond to pherotypes as in figure 1. Error bars show 95% confidence intervals. (B) Distribution of the proportion of recombination events between Csp-1 and Csp-2, in which all recombination events with identical breaks points in the full-genome alignment are grouped as a single event. Light gray shapes show density estimate; dark bars incorporate the 25th to 75th percentile; white circles indicate the average proportion. Averages in bold are significantly different from the null expectation.
Reconstructed Pherotypes of Recipient and Donor Strains
| Recipient Pherotype | Population | Number of Comparisons | Frequency of Pherotype | Proportion of Comparisons with Intrapherotype Presumed Donor Strain | Uncorrected | Corrected |
|---|---|---|---|---|---|---|
| Csp-1 | 1 | 194 | 0.72 | 0.639 | 0.016 | 0.275 |
| Csp-1 | 3 | 54 | 0.72 | 0.981 | 0 | |
| Csp-1 | 7 | 10 | 0.72 | 0.7 | 1 | 1 |
| Csp-1 | 11 | 175 | 0.72 | 0.794 | 0.029 | 0.428 |
| Csp-1 | 12 | 26 | 0.72 | 0.615 | 0.274 | 1 |
| Csp-1 | 14 | 199 | 0.72 | 0.678 | 0.206 | 1 |
| Csp-1 | 19 | 411 | 0.72 | 0.725 | 0.869 | 1 |
| Csp-1 | 21 | 465 | 0.72 | 0.8 | 0 | |
| Csp-1 | 23 | 692 | 0.72 | 0.76 | 0.02 | 0.317 |
| Csp-1 | 34 | 15 | 0.72 | 0.533 | 0.146 | 1 |
| Csp-1 | 35 | 225 | 0.72 | 0.702 | 0.553 | 1 |
| Csp-1 | 36 | 134 | 0.72 | 0.709 | 0.773 | 1 |
| Csp-2 | 1 | 182 | 0.235 | 0.456 | 0 | |
| Csp-2 | 3 | 175 | 0.235 | 0.091 | 0 | |
| Csp-2 | 7 | 10 | 0.235 | 0.2 | 1 | 1 |
| Csp-2 | 11 | 16 | 0.235 | 0.313 | 0.554 | 1 |
| Csp-2 | 12 | 53 | 0.235 | 0.208 | 0.747 | 1 |
| Csp-2 | 14 | 28 | 0.235 | 0.75 | 0 | |
| Csp-2 | 19 | 1475 | 0.235 | 0.397 | 0 | |
| Csp-2 | 21 | 12 | 0.235 | 0.417 | 0.168 | 1 |
| Csp-2 | 23 | 195 | 0.235 | 0.369 | 0 | |
| Csp-4_R1 | 35 | 19 | 0.004 | 0 | 1 | 1 |
| Csp-4_R2 | 34 | 40 | 0.007 | 0.025 | 0.245 | 1 |
| Csp-4_R2 | 35 | 209 | 0.007 | 0.048 | 0 | |
| Csp-4_R3 | 34 | 73 | 0.004 | 0.055 | 0 | |
| Csp-4_R3 | 35 | 94 | 0.004 | 0 | 1 | 1 |
Statistically significant corrected P values are in bold.
Population/pherotype combinations with less than ten comparisons are found in supplementary table S2 in Supplementary Material online.