| Literature DB >> 26983823 |
Eric L Miller1, Monica I Abrudan1, Ian S Roberts2, Daniel E Rozen3.
Abstract
The opportunistic pathogen Streptococcus pneumoniae is commonly carried asymptomatically in the human nasopharynx. Due to high rates of cocolonization with other pneumococcus strains, intraspecific competitive interactions partly determine the carriage duration of strains and thereby their potential to cause disease. These interactions may be mediated by bacteriocins, such as the type IIb bacteriocins encoded by the blp (bacteriocin-like peptide) locus. To understand blp diversity and evolution, we undertook a bioinformatic analysis of 4,418 pneumococcal genomes, including 168 newly sequenced genomes. We describe immense variation at all levels of genomic organization: Gene presence/absence, gene order, and allelic diversity. If we make the extreme and naive hypothesis that assumes all genes in this operon can assort randomly, this variation could lead to 10(15) distinct bacteriocin-related phenotypes, each potentially representing a unique ecological strategy; however, we provide several explanations for why this extreme is not realized. Although rarefaction analysis indicates that the number of unique strategies is not saturated, even after sampling thousands of genomes, we show that the variation is neither unbounded nor random. We delimit three bacteriocin groups, which contain group-specific bacteriocins, immunity genes, and blp operon gene order, and argue that this organization places a constraint on realized ecological strategies. We additionally show that ecological strategy diversity is significantly constrained by pneumococcal phylogeny and clonal structure. By examining patterns of association between alleles within the blp operon, we show that bacteriocin genes, which were believed to function in pairs, can be found with a broad diversity of partner alleles and immunity genes; this overall lack of allelic fidelity likely contributes to the fluid structure of this operon. Our results clarify the diversity of antagonistic ecological strategies in the global pneumococcal population and highlight the potential role of blp bacteriocins in competition within the nasopharynx.Entities:
Keywords: antagonism; bacteriocin; bioinformatics; ecological strategies; interference competition
Mesh:
Substances:
Year: 2016 PMID: 26983823 PMCID: PMC4860687 DOI: 10.1093/gbe/evw055
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FPhylogenetic relationship between 4,418 Streptococcus pneumoniae genomes. We used 43 nonpneumoniae Streptococcus sp. as an outgroup. We collapsed clades with less than 75% nonparametric bootstrap support. The colored rings show the genome set (inner ring) and bacteriocin group (outer ring) of each genome.
Genes and Gene Frequencies of blp Operon and Related Genes
| Gene | Alternative Names | Function | Frequency | Amino Acid Variants | Phylotypes over 0.5% Frequency |
|---|---|---|---|---|---|
| ABC transporter | 0.249 | 272 | 23 | ||
| ABC transporter | 0.838 | 195 | 11 | ||
| QS signal | 0.991 | 29 | 9 | ||
| Bacteriocin | 0.115 | 4 | 1 | ||
| Bacteriocin | 0.116 | 2 | 1 | ||
| Membrane protein | 0.114 | 2 | 1 | ||
| CAAX protease | 0.111 | 3 | 1 | ||
| QS receptor | 0.990 | 156 | 15 | ||
| Bacteriocin | 0.404 | 11 | 3 | ||
| Bacteriocin | 0.002 | 1 | 0 | ||
| Bacteriocin | 0.416 | 22 | 5 | ||
| Bacteriocin | 0.545 | 50 | 8 | ||
| Membrane protein | 0.575 | 59 | 9 | ||
| Bacteriocin | 0.610 | 33 | 5 | ||
| Bacteriocin | 0.757 | 26 | 6 | ||
| Bacteriocin | 0.605 | 24 | 4 | ||
| Membrane protein | 0.864 | 18 | 2 | ||
| Bacteriocin | 0.202 | 4 | 1 | ||
| Response regulator | 0.995 | 91 | 7 | ||
| Accessory protein | 0.938 | 70 | 9 | ||
| Unknown | 0.991 | 48 | 7 | ||
| Unknown | 0.059 | 7 | 2 | ||
| Unknown | 0.006 | 4 | 1 | ||
| Unknown | 0.213 | 6 | 2 | ||
| Unknown | 0.425 | 6 | 2 | ||
| Unknown | 0.422 | 7 | 1 | ||
| Unknown | 0.116 | 2 | 1 | ||
| Membrane protein | 0.692 | 28 | 4 | ||
| CAAX protease | 0.983 | 96 | 10 | ||
| Membrane protein | 0.993 | 43 | 9 | ||
| ABC transporter | 0.983 | 181 | 5 | ||
| ABC transporter | 0.992 | 121 | 2 | ||
| Membrane protein | 0.792 | 26 | 10 | ||
| Membrane protein | 0.633 | 18 | 4 | ||
| SP0547 | CAAX protease | 0.984 | 91 | 9 | |
| Bacteriocin | 0.213 | 7 | 1 | ||
| Bacteriocin | 0.302 | 20 | 4 |
aCAAX proteases and membrane proteins are both considered immunity genes.
bIn 4,096 randomly sampled genomes.
cFull length alleles only for blpA and blpB.
dPutative, based on sequence similarity.
Mature Putative Bacteriocin Amino Acid Sequences
| Gene | Frequency in Randomly Sampled Genomes | Amino Acid Sequence |
|---|---|---|
| 0.115 | ||
| 0.116 | ||
| 0.371 | ||
| 0.025 | ||
| 0.006 | ||
| 0.349 | ||
| 0.027 | ||
| 0.017 | ||
| 0.006 | ||
| 0.005 | ||
| 0.139 | ||
| 0.119 | ||
| 0.099 | ||
| 0.067 | ||
| 0.050 | ||
| 0.044 | ||
| 0.034 | ||
| 0.029 | ||
| 0.002 | ||
| 0.300 | ||
| 0.101 | ||
| 0.096 | ||
| 0.092 | ||
| 0.011 | ||
| 0.374 | ||
| 0.149 | ||
| 0.101 | ||
| 0.057 | ||
| 0.043 | ||
| 0.011 | ||
| 0.547 | ||
| 0.028 | ||
| 0.017 | ||
| 0.007 | ||
| 0.200 | ||
| 0.207 | ||
| 0.132 | ||
| 0.126 | ||
| 0.028 | ||
| 0.005 |
aFor genes present in over 0.5% of genomes, only variants found in at least 0.5% of genomes are shown.
Molecular Diversity of blp Operon and Selected Housekeeping Genes
| Gene | Category | Number of Sites | Number of Sequences | Nucleotide Diversity (π) | |
|---|---|---|---|---|---|
| Unknown | 207 | 5 | 0.012 | 2.656 | |
| Bacteriocin | 147 | 9 | 0.023 | 2.645 | |
| Bacteriocin | 226 | 18 | 0.012 | 1.814 | |
| Bacteriocin | 201 | 5 | 0.010 | 0.988 | |
| Bacteriocin | 142 | 20 | 0.027 | 0.974 | |
| Regulatory | 127 | 28 | 0.129 | 0.895 | |
| Bacteriocin | 231 | 19 | 0.056 | 0.844 | |
| Unknown | 147 | 8 | 0.012 | 0.712 | |
| Bacteriocin | 192 | 7 | 0.009 | 0.569 | |
| Unknown | 414 | 8 | 0.014 | 0.492 | |
| Immunity | 84 | 13 | 0.066 | 0.476 | |
| Bacteriocin | 126 | 19 | 0.030 | 0.462 | |
| Bacteriocin | 213 | 38 | 0.040 | 0.395 | |
| Bacteriocin | 219 | 24 | 0.020 | 0.360 | |
| Regulatory | 327 | 74 | 0.027 | 0.352 | |
| Immunity | 177 | 22 | 0.037 | 0.290 | |
| Immunity | 357 | 23 | 0.023 | 0.242 | |
| Regulatory | 1,308 | 157 | 0.086 | 0.206 | |
| Immunity | 204 | 35 | 0.055 | 0.199 | |
| Immunity | 495 | 128 | 0.018 | 0.190 | |
| Regulatory | 297 | 53 | 0.024 | 0.181 | |
| Immunity | 222 | 68 | 0.068 | 0.153 | |
| Immunity | 147 | 13 | 0.017 | 0.152 | |
| Housekeeping gene | 852 | 121 | 0.017 | 0.151 | |
| Immunity | 687 | 127 | 0.070 | 0.137 | |
| Regulatory | 1,359 | 177 | 0.027 | 0.137 | |
| Housekeeping gene | 2,559 | 190 | 0.004 | 0.117 | |
| Unknown | 114 | 5 | 0.014 | 0.092 | |
| Housekeeping gene | 609 | 108 | 0.017 | 0.090 | |
| Regulatory | 2,151 | 74 | 0.015 | 0.083 | |
| Housekeeping gene | 1,344 | 161 | 0.018 | 0.071 | |
| Regulatory | 666 | 114 | 0.049 | 0.064 | |
| Housekeeping gene | 975 | 141 | 0.020 | 0.061 | |
| Housekeeping gene | 1,041 | 189 | 0.042 | 0.053 | |
| Housekeeping gene | 1,161 | 113 | 0.009 | 0.031 | |
| Housekeeping gene | 654 | 101 | 0.020 | 0.028 | |
| Housekeeping gene | 1,620 | 166 | 0.014 | 0.028 |
ablpD, blpE, blpF, blpG, blpI2, blpU2, and blpV were removed for having less than five unique sequences after trimming.
bFull-length blpA and blpB alleles only.
FEstimated and sampled ecological strategies in the blp operon. (A) We reduced the operon to an “average” operon consisting of a histidine kinase quorum sensing receptor (blpH), a quorum sensing signals (blpC), exactly four bacteriocin genes, and exactly seven immunity genes in order to estimate the number of potential ecological strategies. This average genome was based on frequency data from our genome set, as we detected: 99.1% of genomes containing a single blpC gene, with multiple co-occurring blpC genes not detected; 99.0% of genomes containing a single blpH gene, with multiple co-occurring blpH genes not detected; an average of 4.29 putative bacteriocins detected per genome; and an average of 6.74 putative immunity genes detected per genome. We included only phylotypes that were present in at least 0.5% of randomly sampled genomes. (B) Rarefaction of ecological strategies found in randomly sampled genomes. Only strains that contained blpC and blpH, and phylotypes that were present in at least 0.5% of randomly sampled genomes, were included. Error bars indicate standard deviation of 10,000 randomizations.
FConsensus blp operon gene structure across bacteriocin groups. This gene order was compatible with 99.5% of the Q–T group; 94.9% of the I–J group; 93.3% of unclassified genomes; and 99.4% of the D–E group. The frequency of the gene in the group is shown in gray bars; gene function is shown by the color of the gene. Lighter gray bars underneath blpK indicate the proportion of genomes with blpK in an unknown location. Genes occurring in less than 0.5% of genomes in the bacteriocin group are not shown.
FDistribution of phylotypes across bacteriocin groups. For each blp operon gene, the number of genomes with each phylotype is shown, with the phylotypes colored by their overall ranked abundance within each gene across all bacteriocin groups. Phylotypes found in less than 1% of randomly sampled genomes are shown in white. Genes are ordered by gene class and Bray–Curtis dissimilarity between bacteriocin groups.
FSignificant associations between blp operon gene phylotypes. Within each pair of genes, the proportion of pairwise phylotypes that had significant associations is shown: (A) Between putative bacteriocin genes; (B) between putative bacteriocins and immunity genes; and (C) between regulatory blp operon genes, shown in consensus gene order. Comparisons between phylotypes co-occurring in less than 0.5% of genomes were not tested. Pairs of genes with fewer than three tested phylotype combinations are shown with a slash. Genes with fewer than three phylotypes each occurring in at least 0.5% of genomes are not shown.
FDistribution of pairwise strain comparisons across phylogenetic and ecological strategy distance. For strains that were placed into clonal complexes, we calculated the pairwise distance along the phylogenetic tree and the pairwise distance between ecological strategies (e.g., the number of different gene-phylotypes for blpH, blpC, putative bacteriocins, and immunity genes) of (A) strains belonging to the same clonal complex and (B) strains belonging to different clonal complexes. The axes divisions are identical for each graph, although the scales differ. Figure 6A has 14 divisions with over 5,000 pairwise comparisons, while figure 6 has 24 divisions with over 100,000 pairwise comparisons.