| Literature DB >> 31481387 |
Mogens Kilian1, Hervé Tettelin2.
Abstract
From a common ancestor, Streptococcus pneumoniae and Streptococcus mitis evolved in parallel into one of the most important pathogens and a mutualistic colonizer of humans, respectively. This evolutionary scenario provides a unique basis for studies of both infection-associated properties and properties important for harmonious coexistence with the host. We performed detailed comparisons of 60 genomes of S. pneumoniae, S. mitis, Streptococcus pseudopneumoniae, the three Streptococcus oralis subspecies oralis, tigurinus, and dentisani, and Streptococcus infantis Nonfunctional remnants of ancestral genes in both S. pneumoniae and in S. mitis support the evolutionary model and the concept that evolutionary changes on both sides were required to reach their present relationship to the host. Confirmed by screening of >7,500 genomes, we identified 224 genes associated with virulence. The striking difference to commensal streptococci was the diversity of regulatory mechanisms, including regulation of capsule production, a significantly larger arsenal of enzymes involved in carbohydrate hydrolysis, and proteins known to interfere with innate immune factors. The exclusive presence of the virulence factors in S. pneumoniae enhances their potential as vaccine components, as a direct impact on beneficial members of the commensal microbiota can be excluded. In addition to loss of these virulence-associated genes, adaptation of S. mitis to a mutualistic relationship with the host apparently required preservation or acquisition of 25 genes lost or absent from S. pneumoniae Successful adaptation of S. mitis and other commensal streptococci to a harmonious relationship with the host relied on genetic stability and properties facilitating life in biofilms.IMPORTANCE Streptococcus pneumoniae is one of the most important human pathogens but is closely related to Streptococcus mitis, with which humans live in harmony. The fact that the two species evolved from a common ancestor provides a unique basis for studies of both infection-associated properties and properties important for harmonious coexistence with the host. By detailed comparisons of genomes of the two species and other related streptococci, we identified 224 genes associated with virulence and 25 genes unique to the mutualistic species. The exclusive presence of the virulence factors in S. pneumoniae enhances their potential as vaccine components, as a direct impact on beneficial members of the commensal microbiota can be excluded. Successful adaptation of S. mitis and other commensal streptococci to a harmonious relationship with the host relied on genetic stability and properties facilitating life in biofilms.Entities:
Keywords: Streptococcus mitis; Streptococcus oralis; Streptococcus pneumoniae; bacteremia; commensal; evolution; infective endocarditis; mutualism; virulence
Mesh:
Substances:
Year: 2019 PMID: 31481387 PMCID: PMC6722419 DOI: 10.1128/mBio.01985-19
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1(A) Minimum evolution tree based on concatenated sequences of MLST loci showing the representability of the 13 S. pneumoniae indicated by colored dots. (B) Evolutionary relationships of genomes inferred from SplitsTree analysis. Phylogenetic trees of 60 genomes based on core sequence of 844,932 bp shared by all strains in the collection. The tree shows three major clusters supported by bootstrap values of 100, one consisting of S. mitis, S. pneumoniae, and S. pseudopneumoniae, a second composed of S. oralis (subsp. oralis, subsp. dentisani, subsp. tigurinus, and a genomosubspecies), and a third consisting of strains of S. infantis. The 20 S. mitis strains are segregated into two distinct clusters, one of them consisting of several minor subclusters.
FIG 2Synteny gradient display visualizing changes in synteny of seven representative genomes relative to the reference, S. mitis NCTC12261. The reference genome’s genes are colored from yellow to blue on a gradient from left to right. The figure reveals conservation of the color gradient in syntenic regions, while shared genes located in rearrangements show up as color mismatches between the reference and query genomes. Gaps in query genomes (white spaces) represent genes that are present in the reference but not the query. Query genes with paralogs are displayed in black. The figure reveals conservation of gene synteny between the three gap-free S. mitis genomes. Synteny is still mostly conserved with S. oralis Uo5, although blocks of genes breaking the color gradient appear. Given that the leftmost blocks are dark blue and the rightmost ones are light green, it is likely that these nonsyntenic blocks are the results of inversions that are symmetrical with the chromosome’s start (or terminus) of replication. Two successive nested inversions will leave the tips of the first inversion in place (breaks of color gradient), while the second inversion will restore the inner part of the gradient. Many such nested symmetric inversions then leave many nonsyntenic blocks of different colors. Similar inversions are observed between the S. mitis reference and S. pneumoniae and S. pseudopneumoniae genomes, but syntenic blocks remain larger, indicative of fewer inversion events and the close evolutionary relationship.
Genes regularly present in pathogenic S. pneumoniae in contrast to related commensal species
| Locus no. | Annotation and comment(s) | Presence (%) in: | |||
|---|---|---|---|---|---|
| SP_0041 | Bacteriocin BlpO (BlpU); downregulated in CSF ( | 100 | 100 | 0 | 0 |
| Pneumococcal adhesion and virulence protein B, Pav; | 92 | 100 | 5 | 0 | |
| Hypothetical protein; upregulated in CSF ( | 85 | 0 | 0 | 0 | |
| Glycosyl transferase | 92 | 0 | 15 | 0 | |
| SP_0109 | Putative bacteriocin | 100 | 0 | 15 | 12 |
| SP_0116 | Hypothetical protein | 100 | 67 | 0 | 0 |
| Pneumococcal surface protein A (PspA); interferes with complement activation and binds lactoferrin; significantly upregulated in blood, CSF, the heart, and epithelial contact ( | 100 | 0 | 0 | 0 | |
| SP_0124-5 | Class IIb bacteriocins, CibA and CibB | 100 | 100 | 10 | 0 |
| Glycosyl transferase, glycosyl transferase, family 2, ABC transporter, ATP-binding protein, GlcNAc-PI de- | 100 | 0 | 0 | 0 | |
| Riboflavin biosynthesis (RibD, RibE, RibA, RibC/H); essential for intracellular survival of | 100 | 100 | 5 | 6 | |
| SP_0181 | Conserved hypothetical protein | 100 | 0 | 0 | 35 |
| Putative pyruvate formate-lyase-activating enzyme, transcriptional regulator DeoR family, transcriptional regulator, PTS, putative formate acetyltransferase, transaldolase family protein, glycerol dehydrogenase (GldA) | 100 | 0 | 20 | 0 | |
| Hyaluronate lyase (Hyl) (P, H), hypothetical protein, 4-hydroxy-2-oxoglutarate aldolase, carbohydrate kinase (PfkB family), conserved domain protein, oxidoreductase (H), hyaluronate PTS transporter (L), glucuronyl hydrolase (L), preprotein translocase (YajC subunit), oligohyaluronate lyase/heparinase II/III-like protein, hypothetical protein, sugar binding transcriptional regulator (RegR) (repression of glycuronidase expression), two transposases; colonization, meningitis | 100 | 0 | 0 | 18 | |
| SP_0379 | MFS (major facilitator superfamily) permease (L); import or export of target substrates | 85 | 100 | 5 | 0 |
| SP_0389 | Hypothetical protein/cell wall-binding repeat protein | 92 | 0 | 5 | 0 |
| Choline-binding protein G (CbpG); upregulated in blood ( | 100 | 33 | 0 | 6 | |
| Mannitol PTS, transcriptional regulator | 85 | 100 | 0 | 0 | |
| SP_0448 | Hypothetical protein | 100 | 0 | 0 | 0 |
| SP_0449 | Hypothetical protein | 100 | 100 | 15 | 0 |
| SP_0455 | Hypothetical protein | 92 | 0 | 0 | 0 |
| SP_0471-2 | Conserved hypothetical proteins | 100 | 0 | 5 | 0 |
| Putative regulator, PTS, lactose specific, GH20_hexosaminidase superfamily domain protein, 6-phospho-β-galactosidase | 85 | 0 | 5 | 0 | |
| Type I restriction-modification system, M/R subunits (HsdM, HsdR); regulates capsular polysaccharide expression ( | 100 | 100 | 0 | 0 | |
| SP_0528 | Peptide pheromone BlpC | 100 | 100 | 20 | 12 |
| SP_0529 | Bacteriocin secretion accessory protein BlpC, ABC transporter | 85 | 100 | 20 | 12 |
| SP_0576 | Transcription antiterminator LicT | 92 | 100 | 5 | 0 |
| SP_0577-8 | Beta-glucoside PTS transporter (BglP), 6-phospho-beta-glucosidase (BglA-2) | 92 | 100 | 0 | 0 |
| Endonuclease; significantly downregulated in CSF ( | 92 | 100 | 0 | 0 | |
| Hypothetical protein | 92 | 0 | 0 | 0 | |
| Hypothetical protein | 85 | 67 | 0 | 0 | |
| SP_0684-5 | Bacteriocin, lactococcin 972 family protein | 92 | 100 | 10 | 12 |
| SP_0703-7 | Peptide ABC transporter; downregulated in blood ( | 100 | 0 | 15 | 6 |
| SP_0772 | Hypothetical protein | 100 | 0 | 0 | 0 |
| SP_0858-60 | Two membrane proteins, pyrrolidone-carboxylate peptidase (Pcp-1) | 100 | 100 | 0 | 0 |
| Type I restriction-modification system | 100 | 100 | 0 | 54 | |
| SP_0899 | Conserved hypothetical protein | 92 | 100 | 10 | 47 |
| SP_0950 | Acetyltransferase, GNAT family | 85 | 0 | 0 | 0 |
| SP_0994 | Hypothetical protein | 100 | 100 | 0 | 0 |
| SP_0997-8 | Hypothetical protein (SP_0997-8 constitute one ORF) | 100 | 100 | 0 | 0 |
| Iron compound ABC transporter (Pit) | 100 | 0 | 0 | 0 | |
| Phage-associated proteins, including site-specific resolvase | 100 | 0 | 0 | 6 | |
| SP_1047-8 | Hypothetical protein (part of RD6); in | 92 | 0 | 10 | 0 |
| SP_1108 | Hypothetical protein (fragment of SK142_0963) | 92 | 0 | 0 | 0 |
| SP_1210 | Putative bacteriocin immunity protein | 85 | 100 | 0 | 0 |
| SP_1223-4 | Toxin-antitoxin module | 100 | 100 | 20 | 0 |
| SP_1237 | Acetyltransferase, GNAT family | 100 | 100 | 10 | 18 |
| SP_1250 | McrBC 5-methylcytosine restriction system component | 100 | 33 | 15 | 12 |
| SP_1251 | Putative endonuclease/restriction enzyme B | 92 | 0 | 10 | 24 |
| SP_1264 | Conserved domain protein | 85 | 100 | 0 | 0 |
| SP_1427 | Peptidase, U32 family (fragment of SK142_1155) | 100 | 0 | 0 | 0 |
| Type II DNA restriction-modification system | 92 | 100 | 0 | 0 | |
| Transcriptional regulator, AraC family, drug ABC transporter | 85 | 0 | 0 | 0 | |
| SP_1493-5 | Hypothetical proteins; SP_1494 is fragment (84 aa) of 1,256- to 2,522-aa collagen binding protein (LPXTG cell wall anchor domain protein) of | 100 | 0 | 15 | 0 |
| Glutamine ABC transporter (GatB), amino acid-binding protein; adhesin upregulated in contact with lung epithelia ( | 100 | 100 | 10 | 24 | |
| Glutathione | 100 | 0 | 20 | 0 | |
| SP_1656 | Hypothetical protein | 92 | 0 | 0 | 0 |
| SP_1658 | Membrane protein | 92 | 0 | 15 | 0 |
| SP_1679 | Hypothetical protein | 100 | 100 | 10 | 0 |
| SP_1684 | PTS, IIBC components | 92 | 100 | 5 | 0 |
| SP_1694 | Acetyl xylan esterase | 100 | 100 | 0 | 0 |
| ABC transporter, unknown function | 100 | 100 | 15 | 0 | |
| SP_1795 | Beta-fructosidase; pneumonia | 100 | 100 | 5 | 0 |
| SP_1809-10 | Transcriptional regulator, hypothetical protein; tryptophan biosynthesis | 85 | 100 | 5 | 0 |
| SP_1821 | Sugar-binding transcriptional regulator, LacI family | 100 | 100 | 20 | 12 |
| SP_1822 | Hypothetical protein | 85 | 67 | 10 | 0 |
| MgtC/SapB family protein/magnesium transporter | 100 | 100 | 15 | 12 | |
| Phosphate ABC transporter, hypothetical protein, UDP-glucose 4-epimerase (GalE-2), galactose-1-phosphate uridylyltransferase (GalT-1), PhoU family transcriptional regulator, two hypothetical proteins, cell wall-anchored protein, cell wall surface anchor family protein (SP_1828-30 constitute RD11) | 100 | 100 | 20 | 0 | |
| Transcriptional regulator, MerR family, galactose operon | 100 | 100 | 15 | 24 | |
| Nicotinamide mononucleotide transporter PnuC | 100 | 100 | 20 | 71 | |
| SP_1883-5 | Putative dextran glucosidase DexS/alpha-phosphotrehalase/alpha-amylase, trehalose PTS system, IIABC components, trehalose operon transcriptional repressor (TreR) | 100 | 100 | 10 | 35 |
| SP_1894 | Sucrose phosphorylase (GtfA) | 100 | 0 | 10 | 76 |
| Raffinose ABC transporter (RafG, RafF, RafE); upregulated in contact with lung epithelia ( | 100 | 0 | 0 | 0 | |
| Alpha-galactosidase (Aga), | 100 | 0 | 10 | 76 | |
| SP_1917 | Hypothetical protein | 100 | 0 | 0 | 12 |
| SP_1918 | ABC transporter, ATP-binding protein (fragment of SK142_1598) | 100 | 0 | 0 | 0 |
| Pneumolysin (Ply), significantly upregulated in the heart ( | 100 | 100 | 15 | 0 | |
| SP_1930-3 | Hypothetical proteins | 100 | 0 | 5 | 0 |
| SP_1934-6 | Hypothetical protein, Zn-dependent peptidase ImmA, M78 family (posttranslational modification, protein turnover), type II restriction-modification system regulatory protein | 100 | 67 | 5 | 0 |
| Autolysin (LytA) significantly upregulated in the heart ( | 100 | 100 | 20 | 0 | |
| SP_1946 | Putative transcriptional regulator PlcR | 92 | 0 | 10 | 0 |
| SP_1958 | Hypothetical protein | 100 | 0 | 5 | 0 |
| SP_1989 | Transcriptional regulator PlcR | 92 | 0 | 10 | 0 |
| SP_1992 | Cell wall surface anchor family protein (“Xisco”) | 100 | 0 | 0 | 0 |
| Nicotinate-nucleotide pyrophosphorylase (NadC); downregulated in CSF ( | 100 | 0 | 0 | 0 | |
| SP_2081-3 | Conserved hypothetical protein, two-component histidine transcriptional regulator PnpR/S (TCS04); activates the expression of the genes encoding the PstI transporter and PhoU1 regulator (SP_2084-8); genetic fitness, immune evasion, controls the expression of pneumococcal surface antigen (PsaA, SP_0117) ( | 100 | 100 | 0 | 0 |
| Phosphate ABC transporter (Pst1: PstS, -C, -A, and -B proteins), phosphate system regulatory protein PhoU | 100 | 100 | 0 | 0 | |
| Choline-binding protein PcpA; significantly upregulated in the heart ( | 100 | 33 | 5 | 6 | |
| SP_2148-51 | Arginine deiminase pathway enzymes ArcA, ArgF, ArcC; orthologs in | 100 | 0 | 0 | 29 |
| SP_2152-3 | Conserved hypothetical protein, peptidase, M2i0/M25/M40 family | 100 | 0 | 0 | 0 |
| Fucose utilization operon including PTS (FucI, FucU, FucA); sepsis ( | 85 | 0 | 10 | 0 | |
| Choline-binding protein A (PspC/CbpA)/isoprenylcysteine carboxyl methyltransferase; upregulated by epithelial contact ( | 100 | 0 | 0 | 0 | |
| SP_2192-3 | Two-component histidine transcriptional regulator (TCS06); regulates CbpA (SP_2190) expression ( | 100 | 0 | 0 | 29 |
Locus numbers in boldface refer to genes identified as associated with attenuated virulence in mouse models of pneumonia and/or bacteremia in one or more of three STM studies (10–12).
CSF, cerebrospinal fluid; ORF, open reading frame.
FIG 3Comparisons of the IgA1 protease/zmpA loci in representative strains of S. mitis demonstrate that the locus is a hot spot for recombination. (A) Cluster II strains of S. mitis (represented by SK667 and SK569) show the same structure as pneumococci (represented by TIGR4). In some strains, the locus includes the paralogous zmpD gene. Strains of S. mitis cluster I lack the zmpA gene at this site, but some strains instead have a gene encoding an unrelated LPxTG-anchored protein (SK637 and B6) or a complete capsular polysaccharide biosynthesis locus (SK321). (B) Some strains of S. mitis cluster I carry the zmpA gene in the same spot as S. oralis but without the paralogous zmpC gene.
Genes present in commensal Streptococcus species but absent from S. pneumoniae
| Locus no. in | Annotation and comments | Presence (%) in: | Size of protein (aa) | |||
|---|---|---|---|---|---|---|
| SM12261_0091-2 | Gram-positive cell wall surface anchor family protein (YSIRK family); one reading frame in all other strains; N-terminal 199 nt show 87% identity, whereas remaining part shows no significant homology to | 95 | 0 | 100 | 100 | 657 |
| SM12261_0140 | MarR family transcriptional regulator (phosphoenolpyruvate-protein phosphotransferase); absent from | 100 | 0 | 65 | 0 | 81 |
| SM12261_0266 | Oligopeptide-binding protein SarA; present in four of five | 100 | 0 | 24 | 0 | 654 |
| SM12261_0268 | Amino acid permease 2 superfamily; APC (amino acid/polyamine/organocation) family permease (solute:cation symporter) involved in the uptake of a specific amino acid and/or polyamine substrate with the concomitant import of a proton | 90 | 0 | 100 | 100 | 614 |
| SM12261_0489-91 | Queuosine biosynthesis proteins QueC, QueD, and QueE; queuosine is a hypermodified guanosine analog; pseudogene of QueC consisting of first 213 bp is present in | 100 | 100 | 82 | 67 | 217, 147, 226 |
| SM12261_0495-6 | LrgA + LrgB; homologs of LrgA and LrgB/CidA and CidB in | 100 | 100 | 100 | 100 | 124–134, 231 |
| SM12261_0576-9 | Iron chelate uptake ABC transporter (FeCT family) (permease, ATP-binding protein, oligopeptide-binding protein) and putative esterase (predicted hydrolase of the alpha/beta superfamily) | 100 | 0 | 100 | 0 | 348, 251, 336, 230 |
| SM12261_0615 | Beta-glucosidase 2, glycosyl-hydrolase family 116 N-terminal domain | 100 | 100 | 100 | 83 | 102–127 |
| SM12261_0623 | Virulence factor BrkB; a homolog in | 90 | 0 | 100 | 100 | 286–294 |
| SM12261_1076 | 3-Isopropylmalate dehydratase, large subunit (LeuC); this enzyme performs the second step in the biosynthesis of leucine; the prokaryotic enzyme is a heterodimer composed of a large (LeuC) and small (LeuD) subunit; LeuD (591 nt) occurs downstream from LeuC; LeuD is present in strains of | 100 | 100 | 100 | 100 | 460 |
| SM12261_1129 | Glycerophosphoryl diester phosphodiesterase | 100 | 0 | 0 | 0 | 587 |
| SM12261_1229 | PF08002 family protein of unknown function | 95 | 0 | 59 | 50 | 153–181 |
| SM12261_1300 | Amino acid permease (LysP superfamily: | 100 | 0 | 100 | 100 | 445–470 |
| SM12261_1517 | Oligopeptide-binding protein AliD (SarA); located immediately upstream of the | 95 | 0 | 100 | 0 | 652 |
| SM12261_1764 | Hypothetical protein; contains DNA polymerase III subunits gamma and tau domain; predicted coiled-coil domain-containing protein (DUF2360) | 95 | 100 | 100 | 100 | 227–232 |
| B6:smi_1070-3 | ABC transporter, ATP-binding protein + ABC-2 transporter family protein + histidine kinase + response regulator receiver domain protein | 95 | 100 | 71 | 33 | 302, 250, 292, 225 |