| Literature DB >> 35852327 |
Irina Laczkovich1,2, Kyle Mangano3,2, Xinhao Shao3, Adam J Hockenberry4, Yu Gao3, Alexander Mankin3,2, Nora Vázquez-Laslop3,2, Michael J Federle3,2.
Abstract
Streptococcus pneumoniae, an opportunistic human pathogen, is the leading cause of community-acquired pneumonia and an agent of otitis media, septicemia, and meningitis. Although genomic and transcriptomic studies of S. pneumoniae have provided detailed perspectives on gene content and expression programs, they have lacked information pertaining to the translational landscape, particularly at a resolution that identifies commonly overlooked small open reading frames (sORFs), whose importance is increasingly realized in metabolism, regulation, and virulence. To identify protein-coding sORFs in S. pneumoniae, antibiotic-enhanced ribosome profiling was conducted. Using translation inhibitors, 114 novel sORFs were detected, and the expression of a subset of them was experimentally validated. Two loci associated with virulence and quorum sensing were examined in deeper detail. One such sORF, rio3, overlaps with the noncoding RNA srf-02 that was previously implicated in pathogenesis. Targeted mutagenesis parsing rio3 from srf-02 revealed that rio3 is responsible for the fitness defect seen in a murine nasopharyngeal colonization model. Additionally, two novel sORFs located adjacent to the quorum sensing receptor rgg1518 were found to impact regulatory activity. Our findings emphasize the importance of sORFs present in the genomes of pathogenic bacteria and underscore the utility of ribosome profiling for identifying the bacterial translatome. IMPORTANCE This work employed pleuromutilin-assisted ribosome profiling using retapamulin (Ribo-RET) to identify genome-wide translation start sites in the human pathogen Streptococcus pneumoniae. We identified 114 unannotated intergenic small open reading frames (sORFs). The described procedures and data sets provide a model for microbiologists seeking to explore the translational landscape of bacteria. The biological roles of four sORF examples are characterized: two control the regulation of a cell-cell communication (quorum sensing) system, one contributes to the ability of S. pneumoniae to colonize the upper respiratory tract of mice, and a fourth governs the translation of PrfB, a protein enabling ribosome release at stop codons. We propose that Ribo-RET is a valuable approach to identifying unstudied microproteins and difficult-to-find pheromone genes used by Gram-positive organisms, whose genomes are replete with pheromone receptors.Entities:
Keywords: Streptococcus pneumoniae D39; quorum sensing; ribosome profiling; small open reading frames; small proteins; translation inhibitors; translational control; virulence
Mesh:
Year: 2022 PMID: 35852327 PMCID: PMC9426450 DOI: 10.1128/mbio.01247-22
Source DB: PubMed Journal: mBio Impact factor: 7.786
FIG 1Retapamulin and lefamulin trap the ribosome near the start codon. (A) Ribosome footprint density of S. pneumoniae treated with and without retapamulin and lefamulin. (B) Example of a ribosome footprint stalled prior to the annotated start codon for spv_0776. (C) Metagene analysis of ribosome density reads (27 to 35 nt) distributed relative to the annotated start codon. (D) Distribution of the ribosome footprint length.
List of sORFs
| Ribo-seq-identified | Coordinates | Upstream flanking | Downstream flanking | Strand | Expression | Start codon | Nucleotide sequence | Peptide sequence | Small protein | Theoretic | Presence of secretion | Found within/overlapping |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 24644–24742 |
|
| + | 48 | CTG |
|
| 32 | 3.6 | No | scRNA |
|
| 29719–29778 |
|
| + | 6.25 | ATG |
|
| 19 | 2.2 | No |
|
|
| 39961–39996 |
|
| + | 53 | ATG |
|
| 11 | 1.4 | No |
|
|
| 39885–39971 |
|
| + | 24.4 | ATG |
|
| 28 | 3.3 | Yes | |
|
| 78879–78992 |
|
| + | 2.7 | ATG |
|
| 37 | 4.3 | No | spd sr8 |
|
| 86107–86193 |
|
| + | 10 | ATG |
|
| 28 | 3 | No | |
|
| 113831–113908 |
|
| + | 13 | TTG |
|
| 25 | 2.9 | No | |
|
| 113958–113987 |
|
| + | 5.4 | TTG |
|
| 9 | 1 | No | |
|
| 113770–113814 |
|
| + | 4.2 | TTG |
|
| 14 | 1.6 | No | |
|
| 118625–118672 |
|
| + | 3.8 | ATG |
|
| 15 | 1.9 | No | |
|
| 118721–118750 |
|
| + | 2.5 | TTG |
|
| 9 | 1.1 | No | |
|
| 119733–119777 |
|
| + | 40 | TTG |
|
| 14 | 1.6 | No | |
|
| 122716–122766 |
|
| + | 15 | ATG |
|
| 16 | 1.8 | No | |
|
| 122780–122845 |
|
| + | 8 | ATG |
|
| 21 | 2.2 | No | |
|
| 127035–127055 |
|
| + | 57.5 | ATG |
|
| 7 | 0.64 | No | |
|
| 132259–132294 |
|
| + | 25 | ATG |
|
| 11 | 1.2 | No |
|
|
| 149686–149748 |
|
| − | 2.1 | ATG |
|
| 20 | 2.3 | No |
|
|
| 165545–165586 |
|
| + | 12.5 | TTG |
|
| 13 | 1.5 | No | |
|
| 174277–174363 |
|
| − | 2.2 | ATG |
|
| 28 | 3.2 | No | |
|
| 186024–186104 |
|
| + | 105 | CTG |
|
| 26 | 2.7 | No | |
|
| 186133–186174 |
|
| + | 33 | ATG |
|
| 13 | 1.5 | No | |
|
| 185918–185992 |
|
| + | 12.5 | CTG |
|
| 24 | 2.6 | No | |
|
| 188045–188092 |
|
| + | 5.75 | ATG |
|
| 15 | 1.7 | No | |
|
| 195914–195985 |
|
| + | 5.5 | TTG |
|
| 23 | 2.6 | Yes | |
|
| 195901–195972 |
|
| + | 20 | TTG |
|
| 23 | 2.5 | No | |
|
| 202954–203037 |
|
| + | 40 | TTG |
|
| 27 | 3.1 | Yes | |
|
| 203084–203146 |
|
| + | 8.8 | TTG |
|
| 20 | 2.3 | No | |
|
| 249098–249157 |
|
| + | 1.68 | ATG |
|
| 19 | 2.1 | No | |
|
| 272815–272874 |
|
| + | 11.3 | TTG |
|
| 19 | 2 | No | |
|
| 306391–306480 |
|
| + | 18 | ATG |
|
| 29 | 3.4 | No | |
|
| 400274–400312 |
|
| + | 28 | ATG |
|
| 13 | 1.3 | No | |
|
| 402069–402098 |
|
| + | 11.9 | ATG |
|
| 9 | 1.1 | No | |
|
| 401926–401958 |
|
| + | 11 | TTG |
|
| 10 | 1.1 | No | |
|
| 408631–408684 |
|
| + | 18 | ATG |
|
| 17 | 1.9 | Yes | |
|
| 411179–411250 |
|
| + | 6.1 | ATG |
|
| 23 | 2.8 | No | |
|
| 444912–444977 |
|
| + | 12.5 | CTG |
|
| 21 | 2.3 | No | |
|
| 485231–485263 |
|
| + | 3.5 | TTG |
|
| 10 | 1.2 | No | |
|
| 497485–497523 |
|
| + | 10 | ATG |
|
| 12 | 1.5 | No | |
|
| 508756–508830 |
|
| + | 30 | ATG |
|
| 24 | 2.9 | No |
|
|
| 565754–565780 |
|
| + | 17 | ATG |
|
| 8 | 1 | No | |
|
| 569211–569276 |
|
| + | 24 | ATG |
|
| 21 | 2.6 | No | |
|
| 569735–569764 |
|
| + | 9.4 | TTG |
|
| 9 | 1 | No | |
|
| 616529–616573 |
|
| + | 4.4 | ATG |
|
| 14 | 1.7 | No | |
|
| 640777–640818 |
|
| + | 2.3 | ATG |
|
| 13 | 1.5 | No | |
|
| 653885–653935 |
|
| − | 11.8 | ATG |
|
| 16 | 1.7 | No | |
|
| 657363–657389 |
|
| + | 275 | TTG |
|
| 8 | 0.9 | No | |
|
| 669265–669354 |
|
| + | 1.3 | ATG |
|
| 29 | 3.5 | No | |
|
| 676352–676426 |
|
| + | 182 | ATG |
|
| 24 | 2.8 | No | |
|
| 684026–684079 |
|
| + | 750 | ATG |
|
| 17 | 2.1 | No | |
|
| 738388–738480 |
|
| + | 1.4 | CTG |
|
| 30 | 3.7 | No | |
|
| 764625–764684 |
|
| + | 2 | ATG |
|
| 19 | 2.3 | No | |
|
| 767622–767705 |
|
| + | 20 | TTG |
|
| 27 | 3.1 | Yes | |
|
| 773033–773068 |
|
| + | 57.5 | TTG |
|
| 11 | 1.4 | No | |
|
| 863206–863247 |
|
| + | 9.6 | CTG |
|
| 13 | 1.5 | No | |
|
| 865083–865118 |
|
| + | 14 | TTG |
|
| 11 | 1.2 | No | |
|
| 1051996–1052016 |
|
| − | 200 | TTG |
|
| 6 | 0.7 | No |
|
|
| 1036860–1036916 |
|
| − | 9.9 | CTG |
|
| 18 | 2.1 | No | |
|
| 962247–962309 |
|
| − | 14.35 | TTG |
|
| 20 | 2.2 | No | |
|
| 958487–958513 |
|
| − | 6.7 | TTG |
|
| 8 | 0.95 | No | |
|
| 921151–921201 |
|
| − | 13.8 | ATG |
|
| 16 | 2 | No | |
|
| 917345–917374 |
|
| − | 25.5 | ATG |
|
| 9 | 1 | No | |
|
| 1085315–1085347 |
|
| − | 1.6 | ATG |
|
| 10 | 1.2 | No | |
|
| 1153159–1153251 |
|
| − | 5.5 | ATG |
|
| 30 | 3.5 | No | |
|
| 1189201–1189269 |
|
| − | 20 | ATG |
|
| 22 | 2.5 | No | |
|
| 1190190–1190255 |
|
| − | 1.83 | ATG |
|
| 21 | 2.6 | No | |
|
| 1242216–1242290 |
|
| + | 2 | ATG |
|
| 26 | 2.8 | No | |
|
| 1242341–1242406 |
|
| + | 0.9 | TTG |
|
| 21 | 2.5 | No | |
|
| 1250407–1250430 |
|
| − | 1.75 | ATG |
|
| 7 | 0.8 | No | |
|
| 1250035–1250073 |
|
| − | 2 | TTG |
|
| 12 | 1.2 | No | |
|
| 1264626–1264679 |
|
| − | 18 | ATG |
|
| 17 | 1.9 | No | |
|
| 1274211–1274282 |
|
| − | 5.4 | ATG |
|
| 23 | 2.8 | No | |
|
| 1277771–1277803 |
|
| − | 1.4 | ATG |
|
| 10 | 1.1 | No | |
|
| 1287975–1288013 |
|
| + | 32 | ATG |
|
| 12 | 1.5 | No | |
|
| 1297923–1297994 |
|
| + | 9 | ATG |
|
| 23 | 2.7 | No | |
|
| 1357604–1357621 |
|
| − | 5 | ATG |
|
| 5 | 0.6 | No | |
|
| 1357729–1357752 |
|
| − | 30 | ATG |
|
| 7 | 0.9 | No | |
|
| 1404509–1404604 |
|
| − | 27 | CTG |
|
| 31 | 3.4 | No | |
|
| 1433395–1433523 |
|
| − | 5.4 | TTG |
|
| 42 | 5.2 | No | |
|
| 1456875–1456949 |
|
| − | 117 | ATG |
|
| 24 | 2.7 | Yes | |
|
| 1457861–1457905 |
|
| − | 14.9 | TTG |
|
| 14 | 1.5 | No | |
|
| 1480252–1480278 |
|
| + | 1.6 | TTG |
|
| 8 | 1.1 | No | |
|
| 1528556–1528618 |
|
| − | 120 | ATG |
|
| 20 | 2.3 | No |
|
|
| 1539897–1539920 |
|
| − | 775 | ATG |
|
| 7 | 0.9 | No | |
|
| 1539963–1540064 |
|
| − | 358 | ATG |
|
| 33 | 4.1 | Yes | |
|
| 1579688–1579789 |
|
| − | 140 | CTG |
|
| 33 | 3.8 | No | |
|
| 1673721–1673771 |
|
| − | 16 | TTG |
|
| 16 | 1.9 | No | |
|
| 1721860–1721904 |
|
| − | 5.6 | ATG |
|
| 14 | 1.5 | No | |
|
| 1731142–1731255 |
|
| − | 46 | ATG |
|
| 37 | 4.3 | Yes | |
|
| 1786300–1786332 |
|
| − | 5 | TTG |
|
| 10 | 1.1 | No | |
|
| 1796868–1796975 |
|
| − | 9.3 | TTG |
|
| 35 | 4.2 | No | |
|
| 1814148–1814210 |
|
| − | 2.8 | ATG |
|
| 21 | 2.5 | No | |
|
| 1814385–1814408 |
|
| − | 1.6 | ATG |
|
| 7 | 0.8 | No | |
|
| 1858646–1858711 |
|
| − | 41.08 | TTG |
|
| 21 | 2.8 | No | |
|
| 1907589–1907618 |
|
| − | 29.8 | ATG |
|
| 9 | 1.2 | No | |
|
| 2006759–2006857 |
|
| − | 37 | CTG |
|
| 32 | 3.7 | No | |
|
| 2012589–2012618 |
|
| − | 5 | ATG |
|
| 9 | 1 | No | |
|
| 105596–105616 |
|
| + | 425 | ATG |
|
| 6 | 0.7 | No | |
|
| 120796–120822 |
|
| + | 15 | ATG |
|
| 8 | 0.9 | No | |
|
| 120952–120984 |
|
| + | 5.5 | ATG |
|
| 10 | 1.2 | No | |
|
| 121373–121429 |
|
| + | 8.5 | ATG |
|
| 18 | 1.9 | Yes | |
|
| 141806–141937 |
|
| + | 38 | ATG |
|
| 43 | 5.2 | No | |
|
| 420072–420098 |
|
| + | 2 | CTG |
|
| 8 | 0.9 | No | |
|
| 742170–742214 |
|
| + | 55 | ATG |
|
| 14 | 1.6 | No | |
|
| 811459–811533 |
|
| + | 27 | ATG |
|
| 24 | 2.7 | No | |
|
| 849735–849788 |
|
| + | 35 | CTG |
|
| 17 | 2 | No | |
|
| 1032903–1032932 |
|
| − | 12 | ATG |
|
| 9 | 1 | No | |
|
| 1037861–1037899 |
|
| − | 9.2 | TTG |
|
| 12 | 1.3 | No | |
|
| 1037928–1037963 |
|
| − | 7.9 | ATG |
|
| 11 | 1.3 | No | |
|
| 1038016–1038045 |
|
| − | 18 | ATG |
|
| 9 | 1 | No | |
|
| 1278108–1278194 |
|
| − | 9.1 | ATG |
|
| 28 | 3.3 | No | |
|
| 1469164–1469196 |
|
| − | 6.8 | ATG |
|
| 10 | 1.1 | No | |
|
| 1619475–1619567 |
|
| − | 4.9 | TTG |
|
| 30 | 3.7 | Yes | |
|
| 1619912–1619941 |
|
| − | 12.5 | ATG |
|
| 9 | 0.9 | No | |
|
| 1751386–1751403 |
|
| + | 1.2 | ATG |
|
| 5 | 0.7 | No |
See reference 65.
See reference 66.
See references 2 and 3. scRNA, small cytoplasmic RNA.
sRNAs involved in virulence
| sRNA | Tigr4 flanking gene | Tigr4 flanking gene | sRNA D39 homolog | Host | Fitness (<1, fitness defect) | sORF | Expression (rpm) |
|---|---|---|---|---|---|---|---|
| F38 |
|
|
| Nasopharynx | 0 |
| 115 |
| SN39 |
|
| Nasopharynx | 0 |
| 550 | |
| F52 |
|
|
| Nasopharynx | 0 |
| 208 |
| trn0760 |
|
| Nasopharynx | 0 |
| 73 |
See references 1 and 67.
FIG 2Identification and validation of unannotated sORFs. (A) Violin plot showing the distribution of protein lengths (amino acids [aa]) encoded by the 114 sORFs identified. (B) Start codon identity distribution of the sORFs. (C) Western blotting of C-terminally sfGFP-tagged sORFs expressed from their native locus.
sORFs identified by Ribo-seq are conserved among other Streptococcus pneumoniae serotypes
| sORF length (nt) | sORF | Presence of sORF in strain | |||||
|---|---|---|---|---|---|---|---|
| P1031 (serotype 1; GenBank accession no. | TIGR4 (serotype 4; GenBank accession no. | JJA (serotype 14; GenBank accession no. | Hungary 19A-6 (serotype 19A; GenBank accession no. | ATCC 700669 (serotype 23F; GenBank accession no. | Taiwan 19F-14 (serotype 19F; GenBank accession no. | ||
| 32 |
| X | X | X | |||
| 19 |
| X | X | X | X | X | |
| 11 |
| X | X | X | X | X | X |
| 28 |
| X | X | X | X | X | X |
| 37 |
| X | X | X | X | X | X |
| 28 |
| X | X | X | X | X | X |
| 25 |
| X | X | ||||
| 9 |
| X | |||||
| 14 |
| X | X | ||||
| 15 |
| X | X | ||||
| 9 |
| X | X | ||||
| 14 |
| X | X | X | X | ||
| 16 |
| X | X | ||||
| 21 |
| X | X | X | |||
| 7 |
| X | X | X | |||
| 11 |
| X | X | ||||
| 20 |
| X | X | X | X | X | X |
| 13 |
| X | X | X | X | X | X |
| 28 |
| X | X | X | X | X | X |
| 26 |
| X | X | ||||
| 13 |
| X | X | X | |||
| 24 |
| X | |||||
| 15 |
| X | X | X | X | X | X |
| 23 |
| X | X | X | X | X | X |
| 23 |
| X | X | X | X | X | X |
| 27 |
| X | X | X | X | X | X |
| 20 |
| X | X | X | X | X | X |
| 19 |
| X | X | X | X | X | X |
| 19 |
| X | X | X | X | X | X |
| 29 |
| X | X | X | |||
| 13 |
| X | X | X | X | X | X |
| 9 |
| X | X | X | X | ||
| 10 |
| X | X | X | X | X | |
| 17 |
| ||||||
| 23 |
| X | X | X | X | X | X |
| 21 |
| X | X | X | X | X | X |
| 10 |
| X | X | X | X | X | |
| 12 |
| X | X | ||||
| 24 |
| X | |||||
| 8 |
| X | X | X | X | ||
| 21 |
| X | |||||
| 9 |
| X | X | ||||
| 14 |
| X | X | X | X | X | X |
| 13 |
| X | X | X | X | X | X |
| 16 |
| X | X | X | X | X | X |
| 8 |
| X | X | X | |||
| 29 |
| X | X | X | X | X | X |
| 24 |
| X | X | X | X | X | X |
| 17 |
| X | X | X | X | ||
| 30 |
| X | X | X | |||
| 19 |
| X | X | X | X | X | X |
| 27 |
| X | X | X | X | X | |
| 11 |
| X | X | ||||
| 13 |
| X | X | X | X | X | X |
| 11 |
| X | X | ||||
| 6 |
| X | X | X | X | ||
| 18 |
| X | X | X | X | ||
| 20 |
| X | X | X | X | X | X |
| 8 |
| X | |||||
| 16 |
| X | X | X | |||
| 9 |
| X | X | X | |||
| 10 |
| X | X | X | X | ||
| 30 |
| X | X | X | X | X | X |
| 22 |
| X | X | X | X | X | |
| 21 |
| X | |||||
| 26 |
| X | X | X | X | X | X |
| 21 |
| X | X | X | X | X | X |
| 7 |
| X | X | X | |||
| 12 |
| X | X | X | X | X | X |
| 17 |
| X | X | X | X | X | X |
| 23 |
| X | X | X | X | X | X |
| 10 |
| ||||||
| 12 |
| X | X | X | X | X | X |
| 23 |
| X | X | X | X | X | |
| 5 |
| X | X | X | X | ||
| 7 |
| X | X | X | X | ||
| 31 |
| X | X | X | X | X | X |
| 42 |
| X | X | X | X | X | X |
| 24 |
| X | X | X | X | X | X |
| 14 |
| X | X | X | X | X | X |
| 8 |
| X | X | ||||
| 20 |
| X | X | X | X | X | X |
| 7 |
| X | X | X | |||
| 33 |
| X | X | X | |||
| 33 |
| X | |||||
| 16 |
| X | X | X | X | X | X |
| 14 |
| X | X | X | X | X | X |
| 37 |
| X | X | X | X | X | X |
| 10 |
| ||||||
| 35 |
| X | X | X | X | X | X |
| 21 |
| X | X | X | X | ||
| 7 |
| X | X | X | X | ||
| 21 |
| X | X | X | X | X | X |
| 9 |
| X | X | X | |||
| 32 |
| X | X | X | X | X | X |
| 9 |
| X | X | ||||
| 6 |
| X | X | X | X | ||
| 8 |
| X | |||||
| 10 |
| X | |||||
| 18 |
| X | X | X | |||
| 43 |
| X | X | X | X | X | X |
| 8 |
| X | X | X | |||
| 14 |
| X | X | X | X | X | X |
| 24 |
| X | X | X | X | X | X |
| 17 |
| X | X | X | X | ||
| 9 |
| X | X | X | X | X | |
| 12 |
| X | X | X | X | X | X |
| 11 |
| X | X | X | X | ||
| 9 |
| X | X | X | X | ||
| 28 |
| X | X | X | X | X | X |
| 10 |
| X | |||||
| 30 |
| X | X | X | X | X | X |
| 9 |
| X | X | X | X | X | |
| 5 |
| X | X | X | X | ||
sORFs highlighted in boldface type were too short for tBLASTn analysis, so we assessed their conservation by looking for conserved nucleotide sequences.
Nucleotide sequence not conserved in the 6 serotypes but found in other strains.
FIG 3Identification of two novel sORFs found near the uncharacterized transcriptional regulator Rgg1518. (A) Ribosome footprint density profiles of rio83 and rio84 found near spv_1518 (Rgg1518). Blue arrows represent sORFs identified by Ribo-RET, and gray arrows represent previously annotated ORFs. (B) Volcano plot of wild-type D39 versus Δrgg1518 transcript fold changes. Genes of interest with the highest fold change differences are indicated on the graph. (C) qRT-PCR validation of spv_1517 expression in wild-type (WT) D39 versus the Δrgg1518 mutant.
FIG 4rio84 encodes the signaling peptide for the Rgg1518 quorum sensing system. (A) Schematic of the luciferase reporter integrated into the bgaA locus of S. pneumoniae D39. The black arrows indicate the promoter. (B) P is induced when grown in CDM and upon the constitutive expression of rio84 in the background of the Δrio83 Δrio84 strain. (C) Induction of P upon the addition of 10 μM synthetic C6, C8, and C12 Rio84 peptides. The data shown are representative of results from three independent experiments.
FIG 5Expression of rio83 in the absence of rio84 represses luciferase activity. (A) P induction in the presence of 10 μM full-length synthetic Rio83. (B) P induction in different knockout strains in the presence of 10 μM synthetic C12. The data shown are representative of results from three independent experiments.
FIG 6rio3 is important for nasopharyngeal colonization in a pneumonia mouse model. (A) Ribosome footprint of the blpK operon. Arrows in blue represent sORFs identified by Ribo-RET, and arrows in gray represent ORFs annotated previously. (B) Growth curve of wild-type and mutant strains in CDM over a span of 6 h. (C) Six-week-old BALB/c mice were inoculated with 1 × 107 CFU/25 μL of either the wild type, the rio03 mutant, the rio03 complemented strain, or the Δsrf-02 mutant. The nasal passages were collected at 24 h postinfection, homogenized, and plated to determine the bacterial burden. Statistical significance was determined using Kruskal-Wallis analysis. **** denotes a P value of <0.0001.