| Literature DB >> 25526982 |
Dake Zhang1, Nan Du1, Sufang Ma2, Qingtao Hu3, Guangwen Lu2, Wei Chen1, Changqing Zeng4.
Abstract
The Streptococcus suis serotype 2 (S. suis 2) isolates 05ZYH33 and 98HAH33 have caused severe human infections in China. Using a strand-specific RNA-seq analysis, we compared the in vitro transcriptomes of these two Chinese isolates with that of a reference strain (P1/7). In the 89K genomic island that is specific to these Chinese isolates, a toxin-antitoxin system showed relatively high levels of transcription among the S. suis. The known virulence factors with high transcriptional activity in these two highly-pathogenic strains are mainly involved in adhesion, biofilm formation, hemolysis and the synthesis and transport of the outer membrane protein. Furthermore, our analysis of novel transcripts identified over 50 protein-coding genes with one of them encoding a toxin protein. We also predicted over 30 small RNAs (sRNAs) in each strain, and most of them are involved in riboswitches. We found that six sRNA candidates that are related to bacterial virulence, including cspA and rli38, are specific to Chinese isolates. These results provide insight into the factors responsible for the difference in virulence among the different S. suis 2 isolates.Entities:
Keywords: Novel transcripts; Streptococcus suis serotype 2; Transcriptome; Virulence factor; sRNA
Mesh:
Substances:
Year: 2014 PMID: 25526982 PMCID: PMC4411499 DOI: 10.1016/j.gpb.2014.11.001
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Transcriptome sequencing and mapping statistics for the three S. suis 2 strains
| 05ZYH33 | 50,554,946 | 50,553,160 (99.999%) | 47,527,320 (94.01%) | 39,020,066 (82.1%) | 1285 | 1216 |
| 98HAH33 | 33,429,474 | 33,406,466 (99.93%) | 31,581,564 (94.54%) | 27,182,870 (86.07%) | 1332 | 1211 |
| P1/7 | 47,162,906 | 47,139,616 (99.95%) | 44,877,096 (95.2%) | 40,313,416 (92.43%) | 1222 | 1216 |
Note: Different coverage thresholds were used for identification of TARs (>70 bp in length) in the three strains, which are 9 × for 05ZYH33, 7 × for 98HAH33 and 16 × for P1/7, respectively. All the percentages in the parenthesis beside the absolute read No. were obtained by dividing the target read number by the one listed in the preceding column on its left. For example, percentage of “No. of reads after rRNA removal” = “No. of reads after rRNA removal” (50,553,160)/“No. of raw reads” (50,554,946) = 99.99%. TAR, transcriptionally-active region.
Figure 1The pipeline for
Sequencing reads were first mapped to reference genome. Sequencing reads in genic region are used to calculate the expression levels of each gene, and then DEGs are identified among three isolates. Sequencing reads found in intergenic regions are used to define novel TARs, which are annotated based on homology analysis. DEG, differentially-expressed gene; TAR, transcriptionally-active region; GI, genomic island.
Figure 2DEGs in the Chinese isolates shared a similar expression pattern
In total 120 DEGs are shared by both Chinese isolates when compared with P1/7. The hierarchical heatmap (R package: pheatmap) illustrates transcriptional levels (log2 ratios of RPKM, scale is shown at the bottom) for all the DEGs in three strains. The expression levels of DEGs in two Chinese isolates are more similar compared to P1/7.
Figure 3COG analysis of homologous DEGs
The pie chart showed the COG analysis for the classification of homologous DEGs in the 05ZYH33 and 98HAH33 isolates, compared to the P1/7 reference strain. The general categories are indicated on the outer circle and the subgroups are indicated in the inner circle with the number of genes belonging to each group listed.
DEGs in PTS and glycolysis/gluconeogenesis pathways
| 98HAH33 | SSU98_1051 | 3.05 | PTS cellobiose-specific component IIA | |
| SSU98_0384 | 1.92 | PTS IIC component, glucose | ||
| SSU98_1233 | −1.37 | PTS, mannose | ||
| SSU98_0191 | −1.38 | Hypothetical protein | ||
| SSU98_0439 | −1.43 | PTS, mannose | ||
| SSU98_0210 | −1.49 | PTS cellobiose-specific component IIC | ||
| SSU98_2074 | −1.62 | PTS cellobiose-specific component IIC | ||
| SSU98_0712 | −1.66 | PTS cellobiose-specific component IIC | ||
| SSU98_1232 | −1.71 | PTS, mannose | ||
| SSU98_0440 | −1.80 | PTS, mannose | ||
| SSU98_1501 | −6.74 | PTS IIC component, glucose | ||
| 05ZYH33 | SSU05_1038 | 1.11 | PTS cellobiose-specific component IIA | |
| SSU05_0398 | 1.56 | PTS IIC component, glucose/maltose/ | ||
| SSU05_1218 | −2.59 | PTS, mannose/fructose/ | ||
| SSU05_0189 | −1.77 | PTS system ascorbate-specific transporter subunit IIC | ||
| SSU05_0451 | −2.19 | PTS, mannose/fructose/ | ||
| SSU05_0212 | −3.43 | PTS cellobiose-specific component IIC | ||
| SSU05_2071 | −1.67 | PTS cellobiose-specific component IIC | ||
| SSU05_0713 | −3.26 | PTS cellobiose-specific component IIC | ||
| SSU05_1217 | −3.05 | PTS, mannose/fructose/ | ||
| SSU05_0452 | −2.33 | PTS, mannose/fructose/ | ||
| SSU05_1490 | −5.40 | PTS IIC component, glucose/maltose/ | ||
| 98HAH33 | 2.7.1.69 | SSU98_0384 | 1.92 | PTS IIC component, glucose |
| 3.2.1.86 | SSU98_2083 | −1.02 | Alpha-galactosidase | |
| 1.1.1.1 | SSU98_0274 | −1.63 | AdhA | |
| 1.1.1.1 | SSU98_0275 | −1.92 | Bifunctional acetaldehyde-CoA | |
| 3.2.1.86 | SSU98_1500 | −6.14 | Beta-glucosidase | |
| 05ZYH33 | 2.7.1.69 | SSU05_0398 | 1.56 | PTS IIC component, glucose/maltose/ |
| 3.2.1.86 | SSU05_2079 | −1.69 | Alpha-galactosidase/6-phospho-beta-glucosidase | |
| 1.1.1.1 | SSU05_0279 | −2.29 | Alcohol dehydrogenase | |
| 1.1.1.1 | SSU05_0280 | −3.01 | Bifunctional acetaldehyde-CoA/alcohol dehydrogenase | |
| 3.2.1.86 | SSU05_1489 | −5.05 | Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase | |
Note: FC stands for fold change, which was calculated by log2 expression of the respective Chinese isolate/expression of P1/7. PTS, phosphotransferase system. EC ID refers to the enzyme commission number in KEGG.
Differentially-expressed virulence factors
| Upregulated | VFG1497 | SSU05_0111/SSU98_0114/SSU0114 | 152.6/292.4/79.3 | Putative ABC transporter membrane protein | |
| Downregulated | VFG1855 | SSU05_0148/SSU98_0152/SSU0147 | 2680.4/2435.8/3243.0 | Hsp60, 60 K heat shock protein HtpB | |
| VFG0869 | SSU05_1381/SSU98_1396/SSU0466 | 147.1/212.3/679.6 | AatC ATB binding protein of ABC transporter | ||
| VFG0594 | SSU05_1387/SSU98_1401/SSU1215 | 135.3/359.7/912.5 | Pathogenicity island encoded protein: SPI3 | ||
| VFG0317 | SSU05_1281/SSU98_1295 | 119.3/90.4 | Lipopolysaccharide 1,2-glucosyltransferase | ||
| VFG2194 | SSU05_2090/SSU98_2093 | 64.5/81.1 | Unknown | ||
| VFG1462 | SSU05_0588/SSU98_0592 | 23.4/33.5 | l0015 | Putative IS protein | |
| VFG1513 | SSU05_0587/SSU98_0591 | 15.2/16.1 | IS66-like transposase | ||
| VFG0284 | SSU05_0973/SSU98_0987 | 0.8/1.9 | Cag pathogenicity island protein, DNA transfer protein | ||
| VFG2173 | SSU05_0965/SSU98_0978 | 0.6/1.0 | Aggregation substance Asa1 | ||
| VFG1453 | SSU05_0979/SSU98_0993 | 0.2/0.7 | Putative DNA methylase | ||
| Upregulated | – | SSU05_0753/SSU98_0756/SSU0706 | 4575.6/6524.3/2045.4 | Muramidase release protein | |
| – | SSU05_0155/SSU98_0158/SSU0153 | 19380.7/20694.6/16530.14 | Glyceraldehyde-3-phosphate dehydrogenase | ||
| Downregulated | – | SSU05_0841/SSU98_0841/SSU0785 | 423.3/339.3/506.326 | Elongation factor | |
| Mixed | – | SSU05_1403/SSU98_1416/SSU1231 | 137.9/677.642/440.004 | Suilysin | |
| – | SSU05_0564/SSU98_0567/SSU0515 | 1026.3/441.9/704.4 | Capsular polysaccharide | ||
Note: ID for the genes, which essentially encode the same virulence factor in the strains and their corresponding RPKM, were listed in the order of 05ZYH33, 98HAH33 and P1/7. For example, Gene ID SSU05_0111/SSU98_0114/SSU0114 represent SSU05_0111 in 05ZYH33, SSU98_0114 in 98HAH33 and SSU0114 in P1/7, which all encode ORF46, and their expression level in terms of RPKM is 152.6 in 05ZYH33, 292.4 in 98HAH33 and 79.3 in P1/7, respectively. Mixed means that the expression of genes was upregulated in one Chinese isolate but downregulated in the other isolate.
Figure 4Transcriptional activity of genes located in 89K GI of the Chinese isolates
The histogram demonstrates the distribution of genes at different transcription level (RPKM values) for 98HAH33 (A) and 05ZYH33 (B). The group “RPKM <1” contains the highest number of genes, indicating the low transcriptional activity of 89K GI region during in vitro incubation of these isolates.
Top 10 most highly-expressed genes specific to the Chinese isolates
| 05ZYH33 | 1 | SSU05_0582 | 2067.935 | COG3039L | |
| 2 | SSU05_1895 | 1461.981 | – | Hypothetical protein | |
| 3 | SSU05_1896 | 1428.723 | COG3871R | Hypothetical protein | |
| 4 | SSU05_0177 | 1284.939 | – | Extracellular protein | |
| 5 | SSU05_0178 | 1190.736 | – | Epf-like protein | |
| 6 | SSU05_0179 | 1149.505 | – | ||
| 7 | SSU05_0373 | 1073.18 | COG1335Q | Nicotinamidase-like amidase | |
| 8 | SSU05_0372 | 782.191 | COG1670J | Histone acetyltransferase HPA2-like acetyltransferase | |
| 9 | SSU05_0553 | 664.709 | – | Hypothetical protein | |
| 10 | SSU05_1990 | 660.201 | COG3481R | HD-superfamily hydrolase | |
| 98HAH33 | 1 | SSU98_0181 | 2124.663 | – | |
| 2 | SSU98_0586 | 1986.35 | COG3039L | ||
| 3 | SSU98_0179 | 1799.871 | – | Extracellular protein | |
| 4 | SSU98_0180 | 1684.485 | – | Epf-like protein | |
| 5 | SSU98_1014 | 1076.934 | – | Hypothetical protein | |
| 6 | SSU98_0364 | 1049.139 | COG1335Q | Nicotinamidase-like amidase | |
| 7 | SSU98_1026 | 986.816 | COG0448G | ADP-glucose pyrophosphorylase | |
| 8 | SSU98_1897 | 972.083 | – | Hypothetical protein | |
| 9 | SSU98_0363 | 876.425 | COG1670J | Histone acetyltransferase HPA2-like acetyltransferase | |
| 10 | SSU98_1898 | 796.214 | COG3871R | Hypothetical protein | |
Note: The toxin–antitoxin system components with high transcription level in both virulent isolates are highlighted in bold. RPKM, reads per kilo bases per million reads; COG, clusters of orthologous groups.
Novel TARs with homologous genes having known functions
| 05ZYH33 | 05zyh33_new1 | + | 24,422 | 24,911 | 490 | GI: 386587281 | NC_017622.1 | Rod shape-determining protein MreD |
| 05zyh33_new6 | + | 84,830 | 84,947 | 118 | GI: 403060703 | NC_018526.1 | 50S ribosomal protein L36 | |
| 05zyh33_new7 | + | 208,869 | 209,049 | 181 | GI: 403060703 | NC_018526.1 | Copper-transporting ATPase | |
| 05zyh33_new8 | − | 464,948 | 465,239 | 292 | GI: 386587281 | NC_017622.1 | Transposase | |
| 05zyh33_new16 | + | 566,368 | 566,536 | 169 | GI: 403060703 | NC_018526.1 | ||
| 05zyh33_new22 | − | 933,091 | 933,265 | 175 | GI: 386585202 | NC_017621.1 | Lantibiotic protein | |
| 05zyh33_new23 | − | 999,593 | 999,674 | 82 | GI: 403060703 | NC_018526.1 | Helicase subunit of the DNA excision repair complex | |
| 05zyh33_new34 | − | 1,578,674 | 1,578,830 | 157 | GI: 403060703 | NC_018526.1 | Serum opacity factor | |
| 05zyh33_new48 | − | 1,915,147 | 1,915,405 | 259 | GI: 403060703 | NC_018526.1 | Toxin–antitoxin system, toxin protein | |
| 98HAH33 | 98hah33_new5 | + | 84,826 | 84,943 | 118 | GI: 403060703 | NC_018526.1 | 50S ribosomal protein L36 |
| 98hah33_new7 | + | 208,714 | 208,894 | 181 | GI: 403060703 | NC_018526.1 | Copper-transporting ATPase | |
| 98hah33_new17 | + | 566,231 | 566,399 | 169 | GI: 403060703 | NC_018526.1 | ||
| 98hah33_new24 | − | 932,796 | 932,970 | 175 | GI: 386585202 | NC_017621.1 | Lantibiotic protein | |
| 98hah33_new25 | − | 999,285 | 999,366 | 82 | GI: 403060703 | NC_018526.1 | Helicase subunit of the DNA excision repair complex | |
| 98hah33_new27 | − | 1,116,314 | 1,116,593 | 280 | GI: 403060703 | NC_018526.1 | Membrane associated protein | |
| 98hah33_new45 | − | 1,892,560 | 1,892,725 | 166 | GI: 403060703 | NC_018526.1 | 50S ribosomal protein L33 | |
| 98hah33_new48 | − | 1,914,521 | 1,914,779 | 259 | GI: 403060703 | NC_018526.1 | Toxin–antitoxin system, toxin protein | |
Note: GI number is listed for each homologous gene. TAR, transcriptionally-active region.
Novel sRNAs potentially regulating bacterial virulence
| 05ZYH33 | 05seq336 | − | 389,484 | 392,352 | 389,489 | 389,850 | 362 | 1 | 428 | |
| 05seq972 | + | 1,825,119 | 1,830,751 | 1,830,583 | 1,830,751 | 169 | 1 | 169 | ||
| 05seq1101 | + | 1,993,916 | 1,994,224 | 1,993,916 | 1,993,992 | 77 | 10 | 91 | ||
| 05seq710 | + | 961,698 | 961,817 | 961,698 | 961,817 | 120 | 15 | 134 | ||
| 98HAH33 | 98seq331 | − | 391,826 | 392,204 | 391,827 | 392,185 | 359 | 1 | 428 | |
| 98seq1086 | − | 2,057,227 | 2,057,829 | 2,057,231 | 2,057,296 | 66 | 1 | 71 | ||
| 98seq1021 | + | 1,824,549 | 1,830,052 | 1,829,956 | 1,830,052 | 97 | 1 | 97 | ||
| 98seq1137 | + | 1,993,284 | 1,993,592 | 1,993,284 | 1,993,360 | 77 | 10 | 91 | ||
| 98seq696 | + | 961,400 | 961,519 | 961,400 | 961,519 | 120 | 15 | 134 | ||
| P1/7 | 17seq501 | − | 635,186 | 635,918 | 635,187 | 635,263 | 77 | 1 | 77 | |
| 17seq1095 | − | 1,905,111 | 1,905,186 | 1,905,111 | 1,905,186 | 76 | 1 | 91 | ||
| 17seq916 | + | 1,736,325 | 1,741,929 | 1,741,790 | 1,741,929 | 140 | 1 | 140 | ||
Note: TAR, transcriptionally-active region. Mapped starts and ends indicate the region of TARs that is aligned with homologous genes. Starts and ends of sRNA indicate the corresponding part of sRNA. TARs that are shared in all three strains are highlighted in bold in description.