| Literature DB >> 20184757 |
Antonio Frandi1, Alessio Mengoni, Matteo Brilli.
Abstract
BACKGROUND: Clostridium perfringens is a Gram-positive anaerobic bacterium causing severe diseases such as gas gangrene and pseudomembranosus colitis, that are generally due to the secretion of powerful extracellular toxins. The expression of toxin genes is mainly regulated by VirR, the response regulator of a two-component system. Up to now few targets only are known for this regulator and mainly in one strain (Strain 13). Due to the high genomic and phenotypic variability in toxin production by different strains, the development of effective strategies to counteract C. perfringens infections requires methodologies to reconstruct the VirR regulon from genome sequences.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20184757 PMCID: PMC2838878 DOI: 10.1186/1471-2180-10-65
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Figure 1Biological system and scheme of the strategy. a) The two component system VirR/VirS and its experimentally validated targets are here schematically represented. Information mainly come from studies performed in Str. 13; modified from [7]. b) Scheme illustrating the two step strategy that allowed to use information coming mainly from a single strain to do the predictions on all other strains in a less biased way. In the first step a position weight matrix (PWM) calculated from a limited number of experimentally validated motifs is used to scan the genomes and to make a list of possible targets. Within that list we looked for sequences corresponding to known targets using clustering, we retrieved their motifs and we obtained a second PWM. This includes the variability of the motif in several strains and was used for the final scan of the genomes.
Genomes and plasmids analyzed
| C.p. Strain | Type (name) | Sequencing Status | N Genes | Length (nt) |
|---|---|---|---|---|
| G | Finished | 2905 | 3085740 | |
| G | Finished | 3066 | 3256683 | |
| G | Draft | 3427 | 3896305 | |
| G | Draft | 3254 | 3661329 | |
| G | Draft | 3118 | 3510272 | |
| G | Draft | 3485 | 4045016 | |
| G | Draft | 3729 | 4127102 | |
| G | Finished | 2748 | 2921996 | |
| P (pBCNF5603) | Finished | 36 | 36695 | |
| P (pCP8533etx) | Finished | 63 | 64753 | |
| P (pCPF5603) | Finished | 73 | 75268 | |
| P (pCW3) | Finished | 51 | 47263 | |
| P (pCPF4969) | Finished | 62 | 70480 | |
| P (1) | Finished | 10 | 12397 | |
| P (2) | Finished | 11 | 12206 | |
| P (pCP13) | Finished | 63 | 54310 |
List of genomes and plasmids used in this study. The Type column indicates if a sequence is a genome (G) or a plasmid (P) in that case we also indicate the name of the plasmid within round parentheses. C.p. stands for Clostridium perfringens.
Figure 2Distribution of distances from gene. The distance of the motifs with respect to the translation start site (x-axis) is shown. Motifs are grouped by homology of the downstream gene (cluster identifier is on the y-axis). Most of the targets are located in the first 200 nt from the start of the gene, but some of them (and notably several corresponding to characterized ones) are located at larger distances. Red circles correspond to orthologous groups from Table 2.
Conserved VirR regulon
| Product | Genomes | REF | |||||||
|---|---|---|---|---|---|---|---|---|---|
| ATCC13124 | Str.13 | SM101 | F4969† | JGS1721† | JGS1495† | JGS1987† | ATCC3626† | ||
| CPF_0840 | CPE0846 | CPR_0833 | AC5_0918 | CJD_0991 | CPC_0878 | AC3_1028 | AC1_0991 | [ | |
| 1.52 | 1.52 | 1.52 | 1.52 | 1.52 | 1.52 | 1.52 | 1.52 | ||
| CPF_1204 | CPE0957 | AC5_1228 | CJD_1316 | CPC_1202 | AC3_1326 | AC1_1388 | " | ||
| 1.31 | 1.31 | 1.31 | 1.21 | 1.31 | 1.31 | 1.31 | |||
| CPF_0156 | CPE0163 | AC5_0210 | CJD_0196 | CPC_0186 | AC3_0278 | AC1_0175 | " | ||
| 1.18 | 1.18 | 1.18 | 1.18 | 1.18 | 1.18 | 1.18 | |||
| CPF_0925 | CPE0920 | * | CJD_1073 | * | AC3_1102 | AC1_1131 | " | ||
| 1.20 | 1.20 | 1.26 | 1.26 | 1.20 | 1.20 | 1.20 | |||
| CPF_1074 | CPR_0937 | ** | ** | ** | [ | ||||
| 0.88 | 1.11 | 1.03 | 1.03 | 1.03 | |||||
| CPF_0461 | CPR_0762 | AC1_0537 | " | ||||||
| 1.28 | 1.38 | 1.28 | |||||||
| AC5_0209 | CPC_0185 | ||||||||
| 1.18 | 1.18 | ||||||||
| CPE0845 | AC1_0990 | [ | |||||||
| 1.2977 | 1.29 | ||||||||
Predicted VirR regulons, only genes present in at least two genomes are shown. Numbers below each gene name correspond to the score calculated as described in Methods (on a maximum attainable score of 1.52). As described in the text, most of the known VirR targets belongs to this group. * no open reading frame identified in this region but DNA sequence identical to CPE0920; ** no open reading frame identified in this region but DNA sequence identical to CPF_1074, †: draft genomes.
Strain specific VirR targets
| Product | Gene | Score | Strain | |
|---|---|---|---|---|
| 2-keto-3-deoxygluconate kinase | AC3_0259 | 1.26 | 124 | JGS1987 |
| hypothetical protein AC3_0622 | AC3_0622 | 1.16 | 70 | JGS1987 |
| hypothetical protein AC3_A0724 | AC3_A0724 | 1.04 | 393 | JGS1987 |
| hypothetical protein AC3_A0725 | AC3_A0725 | 1.04 | 119 | JGS1987 |
| conserved hypothetical protein | AC3_A0081 | 1.11 | 180 | JGS1987 |
| resolvase/recombinase | AC3_0180 | 1.15 | 264 | JGS1987 |
| put. lipid A export ATP-binding/permease (MsbA) | AC3_0181 | 1.15 | 124 | JGS1987 |
| hypothetical protein AC3_A0587 | AC3_A0587 | 1.34 | 227 | JGS1987 |
| hypothetical protein AC3_0277 | AC3_0277 | 1.18 | 112 | JGS1987 |
| hypothetical protein AC3_A0194 | AC3_A0194 | 1.25 | 284 | JGS1987 |
| hypothetical protein AC1_A0478 | AC1_A0478 | 0.80 | 75 | ATCC 3626 |
| hypothetical protein AC5_A0236 | AC5_A0236 | 1.04 | 110 | F4969 |
| put. metal-dependent hydrolase | CPR_1028 | 1.34 | 499 | SM101 |
| hypothetical protein CJD_0545 | CJD_0545 | 0.95 | 153 | JGS1721 |
| hypothetical protein CJD_1387 | CJD_1387 | 1.30 | 75 | JGS1721 |
Genes identified as VirR targets that are present in a single strain. Strain JGS1987 suggests an expansion of the VirR regulon. †: draft genomes.
Figure 3Plasmid comparison. Here we applied one of the analysis available in the Blast2Network package [13] consisting in a comparison of plasmid gene content (see Methods for a concise description of the methodology). Values connecting nodes (plasmids) correspond to the percentage of shared genes with respect to the total number of genes of the two plasmids (Jaccard coefficient) and can be considered as a measure of relatedness in terms of evolutionary history (common ancestor) and horizontal transfers and recombination. After this analysis, two groups of plasmids emerged, that are connected through the edge between plasmids pBCNF5603 and pCP13. In the right group of plasmids we identified some VirR targets. The high similar gene content of some of the plasmids in that group may suggest a high rate of horizontal transfer/recombination between different strains, so raising the possibility of the transfer of the VirR targets. Moreover, the connection between the two groups can also suggest that transfers between the two groups of plasmids can happen.