| Literature DB >> 23691061 |
David W Eyre1, A Sarah Walker, Jane Freeman, Simon D Baines, Warren N Fawley, Caroline H Chilton, David Griffiths, Alison Vaughan, Derrick W Crook, Tim E A Peto, Mark H Wilcox.
Abstract
BACKGROUND: Clostridium difficile whole genome sequencing has the potential to identify related isolates, even among otherwise indistinguishable strains, but interpretation depends on understanding genomic variation within isolates and individuals.Entities:
Mesh:
Year: 2013 PMID: 23691061 PMCID: PMC3655144 DOI: 10.1371/journal.pone.0063540
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Gut model samples.
A schematic diagram of the gut model experiments to induce and treat Clostridium difficile infection. CD, Clostridium difficile. CDI, C. difficile infection. Cfu, colony forming units.
Figure 2Co-amoxiclav induction gut model samples and variable site base counts.
Panel A shows the total viable counts, spore counts and cytotoxin titres. Sampling points are shown as circles. Sampling points where 12 colonies were sub-cultured and sequenced separately are shown with an *. Panel B shows the base counts obtained at the variable sites identified between sequences. Each sample is represented in a different column. Samples taken at different timepoints are numbered sequentially. Where 12 colonies were sub-cultured at a given timepoint these are indicated by different letters following the sample number and yellow background shading. Replicate samples sequenced to assess the accuracy of the sequencing are shown with a‘, e.g. 3’ and blue background shading. The proportion of high quality (base quality PHRED scaled score ≥30, mapping quality ≥30) base counts supporting the SNV identified is shown for each variable site as a separate row. Median (range) calls per site 49 (23–99). No site with >2 different base calls.
Figure 3Clindamycin-induction, vancomycin-treatment gut model samples and variable site base counts.
Panel A shows the total viable counts, spore counts and cytotoxin titres. Sampling points are shown as circles. Sampling points where 12 colonies were sub-cultured and sequenced separately are shown with an *. Panel B shows the base counts obtained at the variable sites identified between sequences. Each sample is represented in a different column. Samples taken at different timepoints are numbered sequentially. Where 12 colonies were sub-cultured at a given timepoint these are indicated by different letters following the sample number and yellow background shading (pre-treatment) and purple background shading (post-treatment). Replicate samples sequenced to assess the accuracy of the sequencing are shown with a‘, e.g. 6c’ and blue background shading. The proportion of high quality (base quality ≥30, mapping quality ≥30) base counts supporting the SNV identified is shown for each variable site as a separate row. Median (range) calls per site 71 (28–198). No site with >2 different base calls.
Figure 4Clindamycin-induction, cadazolid-treatment gut model samples and variable site base counts.
Panel A shows the total viable counts, spore counts and cytotoxin titres. Sampling points are shown as circles. Panel B shows the base counts obtained at the variable sites identified between sequences. Each sample is represented in a different column. Samples taken at different timepoints are numbered sequentially. Replicate samples sequenced to assess the accuracy of the sequencing are shown with a‘, e.g. 6’ and blue shading. The proportion of high quality (base quality ≥30, mapping quality ≥30) base counts supporting the SNV identified is shown for each variable site as a separate row. Median (range) calls per site 57 (28–89). No site with >2 different base calls.
Figure 5Observed single nucleotide variants between serially sampled patients with ST1/ribotype 027 CDI.
Panel A shows each patient as a separate horizontal line. Samples were obtained from each patient at the start and end of each line, any additional intermediate samples obtained are shown with a ‘x’. Twenty-eight patients with no SNVs at any timepoint are shown at the top of the plot. The 16 patients with at least 1 SNV observed at ≥1 timepoint are lettered A to P. SNVs are shown as black circles, the number corresponding to table 1. ‡ indicates a patient with 13 transient SNVs, observed after 93 days, and with a sample indistinguishable from baseline at 94 days. § indicates a patient with 20 SNVs presumed to be a re-infection, and excluded from panel B. Panel B shows the time between first and last samples from 43 serially sampled patients. The line indicates a coalescent-based model for the rate of C. difficile evolution fitted by maximum likelihood assuming a Poisson distribution for the accumulation of mutations. 95% confidence intervals obtained by parametric bootstrap are shown shaded.
Identified SNVs in gut model and patient C. difficile isolates.
| Gut Model | COG Function | SNV site | Locus | Wildtype | SNV | Impact | Annotation | Fig | ||
| Carbohydrate transport and metabolism | 2914411 | CD196_2507 | C | T | Truncated | PTS system, glucose-specific IIa component | 4 | |||
| Carbohydrate transport and metabolism | 3391317 | CD196_2881 | G | T | Non-Synonymous | PTS system, IIabc component | 2,3 | |||
| Transcription | 9921 | CD196_0011 | A | G | Non-Synonymous | RNA polymerase sigma-B factor | 2 | |||
| Transcription | 10179 | CD196_0011 | C | T | Non-Synonymous | RNA polymerase sigma-B factor | 2 | |||
| Poorly characterized | 977041 | CD196_0822 | C | T | Non-Synonymous | ABC transporter, substrate-binding lipoprotein | 3 | |||
| Poorly characterized | 2380323 | CD196_2054 | C | T | Non-Synonymous | putative phosphoesterase | 3 | |||
| Poorly characterized | 3493190 | CD196_2947 | A | G | Non-Synonymous | crispr-associated helicase cas3 | 3 | |||
| Poorly characterized | 1592378 | CD196_1367 | G | T | Truncated | putative transcriptional regulator | 2,3 | |||
|
| 2206108 | CD196_1916 | T | C | Non-Synonymous | putative lipoprotein | 3 | |||
| There was 1 SNV, not within an annotated coding region: 2916358 | 2,3 | |||||||||
|
|
|
|
|
|
|
|
|
| ||
| Amino acid transport and metabolism | 444759 | CD196_0382 | C | A | Non-Synonymous | subunit of oxygen-sensitive 2-hydroxyisocaproyl-CoA dehydratase | 15 | |||
| Amino acid transport and metabolism | 584702 | CD196_0499 | G | T | Non-Synonymous | glutaminase | 18 | |||
| Amino acid transport and metabolism | 2546482 | CD196_2190 | G | T | Non-Synonymous | putative Xaa-Pro dipeptidase | 3 | |||
| Cell envelope biogenesis, outer membrane | 3348892 | CD196_2850 | C | T | Non-Synonymous | hypothetical protein | 4 | |||
| Inorganic ion transport and metabolism | 2268756 | CD196_1966 | C | T | Non-Synonymous | putative Na(+)/H(+) antiporter | 8 | |||
| Inorganic ion transport and metabolism | 3645413 | CD196_3074 | G | T | Truncated | putative phosphateABC transporter, permease protein | 10 | |||
| Posttranslational modification, protein turnover, chaperones | 1878408 | CD196_1614 | C | T | Non-Synonymous | thioredoxin reductase | 21 | |||
| Posttranslational modification, protein turnover, chaperones | 3073073 | CD196_2629 | C | A | Non-Synonymous | cell surface protein (putative cell surface-associated cysteine protease) | 17 | |||
| Signal transduction mechanisms | 1989705 | CD196_1719 | G | T | Truncated | tellurium resistance protein | 9 | |||
| Signal transduction mechanisms | 2023590 | CD196_1748 | C | A | Non-Synonymous | two-component sensor histidine kinase | 2 | |||
| Signal transduction mechanisms | 3037826 | CD196_2601 | C | A | Truncated | putative signaling protein | 14 | |||
| Signal transduction mechanisms | 4091272 | CD196_3472 | G | T | Non-Synonymous | putative RNA/single-stranded DNA exonuclease | 11 | |||
| Transcription | 1378004 | CD196_1170 | A | G | Synonymous | transcription elongation protein | 16 | |||
| Transcription | 2886178 | CD196_2484 | G | A | Non-Synonymous | sporulation sigma factor SigE | 1 | |||
| Transcription | 2970564 | CD196_2552 | C | T | Truncated | LysR-family transcriptional regulator | 6 | |||
| Poorly characterized | 3269840 | CD196_2784 | G | T | Non-Synonymous | hypothetical protein | 13 | |||
| Poorly characterized | 3244418 | CD196_2762 | C | T | Synonymous | putative acetyltransferase | 12 | |||
| – | 2263760 | CD196_1962 | A | C | Non-Synonymous | hypothetical protein | 19 | |||
| – | 3561141 | CD196_3004 | G | A | Non-Synonymous | abc transporter, permease associated with salivaricin lantibiotic | 7 | |||
| There were 2 SNVs, not within an annotated coding region: 783919, 797285 | 20,5 | |||||||||
COG, Clusters of Orthologous Groups of proteins. Fig refers to the figure displaying the SNV. Legend refers to the SNV identifiers used in figure 5.