| Literature DB >> 25636331 |
Erika van Eijk1, Seyed Yahya Anvar2,3, Hilary P Browne4, Wai Yi Leung5, Jeroen Frank6, Arnoud M Schmitz7, Adam P Roberts8, Wiep Klaas Smits9.
Abstract
BACKGROUND: Clostridium difficile strain 630Δerm is a spontaneous erythromycin sensitive derivative of the reference strain 630 obtained by serial passaging in antibiotic-free media. It is widely used as a defined and tractable C. difficile strain. Though largely similar to the ancestral strain, it demonstrates phenotypic differences that might be the result of underlying genetic changes. Here, we performed a de novo assembly based on single-molecule real-time sequencing and an analysis of major methylation patterns.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25636331 PMCID: PMC4320837 DOI: 10.1186/s12864-015-1252-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Results of short read next generation sequencing of 630Δ . A. Coverage of the region of Tn5398 harboring the two erythromycin resistance genes (CD2007 and CD2010). Bars underneath the graph indicate a greater than 100-fold (orange) and lesser than 50-fold (blue) coverage, respectively. Reference assembly was performed using Geneious 7.1 software (Biomatters, http://www.geneious.com). B. Schematic representation of the breakpoint analysis (for details see Methods). Segments between breakpoints are indicated with different colors. The putative transposed element is indicated in black.
Figure 2The complete genome of 630Δ . A. Overview of genomic features. Indicated are (from outside to inside); Short Tandem Repeats <500 bp (dots); rRNA (red), tRNA (blue), mobile genetic elements (green) and the PaLoc (purple); GC content per 1 kb window; GC skew (orange line) in a 5 kb sliding window; grey links represent repeats (193 repeats identified with Blast2Seq) having >95% identity and an alignment length of >500 bp; red links indicate an alignment length >2 kb. B. Confirmation of the 18 bp duplication in spo0A resulting in a 6 amino acid direct repeat [23] C. Confirmation of the 2.4 kb Δerm deletion [18]. Open reading frames are indicated as yellow arrows, repeat elements in blue.
Figure 3Methylation patterns in 630Δ . A. Genome wheel showing motif-associated m6A methylation (red), motif associated m4C methylation (blue), m6A methylation not associated with a motif (black), and m4C methylation events not associated with a motif (orange) in relation to GC content (per 1 kb window), rRNA (red), tRNA (blue) and mobile genetic elements (green). B. Sequence logos for the m6A methylated sequences. C. Sequence logos for the m4C methylated sequences.
Structural variants associated with coding sequences
|
|
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| 84143 | 89438 | Substitution | C > T | CD630_00580 |
| Elongation factor EFTu/EF1A | Synonymous | ||
| 84227 | 89522 | substitution | C > T | CD630_00580 |
| Elongation factor EFTu/EF1A | Synonymous | ||
| 103225 | 108520 | substitution | G > T | CD630_00730 |
| 50S ribosomal protein L3 | Synonymous | ||
| 610336 | 615631 | substitution | G > A | CD630_05140 |
| Cell surface protein | Val > Ile | ||
| 610480 | 615775 | substitution | C > T | CD630_05140 |
| Cell surface protein | Synonymous | ||
| 610563 | 610564 | 615859 | 615861 | insertion | 610563_610564ins | CD630_05140 |
| Cell surface protein | In frame Ala insertion |
| 610570 | 615868 | substitution | A > G | CD630_05140 |
| Cell surface protein | Ile > Val | ||
| 610638 | 615936 | substitution | C > T | CD630_05140 |
| Cell surface protein | Synonymous | ||
| 610752 | 616050 | substitution | G > A | CD630_05140 |
| Cell surface protein | Synonymous | ||
| 610840 | 616138 | substitution | C > T | CD630_05140 |
| Cell surface protein | Synonymous | ||
| 610875 | 616173 | substitution | C > T | CD630_05140 |
| Cell surface protein | Synonymous | ||
| 755776 | 755800 | 760995 | 760996 | deletion | 755776_755800del | CD630_06320 | Conserved hypothetical protein | In frame 8aa deletion in repeat region | |
| 1000995 | 1006274 | substitution | A > G | CD630_08260 | Putative ferric-uptake regulator | Thr > Ala | |||
| 1391850 | 1397129 | substitution | T > C | CD630_11900 | Putative acyl-CoA N-acyltransferase | Phe > Leu | |||
| 1413060 | 1413077 | 1418339 | 1418354 | duplication | 1413060_1413077dup | CD630_12140 |
| Stage 0 sporulation protein A | 6aa (NVGNIE) duplication |
| 1607458 | 1607459 | 1612756 | insertion | 1607458_1607459insT | CD630_13880 | Putative transcriptional regulator | Restores transcriptional regulator | ||
| 2044514 | 2049813 | substitution | C > G | CD630_17670 |
| Glyceraldehyde-3-phosphate dehydrogenase GAPDH | Pro > Ala | ||
| 2137467 | 2183040 | 2142764 | 2142765 | deletion | 2137467_2183040del | CD630_18440 | Putative adhesin | Translocation of CTn | |
| 2209236 | 2168961 | substitution | G > A | CD630_19070 |
| Ethanolamine iron-dependent Alcohol dehydrogenase | Gly > Glu | ||
| 2924655 | 2881973 | substitution | C > T | CD630_25320 | Aminotransferase, alanine--glyoxylate transaminase | Synonymous | |||
| 3034953 | 2992271 | substitution | C > A | CD630_26270 | Conserved hypothetical protein | Gly > Cys | |||
| 3080703 | 3038021 | substitution | C > T | CD630_26670 |
| PTS system, glucose-specific IIBC component | Val > Ile | ||
| 3686534 | 3686535 | 3643756 | 3643756 | insertion | 3686534_3686535insA | CD630_31561 | Conserved hypothetical protein | Restores conserved hypothetical protein | |
| 3967522 | 3967523 | 3924743 | 3970315 | insertion | 3967522_3967523insAM180355:g.2137467_2183040 | CD630_33930 |
| 23S rRNA (uracil-5-)-methyltransferase | Translocation of CTn |
| 4166495 | 4169292 | substitution | G > A | CD630_35650 | Transcriptional regulator, GntR family | Ala > Val | |||
| 12347 | 12348 | 12348 | 17642 | insertion | 12347_12348ins | multiple | rRNA/tRNA cluster | ||
| 2317627 | 2320033 | 2277358 | 2277359 | deletion | 2317633_2320041del | multiple | Loss of erythromycin resistance (Δ |
Figure 4Schematic representation of the rRNA operons and associated tRNA clusters of 630Δ . Operons are numbered from 1–12 in the order they appear in the genome sequence. 16S rRNA, 23S rRNA and 5S rRNA genes are indicated with white, grey and black arrow shapes, respectively. tRNAs are indicated by green arrow shapes. SNPs between the rRNA clusters of strains 630Δerm and 630 are indicated in red (for details see Additional file 1). Brackets indicate that operon 1 is unique to strain 630Δerm. A cluster of tRNAs that is found multiple times associated with rRNAs (tRNAAsn-tRNALeu-tRNAMet-tRNAGlu-tRNAGly-tRNAVal-tRNAAsp) is indicated in dark green. A = tRNAAla, L = tRNALeu, M = tRNAMet, G = tRNAGly, T = tRNAThr. Figure is approximately to scale.
Figure 5CTn is present in in 630Δ but not 630 or 630E. A. Dotplot of the reference sequence for C. difficile 630 (x-axis) versus the de novo assembled 630Δerm sequence (y-axis), indicating the location of CD1844 and rumA (boxed in blue), and the CTn5 element (boxed in red). Note the inverted orientation of the mobile element. B. Schematic representation of the rumA-CD1844A hybrid protein (CD3393A). C. PCR confirmation of the transposition event. For primers used see Methods and Table 2.
Oligonucleotides used in this study
|
|
|
|
|---|---|---|
| oWKS-1467 | CGCACCAGAATGGAAAGAAG | Left junction CTn |
| oWKS-1468 | AGGCGTACACTGTGGGATAG | Left junction CTn |
| oWKS-1469 | TAGATGATGCCGTTGCTGAG | Right junction CTn |
| oWKS-1470 | AAGGTTTGGGTCTGCTGTAG | Right junction CTn |
| oWKS-1471 | CCGTTACCGTCTGTAATG |
|
| oWKS-1472 | AGGGCCTATAAGGTAAGC |
|
aThe repaired junction (CTn5 excised from CD1844) is detected with oWKS-1467 and oWKS-1470. bThe insertion of CTn5 into rumA is detected by primer combination oWKS-1468/oWKS-1472 and/or oWKS-1469/oWKS-1471.