| Literature DB >> 26689587 |
Andreas E Zautner1, Anne-Marie Goldschmidt2, Andrea Thürmer3, Jörg Schuldes3, Oliver Bader2, Raimond Lugert2, Uwe Groß2, Kerstin Stingl4, Gabriela Salinas5, Thomas Lingner5.
Abstract
BACKGROUND: Campylobacter species are the most prevalent bacterial pathogen causing acute enteritis worldwide. In contrast to Campylobacter jejuni, about 5 % of Campylobacter coli strains exhibit susceptibility to restriction endonuclease digestion by DpnI cutting specifically 5'-G(m)ATC-3' motifs. This indicates significant differences in DNA methylation between both microbial species. The goal of the study was to analyze the methylome of a C. coli strain susceptible to DpnI digestion, to identify its methylation motifs and restriction modification systems (RM-systems), and compare them to related organisms like C. jejuni and Helicobacter pylori.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26689587 PMCID: PMC4687069 DOI: 10.1186/s12864-015-2317-3
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Pie chart representing RAST subsystems identified in the BFR-CA-9557 genome. The 20 most abundant subsystems on the “category” level as identified by RAST are represented by a particular color indicated at the right-hand side of the figure
Fig. 2Comparison of C. coli BFR-CA-9557 genome to those of C. coli RM4661 and 76339 a as well as C. jejuni NCTC 11168 b using the Artemis Comparison Tool (ACT). Comparison of the BFR-CA-9577 genome to those of C. coli clade 1 strain RM4661 and C. coli clade 3 strain 76339 as well as to the genome of C. jejuni NCTC 11168 indicates that the clade 1 strain BFR-CA-9577 genome organization is more related to C. coli clade 1 and C. jejuni strains than to clade 3 strains. The CJIE1-homologue prophage in the BFR-CA-9557 genome is indicated in green
Positions of poly-G tracts in the BFR-CA-9557 genome
| Poly G on forward strand | Pos. rel. to | ORF location | |||||
|---|---|---|---|---|---|---|---|
| Position | Length | RAST annotation of closest ORF | ATG | STOP | Start | Stop | Strand |
| 46933 | 9 | hypothetical protein | −52 | −2292 | 46985 | 49225 | + |
| 258726 | 10 | FIG 00470070: hypothetical protein | −15 | −1924 | 258741 | 260650 | + |
| 689956 | 10 | UDP-N-acetylmuramoylalanine--D-glutamate ligase (EC 6.3.2.9) | −6 | −1232 | 691179 | 689971 | - |
| 851085 | 10 | FIG 00470965: hypothetical protein | 556 | −671 | 850529 | 851756 | + |
| 1562460 | 10 | Putative transferase | −48 | −590 | 1562508 | 1563050 | + |
| 1565336 | 9 | FIG 00469667: hypothetical protein | 251 | −423 | 1565085 | 1565759 | + |
| 1575332 | 10 | Motility accessory factor | 167 | −1780 | 1575165 | 1577112 | + |
| 1645063 | 9 | Ferrous iron transport protein B | 1576 | −269 | 1643487 | 1645332 | + |
Poly-G/C tracts were searched in the BFR-CA-9557 genome using regular expression describing at least 8 consecutive G/C. The first two columns denote the genome location and length of the expression found. Column 3 shows the RAST annotation of the ORF closest to the homopolymeric stretch. Column 4 and 5 represent the relative position of the stretch to the ORF’s start (4) and Stop codon (5), with negative numbers representing upstream locations. Columns 6 to 8 denote the location and orientation of the ORF
Positions of poly-C tracts in the BFR-CA-9557 genome
| Poly C on forward strand | pos. rel. to | ORF location | |||||
|---|---|---|---|---|---|---|---|
| Position | Length | RAST annotation of closest ORF | ATG | STOP | Start | Stop | Strand |
| 442261 | 11 | Putative lipoprotein of ferric iron transporter system | 139 | −3 | 442254 | 442132 | - |
| 1099189 | 10 | hypothetical protein | 739 | −7 | 1099187 | 1098459 | - |
| 1257487 | 9 | Phosphoglycerol transferase | 1728 | −256 | 1257735 | 1255767 | - |
| 1427910 | 9 | Filamentous haemagglutinin domain protein | 1554 | −9 | 1427911 | 1426364 | - |
| 1547848 | 9 | FIG 00469527: hypothetical protein | 637 | −596 | 1548436 | 1547219 | - |
| 1587049 | 8 | hypothetical protein | 679 | −575 | 1587617 | 1586377 | - |
| 1607917 | 9 | FIG 00470049: hypothetical protein | 2305 | 34 | 1607875 | 1605620 | - |
| 1660758 | 9 | CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferase (EC 2.4.99.-) | 64 | −620 | 1661370 | 1660702 | - |
| 1668862 | 8 | CMP-N-acetylneuraminate-beta-galactosamide-alpha-2,3-sialyltransferase (EC 2.4.99.-) | 40 | −1587 | 1670442 | 1668829 | - |
Poly-G/C tracts were searched in the BFR-CA-9557 genome using regular expression describing at least 8 consecutive G/C. The first two columns denote the genome location and length of the expression found. Column 3 shows the RAST annotation of the ORF closest to the homopolymeric stretch. Column 4 and 5 represent the relative position of the stretch to the ORF’s start (4) and Stop codon (5), with negative numbers representing upstream locations. Columns 6 to 8 denote the location and orientation of the ORF
Methylation motifs of C. coli BFR-CA-9557
| No. | Motif | Modified Position | Modification Type | % Motifs Detected | # of Motifs Detected | # of Motifs in Genome | Mean Modi-fication QV1 | Mean Motif Coverage | Partner Motif |
|---|---|---|---|---|---|---|---|---|---|
| A | RAATTY | 3 | m6A | 98.67 | 27795 | 28170 | 309.40 | 235.73 | RAATTY |
| B | GATC | 2 | m6A | 99.66 | 7512 | 7538 | 344.25 | 257.15 | GATC |
| C | RCATC | 3 | m6A | 99.43 | 4381 | 4406 | 274.48 | 256.54 | |
| D | CAAGAA | 6 | m6A | 99.71 | 2069 | 2075 | 290.35 | 249.45 | |
| E | GGGTDA | 6 | m6A | 99.44 | 1607 | 1616 | 327.70 | 254.15 | |
| F | DACATTGB | 4 | m6A | 69.69 | 223 | 320 | 80.60 | 255.05 | |
| G1 | TAAANNNNNGTG | 3 | m6A | 99.75 | 392 | 393 | 271.96 | 259.91 | CACNNNNNTTTA |
| G2 | CACNNNNNTTTA | 2 | m6A | 99.75 | 392 | 393 | 331.81 | 263.02 | TAAANNNNNGTG |
| H1 | CAAYNNNNNNNTTYG | 3 | m6A | 99.58 | 237 | 238 | 316.89 | 246.49 | CRAANNNNNNNRTTG |
| H2 | CRAANNNNNNNRTTG | 4 | m6A | 99.58 | 237 | 238 | 276.97 | 247.15 | CAAYNNNNNNNTTYG |
1 QV = quality value
Methylation motifs were identified using the PacBio SMRT Analysis software (see Methods). Column 2 shows the sequence consensus of the motif, whereby non-uniform positions are represented by IUPAC ambiguity codes. Column 3 and 4 denote the position of the modified base within the motif and the type of methylation. Column 5 represents the fraction of a motif’s occurrences in the genome (column 7) for which a methylation has been detected (column 6). Column 8 and 9 denote the average modification quality (in Phred Q-scores) and average coverage of motifs detected as modified. The last column shows the partner motif, i.e. the reverse complement of the motif
Fig. 3Sequence logos of eight methylation motifs. The consensus sequences of all eight motifs are depicted as sequence logo as obtained by the WebLogo 3 server (weblogo.threeplusone.com/create.cgi). The height of each stack indicates the degree of conservation (bits). The height of the letters represents the relative frequency of the base. The asterisk under a particular letter indicates the modified/methylated base. The two motifs in G and in H are partner motifs that are methylated at both strands. All motifs are recognized by N-6 adenine-specific methyltransferases
Putative C. coli BFR-CA-9557 restriction modification systems
| ORF #1 | Strand | Position in genome | Description | Type/subunit | Predicted rec. seq. |
|---|---|---|---|---|---|
| 465 | + | 86828-89146 |
| I/R | - |
| 485 | + | 92205-93437 |
| I/S | - |
| 495 | + | 94709-96196 | Type I restriction-modification system2C DNA-methyltransferase subunit M (EC 2.1.1.72) | I/M | - |
| 6540 | - | 1246404-1250480 | Type I restriction-modification system2C DNA-methyltransferase subunit M (EC 2.1.1.72) / Type I restriction-modification system2C specificity subunit S (EC 3.1.21.3) | I/MS | (GAGNNNNNGT)3,4 |
| 720 | + | 139123-140043 | fokIM_1, Modification methylase FokI, (EC 2.1.1.72), homologue to ulcer associated adenine specific DNA methyltransferase | II/M | (GGATG)3 |
| 1470 | - | 275768-271959 | Type IIS restriction enzyme Eco57I | II/S | - |
| 2605 | + | 488272-489375 | DNA modification methylase (Adenine-specific methyltransferase), fokIM_2, Modification methylase FokI, (EC 2.1.1.72) | II/M | GAATTC |
| 2895 | + | 541751-542602 | DNA modification methyltransferase, | II/M | GATC |
| 6520 | - | 1240865-1243951 |
| II/M | - |
| 7730 | + | 1464470-1465285 | DNA adenine methylase, dpnM, Modification methylase DpnIIA, EC 2.1.1.72 | II/M | - |
| 8910 | + | 1701685-1702374 | 16S rRNA (guanine(966)-N(2))-methyltransferase (EC 2.1.1.171) | II/M | GATC5 |
| 2225 | + | 409683-409970 | McrBC 5-methylcytosine-specific restriction endonuclease system2C McrB subunit2C putative | IV/R | - |
| 22302 | + | 409982-411148 | McrBC 5-methylcytosine-specific restriction endonuclease system2C McrB_1 subunit2C putative | IV/R | - |
| 22302 | + | 411208-412158 | McrBC 5-methylcytosine-specific restriction endonuclease system2C McrB_2 subunit2C putative | IV/R | - |
| 2235 | + | 412124-413494 | McrBC 5-methylcytosine-specific restriction enzyme subunit McrC | IV/R | - |
1ORF # according to the NCBI annotation pipeline for RAST and Prodigal ORF # see Additional files 3 and 4
2disrupted ORF
3predicted sequence does not correspond to any motif detected by SMRT sequencing
4According to REBASE ORF #6540 encodes a Type II RM-system
5In contrast to RAST and Prodigal REBASE predicts a second candidate for a GATC-specific DNA modification methyltransferase for this ORF
Restriction modification systems have been identified as outlined in section “Methods”. Column 1 to 3 denote the number, strand direction and genome position of the ORF as identified by the NCBI annotation pipeline. Column 4 contains the description of the ORF in terms of aggregated annotations from NCBI, RAST and Prodigal. The type and predicted recognition sequence of the motif are shown in columns 5 and 6