| Literature DB >> 23034806 |
Iain A Murray1, Tyson A Clark, Richard D Morgan, Matthew Boitano, Brian P Anton, Khai Luong, Alexey Fomenkov, Stephen W Turner, Jonas Korlach, Richard J Roberts.
Abstract
Six bacterial genomes, Geobacter metallireducens GS-15, Chromohalobacter salexigens, Vibrio breoganii 1C-10, Bacillus cereus ATCC 10987, Campylobacter jejuni subsp. jejuni 81-176 and C. jejuni NCTC 11168, all of which had previously been sequenced using other platforms were re-sequenced using single-molecule, real-time (SMRT) sequencing specifically to analyze their methylomes. In every case a number of new N(6)-methyladenine ((m6)A) and N(4)-methylcytosine ((m4)C) methylation patterns were discovered and the DNA methyltransferases (MTases) responsible for those methylation patterns were assigned. In 15 cases, it was possible to match MTase genes with MTase recognition sequences without further sub-cloning. Two Type I restriction systems required sub-cloning to differentiate their recognition sequences, while four MTase genes that were not expressed in the native organism were sub-cloned to test for viability and recognition sequences. Two of these proved active. No attempt was made to detect 5-methylcytosine ((m5)C) recognition motifs from the SMRT® sequencing data because this modification produces weaker signals using current methods. However, all predicted (m6)A and (m4)C MTases were detected unambiguously. This study shows that the addition of SMRT sequencing to traditional sequencing approaches gives a wealth of useful functional information about a genome showing not only which MTase genes are active but also revealing their recognition sequences.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23034806 PMCID: PMC3526280 DOI: 10.1093/nar/gks891
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Methylome determination of G. metallireducens GS-15. (a) Example trace of kinetic variation, showing three instances of methylated sequence regions. (b) Scatter plot of sequencing coverage and kinetic score for all genomic positions. The colors indicate the bases as shown in the upper left of the panel. The cutoff for detected genomic positions is indicated by the dashed line. (c) MTase specificities determined from the genomic positions detected as methylated. They are highlighted as gray boxes in the example trace (a). (d) Summary of detected methylated positions across the genome.
Figure 2.Methylome determination of C. salexigens. (a and b) Example traces of kinetic variation, showing two instances of methylated positions. (c) MTase specificities determined from the genomic positions detected as methylated. (d) Summary of detected methylated positions across the genome.
Figure 3.Methylome determination of V. breoganii 1C-10. (a–c) Example traces of kinetic variation, showing instances of the detected methylated motifs. (d) MTase specificities determined from the genomic positions detected as methylated. (e) Summary of detected methylated positions across the genome.
Figure 4.Methylome determination of C. jejuni 81-176. (a–e) Example traces of kinetic variation, showing instances of the detected methylated motifs. (f) MTase specificities determined from the genomic positions detected as methylated. (g) Summary of detected methylated positions across the genome.
Figure 5.Methylome determination of C. jejuni NCTC 11168. (a–d) Example traces of kinetic variation, showing instances of the detected methylated motifs. (e) MTase specificities determined from the genomic positions detected as methylated. (f) Summary of detected methylated positions across the genome.
Bioinformatic predictions and experimental results for all MTase genes
Italicized genes characterized previously; red text indicates new information or revision. Recognition sequences representations use the standard abbreviations. (Eur. J. Biochem., 150, 1–5, 1985) to represent ambiguity: R = G or A, Y = C or T, M = A or C, K = G or T, S = G or C, W = A or T, = not A (C or G or T), D = not C (A or G or T), H = not G (A or C or T), V = not T (A or C or G), N = A or C or G or T.
N/D = not detected (m5C assignments were not attempted).
aindicates incorrect result obtained previously.
b5′-GGCC-3′/5′-GCNGC-3′/5′-CCGG-3′/5′-GGNCC-3′ are all recognized.
Figure 6.Methylome determination of B. cereus ATCC 10987. (a–c) Example traces of kinetic variation, showing instances of the detected methylated motifs. (d) MTase specificities determined from the genomic positions detected as methylated. (e) Summary of detected methylated positions across the genome.