| Literature DB >> 26578678 |
Brian M Forde1, Minh-Duy Phan1, Jayde A Gawthorne1, Melinda M Ashcroft1, Mitchell Stanton-Cook1, Sohinee Sarkar1, Kate M Peters1, Kok-Gan Chan2, Teik Min Chong2, Wai-Fong Yin2, Mathew Upton3, Mark A Schembri4, Scott A Beatson4.
Abstract
UNLABELLED: Escherichia coli sequence type 131 (ST131) is a clone of uropathogenic E. coli that has emerged rapidly and disseminated globally in both clinical and community settings. Members of the ST131 lineage from across the globe have been comprehensively characterized in terms of antibiotic resistance, virulence potential, and pathogenicity, but to date nothing is known about the methylome of these important human pathogens. Here we used single-molecule real-time (SMRT) PacBio sequencing to determine the methylome of E. coli EC958, the most-well-characterized completely sequenced ST131 strain. Our analysis of 52,081 methylated adenines in the genome of EC958 discovered three (m6)A methylation motifs that have not been described previously. Subsequent SMRT sequencing of isogenic knockout mutants identified the two type I methyltransferases (MTases) and one type IIG MTase responsible for (m6)A methylation of novel recognition sites. Although both type I sites were rare, the type IIG sites accounted for more than 12% of all methylated adenines in EC958. Analysis of the distribution of MTase genes across 95 ST131 genomes revealed their prevalence is highly conserved within the ST131 lineage, with most variation due to the presence or absence of mobile genetic elements on which individual MTase genes are located. IMPORTANCE: DNA modification plays a crucial role in bacterial regulation. Despite several examples demonstrating the role of methyltransferase (MTase) enzymes in bacterial virulence, investigation of this phenomenon on a whole-genome scale has remained elusive until now. Here we used single-molecule real-time (SMRT) sequencing to determine the first complete methylome of a strain from the multidrug-resistant E. coli sequence type 131 (ST131) lineage. By interrogating the methylome computationally and with further SMRT sequencing of isogenic mutants representing previously uncharacterized MTase genes, we defined the target sequences of three novel ST131-specific MTases and determined the genomic distribution of all MTase target sequences. Using a large collection of 95 previously sequenced ST131 genomes, we identified mobile genetic elements as a major factor driving diversity in DNA methylation patterns. Overall, our analysis highlights the potential for DNA methylation to dramatically influence gene regulation at the transcriptional level within a well-defined E. coli clone.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26578678 PMCID: PMC4659465 DOI: 10.1128/mBio.01602-15
Source DB: PubMed Journal: mBio Impact factor: 7.867
FIG 1 Detailed summary of R-M systems from across the EC958 genome. A schematic representation showing the structure and genomic context of EC958 R-M systems and orphan MTases is presented. Genes are shaded according to their functional classification.
FIG 2 Relative expression levels of MTase genes in E. coli EC958. The graph shows the fold difference in expression levels of each MTase gene relative to the gene coding for M.EcoMDam (EC958_3778). MTases with expression levels similar to or higher than those of M.EcoMDam were presumed to be active in EC958. MTases with significant differences are indicated by asterisks. Measurements were performed in at least quadruplicates.
FIG 3 Circos plots displaying the distribution of methylated bases in the E. coli EC958 chromosome (A) and large plasmid pEC958 (B). The locations of MGEs on the chromosome (A) and plasmid antibiotic-resistance regions (B) are indicated on the outermost track in yellow. The relative positions of the MTases are indicated on the second outermost track. MTase expression levels are based on a scale from red to green, where red represents high expression relative to Dam and green represents low expression relative to Dam. The remaining colored tracks display the location of methylated sites for each motif. From outer to inner: GATC, purple (M.EcoMDam); CANCATC; red (RM.EcoMVII), AACN4CTTT, orange (RM.EcoMII); RTACN4GTG, green (M.EcoMIII); GAGACC, blue (RM.EcoMI). Tick marks display the genomic positions in megabases (A) and kilobases (B).
FIG 4 Distribution of GATC motifs in the core and accessory genome of E. coli EC958. The graph displays a linear representation of the EC958 chromosome showing the position of methylated GATC sites (x axis) and the distance between methylated GATC sites (y axis). Each GATC motif is represented by a single circle that has been colored based on its genomic context: genomic islands (GI-thrW, HPI, GI-pheV, GI-selC, and GI-leuX), blue; prophage (Phi1 to -7 and cryptic prophage), pink; core, gray. The dashed line denotes the boundary for outliers and is calculated as the mean distance between methylated GATC sites plus 3× the standard deviation.
FIG 5 Distribution of MTases in ST131. MTases conserved in EC958 (tan) and those not encoded in EC958 (purple) are shown along the x axis with strain identifiers listed on the y axis in order of phylogenetic relatedness (6). Gene presence (black shading) is indicated by BLASTn comparison (≥95% nucleotide identity) of EC958 MTases and MTases from the REBASE database (15) to the draft assemblies of 95 ST131 strains and/or mapped reads for each ST131 strain (http://github.com/BeatsonLab-MicrobialGenomics/ST131_99/), as implemented in Seqfindr (http://github.com/mscook/seqfindr).