| Literature DB >> 26870957 |
Matthew J Blow1,2, Tyson A Clark3, Chris G Daum1,2, Adam M Deutschbauer4, Alexey Fomenkov5, Roxanne Fries1,2, Jeff Froula1,2, Dongwan D Kang1,2, Rex R Malmstrom1,2, Richard D Morgan5, Janos Posfai5, Kanwar Singh1,2, Axel Visel1,2, Kelly Wetmore4, Zhiying Zhao1,2, Edward M Rubin1,2, Jonas Korlach3, Len A Pennacchio1,2, Richard J Roberts5.
Abstract
DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities of 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active 'orphan' MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26870957 PMCID: PMC4752239 DOI: 10.1371/journal.pgen.1005854
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 6Identification of putative novel orphan MTase regulators of DNA replication.
Four orphan MTases were found to be associated with enriched clusters of motifs in non-coding regions of the genome (). Plots show density of motifs across a 50kb region of the genome flanking the motif cluster. Data is shown for the organism in which the pattern was originally identified, along with related organisms from the same taxonomic group. For comparison, plots were also generated from closely related organism lacking the orphan MTase. In each panel, a reference MTase sequence is selected (from ). The similarity score of the best scoring orthologs in each genome is represented in the MTase column using a white-green scale. A) Density of GC motifs flanking the gidA gene (origin of replication) in Enterobacteria and other Gammaproteobacteria. B) Density of CTCGG motifs flanking the dnaA gene terminus in Nocardiaceae and other Actinobacteria species. C) Density of TTA motifs flanking the dnaA gene terminus in Arthrobacter and other Actinobacteria species. D) Density of TAG motifs flanking the orc1/cdc6 gene start in Haloarchaea and other Euryarchaeota species (Bold and underlined characters indicate methylated bases. Underlined characters indicate reverse complement of methylated bases). In each example, the presence of motifs clusters correlates with the presence of the respective MTase in the same genome.
Evolutionarily conserved unmethylated motifs.
Conserved unmethylated sites were identified based on reciprocal best blast hits of flanking genes sequences between the respective genomes. Bold and underlined characters indicate methylated bases. Underlined characters indicate reverse complement of methylated bases.
| MTase family | Location of conserved unmethylated site | Number of organisms with conserved unmethylated motif |
|---|---|---|
| Upstream of PTS system D-sorbitol-specific IIC component | 6 | |
| Upstream of homoprotocatechuate degradation operon regulator, HpaR | 4 | |
| Upstream of Sugar kinases, ribokinase family | 3 | |
| Upstream of putative diguanylate cyclase | 2 | |
| Upstream of putative TetR-family regulatory protein | 2 | |
| Upstream of phage repressor protein cI | 2 | |
| Upstream of transcriptional regulator, GntR family | 2 | |
| Upstream of putative zinc-type alcohol dehydrogenase | 2 | |
| Upstream of hemin receptor protein | 2 | |
| Upstream of transcriptional regulator, LysR family | 2 | |
| Upstream of transcriptional regulator, GntR family | 2 | |
| Upstream of transcriptional regulator, PadR family | 2 | |
| Upstream of transcriptional regulator, ArsR family | 2 | |
| Upstream of transcriptional regulator, MarR family | 2 |
Clusters of multiple adjacent unmethylated motifs.
(* = Unmethylated motif clusters and putative target genes each occur in two copies in the respective genomes). Bold and underlined characters indicate methylated bases. Underlined characters indicate reverse complement of methylated bases.
| Orphan MTase | Location of unmethylated motif cluster | Number of adjacent unmethylated sites |
|---|---|---|
| Throughout non-ribosomal peptide synthase domain TIGR01720/amino acid adenylation domain | 6 | |
| Upstream of transcriptional regulator, GntR family | 4 | |
| Upstream of Antigen 43 precursor | 3* | |
| Upstream of TonB dependent receptor | 3* | |
| Upstream of MobA/MobL family | 3* | |
| Upstream of Peptidase M60-like family / Protein-tyrosine-phosphatase | 3 | |
| Upstream of Urease, gamma subunit / hypothetical protein | 3 |