| Literature DB >> 29495512 |
Matthew Ouellette1, J Peter Gogarten2, Jessica Lajoie3, Andrea M Makkay4, R Thane Papke5.
Abstract
DNA methyltransferases (MTases), which catalyze the methylation of adenine and cytosine bases in DNA, can occur in bacteria and archaea alongside cognate restriction endonucleases (REases) in restriction-modification (RM) systems or independently as orphan MTases. Although DNA methylation and MTases have been well-characterized in bacteria, research into archaeal MTases has been limited. A previous study examined the genomic DNA methylation patterns (methylome) of the halophilic archaeon Haloferax volcanii, a model archaeal system which can be easily manipulated in laboratory settings, via single-molecule real-time (SMRT) sequencing and deletion of a putative MTase gene (HVO_A0006). In this follow-up study, we deleted other putative MTase genes in H. volcanii and sequenced the methylomes of the resulting deletion mutants via SMRT sequencing to characterize the genes responsible for DNA methylation. The results indicate that deletion of putative RM genes HVO_0794, HVO_A0006, and HVO_A0237 in a single strain abolished methylation of the sole cytosine motif in the genome (Cm4TAG). Amino acid alignments demonstrated that HVO_0794 shares homology with characterized cytosine CTAG MTases in other organisms, indicating that this MTase is responsible for Cm4TAG methylation in H. volcanii. The CTAG motif has high density at only one of the origins of replication, and there is no relative increase in CTAG motif frequency in the genome of H. volcanii, indicating that CTAG methylation might not have effectively taken over the role of regulating DNA replication and mismatch repair in the organism as previously predicted. Deletion of the putative Type I RM operon rmeRMS (HVO_2269-2271) resulted in abolished methylation of the adenine motif in the genome (GCAm6BN₆VTGC). Alignments of the MTase (HVO_2270) and site specificity subunit (HVO_2271) demonstrate homology with other characterized Type I MTases and site specificity subunits, indicating that the rmeRMS operon is responsible for adenine methylation in H. volcanii. Together with HVO_0794, these genes appear to be responsible for all detected methylation in H. volcanii, even though other putative MTases (HVO_C0040, HVO_A0079) share homology with characterized MTases in other organisms. We also report the construction of a multi-RM deletion mutant (ΔRM), with multiple RM genes deleted and with no methylation detected via SMRT sequencing, which we anticipate will be useful for future studies on DNA methylation in H. volcanii.Entities:
Keywords: CTAG methylation; DNA methylation; GATC methylation; Halobacteria; Haloferax volcanii; haloarchaea; methylation; methylome; restriction-modification system
Year: 2018 PMID: 29495512 PMCID: PMC5867850 DOI: 10.3390/genes9030129
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Strains and plasmids used in this study.
|
|
|
|
|---|---|---|
| Cloning strain of | Clontech, Cat. # 636763 | |
| Wild-type strain | Mullakhanbhai and Larsen [ | |
| ∆ | Bitan-Banin et al. [ | |
| ∆ | Allers et al. [ | |
| This study | ||
| Deletion strain of | This study | |
| Deletion strain of | This study | |
| pTA131 | Vector used for gene deletion. Contains | Allers, Ngo, Mevarech and Lloyd [ |
| p∆ | Derivative of pTA131 with flanking regions of | Ouellette, Jackson, Chimileski and Papke [ |
| p∆ | Derivative of pTA131 with flanking regions of | This study |
| p∆ | Derivative of pTA131 with flanking regions of | This study |
| p∆ | Derivative of pTA131 with flanking regions of | This study |
| p∆ | Derivative of pTA131 with flanking regions of | This study |
| p∆ | Derivative of pTA131 with flanking regions of | This study |
List of restriction-modification (RM) genes annotated in Haloferax volcanii DS2.
| Gene Locus Tag | Gene Symbol | Putative RM Classification | Gene Size (bp) | Location in the Genome | Notes |
|---|---|---|---|---|---|
|
|
| Type IV | 1005 | Chromosome | Type IV restriction endonuclease |
| Type II | 1095 | Chromosome | Putative 4mC CTAG methyltransferase | ||
|
|
| Type I | 2223, | Chromosome | Operon which contains a putative Type I RM system with 6mA methyltransferase |
| - | Type II | 1221 | pHV1 | Putative 5mC GTCGAC methyltransferase | |
|
| - | Type IIG | 660 | pHV4 | Putative restriction endonuclease fragment of HVO_A0237 [ |
| - | Type IV | 3315 | pHV4 | Putative Type IV restriction endonuclease | |
|
| - | Type IIG | 3267 | pHV4 | Putative 6mA Type IIG RM protein |
|
| - | Type IIG | 2199 | pHV4 | Putative 6mA methyltransferase and target recognition protein |
List of primers used in this study.
| Primer Name | Primer Sequence | Primer Description |
|---|---|---|
|
| 5’- CGG GCC CCC CCT CGA GTC AAG CAG TAC CTC AAC ACG GAA CA -3’ | Used to amplify the flanking regions of |
|
| 5’- ATT CGA TAT CAA GCT GTC CTC AAG GAC GGC CTG CA -3’ | |
|
| 5’- GAC GCG TTG ATA TCC CGA AGA ATC CAG TTG CTG TCT GTT G -3’ | |
|
| 5’- GGA TAT CAA CGC GTC GGC ATT ATG CAA TTC -3’ | |
| 5’- GCT TGA TAT CGA ATT CCC CGC GAG AAA GAC GAG AAG -3’ | Used to amplify the flanking regions of | |
|
| 5’- GCC TGG TAG AAT TCC CGT ACG GAC GTA TTT CCC CCG A -3’ | |
|
| 5’- GGA ATT CTA CCA GGC CGA CGA CGA CCG ACT GAG GTC -3’ | |
|
| 5’- TAG AAC TAG TGG ATC CGA ACG GCA GCA CCC GCG A -3’ | |
| 5’- CGG GCC CCC CCT CGA GTC GGT GTT TCG CAG GTC ATT C -3’ | Used to amplify the flanking regions of the | |
|
| 5’- GGG CGC CAT CCA GGC TAC TCA CTA TAT TTC ACT CGG GGT A -3’ | |
|
| 5’- GCC TGG ATG GCG CCC CTC ACC TAT TCA CAA AGA GAG GAA -3’ | |
|
| 5’- ATA TCA AGC TTA TCG ATT GCC GGG TTT CCT GTT ATT TT CT -3’ | |
| 5’ GCT TGA TAT CGA ATT CTG CTC GTC GTG GTA CTT GTC -3’ | Used to amplify the flanking regions of | |
|
| 5’- CGG TAC CGA CAT GTT ATC TCA ATG CAG CGC TTC TC -3’ | |
|
| 5’- AAC ATG TCG GTA CCG TTG AGG ACT GGG AGC GTA TC -3’ | |
|
| 5’- TGG CGG CCG CTC TAG TTG AAG GTC TGT GTC GCA TC -3’ | |
| 5’- GCG AAT TGG GTA CCG GCC CCG ACC TGC CTT GG -3’ | Used to amplify the flanking regions of | |
|
| 5’- GCC TGG TAG AAT TCC CCG TGT TCG GTT AAG CGG A -3’ | |
|
| 5’- GGA ATT CTA CCA GGC AAT GGG ATC TGA CGA AGG AGG -3’ | |
|
| 5’- CTG CAG GAA TTC GAT CAT AAA GGT CTT CTC AGC GGT T -3’ | |
| 5’- CGG GCC CCC CCT CGA GGT TCG CGC TCT TGC TCA GGT -3’ | Used to amplify the flanking regions of | |
|
| 5’- GGG ATC CAA AGC TTG AGG CGT TGC TGA CAT TAT ATC GAA G -3’ | |
|
| 5’- CAA GCT TTG GAT CCC GCC TTT CTG CTG GCG AGT TTC C -3’ | |
|
| 5’- TGG CGG CCG CTC TAG AAT ATC GCG CAG CTC TAT CGG G -3’ | |
| 5’- GTA AAA CGA CGG CCA GT -3’ | Used for amplifying the multiple cloning site of pTA131 for screening | |
|
| 5’- AGG AAA CAG CTA TGA CCA T -3’ |
Figure 1Amino acid alignment of HVO_0794 homologs. The multiple sequence alignment includes HVO_0794 (Haloferax volcanii DS2; GenBank ADE02643), M.HsaR1I (Halobacterium salinarum R1; GenBank CAP14114), M.MthZI (Methanothermobacter thermautotrophicus; GenBank CAA48447), M.MjaI (Methanocaldococcus jannaschii DSM 2661; GenBank AAB98988) and M.BfaI (Bacteroides fragilis; GenBank ADQ20483). Identified N4-cytosine methyltransferase motifs I-X [9] are highlighted in blue (representing S-adenosyl methionine (AdoMet) binding motifs), purple (representing DNA binding motif), and yellow (representing catalytic motifs). Red boxes are used to identify the signature DPPY and FGG motifs. The SCOP superfamily domain S-adenosyl-l-methionine-dependent methyltransferase domain SSF53335 is highlighted in green throughout the alignment. Clustal X2 shading and marking of amino acids is included in the alignment.
Figure 2Maximum likelihood phylogeny of homologs of the Haloferax volcanii DS2 CTAG MTase identified in completely sequenced haloarchaeal genomes. Numbers give non-parametric bootstrap support values. The phylogeny was rooted using more divergent haloarchaeal and methanomicrobial homologs. Genomes with a chromosome wide GATC to CTAG ratio below five are given in red, those with a GATC to CTAG ratio between 5 and 14 are given in orange. In addition, those with a ratio above 20 are given in black. Note that only few groups are well supported, including the Haloferax and Haloarcula genera.
Figure 3Cumulative occurrence (y-axis) of GATC (orange) and CTAG (blue) motifs along the Haloferax volcanii DS2 genome (x-axis). The location of the origins of replication identified in Hawkins et al. [54] are indicated on top.
CTAG and GATC motif frequencies in completely sequenced halobacterial chromosomes. Data for Escherichia coli K12 are given for comparison.
| Accession Number, Organism and Chromosome Number | Total CTAG | Total GATC | CTAG/kb | GATC/kb | GATC/CTAG | Match to |
|---|---|---|---|---|---|---|
| NC 013967.1 | 671 | 1851 | 0.24 | 0.65 | 2.8 | |
| NZ CP007551.1 | 1130 | 1472 | 0.38 | 0.50 | 1.3 | |
| NZ CP011947.1 | 556 | 1510 | 0.19 | 0.51 | 2.7 | |
| NC 017941.2 | 1142 | 1500 | 0.39 | 0.51 | 1.3 | |
| NC 023013.1 | 1497 | 7523 | 0.50 | 2.50 | 5.0 | |
| NC 023010.2 | 340 | 1675 | 0.94 | 4.61 | 4.9 | |
| NZ CP010529.1 | 1849 | 9333 | 0.54 | 2.73 | 5.0 | |
| NC 006396.1 | 1816 | 6564 | 0.58 | 2.10 | 3.6 | |
| NC 006397.1 | 274 | 1011 | 0.95 | 3.51 | 3.7 | |
| NC 015948.1 | 1493 | 7462 | 0.50 | 2.49 | 5.0 | |
| NC 015943.1 | 479 | 2210 | 0.98 | 4.52 | 4.6 | |
| NZ LN831302.1 | 795 | 1820 | 0.32 | 0.72 | 2.3 | |
| NZ CP007060.1 | 1168 | 4634 | 0.41 | 1.63 | 4.0 | |
| NC 002607.1 | 551 | 11047 | 0.27 | 5.48 | 20.0 | |
| NC 010364.1 | 537 | 10991 | 0.27 | 5.49 | 20.5 | |
| NC 012029.1 | 756 | 25016 | 0.28 | 9.15 | 33.1 | + |
| NC 012028.1 | 389 | 3306 | 0.74 | 6.29 | 8.5 | + |
| NC 007426.1 | 1016 | 1839 | 0.39 | 0.71 | 1.8 | |
| NC 008212.1 | 2290 | 14449 | 0.73 | 4.61 | 6.3 | |
| NC 017459.1 | 2281 | 14681 | 0.72 | 4.66 | 6.4 | |
| NC 014729.1 | 1085 | 8407 | 0.38 | 2.98 | 7.7 | |
| CP024845.1 Halophilic archaeon True-ADL | 1786 | 23542 | 0.54 | 7.07 | 13.2 | |
| NC 021921.1 | 892 | 27010 | 0.32 | 9.59 | 30.3 | + |
| NC 013158.1 | 964 | 32101 | 0.31 | 10.30 | 33.3 | |
| NC 013202.1 | 918 | 27978 | 0.30 | 8.99 | 30.5 | |
| NC 013743.1 | 1347 | 37472 | 0.35 | 9.64 | 27.8 | |
| NC 013922.1 | 1592 | 25139 | 0.42 | 6.70 | 15.8 | |
| NC 014297.1 | 1106 | 29489 | 0.39 | 10.50 | 26.7 | |
| NC 015666.1 | 1090 | 33560 | 0.30 | 9.15 | 30.8 | |
| NC 018224.1 | 1393 | 33801 | 0.38 | 9.14 | 24.3 | |
| NC 019792.1 | 2330 | 31628 | 0.62 | 8.35 | 13.6 | |
| NC 019962.1 | 1384 | 36667 | 0.37 | 9.67 | 26.5 | |
| NC 019964.1 | 1256 | 30664 | 0.39 | 9.51 | 24.4 | |
| NC 019974.1 | 1534 | 42563 | 0.38 | 10.61 | 27.7 | |
| NC 020388.1 | 1088 | 23003 | 0.37 | 7.90 | 21.1 | |
| NC 021313.1 | 948 | 33056 | 0.29 | 10.15 | 34.9 | |
| NZ AP017558.1 | 695 | 31995 | 0.23 | 10.78 | 46.0 | + |
| NZ AP017569.1 | 426 | 19948 | 0.15 | 7.03 | 46.8 | + |
| NZ CP007055.1 | 1094 | 24861 | 0.39 | 8.91 | 22.7 | |
| NZ CP008874.1 | 596 | 15696 | 0.29 | 7.53 | 26.3 | |
| NZ CP011564.1 | 637 | 16067 | 0.30 | 7.55 | 25.2 | |
| NZ CP016070.1 | 639 | 18453 | 0.32 | 9.36 | 28.9 | |
| NZ CP016804.1 | 696 | 19038 | 0.33 | 9.13 | 27.4 | |
| NZ CP019067.1 | 1046 | 25987 | 0.31 | 7.68 | 24.8 | + |
| NZ CP019285.1 | 1235 | 40138 | 0.30 | 9.64 | 32.5 | |
| NZ CP019327.1 | 1124 | 25935 | 0.33 | 7.63 | 23.1 | |
| NZ CP019893.1 Natrialbaceae archaeon JW/NM-HA 15 | 1177 | 35113 | 0.30 | 8.93 | 29.8 | + |
| Mean value per chromosome | 0.42 | 6.34 | 18.4 | |||
| Standard Deviation | 0.19 | 3.35 | 12.7 | |||
| NC 000913.3 | 885 | 19124 | 0.19 | 4.12 | 21.6 | + |
$: Presence of a match to the E. coli DNA adenine methyltransferase in a translated nucleotide BLAST (tblastn) search with an E-value < 10−25 are indicated by +.
Figure 4Amino acid alignment of RmeM homologs. The multiple sequence alignment includes RmeM (Haloferax volcanii DS2; GenBank ADE02452), M.HspNI (Halobacterium salinarum NRC-1; GenBank AAG18733), Mboo_1031 (Methanoregula boonei 6A8; GenBank ABS55549), Tte_1547 (Caldanaerobacter subterraneus subsp. tengcongensis MB4; AAM24756), M.BceSVI (Bacillus cereus ATCC 10987; GenBank AAS39772), and M.EcoKI (Escherichia coli K-12; GenBank P08957). The PFAM N6 adenine-specific DNA methyltransferase N-terminal domain PF12161 is highlighted in yellow, and the PFAM DNA methylase, adenine specific domain PF02384 is highlighted in blue. Red boxes are used to identify the signature DPPY and FGG motifs. The SCOP superfamily domain S-adenosyl-l-methionine-dependent methyltransferase domain SSF53335 is highlighted in green throughout the alignment. Clustal X2 shading and marking of amino acids is included in the alignment.
Figure 5Amino acid alignment of RmeS homologs. The multiple sequence alignment includes RmeS (Haloferax volcanii DS2; GenBank ADE04051), S.HspNI (Halobacterium salinarum NRC-1; GenBank AAG18734), Mboo_1032 (Methanoregula boonei 6A8; ABS55550), Tte1545 (Caldanaerobacter subterraneus subsp. tengcongensis MB4; GenBank AAM24754), S.BceSVI (Bacillus cereus ATCC 10987; GenBank AAS39773), and S.EcoKI (Escherichia coli K-12; GenBank AAG18734). The first SCOP superfamily domain DNA methylase specificity domain SSF116734 is highlighted in yellow, and the second one is highlighted in green. Clustal X2 shading and marking of amino acids is included in the alignment.
Figure 6Amino acid alignment of HVO_C0040 homologs. The multiple sequence alignment includes HVO_C0040 (Haloferax volcanii DS2; GenBank ADE05226), M.Hti4BORF752P (Halorhabdus tiamatea SARL4B; GenBank CCQ33914), M.AbaMAR002ORF10745P (Acinetobacter baumannii MAR002; GenBank KGF60346), M.BbrUII (Bifidobacterium breve UCC2003; GenBank ABE95799), and M.HgiDII (Herpetosiphon aurantiacus; GenBank CAA38941). The protein motif database PRINTS cytosine-specific DNA methyltransferase signature domains PR00105 are highlighted in yellow. Red boxes are used to identify signature FGG, PC, ENV, QRR, and YGN motifs described in Posfai et al. [11]. The SCOP superfamily domain S-adenosyl-l-methionine-dependent methyltransferase domain SSF53335 is highlighted in green throughout the alignment. Clustal X2 shading and marking of amino acids is included in the alignment.
Figure 7Amino acid alignment of HVO_A0079 homologs. The multiple sequence alignment includes HVO_A0079 (Haloferax volcanii DS2; GenBank ADE01706), C463_0072 (Halorubrum californiensis DSM 19288; GenBank ELZ48543), RM.HarDL31ORF105P (Halophilic archaeon DL31; GenBank AEN07377), RM.Aco12261II (Aminobacterium colombiense DSM 12261; GenBank ADE57453), and RM.Fla104114II (Filimonas lacunae 104114; GenBank BAV07385). The PRINTS adenine-specific DNA methyltransferase signature domains PR00507 are highlighted in yellow. Red boxes signify signature FGG and DPPY motifs. The SCOP superfamily domain S-adenosyl-l-methionine-dependent methyltransferase domain SSF53335 is highlighted throughout the alignment. Clustal X2 shading and marking of amino acids is included in the alignment.
DNA methylation patterns detected in H. volcanii RM deletion mutants
|
|
|
| ||||
|---|---|---|---|---|---|---|
| Motif | GCAm6BNNNNNNVTGC | Cm4TAG | GCAm6BNNNNNNVTGC | Cm4TAG | GCAm6BNNNNNNVTGC | Cm4TAG |
| Methylated position | 3 | 1 | 3 | 1 | 3 | 1 |
| Methylation type | 6mA | 4mC | 6mA | 4mC | 6mA | 4mC |
| Number of methylated motifs | 410 | 0 | 0 | 1199 | 0 | 0 |
| Number of motifs in genome | 410 | 1342 | 410 | 1342 | 410 | 1342 |
| Percent of methylated motifs | 100 | 0 | 0 | 89 | 0 | 0 |
| Mean modification QV score | 213.0 | - | - | 104.1 | - | - |
| Mean motif coverage | 130.4 | - | - | 113.0 | - | - |
Figure 8Cell density of H26 and ΔRM at stationary phase when grown on Hv-YPC, represented by the average optical density (OD620) reading of 24 cell culture replicates after 72 h of growth. Error bars indicate the standard error of the mean. Analysis of variance (ANOVA) single factor, p = 0.98.