| Literature DB >> 31427571 |
Torbjørn Ølshøj Jensen1, Christian Tellgren-Roth2, Stephanie Redl1,3, Jérôme Maury1, Simo Abdessamad Baallal Jacobsen1, Lasse Ebdrup Pedersen1, Alex Toftgaard Nielsen4.
Abstract
Genome-wide analysis of DNA methylation patterns using single molecule real-time DNA sequencing has boosted the number of publicly available methylomes. However, there is a lack of tools coupling methylation patterns and the corresponding methyltransferase genes. Here we demonstrate a high-throughput method for coupling methyltransferases with their respective motifs, using automated cloning and analysing the methyltransferases in vectors carrying a strain-specific cassette containing all potential target sites. To validate the method, we analyse the genomes of the thermophile Moorella thermoacetica and the mesophile Acetobacterium woodii, two acetogenic bacteria having substantially modified genomes with 12 methylation motifs and a total of 23 methyltransferase genes. Using our method, we characterize the 23 methyltransferases, assign motifs to the respective enzymes and verify activity for 11 of the 12 motifs.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31427571 PMCID: PMC6700114 DOI: 10.1038/s41467-019-11179-9
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Summary of methylome data of the two different strains
| Motifs | Modified position | Type | % motifs detected | # of motifs in genome |
|---|---|---|---|---|
| WATCNNNNNCTC | 2 | m6A | 100.0% | 461 |
| GAGNNNNNGAT | 2 | m6A | 99.4% | 950 |
| SATC | 2 | m6A | 99.9% | 37470 |
| GAWTC | 2 | m6A | 99.7% | 4600 |
| AACCA | 5 | m6A | 99.7% | 5547 |
| GGGCCC | 5 | m4C | 97.4% | 2352 |
| CTCCG | 4 | m4C | 76.4% | 5439 |
|
| ||||
| GCCRAG | 5 | m6A | 99.6% | 1989 |
| CCWGG | 1 | m5C | – | 9414 |
| TAAGNNNNNTCC | 3 | m6A | 99.8% | 304 |
| GATGNNNNNNTGC | 2 | m6A | 99.8% | 375 |
| CAAAAAR | 6 | m6A | 98.9% | 5576 |
The data for A. woodii was from Blow et al.[30]
Fig. 1MetMap workflow. The box designated “automated” indicates the steps that were automated. The variance in the occurrence of motifs in the motif-cassette depends on the nature of the motifs; A motif including unspecific regions of 2–10 n’s (typical for Type I motifs) is present 12 times in the cassette, whereas motifs not including such regions is repeated 10 times
Fig. 2Handling of the sequences. a mapping based selection of the sequences based upon the motif cassette; b 2nd round of mapping based selection based upon both the motif cassette and the gene coding for the methyltransferases; c analysing the grouped sequences though the ‘Modification and Motif_Analysis’ pipeline
Summary of the methyltransferases from M. thermoacetica
| Gene name ATCC 39073-HH | Gene name ATCC 39073 | Gene product | Predicted specificity | Motif detected | Mean mod. QV | Coverage |
|---|---|---|---|---|---|---|
|
| ||||||
MothHH_01798 MothHH_01799 | Moth_1671 Moth_1672 | DNA met. Specificity N-6 DNA methylase | N.A. | W | 55.40 53.80 | 1643.71 |
| GNNBNA(m6A)*TCTNNNWNTC | ||||||
|
| ||||||
| MothHH_00029 | Moth_0026 | DNA adenine methylase | GASTC | GA(m6A)WTC | 49.58 | 50.29 |
| MothHH_00692 | Moth_0639 | DNA methylase N-4/N-6 | N.A. | BGN | 35.40 | 224.54 |
| MothHH_00927 | Moth_0871 | conserved hypothetical protein | N.A. | CA(m6A)TC | 88.75 | 95.23 |
| MothHH_01869 | Moth_1737 | DNA adenine methylase | GGATC | GA(m6A)TC | 53.14 | 60.88 |
| MothHH_02421 | Moth_2238 | hypothetical protein | N.A. | N.D. | N.D. | 6388 |
| MothHH_02424 | Moth_2241 | hypothetical protein | N.A. | N.D | N.D. | 1531 |
| MothHH_02467 | Moth_2281 | Adenine-specific DNA methylase | AGGCCT | GGGCC(m4C)C | 151.5 | 343.27 |
|
| ||||||
| MothHH_01297 | Moth_1213 | Site specific subunit | N.A. | AACCA(m6A) | 69.93 | 83.43 |
| Neg-control | bgaB (beta-galatosidase) | None | N.D. | N.D. | 116 | |
Methyltransferases from M. thermoacetica cloned individually and their corresponding motifs
N. A. not available, N. D. not detected, U. K. unknown
*indicates that the methylation type was identified by the modification position
Summary of the methyltransferases from A. woodii
| Gene name (DSMZ 1030) | Gene product | Predicted specificity | Motif detected | Mean mod. QV | Coverage |
|---|---|---|---|---|---|
|
| |||||
Awo_c04420 Awo_c04450 | R-M subunit HsdM1 R-M subunit HsdS1 | N.A. | N.D. | N.D. | 2075 |
Awo_c08800 Awo_c08810 | R-M subunit HsdM2 R-M subunit HsdS2 | TAAGNNNNNTCC | TAA(m6A)GNNNNNTCC GGA(m6A)NNNNNCNTA | 1914 1567 | 3446 |
Awo_c17130 Awo_c17140 | R-M subunit HsdS3 R-M subunit HsdM3 | GATGNNNNNNTGC | GA(m6A)TNNNNNNNTGC GCA(m6A)NNNNNNCAT | 1136 932 | 1768 |
| Awo_c17840 | R-M subunit HsdM4 | N.A. | N.D. | N.D. | 8535 |
|
| |||||
| Awo_c06460 | DNA (cytosine-5-)-methyltransferase Dcm | CCGG | N.D. | N.D. | 8569 |
| Awo_c14460 | DNA modification methyltransferase | N.A. | CAAAAA(m6A)R | 2185 | 6372 |
| Awo_c18570 | Putative DNA methylase | N.A. | N.D. | N.D. | 9517 |
| Awo_c18590 | Putative DNA methylase | CCWGG | TNNN MNNBSNTNNCCNG(U.K)*G | 48 32 | 8514 |
| Awo_c19740 | Potential Type II | GCCRAG | GCCRAG | 1986 | 6223 |
| Awo_c30860 | Putative DNA methylase | N.A. | TA(m6A)GNHNNNNV BCANA(m6A)NNGNNNNNM VNTNNNNNNTTNTA(m6A) | 608 172 147 | 5539 |
| Awo_c31130 | DNA (Cytosine-5-)-methyltransferase YdiP | N.A. | N.D. | N.D. | 11339 |
| Awo_c34920 | N6 adenine-specific DNA methylase | N.A. | WBNNMDVA(m6A)GCNH | 70 | 937 |
| Awo_c35180 | phage N6 methylase | TGGCCA | WNNNNNMGNC(m4C)NGGNM | 237 | 708 |
| Awo_c35200 | Lambda C methyl | N.A. | N.D. | N.D. | 909 |
| Neg-control | bgaB (beta-galatosidase) | None | N.D. | N.D. | 8051 |
Methyltransferases from A. woodii cloned individually and their corresponding motifs
*indicates the modification position is misleading, as the modification on a guanine is not likely, in addition m5C has typical characteristic kinetic signals two and six bases downstream of the methylated position
N. A. not available, N. D. not detected, U. K unknown