| Literature DB >> 30820249 |
Binit Lamichhane1, Eng-Guan Chua1, Michael J Wise1,2, Connor Laming1, Barry J Marshall1,3, Chin-Yen Tay1,3.
Abstract
BACKGROUND: Helicobacter pylori is a Gram-negative bacterium which mainly causes peptic ulcer disease in human, but is also the predominant cause of stomach cancer. It has been coevolving with human since 120,000 years and, according to Multi-locus sequence typing (MLST), H. pylori can be classified into seven major population types, namely, hpAfrica1, hpAfrica2, hpNEAfrica, hpEastAsia, hpAsia2, hpEurope and hpSahul. Helicobacter pylori harbours a large number of restriction-modification (R-M) systems. The methyltransferase (MTase) unit plays a significant role in gene regulation and also possibly modulates pathogenicity. The diversity in MTase can act as geomarkers to correlate strains with the phylogeographic origins. This paper describes the complete genome sequence and methylome of gastric pathogen H. pylori belonging to the population hpNEAfrica.Entities:
Keywords: Complete genome; Helicobacter pylori; Methylome; hpNEAfrica
Year: 2019 PMID: 30820249 PMCID: PMC6381644 DOI: 10.1186/s13099-019-0284-y
Source DB: PubMed Journal: Gut Pathog ISSN: 1757-4749 Impact factor: 4.181
Presence and absence of major virulence factors in HP14039 complete genome and other hpNEAfrica draft genomes (“+” means presence and “–” means absence)
| Virulence factors | HP14039 | HP01234 | HP07036 | HP08058 | HP08061 | HP08074 | HP11049 | HP13005 | HP13050 | HP13068 | HP15005 | HP98490 | HP99255 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| + |
| + |
|
| +a |
| + |
| + |
| |
|
| + | + | + | + | + | + | + | + | + | + | + | + | + |
|
| + | + |
|
|
| +b |
|
|
|
|
|
|
|
|
| + | + | + |
|
| + | + | + |
| + | + | + | + |
|
| + | + | + | + |
| +b | + | +b | +b | +b |
| + |
|
|
| + |
| + | + | +b | +b |
|
|
| +b | + |
|
|
|
|
| + |
|
| +b |
|
|
|
|
|
|
| + |
|
| + | + | + | + | + | + | + | + | + | + | + | + | + |
|
| + | + | + | + | + | + | + | + | + | + | + | + | + |
|
| + | + | + | + | + | + | + | + | + | + | + | + | + |
|
| + | + | + | + | + | + | + | + | + | + | + | + | + |
|
| + | + | + | + | + | + | + | + | + | + | + | + | + |
|
| + | + | + | + | + | + | + | + | + | + | + | + | + |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
aAll cag-PAI elements are present, except cagA
bTruncated gene
Fig. 1Circos plot displaying the density of methylated sites in HP14039 complete genome (5000 bp window). Open reading frames are highlighted in red in the outer ring. The inner histogram displays the abundance of methylated bases within every 5000 bp region over the chromosome, with an average of 367.9 ± 49.7. The hypermethylated and hypomethylated regions with methylated sites exceeding two standard deviations (> 466.7 and < 267.9, respectively) are highlighted in yellow and dark blue, respectively. The three outer green tracks within the histogram represent density values ranging from 600 to 481 (from outer to inner). The three inner orange tracks within the histogram represent density values ranging from 320 to 201 (from outer to inner)
Methylated motifs detected in HP14039
| Recognition sitea | Modification type | No. detected | No. in genome | Detected (%) | Restriction component(s) | Methylation component(s) | Refs | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Locus tag | Ortholog | % identity | Locus tag | Ortholog | % identity | ||||||
| 5′-A | m4c | 976 | 1112 | 87.8 | HP14039_01407-01406 | K747_11000 | 94.1, 93.9 | HP14039_01408 | K747_10995 | 95.5 | [ |
| 5′-ATTA | m6a | 949 | 952 | 99.7 | HP14039_01410 | jhp0431 | 94.1 | HP14039_01411 | jhp0430 | 94.9 | [ |
| 5′-C | m6a | 14,872 | 14,874 | 100 | HP14039_00716 | hp1209 | 93.3 | HP14039_00717 | hp1208 | 93.8 | [ |
| 5′-CC | m6a | 2190 | 2199 | 99.6 | HP14039_00375-00376 | K747_12645 | 95.7, 92.7 | HP14039_00372-00373 | K747_03690 | 96.2, 97.7 | [ |
| 5′- | m4c | 3535 | 3542 | 99.8 | HP14039_01604 | hp0262 | 96.1 | HP14039_01603 | hp0263 | 94.2 | [ |
| 5′- | m4c | 6164 | 6172 | 99.9 | HP14039_01555 | HpyHI | 96 | HP14039_01554 | M.HpyHI | 94.9 | [ |
| 5′-G | m6a | 4672 | 4718 | 99 | HP14039_00203b | – | – | HP14039_00204 | hp0050 | 95.2 | [ |
| 5′-G | m6a | 10,541 | 10,548 | 99.9 | HP14039_00158 | hp0091 | 93.7 | HP14039_00157 | hp0092 | 92.8 | [ |
| 5′-GT | m6a | 300 | 304 | 98.7 | HP14039_01382-01381 | jhp0455 | 93.4, 93.7 | HP14039_01383 | jhp0454 | 95.8 | [ |
| 5′-GTNN | m6a | 820 | 826 | 99.3 | HP14039_00987 | hp0909 | 91.7 | HP14039_00986 | hp0910 | 96.1 | [ |
| 5′-TCG | m6a | 610 | 612 | 99.7 | – | – | – | HP14039_01606-01607 | hp0260 | 96.1, 97.6 | [ |
| 5′-TCNNG | m6a | 3795 | 3808 | 99.7 | HP14039_00847-00846 | jhp1013c | 96.8, 97.5 | HP14039_00848 | jhp1012 | 96.1 | [ |
| 5′-TGC | m6a | 11,060 | 11,072 | 99.9 | HP14039_00305 | HpyCH4 V | 96.6 | HP14039_00306 | M.HpyCH4 V | 93.4 | [ |
| 5′-CT | m6a | 191 | 192 | 99.5 | – | – | – | HP14039_00325d | K747_03505 | 93 | [ |
| 5′-WC | m6a | 1732 | 4440 | 39 | – | – | – | HP14039_01429e | K747_10905 | 92 | [ |
aThe methylated base within the motif is underlined while the modified base in the complementary strand is highlighted in red
bThe 1–563 bp and 550–918 bp regions of HP14039_00203 demonstrated significant nucleotide sequence homology to the N-terminus of hp0052 and the C-terminus of hp0051, respectively
cPutative type II restriction enzyme probably recognising TCNNGA
dPredicted type I DNA methylase probably recognising CTANNNNNNNTAYG based on high sequence similarity with K747_03505 that recognises a closely related sequence motif, which is GANNNNNNNTAYG
ePredicted type I DNA methylase probably recognising WCANHNNNNTG. This gene is highly similar to K747_10905, which encodes a type I DNA methylase that recognises CCANNNNNNTC sequence motif
Fig. 2Core genome based phylogenetic tree of HP14039 with 47 complete genomes and 12 draft genomes. HP14039 is highlighted in red