| Literature DB >> 22701658 |
Yuanhai You1, Lihua He, Maojun Zhang, Jianying Fu, Yixin Gu, Binghua Zhang, Xiaoxia Tao, Jianzhong Zhang.
Abstract
In this study, a whole-genome CombiMatrix Custom oligonucleotide tiling microarray with 90,000 probes covering six sequenced Helicobacter pylori (H. pylori) genomes was designed. This microarray was used to compare the genomic profiles of eight unsequenced strains isolated from patients with different gastroduodenal diseases in Heilongjiang province of China. Since significant genomic variation was found among these strains, an additional 76 H. pylori strains associated with different clinical outcomes were isolated from various provinces of China. These strains were tested by polymerase chain reaction to demonstrate this distinction. We identified several highly variable regions in strains associated with gastritis, gastric ulceration, and gastric cancer. These regions are associated with genes involved in the bacterial type I, type II, and type III R-M systems. They were also associated with the virB gene, which lies on the well-studied cag pathogenic island. While previous studies have reported on the diverse genetic characterization of this pathogenic island, in this study, we find that it is conserved in all strains tested by microarray. Moreover, a number of genes involved in the type IV secretion system, which is related to horizontal DNA transfer between H. pylori strains, were identified in the comparative analysis of the strain-specific genes. These findings may provide insight into new biomarkers for the prediction of gastric diseases.Entities:
Mesh:
Year: 2012 PMID: 22701658 PMCID: PMC3368837 DOI: 10.1371/journal.pone.0038528
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Characteristics of the 84 H. pylori strains studied.
| Origin | Clinical Diagnosis | No. of strains |
| Heilongjiang (n = 23) | GC | 12 |
| DU | 3 | |
| GU | 3 | |
| AG | 3 | |
| SG | 2 | |
| Shandong (n = 12) | GC | 12 |
| Hubei (n = 10) | FD | 6 |
| GU | 2 | |
| DU | 2 | |
| Xi’an (n = 11) | SG | 7 |
| DU | 1 | |
| GDU | 3 | |
| Yunnan (n = 10) | SG | 5 |
| GU | 3 | |
| DU | 2 | |
| Jiangxi (n = 18) | SG | 9 |
| GC | 9 |
Note: GC, gastric cancer. GU, gastric ulcer. AG, atrophic gastritis, DU, duodenal ulcer, GDU, gastroduodenal ulcer. FD, functional dyspepsia. SG, non-atrophic gastritis.
General features of the sequenced H. pylori genomes selected for microarray probe design.
| strain | ACCESSION | length | origin | Clinical diagnosis |
| 26695 | AE000511 | 1667867 | UK | Gastritis |
| J99 | AE001439 | 1643831 | USA | Duodenal ulcer |
| HPAG1 | CP000241 | 1596366 | Sweden | Atrophic gastritis |
| P12 | CP001217 | 1673813 | German | Duodenal ulcer |
| G27 | CP001173 | 1652982 | Italy, Tuscany | No known disease |
| Shi470 | CP001072 | 1608548 | Peru | Gastritis |
Figure 1Comparison of HLJ005, HLJ038, and 26695 based on strain specific genes of the six sequenced genomes.
A. Comparison between HLJ005 and 26695. B. Comparison between HLJ038 and 26695. C. Comparison between HLJ038 and HLJ005. Green spots show inferred loss while orange dots show no changes in the original data. Red spots show inferred increase. Segment 1 on the X-axis: probes of 26695 strain specific genes. Segment 2: probes of G27 strain specific genes. Segment 3: probes of HPAG1 strain specific genes. Segment 4: probes of J99 strain specific genes. Segment 5: probes of P12 strain specific genes. Segment 6: probes of Shi470 strain specific genes.
Figure 2Comparison of HLJ005, HLJ038, and 26695 based on 0–100 kb of p12 genome.
A. Comparison between HLJ005 and 26695. B. Comparison between HLJ038 and 26695. C. Comparison between HLJ038 and HLJ005. Green spots show inferred loss while orange dots show no changes in the original data. Red spots show inferred increase. The red rectangle indicates the predicted 5.2 kb absent region both in gastric cancer strain HLJ038 and HLJ005 based on the 0–100 kb of the reference P12 chromosome.
Figure 3Genome comparison of eight H. pylori strains based on P12 reference sequence.
Large variable regions were labeled with colored rectangles. Predicted gastric cancer strain specific regions were labeled with a red rectangle. General genomic variation among all strains was labeled with a green rectangle. A blue rectangle represents diverse regions obtained from comparison of gastric cancer strains and superficial gastritis strains. Corresponding coding proteins were briefly labeled at the bottom of each rectangle. GC vs SG denotes comparison between the two strains of gastric cancer and the two strains of superficial gastritis. DR, different region.
Predicted variable genomic region based on reference genome P12.
| DR | Gene | start | end | Gene description |
| DR1 | 49177 | 49869 | adenine specific DNA methyltransferase | |
| 49866 | 50933 | cytosine specific DNA methyltransferase | ||
| 50930 | 52153 | restriction endonuclease | ||
| 52297 | 53568 | type II R-M system restriction endonuclease | ||
| 53568 | 56036 | type II R-M system methyltransferase | ||
| DR2 | 295040 | 303760 | vacuolating cytotoxin VacA-like protein | |
| DR4 | hsdS-1 | 448002 | 449183 | type I R-M system S protein |
| hsdM-1 | 449176 | 450807 | type I R-M system M protein | |
| 452423 | 453496 | integrase/recombinase XercD family | ||
| 453578 | 454261 | hypothetical protein | ||
| virb6 | 454334 | 455632 | VirB6 type IV secretion protein | |
| 455598 | 455876 | hypothetical protein | ||
| 455928 | 457328 | hypothetical protein | ||
| 457332 | 457577 | hypothetical protein | ||
| 457791 | 459272 | hypothetical protein | ||
| 459405 | 460469 | hypothetical protein | ||
| 460312 | 460803 | hypothetical protein | ||
| 460810 | 461832 | hypothetical protein | ||
| 462096 | 470522 | DNA methylase | ||
| parA | 470785 | 471441 | chromosome partitioning protein | |
| 471523 | 471807 | hypothetical protein | ||
| 471839 | 473017 | hypothetical protein | ||
| virD2-1 | 473042 | 474955 | relaxase | |
| 475253 | 475567 | hypothetical protein | ||
| 475560 | 475841 | hypothetical protein | ||
| virD4-1 | 476105 | 477832 | VirD4 coupling protein | |
| 477879 | 478391 | hypothetical protein | ||
| 478392 | 478682 | hypothetical protein | ||
| 478679 | 479134 | hypothetical protein | ||
| virB11-1 | 479131 | 480072 | VirB11 type IV secretion ATPase | |
| 480072 | 480371 | hypothetical protein | ||
| 480368 | 480631 | hypothetical protein | ||
| 480624 | 480917 | hypothetical protein | ||
| virB10-1 | 480987 | 482252 | VirB10 type IV secretion protein | |
| virB9-1 | 482252 | 483784 | VirB9 type IV secretion protein | |
| virB8-1 | 483784 | 484953 | VirB8 type IV secretion protein | |
| virB7-1 | 484957 | 485073 | VirB7 type IV secretion protein | |
| virB3-1 | 489522 | 489788 | VirB3 type IV secretion protein | |
| virB2-1 | 489789 | 490091 | VirB2 type IV secretion protein | |
| 490088 | 490372 | hypothetical protein | ||
| 490433 | 491611 | hypothetical protein | ||
| 491618 | 491911 | hypothetical protein | ||
| 491931 | 492710 | hypothetical protein | ||
| 494743 | 496638 | hypothetical protein | ||
| 496664 | 497431 | hypothetical protein | ||
| 497441 | 497791 | integral membrane protein | ||
| virb6 | 454334 | 455632 | VirB6 type IV secretion protein | |
| 455598 | 455876 | hypothetical protein | ||
| 455928 | 457328 | hypothetical protein | ||
| 457332 | 457577 | hypothetical protein | ||
| DR6 | res-1 | 628179 | 631121 | type III R-M system restriction enzyme |
| DR8 | 1050977 | 1051564 | serine/threonine kinase C-like protein | |
| 1051604 | 1052053 | serine/threonine kinase C-like protein | ||
| 1052213 | 1052728 | serine/threonine phosphatase 2C-like | ||
| protein | ||||
| DR11 | virB2-2 | 1396323 | 1396607 | VirB2 type IV secretion protein |
| virB3-2 | 1396619 | 1396882 | VirB3 type IV secretion protein | |
| 1396894 | 1397130 | hypothetical protein | ||
| virB4-2 | 1397130 | 1399706 | VirB4 type IV secretion ATPase | |
| virB7-2 | 1399703 | 1399843 | VirB7 type IV secretion protein | |
| virB8-2 | 1399836 | 1400972 | VirB8 type IV secretion protein | |
| virB9-2 | 1400969 | 1402624 | VirB9 type IV secretion protein | |
| virB10-2 | 1402591 | 1403829 | VirB10 type IV secretion protein | |
| 1403813 | 1405597 | hypothetical protein | ||
| 1405501 | 1406055 | hypothetical protein | ||
| 1406068 | 1407030 | hypothetical protein | ||
| 1407047 | 1407322 | hypothetical protein | ||
| virB11-3 | 1407327 | 1408271 | VirB11 type IV secretion ATPase | |
| 1408268 | 1408786 | hypothetical protein | ||
| virD4-2 | 1408783 | 1411026 | VirD4 coupling protein | |
| 1413163 | 1413633 | hypothetical protein | ||
| 1413603 | 1414406 | hypothetical protein | ||
| 1414480 | 1415145 | hypothetical protein | ||
| 1415123 | 1415401 | hypothetical protein | ||
| 1415334 | 1415765 | hypothetical protein | ||
| 1415750 | 1416160 | hypothetical protein | ||
| 1416165 | 1416794 | hypothetical protein | ||
| 1416897 | 1417178 | hypothetical protein | ||
| 1417399 | 1417788 | hypothetical protein | ||
| 1417796 | 1418086 | hypothetical protein | ||
| 1418215 | 1418466 | hypothetical protein | ||
| 1418394 | 1419248 | hypothetical protein | ||
| 1421131 | 1421259 | hypothetical protein | ||
| virD2-2 | 1421785 | 1423818 | relaxase | |
| DR12 | 1575820 | 1579728 | type IIS R-M system | |
| restriction/modification enzyme | ||||
| 1580020 | 1582023 | hypothetical protein | ||
| res-4 | 1582157 | 1585066 | type III R-M system restriction enzyme | |
| mod-5 | 1585069 | 1587111 | type III R-M system methyltransferase |
Note: DR, different region
Figure 4Comparison of eight H. pylori strains based on strain specific genes of six sequenced strains.
Large variable regions were labeled with colored rectangles. Predicted gastric cancer strain specific regions were labeled with a red rectangle. General genomic variations among all strains were labeled with a green rectangle. A blue rectangle represents diverse regions obtained from comparison of gastric cancer strains and superficial gastritis strains. Corresponding coding proteins were briefly labeled at the bottom of each rectangle. GC vs SG denotes comparison between the two strains of gastric cancer and the two strains of superficial gastritis.
Predicted variable genes based on strain specific genes of the six sequenced strains.
| Strain | DR | Gene | start | end | Gene description |
| DR1 | HP0440 | 457297 | 459330 | DNA topoisomerase I (topA) | |
| HP0441 | 459333 | 459333 | VirB4 homolog | ||
| 26695 | HP0442 | 461749 | 462015 | hypothetical protein | |
| HP0443 | 462016 | 462318 | hypothetical protein | ||
| strain | HP0444 | 462315 | 461756 | hypothetical protein | |
| HP0445 | 463954 | 464139 | hypothetical protein | ||
| specific | DR2 | HP0456 | 475056 | 475508 | hypothetical protein |
| HP0457 | 475826 | 476089 | hypothetical protein | ||
| genes | HP0458 | 476101 | 476337 | hypothetical protein | |
| HP0459 | 476337 | 478913 | virB4 homolog (virB4) | ||
| HP0460 | 479043 | 479531 | hypothetical protein | ||
| HP0461 | 479557 | 479649 | hypothetical protein | ||
| HP0462 | 480062 | 481159 | type I restriction enzyme S protein (hsdS) | ||
| HP1366 | 1427688 | 1428959 | type IIS restriction enzyme R protein (MBOIIR) | ||
| DR3 | HP1367 | 1428975 | 1429757 | type IIS restriction enzyme M1 protein (mod) | |
| HP1368 | 1429744 | 1430607 | type IIS restriction enzyme M2 protein (mod) | ||
| G27 | DR4 | 1046540 | 1048216 | hypothetical protein | |
| 1048432 | 1048734 | hypothetical protein | |||
| strain | 1053564 | 1054550 | competence protein | ||
| 1076345 | 1077382 | hypothetical protein | |||
| specific | 1081055 | 1082440 | hypothetical protein | ||
| 1082440 | 1083642 | hypothetical protein | |||
| genes | 1083705 | 1084781 | integrase-recombinase | ||
| protein | |||||
| DR5 | 1351624 | 1352001 | adenine-specific DNA methylase | ||
| 1419432 | 1420331 | adenine-specific DNA methylase | |||
| HPAG1 | DR6 | HrgA | 94738 | 95736 | HrgA |
| strain | DR7 | 1410157 | 1412718 | hypothetical protein | |
| Specific genes | |||||
| J99 | DR8 | jhp0164 | 178219 | 179565 | putative restriction enzyme |
| strain | jhp0165 | 179558 | 180778 | hypothetical protein | |
| Specific genes | DR9 | jhp0929 | 1032025 | 1032477 | hypothetical protein |
| jhp0930 | 1032591 | 1032833 | hypothetical protein | ||
| topA_3 | 1032846 | 1034906 | topoisomerase I | ||
| jhp0932 | 1034961 | 1035431 | hypothetical protein | ||
| jhp0933 | 1035401 | 1036204 | hypothetical protein | ||
| jhp0934 | 1036277 | 1037296 | hypothetical protein | ||
| jhp0935 | 1037343 | 1037885 | hypothetical protein | ||
| jhp0936 | 1038083 | 1038616 | hypothetical protein | ||
| jhp0937 | 1038613 | 1039878 | hypothetical protein | ||
| P12 | DR10 | virB2-2 | 1396323 | 1396607 | VirB2 type IV secretion protein |
| strain | virB3-2 | 1396619 | 1396882 | VirB3 type IV secretion protein | |
| Specific genes | 1396894 | 1397130 | hypothetical protein | ||
| virB4-2 | 1397130 | 1399706 | VirB4 type IV secretion ATPase | ||
| virB7-2 | 1399703 | 1399843 | VirB7 type IV secretion protein | ||
| virB8-2 | 1399836 | 1400972 | VirB8 type IV secretion protein | ||
| virB9-2 | 1400969 | 1402624 | VirB9 type IV secretion protein | ||
| virB10-2 | 1402591 | 1403829 | VirB10 type IV secretion protein | ||
| 1403813 | 1405597 | hypothetical protein | |||
| 1405501 | 1406055 | hypothetical protein | |||
| 1406068 | 1407030 | hypothetical protein | |||
| 1407047 | 1407322 | hypothetical protein | |||
| virB11-3 | 1407327 | 1408271 | VirB11 type IV secretion ATPase | ||
| 1408268 | 1408786 | hypothetical protein | |||
| virD4-2 | 1408783 | 1411026 | VirD4 coupling protein | ||
| 1413163 | 1413633 | hypothetical protein | |||
| 1413603 | 1414406 | hypothetical protein | |||
| 1414480 | 1415145 | hypothetical protein | |||
| 1415123 | 1415401 | hypothetical protein | |||
| 1415334 | 1415765 | hypothetical protein | |||
| 1415750 | 1416160 | hypothetical protein | |||
| 1416165 | 1416794 | hypothetical protein | |||
| 1416897 | 1417178 | hypothetical protein | |||
| 1417399 | 1417788 | hypothetical protein | |||
| 1417796 | 1418086 | hypothetical protein | |||
| 1418215 | 1418466 | hypothetical protein | |||
| 1418394 | 1419248 | hypothetical protein | |||
| 1421131 | 1421259 | hypothetical protein | |||
| virD2-2 | 1421785 | 1423818 | relaxase | ||
| DR11 | 1466136 | 1466765 | hypothetical protein | ||
| 1470038 | 1471000 | type II R-M system restriction endonuclease | |||
| 1470981 | 1471751 | type II R-M system restriction endonuclease | |||
| 1479129 | 1479917 | hypothetical protein | |||
| 1486667 | 1487452 | hypothetical protein | |||
| 1487452 | 1489119 | hypothetical protein | |||
| 1492159 | 1492494 | hypothetical protein | |||
| 1507434 | 1507625 | hypothetical protein | |||
| 1524027 | 1526738 | DNA polymerase I | |||
| 1528042 | 1530078 | type IIS R-M system methyltransferase | |||
| 1555145 | 1556065 | hypothetical protein | |||
| Shi470 | DR12 | 874998 | 876074 | integrase/recombinase (xerD) | |
| strain | 876139 | 877341 | hypothetical protein | ||
| Specific genes | 877341 | 878726 | hypothetical protein | ||
| 880915 | 882183 | hypothetical protein | |||
| 882184 | 883221 | hypothetical protein | |||
| 883375 | 891786 | hypothetical protein | |||
| 902243 | 903478 | ComB3 protein | |||
| 906132 | 908066 | topoisomerase I | |||
| 908059 | 910557 | DNA transfer protein | |||
| 910570 | 910830 | hypothetical protein | |||
| 910831 | 911133 | hypothetical protein | |||
| 911758 | 913038 | hypothetical protein | |||
| 913049 | 913813 | hypothetical protein | |||
| 913835 | 915454 | hypothetical protein |
Note: DR, different region.
Figure 5Gene content analysis of TFS4 and TFS3 in eighty-four H. pylori isolates by PCR.
The presence of individual genes is indicated in red, and their absence in green. Non-specific amplicon is shown in black.
Distribution of hrgA in different diseases.
| Diseases | hrgA | Total | |
| + | − | ||
| GC | 25 | 8 | 33 |
| GU | 5 | 2 | 7 |
| DU | 4 | 4 | 8 |
| GDU | 0 | 4 | 4 |
| SG | 9 | 14 | 23 |
| AG | 1 | 2 | 3 |
| FD | 3 | 3 | 6 |
| Total | 47 | 37 | 84 |
Note: GC, gastric cancer. GU, gastric ulcer. AG, atrophic gastritis, DU, duodenal ulcer GDU, gastroduodenal ulcer, FD, functional dyspepsia, SG, non-atrophic gastritis.