| Literature DB >> 25503415 |
Selva Perumal Gunaletchumy1, Indran Seevasant1, Mun Hua Tan2, Laurence J Croft2, Hazel M Mitchell3, Khean Lee Goh4, Mun Fai Loke1, Jamuna Vadivelu1.
Abstract
Helicobacter pylori infection results in diverse clinical conditions ranging from chronic gastritis and ulceration to gastric adenocarcinoma. Among the multiethnic population of Malaysia, Indians consistently have a higher H. pylori prevalence as compared with Chinese and Malays. Despite the high prevalence of H. pylori, Indians have a relatively low incidence of peptic ulcer disease and gastric cancer. In contrast, gastric cancer and peptic ulcer disease incidence is high in Chinese. H. pylori strains from Chinese strains predominantly belong to the hspEAsia subpopulation while Indian/Malay strains mainly belong to the hspIndia subpopulation. By comparing the genome of 27 Asian strains from different subpopulations, we identified six genes associated with risk of H. pylori-induced peptic ulcer disease and gastric cancer. This study serves as an important foundation for future studies aiming to understand the role of bacterial factors in H. pylori-induced gastro-duodenal diseases.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25503415 PMCID: PMC5377019 DOI: 10.1038/srep07431
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Phylogenetic tree constructed on a whole genome of 33 H. pylori strains.
Asian strains used in this study mostly clustered together with hspIndia and hspEAsia strains according to host ethnicity (except UM037).
Details of strains used in this study
| Strain | Ethnicity | Age | Gender | Year of Isolation | Geographical | Disease | Reference |
|---|---|---|---|---|---|---|---|
| GC26 | Chinese | 70 | Female | 2005 | Malaysia | Gastric cancer | |
| F57 | Japanese | - | - | - | Japan | Gastric cancer | |
| F32 | Japanese | - | - | - | Japan | Gastric cancer | |
| XZ274 | Tibetan | - | - | - | China | Gastric cancer | |
| UM023 | Chinese | 71 | Female | 2011 | Malaysia | Peptic ulcer disease | |
| UM065 | Chinese | 57 | Female | 2011 | Malaysia | Peptic ulcer disease | |
| UM066 | Chinese | 30 | Female | 2011 | Malaysia | Peptic ulcer disease | |
| UM077 | Chinese | 59 | Female | 2011 | Malaysia | Peptic ulcer disease | |
| UM067 | Indian | 64 | Male | 2011 | Malaysia | Peptic ulcer disease | |
| UM084 | Malay | 60 | Male | 2011 | Malaysia | Peptic ulcer disease | |
| UM114 | Indian | 42 | Female | 2012 | Malaysia | Peptic ulcer disease | |
| F30 | Japanese | - | - | - | Japan | Peptic ulcer disease | |
| HP51 | Korean | - | - | - | Korea | Peptic ulcer disease | - |
| HLJ271 | - | - | - | - | China | Peptic ulcer disease | |
| FD506 | Chinese | 56 | Female | 2005 | Malaysia | Non-ulcer dyspepsia | |
| FD568 | Chinese | 50 | Female | 2005 | Malaysia | Non-ulcer dyspepsia | |
| FD577 | Chinese | 67 | Female | 2005 | Malaysia | Non-ulcer dyspepsia | |
| UM038 | Chinese | 73 | Male | 2011 | Malaysia | Non-ulcer dyspepsia | |
| UM085 | Chinese | 68 | Female | 2011 | Malaysia | Non-ulcer dyspepsia | |
| UM111 | Chinese | 76 | Male | 2012 | Malaysia | Non-ulcer dyspepsia | |
| FD662 | Malay | 47 | Female | 2006 | Malaysia | Non-ulcer dyspepsia | |
| FD719 | Malay | 60 | Female | 2006 | Malaysia | Non-ulcer dyspepsia | |
| FD703 | Malay | 51 | Male | 2006 | Malaysia | Non-ulcer dyspepsia | |
| FD423 | Indian | 39 | Male | 2005 | Malaysia | Non-ulcer dyspepsia | |
| FD430 | Indian | 34 | Male | 2005 | Malaysia | Non-ulcer dyspepsia | |
| FD535 | Indian | 44 | Male | 2005 | Malaysia | Non-ulcer dyspepsia | |
| UM037 | Indian | 67 | Female | 2011 | Malaysia | Non-ulcer dyspepsia |
Candidate genes identified as associated with GC, PUD and NUD with percentage of identity. Percentage of identity generated by RAST >80% was considered similar
| Gene/Function | GC | PUD | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Chinese | Japanese | Tibetan | Chinese | ||||||
| GC26 | F57 | F32 | XZ274 | UM023 | UM065 | UM066 | UM077 | HLJ271 | |
| Membrane protein GC26_77 | 100 | 78.7 | 86.6 | 88.67 | 0 | 0 | 0 | 0 | 0 |
| Hypothetical ATPase protein GC26_69 | 100 | 93.8 | 98.6 | 95.65 | 0 | 0 | 0 | 0 | 76.88 |
| Hypothetical protein GC26_73 | 100 | 92.1 | 97.4 | 88.24 | 0 | 0 | 74.1 | 0 | 0 |
| Outer membrane protein GC26_66 (type 1) | 100 | 96.2 | 98.0 | 97.6 | 0 | 98.08 | 96.63 | 96.46 | 98.02 |
| Outer membrane protein GC26_66 (type 2) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Phospho-2-Dehydro-3-Deoxyheptonate Aldolase | 100 | 99.6 | 99.0 | 98.66 | 99.33 | 98.44 | 99.33 | 98.66 | 99.33 |
| Hypothetical protein GC26_33 | 100 | 84.1 | 100 | 98.18 | 100 | 96.36 | 90.91 | 96.36 | 0 |
| Membrane Protein GC26_77 | 0 | 0 | 0 | 0 | 0 | 89.3 | 95.2 | 100 | 96.0 |
| Hypothetical ATPase Protein GC26_69 | 82.55 | 92.22 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Hypothetical protein GC26_73 | 89.19 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Outer membrane protein GC26_66 (type 1) | 97.12 | 98.56 | 0 | 0 | 0 | 0 | |||
| Outer membrane protein GC26_66 (type 2) | 0 | 0 | 95.43 | 93.91 | 94.42 | 0 | 0 | 0 | 0 |
| Phospho-2-Dehydro-3-Deoxyheptonate Aldolase | 98.66 | 97.09 | 97.55 | 0 | 98.22 | 0 | 0 | 0 | 0 |
| Hypothetical protein GC26_33 | 74.55 | 0 | 0 | 0 | 81.82 | 0 | 0 | 0 | 0 |
| Membrane Protein GC26_77 | 87.3 | 92.0 | 93.0 | 96.7 | 90.0 | 92.2 | 94.0 | 93.2 | 89.3 |
| Hypothetical ATPase Protein GC26_69 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Hypothetical protein GC26_73 | 0 | 94.7 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Outer membrane protein GC26_66 (type 1) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Outer membrane protein GC26_66 (type 2) | 0 | 0 | 94.92 | 95.94 | 96.45 | 95.43 | 93.40 | 96.45 | 94.44 |
| Phospho-2-Dehydro-3-Deoxyheptonate Aldolase | 99.33 | 99.0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Hypothetical protein GC26_33 | 100 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Percentage of similarity >80% is considered significa.
Significance of association of candidate genes with disease was examined by Student's t-test
| Student's t-test (P-value) | ||
|---|---|---|
| Genes/Function | GC vs. PUD | GC vs. NUD |
| 0.000 | 0.390 | |
| 0.000 | 0.000 | |
| 0.000 | 0.000 | |
| 0.282 | 0.005 | |
| 0.308 | 0.000 | |
| 0.022 | 0.000 | |
*. P-value <0.05 is considered significant.
**. P-value <0.01 is considered highly significant.
Structural and domain prediction of candidate genes by Blast2Go
| Gene/Function | Accession No. | Gene Length (bp) | Hits | Min. e-value | Mean Similarity | Go ID | InterPro |
|---|---|---|---|---|---|---|---|
| 456 | 20 | 1.11E-70 | 92.2% | - | i. Transmembrane protein | ||
| 1683 | 20 | 0 | 86.35% | Fatty acid biosynthetic process: GTP catabolic process; GTPase activity; GTP binding | i. Dynamin, GTPase domainii.P-loop containing nucleoside triphosphate hydrolase | ||
| 150 | 6 | 5.58E-17 | 93.5% | - | i. Signal peptide | ||
| 2394 | 20 | 5.99E-140 | 98.65% | - | i. | ||
| 1353 | 20 | 0 | 99.05% | 3-deoxy-7-phosphoheptulonate synthase activity; aromatic amino acid family biosynthetic process | DAHP synthetase, class II | ||
| 168 | 20 | 4.57E-28 | 93.55% | - | i. Transmembrane protein |
Pearson's correlation coefficient test on relationship between candidate gene and disease and between different genes
| GC | PUD | NUD | Race | Membrane protein GC26_77 | Hypothetical ATPase protein GC26_69 | Hypothetical protein GC26_73 | Outer membrane protein GC26_66 (type 1) | Phospho-2-dehydro-3-deoxyheptonate aldolase | Hypothetical protein GC26_33 | ||
|---|---|---|---|---|---|---|---|---|---|---|---|
| GC | Pearson Correlation | 1 | −0.320 | −0.402 | −0.346 | 0.287 | 0.760 | 0.739 | 0.546 | 0.377 | 0.523 |
| Sig. (2-tailed) | 0.104 | 0.038 | 0.077 | 0.147 | 0.000 | 0.000 | 0.003 | 0.052 | 0.005 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
| PUD | Pearson Correlation | −0.320 | 1 | −0.739 | −0.168 | −0.996 | 0.028 | −0.138 | 0.363 | 0.527 | 0.269 |
| Sig. (2-tailed) | 0.104 | 0.000 | 0.403 | 0.000 | 0.890 | 0.491 | 0.063 | 0.005 | 0.175 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
| NUD | Pearson Correlation | −0.402 | −0.739 | 1 | 0.408 | 0.759 | −0.567 | −0.392 | −0.739 | −0.778 | −0.632 |
| Sig. (2-tailed) | 0.038 | 0.000 | 0.035 | 0.000 | .002 | 0.043 | 0.000 | 0.000 | 0.000 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
| Race | Pearson Correlation | −0.346 | −0.168 | 0.408 | 1 | 0.176 | −0.488 | −0.310 | −0.636 | −0.474 | −0.549 |
| Sig. (2-tailed) | 0.077 | 0.403 | 0.035 | 0.380 | 0.010 | 0.116 | 0.000 | 0.012 | 0.003 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
| Membrane protein GC26_77 | Pearson Correlation | 0.287 | −0.996 | 0.759 | 0.176 | 1 | −0.054 | 0.113 | −0.384 | −0.556 | −0.297 |
| Sig. (2-tailed) | 0.147 | 0.000 | 0.000 | 0.380 | 0.789 | 0.575 | 0.048 | 0.003 | 0.132 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
| Hypothetical ATPase protein GC26_69 | Pearson Correlation | 0.760 | 0.028 | −0.567 | −0.488 | −0.054 | 1 | 0.667 | 0.771 | 0.528 | 0.382 |
| Sig. (2-tailed) | 0.000 | 0.890 | 0.002 | 0.010 | 0.789 | 0.000 | 0.000 | 0.005 | 0.049 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
| Hypothetical protein GC26_73 | Pearson Correlation | 0.739 | −0.138 | −0.392 | −0.310 | 0.113 | 0.667 | 1 | 0.586 | 0.531 | 0.513 |
| Sig. (2-tailed) | 0.000 | 0.491 | 0.043 | 0.116 | 0.575 | 0.000 | 0.001 | 0.004 | 0.006 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
| Outer membrane protein GC26_66 | Pearson Correlation | 0.546 | 0.363 | −0.739 | −0.636 | −0.384 | 0.771 | 0.586 | 1 | 0.687 | 0.602 |
| Sig. (2-tailed) | 0.003 | 0.063 | 0.000 | 0.000 | 0.048 | 0.000 | 0.001 | 0.000 | 0.001 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
| Phospho-2-dehydro-3-deoxyheptonate aldolase | Pearson Correlation | 0.377 | 0.527 | −0.778 | −0.474 | −0.556 | 0.528 | 0.531 | 0.687 | 1 | 0.740 |
| Sig. (2-tailed) | 0.052 | 0.005 | 0.000 | 0.012 | 0.003 | 0.005 | 0.004 | 0.000 | 0.000 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
| Hypothetical protein GC26_33 | Pearson Correlation | 0.523 | 0.269 | −0.632 | −0.549 | −0.297 | 0.382 | 0.513 | 0.602 | 0.740 | 1 |
| Sig. (2-tailed) | 0.005 | 0.175 | 0.000 | 0.003 | 0.132 | 0.049 | 0.006 | 0.001 | 0.000 | ||
| N | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | 27 | |
*. Correlation is significant at the 0.05 level (2-tailed).
**. Correlation is significant at the 0.01 level (2-tailed).
Figure 2Sequence alignment of translated outer membrane protein GC26_66 gene.
According to the translated peptide, sequences were grouped as type 1 and type 2. Type 1 strains were GC26, F32, F57, XZ274, UM065, UM066, UM077, F30, HP51 and HLJ271. Type 2 strains were UM067, UM084, UM114, FD662, FD703, FD719, FD423, FD430, FD535 and UM037. There was close correlation with host ethnicity.
Figure 3Correlation map of candidate genes with disease states and between genes.
The correlation was determined by Pearson's correlation (Table 5). Dotted line represents negative correlation and full line represents positive correlation. Bold line represents strong correlation with significance <0.01 and fine line represents correlation with significance <0.05. Genes outside the boxes represent genes present in the respective disease strains (Student's t-test) but no significant correlation with respective disease status (Pearson's).