| Literature DB >> 25452339 |
Narender Kumar1, Vanitha Mariappan2, Ramani Baddam1, Aditya K Lankapalli1, Sabiha Shaik1, Khean-Lee Goh3, Mun Fai Loke2, Tim Perkins4, Mohammed Benghezal4, Seyed E Hasnain5, Jamuna Vadivelu2, Barry J Marshall4, Niyaz Ahmed6.
Abstract
The discordant prevalence of Helicobacter pylori and its related diseases, for a long time, fostered certain enigmatic situations observed in the countries of the southern world. Variation in H. pylori infection rates and disease outcomes among different populations in multi-ethnic Malaysia provides a unique opportunity to understand dynamics of host-pathogen interaction and genome evolution. In this study, we extensively analyzed and compared genomes of 27 Malaysian H. pylori isolates and identified three major phylogeographic lineages: hspEastAsia, hpEurope and hpSouthIndia. The analysis of the virulence genes within the core genome, however, revealed a comparable pathogenic potential of the strains. In addition, we identified four genes limited to strains of East-Asian lineage. Our analyses identified a few strain-specific genes encoding restriction modification systems and outlined 311 core genes possibly under differential evolutionary constraints, among the strains representing different ethnic groups. The cagA and vacA genes also showed variations in accordance with the host genetic background of the strains. Moreover, restriction modification genes were found to be significantly enriched in East-Asian strains. An understanding of these variations in the genome content would provide significant insights into various adaptive and host modulation strategies harnessed by H. pylori to effectively persist in a host-specific manner.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25452339 PMCID: PMC4288169 DOI: 10.1093/nar/gku1271
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.A circular representation of the genomes of Malaysian isolates: the draft genomes of 27 strains were aligned against the genome of reference strain H. pylori G27. Each genome is represented by a ring. The yellow rings represent H. pylori genomes from Malaysian-Chinese, purple represents those from Malays and light blue represents genomes of Malaysian-Indian strains. The G+C content (%) of the reference genome (strain G27) is represented by a ragged inner circle in black (GC). The variable regions such as plasticity zones (PZ) and cagPAI are compared across all the genomes using BRIG image generator (http://brig.sourceforge.net).
Genome statistics of the sequenced Malaysian H. pylori isolates
| UM018 | UM054 | UM007 | UM034 | UM045 | |
|---|---|---|---|---|---|
| Origin | Malaysian-Indian | Malaysian-Indian | Chinese | Chinese | Malay |
| Avg. genome coverage | 170X | 170X | 150X | 180X | 200X |
| No. of contigs | 72 | 89 | 80 | 27 | 28 |
| Genome size | 1 617 433 | 1 603 218 | 1 568 678 | 1 714 278 | 1 623 876 |
| G+C | 39.05 | 39.12 | 38.84 | 38.59 | 38.96 |
| CDS | 1579 | 1585 | 1557 | 1669 | 1595 |
| Avg. length | 937 | 926 | 924 | 940 | 933 |
| Coding% | 91.5 | 91.6 | 91.7 | 91.5 | 91.7 |
| rRNA | 3 | 4 | 3 | 4 | 3 |
| tRNA | 36 | 36 | 36 | 36 | 36 |
| cagA (EPIYA-motif) | AB-C | AB-C | AB-D | AB-D | AB-C |
| vacA | s1m1 | s2m2 | s1m2 | s1m2 | s1m2 |
| Prophage | Absent | Present (incomplete) | Absent | Absent | Present (intact) |
Figure 2.The whole genome phylogenetic analysis: the figure represents a whole genome comparison-based phylogenetic tree of various complete and draft H. pylori genomes from different geographical regions. The tree was constructed based on neighbor joining algorithm using SplitsTree. The Malaysian strains used in the analysis are labeled in black whereas the other genomes are colored to represent their genotypes.
Figure 3.The analysis of 27 Malaysian genomes: the Neighbor-Joining phylogenetic tree constructed after the alignment of 27 Malaysian H. pylori isolates representing various ethnic groups. The heat plot shows average similarity values among the strains.
Figure 4.The functional COG classification of genes: (A) the COG functional classification representing core genome of the Malaysian H. pylori isolates. (B) Core and specific gene content observed among various strains compared in the study.
Genes differentially present among the core of East-Asian (EA) and non-East-Asian (Non-EA) H. pylori
| Status in | ||||
|---|---|---|---|---|
| Cluster ID | EA ( | Non-EA ( | Predicted functions | Orthologs in 26695 |
| 2426 | 14 | 0 | Lysozyme family protein | HP0339 |
| 2425 | 14 | 0 | Hypothetical protein | HP0344 |
| 2424 | 14 | 0 | Hypothetical protein | HP0346 |
| 2423 | 14 | 0 | Hypothetical protein | Absent |
| 2403 | 14 | 2 | Lipopolysaccharide biosynthesis protein | Absent |
| 2376 | 14 | 4 | Type II restriction endonuclease | HP1537 |
| 2375 | 14 | 4 | Type II methylase | Absent |
| 2370 | 14 | 5 | Glycosyltransferase | Absent |
| 2364 | 14 | 4 | DNA methyltransferase | HP0051 |
| 2402 | 1 | 13 | Hypothetical protein | Absent |
Figure 5.The distribution of RM genes in various strains: the graph shows the number of genes annotated to encode putative RM functions in each strain. The Y-axis represents the number of genes and the X-axis denotes strain names.
Figure 6.The functional classification of differentially evolving genes: the graph shows the COG functional classification of various core genes with signs of differential evolution among East-Asian and non-East-Asian strains. The Y-axis denotes the functional category and the X-axis represents the number of genes in a particular functional category.