| Literature DB >> 33904371 |
Feng Yang1,2,3, Jinghao Zhang3, Su Wang3, Zhaoyang Sun3, Jun Zhou4, Feng Li4, Yue Liu3, Li Ding3, Yixin Liu3, Wenjing Chi3, Tao Liu3, Yongqun He5, Ping Xiang4, Zhijun Bao1,2,6, Michal A Olszewski7, Hu Zhao1,2,3, Yanmei Zhang1,2,3.
Abstract
Severe Helicobacter pylori-linked gastric disorders are especially prevalent in the East Asia region. The ability of H. pylori to cause different clinical outcomes is thought to be associated with unique sets of its genetic features. However, only few genetic features have been definitively linked to specific gastrointestinal pathologies. Genome heterogeneity of clinical H. pylori strains from patients with four different gastric disorders was studied to explore the population structure and molecular genomic features and their association with pathogenicity. Population analysis showed that 92.9% of the Shanghai H. pylori isolates were clustered in the East Asia group. Among 2,866 genes detected in all genomes, 1,146 genes formed the core genome, whereas 209 unique genes were detected in individual disease groups. The unique genes of peptic ulcer and gastric cancer groups represented the inorganic ion transport and metabolism function gene clusters. Sixteen virulence genes were detected with statistically different detection rates among the four disease groups. Furthermore, 127 clustered regularly interspaced short palindromic repeats were found with significantly different rates in the four disease groups. A total of 337 putative genomic islands were identified, and three genomic islands were individually found in more than 10% of strains. The genomic islands included several metabolism-associated genes and many genes with unknown function. In total, 88 sequence types were detected among the 112 Shanghai H. pylori isolates. Our study provides an essential milestone in the mapping of specific genomic features and their functions to identify factors needed to induce specific gastric disorders in H. pylori.Entities:
Keywords: Helicobacter pylori; crispr; gastric diseases; genomic features; genomic island; pathogenicity; population structure
Mesh:
Substances:
Year: 2021 PMID: 33904371 PMCID: PMC8081043 DOI: 10.1080/21505594.2021.1920762
Source DB: PubMed Journal: Virulence ISSN: 2150-5594 Impact factor: 5.882
Figure 1.Contemporary population structure of Shanghai H. pylori isolates (H. pylori-Shi) largely overlaps with that of the entire East Asia (EAsia) cluster with minimal infusion of broader Eurasian clusters
Figure 2.Many identified H. pylori genes exclusive of the H. pylori Core Genome include subsets of genes that appear to be linked with specific pathological outcomes
Figure 3.Functional classification of core genes in 122 H. pylori strains reveals predominant basic structure and metabolism genes as well as a group of genes with unknown functions
The functions of the clinical outcome specific gene clusters
| Different groups | Specific genes | Annotated genes | COG function classifications | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| C | D | E | F | G | H | I | J | L | M | N | O | P | S | T | U | V | |||
| CSG | 15 | 8 | 1 | 1 | 1 | - | - | - | - | - | 3 | - | - | - | - | 2 | - | - | - |
| CAG | 98 | 35 | 3 | 2 | - | 2 | - | 3 | 1 | - | 3 | 4 | 3 | 2 | 4 | 4 | - | 2 | 2 |
| GU | 17 | 3 | - | - | - | - | - | - | - | - | - | - | - | - | - | 3 | - | - | - |
| GC | 79 | 50 | 2 | - | 1 | - | 1 | - | 2 | 3 | 5 | 5 | 2 | 4 | 6 | 16 | 1 | 1 | 1 |
| GC+GU | 17 | 2 | - | - | - | - | - | - | - | - | - | - | - | - | 2 | - | - | - | - |
Note that the GC/GU specific genes are enriched for class P (inorganic ion transport and metabolism), consistent with the role of microbial ion transport pathways in driving GC/GU pathological processes. The annotations of each COG functional classes are as per Figure 3, (leaving out classes without a single hit).
Sixteen virulence genes demonstrate that statistically different frequency of detection among the clinical outcome groups
| Genes | Number and % gene positive isolates in each clinical category | ||||
|---|---|---|---|---|---|
| CAG (n = 66) | CSG (n = 19) | GU (n = 26) | GC (n = 11) | ||
| 61 (92.4%) | 18 (94.7%) | 24 (92.3%) | 6 (54.5%) | 0.008 | |
| 9 (13.6%) | 3 (15.8%) | 4 (15.4%) | 6 (54.5%) | 0.027 | |
| 44 (66.7%) | 11 (57.9%) | 17 (65.4%) | 1 (9.1%) | 0.004 | |
| 61 (92.4%) | 19 (100.0%) | 24 (92.3%) | 6 (54.5%) | 0.003 | |
| 66 (100.0%) | 19 (100.0%) | 26 (100.0%) | 4 (36.4%) | 0.000 | |
| 66 (100.0%) | 17 (89.5%) | 26 (100.0%) | 8 (72.7%) | 0.000 | |
| 66 (100.0%) | 19 (100.0%) | 26 (100.0%) | 8 (72.7%) | 0.001 | |
| 66 (100.0%) | 17 (89.5%) | 26 (100.0%) | 11 (100.0%) | 0.031 | |
| 66 (100.0%) | 19 (100.0%) | 26 (100.0%) | 7 (63.6%) | 0.000 | |
| 66 (100.0%) | 19 (100.0%) | 26 (100.0%) | 9 (81.8%) | 0.007 | |
| 66 (100.0%) | 19 (100.0%) | 26 (100.0%) | 1 (9.1%) | 0.000 | |
| 8 (12.1%) | 4 (21.1%) | 7 (26.9%) | 9 (81.8%) | 0.000 | |
| 8 (12.1%) | 1 (5.3%) | 6 (23.1%) | 9 (81.8%) | 0.000 | |
| 8 (12.1%) | 1 (5.3%) | 6 (23.1%) | 9 (81.8%) | 0.000 | |
| 66 (100.0%) | 18 (94.7%) | 25 (96.2%) | 9 (81.8%) | 0.015 | |
| 66 (100.0%) | 19 (100.0%) | 26 (100.0%) | 9 (81.8%) | 0.007 | |
Among known virulence-associated genes registered in VFDB database, 16 show significantly different distribution in four clinical outcome groups (Note: These and the additional virulence genes that show no significant difference in frequencies among four clinical groups are all shown in Supplemental Table S5).
Figure 4.Three virulence-associated genes with significant different copy numbers among four clinical outcome groups
Figure 5.Structure analysis of three most common putative GIs in H. pylori strains identifies known genes and multiple unknown hypothetical protein genes that could be related to pathogenicity
Figure 6.Distribution and constitution of CRISPRs and the three most predominant and previously unknown CRISPRs in H. pylori strains