| Literature DB >> 34521966 |
Dingxue Hu1,2, Yueqi Lu1, Daoming Wang1, Chao Nie3, Yan Li4.
Abstract
Helicobacter pylori is a causative pathogen of many gastric and extra-gastric diseases. It has infected about half of the global population. There were no genome-wide association studies (GWAS) for H. pylori infection conducted in Chinese population, who carried different and relatively homogenous strain of H. pylori. In this work, we performed SNP (single nucleotide polymorphism)-based, gene-based and pathway-based genome-wide association analyses to investigate the genetic basis of host susceptibility to H. pylori infection in 480 Chinese individuals. We also profiled the composition and function of the gut microbiota between H. pylori infection cases and controls. We found several genes and pathways associated with H. pylori infection (P < 0.05), replicated one previously reported SNP rs10004195 in TLR1 gene region (P = 0.02). We also found that glycosaminoglycan biosynthesis related pathway was associated with both onset and progression of H. pylori infection. In the gut microbiome association study, we identified 2 species, 3 genera and several pathways had differential abundance between H. pylori infected cases and controls. This paper is the first GWAS for H. pylori infection in Chinese population, and we combined the genetic and microbial data to comprehensively discuss the basis of host susceptibility to H. pylori infection.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34521966 PMCID: PMC8440747 DOI: 10.1038/s41598-021-97790-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Characteristic statistics of the study population.
| Number (N) | Sex | Age | BMI | ||||
|---|---|---|---|---|---|---|---|
| female count (percent) | Male count (percent) | Mean (SD) | Range | Mean (SD) | Range | ||
| Case | 224 | 127 (56.70%) | 97 (43.30%) | 30.78 (5.46) | 22–58 | 22.01 (3.00) | 15.20–30.60 |
| Control | 256 | 136 (53.13%) | 120 (46.88%) | 31.18 (5.40) | 22–69 | 22.25 (3.22) | 15.70–37.80 |
| Total | 480 | 263 (54.79%) | 217 (45.21%) | 30.99 (5.46) | 22–69 | 22.14 (3.12) | 15.20–37.80 |
After Wilcoxon test, there was no significance in age difference between case and control. In the logistic regression model, H. pylori infection status ~ sex, we found there was no significant correlation between sex and H. pylori infection. BMI had no significant difference in the cases and controls.
Figure 1Manhattan plots of genome-wide association analyses. (a) Manhattan plots presenting the − log10 P values from the GWAS adjusted for the top two principal components of the H. pylori infection cases versus all controls and (b) the DOB values in H. pylori infection cases. The blue line indicates the threshold for suggestive significant association (P < 1e−5). The genes that suggestive significant variants encoded are marked in the plot.
Figure 2Regional association plots for the regions containing associated genes. Regional association plots for the LINC02011 (a), CACNA1B (b), FTO (c) and LARGE1 (d). LINC02011 (a) and CACNA1B (b) were identified based on the GWAS results from the H. pylori infection cases versus controls, FTO (c) and LARGE1 (d) were identified based on the DOB values association study of H. pylori infection cases. The color of the variants is based on the linkage disequilibrium with rs1687312 (LINC02011) (a), rs7875160 (CACNA1B) (b), rs8046411 (FTO) (c) and rs5749679 (LARGE1) (d). For all plots, each point represents a SNP, where the x axis represents the position of the SNPs and the y axis represents the − log10 P values of the GWAS results. Each point is color-coded with the r2 value as calculated with the
source of LD information retrieved from hg38/1000 Genomes Nov 2014 EAS (Asian). Plots showed the most significant SNPs flank 300 kb.
Candidate pathways associated with H. pylori infection in logistic regression model.
| Pathway set | No.SNPs | Chisq(Obs) | TopSNP.Pvalue | TopSNP | |
|---|---|---|---|---|---|
| KEGG_GLYCOSPHINGOLIPID_BIOSYNTHESIS_LACTO_AND_NEOLACTO_SERIES | 18 | 58.7025 | 2.03E−02 | 9.41E−03 | rs872451 |
| KEGG_GLYCOSAMINOGLYCAN_BIOSYNTHESIS_HEPARAN_SULFATE | 80 | 149.922 | 2.50E−02 | 3.82E−03 | rs7760512 |
| KEGG_CYTOSOLIC_DNA_SENSING_PATHWAY | 14 | 37.2182 | 3.88E−02 | 7.01E−03 | rs118156647 |
| KEGG_AXON_GUIDANCE | 120 | 207.521 | 4.33E−02 | 1.90E−03 | rs12572571 |
| KEGG_NATURAL_KILLER_CELL_MEDIATED_CYTOTOXICITY | 25 | 58.616 | 4.68E−02 | 6.96E−03 | rs13420683 |
| KEGG_PRIMARY_IMMUNODEFICIENCY | 25 | 58.616 | 4.68E−02 | 6.96E−03 | rs13420683 |
| KEGG_FC_GAMMA_R_MEDIATED_PHAGOCYTOSIS | 4 | 10.6895 | 4.90E−02 | 3.88E−02 | rs57919378 |
| KEGG_RNA_DEGRADATION | 16 | 37.3961 | 4.96E−02 | 6.68E−03 | rs34113220 |
No.SNPs: number of SNPs in the pathway; Chisq(Obs): sum of chi-squared test-statistics of all SNPs in the set; P value: pathway set-based test P value; TopSNP.Pvalue: smallest single-SNP GWAS P value in the pathway set; TopSNP: the top associated GWAS SNP. The rsID is based on dbSNP build 138.
Candidate pathways associated with H. pylori infection in linear regression model.
| Pathway set | No.SNPs | Chisq(Obs) | TopSNP.Pvalue | TopSNP | |
|---|---|---|---|---|---|
| KEGG_ANTIGEN_PROCESSING_AND_PRESENTATION | 46 | 126.12 | 9.74E−03 | 1.03E−03 | rs12204338 |
| KEGG_AUTOIMMUNE_THYROID_DISEASE | 46 | 126.12 | 9.74E−03 | 1.03E−03 | rs12204338 |
| KEGG_HEDGEHOG_SIGNALING_PATHWAY | 6 | 23.2668 | 1.60E−02 | 5.35E−03 | rs9576171 |
| KEGG_TERPENOID_BACKBONE_BIOSYNTHESIS | 9 | 32.6463 | 1.91E−02 | 8.99E−03 | rs61836189 |
| KEGG_NEUROACTIVE_LIGAND_RECEPTOR_INTERACTION | 29 | 70.052 | 2.28E−02 | 1.56E−03 | rs2146490 |
| KEGG_ADIPOCYTOKINE_SIGNALING_PATHWAY | 19 | 48.4956 | 2.97E−02 | 2.09E−02 | rs12602449 |
| KEGG_JAK_STAT_SIGNALING_PATHWAY | 19 | 48.4956 | 2.97E−02 | 2.09E−02 | rs12602449 |
| KEGG_FOLATE_BIOSYNTHESIS | 21 | 46.2834 | 4.66E−02 | 1.16E−02 | rs13131273 |
| KEGG_GLYCOSAMINOGLYCAN_BIOSYNTHESIS_HEPARAN_SULFATE | 80 | 135.496 | 4.77E−02 | 5.04E−03 | rs12195272 |
No.SNPs: number of SNPs in the pathway; Chisq(Obs): sum of chi-squared test-statistics of all SNPs in the set; P value: pathway set-based test P value; TopSNP.Pvalue: smallest single-SNP GWAS P value in the pathway set; TopSNP: the top associated GWAS SNP. The rsID is based on dbSNP build 138.
Bacteria associated with H. pylori infection in Wilcoxon test.
| Taxo | Mean difference | FDR-adjusted P | Presence ratio | ||
|---|---|---|---|---|---|
| HPI negative | HPI positive | ||||
| 0.0008 | 0.0004 | 0.0579 | 0.2156 | 0.3838 | |
| 0.0002 | 0.0010 | 0.0785 | 0.0596 | 0.1622 | |
| 0.0008 | 0.0003 | 0.0212 | 0.4266 | 0.5946 | |
| 0.0002 | 0.0010 | 0.0349 | 0.0596 | 0.1622 | |
| 0.0648 | 0.0037 | 0.0861 | 0.8716 | 0.8973 | |
Taxo: identified bacteria names at last taxonomic level by MetaPhlAn2; Mean difference: difference of bacteria abundance between H. pylori infection positive group and H. pylori infection negative group; P value: P value of Wilcoxon test; FDR: taxonomic level-based FDR adjusted P value; Presence ratio: ratio of non-zero abundance individuals to all people, HPI negative: H. pylori infection negative; HPI positive: H. pylori infection positive.