Minjuan Li1, Dantong Shao1, Jiachen Zhou2, Jianhua Gu1, Junjie Qin3, Xinqing Li1, Changqing Hao4, Wenqiang Wei1. 1. National Central Cancer Registry, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China. 2. Department of Epidemiology and Biostatistics, School of Public Health, Xi'an Jiaotong University Health Science Center, Xi'an, China. 3. Promegene Institute, Shenzhen, China. 4. Department of Endoscopy, Cancer Institute/Hospital of Linzhou, Linzhou, China.
Abstract
The objective of this study was to describe and compare the dynamic microbiota characteristics in the gastrointestinal (GI) tract in Chinese participants via high-throughput sequencing techniques. The study collected saliva, esophageal swab, cardia biopsy, noncardia biopsy, gastric juice, and fecal specimens from 40 participants who underwent upper GI tract cancer screening in Linzhou (Henan, China) in August 2019. The V4 region of 16S rRNA genes was amplified and sequenced using the Illumina MiniSeq platform. The observed amplicon sequence variants (ASVs) gradually decreased from saliva to esophageal swab, cardia biopsy, noncardia biopsy, and gastric juice specimens and then increased from gastric juice to fecal specimens (P < 0.05). Each GI site had its own microbial characteristics that overlapped those of adjacent sites. Characteristic genera for each site were as follows: Neisseria and Prevotella in saliva, Streptococcus and Haemophilus in the esophagus, Helicobacter in the noncardia, Pseudomonas in gastric juice, Faecalibacterium, Roseburia, and Blautia in feces, and Weissella in the cardia. Helicobacter pylori-positive participants had decreased observed ASVs (cardia, P < 0.01; noncardia, P < 0.001) and Shannon index values (cardia, P < 0.001; noncardia, P < 0.001) compared with H. pylori-negative participants both in cardia and noncardia specimens. H. pylori infection played a more important role in the microbial composition of noncardia than of cardia specimens. In gastric juice, the gastric pH and H. pylori infection had similar additive effects on the microbial diversity and composition. These results show that each GI site has its own microbial characteristics that overlap those of adjacent sites and that differences and commonalities between and within microbial compositions coexist, providing essential foundations for the continuing exploration of disease-associated microbiota. IMPORTANCE Upper gastrointestinal (UGI) tract cancer is one of the most common cancers worldwide, while limited attention has been paid to the UGI microbiota. Microbial biomarkers, such as Fusobacteria nucleatum and Helicobacter pylori, bring new ideas for early detection of UGI tract cancer, which may be a highly feasible method to reduce its disease burden. This study revealed that each gastrointestinal site had its own microbial characteristics that overlapped those of adjacent sites. There were significant differences between the microbial compositions of the UGI sites and feces. Helicobacter pylori played a more significant role in the microbial composition of the noncardia stomach than in that of the cardia. Gastric pH and Helicobacter pylori had similar additive effects on the microbial diversity of gastric juice. These findings played a key role in delineating the microbiology spectrum of the gastrointestinal tract and provided baseline information for future microbial exploration covering etiology, primary screening, treatment, outcome, and health care products.
The objective of this study was to describe and compare the dynamic microbiota characteristics in the gastrointestinal (GI) tract in Chinese participants via high-throughput sequencing techniques. The study collected saliva, esophageal swab, cardia biopsy, noncardia biopsy, gastric juice, and fecal specimens from 40 participants who underwent upper GI tract cancer screening in Linzhou (Henan, China) in August 2019. The V4 region of 16S rRNA genes was amplified and sequenced using the Illumina MiniSeq platform. The observed amplicon sequence variants (ASVs) gradually decreased from saliva to esophageal swab, cardia biopsy, noncardia biopsy, and gastric juice specimens and then increased from gastric juice to fecal specimens (P < 0.05). Each GI site had its own microbial characteristics that overlapped those of adjacent sites. Characteristic genera for each site were as follows: Neisseria and Prevotella in saliva, Streptococcus and Haemophilus in the esophagus, Helicobacter in the noncardia, Pseudomonas in gastric juice, Faecalibacterium, Roseburia, and Blautia in feces, and Weissella in the cardia. Helicobacter pylori-positive participants had decreased observed ASVs (cardia, P < 0.01; noncardia, P < 0.001) and Shannon index values (cardia, P < 0.001; noncardia, P < 0.001) compared with H. pylori-negative participants both in cardia and noncardia specimens. H. pylori infection played a more important role in the microbial composition of noncardia than of cardia specimens. In gastric juice, the gastric pH and H. pylori infection had similar additive effects on the microbial diversity and composition. These results show that each GI site has its own microbial characteristics that overlap those of adjacent sites and that differences and commonalities between and within microbial compositions coexist, providing essential foundations for the continuing exploration of disease-associated microbiota. IMPORTANCE Upper gastrointestinal (UGI) tract cancer is one of the most common cancers worldwide, while limited attention has been paid to the UGI microbiota. Microbial biomarkers, such as Fusobacteria nucleatum and Helicobacter pylori, bring new ideas for early detection of UGI tract cancer, which may be a highly feasible method to reduce its disease burden. This study revealed that each gastrointestinal site had its own microbial characteristics that overlapped those of adjacent sites. There were significant differences between the microbial compositions of the UGI sites and feces. Helicobacter pylori played a more significant role in the microbial composition of the noncardia stomach than in that of the cardia. Gastric pH and Helicobacter pylori had similar additive effects on the microbial diversity of gastric juice. These findings played a key role in delineating the microbiology spectrum of the gastrointestinal tract and provided baseline information for future microbial exploration covering etiology, primary screening, treatment, outcome, and health care products.
Gastrointestinal (GI) microbiota refers to the resident and transient bacteria, viruses, fungi, protozoa, and archaea in the human GI tract and can be considered an organ that coevolved with the host to maintain physiological homeostasis. Microbial homeostasis is vital to the physiological balance of the host, playing key roles in development, acquisition of nutrients, immunity, and multiple biological processes (1–3). Once the microbiota becomes unbalanced, the microenvironment may change, leading to disease (4). To date, scientific clues about the relationship between dysbacteriosis and diseases, including diabetes, obesity, cancer, and mental illness, are emerging (5–8). The development of microbial studies is gradually uncovering the significance of microbiota in the early detection and early diagnosis of disease by microbial biomarkers like Fusobacteria nucleatum (9, 10) and Helicobacter pylori. Additionally, increasing evidence indicates that targeted reconstitution of the gut microbiota, such as by fecal microbiota transplantation (FMT), is an ideal therapeutic strategy against gastrointestinal disorders (11). Apparently, the GI microbiota is closely related to health and disease.Currently, there is more emphasis on single GI sites or multiple specimens from one site, such as feces (12), saliva, the tongue dorsum, and supragingival plaque from the oral cavity (13), or on paired esophageal biopsy and swab specimens from the esophagus (14). In multisite microbial studies (15, 16), there is limited evidence regarding the upper gastrointestinal (UGI) sites, especially the esophagus and cardia, and meanwhile, there is a heavy global burden of UGI cancer (17). To decrease the morbidity and mortality of UGI cancer, early detection is an effective method (18). However, it is still unclear how feasible it may be to replace difficult-to-collect esophageal or gastric specimens with easy-to-collect saliva or fecal specimens for early detection of UGI cancer and precancerous lesions by detection of microbial biomarkers during primary screening. Additionally, H. pylori infection plays an important role in the progression of gastric cancer. The alpha diversity in the stomach is lower in patients with H. pylori-positive status than in patients with H. pylori-negative status (19, 20). H. pylori infection has no significant effect on the microbiota diversity and composition in gastric juice (20). Nevertheless, it is still unclear whether H. pylori infection and gastric pH have an influence on the microbial characteristics in other sites. Moreover, individual differences are a nonnegligible barrier to microbial studies, since they influence the required intensive study, data mining and integration, and translational application.Considering the status of microbial research described above, we conducted a gastrointestinal microbiota study in China to describe and compare the microbial diversities, microbial compositions, and dynamic changes of bacterial populations existing in the oral cavity, esophagus, cardia, noncardia stomach, gastric juice, and feces. This is a crucial step to map the microbial composition, which is of great value to demonstrate and underpin the bacterium-associated diseases affecting human health.
RESULTS
Baseline information for participants.
A total of 40 participants were enrolled in this study, including 22 healthy participants and 18 participants with esophagitis. Half of the participants in both the healthy and the esophagitis group had H. pylori infection. All participants were negative for fecal occult blood testing. Baseline information is summarized in Table 1.
TABLE 1
Demographic information of enrolled participants
Epidemiological factor
No. (%) of participants or value as indicated in the following groupa:
Healthy (n = 22)
Esophagitis (n = 18)
Total (n = 40)
Male
9 (40.91)
9 (50.00)
18 (45.00)
≥56 yrs of age
7 (31.82)
11 (61.11)
18 (45.00)
Median age ± SD (yr)
53 ± 7.49
58 ± 6.72
54 ± 7.20
Married
20 (90.91)
18 (100.00)
38 (95.00)
Agriculture and farming
10 (45.45)
14 (77.78)
24 (60.00)
Primary school or below
11 (50.00)
6 (33.33)
17 (42.50)
Smoker
2 (9.09)
3 (16.67)
5 (12.50)
Never or rarely drinks tea
21 (95.45)
18 (100.00)
39 (97.50)
Eats rice at least 1 to 3 days/wk
18 (81.82)
14 (77.78)
32 (80.00)
Eats pasta every day
16 (72.73)
8 (44.44)
24 (60.00)
Eats harder staple food
8 (36.36)
2 (11.11)
10 (25.00)
Eats livestock meat at least 1 to 3 days/mo
11 (50.00)
4 (22.22)
15 (37.50)
Eats eggs at least 1 to 3 days/mo
11 (50.00)
7 (38.89)
18 (45.00)
Never or rarely drinks milk
20 (90.91)
16 (88.89)
36 (90.00)
Eats vegetables every day
22 (100.00)
17 (94.44)
39 (97.50)
Eats fruits at least 1 to 3 days/mo
15 (68.18)
6 (33.33)
21 (52.50)
Never or rarely eats spicy food
21 (95.45)
16 (88.89)
37 (92.50)
Never or rarely eats hot food
22 (100.00)
18 (100.00)
40 (100.00)
Eats faster
6 (27.27)
4 (22.22)
10 (25.00)
Eats leftovers at least once/wk
9 (40.91)
12 (66.67)
21 (52.50)
Has lost 1 to 3 teeth
8 (36.36)
7 (38.89)
15 (37.50)
H. pylori infection
11 (50.00)
9 (50.00)
20 (50.00)
pH of gastric juice of <2
16 (72.73)
12 (66.67)
28 (70.00)
Median pH ± SD
1 ± 1.47
1 ± 1.70
1 ± 1.57
Family history of cancer
19 (86.36)
11 (61.11)
30 (75.00)
None of the participants drank alcohol. All participants ate regular meals, three meals a day. There was no statistical difference in each factor between the healthy group and the group with esophagitis.
Demographic information of enrolled participantsNone of the participants drank alcohol. All participants ate regular meals, three meals a day. There was no statistical difference in each factor between the healthy group and the group with esophagitis.
Microbial characteristics in six GI sites.
The observed amplicon sequence variants (ASVs) (Fig. 1A) gradually decreased from saliva to esophageal swab, cardia biopsy, noncardia biopsy, and gastric juice specimens and then increased from gastric juice to fecal specimens, which was not obvious in the Shannon index values (Fig. 1B). Taking into consideration multiple-comparison results both in observed ASVs (Fig. 1A) and Shannon index values (Fig. 1B), the alpha diversity was predominantly higher in saliva specimens than in other sites. For beta diversity, the constrained principal coordinate analysis (CPCoA) results presented in Fig. 1C showed there were differences in microbial composition among different GI sites and the microbial composition of feces was distinctly different from those of UGI sites. To be precise, around 0.13%, 0.12%, 0.16%, 0.07%, and 0.21% of the fecal microbiota come from saliva, esophageal swab, cardia biopsy, noncardia biopsy and gastric juice respectively (Fig. S1 in the supplemental material). The phyla Actinobacteria, Bacteroidetes, Fusobacteria, Firmicutes, and Proteobacteria were dominant in all GI sites. For each phylum, the dynamic changes in relative abundance are shown in Fig. 1D. Only Fusobacteria decreased from saliva to esophageal swab, cardia biopsy, noncardia biopsy, gastric juice, and fecal specimens. The relative abundances were 2.90%, 1.93%, 1.87%, 1.38%, 1.32%, and less than 0.01%, respectively. At the genus level, Alloprevotella, Bacteroides, Blautia, Faecalibacterium, Haemophilus, Helicobacter, Neisseria, Porphyromonas, Prevotella, Pseudomonas, Roseburia, Rothia, Streptococcus, and Veillonella were the predominant genera in all GI sites (Fig. 1E). Different GI sites had distinctive microbial compositions due to having different proportions of these genera (Table S1).
FIG 1
Microbial diversity and microbial composition in different GI sites. (A) Observed ASVs in different GI sites. Horizontal lines link groups that are statistically different by multiple testing after the Kruskal-Wallis test. ** and *** above the lines indicate P values after Bonferroni correction of less than 0.01 and 0.001, respectively. (B) Shannon index values in different GI sites. Horizontal lines link groups that are statistically different by multiple testing after the Kruskal-Wallis test. *, **, and *** above the lines indicate P values after Bonferroni correction of less than 0.05, 0.01, and 0.001, respectively. (C) CPCoA based on weighted UniFrac distance in different GI sites. (D) Microbial compositions including the top 5 phyla in different GI sites. (E) Microbial compositions including the top 15 genera in different GI sites. Top, microbial composition of the top 15 genera in different GI sites; bottom, source composition indicated the composition of different GI sites for each genus.
Microbial diversity and microbial composition in different GI sites. (A) Observed ASVs in different GI sites. Horizontal lines link groups that are statistically different by multiple testing after the Kruskal-Wallis test. ** and *** above the lines indicate P values after Bonferroni correction of less than 0.01 and 0.001, respectively. (B) Shannon index values in different GI sites. Horizontal lines link groups that are statistically different by multiple testing after the Kruskal-Wallis test. *, **, and *** above the lines indicate P values after Bonferroni correction of less than 0.05, 0.01, and 0.001, respectively. (C) CPCoA based on weighted UniFrac distance in different GI sites. (D) Microbial compositions including the top 5 phyla in different GI sites. (E) Microbial compositions including the top 15 genera in different GI sites. Top, microbial composition of the top 15 genera in different GI sites; bottom, source composition indicated the composition of different GI sites for each genus.Combining the results from distance-based redundancy analysis (db-RDA) (Fig. 2A) and linear discriminant analysis effect size (LEfSe) (Fig. 2B, Fig. S2), we found the characteristic genera among the top 15 genera for each GI site to be as follows: Neisseria and Prevotella in saliva, Streptococcus and Haemophilus in the esophagus, Helicobacter in the noncardia stomach, Pseudomonas in gastric juice, Faecalibacterium, Roseburia, and Blautia in feces, and none in the cardia. For the GI sites with two or more characteristic genera, correlation analysis was performed. Highly positive and statistically different correlations were observed in the characteristic genera of saliva (Neisseria and Prevotella, coefficient = 0.64), the esophagus (Streptococcus and Haemophilus, coefficient = 0.75), and feces (Faecalibacterium and Roseburia, coefficient = 0.78; Faecalibacterium and Blautia, coefficient = 0.76; and Roseburia and Blautia, coefficient = 0.79) (Fig. 2C). Helicobacter was strongly positively correlated with Pseudomonas, and these two genera were negatively correlated with Bacteroides, which was a characteristic genus of feces. Expanding the limitation from the top 15 genera to all genera, the most representative genus of the cardia was Weissella, as analyzed by LEfSe (Fig. 2B, Fig. S2). Additionally, characteristic metabolism pathways for each specimen type are summarized in Table S2, with a comparison to those of the saliva group.
FIG 2
Characteristic genera in different GI sites. (A) db-RDA plot of characteristic genera in each GI site. (B) LEfSe results for characteristic genera in different GI sites. (C) Correlation plot of top 15 genera in six GI sites. CAP, constrained analysis of proximities.
Characteristic genera in different GI sites. (A) db-RDA plot of characteristic genera in each GI site. (B) LEfSe results for characteristic genera in different GI sites. (C) Correlation plot of top 15 genera in six GI sites. CAP, constrained analysis of proximities.
H. pylori infection affected the microbial characteristics of cardia and noncardia biopsy specimens.
The observed ASVs and Shannon index values were lower in H. pylori-positive participants than in H. pylori-negative participants both in cardia (Fig. 3A and B) and noncardia (Fig. 3C and D) specimens. Both in H. pylori-positive (observed ASVs, P < 0.01; Shannon index, P < 0.01) and H. pylori-negative (observed ASVs, P = 0.05; Shannon index, P > 0.05) participants, the observed ASVs and Shannon index values were higher in cardia than in noncardia specimens. For beta diversity, the microbial community was significantly affected by H. pylori infection status, especially for noncardia specimens (Fig. 3E and F). Furthermore, in comparison to the results for cardia and noncardia specimens from H. pylori-positive participants (Fig. 3G), a large-scale overlap was shown between cardia and noncardia specimens of participants without H. pylori infection (Fig. 3H).
FIG 3
Effects of H. pylori infection on microbial diversity in cardia and noncardia biopsy specimens. (A) Observed ASVs between H. pylori-positive and H. pylori-negative cardia biopsy specimens. (B) Shannon index values between H. pylori-positive and H. pylori-negative cardia biopsy specimens. (C) Observed ASVs between H. pylori-positive and H. pylori-negative noncardia biopsy specimens. (D) Shannon index values between H. pylori-positive and H. pylori-negative noncardia biopsy specimens. *, **, and *** indicate P values of less than 0.05, 0.01, and 0.001, respectively. (E) PCoA of H. pylori infection in cardia biopsy specimens. (F) PCoA of H. pylori infection in noncardia biopsy specimens. (G) PCoA of H. pylori-positive status between cardia and noncardia biopsy specimens. (H) PCoA of H. pylori-negative status between cardia and noncardia biopsy specimens.
Effects of H. pylori infection on microbial diversity in cardia and noncardia biopsy specimens. (A) Observed ASVs between H. pylori-positive and H. pylori-negative cardia biopsy specimens. (B) Shannon index values between H. pylori-positive and H. pylori-negative cardia biopsy specimens. (C) Observed ASVs between H. pylori-positive and H. pylori-negative noncardia biopsy specimens. (D) Shannon index values between H. pylori-positive and H. pylori-negative noncardia biopsy specimens. *, **, and *** indicate P values of less than 0.05, 0.01, and 0.001, respectively. (E) PCoA of H. pylori infection in cardia biopsy specimens. (F) PCoA of H. pylori infection in noncardia biopsy specimens. (G) PCoA of H. pylori-positive status between cardia and noncardia biopsy specimens. (H) PCoA of H. pylori-negative status between cardia and noncardia biopsy specimens.The top 5 phyla in cardia and noncardia biopsy specimens were Proteobacteria, Bacteroidetes, Firmicutes, Actinobacteria, and Fusobacteria. The relative abundances of all phyla except Proteobacteria were significantly reduced in noncardia biopsy specimens from the H. pylori-positive group compared with their abundances in cardia biopsy specimens (Table 2). The top 15 genera in cardia biopsy specimens were Helicobacter, Prevotella, Veillonella, Neisseria, Haemophilus, Streptococcus, Pseudomonas, Brevundimonas, Alloprevotella, Fusobacterium, Porphyromonas, Bacteroides, Selenomonas, Stenotrophomonas, and Delftia. However, the genus Stenotrophomonas in cardia biopsy specimens was replaced by Megasphaera in the top 15 genera in noncardia biopsy specimens. Stenotrophomonas and Megasphaera were statistically reduced in the H. pylori-positive group compared to their levels in the H. pylori-negative group both in cardia and noncardia biopsy specimens.
TABLE 2
Comparison of dominant phyla and genera in cardia and noncardia biopsy specimens
Taxon
Avg relative abundance (%) in indicated biopsy specimen type from participants with indicated H. pylori status
P value for indicated pairwise comparisona
Cardia
Noncardia
Positive
Negative
Positive
Negative
PC/NC
PN/NN
PC/PN
NC/NN
Phyla
Proteobacteria
60.56
42.25
86.39
46.35
**
***
***
0.53
Bacteroidetes
15.35
30.54
6.79
28.73
***
***
**
0.74
Firmicutes
20.02
20.91
5.56
18.30
0.44
***
***
0.24
Actinobacteria
1.49
2.48
0.45
1.82
*
***
**
0.56
Fusobacteria
1.78
1.95
0.40
2.36
0.38
***
*
0.41
Genera
Helicobacter
42.56
10.17
79.99
13.91
***
***
***
0.50
Prevotella
11.67
22.77
5.71
22.30
**
**
*
0.80
Veillonella
9.22
7.90
2.83
9.65
0.78
**
0.05
0.21
Neisseria
7.64
7.19
2.55
8.69
0.51
***
0.20
0.82
Haemophilus
5.07
8.72
1.28
9.23
0.11
***
*
0.58
Streptococcus
6.02
4.71
1.49
7.09
0.97
*
**
0.19
Pseudomonas
1.78
6.77
1.35
3.10
*
*
0.84
0.72
Brevundimonas
0.81
2.33
0.23
1.96
0.19
*
0.52
0.97
Alloprevotella
0.52
2.25
0.27
1.71
**
***
*
0.58
Fusobacterium
1.09
1.36
0.22
1.47
0.34
***
*
0.35
Porphyromonas
0.91
1.48
0.19
1.41
0.40
***
*
0.86
Bacteroides
0.93
1.36
0.23
0.93
0.34
*
***
0.25
Selenomonas
0.39
1.13
0.10
0.88
*
**
0.09
0.21
Delftia
0.46
0.83
0.17
0.73
0.20
**
0.11
0.60
PC, H. pylori-positive cardia group; PN, H. pylori-positive noncardia group; NC, H. pylori-negative cardia group; NN, H. pylori-negative noncardia group; *, P < 0.05; **, P < 0.01; ***, P < 0.001.
Comparison of dominant phyla and genera in cardia and noncardia biopsy specimensPC, H. pylori-positive cardia group; PN, H. pylori-positive noncardia group; NC, H. pylori-negative cardia group; NN, H. pylori-negative noncardia group; *, P < 0.05; **, P < 0.01; ***, P < 0.001.For the above-described dominant genera, Table 2 shows the relative abundances in cardia and noncardia biopsy specimens in H. pylori-positive and H. pylori-negative participants. H. pylori infection statistically increased the relative abundances of Helicobacter both in cardia (increased 3.18 times) and noncardia (increased 4.75 times) biopsy specimens. Apart from Helicobacter, the relative abundances of all other genera were statistically decreased in noncardia biopsy specimens from H. pylori-positive participants. The relative abundances of Pseudomonas and Selenomonas were significantly lower in the H. pylori-positive group than in the H. pylori-negative group in both cardia and noncardia biopsy specimens and were similar between cardia and noncardia biopsy specimens in both the H. pylori-positive and the H. pylori-negative group. H. pylori infection expressly reduced the relative abundances of Prevotella (74.40% reduction) and Alloprevotella (84.59% reduction) in noncardia biopsy specimens. In addition, besides its effects on the microbial composition, H. pylori infection had an influence on the human disease (cancer and cardiovascular disease) pathways in noncardia biopsy specimens (Table S3).
H. pylori infection and pH value had an influence on the microbial characteristics of gastric juice.
When both H. pylori infection and gastric pH were considered, the observed ASVs (Fig. 4A) and Shannon index (Fig. 4B) decreased from the higher-pH and H. pylori-positive (HP) group to the higher-pH (H) group, the higher-pH and H. pylori-negative (HN) group, the H. pylori-positive (P) group, the lower-pH and H. pylori-positive (LP) group, the lower-pH (L) group, the H. pylori-negative (N) group, and last, the lower-pH and H. pylori-negative (LN) group. The alpha diversity of the HP group was surprisingly higher than those of the other groups, and there was an additional effect of H. pylori infection and gastric pH on the alpha diversity of gastric juice. However, there was no statistical difference between the alpha diversities of the HP group (observed ASVs = 177.33, Shannon index = 5.23) and the H group (observed ASVs = 164.75, Shannon index = 5.14). For beta diversity, both H. pylori infection (Fig. 4C) and pH value (Fig. 4D) influenced the microbial community in gastric juice, although pH value played a more prominent role.
FIG 4
Microbial diversity and microbial composition in gastric juice. Definitions of groups are as follows: HP group, higher pH and H. pylori positive; H group, higher pH; HN group, higher pH and H. pylori negative; P group, H. pylori positive; LP group, lower pH and H. pylori positive; L group, lower pH; N group, H. pylori negative; LN group, lower pH and H. pylori negative. (A) Observed ASVs in gastric juice. Horizonal lines link groups that are statistically different by multiple testing after the Kruskal-Wallis test. ** above the lines indicates P values after Bonferroni correction of less than 0.01. (B) Shannon index values in gastric juice. Horizontal lines link groups that are statistically different by multiple testing after the Kruskal-Wallis test. * and ** above the lines indicate P values after Bonferroni correction of less than 0.05 and 0.01, respectively. (C) PCoA of H. pylori infection status in gastric juice. (D) PCoA of pH values in gastric juice. (E) Microbial composition including the top 15 genera in gastric juice.
Microbial diversity and microbial composition in gastric juice. Definitions of groups are as follows: HP group, higher pH and H. pylori positive; H group, higher pH; HN group, higher pH and H. pylori negative; P group, H. pylori positive; LP group, lower pH and H. pylori positive; L group, lower pH; N group, H. pylori negative; LN group, lower pH and H. pylori negative. (A) Observed ASVs in gastric juice. Horizonal lines link groups that are statistically different by multiple testing after the Kruskal-Wallis test. ** above the lines indicates P values after Bonferroni correction of less than 0.01. (B) Shannon index values in gastric juice. Horizontal lines link groups that are statistically different by multiple testing after the Kruskal-Wallis test. * and ** above the lines indicate P values after Bonferroni correction of less than 0.05 and 0.01, respectively. (C) PCoA of H. pylori infection status in gastric juice. (D) PCoA of pH values in gastric juice. (E) Microbial composition including the top 15 genera in gastric juice.Regarding microbial composition, the phyla Proteobacteria (P < 0.001), Firmicutes (P < 0.01), Bacteroidetes (P < 0.01), Actinobacteria (P < 0.05), and Fusobacteria (P < 0.01) were dominant in gastric juice. Among them, only the relative abundances of Proteobacteria, increased gradually from the HP group to the H group, the P group, the LP group, the HN group, the L group, the N group, and finally, the LN group. At the genus level, the top 15 genera were Prevotella, Streptococcus, Neisseria, Veillonella, Haemophilus, Pseudomonas, Rothia, Delftia, Brevundimonas, Bacillus, Pedobacter, Acinetobacter, Herbaspirillum, Amycolatopsis, and Sphingomonas (Fig. 4E). The genera Pseudomonas, Acinetobacter, and Rothia were influenced by H. pylori infection and pH value (Table S4). Additionally, the LEfSe results showed that Rothia was the characteristic genus in the P group, while Pseudomonas and Acinetobacter were the characteristic genera in the N group. The relative abundances of Prevotella, Neisseria, Streptococcus, Veillonella, Haemophilus, Herbaspirillum, and Amycolatopsis were significantly different between the H group and the L group. Pseudomonas, Acinetobacter, Herbaspirillum, and Amycolatopsis were the characteristic genera in the L group by LEfSe. Taking all comparisons based on various grouping methods into consideration, only Pseudomonas, Acinetobacter, and Rothia were significantly different between groups (Table S4).
Microbial characteristics between and within participants.
Between participants, there was microbial similarity both in observed ASVs (P > 0.05) and Shannon index values (P > 0.05). CPCoA analysis shows that the six specimens from each participant had large overlaps (P > 0.05), but only 14% of the variance could be explained. Based on Bray-Curtis distances, there was no interparticipant or intraparticipant difference (P < 0.05). The microbial compositions of the dominant phyla, including Actinobacteria, Bacteroidetes, Firmicutes, Fusobacteria, and Proteobacteria, were similar between participants (Fig. 5A). At the genus level, the average relative abundances of the top 15 genera from the highest to the lowest were as follows: Prevotella (15.87%, P = 0.16), Helicobacter (12.36%, P = 0.56), Pseudomonas (8.06%, P = 0.97), Neisseria (7.89%, P = 0.81), Streptococcus (7.57%, P = 0.90), Veillonella (6.65%, P = 0.57), Haemophilus (6.42%, P = 0.87), Faecalibacterium (2.07%, P = 0.98), Roseburia (1.97%, P = 1.00), Blautia (1.57%, P = 1.00), Rothia (1.24%, P = 0.93), Porphyromonas (1.08%, P = 0.68), Bacteroides (1.07%, P = 0.18), and Alloprevotella (1.02%, P = 0.60) (Fig. 5B). Generally, the microbial characteristics between participants were consistent with the microbial characteristics within participants, which were diverse and dynamic.
FIG 5
Interindividual average relative abundances at phylum and genus levels. (A) Interindividual average relative abundances (%) at the phylum level. For each phylum in each participant, the average relative abundance (%) was the average of this phylum in six sites. (B) Interindividual average relative abundances (%) at the genus level. For each genus in each participant, the average relative abundance (%) was the average of this genus in six GI sites.
Interindividual average relative abundances at phylum and genus levels. (A) Interindividual average relative abundances (%) at the phylum level. For each phylum in each participant, the average relative abundance (%) was the average of this phylum in six sites. (B) Interindividual average relative abundances (%) at the genus level. For each genus in each participant, the average relative abundance (%) was the average of this genus in six GI sites.
DISCUSSION
The microbial characteristics in each GI site are closely related to the anatomical structure, and the microbial compositions overlap in the descending GI tract (15, 21, 22). In this study, there were great differences in the microbial compositions of the upper and lower GI tract. Each GI site had its own microbial characteristics that overlapped those of adjacent sites, and differences and commonalities coexisted in the microbial compositions of different sites. This study provides an opportunity to understand the microbial diversity and similarity in different GI sites in Chinese individuals.The oral cavity, as the initial gateway of the GI tract, is constantly exposed to both inhaled and ingested microbes and has more than 700 species or phylotypes in its microbiota (23), which explains the highest observed ASVs in saliva in this study. With almost 1011 bacterial cells moving from the mouth into the stomach per day (15, 24), the observed ASVs decreased accordingly. They dropped to their lowest point in gastric juice, which was contrary to the findings of Sung et al. (16). Sung et al. (16) et al. found that the mean number of operational taxonomic units (OTUs) was higher in gastric fluid than in gastric mucosa. The disease status of enrolled participants and the sampling sites in the noncardia stomach in the two studies are probable influencing factors. Surprisingly, this study found that the alpha diversity in feces was lower than expected. A previous study showed that the diversity was lower in feces than in saliva (25). The distinct discrepancy in microbial communities between the UGI tract and feces and the dominant role of the characteristic genera in the fecal microbiota may be a reasonable explanation for the lower diversity in feces than saliva. Microbial DNA in different GI sites was extracted by the same protocol in this study, which may be another explanation and needs to be further explored in the future. Additionally, the acid environment influenced the microbial diversity in gastric juice very strongly, which may also explain the surprising upward tendency of alpha diversity from gastric juice to feces in this study (15, 25–27).In this study, compared to the microbial communities of the cardia biopsy, noncardia biopsy, gastric juice, and fecal specimens, the microbial community in saliva specimens was comparatively similar to that of esophageal swab specimens, which is consistent with the findings of Fillon et al. (28) but inconsistent with those of Dong et al. (13). To answer the inconsistencies between this study and Dong’s, the various sampling sites in this study were important explanations for the relevant microbial similarities between saliva and esophageal swab specimens. Neisseria, Prevotella, Streptococcus, and Haemophilus have been characterized in both the oral cavity and the esophagus (13, 29–31). This study found that Neisseria and Prevotella were saliva-specific genera, which was similar to the findings of Segata et al. (15). However, a study (13) comparing three sites in the oral cavity only indicated Neisseria as a saliva-specific genus, and the relative abundances of Neisseria in saliva and tongue dorsum specimens were higher than its abundance in supragingival plaque specimens. Streptococcus and Haemophilus were esophagus-specific genera in this study. The average relative abundances of Streptococcus decreased from esophagus to cardia to noncardia specimens, which is consistent with the findings of Wurm et al. (32). Previous studies have indicated that decreased levels of Streptococcus in the esophagus are associated with gastroesophageal reflux disease (GERD) (33), Barrett’s esophagus (BE) (22), esophageal adenocarcinoma (EAC) (22), and esophageal squamous cell carcinoma (ESCC) (34, 35) and an enrichment of Haemophilus is relevant to eosinophilic esophagitis (EoE) (33). More significant, the combination of Streptococcus and Prevotella (34, 36) or Neisseria (37) is emerging in the exploration of the relationship between microbiota and esophageal health or disease.The gastric microbiota, one of the important components of the gut microbiota, can be shaped by the host physiology as well as by external factors, such as H. pylori infection, proton pump inhibitors (PPIs), and antibiotics (38). In this study, the microbial diversity was lower in the H. pylori-positive group than in the H. pylori-negative group in both cardia and noncardia biopsy specimens. However, there have been different conclusions. Bik et al., Khosravi et al., and Maldonado-Contreras et al. found that the presence of H. pylori did not significantly modify the diversity of gastric microbiota, while Andersson et al. found higher diversity in participants without H. pylori infection than in participants with H. pylori infection (19, 39–41). All participants in the Andersson et al. study were healthy, while all or some participants in the other studies had GI disease. In detail, there was a vicious circle from H. pylori infection to disease. With the progression of disease, the initial influence of H. pylori on other, non-H. pylori growth gradually weakened, and the microbial composition gradually became balanced. One supposition is that the inclusion of cardia biopsy specimens may play an important role in the explanation. As we all know, H. pylori normally inhabits the mucus-lined surface of the gastric corpus and antrum. H. pylori infection is identified as the strongest risk factor for gastric cancer and has a higher risk ratio for noncardia cancer than for cardia cancer (42). Consequently, it is reasonable that the impact of H. pylori on the noncardia microbiota is more significant than its impact on the cardia microbiota. Besides, the results of this study indicated that the microbial diversity was always higher in cardia than in noncardia biopsy specimens, no matter the H. pylori infection status; the acid environment in the stomach may be a reasonable explanation, but there are limited relevant studies. Additionally, the relative abundance of Helicobacter in cardia specimens was second only to its abundance in noncardia specimens, and Streptococcus and Haemophilus were richer in cardia than in noncardia specimens but lower than in esophagus specimens, which reflected the boundary (esophagogastric junction) microbial characteristics of cardia.After H. pylori colonizes the stomach mucosa, it becomes the predominant species of the gastric microbiota. H. pylori is a spiral-shaped, microaerophilic, Gram-negative bacterium with flagella that has urease, catalase, and oxidase activity (43). Every H. pylori carrier will ultimately develop chronic gastritis. In turn, the inflammatory response damages the gastric epithelial lining and allows H. pylori to flourish. Then, H. pylori infection worsens, gastritis results, and acid-secreting glands decrease, leading to an increase in the pH of gastric juice (44). However, the diversities in the HP group, the H group, and the P group were similar to each other, and the HP group was relevantly higher. This suggests that H. pylori infection might not affect stomach acid secretion as much, since the participants in this study did not have gastritis. The relative abundances of Pseudomonas and Acinetobacter increased from abnormal (in the HP group) to normal (in the LN group), while the change in the relative abundance of Rothia was the converse, which was similar to the change of alpha diversity in gastric juice specimens. Besides, Pseudomonas was the characteristic genus of gastric juice in this study. There is a study from Venezuela reporting that Pseudomonas strains may interfere with the identification of H. pylori (45). However, less attention has been paid to Pseudomonas until now, especially in gastric juice.Apart from anatomical sites, individual differences play an important role in the microbial composition. Gall et al. found that microbial communities in UGI sites displayed higher interindividual variation but overlapping community membership between sites (22), and in the upper and lower gut, the intra- and interindividual mucosa-associated microbiota profiles were significantly different in healthy Japanese (27). Hisada et al. found that interindividual variations were significantly larger than intraindividual variations for most of the dominant genera in fecal samples (46). Individual difference is an important and nonnegligible factor for microbial studies. In this study, inter- and intraparticipant microbial similarities were shown based on the Bray-Curtis distances. The variances in study design among the above-mentioned studies are potential reasons for the opposite conclusions described, especially the collection sites, sample types, and sampling methods. Specifically, all samples in the studies of Gall et al. (22) and Kashiwagi et al. (27) were collected via brush sampling, while this study included saliva, tissue biopsy, mucosal swab, gastric juice, and fecal specimens. Additionally, gastrointestinal microbiota can be influenced by various factors, including geographic origin, host genotype, and diet (47). Two microbial studies (48, 49) based in several provinces in China found that the microbial composition in feces was mainly influenced by geography. Previous studies (50–52) based in one or two provinces in China have shown similar patterns in the bacterial, fungal, and viral components. In this study, all participants were from the same village in Linzhou County and had no statistical differences in diet and lifestyle, which effectively reduced the heterogeneity.This study plays a fundamental role in mapping the microbial composition in Chinese participants, which provides the scientific basis for exploring the gastrointestinal-host microbiota relationship in the future. However, there were several limitations in this study. First, due to the strict inclusion criteria and complicated collection procedures, this exploratory study was based on the UGI screening project in China, which ensured the feasibility and reliability of study implementation, while the sample size was small. Second, H. pylori infection was one of the main factors discussed in this study. However, only one biopsy specimen was collected from the noncardia stomach, which made it hard to testify as to the initial infection site and colonization status of H. pylori. Third, we used a fecal occult blood test to determine lower GI disease status in this study, but there was no solid pathological diagnosis, and the fecal occult blood tests of all enrolled participants in this study were negative. Finally, due to the similarity of epidemiological factors and the small sample size, this study did not analyze the interaction between microbiota and epidemiological factors.
Conclusion.
Simply put, differences and commonalities coexist in the microbial composition of different GI sites. Within the individual, each phylum or genus may exist in more than one site, while each site has its own relatively characteristic microbiota. This study provides baseline microbial information, especially for the UGI microbiota, which will help to deepen our understanding of microbial characteristics in the GI tract and then gradually cover the fields of etiology, primary screening, and treatment. However, the relationship between microbiota and epidemiological factors is still unclear. For this, multidisciplinary studies with multicenter cooperation, larger sample sizes, multiple types of specimens, and prospective studies need to be carried out in the future.
MATERIALS AND METHODS
Study participants.
We recruited 40 participants aged 40 to 69 years old at the Linzhou Cancer Hospital in Henan Province in China (Fig. 6). The healthy group included 22 participants that did not have esophageal, cardia, or noncardia stomach lesions, and the esophagitis group included 18 participants that had mild or moderate esophagitis and no lesions in the cardia and noncardia stomach. All participants were local residents who were screened for UGI cancer in August in 2019. All participants had no major metabolic or systemic disease and had negative results in fecal occult blood testing. None of the participants had taken antibiotics or proton pump inhibitors in the past month. Epidemiologic information was collected by trained staff. All participants were clearly informed and signed written informed consent. This study was approved and overseen by the Institutional Review Board of the Cancer Hospital of Chinese Academy of Medical Sciences.
FIG 6
Flowchart of participant enrollment and sample collection.
Flowchart of participant enrollment and sample collection.
Sample collection.
Saliva, esophageal swab, cardia biopsy, noncardia biopsy, gastric juice, and fecal specimens were collected from each participant during the UGI cancer screening. Fresh feces were collected before endoscopic examination by disposable plastic box within 5 min after emission and then smeared on FOBT cards (fecal occult blood test cards; Baso-Diagnostics, Inc., Zhuhai, China) by the same trained staff. After mouth rinsing, saliva was collected in the salivary DNA collection device (saliva DNA sample collection kit; Zeesan Biotech Co., Xiamen, China). Other specimens were collected during the endoscopy examination (no defoaming agent was used before endoscopy examination). First, 15 mL gastric juice was absorbed into the collection device (Polyp Trap; Micro-Tech Co., Ltd., Nanjing, China). Next, cardia and noncardia biopsy specimens were dissected by an experienced endoscopy physician with sterile forceps under strict procedures and then placed into sterile tubes (2.0-mL cryogenic vials; Corning Incorporated, New York, NY, USA). Esophageal swab specimens were collected using sterile swabs (disposable cell brush; Jiuhong Medical Instrument Co., Changzhou, China). If there were lesions, 5 looping brushes were taken at the lesion, and if not, 5 looping brushes were taken at the middle esophagus. After swab collection, the swab head was cut off into a sterile tube (3.0-mL cryogenic tube; Simport Scientific, Beloeil, Canada) containing 1.5 mL of cell-preserving fluid (ThinPrep PreservCyt solution; Hologic, San Diego, CA, USA). All specimens were stored at −80°C immediately after sampling. After all specimens were obtained, we transferred the specimens to the laboratory with dry ice by express delivery. Before DNA extraction, no other processes were performed on specimens.
Gastric pH test, H. pylori test, and fecal occult blood test.
After the absorption of gastric juice, pH test strips (pH-fix 0.0–6.0; Macherey-Nagel GmbH & Co. KG, Düren, Germany) were used for pH testing. All pH tests were performed by the same staff to reduce bias in the color comparisons. The noncardia biopsy specimen from each participant was subjected to Helicobacter pylori testing by rapid urease test based on the UGI screening project. The fresh biopsy specimen was placed into the substrate reaction solution with a bamboo stick, which was fully oscillated and placed for around 5 min at room temperature. H. pylori testing and result determination were conducted by the same staff. Additionally, fecal occult blood testing was performed by the pyramidon semiquantitative method (fecal occult blood reagent; Baso Diagnostics, Inc., Zhuhai, China) or a home self-testing device (Hangzhou New Horizon Health Technology Co., Ltd., Hangzhou, China).
Bacterial DNA extraction, amplification, and sequencing.
Bacterial DNA in all specimens was extracted based on the protocol in the study of Manichanh et al. (53) and stored in Tris-EDTA buffer solution at −80°C prior to other processes. DNA amplicons covering the V4 region were generated using the following primers: 515F (5′-GTGYCAGCMGCCGCGGTAA-3′) and 806R (5′-GGACTACNVGGGTWTCTAAT-3′) (54, 55). The PCR mixtures contained 1 μL of forward and reverse primer (10 μmol/L), 1 μL of template DNA, 4 μL of deoxyribonucleoside triphosphates (dNTPs) (2.5 mmol/L), 5 μL of 10× EasyPfu buffer, 1 μL of EasyPfu DNA polymerase (2.5 U/μL), and 1 μL of double-distilled water in a 50-μL reaction mixture volume. The PCR thermal cycling consisted of an initial denaturation step at 95°C for 5 min, followed by cycles of denaturation at 94°C for 30 s, annealing at 60°C for 30 s, and extension at 72°C for 40 s, with a final extension step at 72°C for 4 min. The numbers of cycles were as follows: 25 cycles for saliva and fecal specimens, 30 cycles for swab and biopsy specimens, and 35 cycles for gastric juice specimens (Fig. 6) Amplicons from each specimen were run on an agarose gel to ensure consistent sequencing length. Amplicons were quantified with the Qubit double-stranded DNA (dsDNA) high-sensitivity (HS) assay kit (catalog no. Q32854; Thermo Fisher Scientific/Invitrogen, Waltham, MA, USA). Then, the amplicon libraries were pooled at an equal mass of 100 ng per sample and were sequenced on the Illumina MiniSeq platform. In the DNA extraction, PCR amplification, and sequencing steps, positive and negative quality controls were included along with the study samples in each batch. The positive quality control was a fecal sample that had been deeply sequenced previously, and the negative control was a sample of water without DNA or contamination.
Sequence processing and taxonomic classification.
In this study, 240 specimens were successfully sequenced. Fecal samples were sequenced with more than 30,000 reads, while other samples depended on the actual situation. The common quality control process included removal of low-quality, artifactual, and chimeric reads, sequencing error corrections, and read-length trimming. The DADA2 algorithm was used to remove possible phiX reads and chimeric sequences, correct sequencing errors, and generate amplicon sequence variants (ASVs) (56). To avoid sequencing errors at the ends, the remaining reads were truncated from 0 to 150 bp. Paired-end reads were connected at the minimum overlap length of 20 bp between the forward and reverse reads, and maximum 2-bp mismatches were allowed inside the overlap zone. Sequence data were processed using the Quantitative Insights into Microbial Ecology (QIIME2) platform (57). Taxonomic assignment of the ASVs was determined based on a pretrained naive Bayes classifier (trained on the SILVA database [58, 59]) via the q2-feature-classifier plugin. To avoid sampling depth bias, 4,000 reads were randomly selected from each sample when calculating the relative abundances of ASVs and taxa.
Group definition.
Several grouping methods were used in this study, as follows. (i) For GI site, based on the specimen collection site, specimens were assigned to the saliva group, esophagus group, cardia group, noncardia group, gastric juice group, or feces group. (ii) For H. pylori infection, participants were divided into the positive (P) group or the negative (N) group. (iii) For gastric pH, depending on whether the pH value of gastric juice was lower than or higher than 2, participants were divided into the lower (L) group or the higher (H) group. (iv) For H. pylori infection and gastric pH, the groups included the HP group (higher pH and H. pylori positive), the LP group (lower pH and H. pylori positive), the HN group (higher pH and H. pylori negative), and the LN group (lower pH and H. pylori negative).
Statistical analysis.
Epidemiological factors (Table 1) were compared between healthy participants and participants with esophagitis by the chi-square test or Fisher’s test. Alpha diversities (observed ASVs and Shannon index) were compared between two groups by the Wilcoxon test and among more than two groups by the Kruskal-Wallis test. After the Kruskal-Wallis test, Dunn’s test was used for multiple testing (Pairwise Multiple Comparisons of Mean Rank Sums [PMCMR] package). Beta diversity analysis was based on Bray-Curtis distance or weighted UniFrac distance. If there were more than three groups, constrained principal coordinate analysis (CPCoA) was performed for beta diversity, or else principal coordinate analysis (PCoA) was performed (Analyses of Phylogenetics and Evolution [APE] package). The adonis test was performed after PCoA (Vegan package). Microbial compositions at the phylum and genus levels were described and compared using the Kruskal-Wallis test or the Wilcoxon test. Additionally, SourceTracker was performed as described in Knights et al., where the fecal specimens were sink specimens (60). Then, distance-based redundancy analysis (db-RDA) and linear discriminant analysis effect size (LEfSe) were conducted by using the Vegan package and Galaxy website to explore the characteristic genus. In addition, correlation analysis was performed for the dominant genera (Corrplot package). Moreover, functional profiling of the GI microbiota was performed using the PICRUSt2 algorithm (61). For each sample, the composition of KEGG orthologs (KOs) was predicted based on the functional information on the reference ASVs (62). KEGG pathway compositions were generated according to the assignment of KOs. Between interindividual and intraindividual data, the Wilcoxon test was used to compare the microbial diversities based on Bray-Curtis distance. A conventional level of significance (P < 0.05) was used for rejecting the null hypothesis. The Bonferroni correction was used to adjust P values in multiple testing. All statistical analysis was performed with R Studio (version 1.1.456).
Ethics declaration.
This study was approved and overseen by the Institutional Review Board of the Cancer Hospital of Chinese Academy of Medical Sciences.
Data availability.
The sequencing data are available in the Genome Sequence Archive (GSA) database under accession number CRA004966.
Authors: Dan Knights; Justin Kuczynski; Emily S Charlson; Jesse Zaneveld; Michael C Mozer; Ronald G Collman; Frederic D Bushman; Rob Knight; Scott T Kelley Journal: Nat Methods Date: 2011-07-17 Impact factor: 28.547
Authors: Christian Quast; Elmar Pruesse; Pelin Yilmaz; Jan Gerken; Timmy Schweer; Pablo Yarza; Jörg Peplies; Frank Oliver Glöckner Journal: Nucleic Acids Res Date: 2012-11-28 Impact factor: 16.971