Literature DB >> 35720267

Helicobacter pylori genomes reveal Paleolithic human migration to the east end of Asia.

Rumiko Suzuki1, Naruya Saitou2, Osamu Matsuari3, Seiji Shiota4, Takashi Matsumoto1, Junko Akada1, Nagisa Kinjo5, Fukunori Kinjo6, Kuniko Teruya7, Makiko Shimoji7, Akino Shiroma7, Mototsugu Kato8, Kazuhito Satou7, Takashi Hirano7, Masahiro Asaka9, Kirill Kryukov2, Yoshan Moodley10, Yoshio Yamaoka1,11.   

Abstract

A virulence bacterium, Helicobacter pylori, evolved parallel to its host human, therefore, can work as a marker for tracing the human migration. We found H. pylori strains indigenous in the southernmost islands of Japanese Archipelago, Okinawa, and defined them as hspOkinawa and hpRyukyu. Genome data of the strains revealed that hspOkinawa diverged from other East Asian strains about 20,000 years ago, and that hpRyukyu diverged about 45,000 years ago. The closest strains of hpRyukyu were found from Afghanistan, Punjab, and Nepal, which suggest this strain originated in the central Asia and traveled across the Eurasian continent during Paleolithic era. The divergence date of hpRyukyu corresponds with human fossil records in Okinawa. Although it is controversial from human DNA analyses whether descendants of the Paleolithic migrants remain in the modern Japanese population, this study reveals that the bacterium of Paleolithic origin remains in the stomachs of current Japanese.
© 2022 The Authors.

Entities:  

Keywords:  Bacteriology; Evolutionary history; Gastroenterology; Medical Microbiology; Microbial genetics; Phylogeny

Year:  2022        PMID: 35720267      PMCID: PMC9204748          DOI: 10.1016/j.isci.2022.104477

Source DB:  PubMed          Journal:  iScience        ISSN: 2589-0042


Introduction

H. pylori adapted to the human stomach before the Out-of-Africa movement of anatomically modern humans around 70,000 years ago, and now approximately half of the global population carries H. pylori (Covacci et al., 1999). This bacterium transmits mainly from family members in childhood, and the infection continues throughout life unless eradicated (Kuipers et al., 2000). Because the evolution of H. pylori parallels that of humans, we can infer human migrations through genetic analyses of H. pylori (Linz et al., 2007; Moodley et al., 2009). Conventionally, population and phylogenetic analyses of H. pylori have been conducted based on a multi-locus sequence typing (MLST) method utilizing concatenated sequences of seven housekeeping genes (atpA, efp, mutY, ppa, trpC, ureI, and yphC) (Maiden et al., 1998). Seven major bacterial populations of H. pylori (hpAfrica1, hpAfrica2, hpNEAfrica, hpEurope, hpAsia2, hpEastAsia, and hpSahul) (Falush et al., 2003b; Linz et al., 2007; Moodley et al., 2009) were identified, and the phylogenetic relationship of these seven populations illuminated the modern human dispersal from Africa to the Eurasian continent (Linz et al., 2007), to the Americas (Falush et al., 2003b; Moodley and Linz, 2009), and to Oceania (Moodley et al., 2009). HpEastAsia is further divided into three subpopulations: hspEAsia that distributes in the eastern area of the Eurasian continent, hspMaori that expanded from Taiwan to Oceania, and hspIndigenousAmericas (Moodley et al., 2021) that migrated to North and South America via the Bering Land Bridge. In our previous study based on the MLST method, we found two H. pylori populations specific to Okinawa: one of the populations diverged earlier than any of the East Asian strains, and the other one diverged more recently but makes a distinct cluster within hpEastAsian strains (Matsunari et al., 2012). Okinawa area consists of the southmost islands of the Japanese Archipelago and ruled by Ryukyu dynasty before they were merged by the main island government in 1879. In this study, we name the group with the older divergence as hpRyukyu and the group with more recent divergence as hspOkinawa, regarding this group as a sub-population of hpEastAsia. HpRyukyu and hspOkinawa strains account for 16% and 14% of current Okinawa strains, respectively, and the rest belong to hspEAsia, the same group of strains on the main island of Japan. Our previous study also revealed that hpRyukyu and hspOkinawa strains are less virulent than hspEAsia strains because of the difference of CagA, one of the major virulence genes of H. pylori (Matsunari et al., 2012). HpRyukyu strains possess CagA whose amino acid motif is similar to that of hpEurope strains. However, the CagA motif of hpRyukyu was originally found only from Okinawa but no other area. Therefore, it was named as J-Western CagA as “J” stands for “Japan.” HspOkinawa strains lack CagA, so the least virulent. Mostly, all of strains in the main island of Japan belong to hspEAsia, which are also common in China and Korea. It was probably brought by people from the continent with rice cultivation. In contrast, the origin of hpRyukyu and hspOkinawa was enigmatic. We investigated virulence genes and seven housekeeping genes only in our previous study but advancement of DNA sequencers enabled us to obtain the whole genomes data of H. pylori. We investigated not only the whole genomes of Okinawa strains but also the genomes of global strains to speculate when and from where the host human population arrived at Japan.

Results

Population classification and genome tree

First, we defined hpRyukyu and hspOkinawa by the MLST method using STRUCTURE (Falush et al., 2003a) and DAPC (Jombart et al., 2010). The results are shown in Figures S1–S5 and Tables S1 and S2. We obtained genome data of H. pylori from Okinawa (hpRyukyu and hspOkinawa), Alaska, Colombia, Thailand, Bhutan, and Nepal sequenced by a PacBio, and reconstructed the complete whole genome totaling 21 strains (Table S3). Alaskan and Colombian strains were those collected from aboriginal Americans. We integrated our genome data with the data obtained from the public database. We concatenated a sequence of 619 orthologous genes totaling 88 strains (495,102 bp) and constructed a phylogenetic tree (Figure 1).
Figure 1

Phylogenetic tree using genome data of 88 strains

NJ-tree (Kimura 2-parameter) based on concatenated sequences of 619 genes (alignment length 495,102 bp. Encircled strains are those we sequenced by PacBio for this study. The squares on the right side of the tree represent population classification by STRUCTURE analysis of MLST data. The marks on the right of the boxes represent cagA genotype: ▵ cagA (−), ∗ J-Western cagA, a: Amerind cagA, e: East Asian cagA, w: Western cagA.

Phylogenetic tree using genome data of 88 strains NJ-tree (Kimura 2-parameter) based on concatenated sequences of 619 genes (alignment length 495,102 bp. Encircled strains are those we sequenced by PacBio for this study. The squares on the right side of the tree represent population classification by STRUCTURE analysis of MLST data. The marks on the right of the boxes represent cagA genotype: ▵ cagA (−), ∗ J-Western cagA, a: Amerind cagA, e: East Asian cagA, w: Western cagA. Of the 88-sequence data, HP14065, UM211, UM139, and HP15060 were not complete genomes but contig sequences (Chua et al., 2019). First, we could find only one strain that is close to hpRyukyu, a Nepalian strain NP05-124, in the complete genome data, then we explored H. pylori contig sequences deposited in GenBank and found the above four strains. We asked the author of the paper about the origin of these strains and were informed that HP14065, UM211, UM139, and HP15060 were from Afghanistan, unknown origin, Punjab, and Nepal, respectively. The hspOkinawa sub-branch was located between hspEAsia and hspIndigenousAmericas strains, as displayed by the colored squares on the right of the branches. The hspIndigenousAmericas strains were further divided into North (Alaska and Canada) and South (Peru, Colombia, and Venezuela) American sub-branches. Hokkaido strains, which were isolated from northern aboriginal Japanese, Ainu, belonged to the North American branch. Figure 1 also shows the genotypes of cagA, one of the well-known H. pylori virulence genes. CagA is one of the member genes of cag pathogenicity island (cagPAI) and classified as Western cagA, East Asian cagA, Amerind cagA, and J-Western cagA (Figure S6). Because cagPAI is not essential for survival, some strains lack cagPAI (cagA (−)). HspOkinawa strains are cagA (−) and so are the North American and Hokkaido strains. HpRyukyu has J-Western cagA whose amino acid motifs are similar to the Western type but discriminated by other features (Matsunari et al., 2012). The “J” of J-Western came from Japan because this genotype was first observed only in Okinawa, Japan. However, the Afghanistan and Nepal strains also have J-Western cagA, as well as some European and African strains. Our previous study on Nepalese strains reported that there were three bacterial populations: one was typical Asia2 strains, the second was recombinants between hpAsia2 and hpEurope strains, and the third was recombinants between hpAsia2 and Nepal-specific strains (Miftahussurur et al., 2015). The Nepal-specific strains may have relation with hpRyukyu. In the third population, strains containing high ratio of the Nepal-specific component was rare, which coincident with the fast that we investigated a lot of Nepalese strains but only a few of them were close to hpRyukyu.

Scarce exogenous influence on Okinawa strains

H. pylori imports exogenous DNA frequently, and therefore its genome contains many recombined regions (Yahara et al., 2012, 2013). We estimated the intensity of DNA exchange from one strain to another using ChromoPainterv2 (Falush et al., 2003a) and fineSTRUCTURE (Falush et al., 2003a), in which the genome of one H. pylori cell is regarded as a series of chunks imported from other H. pylori cells through recombination. The result suggests that hpRyukyu and hspOkinawa had very little influence from other populations, even between the two Okinawa populations (Figure S7). This observation suggests that hpRyukyu and hspOkinawa had different origins and co-existed in Okinawa without hybridization.

Estimation of the divergence dates

Based on the populations defined by fineSTRUCTURE, we selected representative strains to estimate divergence dates. We detected recombination signals by using the 4-gamete test and eliminated recombinant regions. Then, we constructed a Bayesian tree using BEAST2 (Bouckaert et al., 2014) and estimated the divergence dates. The divergence date of hspOkinawa was estimated around 20 thousand years ago (kya), and that of the cluster including hpRyukyu and Afghanistan-Nepal strains was estimated around 45 kya (Figure 2).
Figure 2

Bayesian tree of representative 24 strains

The colored circles on the map indicate geographic regions of Japan and correspond to the colored circles on the H. pylori strains. Bacterial populations determined by STRUCTURE are shown by red letters. Calibration point C-1 was set at the divergence of non-African strains as lower 50 kya and upper 100 kya according to archaeological information (Bae et al., 2017; Sikora et al., 2017).

Bayesian tree of representative 24 strains The colored circles on the map indicate geographic regions of Japan and correspond to the colored circles on the H. pylori strains. Bacterial populations determined by STRUCTURE are shown by red letters. Calibration point C-1 was set at the divergence of non-African strains as lower 50 kya and upper 100 kya according to archaeological information (Bae et al., 2017; Sikora et al., 2017). We chose oki422 as the representative strain of hpRyukyu. In Figure 2, oki422 branched out after the Afghanistan strain, although Figure 1 shows that the four hpRyukyu strains formed a sub-branch outside the Afghanistan-Nepal cluster. In either case, hpRyukyu diverged earlier than Punjab and Nepal. This suggests that hpRyukyu did not come through the extension of the Punjab-Nepal line.

Gene flow from hpRyukyu to others

The area from Afghanistan to Nepal is far away from Japan and the path from the area to Okinawa is unknown. We considered a north path that runs the north of Himalayas and a south path that runs southern coastal line. To evaluate which path is more probable, we analyzed the gene flow from hpRyukyu to others using f-statistics (fhom) (Martin et al., 2015) (Table S1). The f-statistic is a similar index to the ABBA-BABA-test (d-statistics). These statistics compare four taxa that represent three populations and one outgroup, for example, ((H1, H2). H3), H4) as H4 is the outgroup. The basic idea is that alleles shared only between H1 and H4 should be as equally frequent as that of between H2 and H4, if the allele sharing arose from incomplete lineage sorting without gene flow. Then, a skew of the sharing frequency between H1-H4 and H2-H4 is indicative of gene flow. Though the formulation of f-statistics and d-statistics differs slightly, the basic idea is the same. We used SouthAfrica20 as the base (H4). H3 to H1 were set by branching order in Figure 2. H3 is hpRyukyu and Afghanistan-Nepal strains. SNT49 and PNG84A in H2 are strains of hpAsia2 and hpSahul, respectively. If the gene flow from H3 to H2 is stronger than that from H3 to H1, the value is positive, and vice versa. EA-Am in H1 represents 13 strains in East Asia and the Americas (strains from 51 to shi470 in Figure 2), and the f-statistics is the average of the values calculated on each EA-Am strain. The f-statistic values were positive when SNT49 (hpAsia2 strain from India) was set to H2. This means the gene flow from the H3 strains (hpRyukyu and Afghanistan-Nepal strains) to the Indian strain is stronger than the gene flow to H1 strains (PNG84A, hpSahul strain from Papua New Guinea, and East Asian strains). This implies that the gene flow was [H3] -> [India] -> [Oceania, East Asia]. In contrast, f-statistics values were all negative when the hpSahul strain was set to H2. This result suggests that gene flow from hpRyukyu to EA-Am was stronger than that from hpRyukyu to hpSahul, i.e., the gene flow skipped hpSahul. HpSahul is considered to have expanded via coastal line to reach Oceania (Moodley et al., 2009). If the ancestral hosts of hpRyukyu moved toward the south, there should be gene flow between hpRyukyu and hpSahul before hpRyukyu reached East Asia. The weaker gene flow from hpRyukyu to hpSahul than that from hpRyukyu to East Asia implies that the ancestral hosts of hpRyukyu might not take a southern course. Rather, hpRyukyu might go to the north circumventing the Himalayas.

Discussion

Paleolithic migration to the east end

Integrating the results obtained, we assumed H. pylori migration to Japan (Figure 3). According to the results of f-statistics, hpRyukyu may have reached Okinawa via an inland path rather than the coastal route that hpSahul might have taken (Figure 3A). One candidate pathway is the Wakhan Corridor, which connects northeast Afghanistan and southwest China.
Figure 3

Putative migration path to Japan

Putative migration path of hpRyukyu (A) and hspOkinawa (B). The line from Afghanistan to Nepal was colored green and gray since those strains belong to hpEurope or hpAsia2. Because hpSahul strains are not observed anywhere in the Eurasian continent, its putative path is indicated by a dotted light blue line. During the last ice age, Hokkaido was connected to the Eurasian continent via Sakhalin (A). The Okinawa islands were isolated all the time (B). HspEAsia and hspMaori spread with rice agriculture and advanced navigation systems, respectively (C).

Putative migration path to Japan Putative migration path of hpRyukyu (A) and hspOkinawa (B). The line from Afghanistan to Nepal was colored green and gray since those strains belong to hpEurope or hpAsia2. Because hpSahul strains are not observed anywhere in the Eurasian continent, its putative path is indicated by a dotted light blue line. During the last ice age, Hokkaido was connected to the Eurasian continent via Sakhalin (A). The Okinawa islands were isolated all the time (B). HspEAsia and hspMaori spread with rice agriculture and advanced navigation systems, respectively (C). The oldest Homo sapiens fossil in Okinawa, the Yamashita Cave Man, was estimated to be around 32,000 B.P. through radiocarbon dating (Kobayashi et al., 1971). The complete skeletons of 27,000 B.P. (Shiraho-Saonetabaru) (Shinoda and Adachi, 2017) and 2,2000 B.P. (Minatogawa) (Baba and Narasaki, 1991) were also discovered from Okinawa. This suggests that some Paleolithic human populations resided in Okinawa. The divergence dates of hpRyukyu and hspOkinawa correspond with these fossil records. However, human genome analyses indicate that the modern Okinawa population was more closely associated with the Japan main island than the Ainu people (Jinam et al., 2015; Kanzawa-Kiriyama et al., 2017). One of the possible reasons for the discrepancy between human and H. pylori analyses is that ancient traces in the autosomal DNA are attenuated by admixture with new migrants. Actually, mitochondria DNA haplogroup M7a, which was found in 3 of 5 Paleolithic Shiraho-Saonetabal individuals (Shinoda and Adachi, 2017), is still observed in about 20% of the current Okinawa population. Another possibility is that hpRyukyu in Okinawa was transmitted from the Paleolithic migrants to the later migrants without human admixture. Then, the old migrants went extinct and only the Paleolithic H. pylori remained in the new migrants. It is controversial whether the Paleolithic migrants contributed to the modern Japanese population, although at least the Paleolithic H. pylori survived up to now.

H. pylori of hunter-gatherers and farmers

The Japanese population is thought to contain at least two layers of migrations: early hunter-gatherers (Jomon people) and later farmers (Yayoi people) (Hanihara, 1991). As the farmer population expanded in the main island of Japan, a trace of the Jomon people remained in the north (Hokkaido) and south (Okinawa) of the Japanese Archipelago. The divergence date of hspOkinawa and Ainu strains corresponded to the Jomon era. Both hspOkinawa and Ainu strains lack cagA (designated as cagA (−)). CagA (−) strains are often observed in Africa and Europe but are rare in Asia. As the farmer population expanded on the main island of Japan, the trace of Jomon people remained in the north (Hokkaido) and south (Okinawa) of the Japanese Archipelago. However, hspOkinawa and Ainu strains belong to different sub-branches (Figures 1 and 2). In addition, hspOkinawa has similarity with the southern strains (hspMaori) in the STRUCTURE analysis (Figures S1 and S2). We assume that the hosts of hspOkinawa were in contact with people from Southeast Asia who might carry ancestral strains of hspMaori. From these observations, we assume that ancestral hspIndigenousAmericas from the north route and ancestral hspMaori from a southern route reached the east end of the Eurasian continent and hybridized to form hspOkinawa, so hspOkinawa has both characteristics (Figure 3B).

Current H. pylori in Japan

Most of the current patients with H. pylori in the main island of Japan are infected with hspEAsia strains. HspEAsia distributes broadly in the countries such as China, Korea, Vietnam, Thailand, and Bhutan. HspEAsia evolved in the Eurasian continent and migrated to Japan with rice farmers, while the hosts of hspMaori developed an advanced navigation system and expanded via the ocean from Taiwan to New Zealand (Figure 3C). If hpRyukyu or hspOkinawa strains existed in East Eurasia or in the main island of Japan among hunter-gatherers, it might be taken over by hpEAsia strains carried by farmers, like hspIndigenousAmericas was taken over by hpEurope in the Americas and is observed only among aboriginal people in remote areas. HpRyukyu and hspOkinawa strains might have remained in Okinawa because there are many isolated islands. The scarce admixture between hpRyukyu and hspOkinawa suggests that they co-existed independently for a long time. This independence may reflect geographic isolation because the Okinawa area consists of many islands.

Probability of influence from recent human migration

When we found Okinawa-specific strains (Matsunari et al., 2012), we considered influence from recent human migration, especially from people in the U.S. army bases located in Okinawa. However, we think the probability is low that hpRyukyu and hspOkinawa derive from recent migrants as described below. There were hpEurope strains that might be transmitted from Western people but its ratio was less than 5% of the strains obtained from Okinawa. On the contrary, the hpRyukyu and hspOkinawa strains accounted for 16% and 14%, respectively. H. pylori does not transmit easily between adults but mostly transmit from parents to children by close contacts while the immune system is not fully developed, therefore lineage expansion takes generations. Considering the substantial inflow of foreign population to Okinawa started only after 1945, it is unlikely that hpRyukyu and hspOkinawa strains derive from recent foreign sources. Furthermore, strains that are close to hpRyukyu were found only from Afghanistan to Nepal, although we investigated a wide variety of Asian strains and public genome data of Western and African strains. Because current human flow from Afghanistan or Nepal to Okinawa is scarce, the similarity between Okinawa strains and Afghanistan or Nepal strains is likely due to common ancestry of the host human populations.

The utility of the H. pylori genome data

Currently, evolution of modern humans is investigated using genome-wide human SNP data (Japanese Archipelago Human Population Genetics Consortium et al., 2012; Jinam et al., 2015) or whole human genome sequence data (Pagani, 2017; Pagani et al., 2016). Genome sequences of H. pylori are much smaller than those human genome data, and one may think that these small data are no longer useful for elucidating the history of human evolution. However, genome analysis of H. pylori answered three questions about human evolution in the Japanese Archipelago: (1) Are there traces of Paleolithic migrants who reached Okinawa around 30,000 years ago observable in the modern Okinawa population? (2) Are there common features between Okinawa and Ainu people originating in the Jomon era? (3) Did the ancestors of modern Ainu people receive some influence from people in North Asia? We demonstrated the effectiveness of H. pylori genome data for the study of modern human evolution in East Eurasia.

Limitations of the study

The Okinawan H. pylori in this study were all isolated from patients of the University of the Ryukyus Hospital located in the south of Okinawa main island. Okinawa area covers many islands but there was no information about the birth places of the patients. We therefore cannot clarify the geographic distribution of hpRyukyu and hspOkinawa. The host human DNA was not available because of the lack of the informed consent, therefore we cannot analyze the genetics of the hosts. These are limitations of our study and we hope to clarify these points in a future study.

STAR★Methods

Key resources table

Resource availability

Lead contact

Further information and requests for resources and reagents should be directed to and will be fulfilled by the lead contact, Yoshio Yamaoka (yyamaoka@oita-u.ac.jp).

Materials availability

This study did not generate new unique reagents.

Experimental model and subject details

Biopsy specimens of gastric mucosa were collected from H. pylori-infected patients and the bacterial cells were isolated from the biopsy specimens using standard culture methods. From each sample, a single colony was expanded on confluent plates and cultured under microaerobic conditions (12% CO2) at 37°C. Bacterial DNA was extracted from the plates using a commercially kit (QIAGEN Inc., Valencia, CA, USA).

Method details

Population structure analysis of the MLST data

Conventional MLST analysis uses 3,406 bp sequences concatenated from seven housekeeping genes. We integrated our original MLST sequences (51, 64, 21, 20, 51, 72, and 119 strains from Bangladesh, Bhutan, Laos, Myanmar, Nepal, Okinawa Japan, and Thailand, respectively) with 691 sequences from pubMLST (https://pubmlst.org/). We analyzed population structure using the model-based clustering software STRUCTURE (Falush et al., 2003a) (v2.3.4) on the dataset of total strains by changing the population number (K) from 2 to 10 and executing 10 runs for each K. We tried both no-admixture and linkage models. The results of the 10 runs for each K were integrated by CLUMPAK server (http://clumpak.tau.ac.il/distruct.html) (Kopelman et al., 2015). The DAPC (Jombart et al., 2010) package of statistics software R (v3.4.3) was applied on the same dataset.

Genome sequencing

Biopsy specimens of gastric mucosa were collected from H. pylori-infected patients in Okinawa, Japan (oki102, oki112, oki128, oki154, oki422, oki673, oki828, oki898), Hokkaido, Japan (HK711, Hk721, Hk840), Alaska, U.S.A. (AL02, AL03, AL04, AL05), Colombia (CO1681, CO1766, CO1768), Thailand (UBN18), and Bhutan (BT302), as previously described (Abe et al., 2011; Matsunari et al., 2012; Moodley et al., 2009). H. pylori were isolated from the biopsy specimens using standard culture methods, as previously described (Yamaoka et al., 1998). Genomic DNA was extracted from confluent plate cultures expanded from a single colony using a commercially available kit (QIAGEN Inc., Valencia, CA, USA). DNA samples of the 20 strains were sequenced by PacBio. Genome assembly was performed by the HGAP Pipeline to obtain the complete genome data.

Phylogenetic tree and population structure analysis of the genome data

For genomic analyses, we used the complete genome sequences of our original 20 strains and 64 strains downloaded from GenBank. Additionally, we explored Genome Neighbour report of NCBI and downloaded contig sequences of four strains (HP14065, HP15060, UM139, and UM211) that were close to hpRyukyu. We used the prokaryotic genome annotation tool Prokka (Seemann, 2014) and the pan genome pipeline Roary (Page et al., 2015) to identify orthologous gene groups of the above 88 strains. If a multiple copy of an ortholog exists in one strain (paralogs), we did not use the gene for analysis because paralogs evolves differently from a single copy of a gene and are not suitable for phylogenetic analysis. As for gene loss, we allowed lack of genes at most 5% of the strains and used pairwise deletion option when we constructed a NJ-tree (Kimura 2-parameter) by MEGA (v.10.1.8) (Tamura et al., 2013). We obtained 619 sets of orthologous genes and aligned the DNA sequences of each gene. Then we concatenated the aligned sequences into the a sequence of totally 495,102 bp. This concatenated sequence was also used for population structure analysis by Chromopainter (v.2) (Lawson et al., 2012) and fineSTRUCTURE (v.0.1.0) (Lawson et al., 2012).

Estimation of the divergence dates

The result of fineSTRUCTURE classified the 88 strains into 25 populations. We chose one strain from each population that has the least influence from other populations. Because hpEurope strains were known to be a hybrid between hpNEAfrica and hpAsia2, we excluded seven European strains. To estimate the divergence time of hpRyukyu, we included Afghanistan-Nepal strains (HP14065, HP15060, UM139) but excluded UM211, whose origin was unclear. We used the 619 orthologous gene sets of the selected 24 strains for further analysis. We used ClonalFrameML (v.1.12) (Didelot and Wilson, 2015) to eliminated recombinant regions and applied jModeltest (v.2.1.10) (Posada, 2008) to select the substitution model (GTR). The clock model (Relaxed clock lognormal) was selected by a preliminary run of BEAST2 (v2.5.0) (Bouckaert et al., 2014) executing 10,000,000 MCMC. After this process, the divergence date was estimated executing 100,000,000 MCMC. Calibration points were set between African and non-African using Log normal model, M = 2, S = 0.1, offset = 0 in BEAUti (Lower 50 kya, upper 100 kya) referring to archaeological information (Bae et al., 2017; Sikora, 2017).

Gene flow analysis

We used an improved ABBA-BABA test, fhom, to evaluate gene flow. We set SouthAfrica20 as H4 (outlier), hpRyukyu and Afghanistan-Nepal strains as H3, Indian strain (SNT49) or Papua New Guinea strain (PNG84A) as H2, and PNG84A (SNT49 as H2) or other East Asian strains as H1. We calculated fhom and took an average when the East Asian strains were set as H1. A positive value of fhom suggests that gene flow H3 to H2 is stronger than H3 to H1, and a negative value suggests the opposite.

Data access

Genome data sequenced by us are available from GenBank under the accession numbers CP006820 - CP006827, CP058279 - CP058288, and CP058250 - CP058252.
REAGENT or RESOURCESOURCEIDENTIFIER
Deposited data

Genome sequences of H. pylori (Genbank accession number)This paperCP006820 - CP006827, CP058279 - CP058288, and CP058250 - CP058252. See Table S3 for details.

Software and algorithms

MEGA (v.10.1.8)Tamura et al. (2013)https://www.megasoftware.net/
ClonalFrameML (v.1.12)Didelot and Wilson (2015)https://github.com/xavierdidelot/ClonalFrameML
jModeltest (v.2.1.10)Posada (2008)http://evomics.org/learning/phylogenetics/jmodeltest/
BEAST2 (v2.5.0)Bouckaert et al. (2014)https://www.beast2.org/
Chromopainter (v.2)Lawson et al. (2012)https://people.maths.bris.ac.uk/%7Emadjl/finestructure-old/chromopainter_info.html
fineSTRUCTURE (v0.1.0)Lawson et al. (2012)http://www.paintmychromosomes.com/
  33 in total

1.  Traces of human migrations in Helicobacter pylori populations.

Authors:  Daniel Falush; Thierry Wirth; Bodo Linz; Jonathan K Pritchard; Matthew Stephens; Mark Kidd; Martin J Blaser; David Y Graham; Sylvie Vacher; Guillermo I Perez-Perez; Yoshio Yamaoka; Francis Mégraud; Kristina Otto; Ulrike Reichard; Elena Katzowitsch; Xiaoyan Wang; Mark Achtman; Sebastian Suerbaum
Journal:  Science       Date:  2003-03-07       Impact factor: 47.728

2.  jModelTest: phylogenetic model averaging.

Authors:  David Posada
Journal:  Mol Biol Evol       Date:  2008-04-08       Impact factor: 16.240

3.  A Cover Story for a Nature cover: genetic signature of human expansions into Eurasia revealed by a panel of worldwide high coverage genomes.

Authors:  Luca Pagani
Journal:  J Anthropol Sci       Date:  2017-08-07

4.  Impact of Helicobacter pylori CagA diversity on gastric mucosal damage: an immunohistochemical study of East-Asian-type CagA.

Authors:  Takashi Abe; Masaaki Kodama; Kazunari Murakami; Osamu Matsunari; Kazuhiro Mizukami; Kunimitsu Inoue; Masahiro Uchida; Tadayoshi Okimoto; Toshio Fujioka; Tomohisa Uchida; Masatsugu Moriyama; Yoshio Yamaoka
Journal:  J Gastroenterol Hepatol       Date:  2011-04       Impact factor: 4.029

5.  Association between Helicobacter pylori virulence factors and gastroduodenal diseases in Okinawa, Japan.

Authors:  Osamu Matsunari; Seiji Shiota; Rumiko Suzuki; Masahide Watada; Nagisa Kinjo; Kazunari Murakami; Toshio Fujioka; Fukunori Kinjo; Yoshio Yamaoka
Journal:  J Clin Microbiol       Date:  2011-12-21       Impact factor: 5.948

6.  Relationship of vacA genotypes of Helicobacter pylori to cagA status, cytotoxin production, and clinical outcome.

Authors:  Y Yamaoka; T Kodama; M Kita; J Imanishi; K Kashima; D Y Graham
Journal:  Helicobacter       Date:  1998-12       Impact factor: 5.753

7.  Inference of population structure using dense haplotype data.

Authors:  Daniel John Lawson; Garrett Hellenthal; Simon Myers; Daniel Falush
Journal:  PLoS Genet       Date:  2012-01-26       Impact factor: 5.917

8.  ClonalFrameML: efficient inference of recombination in whole bacterial genomes.

Authors:  Xavier Didelot; Daniel J Wilson
Journal:  PLoS Comput Biol       Date:  2015-02-12       Impact factor: 4.475

9.  Roary: rapid large-scale prokaryote pan genome analysis.

Authors:  Andrew J Page; Carla A Cummins; Martin Hunt; Vanessa K Wong; Sandra Reuter; Matthew T G Holden; Maria Fookes; Daniel Falush; Jacqueline A Keane; Julian Parkhill
Journal:  Bioinformatics       Date:  2015-07-20       Impact factor: 6.937

10.  The peopling of the Pacific from a bacterial perspective.

Authors:  Yoshan Moodley; Bodo Linz; Yoshio Yamaoka; Helen M Windsor; Sebastien Breurec; Jeng-Yih Wu; Ayas Maady; Steffie Bernhöft; Jean-Michel Thiberge; Suparat Phuanukoonnon; Gangolf Jobb; Peter Siba; David Y Graham; Barry J Marshall; Mark Achtman
Journal:  Science       Date:  2009-01-23       Impact factor: 63.714

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.