| Literature DB >> 25799515 |
Yuanhai You1, Lihua He1, Maojun Zhang1, Jianzhong Zhang1.
Abstract
Helicobacter pylori is a common pathogen correlated with several severe digestive diseases. It has been reported that isolates associated with different geographic areas, different diseases and different individuals might have variable genomic features. Here, we describe draft genomic sequences of H. pylori strains YN4-84 and YN1-91 isolated from patients with gastritis from the Naxi and Han populations of Yunnan, China, respectively. The draft sequences were compared to 45 other publically available genomes, and a total of 1059 core genes were identified. Genes involved in restriction modification systems, type four secretion system three (TFS3) and type four secretion system four (TFS4), were identified as highly divergent. Both YN4-84 and YN1-91 harbor intact cag pathogenicity island (cagPAI) and have EPIYA-A/B/D type at the carboxyl terminal of cagA. The vacA gene type is s1m2i1. Another major finding was a 32.5-kb prophage integrated in the YN4-84 genome. The prophage shares most of its genes (30/33) with Helicobacter pylori prophage KHP30. Moreover, a 1,886 bp transposable sequence (IS605) was found in the prophage. Our results imply that the Naxi ethnic minority isolate YN4-84 and Han isolate YN1-91 belong to the hspEAsia subgroup and have diverse genome structure. The genome has been extensively modified in several regions involved in horizontal DNA transfer. The important roles played by phages in the ecology and microevolution of H. pylori were further emphasized. The current data will provide valuable information regarding the H. pylori genome based on historic human migrations and population structure.Entities:
Mesh:
Year: 2015 PMID: 25799515 PMCID: PMC4370579 DOI: 10.1371/journal.pone.0120659
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1A. Global distribution of sequenced H. pylori strains from different populations B. phylogenetic comparison based on seven house-keeping genes (Left) and core genomes (Right).
Genomic features and backgrounds of the 47 H. pylori strains used in this study
| strain | ACCESSION | length | origin | Clinical diagnosis |
|---|---|---|---|---|
| 26695 | AE000511 | 1667892 | UK | Gastritis |
| J99 | AE001439 | 1643831 | USA | Duodenal ulcer |
| HPAG1 | CP000241 | 1596366 | Sweden | Atrophic gastritis |
| P12 | CP001217 | 1673813 | German | Duodenal ulcer |
| G27 | CP001173 | 1652982 | Italy | NA |
| Shi470 | CP001072 | 1608548 | Peru | Gastritis |
| Shi169 | NC_017740 | 1616909 | Peru | NA |
| Shi417 | NC_017739 | 1665719 | Peru | NA |
| Shi112 | NC_017741 | 1663456 | Peru | NA |
| 35A | CP002096 | 1566655 | Japan | NA |
| 51 | CP000012 | 1589954 | Korea | Duodenal ulcer |
| 52 | CP001680 | 1568826 | Korea | NA |
| 908 | CP002184 | 1549666 | France | Duodenal ulcer |
| B38 | FM991728 | 1576758 | France | MALT lymphoma |
| B8 | FN598874 | 1673997 | unknown | Gastric ulcer |
| Cuz20 | CP002076 | 1635449 | Peru | NA |
| Gambia94/24 | CP002332 | 1709911 | The Gambia | NA |
| India7 | CP002331 | 1675918 | India | Peptic ulcer |
| Lithuania75 | CP002334 | 1624644 | Lithuania | NA |
| PeCan4 | CP002074 | 1629557 | Peru | Peruvian gastric cancer |
| PeCan18 | NC_017742 | 1660685 | Peru | Peruvian gastric cancer |
| SJM180 | CP002073 | 1658051 | Peru | Gastritis |
| Sat464 | CP002071 | 1560342 | Peru | NA |
| SouthAfrica7 | CP002336 | 1501960 | South Africa | NA |
| v225d | CP001582 | 1588278 | Venezuela | Gastritis |
| 2017 | CP002571 | 1548238 | France | Duodenal ulcer |
| 2018 | CP002572 | 1562832 | France | Duodenal ulcer |
| F16 | AP011940 | 1575399 | Japan | Gastritis |
| F30 | AP011941 | 1570564 | Japan | Duodenal ulcer |
| F32 | AP011943 | 1578824 | Japan | Gastric cancer |
| F57 | AP011945 | 1609006 | Japan | Gastric cancer |
| 83 | CP002605 | 1617426 | unknown | NA |
| XZ274 | NC_017926 | 1634138 | China | Gastric cancer |
| Rif1 | CP003905 | 1667883 | German | NA |
| Rif2 | CP003906 | 1667890 | German | NA |
| Puno120 | NC_017378 | 1624979 | Peru | Gastritis |
| Puno135 | NC_017379 | 1646139 | Peru | Gastritis |
| HUP-B14 | NC_017733 | 1599280 | Spain | NA |
| ELS37 | NC_017063 | 1664587 | El Salvador | Gastric cancer |
| SNT49 | NC_017376 | 1607577 | India | Asymptomatic |
| Aklavik86 | CP003476 | 1494183 | Canada | NA |
| Aklavik117 | CP003483 | 1614447 | Canada | NA |
| HLJ193 | ALJI00000000 | 1552322 | China | Atrophic gastritis |
| HLJ256 | ALKA00000000 | 1576324 | China | Atrophic gastritis |
| HLJ271 | ALKB00000000 | 1588141 | China | Gastric ulcer |
| YN1–91 | JPXC01000000 | 1609835 | China | Gastritis |
| YN4–84 | JPXD00000000 | 1633405 | China | Gastritis |
Fig 2a. Subsystem distribution statistics of Helicobacter pylori strain YN4–84 generated by the rapid annotation using a subsystem technology server. b. Subsystem distribution statistics of Helicobacter pylori strain YN1–91 generated by the rapid annotation using a subsystem technology server.
Fig 3Global overview of genomic diversity of YN4–84 and YN1–91 with other 46 H. pylori genome sequences.
The four regions with most divergence were labeled with brown arcs along the chromosome.
Fig 4Global alignment of YN4–84 and YN1–91.
The black arrows above YN4–84 show the insertion of YN4–84P and its flanking gene fragments.
Gene contents of the predicted phage YN4–84P.
| code | CDS_POSITION | BLAST_HIT | E-VALUE |
|---|---|---|---|
| 1 | 901213..901485 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00960; phage(gi431810541) | 1.00E-25 |
| 2 | 901416..901427 | attL TCAAAAAACCAC | 0 |
| 3 | 901478..902599 | PHAGE_Helico_KHP30_NC_019928: putative phage integrase; PP_00961; phage(gi431810542) | 7.00E-148 |
| 4 | 902626..902811 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00962; phage(gi431810543) | 3.00E-22 |
| 5 | 902813..903121 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00963; phage(gi431810544) | 1.00E-46 |
| 6 | 903118..904113 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00964; phage(gi431810545) | 2.00E-167 |
| 7 | 904311..904577 | PHAGE_Helico_KHP30_NC_019928: DNA helicase; PP_00965; phage(gi431810547) | 2.00E-38 |
| 8 | 904588..905757 | PHAGE_Helico_KHP30_NC_019928: putative DNA helicase, putative DNA repair protein;PP_00966; phage(gi431810548) | |
| 9 | 905768..907327 | PHAGE_Helico_KHP30_NC_019928: putative primase; PP_00967; phage(gi431810549) | 0 |
| 10 | 907324..908997 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00968; phage(gi431810550) | 0 |
| 11 | 909058..914190 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00969; phage(gi431810551) | 0 |
| 12 | 914563..915150 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00970; phage(gi431810552) | 1.00E-97 |
| 13 | 915385..915945 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00971; phage(gi431810553) | 5.00E-98 |
| 14 | 915956..917101 | PHAGE_Helico_KHP30_NC_019928: structural protein; PP_00972; phage(gi431810554) | 0 |
| 15 | 917115..917477 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00973; phage(gi431810555) | 9.00E-43 |
| 16 | 917528..917959 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00974; phage(gi431810556) | 6.00E-70 |
| 17 | 918030..919418 | PHAGE_Helico_KHP30_NC_019928: putative portal protein; PP_00975; phage(gi431810557) | 0 |
| 18 | complement(919375..920658) | PHAGE_Clostr_c_st_NC_007581: putative IS transposase (OrfB); PP_00976; phage(gi80159731) | 9.00E-57 |
| 19 | 920727..921155 | PHAGE_Clostr_c_st_NC_007581: putative transposase; PP_00977; phage(gi80159828) | 3.00E-18 |
| 20 | 921347..921733 | PHAGE_Helico_KHP30_NC_019928: putative portal protein; PP_00978; phage(gi431810557) | 3.00E-60 |
| 21 | 921694..921840 | hypothetical; PP_00979 | 0 |
| 22 | 921833..922228 | PHAGE_Helico_KHP30_NC_019928: putative terminase; PP_00980; phage(gi431810558) | 2.00E-67 |
| 23 | 922234..923388 | PHAGE_Helico_KHP30_NC_019928: putative terminase; PP_00981; phage(gi431810558) | 0 |
| 24 | 923443..923652 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00982; phage(gi431810559) | 4.00E-27 |
| 25 | 923664..923879 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00983; phage(gi431810560) | 4.00E-08 |
| 26 | 923879..924205 | PHAGE_Helico_KHP30_NC_019928: putative holin; PP_00984; phage(gi431810561) | 7.00E-42 |
| 27 | 924272..924592 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00985; phage(gi431810563) | 4.00E-33 |
| 28 | 924640..925185 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00986; phage(gi431810564) | 2.00E-97 |
| 29 | 925187..925984 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00987; phage(gi431810565) | 4.00E-134 |
| 30 | 925984..926556 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00988; phage(gi431810566) | 3.00E-88 |
| 31 | 926595..926882 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00989; phage(gi431810567) | 6.00E-39 |
| 32 | 926935..927267 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00990; phage(gi431810568) | 2.00E-12 |
| 33 | 927288..928121 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00991; phage(gi431810569) | 7.00E-125 |
| 34 | 928121..928903 | PHAGE_Helico_KHP30_NC_019928: hypothetical protein; PP_00992; phage(gi431810570) | 4.00E-133 |
| 35 | 933718..933729 | attR TCAAAAAACCAC | 0 |
Fig 5Gene content of YN4–84P.
Fig 6Neighbor-joining tree based on the sequence diversity of cagA (Left) and cagPAI (Right).
YN4–84 is highlighted in red.
Fig 7EPIYA motif analysis of the CagA C-terminal region.