| Literature DB >> 27703135 |
Vanessa K Wong1,2, Stephen Baker3,4,5, Thomas R Connor6, Derek Pickard1, Andrew J Page1, Jayshree Dave7, Niamh Murphy7, Richard Holliman7, Armine Sefton8, Michael Millar8, Zoe A Dyson9,10, Gordon Dougan1, Kathryn E Holt9,10.
Abstract
The population of Salmonella enterica serovar Typhi (S. Typhi), the causative agent of typhoid fever, exhibits limited DNA sequence variation, which complicates efforts to rationally discriminate individual isolates. Here we utilize data from whole-genome sequences (WGS) of nearly 2,000 isolates sourced from over 60 countries to generate a robust genotyping scheme that is phylogenetically informative and compatible with a range of assays. These data show that, with the exception of the rapidly disseminating H58 subclade (now designated genotype 4.3.1), the global S. Typhi population is highly structured and includes dozens of subclades that display geographical restriction. The genotyping approach presented here can be used to interrogate local S. Typhi populations and help identify recent introductions of S. Typhi into new or previously endemic locations, providing information on their likely geographical source. This approach can be used to classify clinical isolates and provides a universal framework for further experimental investigations.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27703135 PMCID: PMC5059462 DOI: 10.1038/ncomms12827
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Summary of 1,831 S. Typhi.
| Southeast Asia | 1976–2012 | 719 | |
| Cambodia | 2007–2012 | 210 | |
| Indonesia | 1976–2012 | 129 | |
| Laos | 2000–2010 | 138 | |
| Vietnam | 1972–2011 | 221 | |
| Malaysia | 2005–2011 | 6 | |
| Other (Philippines, Thailand, East Timor, Myanmar) | 2002–2012 | 15 | |
| South Asia | 1977–2012 | 321 | |
| Bangladesh | 1998–2012 | 51 | |
| India | 1977–2012 | 174 | |
| Nepal | 1999–2012 | 47 | |
| Pakistan | 2003–2012 | 45 | |
| Other (Sri Lanka, Afghanistan) | 2001–2012 | 4 | |
| Western Asia | 1997–2011 | 25 | |
| Iraq | 2006–2011 | 11 | |
| Lebanon | 2001–2011 | 7 | |
| Other (Armenia, Palestine, Turkey, Western Asia) | 1997–2011 | 7 | |
| Eastern Asia | 2002–2011 | 3 | |
| Other (China) | 2002–2011 | 3 | |
| North Africa | 1961–2009 | 24 | |
| Algeria | 1999–2009 | 7 | |
| Morocco | 1999–2000 | 9 | |
| Other (Sudan, Egypt, Tunisia) | 1961–2008 | 8 | |
| East Africa | 1980–2010 | 115 | |
| Kenya | 1998–2009 | 56 | |
| Tanzania | 2006–2010 | 52 | |
| Other (Comoros, Madagascar) | 1980–2002 | 7 | |
| Central Africa | 1958–2011 | 49 | |
| Cameroon | 1958–2009 | 27 | |
| DRC | 1976–2011 | 17 | |
| Other (Angola, Central African Republic) | 2001–2009 | 5 | |
| West Africa | 1998–2009 | 30 | |
| Other (Burkina Faso, Cape Verde, Benin, Guinea, Ivory Coast, Gabon, Liberia, Mali, Niger, Nigeria, Mauritania, Senegal, Togo) | 1998–2009 | 30 | |
| Southern Africa | 2004–2013 | 153 | |
| Malawi | 2004–2013 | 112 | |
| South Africa | 2004–2012 | 41 | |
| Africa | 2009–2012 | 3 | |
| Unknown | 2009–2012 | 3 | |
| Eastern Europe | 1916–1996 | 2 | |
| Other (Russia) | 1916–1996 | 2 | |
| Western Europe | 2009 | 4 | |
| Other (France (suspected African origin of infection)) | 2009 | 4 | |
| Southern Europe | 2009 | 1 | |
| Other (Malta) | 2009 | 1 | |
| Australia | 2010–2012 | 3 | |
| Australia | 2010–2012 | 3 | |
| Oceania | 1980–2012 | 339 | |
| Fiji | 1981–2012 | 170 | |
| Samoa | 1992–2012 | 117 | |
| Papua New Guinea | 1980–2012 | 47 | |
| Other (Tonga, Vanuatu) | 1980–2003 | 5 | |
| North America | 1958–2011 | 5 (0.3%) | |
| Other (USA, Mexico) | 1958–2011 | 5 | |
| Central America | 2012 | 1 (0.05%) | |
| Other (El Salvador) | 2012 | 1 | |
| South America | 1905–2012 | 17 (0.9%) | |
| Argentina | 1905–2006 | 10 | |
| Other (French Guiana, Peru, South America ) | 2002–2012 | 6 | |
| Unknown origin | 1939–2012 | 19 (1.0%) |
S. Typhi, Salmonella enterica serovar Typhi.
Typhi isolates from the global collection, which were used to define genotypes. Countries with fewer than five isolates were grouped into the category ‘Other'; n indicates the number of such countries in each region.
Figure 1Population structure of S. Typhi based on genome-wide SNPs.
(a) Whole-genome tree of 1,831 global S. Typhi isolates. Primary clusters 1–4 are indicated in the outer coloured ring; branches defining these groups are coloured in the tree. These groups are further divided into clades, which are shaded and labelled. The location of S. Typhi reference genomes CT18 (accession number AL513382) and Ty2 (accession number AE014613) are indicated on the tree. Subclade 4.3.1 (H58, marked in red), which comprises half of the global collection, is represented by just 50 (6%) randomly selected isolates out of the total 852 belonging to this subclade, so that the relationships between other clades can be visualized. (b) Tree backbone showing further division of 16 S. Typhi clades (shaded) into 49 subclades (labelled; note 12 undifferentiated clade groups shown in brackets). Branches are coloured by primary cluster. (c) Map of the world showing subclade diversity of S. Typhi isolates in the global collection, by region22. Where groups of isolates from the same country and year belonged to the same subclade, this was classified as an ‘outbreak' and the group is only represented once in the pie graphs. Pies are sized to indicate number of isolates; slices are coloured by clade; multiple slices of the same colour indicate multiple subclades belonging to the same clade.
Figure 2Geographical clustering of S. Typhi subclades.
Heatmap shows, for each subclade, the percentage of unique isolates originating from each of the geographical regions. Where groups of isolates from the same country and year belonged to the same subclade, this was classified as an ‘outbreak' and the group is only represented once. The same data are represented as a scaled bar graph to the right. The full list of isolates by country and subclade is provided in Supplementary Data 1.
Summary of 99 East London travel-associated S. Typhi isolates used in the study.
| Bangladesh | 2006–2012 | 38 |
| India | 2006–2012 | 22 |
| Pakistan | 2006–2012 | 13 |
| Nepal | 2007 | 1 |
| India/Pakistan | 2008 | 1 |
| India/Kuwait | 2008 | 2 |
| Bangladesh/India | 2010 | 2 |
| Nigeria | 2009 | 1 |
| Ghana | 2007 | 1 |
| No known travel | 2005–2011 | 18 |
| Total | 2005–2012 | 99 |
S. Typhi, Salmonella enterica serovar Typhi.
The country of origin and range of isolation dates (years) for the isolates are described.
*For five patients, multiple countries of travel were recorded, and it was not possible to confirm in which country the S. Typhi infection originated.
Summary of genotyping and SNP results for travel-associated S. Typhi isolates with known country of travel.
| Bangladesh | 1 | 1 | 9 | ||
| India | 2 | 2 | 49 | Oceania (92%) | |
| Pakistan | 1 | 1 | 17 | Southern Africa (24%)East Africa (24%)South Asia (18%) | |
| India | 1 | 1 | 9 | South Asia (33%)Western Asia (33%) | |
| Bangladesh | 2 | 2 | 9 | South America (56%)South Asia (33%) | |
| Nigeria, Ghana | 2 | 0 | 25 | ||
| Bangladesh, Pakistan | 3 | 2 | 12 | ||
| Bangladesh, Pakistan, India | 14 | 13 | 30 | ||
| Pakistan | 1 | 1 | 32 | East Africa (44%)South Asia (19%) | |
| India | 1 | 1 | 78 | Southeast Asia (71%) | |
| Bangladesh, India, Pakistan, Nepal, Kuwait | 53 | — | 853 | Southeast Asia (50%)South Asia (26%) |
Closest genome country match, number of travel-associated isolates whose country of travel matched that of the closest genome in the global collection (based on lowest number of SNPs); N (global), number of isolates in the global collection that were assigned to this subclade; N (travel), number of travel-associated isolates that were assigned to the subclade; Region frequencies, frequency of each geographic region among isolates of this subclade from the global collection (note groups of isolates from the same subclade, country and year were classified as outbreaks and represented only once per group in the frequency calculations); SNP, single-nucleotide polymorphism; S. Typhi, Salmonella enterica serovar Typhi.
*Highlights the most frequent region for this subclade among the global collection, where this matches the region of travel.
Figure 3Phylogeny of 99 travel-associated S. Typhi in comparison with the global genomic framework containing 1,831 isolates.
Whole-genome SNP tree is shown in the centre and branches are coloured by clade. Rings indicate region of origin: inner ring, global collection; outer ring, travel-associated isolates. Subclades that contain travel-associated isolates are highlighted within the tree (shaded in alternating colours) and labelled around the outside; intrasubclade phylogenies are provided in Supplementary Fig. 4.
Summary of genotyping and SNP results for travel-associated S. Typhi isolates of unknown origin.
| H06434426 | Mexico | 146 | 2.0.2 | North America (50%)North Africa (50%) |
| H06156550 | Pakistan | 92 | 3.0.1 | South Asia (50%)North Africa (50%) |
| H05272442 | Ghana | 17 | 3.1.1 | |
| H09176223 | Bangladesh | 11 | 3.3.0 | |
| H10182335 | India | 18 | 3.3.1 | East Africa (44%)South Asia (19%) |
| H10046338 | Bangladesh | 13 | 4.3.1 | Southeast Asia (50%)South Asia (26%) |
| H05406403 | Bangladesh | 4 | ||
| H06136379 | Bangladesh | 19 | ||
| H06136380 | Bangladesh | 17 | ||
| H05118260 | Bangladesh/India | 19 | ||
| H10382491 | India | 9 | ||
| H10394694 | India | 9 | ||
| H06016481 | India | 9 | ||
| H05196407 | India | 14 | ||
| H05196408 | India | 12 | ||
| H05212226 | India | 17 | ||
| H11372598 | Pakistan | 10 | ||
| H09266336 | Pakistan | 9 |
SNP, single-nucleotide polymorphism; S. Typhi, Salmonella enterica serovar Typhi.
For each London isolate, the closest isolate in the global collection was determined (closest=smallest SNP distance, that is, smallest number of core genome SNPs); the country and SNP distance are recorded.
*Highlights the most frequent region for this subclade among the global collection, where this matches the region of the closest isolate in the global collection.
†Location of closest travel-associated isolates from London (unresolvable beyond ‘South Asia' based on the global collection alone).