Literature DB >> 23023330

Intracontinental spread of human invasive Salmonella Typhimurium pathovariants in sub-Saharan Africa.

Chinyere K Okoro1, Robert A Kingsley, Thomas R Connor, Simon R Harris, Christopher M Parry, Manar N Al-Mashhadani, Samuel Kariuki, Chisomo L Msefula, Melita A Gordon, Elizabeth de Pinna, John Wain, Robert S Heyderman, Stephen Obaro, Pedro L Alonso, Inacio Mandomando, Calman A MacLennan, Milagritos D Tapia, Myron M Levine, Sharon M Tennant, Julian Parkhill, Gordon Dougan.   

Abstract

A highly invasive form of non-typhoidal Salmonella (iNTS) disease has recently been documented in many countries in sub-Saharan Africa. The most common Salmonella enterica serovar causing this disease is Typhimurium (Salmonella Typhimurium). We applied whole-genome sequence-based phylogenetic methods to define the population structure of sub-Saharan African invasive Salmonella Typhimurium isolates and compared these to global Salmonella Typhimurium populations. Notably, the vast majority of sub-Saharan invasive Salmonella Typhimurium isolates fell within two closely related, highly clustered phylogenetic lineages that we estimate emerged independently ∼52 and ∼35 years ago in close temporal association with the current HIV pandemic. Clonal replacement of isolates from lineage I by those from lineage II was potentially influenced by the use of chloramphenicol for the treatment of iNTS disease. Our analysis suggests that iNTS disease is in part an epidemic in sub-Saharan Africa caused by highly related Salmonella Typhimurium lineages that may have occupied new niches associated with a compromised human population and antibiotic treatment.

Entities:  

Mesh:

Substances:

Year:  2012        PMID: 23023330      PMCID: PMC3491877          DOI: 10.1038/ng.2423

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


INTRODUCTION

S. enterica is a diverse bacterial species that remains a common cause of infectious disease in humans and animals throughout the world[1]. Human Salmonella infections are classically divided into diseases caused by typhoidal or non-typhoidal salmonella (NTS). The former category includes the human restricted S. enterica serovars Typhi and Paratyphi that cause the systemic disease typhoid, while NTS is comprised of the majority of other serovars that predominantly cause self-limiting gastroenteritis in humans[2]. Salmonella Typhi is a human restricted pathogen that is transmitted from human to human, whereas NTS disease is normally associated with zoonotic reservoirs, typically domesticated animals, with little or no sustained human-to-human transmission. In contrast to this classical view, NTS are a frequent cause of invasive bacterial disease in many countries in sub-Saharan Africa[3,4]. This invasive form of NTS disease (iNTS) is common both in children with malnutrition, severe anemia, malaria or HIV[4,5] and HIV-infected adults[6], frequently surpassing Salmonella Typhi in many parts of the region as the dominant cause of invasive salmonellosis. The clinical presentation of this disease is distinct from both gastroenteritis and typhoid fever, characterized by a non-specific fever that can be indistinguishable from malaria and in rare cases is accompanied by diarrhea[7]. The frequency of NTS-associated case fatalities can be extremely high in both adults and children (22% – 45%)[6,8-10]. S. enterica serovar Typhimurium (Salmonella Typhimurium) is one of the serovars most frequently associated with iNTS in the sub-Saharan region, although other serovars including S. enterica serovar Enteritidis have also been implicated [3,4,8]. We previously reported that Salmonella Typhimurium isolates from Kenya and Malawi were predominantly of a novel multi-locus sequence type (ST) designated ST313[7], rarely isolated from outside sub-Saharan Africa. The DNA sequence of representative multi-drug-resistant (MDR) ST313 isolates, D23580 and A130, identified genomic features distinct from previously characterized gastroenteritis-associated strains[7]. These features included evidence of partial genome degradation, with some parallels to that observed in the S. enterica serovars Typhi and Paratyphi A that has been linked to niche adaptation[11,12]. Here we use single nucleotide polymorphism (SNP)-based phylogenetic methods based on whole genome sequences to determine the population structure of a geographically-diverse collection of invasive Salmonella Typhimurium isolates from different sub-Saharan African countries. These data are placed in the phylogenetic context of Salmonella Typhimurium isolates from other parts of the world. We provide evidence that two tightly clustered genetic lineages have emerged within the last sixty years to be a dominant cause of an epidemic invasive Salmonella Typhimurium disease in the region. We highlight the potential role of antibiotic-resistance acquisition in driving the epidemic and the temporal association with an increased prevalence of HIV.

RESULTS

Phylogenetic analysis of Salmonella Typhimurium

Salmonella Typhimurium represents an unstratified serologically-defined group within the broader species S. enterica[13]. Therefore, in order to place the invasive Salmonella Typhimurium isolates from sub-Saharan Africa into an evolutionary and phylogenetic context we exploited whole genome sequencing to discover potentially informative SNPs within a collection of 179 Salmonella Typhimurium spanning the period 1938 and 2010, collected from different parts of the world. Our collection included 129 invasive Salmonella Typhimurium from Malawi, Kenya, Mozambique, Uganda, The Democratic Republic of Congo (DRC), Nigeria and Mali (Supplementary Table 1). Data were available for 10,623 high-quality SNPs, corresponding to approximately 1 SNP for every 407 bp, that were distributed relatively uniformly across the genome of the reference Salmonella Typhimurium SL1344. To refine phylogenetic analysis, SNPs associated with repetitive sequences, mobile elements and phage sequences, representing ~4% of the genome, were excluded. We detected no evidence of extensive recombination within the remaining genomic sequences and consequently SNPs mapping to these regions were used to reconstruct a maximum-likelihood phylogenetic tree[14] (Fig. 1).
Figure 1

Population structure of Salmonella Typhimurium isolates

Unrooted maximum likelihood tree showing the relationship between isolates associated with invasive disease and gastroenteritis. Lineages of human invasive Salmonella Typhimurium are shown in red and labeled I–3IV.

The Phylogenetic positions of invasive strains A130 (ref. 7) and D23580 (ref. 7) and gastroenteritis-associated strains DT104, LT2 and SL1344 are indicated. Branch lengths are indicative of the estimated substitution rate per variable site. Scale bar, 0.009 substitutions per variable site.

The numbers of isolates (in parentheses) and MLST groups are indicated in boxes. Top left, unrooted maximum-likelihood tree of plasmid sequences showing congruence with the chromosomal tree. Asterisks indicate nodes with 100% bootstrap support.

Notably, invasive Salmonella Typhimurium isolates from sub-Saharan Africa fall predominantly into two distinct ST313 phylogenetic lineages designated as lineages I and II. Furthermore, these lineages form distinct and extremely tight clusters on separate branches from other Salmonella Typhimurium that were isolated elsewhere in the world. The tight clustering is illustrated by the fact that isolates within either lineage I or lineage II are separated by mean differences as few as 33 and 21 SNPs respectively. The two invasive Salmonella Typhimurium lineages are joined to the main tree by relatively long branches, but there is divergence at the branch tips suggesting recent clonal or population expansion. Both lineages are thus more closely related to each other than to any other Salmonella Typhimurium within the tree. The two invasive Salmonella Typhimurium lineages are joined to the main tree by relatively long branches, but there has been divergence at the branch tips suggesting recent clonal or population expansion. Multi-Locus-Sequence-Typing (MLST) analysis confirmed lineages I and II as ST313, although a single isolate, 5580 from lineage I is ST394, a single locus variant of ST313 (Supplementary Fig. 1). All eight invasive Salmonella Typhimurium isolates from sub-Saharan Africa which fall outside of lineages I and II are ST19, a common ST to which 82% (41/50) of the 50 non-African Salmonella Typhimurium that we sequenced belong. Other STs represented in the non sub-Saharan Salmonella Typhimurium lineages include ST34 (5/50), ST98 (1/50), ST128 (2/50) and ST568 (2/50) (Supplementary Fig. 1).

Temporal and geographic distribution relative to phylogeny

We performed BEAST[15] analysis on 129 sub-Saharan invasive Salmonella Typhimurium isolates from seven sub-Saharan African countries covering a 22 year-period from 1988 to 2010. BEAST is designed to reconstruct evolutionary history within the context of geographical distribution over time from sampled DNA sequences[16], and has been used extensively in bacterial[17-20], viral[21,22] and eukaryotic population studies[23]. From this analysis, a single maximum clade credibility tree was produced for each lineage (Fig. 2a & 2b). The mean evolutionary rates, assuming a Bayesian skyline model of population size change and a relaxed molecular clock, were estimated to be 1.9×10−7 and 3.9×10−7substitutions−1site−1year−1, for lineage II and I respectively. This corresponds to an accumulation of approximately 1-2 SNPs genome−1year−1, which is similar to that calculated for the enteric pathogen Vibrio cholerae (8 × 10−7 site−1 year−1)[24] and lies between the rates estimated for Yersinia pestis (2 × 10−8)[25] and Staphylococcus aureus (3 × 10−6)[26]. The topologies of the BEAST and maximum-likelihood trees were congruent, and the recovered nodes were supported with high posterior probabilities and bootstrap values, respectively.
Figure 2

Bayesian-based analyses of spatial and temporal distribution of Sub-Saharan African lineages of invasive Salmonella Typhimurium

(a,b) MCC trees from BEAST showing phylogeographic reconstruction of lineage I (a) and lineage II (b) with estimated sampling intervals of 43.0 years (1960.6-2003.6) and 32.3 years (1977.1-2009.4), respectively. Estimated ages of nodes where transmissions occurred (black circles) are reported as the median values, with 95% HPD given in parentheses. Asterisks indicates the second introduction of invasive Salmonella Typhimurium in Mali. Posterior probability values for all geographical locations at the ancestral nodes other than the second spread into Uganda (0.56) >0.9 for lineage I in a and >0.7 for lineage II in b were recovered for all the geographic locations at the ancestral nodes. Branches and nodes are colored according to the location that had the highest posterior probability value. Arrows indicate the estimated points of insertion of independently acquired Tn21 and cat loci within the plasmids in both lineages. (c) Percentage of HIV prevalence in sampled countries from 1960 to the present. HIV prevalence is defined as the percentage of men and women between the ages of 15 and 49 who are HIV positive (UNAIDS Report on the Global AIDS Epidemic 2010; see URLs). Dashed lines show predicted HIV prevalence values before monitoring and reportage for the different countries extrapolated backwards in time to 1960. The block outlined by dashed lines indicates the time when HIV prevalence monitoring was temporally associated with the expansion of invasive Salmonella Typhimurium clones across sub-Saharan Africa.

A time-dependent phylogeographic reconstruction of lineage 1, which is estimated to have emerged ~52 years ago (95% Highest posterior density HPD, 1920.4 -1979.5; Fig. 2a), indicated that in our collection, isolates from Malawi diverged earliest from the last common ancestor for this lineage. Although we cannot completely eliminate the potential bias due the number of Malawian isolates analyzed within this lineage, 25 permutation datasets using 10 randomly selected Malawi isolates (a different set of 10 isolates, an equivalent sample size to those from other countries, was used for each permutation) returned similar results to the complete data set. Thus, we are confident of our estimates of the age and geographical origin of the ancestral node of this lineage (Supplementary Fig. 2a & 2b). Analyses of the distribution of isolates from each country and the tree topology of lineage I are consistent with at least four independent transmission events or movements across Southeastern Africa, with Malawi having served as a potentially important early hub (Fig. 3a, Supplementary Fig. 3a). The earliest identifiable waves or transmissions were from Malawi to Kenya dated ~1982 (95% HPD, 1967.6 – 1990.2) and between Malawi and DRC dated ~1983 (95% HPD, 1974.8 – 1988.3). This same phylogenetically-linked wave was identified in Uganda ~1989 (95% HPD, 1980.0 – 1994.6) and a further onward wave was identifiable in Mozambique in ~1990 (95% HPD, 1981.0 – 1994.4) and this manifested as a second introduction into Uganda in ~2001 (95% HPD, 1981.0 – 1994.4). We cannot identify the specific geographical route that these bacterial lineages followed but the phylogenetic evidence clearly temporally links these outbreaks as a single epidemic. Our results also show evidence of geographical clustering after a transmission event introduced the lineage into a country. This suggests that the epidemic clone was introduced a limited number of times into each country, giving rise to localized epidemics or outbreaks.
Figure 3

Geospatial transmission of invasive Salmonella Typhimurium isolates in sub-Saharan Africa

(a,b) Phylogeographical diffusion of lineages I (a) and II (b) across sub-Saharan Africa over time based on a discrete geospatial model with associated geographical coordinates. Countries shown here represent discrete locations annotated at the tree nodes taken from the BEAST analyses, and branches that indicate location changes are represented on the map as the transmission lines. The color gradient shows the ages of transmission lines.

Invasive Salmonella Typhimurium isolates of lineage I disappeared from our collection between 2003–2005 and were replaced by lineage II and isolates after 2006 were found exclusively in this cluster. Lineage II is estimated to have emerged ~35 years ago (95% HPD, 1957.1 – 1986.8) making it genetically younger than lineage I (Fig. 2b,). The spread of lineage II also appears to have occurred in several waves (Fig. 3b, Supplementary Fig. 3b). Our deepest-rooted isolates are from DRC with evidence for transmission onward to Uganda dated ~1985 (95% HPD 1972.6 – 1990.6). This wave was detected in Kenya and Malawi between 1994 and 1996. Malawi likely represents a more recent hub for further dispersal of invasive Salmonella Typhimurium lineage II isolates between 1995 and 1998 to several countries, including neighboring Mozambique and further westwards, across the sub-Saharan region, to Mali and Nigeria. A more recent wave of this lineage appears to have spread from Kenya arriving back in Malawi around 2002. We also see evidence for localized epidemics associated with the lineage II clones, as highlighted by clustering based on geography. Indeed, local epidemiology and molecular typing in Malawi and Kenya[7,8] of invasive Salmonella Typhimurium isolates from 1997 to 2006 describes a local clonal replacement event of lineage I by lineage II associated with the emergence of chloramphenicol resistance in an 18-month period from 2001 to 2003.

Evolution of MDR and potential role of cat gene in clonal replacement

Previously, we characterized two distinct composite Tn21-like transposition elements encoding multiple antibiotic resistance determinants located on pSLT, the so-called virulence-associated plasmid, in two representative invasive Salmonella Typhimurium isolates, A130 (lineage I) and D23580 (lineage II)[7]. These Tn21 elements are inserted at different sites in the pSLT virulence plasmid in each isolate. Notably, in our phylogenetic analysis, we found these insertion sites to be identical within each lineage but different between lineages, suggesting that Tn21 element acquisition was an independent and early event in each lineage (Fig. 4, Supplementary Fig. 4). Interestingly, only one isolate from lineage I (A24924) and one from lineage II (254DRC) do not have a Tn21-like element (Fig. 4). Comparative analyses of these two isolates, which significantly are the most deeply rooted isolates in each lineage, revealed that although the relevant variant of Tn21 element is absent in both isolates (Fig. 2a & 2b), they share the pSLT plasmid backbone with other isolates of the same lineage. This suggests that in each case they share a common ancestor with other isolates within the same lineage, but importantly the ancestor existed before the acquisition of the composite Tn21-like elements (See supplementary data). With the exception of a deletion in istA, a transposase of insertion sequence IS1326, in A16083, the lineage 1-specific Tn21 locus is relatively highly conserved in most isolates of lineage I (Fig 4b). In contrast, the Tn21-like locus encoded by lineage II appears somewhat unstable since isolates in different parts of the tree have lost subsets of genes (14DRC, 5582, J17 and A32751) (Fig. 4a; Supplementary fig. 5).
Figure 4

Distribution of MDR loci in invasive Salmonella Typhimurium in relation to phylogeny

A Tn21 elements in lineage I (panel 1) and lineage II (panel 2) mapped to Tn21 element from strain D23580 (on top). B lineage I elements mapped to assembled Tn21 sequence in strain A130. Sequence reads mapping to the complete sequence length is represented as a heatmap with dark green color indicating > 90% (high) coverage, light green indicating >30<90% coverage and white indicating < 30% (low) coverage.

One striking feature of the dataset is the absence of a chloramphenicol resistance (cat) gene from all isolates in lineage I. In contrast, the gene was present in > 97% of lineage II, being absent in two isolates (Fig. 4a). These isolates are 254DRC, which does not possess a Tn21 element and 5582, a 2005 Kenyan isolate where the loss of the cat gene is due to a simple deletion event (Fig. 4a). This strongly suggests an independent acquisition of the cat gene borne on a lineage II-specific Tn21 element early on in the genealogy, most likely about the time of expansion from DRC, as shown in Figure 2b (median node date 1984; 95% HPD, 1972.6 – 1990.6; state posterior probability = 0.78). The analysis of MDR acquisition is consistent with the antibiotic resistance profiles obtained for the isolates. In some of our sampling sites, such as Malawi, the acquisition of resistance to chloramphenicol was observed in invasive Salmonella Typhimurium isolates from around 2001-2004, consistent with the arrival of lineage II[7]. At this time, chloramphenicol was the drug of choice for treatment for suspected severe bacterial infections and cases of iNTS infection confirmed by blood culture. The acquisition of chloramphenicol resistance may have afforded lineage II clones a greater opportunity to survive treatment and transmit, which could have in turn contributed to the clonal replacement of lineage I isolates observed between 2003 and 2005 and the subsequent expansion of lineage II clones thereafter.

Transmission is temporally associated with HIV and the HIV pandemic

Time-dependent phylogeographic analysis revealed the clonal expansion of two distinct invasive Salmonella Typhimurium lineages within the last 40-50 years that was accompanied by spread across multiple countries of sub-Saharan Africa. Intriguingly, this emergence temporally coincides with the HIV epidemic in sub-Saharan Africa. Molecular clock analysis of HIV-1 genome sequences suggested that the pandemic began at the start of the twentieth century[27-29], with prevalence peaking in the 1990’s in many countries, including those represented within our strain collection (2% in Mali to over 15% in Malawi) (Fig. 2c; Supplementary Fig. 6). An association with the HIV status of the patients is also reflected in terms of the samples analyzed in this study. For example, where a test was conducted for HIV, all adult samples were positive. One of the first reported case of HIV infection in Africa was from an adult in DRC[30] and interestingly, the earliest geographic localization of epidemic clones from lineage II is within this country. Thus, the Congo basin provides a potential origin of invasive Salmonella Typhimurium lineage II[31]. It therefore appears possible that the epidemic of invasive Salmonella Typhimurium and transmission across the sub-Saharan region was potentiated by an increase in the critical population of susceptible and immunocompromised individuals, particularly more mobile adults.

DISCUSSION

The recent reporting of a remarkably high incidence of invasive Salmonella Typhimurium in various parts of the sub-Saharan African region makes it increasingly important to understand the evolutionary origins and spatio-temporal spread of these isolates. Recently, whole-genome sequencing methods have been used to trace intercontinental transmission of different recently emerged and closely related bacterial pathogens[18,24,26,32], and we have therefore applied this high resolution analysis to determine the phylogenetic structure of invasive Salmonella Typhimurium. Here we revealed that the vast majority of Salmonella Typhimurium isolates associated with invasive disease from sub-Saharan Africa comprised just two highly conserved lineages of MLST type ST313 that are more closely related to each other than any other known Salmonella Typhimurium lineage. This is in contrast to the considerable phylogenetic variation of Salmonella Typhimurium isolates associated with gastroenteritis or invasive disease from outside sub-Saharan Africa. Thus, invasive Salmonella Typhimurium disease in this region is in part a previously unrecognized epidemic caused by the spread of the clones from these two lineages. We show how invasive Salmonella Typhimurium transmission into a particular country or geographic area occurs as a discrete, temporally defined introduction that is followed by subsequent spread within that particular location (Fig. 2), although some local regions have experienced multiple introduction events. For example, it is evident that two independent introductions occurred in Mali between 1995 and 2000 (Fig. 2b). Considerable clonal expansion has occurred independently in each of these two lineages beginning around 1960. Independent acquisition of a Tn21 element encoding multiple drug resistance genes by both lineages may have facilitated their successful transmission across the sub-continent, within the susceptible host population. A later acquisition of a cat gene on the composite element within lineage II have contributed to a clonal replacement event, which occurred from 2003 – 2005, and resulted in greater spatial dispersion of clones from this lineage over sub-Saharan Africa. An association between acquisition of chloramphenicol resistance and increased transmission has been observed in early epidemiological studies on chloramphenicol-resistant Salmonella Typhi in Mexico[33] and is also confirmed by observations reported in Kenya[7] and Malawi[8]. HIV increases susceptibility to iNTS infections[34] and this form of bacteraemia is an AIDS-defining opportunistic infection in adults[35,36]. Further, animal models of co-infection between iNTS and simian immunodeficiency virus (SIV)[37] or malaria[38] indicate that host immune status plays a critical role in determining the outcome of Salmonella infections. Indeed, sporadic human invasive disease is a feature of the non-ST313 lineages of Salmonella Typhimurium. Thus, although ST313 is the dominant form of invasive Salmonella disease in sub-Saharan Africa[3,39], it is not unexpected that other S. enterica or indeed Salmonella Typhimurium lineages can also cause sporadic disease. Notably, supporting epidemiological evidence indicates that the ST313 Salmonella Typhimurium lineages may not have reached some parts of Africa, including the Gambia[40,41] and Ethiopia[42,43] where iNTS has been reported. It is particularly noteworthy that we see a temporal association of clonal expansion of invasive Salmonella Typhimurium with the peaks in HIV prevalence, particularly in adults in the countries included in our study. The rapid expansion and spread of these clones may have been facilitated by the dramatic expansion of a mobile susceptible host population. Previous analysis has shown that HIV-I arrived in East and Central Africa around the 1950’s, followed by an eastwards expansion in the 1970’s and early 1980’s[44]. We find temporal parallels in this estimated HIV-I expansion time frame and our estimate of the earliest detectable transmissions in lineage 1 around the early 1980’s (95% HPD, 1967.6 – 1990.2). The continued expansion of the HIV-susceptible population until the peaks of prevalence in the 1990’s (Fig. 2c), together with the acquisition of additional chloramphenicol resistance, is likely contributory to the greater dispersal of lineage II clones. The association of iNTS disease with malaria, anemia and malnourishment in children is well-documented[4,5,45-47] and we have isolates within our collection from children with these underlying conditions (Supplementary Table 1). Malnourished and malarial children thus present an additional ecological niche that co-exists with as well as precedes the HIV-positive population. Notably, we found no evidence of phylogenetic segregation between such isolates and those from HIV-positive children or adults within the two epidemic lineages. This is consistent with immunosuppression as a key predisposing factor in iNTS disease. However, the emergence of a large cohort of HIV infected adults may also have facilitated the spread of the invasive Salmonella Typhimurium lineages, as adults are inevitably more mobile. This is especially pertinent since failure of immunological control of iNTS infections in HIV-positive African adults has been well-documented[34,48]. The resulting large pool of immunosuppressed individuals may also facilitate an unusual human-to-human transmission (anthroponotic) component in invasive Salmonella Typhimurium disease, in contrast to most disease caused by NTS outside of Africa, where transmission is predominantly zoonotic [49]. There is a dearth of information on the specifics of NTS transmission in sub-Saharan Africa although independent, country-based studies have shown evidence of non-zoonotic transmissions patterns[39,49,50]. It is perhaps noteworthy that we detected a similar pattern of genomic degradation in the form of gene loss and pseudogene formation to the human-adapted serovars S. Typhi[12] and S. Paratyphi[51] in the two fully sequenced African invasive Salmonella Typhimurium isolates, D23580 and A130 representative of lineages I and II respectively[7]. Taken together, these results suggest that the invasive clones may have adapted to facilitate direct person-to-person transmission within the human population. Further comparative studies on the virulence, and transmission potential of different Salmonella Typhimurium lineages will be instrumental in closing this critical knowledge gap and are the focus of on-going investigations These results provide the first whole-genome based transmission study of this kind on iNTS from sub-Saharan Africa, and they highlight the power of these approaches to monitor emergence and spread of clonal bacterial populations associated with epidemics locally or globally over time. The transmission pathways hypothesized here suggest potential routes to the implementation of appropriate clinical intervention strategies.

ONLINE METHODS

Isolate Selection and genomic DNA preparation

129 isolates associated with invasive disease from Malawi, Mali, Kenya, and Nigeria were cultured from venous blood, cerebrospinal fluid or stool of febrile adults and children between 1988 and 2010. Gastrointestinal isolates were obtained from collections at the Salmonella Genetic Stock centre (SGSC)[52], Calgary Canada, the Health Protection Agency, London or as indicated in the Supplementary Table 1[13,53-57]. Invasive Salmonella Typhimurium isolates were identified by standard serotyping methods, using the O- and H-antigen agglutination, based on the Kauffmann-White Scheme[1]. DNA samples were provided for invasive Salmonella Typhimurium isolates from the DRC, Mozambique and Uganda. Isolates were grown on Luria-Bertani (LB) medium; single colonies were incubated in LB Broth overnight at 37°C. Bacteria cells were pelleted by centrifugation (3, 700 × g or 4,300 rpm, 5 minutes) and DNA extracted using either the Wizard® Genomic DNA kit (Promega) according to manufacturers’ instructions or by phenol/chloroform extraction protocol[18]. DNA quality and quantity were checked by gel electrophoresis and Qubit® quantitation platform (Invitogen). 20-50 ng/μL of DNA from each isolate was submitted for Illumina sequencing.

Genomic library preparation and sequencing

Multiplex libraries with a 200bp insert size were prepared using 12 unique index-tags, and sequenced to generate 54 or 76 base-pair (bp) paired-end reads. Cluster formation, primer hybridisation and sequencing reactions were based on reversible terminator chemistry using the Illumina Genome Analyser II System according to standard protocol[26,58]. Sequence data were submitted to the European Nucleotide Archive. Accession numbers are given in Supplementary Table 1.

Read alignment and SNP detection

Paired-end Illumina sequence data from each isolate was mapped to the reference genome Salmonella Typhimurium strain SL1344[57] using SSAHA2[59]. Sequence reads mapped to an average of 97.7% of the reference with a mean depth of 56.5-fold in mapped regions across all isolates (Supplementary Table 1). SNPs were identified using samtools mpileup and filtered with a minimum mapping quality of 30 and quality ratio cut-off of 0.75[18,24,26,59,60]. SNPs called in phage sequences and repetitive regions of the Salmonella Typhimurium reference were excluded. Repetitive regions were defined as exact repetitive sequences of ≥ 20 bp, identified using repeat finding programs nucmer[61], REPeuter[62] and repeat-match[12,17]. Recombinant segments of the genome were removed from the whole genome alignment as described previously[18]. Following the removal of recombinant segments, mobile elements and repetitive sequences of the genome, a concatenated alignment composed of 10,623 SNP sites from each sequenced isolates was produced. Small insertions and deletions (indels) were also identified from the SSAHA result output, but were not used for subsequent phylogenetic analyses.

Phylogenetic analyses

A Maximum-likelihood phylogenetic tree (Fig. 1) was constructed from SNP alignment with RAxML v7.0.4[14] using a general time-reversible (GTR) substitution model with γ correction for among-site rate variation. Support for nodes on the trees were assessed using 100 bootstrap replicates. For the identified lineages 1 and II, 487 and 422 chromosomal SNP loci were identified, respectively. These within-cluster SNP alignments were then used to recalculate individual maximum likelihood trees for each cluster, using the same parameters described above. These were used as input trees for subsequent analyses. The methods described above were also applied to obtain a maximum-likelihood phylogenetic reconstruction of plasmids from our isolate collection using 1,251 concatenated SNP sites, with the virulence plasmid pSLT-SL1344 from SL1344 as the reference.

MLST Analyses

Allele coordinates were obtained for the seven housekeeping genes used for the S. enterica MLST typing scheme (aroC, dnaN, hemD, hisD, purE, sucA and thrA) by manually marking the coordinates in the whole-genome alignment of the isolates. The marked up regions were extracted and a multi-sequence alignment of each gene produced for all the isolates. These were used to determine the sequence type of each isolates using the Salmonella enterica MLST database

Bayesian phylogeny, estimating dates of divergence and phylogeographical analyses of lineages

Estimation of rates of evolution, divergence times and phylogeography for our isolate collection as well as for each of the identified lineages was performed using the Bayesian MCMC framework, BEAST[15], on SNP alignments. Various combinations of population size change model and molecular clock model were compared to find the model that best fit the data. In all cases Bayes Factors showed strong support (BF << 200) for the use of skyline[63] model of population size change and a relaxed uncorrelated lognormal clock[64] which allows the evolutionary rates to change among the branches of the tree[24] and a GTR substitution model with gamma correction for amongst site rate variation. Using the same parameters, the geographical locations of ancestral nodes were estimated using the discrete geospatial model implemented in BEAST (Supplementary Table 1)[16]. In all cases, three independent chains were run for 250 million steps each, and sampled every 10,000 steps. The three chains were combined with LogCombiner[15] with the initial 25 million steps removed from each as a burn-in. Maximum clade credibility (MCC) trees were created and annotated using TreeAnnotator and viewed in FigTree[15]. We report estimates as median values within 95% highest posterior density (HPD) and report posterior probability values as support for identified ancestral node age and geographical location. For the latter, we report values greater than 0.7. Spatial reconstruction of MCC trees was carried out using the SPREAD[65] software and visualized with Google Earth (Supplementary figure 3).

HIV prevalence data extrapolation

HIV prevalence data for the sampled countries was modelled with a generalized logistic (or Richards’)[66] curve using the grofit R package[67]. Curves were fit to all data points from the beginning of monitoring until stabilization or decline of the HIV-positive population. We then used these fitted models to extrapolate possible past population sizes.

Validation tests for the origin of lineage I

25 permutation datasets made up of 10 randomly selected Malawian isolates, together with the 7 DRC, 8 Kenyan, 8 Mozambican and 7 Ugandan isolates, were used to reconstruct Bayesian maximum clade credibility (MCC) phylogenetic trees. Each of the 25 datasets included a different set of 10 randomly selected Malawian isolates. The same parameters described above were applied in making the trees. Malawi was the ancestral state of all resulting 25 MCC trees with posterior probability values ranging from 0.58 - 0.92. The resulting phylogenetic trees and their root location state probability distributions are shown in supplementary figure 2b.

Plasmid sequence analyses

Paired-end sequence reads of each isolates were mapped to the multi-fasta sequence features including the Tn21 locus of pSLT-BT, the reference plasmid from invasive strain, D23580 using the Burrows-Wheeler Aligner software BWA[68], with minimum base call quality of 50, minimum mapping quality of 30, and minimum read depth of 4. Isolates from each of the three clusters were analyzed separately by cluster. Isolates with <30% of reads mapping to the length of the feature were interpreted as not having the feature, and those with >70% of reads mapping to the feature were interpreted as having the region of interest. A heatmap of the analysis based on the selected cut-off values was generated and aligned to the BEAST MCC tree of each cluster.

De novo sequence assembly and plasmid genome comparisons

Paired-end Illumina sequence data was assembled de novo using Velvet[69] and the parameters optimized to give the highest N50 value. The multi-contig draft genomes generated for each isolate were ordered using either pSLT or pSLT-BT to confirm plasmid structure using Abacas[70]. The draft plasmid genomes were used to query pSLT and/or pSLT-BT sequences using BLASTN[71] and comparison files generated and viewed using Artemis Comparison Tool (ACT)[72]
  70 in total

1.  Genetic exchange in Salmonella.

Authors:  N D ZINDER; J LEDERBERG
Journal:  J Bacteriol       Date:  1952-11       Impact factor: 3.490

2.  Bayesian coalescent inference of past population dynamics from molecular sequences.

Authors:  A J Drummond; A Rambaut; B Shapiro; O G Pybus
Journal:  Mol Biol Evol       Date:  2005-02-09       Impact factor: 16.240

3.  Pigeon-associated strains of Salmonella enterica serovar Typhimurium phage type DT2 have genomic rearrangements at rRNA operons.

Authors:  R Allen Helm; Steffen Porwollik; April E Stanley; Stanley Maloy; Michael McClelland; Wolfgang Rabsch; Abraham Eisenstark
Journal:  Infect Immun       Date:  2004-12       Impact factor: 3.441

4.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

5.  Invasive multidrug-resistant non-typhoidal Salmonella infections in Africa: zoonotic or anthroponotic transmission?

Authors:  Samuel Kariuki; Gunturu Revathi; Nyambura Kariuki; John Kiiru; Joyce Mwituria; Jane Muyodi; Jane W Githinji; Dorothy Kagendo; Agnes Munyalo; C Anthony Hart
Journal:  J Med Microbiol       Date:  2006-05       Impact factor: 2.472

6.  Molecular epidemiology of community-acquired invasive non-typhoidal Salmonella among children aged 2 29 months in rural Gambia and discovery of a new serovar, Salmonella enterica Dingiri.

Authors:  Usman N Ikumapayi; Martin Antonio; Jacob Sonne-Hansen; Ekow Biney; Godwin Enwere; Brown Okoko; Claire Oluwalana; Adeola Vaughan; Syed M A Zaman; Brian M Greenwood; Felicity T Cutts; Richard A Adegbola
Journal:  J Med Microbiol       Date:  2007-11       Impact factor: 2.472

7.  Salmonella bacteremia in Kenyan children.

Authors:  Andrew J Brent; Joe O Oundo; Isaiah Mwangi; Lucy Ochola; Brett Lowe; James A Berkley
Journal:  Pediatr Infect Dis J       Date:  2006-03       Impact factor: 2.129

8.  Simian immunodeficiency virus-induced mucosal interleukin-17 deficiency promotes Salmonella dissemination from the gut.

Authors:  Manuela Raffatellu; Renato L Santos; David E Verhoeven; Michael D George; R Paul Wilson; Sebastian E Winter; Ivan Godinez; Sumathi Sankaran; Tatiane A Paixao; Melita A Gordon; Jay K Kolls; Satya Dandekar; Andreas J Bäumler
Journal:  Nat Med       Date:  2008-03-23       Impact factor: 53.440

9.  Relaxed phylogenetics and dating with confidence.

Authors:  Alexei J Drummond; Simon Y W Ho; Matthew J Phillips; Andrew Rambaut
Journal:  PLoS Biol       Date:  2006-03-14       Impact factor: 8.029

10.  BEAST: Bayesian evolutionary analysis by sampling trees.

Authors:  Alexei J Drummond; Andrew Rambaut
Journal:  BMC Evol Biol       Date:  2007-11-08       Impact factor: 3.260

View more
  194 in total

1.  Genomic signature of multidrug-resistant Salmonella enterica serovar typhi isolates related to a massive outbreak in Zambia between 2010 and 2012.

Authors:  Rene S Hendriksen; Pimlapas Leekitcharoenphon; Oksana Lukjancenko; Chileshe Lukwesa-Musyani; Bushimbwa Tambatamba; John Mwaba; Annie Kalonda; Ruth Nakazwe; Geoffrey Kwenda; Jacob Dyring Jensen; Christina A Svendsen; Karen K Dittmann; Rolf S Kaas; Lina M Cavaco; Frank M Aarestrup; Henrik Hasman; James C L Mwansa
Journal:  J Clin Microbiol       Date:  2014-11-12       Impact factor: 5.948

2.  Genomic Variability of Serial Human Isolates of Salmonella enterica Serovar Typhimurium Associated with Prolonged Carriage.

Authors:  Sophie Octavia; Qinning Wang; Mark M Tanaka; Vitali Sintchenko; Ruiting Lan
Journal:  J Clin Microbiol       Date:  2015-08-26       Impact factor: 5.948

3.  Comparing the roles of antibodies to nontyphoidal Salmonella enterica in high- and low-income countries and implications for vaccine development.

Authors:  Calman A MacLennan; Sharon M Tennant
Journal:  Clin Vaccine Immunol       Date:  2013-07-31

4.  Salmonella Typhimurium Co-Opts the Host Type I IFN System To Restrict Macrophage Innate Immune Transcriptional Responses Selectively.

Authors:  Darren J Perkins; Rajesh Rajaiah; Sharon M Tennant; Girish Ramachandran; Ellen E Higginson; Tristan N Dyson; Stefanie N Vogel
Journal:  J Immunol       Date:  2015-07-22       Impact factor: 5.422

5.  Extremely drug-resistant Salmonella enterica serovar Senftenberg infections in patients in Zambia.

Authors:  Rene S Hendriksen; Katrine Grimstrup Joensen; Chileshe Lukwesa-Musyani; Annie Kalondaa; Pimlapas Leekitcharoenphon; Ruth Nakazwe; Frank M Aarestrup; Henrik Hasman; James C L Mwansa
Journal:  J Clin Microbiol       Date:  2012-10-17       Impact factor: 5.948

6.  Role of antilipopolysaccharide antibodies in serum bactericidal activity against Salmonella enterica serovar Typhimurium in healthy adults and children in the United States.

Authors:  Estela Trebicka; Susan Jacob; Waheed Pirzai; Bryan P Hurley; Bobby J Cherayil
Journal:  Clin Vaccine Immunol       Date:  2013-06-26

7.  Hypoferremia of infection: a double-edged sword?

Authors:  Kristen L Lokken; Renée M Tsolis; Andreas J Bäumler
Journal:  Nat Med       Date:  2014-04       Impact factor: 53.440

Review 8.  Antimicrobial resistance and management of invasive Salmonella disease.

Authors:  Samuel Kariuki; Melita A Gordon; Nicholas Feasey; Christopher M Parry
Journal:  Vaccine       Date:  2015-04-23       Impact factor: 3.641

9.  Pseudogenization of the Secreted Effector Gene sseI Confers Rapid Systemic Dissemination of S. Typhimurium ST313 within Migratory Dendritic Cells.

Authors:  Sarah E Carden; Gregory T Walker; Jared Honeycutt; Kyler Lugo; Trung Pham; Amanda Jacobson; Donna Bouley; Juliana Idoyaga; Renee M Tsolis; Denise Monack
Journal:  Cell Host Microbe       Date:  2017-02-08       Impact factor: 21.023

10.  Emergence of host-adapted Salmonella Enteritidis through rapid evolution in an immunocompromised host.

Authors:  Gordon Dougan; Robert A Kingsley; Elizabeth J Klemm; Effrossyni Gkrania-Klotsas; James Hadfield; Jessica L Forbester; Simon R Harris; Christine Hale; Jennifer N Heath; Thomas Wileman; Simon Clare; Leanne Kane; David Goulding; Thomas D Otto; Sally Kay; Rainer Doffinger; Fiona J Cooke; Andrew Carmichael; Andrew Ml Lever; Julian Parkhill; Calman A MacLennan; Dinakantha Kumararatne
Journal:  Nat Microbiol       Date:  2016-01-25       Impact factor: 17.745

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.