| Literature DB >> 22359605 |
Henry S Gibbons1, Michael D Krepps, Gary Ouellette, Mark Karavis, Lisa Onischuk, Pascale Leonard, Stacey Broomall, Todd Sickler, Janet L Betters, Paul McGregor, Greg Donarum, Alvin Liem, Ed Fochler, Lauren McNew, C Nicole Rosenzweig, Evan Skowronski.
Abstract
Plague disease caused by the gram-negative bacterium Yersinia pestis routinely affects animals and occasionally humans, in the western United States. The strains native to the North American continent are thought to be derived from a single introduction in the late 19(th) century. The degree to which these isolates have diverged genetically since their introduction is not clear, and new genomic markers to assay the diversity of North American plague are highly desired. To assay genetic diversity of plague isolates within confined geographic areas, draft genome sequences were generated by 454 pyrosequencing from nine environmental and clinical plague isolates. In silico assemblies of Variable Number Tandem Repeat (VNTR) loci were compared to laboratory-generated profiles for seven markers. High-confidence SNPs and small Insertion/Deletions (Indels) were compared to previously sequenced Y. pestis isolates. The resulting panel of mutations allowed clustering of the strains and tracing of the most likely evolutionary trajectory of the plague strains. The sequences also allowed the identification of new putative SNPs that differentiate the 2009 isolates from previously sequenced plague strains and from each other. In addition, new insertion points for the abundant insertion sequences (IS) of Y. pestis are present that allow additional discrimination of strains; several of these new insertions potentially inactivate genes implicated in virulence. These sequences enable whole-genome phylogenetic analysis and allow the unbiased comparison of closely related isolates of a genetically monomorphic pathogen.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22359605 PMCID: PMC3281092 DOI: 10.1371/journal.pone.0031604
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Strains and origins.
|
| |||||||
| Map | Strain ID | DateIsolated | Host | Tissue | Location | Reference | Accession Number |
| a | CO92 | 1992 | Human | Unknown | Colorado |
| |
| 3 | BA200901799 | 6/17/2009 | Human female | Blood | Santa Fe, NM | This Study | AGJS00000000 |
| 7 | AS200901539 | 7/9/2009 |
| Liver/spleen | Las Vegas, NM | This Study | AGJT00000000 |
| 1 | AS200901156 | 6/10/2009 | Feline Female, Age 12 | Blood | Santa Fe, NM | This Study | AGJU00000000 |
| 2 | BA200901703 | 6/10/2009 | Human male | Groin aspirate | Santa Fe, NM | This Study | AGJV00000000 |
| 6 | BA200901990 | 7/8/2009 | Human female | Blood | Edgewood, NM | This Study | AGJW00000000 |
| 8 | BA200902009 | 7/10/2009 | Human Male | - | Edgewood, NM | This Study | AGJX00000000 |
| 9 | AS200902147 | 8/14/2009 |
| Liver/spleen | Santa Fe Airport | This Study | AGJY00000000 |
| 4 | AS200901434 | 6/30/2009 |
| Liver/spleen | Santa Fe, NM | This Study | AGJZ00000000 |
| 5 | AS200901509 | 7/7/2009 |
| Liver/spleen | Santa Fe, NM | This Study | AGKA00000000 |
Figure 1New Mexico is a Portion of the Enduring North American Plague Focus.
Highly virulent Yersinia pestis can be found among numerous rodent and animal hosts, occasionally infecting humans and their pets when they come into contact with infected animals or fleas. Samples were chosen for sequencing to represent examples coming from each broad geographic region. Common hosts include squirrels, cats, prairie dogs (Cynomys gunnissoni) and rabbits (Sylvilagus audubonii) [3]. Sequenced isolates are shown with red markers; other cases with green markers (Location map on right). Locations of isolation of strains were mapped using Google Earth®. Source: New Mexico State Dept. of Public Health [32].
Figure 2Genetic Diversity of the 2009 Plague Isolates.
Distribution of mutations through the 2009 isolates. Mutations relative to the parent strain (CO92) are indicated by black squares. Grey squares indicate that the mutation was not called automatically but was evident by manual inspection of the assembly. Mutation 31 from Table 2 is not shown nor incorporated into the phylogeny as it is an expansion of a 10 bp repeating sequence.
Mutations Identified in New Mexico Y. pestis Strains.
| Replicon | Muta-tion ID # | CO92 Coor-dinates | Base(CO92/NM) | Gene(s) Affected | Annotation | Effect on Protein Sequence |
| pMT1 | 1 | 87122 | -/G | Integrase | ||
| Chrom | 2 | 195074 | G/A | Intergenic | YPO0174- | |
| Chrom | 3 | 284552 | G/- | YPO0283 |
| Frameshift (157/657) |
| Chrom | 4 | 341011 | C/T | Intergenic | YPO0331- | |
| Chrom | 5 | 666671 | C/A | Intergenic | ||
| Chrom | 6 | 669885 | C/T | YPO0610 | Hypothetical protein, weak homology to glycosylhydrolases | H606Y |
| Chrom | 7 | 681456 | T/A | YPO0618 | Hypothetical transmembrane protein | S84C |
| Chrom | 8 | 707246 | G/T | Intergenic | ||
| Chrom | 9 | 961274 | 4 bp del. | Intergenic | ||
| Chrom | 10 | 1072960 | G/A | YPO0969 | Hypothetical protein | A551V |
| Chrom | 11 | 1288717 | C/A | YPO1142 |
| Truncation (160/496) |
| Chrom | 12 | 1336811 | C/T | YPO1187 | Putative substrate binding periplasmic protein | Synonymous |
| Chrom | 13 | 1391153 | C/T | Intergenic | ||
| Chrom | 14 | 1558606 | C/T | YPO1383 |
| A578T |
| Chrom | 15 | 1564799 | C/T | YPO1386 |
| A313V |
| Chrom | 16 | 1738860 | T/A | YPO1528 |
| Synonymous |
| Chrom | 17 | 1878012 | C/A | YPO1652 |
| Q113H |
| Chrom | 18 | 1885569 | 4 bp ins. | YPO1657 |
| |
| Chrom | 19 | 1971625 | C/T | Intergenic | ||
| Chrom | 20 | 2060795 | C/A | YPO1813 | Putative sugar binding periplasmic protein | A318E |
| Chrom | 21 | 2184694 | C/A | YPO1926 | Putative 4-hydroxybutyrate coenzyme A transferase | Synonymous |
| Chrom | 22 | 2238568 | A/C | Intergenic | ||
| Chrom | 23 | 2285006 | C/T | YPO2013 |
| G255E |
| Chrom | 24 | 2418892 | A/G | YPO2149 | Hyopthetical | Synonymous |
| Chrom | 25 | 2450171 | T/C | Intergenic | Near IS element | |
| Chrom | 26 | 2594121 | C/A | YPO2306 | Putative amino acid transporter | P267T |
| Chrom | 27 | 2635664 | G/A | YPO2341 | Putative mandelate racemase/muconate | Synonymous |
| Chrom | 28 | 2645036 | A/G | Intergenic | ||
| Chrom | 29 | 2842886 | G/A | YPO2553 |
| R402C |
| Chrom | 30 | 2871000 | G/A | Intergenic | ||
| Chrom | 31 | 2916182 | 12 bp ins | Intergenic | Repeat expansion | |
| Chrom | 32 | 3224054 | G/A | YPO2886 |
| Synonymous |
| Chrom | 33 | 3229957 | G/C | YPO2887 |
| Synonymous |
| Chrom | 34 | 3348820 | A/- | YPO2998 | Putative 2-component system response regulator | Frameshift (24/227) |
| Chrom | 35 | 3406528 | G/A | YPO3049 | Putative binding protein-dependent transport system | A564V |
| Chrom | 36 | 3473912 | G/A | Intergenic | ||
| Chrom | 37 | 3479165 | C/T | YPO3112 |
| D589N |
| Chrom | 38 | 3588660 | G/A | YPO3224 |
| Synonymous |
| Chrom | 39 | 4081854 | G/T | YPO3661 | Putative sulfite oxidase subunit YedZ | H164N |
Figure 3Phylogenetic Analysis of 2009 Plague Isolates.
The phylogenies of the 2009 isolates were inferred from MLVA data (Panels A and B) or the SNP/Indel data (panels C and D). In all cases, the Pestoides F strain was utilized as an outgroup. Panels A and B: Reconstruction of relationships from MLVA data. (A) Neighbor Joining Analysis of MLVA data. Numbers above branches represent NJ analysis bootstrap proportions, greater than 50%, based on 1000 replications; numbers below branches represent MP analysis bootstrap support. (B) Maximum Parsimony analysis. Numbers above branches represent bootstrap proportions, greater than 50%, based on 1000 replications; numbers below branches represent Majority Rule consensus values. Panels C and D: Phylogenetic analysis using newly identified SNPs/Indels using the (C) Maximum Likelihood method. The tree with the highest log likelihood (−411.0996) is shown. The percentage of trees in which the associated taxa clustered together is shown next to the branches. (D) Phylogenetic analysis using the Maximum Parsimony method. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (1000 replicates) are shown next to the branches. Additional details for the methods employed in phylogenetic reconstructions can be found in Materials and Methods.
Figure 4SNP Discovery in New North American Y. pestis Genome Sequences.
The number of strain-specific newly identified SNPs relative to CO92 is plotted for each isolate. Previously sequenced strains are indicated by shaded bars with the location of origin and the number of strains showing identical genotypes in parentheses. * Number of newly identified SNPs described in references [10], [11].
Figure 5Identification of New IS Element Insertion Points in 2009 Strains.
A) Locations of new IS element insertions. IS element insertions were identified in templated assembly experiments using CO92 as a reference. New insertion points were identified using Newbler's 454HCStructVars.txt file and the identity of the newly inserted element was determined by BLAST analysis of sequence reads containing novel junctions. Shaded square indicates the presence of an IS element beginning at the indicated nucleotide position. B) Phylogenetic analysis of 2009 strains using Maximum Likelihood method. Each insertion was treated as a single character. C) Phylogenetic analysis using Maximum Parsimony method.
IS Element Variation Between Strains.
| Location (CO92) | IS Element | Gene | Effect on Gene | Truncation Position (Amino Acids) |
| 184993 | IS100 | YPO0166 | Truncation | 207/438 |
| 1133115 | IS1541 | Intergenic | ||
| 1442414 | IS100 | Intergenic | ||
| 1534254 | IS1541 | Intergenic | ||
| 2190946 | IS100 | YPO1934 | Truncation | 31/320 |
| 2372658 | IS1541 | Intergenic, near YPO2100 putative phage regulatory protein | ||
| 2546052 | IS100 |
| Truncation | 275/466 |
| 2726564 | IS100 |
| Truncation | 64/336 |
| 3304214 | IS1541 | Intergenic | ||
| 3346949 | IS100 | Putative 2-component system sensor kinase YPO2997 | Truncation | 411/450 |
| 3757798 | IS100 |
| Truncation | |
| 4302749 | IS100 |
| Truncation | 211/293 |
| 4612617 | IS100 | Sugar phosphatase YPO4093 | Truncation | 237/270 |
| pMT172878 | IS1541 | YpMT1.72c/Hypothetical protein | Truncation | 88/91 |
| pCD1 14991 | IS285 | YPCD1.23/Orf60 pCD1 hypothetical protein | Truncation | 103/139 |
| 190049 | IS100 |
| Intact | |
| 300409 | IS1541 | Intergenic | ||
| 1992559 | IS100 | YPO1752 | Intact |
*pldA/IS100 chimeric reads are present in a subset of 25% of reads that map to this position (See Figure S2). The remainder of the reads corresponded to an intact pldA gene.