| Literature DB >> 27677796 |
Jason W Sahl1, Talima Pearson1, Richard Okinaka1, James M Schupp2, John D Gillece2, Hannah Heaton2, Dawn Birdsell1, Crystal Hepp3, Viacheslav Fofanov3, Ramón Noseda4, Antonio Fasanella5, Alex Hoffmaster6, David M Wagner1, Paul Keim7.
Abstract
Anthrax is a zoonotic disease that occurs naturally in wild and domestic animals but has been used by both state-sponsored programs and terrorists as a biological weapon. A Soviet industrial production facility in Sverdlovsk, USSR, proved deficient in 1979 when a plume of spores was accidentally released and resulted in one of the largest known human anthrax outbreaks. In order to understand this outbreak and others, we generated a Bacillus anthracis population genetic database based upon whole-genome analysis to identify all single-nucleotide polymorphisms (SNPs) across a reference genome. Phylogenetic analysis has defined three major clades (A, B, and C), B and C being relatively rare compared to A. The A clade has numerous subclades, including a major polytomy named the trans-Eurasian (TEA) group. The TEA radiation is a dominant evolutionary feature of B. anthracis, with many contemporary populations having resulted from a large spatial dispersal of spores from a single source. Two autopsy specimens from the Sverdlovsk outbreak were deep sequenced to produce draft B. anthracis genomes. This allowed the phylogenetic placement of the Sverdlovsk strain into a clade with two Asian live vaccine strains, including the Russian Tsiankovskii strain. The genome was examined for evidence of drug resistance manipulation or other genetic engineering, but none was found. The Soviet Sverdlovsk strain genome is consistent with a wild-type strain from Russia that had no evidence of genetic manipulation during its industrial production. This work provides insights into the world's largest biological weapons program and provides an extensive B. anthracis phylogenetic reference. IMPORTANCE: The 1979 Russian anthrax outbreak resulted from an industrial accident at the Soviet anthrax spore production facility in the city of Sverdlovsk. Deep genomic sequencing of two autopsy specimens generated a draft genome and phylogenetic placement of the Soviet Sverdlovsk anthrax strain. While it is known that Soviet scientists had genetically manipulated Bacillus anthracis with the potential to evade vaccine prophylaxis and antibiotic therapeutics, there was no genomic evidence of this from the Sverdlovsk production strain genome. The whole-genome SNP genotype of the Sverdlovsk strain was used to precisely identify it and its close relatives in the context of an extensive global B. anthracis strain collection. This genomic identity can now be used for forensic tracking of this weapons material on a global scale and for future anthrax investigations.Entities:
Year: 2016 PMID: 27677796 PMCID: PMC5050339 DOI: 10.1128/mBio.01501-16
Source DB: PubMed Journal: MBio Impact factor: 7.867
FIG 1 Phylogenetic structure of B. anthracis. Orthologous SNPs (11,989 total; 5,663 parsimony informative) from whole-genome sequences were analyzed by maximum parsimony to generate a phylogenetic tree. The major clades are collapsed in this figure, but the complete tree is available in the supplemental material (see Fig. S1). The overall consistency index is 0.98.
Alignment status for Sverdlovsk B. anthracis genomes
| Library | Type | Length (bp) | No. of reads (pairs) | Coverage (×) | Chromosome breadth at 1× (%) | ||||
|---|---|---|---|---|---|---|---|---|---|
| Total | Trimmed | Mapped | Chromosome | pXO1 | pXO2 | ||||
| Svd-1 | HiSeq | 93 | 6.4E07 | 4.5E07 | 5.4E05 | 8 | 32 | 55 | 96 |
| Svd-1 | MiSeq | 300 | 1.0E07 | 3.4E06 | 1.5E05 | 1 | 3 | 27 | 30 |
| Svd-2 | HiSeq | 93 | 7.5E07 | 6.6E07 | 1.0E06 | 15 | 85 | 104 | 99 |
| Svd-2 | MiSeq | 300 | 1.2E07 | 3.6E06 | 1.7E05 | 1 | 6 | 28 | 36 |
| Combined data | 1.6E08 | 1.2E08 | 1.9E06 | 25 | 126 | 214 | 99 | ||
Read mixtures at Sverdlovsk genome-specific SNPs
| SNP site | SNP | Read depth (×) | % of consensus | Genomic element |
|---|---|---|---|---|
| 28576 | G to A | 78 | 97.2 | pX02 |
| 59151 | C to T | 49 | 100 | Chromosome |
| 112138 | A to G | 36 | 100 | Chromosome |
| 602999 | C to A | 34 | 100 | Chromosome |
| 1359179 | C to T | 27 | 100 | Chromosome |
| 2138718 | A to T | 20 | 100 | Chromosome |
| 2979549 | A to G | 27 | 100 | Chromosome |
| 3593664 | T to C | 31 | 89.9 | Chromosome |
| 4034596 | G to A | 17 | 93.7 | Chromosome |
| 4236707 | T to C | 30 | 100 | Chromosome |
| 4504222 | C to T | 22 | 100 | Chromosome |
| 4896833 | T to C | 17 | 100 | Chromosome |
| 5186004 | C to A | 46 | 100 | Chromosome |
FIG 2 Sverdlovsk contigs aligned to the Ames ancestor genome. The reads from both autopsy specimens were combined for de novo assembly, which resulted in 128 contigs. These are aligned against the Ames ancestor chromosome and two plasmids, and the synteny was visualized with mummerplot. Greater than 99% of the Ames ancestor genome is represented by these 128 contigs.
FIG 3 The Tsiankovskii clade. A phylogenetic tree of the closest relatives to the Sverdlovsk genomes is shown. One SNP was discovered between Svd-1 and Svd-2. The Sverd_combined genotype is identical to Svd-2.