| Literature DB >> 21477306 |
Andrzej K Brodzik1, Joe Francoeur.
Abstract
BACKGROUND: Bacillus anthracis is one of the most monomorphic pathogens known. Identification of polymorphisms in its genome is essential for taxonomic classification, for determination of recent evolutionary changes, and for evaluation of pathogenic potency.Entities:
Year: 2011 PMID: 21477306 PMCID: PMC3094368 DOI: 10.1186/1756-0500-4-114
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Abundance and taxonomy of SNPs in Ames Ancestor, Ames and Sterne genomes reported in [13] and computed using the DS approach.
| sequence | Read | DS | coding | ns |
|---|---|---|---|---|
| - | 131 | 90 | 62 | |
| 2 | 19 | 11 | 10 | |
| - | 150 | 101 | 78 | |
| - | 150 | 101 | 78 | |
| 15* | 14 | 7 | 6 | |
| 21 | 21 | 16 | 9 |
Hyphens denote that results for a relevant strain comparison were not published. Asterisk denotes that adjacent SNPs, not considered here, were reported (see the discussion of SNPs in Section 3).
Distribution of SNPs in Ames Ancestor, Ames, and Sterne genomes.
| sequence | strain homology | SNP spacing (average) | SNP spacing (adjusted for indels) |
|---|---|---|---|
| 99.96% | 40.3 | 40.3 | |
| 100.00% | 277.8 | 277.8 | |
| 99.94% | 34.5 | 34.5 | |
| 72.38% | 13.0 | 9.4 | |
| 98.49% | 4.5 | 4.4 |
The average SNP spacing, given in Kbp, is computed by dividing the sequence length by the number of SNPs. Non-indel SNP spacing is computed similarly, except that the lengths of all indels and polymorphic regions (SNP clusters, i.e. regions where average SNP spacing is greater than one in every twenty bases) are subtracted from the total sequence length.
Figure 1Distribution of SNPs in chromosomal sequences of the . Small blue dots mark AA-S SNPs, large red dots mark AA-A SNPs.
Figure 2Distribution of nsSNPs in chromosomal sequences of the . Small blue dots mark AA-S SNPs, large red dots mark AA-A SNPs.
Figure 3Histogram of distances between subsequent SNPs in the . The minimum, average and maximum distance between subsequent SNPs is 2, 34499 and 163349 bp, respectively, however many SNPs are less than 2000 bp apart.
DNA sequence fingerprinting scheme choices for three strains of the B. anthracis chromosomal sequence ordered in terms of increasing sequence resolution.
| marker | # of markers | detectable strains | data quality |
|---|---|---|---|
| 2 | known | perfect | |
| 150+15 | some unknown | moderate | |
| ~10,000 | many unknown | poor | |
| ~5,300,000 | arbitrary | arbitrary |