| Literature DB >> 15642093 |
Michael E Zwick1, Farrell Mcafee, David J Cutler, Timothy D Read, Jacques Ravel, Gregory R Bowman, Darrell R Galloway, Alfred Mateczun.
Abstract
We used custom-designed resequencing arrays to generate 3.1 Mb of genomic sequence from a panel of 56 Bacillus anthracis strains. Sequence quality was shown to be very high by replication (discrepancy rate of 7.4 x 10(-7)) and by comparison to independently generated shotgun sequence (discrepancy rate < 2.5 x 10(-6)). Population genomics studies of microbial pathogens using rapid resequencing technologies such as resequencing arrays are critical for recognizing newly emerging or genetically engineered strains.Entities:
Mesh:
Year: 2004 PMID: 15642093 PMCID: PMC549062 DOI: 10.1186/gb-2004-6-1-r10
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1ABACUS quality scores for base calls in B. anthracis. A quality score measures the difference, in log10 units, between the likelihood support level for the best base-call model minus that for the second-best model [32]. Of the bases, 92.6% possess quality scores that exceeded the threshold (31) used for this study.
Assessing microarray resequencing data quality
| Total number of bases called in replicate 1 | 1,383,229 |
| Total number of bases called in replicate 2 | 1,373,905 |
| Total number of bases called in both replicates | 1,349,177 |
| Total number of bases called differently | 1 |
| Replication experiment discrepancy rate | 7.4E-07 |
| Total number of bases called identically | 398,452 |
| Total number of bases called differently | 15 |
| Accuracy experiment discrepancy rate | 3.8E-05 |
Observed genetic variation in B. anthracis
| Observed number of SNPs | Total amount resequenced (bp) | Nucleotide diversity (× 104) ± 2 SEs | Tajima's D | |
| 37 | 1,544,913 | 2.9 ± 1.3 | -0.93 | |
| Chromosome | 18 | 874,564 | 2.5 ± 1.4 | -0.95 |
| pXO1 | 9 | 325,397 | 3.3 ± 2.4 | -0.54 |
| pXO2 | 10 | 344,952 | 3.5 ± 2.5 | -0.73 |
| Silent | 15 | 243,481 | 7.5 ± 4.3 | -0.55 |
| Replacement | 9 | 898,837 | 1.2 ± 0.80 | -0.64 |
| Intergenic | 13 | 402,595 | 3.8 ± 2.3 | -1.09 |
Figure 2B. anthracis SNP frequency spectrum. An excess of rare SNPs are observed in our sample. Ninety-two percent of the SNPs that we discovered have a minor allele frequency less than or equal to 0.25. This finding (92%) is significantly different from the neutral theory expectation (60%). This excess can arise as a consequence of rapid, population expansion from a small founder population and/or the action of natural selection.
Observed patterns of polymorphism/divergence between B. anthracis (Ames) and B. cereus (ATCC 14579, ATCC 10987)
| Silent sites | Replacement sites | |
| Polymorphic sites within | 660 | 136 |
| Divergent sites between | 646 | 125 |
| Polymorphic sites within | 11 | 3 |
Figure 3Radial tree showing inferred phylogenetic relationships of B. anthracis strains from this study. The 37 variable positions identified in this study were concatenated together to create artificial sequence types. Groups of strains with identical sequence types were A0488 and ASC006; A0039, ASC025, ASC031, ASC070, ASC074 and ASC394; ASC074 and ASC054; A0328, ASC061 and ASC073; A0034, ASC159, ASC165 and ASC398. A DNA distance matrix was created using DNADIST, plotted as a UPGMA tree using NEIGHBOR and the tree plotted using DRAWGRAM [56]. The B1 strain A0465 was used as an outgroup.