| Literature DB >> 24039948 |
Nicholas A Be1, James B Thissen, Shea N Gardner, Kevin S McLoughlin, Viacheslav Y Fofanov, Heather Koshinsky, Sally R Ellingson, Thomas S Brettin, Paul J Jackson, Crystal J Jaing.
Abstract
Bacillus anthracis is the potentially lethal etiologic agent of anthrax disease, and is a significant concern in the realm of biodefense. One of the cornerstones of an effective biodefense strategy is the ability to detect infectious agents with a high degree of sensitivity and specificity in the context of a complex sample background. The nature of the B. anthracis genome, however, renders specific detection difficult, due to close homology with B. cereus and B. thuringiensis. We therefore elected to determine the efficacy of next-generation sequencing analysis and microarrays for detection of B. anthracis in an environmental background. We applied next-generation sequencing to titrated genome copy numbers of B. anthracis in the presence of background nucleic acid extracted from aerosol and soil samples. We found next-generation sequencing to be capable of detecting as few as 10 genomic equivalents of B. anthracis DNA per nanogram of background nucleic acid. Detection was accomplished by mapping reads to either a defined subset of reference genomes or to the full GenBank database. Moreover, sequence data obtained from B. anthracis could be reliably distinguished from sequence data mapping to either B. cereus or B. thuringiensis. We also demonstrated the efficacy of a microbial census microarray in detecting B. anthracis in the same samples, representing a cost-effective and high-throughput approach, complementary to next-generation sequencing. Our results, in combination with the capacity of sequencing for providing insights into the genomic characteristics of complex and novel organisms, suggest that these platforms should be considered important components of a biosurveillance strategy.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24039948 PMCID: PMC3767809 DOI: 10.1371/journal.pone.0073455
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Genome equivalents of DNA added to environmental background DNA for detection assessment.
|
| ||||||
|---|---|---|---|---|---|---|
|
| 100,000 | 10,000 | 1,000 | 100 | 10 | 1 |
|
| 560 pg | 56 pg | 5.6 pg | 560 fg | 56 fg | 5.6 fg |
|
| 100 pg | 100 pg | 100 pg | 100 pg | 100 pg | 100 pg |
|
| 84.85% | 35.90% | 5.30% | 0.56% | 0.060% | 0.006% |
|
| ||||||
|
| 100,000 | 10,000 | 1,000 | 100 | 10 | 1 |
|
| 560 pg | 56 pg | 5.6 pg | 560 fg | 56 fg | 5.6 fg |
|
| 1 ng | 1 ng | 1 ng | 1 ng | 1 ng | 1 ng |
|
| 35.90% | 5.30% | 0.56% | 0.060% | 0.006% | 0.001% |
The number of genome equivalents was calculated as 5.6 femtograms based on a chromosome and the two plasmids pXO1 and pXO2 as one genome equivalent. Calculations were based on the published genome and plasmid sizes in base pairs using accession numbers NC_007530 (chromosome), NC_007322 (pXO1), and NC_007323 (pXO2).
Bacterial reference genomes used for mapping of Illumina and 454 sequencing reads.
| Target Reference Genomes | Accession No. |
|---|---|
|
| NC_007530 |
|
| NC_007322 |
|
| NC_007323 |
|
| NC_005945 |
|
| NC_007322 |
|
| NC_007323 |
|
| NC_008600 |
|
| NC_008598 |
|
| NC_014335 |
|
| NC_014331 |
|
| NC_014332 |
|
| NC_014333 |
|
|
|
|
| NC_006350 |
|
| NC_002655 |
|
| NC_006570 |
|
| NC_002516 |
|
| NC_005296 |
|
| NC_003047 |
|
| NC_002745 |
|
| NC_003888 |
|
| NC_003143 |
|
| NC_000964 |
|
| NC_009698 |
* Used as reference for 454 sequencing reads only; ** Used as reference for Illumina sequencing reads only.
target reference genomes were used for identification of reads corresponding to in each sample. and target reference genomes were used for assessment of specificity provided by the read mapping strategy.
Figure 1Mapping of sequencing reads obtained from -spiked environmental samples to specified reference genomes.
Ames genomic DNA was combined with background nucleic acid extracted from either aerosol or soil-based material. Increasing genome copy numbers were spiked into samples at 10-fold concentration intervals. Samples were then subjected to whole genome amplification and next-generation sequencing. The resultant reads were mapped to either a target set ( and plasmids) or a background set of DNA sequences, intended to assess non-specific alignment of -derived sequence reads to other genomes. The numbers of reads mapped were normalized to total reads obtained for each sample to standardize results. Shown are the percentage of reads mapped for A. Illumina reads from an aerosol background spiked with genomic DNA, B. Illumina reads from a soil background spiked with DNA, C. 454 reads from an aerosol background spiked with DNA, and D. 454 reads from a soil background spiked with DNA.
Figure 2Comparison of detection sensitivity via Illumina and 454 sequencing.
Sequence reads obtained from environmentally derived DNA spiked with DNA were mapped to a target sequence set () or a background set of sequences. Ratios shown were calculated by dividing the number of reads mapped to target reference genomes by the number of reads mapping to background reference genomes.
Figure 3Mapping of Illumina reads to closely related species.
Following sequencing of -spiked environmental samples, mapping specificity was examined by determining the percent of total Illumina reads mapping to the closely related species Al Hakam and biovar CI. Illumina reads were obtained from A. aerosol background DNA and B. soil background DNA samples spiked with increasing amounts of DNA.
Figure 4Alignment of uniquely mapped Illumina reads to genomes from and two closely-related species.
Due to the high degree of sequence similarity among the three examined species, a unique mapping approach was used. DNA sequencing reads mapping to only or one of the two near neighbor species were identified; reads mapping to multiple reference genomes were ignored. This approach facilitated distinction among the three closely related species. The number of uniquely mapped reads for is given compared to A. in aerosol samples, B. in soil samples, C. in aerosol samples, and D. in soil samples. This analysis was performed separately for each species – unique reads between and were identified, followed by identification of unique reads between and – thus the results of each comparison are shown in separate charts.
Figure 5Alignment of unique 454 reads to and near-neighbor species.
454 reads mapping only to or a close relative, discounting reads mapping to multiple reference genomes, were identified. The number of sequencing reads mapping uniquely to Ames DNA are shown compared to A. in aerosol samples, B. in soil samples, C. in aerosol samples, and D. in soil samples. As in Figure 4, this analysis was performed separately for each set of species.
Detection of DNA in environmental background by census microarray.
|
| ||||||
|---|---|---|---|---|---|---|
|
| 100,000 | 10,000 | 1,000 | 100 | 10 | 1 |
|
|
|
|
|
|
| N/D |
|
| ||||||
|
| 100,000 | 10,000 | 1,000 | 100 | 10 | 1 |
|
|
|
|
|
| N/D | N/D |
Detection of by only one array replicate. The second replicate yielded detection of sp. in aerosol background and in soil background. N/D:B. anthracis not detected. Detection of DNA was examined in the context of 100 pg aerosol DNA or 1 ng soil DNA.
Summary of detection in environmental samples via genomic technologies.
| Assay | Minimum genome copies required for detection per ng environmental DNA | Assay properties | ||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
|
| 100 | 100 | ++++ | ++++ | ++ | +++ |
|
| 100 | 10 | ++++ | ++++ | ++ | +++ |
|
| 1 | 1 | ++++ | ++++ | ++ | +++ |
|
| 1000 | 1000 | ++ | + | +++ | ++ |
For microarray, detection occurs at 100 copies when using the genus as a threshold, instead of the species.