Literature DB >> 35575735

Rapid Identification of Bacillus anthracis In Silico and On-Site Using Novel Single-Nucleotide Polymorphisms.

Yufei Lyu1, Dongshu Wang1, Lu Yuan1,2, Erling Feng1, Li Zhu1, Chao Pan1, Yan Guo1, Xiankai Liu1, Hengliang Wang1,2.   

Abstract

Bacillus anthracis is a spore-forming bacterium that causes life-threatening infections in animals and humans and has been used as a bioterror agent. Rapid and reliable detection and identification of B. anthracis are of primary interest for both medical and biological threat-surveillance purposes. Few chromosomal sequences provide enough polymorphisms to clearly distinguish B. anthracis from closely related species. We analyzed 18 loci of the chromosome of B. anthracis and discovered eight novel single-nucleotide polymorphism (SNP) sites that can be used for the specific identification of B. anthracis. Using these SNP sites, we developed software-named AGILE V1.1 (anthracis genome-based identification with high-fidelity E-probe)-for easy, user-friendly identification of B. anthracis from whole-genome sequences. We also developed a recombinase polymerase amplification-Cas12a-based method that uses nucleic acid extracts for the specific, rapid, in-the-field identification of B. anthracis based on these SNPs. Via this method and B. anthracis-specific CRISPR RNAs for the target CR5_2, CR5_1, and Ba813 SNPs, we clearly detected 5 aM genomic DNA. This study provides two simple and reliable methods suitable for use in local hospitals and public health programs for the detection of B. anthracis. IMPORTANCE Bacillus anthracis is the etiologic agent of anthrax, a fatal disease and a potential biothreat. A specific, accurate, and rapid method is urgently required for the identification of B. anthracis. We demonstrate the potential of using eight novel SNPs for the rapid and accurate detection of B. anthracis via in silico and laboratory-based testing methods. Our findings have important implications for public health responses to disease outbreaks and bioterrorism threats.

Entities:  

Keywords:  Bacillus anthracis; Bacillus cereus group; Cas12a; SNP; detection

Mesh:

Substances:

Year:  2022        PMID: 35575735      PMCID: PMC9241702          DOI: 10.1128/spectrum.02285-21

Source DB:  PubMed          Journal:  Microbiol Spectr        ISSN: 2165-0497


INTRODUCTION

The soil bacteria Bacillus cereus, Bacillus thuringiensis, and Bacillus anthracis, members of the B. cereus group of strains, have similar spore formation and genetic characteristics to the extent that they have been considered members of the same species (1). B. cereus is known mainly as a food poisoning bacterium that causes diarrhea and vomiting but may also cause more severe infections, and B. thuringiensis is an insect pathogen. B. anthracis is the etiological agent of anthrax, a disease with high lethality in many animal species, including humans. B. anthracis was considered for use as a potential biowarfare agent during the age of the Cold War and, in later years, as a bioterror agent. Therefore, it is important that we are able to unambiguously distinguish B. anthracis isolates from other members of the B. cereus group at the genetic level. Wild-type B. anthracis normally harbors two virulence plasmids, pXO1 and pXO2. Some genes on the plasmids have been used as targets to detect B. anthracis in PCR assays (2, 3). However, data gathered in the past decade have shown that these plasmids may be lost from some B. anthracis strains, and some B. cereus strains can acquire pXO1/pXO2-like plasmids (4, 5). This indicates that we cannot reliably identify B. anthracis using only the plasmid sequences. The mobile nature of the plasmids in B. cereus group strains emphasizes the importance of applying chromosomal markers to differentiate B. anthracis from the rest of the B. cereus group. Compared with targets on the two virulent plasmids, targets on chromosomes are more stable and more appropriate for identifying B. anthracis. Previously, a few targets on the chromosome of B. anthracis have been considered PCR markers. Target genes have included Ba813 (6), gyrA (7), gyrB (8), rpoB (9), SG850 (10), ptsI (11), purA (12), and plcR (13). The multilocus sequencing typing (MLST) loci (glpF, gmk, ilvD, pta, pur, pyc, and tpi) used for molecular typing of the B. cereus group also possess the potential to be identifying markers for B. anthracis (14). The CRISPR system is a widespread immune system in bacteria that holds a record of bacterial evolution (15, 16), and CRISPR loci also have the potential to distinguish B. anthracis. We found three CRISPR sites (CRISPR 2, 3, and 5) in B. anthracis strain Ames (GI: 30260195) using the CRISPR finder program (https://crispr.i2bc.paris-saclay.fr/Server/). The recent availability of numerous whole-genome sequences of B. cereus group strains provided a basis for the reassessment of these loci with regard to whether they can be used as markers to identify B. anthracis. It is worth analyzing these loci and exploring the use of single-nucleotide polymorphisms (SNPs) specific to B. anthracis because accurate and stable SNPs can be used to develop detection methods for the strain. B. anthracis represents a significant public health and veterinary threat and can be used in bioterror attacks, and therefore, rapid in-the-field detection of this pathogen is important. The synthetic biologist Collins first introduced the use of CRISPR/Cas technology for nucleic acid molecular diagnosis (17), and CRISPR/Cas shows excellent sequence recognition capacity. DNA endonuclease-targeted CRISPR trans-reporter (DETECTR) (18) and one-hour low-cost multipurpose highly efficient system (HOLMES) (19) can be used to detect DNA sequences from human papillomaviruses in patient samples and both the pseudorabies virus and Japanese encephalitis virus with attomolar sensitivity and high specificity via CRISPR-Cas12a, respectively. Recombinase polymerase amplification (RPA) does not require template denaturation; can be run at a low, constant temperature; and has been successfully integrated into different detection platforms (20). Integration of the CRISPR/Cas system with RPA has huge application prospects in the field of on-site testing that does not require precision instruments. We aimed to develop reliable methods for the identification of B. anthracis in our study. First, computer-based comparative analysis of target loci was conducted aiming to identify SNP markers specific for B. anthracis. We uncovered eight novel SNPs from these loci and developed publicly available software for the rapid identification of B. anthracis based on these SNPs. Then, we used these SNPs to develop a practical and rapid on-site B. anthracis detection method that employs the Cas12a detection system combined with RPA.

RESULTS

Initial screening of B. anthracis genome database.

As of 10 August 2020, there were 1,992 genomes of common B. cereus sensu lato group members in the NCBI database, including 252 whole-genome sequences assigned as B. anthracis, 1,118 as B. cereus, and 622 as B. thuringiensis (see Table S6 in the supplemental material), and they were all adopted as the genomic databases for screening for specific SNPs. Among them, the genomic sequences of the other 251 B. anthracis strains were compared with the sequence of the reference strain B. anthracis Ames Ancestor (accession no. NC_007530) (Fig. 1). These strains have different genome sequencing completion statuses; some have been completely assembled, some still consist of many contigs, and some even have >1,500 contigs. To facilitate identification, the strains were renumbered (Table S7), and the 16S rRNA gene sequence, average nucleotide identity (ANI) and digital DNA-DNA hybridization (dDDH) values, and a “T” as base 640 of the plcR open reading frame (ORF) (13) were used in the preliminary screening of the B. anthracis strains. The results showed that all 16S rRNA gene similarities between the 251 strains in the B. anthracis database and B. anthracis Ames Ancestor were >99%. Four strains (BA168_AFS095574, BA169_AFS081271, BA171_AFS057383, and BA172_AFS029941) were removed from our database because the ANI and dDDH values were below 96% and 70%, respectively (Fig. 2A). In 12 strains, base 640 in the plcR ORF was not “T,” and three of the strains had a base deletion at this position. The genomes of 11 of the 12 strains possessed neither pXO1 nor pXO2 plasmids, nor any virulence factors of B. anthracis (including pagA, lef, cya, and capBCAD), while one strain (BA145_N1ZF-2; identifier [ID], GCA_001883885.1) carried capBCAD genes on a plasmid. Thus, we temporarily eliminated these 12 doubtful B. anthracis strains from the strain set, leaving 240 B. anthracis strains (Fig. 1 and 2A).
FIG 1

Development of Bacillus anthracis identification methods in silico and for field testing. Ba, B. anthracis; Bc, B. cereus; Bt, B. thuringiensis.

FIG 2

The 12 strains assigned as “B. anthracis” in the NCBI genome database were misidentified using discovered novel single-nucleotide polymorphisms (SNPs) specific to B. anthracis and software developed using these SNPs. (A) After 16S rRNA gene sequence analysis, calculation of average nucleotide identity and digital DNA-DNA hybridization values, and detection of the base at position 640 in the open reading frame of the plcR gene, 12 doubtful B. anthracis strains were eliminated from the strain set. –, no base could be detected at this site. (B) Display of anthracis genome-based identification with high-fidelity E-probe (AGILE) V1.1 software analysis for identification of B. anthracis. The 12 doubtful B. anthracis strains, BC307, and NC7401 were revealed to be B. cereus by AGILE V1.1 scanning. The corresponding table of the assembly identifiers (IDs) and naming information for B. cereus, B. thuringiensis, and B. anthracis in this study is shown as Table S7 in the supplemental material. (C and D) The 12 doubtful B. anthracis strains were proven to be B. cereus via multilocus sequencing typing (C) and whole-genome SNP (D) analysis.

Development of Bacillus anthracis identification methods in silico and for field testing. Ba, B. anthracis; Bc, B. cereus; Bt, B. thuringiensis. The 12 strains assigned as “B. anthracis” in the NCBI genome database were misidentified using discovered novel single-nucleotide polymorphisms (SNPs) specific to B. anthracis and software developed using these SNPs. (A) After 16S rRNA gene sequence analysis, calculation of average nucleotide identity and digital DNA-DNA hybridization values, and detection of the base at position 640 in the open reading frame of the plcR gene, 12 doubtful B. anthracis strains were eliminated from the strain set. –, no base could be detected at this site. (B) Display of anthracis genome-based identification with high-fidelity E-probe (AGILE) V1.1 software analysis for identification of B. anthracis. The 12 doubtful B. anthracis strains, BC307, and NC7401 were revealed to be B. cereus by AGILE V1.1 scanning. The corresponding table of the assembly identifiers (IDs) and naming information for B. cereus, B. thuringiensis, and B. anthracis in this study is shown as Table S7 in the supplemental material. (C and D) The 12 doubtful B. anthracis strains were proven to be B. cereus via multilocus sequencing typing (C) and whole-genome SNP (D) analysis.

Evaluation and validation of B. anthracis-specific SNPs.

To find SNPs that can be used to identify B. anthracis, we evaluated 18 loci on the chromosome, including seven MLST loci, three CRISPR loci, and eight loci previously suggested in the literature (Fig. 1; Table S1). After the 12 doubtful B. anthracis strains were eliminated, the nucleotide sequences of these loci in the 240 putative B. anthracis strains (n = 1,980) were analyzed by local BLAST searches using blast-2.7.1+. After alignment, the four loci (gmk, gyrB, pta, and pur) were found to be completely homologous among some B. anthracis, B. cereus, and B. thuringiensis strains and were eliminated from the locus set (n = 14, Fig. S1). Further analysis found six loci (crispR2, crispR3, glpF, gyrA, ilvD, and ptsI) that had no specific SNP in B. anthracis that could be distinguished from those of the B. cereus-B. thuringiensis strain and thus were also eliminated (n = 8, Fig. S2). Specific SNP sites that can be used to distinguish B. anthracis from B. cereus-B. thuringiensis strains were identified in eight loci (one in each of Ba813, plcR, purA, pyc, rpoB, SG850, and tpiA and two in CRISPR5) (Table 1; Fig. S3). Because the SNP in plcR was the same as that previously reported (13), it was used only to prove the accuracy of the eight novel SNP sites (n = 7) identified in this study. Nine specific probes based on the nine SNPs, namely, eProbes_Ba for B. anthracis and eProbes_Bct for B. cereus and B. thuringiensis (see Materials and Methods), were constructed for the convenience of description and utilization (Table 1 and Table S2).
TABLE 1

Information on SNPs and eProbes used in this study

LocusBase changeSNP positioneProbe starteProbe endNucleotide sequence of eProbe
Ba813A-G4564265(−)45642404564289CCATTGCTAATGTATGCGAATTTCAATTTGCCAAATGACAATTTAGGTTT (SEQ ID no. 1)
crispR5_1 (CR5_1)C-T5015039(−)50150145015063AGGTGCTGCAAAGGCAACGAATGCCTCTGGTAAAAAAGCTGATGACAAAT (SEQ ID no. 2)
crispR5_2 (CR5_2)C-A5014970(−)50149455014994AACGAGCACGACACCAGAAAGTAACACAGAAGATAAGTCTCAAAAAGAAG (SEQ ID no. 3)
pyc T-C3810008(−)38100033810052AATGCCAGGCGGACAGTACAGTAATTTACAACAACAAGCGAAAGTGGTTG (SEQ ID no. 4)
rpoB T-C109410(+)109399109448GTGGTAGAAGGTGATGTTGAGCTGCAATCTATTAAGATTTATGCTCCTGA (SEQ ID no. 5)
SG850A-C1491472(+)14914581491507AGCGGGTGTAGGAAACGGGAAAACAATTGTATATCTTCTATATGCAATTT (SEQ ID no. 6)
tpiA G-T/A4861968(−)48619434861992TCCAGCTCTATTCTTAGAGCGCCTGGTAGCAGCGACTGAAGGAACTGATT (SEQ ID no. 7)
purA G-A5207983(−)52079585208007ATATGTATGTGATACGTCTGTTGTGTTAAATGATGCATTAGATAACAATC (SEQ ID no. 8)
plcR T-G/C5081303(−)50812785081327TTATACTTGGACAATCAATACGAATAAGCGCTTTGTCATGCAAATAAAGC (SEQ ID no. 9)

Bold and underlined letters indicate SNP sites. SNP position is based on the chromosome of B. anthracis Ames Ancestor strain (NC_007530). SEQ ID, sequence identifier.

Information on SNPs and eProbes used in this study Bold and underlined letters indicate SNP sites. SNP position is based on the chromosome of B. anthracis Ames Ancestor strain (NC_007530). SEQ ID, sequence identifier. We retrospectively analyzed the genomes of 1,118 B. cereus and 622 B. thuringiensis strains to determine the specificity and effectiveness of the two electronic probe (eProbe) sets (Fig. 1); none of the SNPs included in eProbes_Ba were detected in silico in these B. cereus and B. thuringiensis strains. We next analyzed the genomes of 4,707 Bacillus strains from 87 species in local and NCBI genome databases, not including B. anthracis, as a test set (Table S6) to assess the applicability of the two eProbe sets. None of the nine SNPs in eProbes_Ba were detected in these strains (data not shown). The above results suggested that the nine SNPs in eProbes_Ba can be used to distinguish B. anthracis from its genetic near-neighbors B. cereus and B. thuringiensis.

Development of identification software based on eProbes.

Encouraged by the specificity and effectiveness of the two eProbe sets, we compiled software for the rapid identification of B. anthracis from genome sequence data using eProbes_Ba and eProbes_Bct. The software was named anthracis genome-based identification with high-fidelity E-probe (AGILE V1.1). As long as one of the eProbes_Ba is matched exactly by AGILE, the strain is identified as B. anthracis, and the more matched probes there are, the more reliable the positive result. This software can be downloaded by anyone for free from https://github.com/844844/AGILE for Windows and https://github.com/844844/Identify_B.anthracis for Python users. The identification of strains can be completed with one click by inputting sequence data (a complete sequence assembly) on a Windows-based PC. AGILE is a convenient, user-friendly tool for microbiologists, clinicians, and industry professionals. After successful completion of the software development, the 12 doubtful B. anthracis strains (genome sequences available at https://www.github.com/844844/AGILE) were confirmed to be non-B. anthracis strains using AGILE V1.1 scanning (Fig. S4). This result is the same as that of our preliminary screening with 16S RNA, ANI, and plcR ORF described previously. The genomes of these 12 strains did not harbor any of the SNPs in eProbes_Ba but harbored all of those in eProbes_Bct (Fig. 2B). The SNP sites in Ba813, CR52, and plcR were also not harbored in several strains. The 12 strains also clustered with B. cereus and B. thuringiensis after minimum evolution method phylogenetic analysis. To verify these results, we combined these 12 strains with 10 strains each of B. anthracis, B. cereus, and B. thuringiensis as a validation set for whole-genome phylogenetic analysis. In the resulting unweighted pair group method with arithmetic means (UPGMA) tree, the 10 B. anthracis strains clustered into one branch, and the 12 doubtful strains clustered with the 10 B. cereus strains and 10 B. thuringiensis strains in MLST analysis (seven loci: https://pubmlst.org/bigsdb?db=pubmlst_bcereus_seqdef&page=downloadAlleles&tree=1) (Fig. 2C) and whole-genome SNP (wgSNP) phylogenetic analysis (Fig. 2D). Thus, these 12 strains were not B. anthracis but may be B. cereus or B. thuringiensis. Hence, using AGILE V1.1, we further confirmed that the 12 strains assigned as “B. anthracis” in the NCBI genome database had been misidentified, even though their genome sequences were complete.

Application of B. anthracis rapid identification software.

We investigated the nucleic acid information on a historical collection of 412 B. anthracis strains in previously published literature (21); these strains were used as a test set to further evaluate our software. All the strains were identified as B. anthracis using AGILE V1.1 (Fig. 3A). The 412 strains (the remaining four strains and History-N408) are marked in red in Fig. 3A. All nine SNP sites in our set were identical in 408 of the strains (History-N408). The eProbes of Ba813, purA and pyc, were not detected in the remaining four strains, which may be related to the quality of the sequencing.
FIG 3

Strains with genomic information collected from the literature were used as a test set to evaluate the AGILE software. We used these genomes and representative publicly available genomes of B. cereus sensu lato strains (from the NCBI database) to generate a maximum-likelihood tree. Tree visualizations were performed using iTOL. (A) Historical strains previously assigned as B. anthracis (n = 412) were placed in the same cluster as B. anthracis, consistent with AGILE analysis. “History-N408” means that all nine SNP sites in our set were identical in 408 of the strains. (B) Eleven isolates from the International Space Station were identified as B. cereus and/or B. thuringiensis, not B. anthracis, by AGILE analysis. These 11 strains clustered with B. cereus and B. thuringiensis according to SNP type. “ISS-N10” means that all nine SNP sites in our set were identical in 10 of the strains. The corresponding table of the assembly identifiers and naming information for the representative strains in B. cereus sensu lato in this study is shown as Table S8 in the supplemental material.

Strains with genomic information collected from the literature were used as a test set to evaluate the AGILE software. We used these genomes and representative publicly available genomes of B. cereus sensu lato strains (from the NCBI database) to generate a maximum-likelihood tree. Tree visualizations were performed using iTOL. (A) Historical strains previously assigned as B. anthracis (n = 412) were placed in the same cluster as B. anthracis, consistent with AGILE analysis. “History-N408” means that all nine SNP sites in our set were identical in 408 of the strains. (B) Eleven isolates from the International Space Station were identified as B. cereus and/or B. thuringiensis, not B. anthracis, by AGILE analysis. These 11 strains clustered with B. cereus and B. thuringiensis according to SNP type. “ISS-N10” means that all nine SNP sites in our set were identical in 10 of the strains. The corresponding table of the assembly identifiers and naming information for the representative strains in B. cereus sensu lato in this study is shown as Table S8 in the supplemental material. In addition to B. cereus, B. thuringiensis, and B. anthracis, the B. cereus sensu lato group includes less-common members such as Bacillus mycoides, Bacillus pseudomycoides, Bacillus weihenstephanensis, Bacillus cytotoxicus, and Bacillus toyonensis (22, 23). We used the 412 genomes mentioned above and publicly available genomes of representative strains (Table S8) of B. cereus sensu lato to generate a maximum-likelihood tree. The 412 strains were placed in the same cluster as B. anthracis, which is consistent with the AGILE analysis. Non-B. anthracis strains of B. cereus sensu lato formed a separate cluster in the maximum-likelihood tree, in agreement with the results of AGILE. Furthermore, we collected genomic information on 11 Bacillus strains isolated on the International Space Station (ISS; from the experimental modules of three different countries) (24). Previous research (24) indicated that the 11 ISS isolates could be B. anthracis based on ANI and dDDH values and gyrB sequences, but these strains were confirmed as not being B. anthracis when later phenotypic trait and whole-genome analyses were used. After the genome sequences were entered into the AGILE software, these 11 ISS strains were identified as B. cereus or B. thuringiensis (Fig. 3B). The 11 ISS strains are marked in red in Fig. 3B. All nine SNP sites in our set were identical in 10 of the strains; the eProbe sequence in purA was absent from the genomic information for one strain (ISS-GCA-002151595). The 11 ISS strains clustered together with B. cereus and B. thuringiensis, among others, in a maximum-likelihood tree. Our findings indicate that AGILE is an accurate and powerful method for identifying B. anthracis.

Detection of B. anthracis based on novel SNP sites by RPA combined with Cas12a.

Having confirmed that the nine SNPs in eProbes_Ba efficiently differentiated B. anthracis from B. cereus and B. thuringiensis in silico, we developed a CRISPR/Cas12a-based-detection method with a naked-eye readout under blue light (Fig. 4A). Genomic DNA was extracted from each bacterium and used as the detection substrate. The target gene fragments were amplified within 30 min at 37°C, followed by a CRISPR/Cas12a reaction at 37°C (Fig. 4A). For the specific detection of B. anthracis genomic DNA, CRISPR RNAs (crRNAs) corresponding to the eight novel SNPs on the chromosome of B. anthracis Ames were designed and evaluated in the laboratory (Fig. 4B).
FIG 4

CRISPR/Cas12a-based detection workflow and schematics for B. anthracis detection with a naked-eye readout under blue light. (A) Schematics of the Cas12a-based assay for rapid visual nucleic acid detection. After recombinase polymerase amplification (RPA) for 30 min, the Cas12a enzyme cleavage takes an additional 60 min. The green fluorescent signal of a positive detection of B. anthracis can be observed by the naked eye under blue light with an orange filter. (B) Schematics of the crRNA design for the five novel SNP sites. SNP sites are generally located at bases 1 to 6 downstream of the PAM sequence. The crRNA can target both the coding strand and the noncoding strand.

CRISPR/Cas12a-based detection workflow and schematics for B. anthracis detection with a naked-eye readout under blue light. (A) Schematics of the Cas12a-based assay for rapid visual nucleic acid detection. After recombinase polymerase amplification (RPA) for 30 min, the Cas12a enzyme cleavage takes an additional 60 min. The green fluorescent signal of a positive detection of B. anthracis can be observed by the naked eye under blue light with an orange filter. (B) Schematics of the crRNA design for the five novel SNP sites. SNP sites are generally located at bases 1 to 6 downstream of the PAM sequence. The crRNA can target both the coding strand and the noncoding strand. The genomic DNA concentrations of each bacterium was 106 aM (Table S9). Cas12a-mediated detection of each of the five target SNPs produced a robust signal within 60 min when the detection substrate was the product of B. anthracis vaccine strain A16R(pXO1+, pXO2−) genomic DNA amplification by RPA, which is derived from A16 by exposure to UV radiation (25) (Fig. 5A). Cas12a-mediated detection produced a weak signal within 60 min when the detection substrate was the product of another strain (B. cereus, BC307 and NC7401; B. thuringiensis, HD73; Bacillus subtilis, Bs168) genomic DNA amplifications by RPA. Cas12a-mediated detection allowed B. anthracis to be distinguished from its near-neighbor bacteria by a fluorescent signal. The five target SNPs that can be used to distinguish B. anthracis via RPA are located in the tpiA, SG850, CR5_1/2, and Ba813 loci (Fig. 5A). The sensitivity of the CRISPR/Cas12a assay was also determined based on the resulting fluorescence intensity. To determine the detection threshold, genomic DNA of B. anthracis (A16R) was diluted from 5.14 × 105 aM to 5.14 × 10−3 aM (Fig. 5B; Table S9). The crRNAs for the target SNPs CR5_2, CR5_1, and Ba813 were the most sensitive and can be used to detect 5 aM of genomic DNA (Fig. 5B). Thus, we chose CR5_2, CR5_1, and Ba813-crRNA for a B. anthracis screening assay. The remaining target loci can be used for confirmatory diagnostic assays.
FIG 5

CRISPR/Cas12a-based detection of Bacillus spp. with naked-eye readout. (A) These B. anthracis-specific CRISPR RNAs (crRNAs) corresponding to five SNP sites can distinguish B. anthracis (strain A16R) from B. cereus (strains BC307 and NC7401), B. thuringiensis (strain HD73), and B. subtilis (strain Bs168) within 90 min (one-way repeated-measure analysis of variance). Genomic DNA concentrations are shown in Table S9 and other files in the supplemental material. (B) Sensitivity assay using the five crRNAs to detect the B. anthracis strain A16R genomic DNA over a dilution gradient from 5.14 × 105 to 5.14 × 10−3 aM (one-way analysis of variance). ****, P ≤ 0.0001; ***, P ≤ 0.001; *, P ≤ 0.05; ns, not significant.

CRISPR/Cas12a-based detection of Bacillus spp. with naked-eye readout. (A) These B. anthracis-specific CRISPR RNAs (crRNAs) corresponding to five SNP sites can distinguish B. anthracis (strain A16R) from B. cereus (strains BC307 and NC7401), B. thuringiensis (strain HD73), and B. subtilis (strain Bs168) within 90 min (one-way repeated-measure analysis of variance). Genomic DNA concentrations are shown in Table S9 and other files in the supplemental material. (B) Sensitivity assay using the five crRNAs to detect the B. anthracis strain A16R genomic DNA over a dilution gradient from 5.14 × 105 to 5.14 × 10−3 aM (one-way analysis of variance). ****, P ≤ 0.0001; ***, P ≤ 0.001; *, P ≤ 0.05; ns, not significant. The other three target SNPs (in purA, pyc, and ropB) can be used to identify B. anthracis via PCR amplification but not via RPA amplification (Fig. S5). In the future, we aim to simultaneously detect multiple SNPs, which would avoid misidentifications due to single SNP mutations, thereby improving the accuracy of the B. anthracis identification.

DISCUSSION

In previous reports, clinically similar cases of cutaneous anthrax lesions and inhalation anthrax were described as being caused by Bacillus pumilus (26), B. cereus containing B. anthracis toxin genes (27), and B. cereus (28). The characterization of members of the B. cereus group is crucial for decision-making by clinicians and staff at the Centers for Disease Control and Prevention. As described in the introduction, SNPs adopted previously for the identification of B. anthracis have been invalidated as increasingly more genome sequence data have become available (6, 29). Constructing a UPGMA tree is a good way to distinguish strains, but this requires work by skilled bioinformatics professionals. In this study, we analyzed the genomes of nearly 2,000 B. cereus group strains downloaded from NCBI, identified eight novel SNPs that are characteristic of B. anthracis, and developed software named AGILE that can identify strains based on eProbes (eProbes_Ba and eProbes_Bct). AGILE is a powerful tool for distinguishing B. anthracis from other members of the B. cereus group, which are often not distinguishable by conventional 16S rRNA gene sequencing and BLAST searching. By applying AGILE to validation and test data sets, we discovered 12 non-B. anthracis strains that had been assigned as B. anthracis in the NCBI database. The 12 strains have also been found to not belong to the B. anthracis lineage by other international research institutions (30, 31), verifying our results. With the wider application of whole-genome sequencing in the field of public health, our AGILE tool will become increasingly important: using genome sequence data, AGILE can be used to identify B. anthracis from near-neighbor bacteria in silico based on eProbes in seconds. As Fig. 3 shows, 11 isolates from the ISS were unambiguously identified as B. cereus or B. thuringiensis, not B. anthracis, using AGILE. Venkateswaran and colleagues (24) stated that the gyrB sequence, DDH and ANI values, and dDDH analysis supported that the 11 ISS isolates were similar to B. anthracis but distant from B. cereus and B. thuringiensis. Phenotypic (motility, positive hemolysis, lack of a capsule, and resistance to gamma phage/penicillin) and genomic (lack of pXO1 and pXO2 plasmids) data were collected, and MLST and whole-genome SNP analyses were performed by them. A large number of the experimental results performed by Venkateswaran et al. provided reasons to exclude the ISS isolates from B. anthracis, which is consistent with the results of the AGILE analysis. AGILE uses multiple SNP sites; therefore, strains are not missed during the detection process due to poor-quality genomic sequencing. For example, although the nonsense mutation at position 640 in plcR proved to be truly unique to B. anthracis, this locus is shortened in several strains of B. cereus and the bases cannot be matched at position 640, which is related to the quality of the sequencing data (Fig. 2A and B). The SNP sites identified in this study are not on virulence plasmids and so are relatively stable. The use of chromosomal molecular markers can identify B. anthracis (pXO1− and/or pXO2−) strains, which provides more information for bacterial traceability analysis. Compared with BTyper (32), a command-line tool for classifying B. cereus group isolates via gene detection, AGILE does not involve complicated calculations, processes, or coding operations. It is easy for users to understand the analytical results of AGILE, share data with colleagues, and make definite judgments on the suspected isolates. Using the SNPs on the chromosome, we have also developed an RPA-Cas12a-based method for the rapid, on-site detection of B. anthracis. The CRISPR5_2, CRISPR5_1, and Ba813 crRNAs can be used to detect B. anthracis with as little as 5 aM genomic DNA. The detection sensitivity is as low as that of HOLMES for detecting the two viruses (19). No precision instruments are required, only a blue light is needed, and detection can be completed in 1.5 h at 37°C. AGILE based on eProbes_Ba and eProbes_Bct can be used only to identify B. anthracis and distinguish the B. cereus group from other Bacillus spp., and other species in the B. cereus group cannot be classified in more detail. However, B. anthracis and B. cereus are common pathogens in this group, and their early and unambiguous diagnostic detection is essential for panic elimination, successful treatment, and disease prevention. We have provided software for use after nucleic acid sequencing and RPA-Cas12a on-site detection for improving the rapid identification of B. anthracis. This paves the way for the development of microbiological diagnostic kits or chips for distinguishing B. anthracis from other B. cereus group members. This technology may aid in rapid responses to anthrax terrorist attacks and help ensure public health security.

MATERIALS AND METHODS

Data resources and initial screening of genomes.

Genome sequences of strains were obtained from the GenBank database (https://www.ncbi.nlm.nih.gov/genome/). As shown in Fig. 1, we performed preliminary screening of the downloaded B. cereus group genomic data in local genome database by 16S rRNA gene sequence analysis (33), calculated the average nucleotide identity (ANI) (34, 35) and digital DNA-DNA hybridization (dDDH) (35–37) values, and detected the base at position 640 in the open reading frame (ORF) of the gene plcR (13). The dDDH value was attained and analyzed via https://ggdc.dsmz.de/ggdc.php. The other 251 B. anthracis strains from the NCBI database were compared with the genomic sequence of B. anthracis strain Ames Ancestor (GenBank accession no. NC_007530), which is the reference genome for B. anthracis.

Specific SNP analysis.

As shown in Fig. 1, MLST loci, CRISPR loci, and other loci (rpoB, gyrA, gyrB, Ba813, plcR, ptsI, purA, and SG850) were used to identify species-specific SNPs and thus distinguish among species (see Table S1 in the supplemental material). The sequences of the seven MLST genes can be obtained from the B. cereus MLST website, http://pubmlst.org/bigsdb?db=pubmlst_bcereus_seqdef&page=downloadAlleles&tree=1 (14). We found three CRISPR sites (CRISPR 2, 3, and 5) in B. anthracis strain Ames (GI: 30260195) using the CRISPR finder program (https://crispr.i2bc.paris-saclay.fr/). The sequences of a selected locus in B. anthracis, B. cereus, and B. thuringiensis were compared. Using local BLAST (blast-2.7.1+), the nucleotide sequence of the selected locus in B. anthracis Ames Ancestor was used as reference for comparison with the sequences of the B. cereus-B. thuringiensis strains. These sequences were imported into MEGA-X software for comparison and analysis to find specific SNP sites that distinguish B. anthracis from its genetic near-neighbors B. cereus and B. thuringiensis (Fig. 1).

Construction of eProbes.

As shown in Fig. 1, taking the identified SNP in each locus as the base point, 24 bp upstream and 25 bp downstream of the SNP were used to create an electronic probe (eProbe) of 50 bp. A FASTA-format file composed of each eProbe (containing the sequences from all strains) was imported into MEGA software for comparison and analysis. If this tag was identical in all B. anthracis strains, and at least one base in the tag was different from that in all B. cereus and B. thuringiensis strains, we selected it as an eProbe. By adjusting the positions of the upstream and downstream sequences, we identified several B. anthracis-specific eProbes (eProbes_Ba) at different loci (Table S2). At the same time, we analyzed the sequences of all B. cereus-B. thuringiensis strains corresponding to the eProbes of B. anthracis and constructed a “degenerate eProbe” for the B. cereus-B. thuringiensis strains (Table S2, eProbes_Bct). The software was created with Python version 3.7 and employs an in silico combination of eProbes_Ba and eProbes_Bct using nucleotide sequencing data. Whether the data to be tested match the probe is used to judge whether the strain is B. anthracis. This does not rely on comparison scoring, and the exact answer is provided directly as output.

Detection of B. anthracis based on Cas12a-RPA.

The CRISPR/Cas12a system contained the Cas12a protein, the target DNA, B. anthracis-specific CRISPR RNAs (crRNAs), and a single-stranded DNA (ssDNA) reporter. To enable on-site detection, the ssDNA (a 12-base probe) was labeled with 6-carboxyfluorescein (FAM) and the quenching group BHQ1: 5′-FAM–GAGACCGACCTG-3′-BHQ1. When the target DNA of B. anthracis was found by the detection system, the Cas12a/crRNA binary complex formed a ternary complex with the target DNA, and the ssDNA probe was cleaved by Cas12a; the quencher BHQ1 was thus separated from the fluorescent probe FAM, and the resulting green fluorescence could be seen with the naked eye under blue light in the wavelength range of 450 to 480 nm (Fig. 4A). The Cas12a reaction was conducted at 37°C for 60 min (Fig. 4A) in a 20-μL volume (Table S5). Fluorescence intensities were detected at 60 min using a Bio-Rad real-time PCR CFX96 instrument in FAM mode (Life Science, Hercules, CA, USA) or with the naked eye under blue light (Fig. 4A). The formation of a Cas12a/crRNA/DNA ternary complex requires the target DNA to contain a protospacer-adjacent motif sequence (PAM; a TTTN sequence), which can be added to the primers and introduced during amplification (Fig. 4B). The sequences of the RPA primers used are shown in Table S3, and RPA was performed using the TwistAmp liquid exo kit (TwistDX, Maidenhead, UK). The G+C mol% content of the RPA primers (Table S3) was between 20% and 70%, the melting temperature (T) was between 50°C and 100°C, and the RPA reaction was performed at 37°C for 30 min (Table S4). To shorten the reaction time, the length of the DNA product was kept between 100 bp and 150 bp. The formation of a Cas12a/crRNA/DNA ternary complex requires crRNA. The 17-base cRNA sequence of the target DNA and the universal sequence (5′-AAUUUCUACUGUUGUAGAU-3′) formed a complete crRNA (Table S3) (38). The detection efficiencies of crRNAs for SNP sites located at six different base positions downstream of the PAM were inconsistent. The crRNA targeting the noncoding strand was designed in the same way (Fig. 4B). The sequences of the chosen RPA oligonucleotide primers and crRNAs after preexperimental analysis using PCR (Vazyme catalog no. P510-01; Nanjing, China) are shown in Table S3. The RPA oligonucleotide primers, crRNAs, and single-stranded DNA probes were synthesized by General Biosystems Co. Ltd. (Anhui, China).
  36 in total

1.  DNA-DNA hybridization values and their relationship to whole-genome sequence similarities.

Authors:  Johan Goris; Konstantinos T Konstantinidis; Joel A Klappenbach; Tom Coenye; Peter Vandamme; James M Tiedje
Journal:  Int J Syst Evol Microbiol       Date:  2007-01       Impact factor: 2.747

2.  Bacillus anthracis pXO1 plasmid sequence conservation among closely related bacterial species.

Authors:  James Pannucci; Richard T Okinaka; Robert Sabin; Cheryl R Kuske
Journal:  J Bacteriol       Date:  2002-01       Impact factor: 3.490

3.  Description of Bacillus toyonensis sp. nov., a novel species of the Bacillus cereus group, and pairwise genome comparisons of the species of the group by means of ANI calculations.

Authors:  Guillermo Jiménez; Mercedes Urdiain; Ana Cifuentes; Aránzazu López-López; Anicet R Blanch; Javier Tamames; Peter Kämpfer; Anne-Brit Kolstø; Daniel Ramón; Juan F Martínez; Francisco M Codoñer; Ramon Rosselló-Móra
Journal:  Syst Appl Microbiol       Date:  2013-06-19       Impact factor: 4.022

4.  The Ba813 chromosomal DNA sequence effectively traces the whole Bacillus anthracis community.

Authors:  V Ramisse; G Patra; J Vaissaire; M Mock
Journal:  J Appl Microbiol       Date:  1999-08       Impact factor: 3.772

5.  A randomly amplified polymorphic DNA marker specific for the Bacillus cereus group is diagnostic for Bacillus anthracis.

Authors:  D Daffonchio; S Borin; G Frova; R Gallo; E Mori; R Fani; C Sorlini
Journal:  Appl Environ Microbiol       Date:  1999-03       Impact factor: 4.792

6.  Bacillus cytotoxicus sp. nov. is a novel thermotolerant species of the Bacillus cereus Group occasionally associated with food poisoning.

Authors:  Marie-Hélène Guinebretière; Sandrine Auger; Nathalie Galleron; Matthias Contzen; Benoit De Sarrau; Marie-Laure De Buyser; Gilles Lamberet; Annette Fagerlund; Per Einar Granum; Didier Lereclus; Paul De Vos; Christophe Nguyen-The; Alexei Sorokin
Journal:  Int J Syst Evol Microbiol       Date:  2012-02-10       Impact factor: 2.747

7.  A Whole-Genome-Based Gene-by-Gene Typing System for Standardized High-Resolution Strain Typing of Bacillus anthracis.

Authors:  Mostafa Y Abdel-Glil; Alexandra Chiaverini; Giuliano Garofolo; Antonio Fasanella; Antonio Parisi; Dag Harmsen; Keith A Jolley; Mandy C Elschner; Herbert Tomaso; Jörg Linde; Domenico Galante
Journal:  J Clin Microbiol       Date:  2021-06-18       Impact factor: 5.948

8.  Non-Toxin-Producing Bacillus cereus Strains Belonging to the B. anthracis Clade Isolated from the International Space Station.

Authors:  Kasthuri Venkateswaran; Nitin K Singh; Aleksandra Checinska Sielaff; Robert K Pope; Nicholas H Bergman; Sandra P van Tongeren; Nisha B Patel; Paul A Lawson; Masataka Satomi; Charles H D Williamson; Jason W Sahl; Paul Keim; Duane Pierson; Jay Perry
Journal:  mSystems       Date:  2017-06-27       Impact factor: 6.496

9.  CRISPR-Cas12a target binding unleashes indiscriminate single-stranded DNase activity.

Authors:  Janice S Chen; Enbo Ma; Lucas B Harrington; Maria Da Costa; Xinran Tian; Joel M Palefsky; Jennifer A Doudna
Journal:  Science       Date:  2018-02-15       Impact factor: 47.728

10.  Identification of the immunogenic spore and vegetative proteins of Bacillus anthracis vaccine strain A16R.

Authors:  Xiankai Liu; Dongshu Wang; Jingxiao Ren; Chao Tong; Erling Feng; Xuefang Wang; Li Zhu; Hengliang Wang
Journal:  PLoS One       Date:  2013-03-13       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.