Literature DB >> 30533713

Draft Genome Sequence of Bacillus anthracis Strain Sterne 09RA8929.

Anne Busch1, Mandy Carolina Elschner1, Daniela Jacob2, Roland Grunow2, Herbert Tomaso1.   

Abstract

A Bacillus anthracis vaccine strain (Sterne), used as an attenuated laboratory comparative strain, was sequenced and analyzed. A comparison to assemblies of B. anthracis strain Sterne (NZ_CP009541 and NZ_CP009540) was performed. The lack of the pX02 plasmid and pX01 in approximately five copies was confirmed.

Entities:  

Year:  2018        PMID: 30533713      PMCID: PMC6256644          DOI: 10.1128/MRA.00972-18

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Bacillus anthracis, a Gram-positive, spore-forming bacterium, is the etiological agent of anthrax (1, 2), a zoonotic disease that can infect humans directly or through products. To prevent anxtrax infections in livestock, vaccines comprising live B. anthracis strains (pX01+ [pX01 positive]; pX02− [pX02 negative]) for animals are available (3). We sequenced and analyzed the B. anthracis strain Sterne 09RA8929 at the Friedrich-Loeffler-Institut (FLI) to use it as a laboratory comparative reference strain. The strain was provided from a repository located at the Robert Koch Institute within the framework of the European Union–funded projects EQADeBa and EMERGE (grant agreement 677066) after two passages on Columbia sheep blood agar plates (Oxoid, Wesel, Germany). At the FLI, the isolate was cultivated for diagnostic purposes and added to the repository. The culture for DNA extraction was initiated from a single colony. DNA was extracted from bacterial cells that were grown in cell culture flasks for 24 h. DNA purification was performed using a Genomic-tip 100/G instrument and a genomic DNA buffer kit (Qiagen, Hilden, Germany) following the manufacturer’s instructions. Genome sequencing was carried out using Illumina DNA sequencing at GATC Biotech (Germany) with the library protocol adapted and validated for use with the HiSeq 2500 platform. A paired-end library was constructed with a target insert size of 300 bp, which resulted in 6 million reads. A total of 12 million (151-bp) sequences, with an average Phred score of >38, were generated. The sequences were assembled using SPAdes version 3.9.1 in Bayes Hammer mode, to minimize mismatches and short indels (--careful) (4) and filtered for contamination with Kraken (5). Subsequently, annotation was performed with Prokka using the standard settings (6). The assembly resulted in 33 contigs with an N50 value of 1,162,008 bp and a total sequence length containing 5,366,275 bp. The annotation predicted 12 RNAs, 5,639 coding DNA sequences (CDSs), 79 tRNAs, and 1 tmRNA. A phylogenetic analysis of the 16S rRNA sequences with Mole-BLAST and PhyloPhlAn (7) produced consistent results. From the phylogeny determined with PhyloPhlAn, a differentiation within the Bacillus genus was possible, although the coding sequences of the plasmids and highly variable sequences are excluded with this method. This bioinformatics approach complements the methods of single-nucleotide polymorphism typing and multilocus variable-number tandem-repeat analysis. After quality checking, trimming, and merging with BBDuk, and removal of duplicates, the reads were mapped to other genome sequences in GenBank (accession numbers NZ_CP009540, NZ_CP009541, and NZ_CP010794) with the Bowtie2 module included in Geneious (7), and variant-calling files were generated. Variants are reported with a minimum variant frequency of 0.9, a minimum P value of 10e-60, and a minimum strand bias P value of 10e-5, and supported by at least five reads. A 37-fold mean coverage of the chromosome was reported with 5 million bases showing 99.99% identity with the reference sequence (5,227,150 of 5,227,496). For the chromosome, 88 variants were called, and 36 variants were predicted to affect known proteins. The plasmid pX01 yielded a 161-fold mean coverage of 181,803 bases with 99.8% identity to the reference sequence (181,233 of 181,624). For the plasmid, 13 variants were called; seven of these affected known proteins (Table 1). With this described method and data, future B. anthracis strains can be monitored. The average chromosome/plasmid coverage ratio suggests that the plasmid pX01 is unusually represented with approximately five copies per cell. It has been reported that sequencing coverage provides exact estimates for the plasmid copy number (8). Since no mapping occurred on one of the pX02 reference sequences (GenBank accession number NZ_CP010794), the existence of pX02 could be excluded.
TABLE 1

Features of the genome of B. anthracis Sterne 09RA8929, including variants to the chromosome and plasmid sequences

RefSeq no.Nucleotide positionReferenceAlterationVariation typeCDS typebTranslational effect
NZ_CP009541345217CTSNPc (transition)Hypothetical proteinNone
NZ_CP009541345220CGGTGCSubstitutionHypothetical proteinSubstitution
NZ_CP009541345226TGSNP (transversion)Hypothetical proteinNone
NZ_CP009541959629GTSNP (transversion)TransposaseSubstitution
NZ_CP009541959936CTSNP (transition)TransposaseSubstitution
NZ_CP0095411273892TTAInsertion (tandem repeat)His BFrameshift
NZ_CP0095411291383AGSNP (transition)xx
NZ_CP0095411291518CASNP (transversion)xx
NZ_CP0095411291524TCSNP (transition)xx
NZ_CP0095411291568TGSNP (transversion)xx
NZ_CP0095411291570AGSNP (transition)xx
NZ_CP0095411291573CTSNP (transition)xx
NZ_CP0095411291606CTSNP (transition)xx
NZ_CP0095411291608AGSNP (transition)xx
NZ_CP0095411291642CTCDeletion (tandem repeat)xx
NZ_CP0095411442326TTGInsertion (tandem repeat)Hypothetical proteinFrameshift
NZ_CP0095411442340AGSNP (transition)Hypothetical proteinSubstitution
NZ_CP0095411442346AGSNP (transition)Hypothetical proteinSubstitution
NZ_CP0095411442349AGSNP (transition)Hypothetical proteinSubstitution
NZ_CP0095411442358GASNP (transition)Hypothetical proteinSubstitution
NZ_CP0095411442367GASNP (transition)Hypothetical proteinSubstitution
NZ_CP0095411442379TTGInsertion (tandem repeat)Hypothetical proteinFrameshift
NZ_CP0095411442465GASNP (transition)Hypothetical proteinSubstitution
NZ_CP0095411442471GASNP (transition)Hypothetical proteinSubstitution
NZ_CP0095411518305GGTInsertion (tandem repeat)IsocitratelyaseFrameshift
NZ_CP0095411602248GGTInsertion (tandem repeat)xx
NZ_CP0095411644562TTAInsertion (tandem repeat)xx
NZ_CP0095411820884GGAInsertion (tandem repeat)Quinolone resistance proteinFrameshift
NZ_CP0095411835869AATInsertion (tandem repeat)xx
NZ_CP0095411911501AATInsertion (tandem repeat)Amino acid permeaseFrameshift
NZ_CP0095412082647AATInsertion (tandem repeat)ThioredoxinFrameshift
NZ_CP0095412218818CCTInsertion (tandem repeat)xx
NZ_CP0095412270478GGAInsertion (tandem repeat)Alkyl-hydroperoxide reductase subunit FFrameshift
NZ_CP0095412340581GGAInsertion (tandem repeat)Hypothetical proteinFrameshift
NZ_CP0095412442080TATDeletion (tandem repeat)Bcr/CflA family drug resistance efflux transporterFrameshift
NZ_CP0095412474920TCTDeletion (tandem repeat)xx
NZ_CP0095412474929TATDeletionxx
NZ_CP0095412611141TTAInsertion (tandem repeat)Hypothetical proteinFrameshift
NZ_CP0095412737890AATInsertion (tandem repeat)xx
NZ_CP0095412782850GGAInsertion (tandem repeat)Acyl-CoA-dehydrogenaseFrameshift
NZ_CP0095412823935GGTInsertion (tandem repeat)NADH-quinoneoxido reductase subunit HFrameshift
NZ_CP0095412906141AGSNP (transition)Transposase CDSSubstitution
NZ_CP0095412906932GASNP (transition)Transposase CDSSubstitution
NZ_CP0095412923257TTAInsertion (tandem repeat)Chitooligosaccharide deacetylaseFrameshift
NZ_CP0095412923451CCTInsertion (tandem repeat)xx
NZ_CP0095412946626TTAInsertion (tandem repeat)xx
NZ_CP0095413004271TTAInsertion (tandem repeat)xx
NZ_CP0095413011141GGTInsertion (tandem repeat)xx
NZ_CP0095413128843GGAInsertion (tandem repeat)LipoylsynthaseFrameshift
NZ_CP0095413169582AATInsertion (tandem repeat)Hypothetical proteinFrameshift
NZ_CP0095413169588AATInsertion (tandem repeat)Hypothetical proteinFrameshift
NZ_CP0095413201261AATInsertion (tandem repeat)xx
NZ_CP0095413201734TATDeletionxx
NZ_CP0095413201748GASNP (transition)xx
NZ_CP0095413330387TCCTSubstitutionSpore germination protein GerIASubstitution
NZ_CP0095413330410CTSNP (transition)Spore germination protein GerIASubstitution
NZ_CP0095413330416TTCGAASubstitutionSpore germination protein GerIASubstitution
NZ_CP0095413330421GTSNP (transversion)Spore germination protein GerIASubstitution
NZ_CP0095413330424GCATSubstitutionSpore germination protein GerIASubstitution
NZ_CP0095413330430CTSNP (transition)Spore germination protein GerIANone
NZ_CP0095413361291CCTInsertion (tandem repeat)xx
NZ_CP0095413364114AGSNP (transition)xx
NZ_CP0095413389585GGAInsertion (tandem repeat)Hypothetical proteinFrameshift
NZ_CP0095413491916CCAInsertion (tandem repeat)DNA polymerase/3'-5'-exonuclease PolXFrameshift
NZ_CP0095413537995TTAInsertion (tandem repeat)ABC transporter ATP-binding proteinFrameshift
NZ_CP0095413647307GGTInsertion (tandem repeat)Rsf A family transcriptional regulatorFrameshift
NZ_CP0095413647932GGTInsertion (tandem repeat)Recombinase Rar AFrameshift
NZ_CP0095413695170AATInsertion (tandem repeat)GNAT family N-acetyltransferaseFrameshift
NZ_CP0095413726079TTAInsertion (tandem repeat)Molecular chaperone DnaJFrameshift
NZ_CP0095413744057GGAInsertion (tandem repeat)RNA polymerase sigma factor RpoDFrameshift
NZ_CP0095413873967CCTInsertion (tandem repeat)Pyrroline-5-carboxylatereductaseFrameshift
NZ_CP0095413951268GAGDeletionphospho-enol-pyruvate protein phosphotransferaseFrameshift
NZ_CP0095414137011CCTInsertion (tandem repeat)xx
NZ_CP0095414328378CCTInsertion (tandem repeat)xx
NZ_CP0095414391405CCAInsertion (tandem repeat)HolinFrameshift
NZ_CP0095414422132CASNP (transversion)xx
NZ_CP0095414766848CCTInsertion (tandem repeat)xx
NZ_CP0095414828439TTAInsertion (tandem repeat)xx
NZ_CP0095414994978TTAInsertion (tandem repeat)Multidrug MFSd transporterFrameshift
NZ_CP0095415006878GCSNP (transversion)TransposaseSubstitution
NZ_CP0095415007511TCSNP (transition)TransposaseExtension
NZ_CP0095415007520ACSNP (transversion)xx
NZ_CP0095415007525GASNP (transition)xx
NZ_CP0095415007546ACSNP (transversion)xx
NZ_CP0095415007607ACSNP (transversion)xx
NZ_CP0095415037843TGSNP (transversion)ABC transporter permeaseSubstitution
NZ_CP0095415116913CGTASubstitutionEnterotoxin CDSSubstitution
NZ_CP0095415116917AGSNP (transition)Enterotoxin CDSNone
NZ_CP0095401GASNP (transition)Hypothetical proteinSubstitution
NZ_CP0095402AGSNP (transition)Hypothetical proteinNone
NZ_CP0095404AAACAInsertionHypothetical proteinInsertion
NZ_CP00954013115TTAInsertion (tandem repeat)Hypothetical proteinFrameshift
NZ_CP00954017095CCTInsertion (tandem repeat)xx
NZ_CP00954020289AATInsertion (tandem repeat)xx
NZ_CP00954065111GGAInsertion (tandem repeat)Hypothetical proteinFrameshift
NZ_CP00954087185TTAInsertion (tandem repeat)PrgI family proteinFrameshift
NZ_CP009540126302AATInsertion (tandem repeat)IS4 family transposaseFrameshift
NZ_CP009540126914GGTInsertion (tandem repeat)Xx
NZ_CP009540128396CCTInsertion (tandem repeat)IS4 family transposaseFrameshift
NZ_CP009540162415CCTInsertion (tandem repeat)xx
NZ_CP009540181623GASNP (transition)Hypothetical proteinNone

The GenBank accession number for the chromosome sequence is NZ_CP009541 and for the plasmid sequence is NZ_CP009540.

CoA, coenzyme A; x, no CDS, and thus no translational effect, was predicted.

SNP, single nucleotide polymorphism.

MFS, major facilitator superfamily.

Features of the genome of B. anthracis Sterne 09RA8929, including variants to the chromosome and plasmid sequences The GenBank accession number for the chromosome sequence is NZ_CP009541 and for the plasmid sequence is NZ_CP009540. CoA, coenzyme A; x, no CDS, and thus no translational effect, was predicted. SNP, single nucleotide polymorphism. MFS, major facilitator superfamily.

Data availability.

The genome sequence of B. anthracis strain Sterne 09RA8929 has been deposited in NCBI GenBank under BioSample number SAMN09635715 and BioProject number PRJNA422985. Raw data have been submitted to the Sequence Read Archive under accession number SRP159486.
  8 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  On Anthrax and Anthracaemia in Wool-Sorters, Heifers, and Sheep.

Authors:  J H Bell
Journal:  Br Med J       Date:  1880-10-23

3.  Prokka: rapid prokaryotic genome annotation.

Authors:  Torsten Seemann
Journal:  Bioinformatics       Date:  2014-03-18       Impact factor: 6.937

Review 4.  Anthrax Pathogenesis.

Authors:  Mahtab Moayeri; Stephen H Leppla; Catherine Vrentas; Andrei P Pomerantsev; Shihui Liu
Journal:  Annu Rev Microbiol       Date:  2015-07-16       Impact factor: 15.500

5.  Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data.

Authors:  Matthew Kearse; Richard Moir; Amy Wilson; Steven Stones-Havas; Matthew Cheung; Shane Sturrock; Simon Buxton; Alex Cooper; Sidney Markowitz; Chris Duran; Tobias Thierer; Bruce Ashton; Peter Meintjes; Alexei Drummond
Journal:  Bioinformatics       Date:  2012-04-27       Impact factor: 6.937

6.  Development of a Sterne-Based Complement Fixation Test to Monitor the Humoral Response Induced by Anthrax Vaccines.

Authors:  Rosanna Adone; Michela Sali; Massimiliano Francia; Michela Iatarola; Adelia Donatiello; Antonio Fasanella
Journal:  Front Microbiol       Date:  2016-01-28       Impact factor: 5.640

7.  Kraken: ultrafast metagenomic sequence classification using exact alignments.

Authors:  Derrick E Wood; Steven L Salzberg
Journal:  Genome Biol       Date:  2014-03-03       Impact factor: 13.583

8.  Comparison of six commercial kits to extract bacterial chromosome and plasmid DNA for MiSeq sequencing.

Authors:  Laura Becker; Matthias Steglich; Stephan Fuchs; Guido Werner; Ulrich Nübel
Journal:  Sci Rep       Date:  2016-06-17       Impact factor: 4.379

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.