| Literature DB >> 29876371 |
Rashmi Rathy1, Sayan Paul1, Vasanthakumar Ponesakki1, Paulkumar Kanniah1, Suriya Prabha Muthu1, Arun Arumugaperumal1, Emmanuel Joshua Jebasingh Sathiya Balasingh Thangapandi1, Subburathinam Balakrishnan1, Rajendhran Jeyaprakash2, Sudhakar Sivasubramaniam1.
Abstract
Bacillus species 062011 msu is a harmful pathogenic strain responsible for causing abscessation in sheep and goat population studied by Mariappan et al. (2012) [1]. The organism specifically targets the female sheep and goat population and results in the reduction of milk and meat production. In the present study, we have performed the whole genome sequencing of the pathogenic isolate using the Ion Torrent sequencing platform and generated 458,944 raw reads with an average length of 198.2 bp. The genome sequence was assembled, annotated and analysed for the genetic islands, metabolic pathways, orthologous groups, virulence factors and antibiotic resistance genes associated with the pathogen. Simultaneously the 16S rRNA sequencing study and genome sequence comparison data confirmed that the strain belongs to the species Bacillus cereus and exhibits 99% sequence homo;logy with the genomes of B. cereus ATCC 10987 and B. cereus FRI-35. Hence, we have renamed the organism as Bacillus cereus 062011msu. The Whole Genome Shotgun (WGS) project has been deposited at DDBJ/ENA/GenBank under the accession NTMF00000000 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA404036(SAMN07629099)).Entities:
Keywords: Abscessation; Bacillus cereus; Genome sequencing; Virulence factors
Year: 2018 PMID: 29876371 PMCID: PMC5988026 DOI: 10.1016/j.dib.2017.12.054
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Summary statistics of Bacillus cereus 062011msu genome assembly.
| average quality PHRED-score Q20 (before trimming) | 26,143 (abs) |
| average quality PHRED-score Q25 (before trimming) | 45,447 (abs) |
| average quality PHRED-score Q30 (before trimming) | 21,587 (abs) |
| No. of clean reads | 432,619 |
| Average length of clean reads | 178.08 |
| average quality PHRED-score Q20 (after trimming) | 5096 (abs) |
| average quality PHRED-score Q25 (after trimming) | 54,520 (abs) |
| average quality PHRED-score Q30 (after trimming) | 29,972 (abs) |
| Percentage of clean reads | 94.27% |
| Total no. of assembled contigs | 3200 |
| Mean contig length | 1611 |
| N50 length | 2922 |
| N25 length | 5365 |
| Sequencing depth | 15X |
| GC% | 35.3% |
| Size of genome | 5,154,790 |
Abs: number of sequences observed at that quality score.
Fig. 1Genetic islands in the Bacillus cereus 062011msu. Total 10 genetic islands were predicted by using the Island Viewer 4.
Fig. 2Subsystem distribution of Bacillus cereus 062011msu genome based on RAST annotation server.
Fig. 3Pie chart representing the distribution of KEGG pathways associated with the genome of Bacillus cereus 062011msu. The pathways were obtained by annotating the protein coding sequences against the KEGG database.
Fig. 4Clusters of Orthologous Groups (COG) distribution of the protein coding genes in Bacillus cereus 062011msu obtained from EggNog database.
List of virulence factors identified in the annotated genome dataset of Bacillus cereus 062011msu.
| SeqName | Start | Stop | Strand | Description | Length | e-Value | Sim mean |
|---|---|---|---|---|---|---|---|
| contig_1017 | 240 | 1 | − | Flagellum-specific ATP synthase | 240 | 6.88E−53 | 100 |
| contig_1017 | 1466 | 1305 | − | Flagellar assembly protein H | 162 | 3.05E−15 | 100 |
| contig_1017 | 2984 | 1980 | − | Flagellar motor switch protein G | 1005 | 0 | 100 |
| contig_1017 | 3453 | 2998 | − | Flagellar MS-ring protein | 456 | 1.33E−40 | 100 |
| contig_1026 | 114 | 1106 | + | UDP-glucose 4-epimerase | 993 | 0 | 100 |
| contig_1050 | 1366 | 557 | − | UTP-glucose-1-phosphate uridylyltransferase | 810 | 0 | 100 |
| contig_1052 | 162 | 16 | − | Flagellar biosynthesis protein FliR | 147 | 1.96E−12 | 100 |
| contig_1052 | 305 | 192 | − | Flagellar biosynthesis protein FliQ | 114 | 4.26E−20 | 100 |
| contig_1052 | 1144 | 503 | − | Flagellar biosynthesis protein FliP | 642 | 1.91E−130 | 100 |
| contig_1052 | 1792 | 1601 | − | Flagellar motor switch protein FliN | 192 | 7.43E−39 | 100 |
| contig_1108 | 28 | 837 | + | Flagellar motor protein MotB | 810 | 6.97E−171 | 100 |
| contig_1140 | 301 | 1116 | + | Flagellar biosynthesis regulator FlhF | 816 | 0 | 100 |
| contig_1140 | 1159 | 1497 | + | Flagellar basal body rod protein FlgG | 339 | 6.74E−76 | 100 |
| contig_1208 | 569 | 387 | − | Flagellar hook protein FlgE | 183 | 5.15E−35 | 100 |
| contig_1208 | 1752 | 1633 | − | Flagellar basal body rod modification protein | 120 | 1.19E−10 | 100 |
| contig_1235 | 13 | 726 | + | Immune inhibitor A metalloprotease | 714 | 9.71E−152 | 100 |
| contig_1277 | 720 | 535 | − | Glycosyl transferase, group 2 family protein | 186 | 8.07E−19 | 100 |
| contig_1321 | 1747 | 1574 | − | Chemotaxis protein CheV | 237 | 1.09E−50 | 100 |
| contig_1339 | 805 | 488 | − | Nonhemolytic enterotoxin NHE | 144 | 2.77E−25 | 100 |
| contig_1358 | 522 | 379 | − | Thiol-activated cytolysin | 837 | 0 | 100 |
| contig_1613 | 547 | 1092 | + | Caspsular polysaccharide biosynthesis protein | 264 | 4.12E−52 | 100 |
| contig_1634 | 912 | 76 | − | Hypothetical protein NEAT-type hemophore-mediated heme uptake system | 585 | 4.82E−92 | 100 |
| contig_1673 | 392 | 93 | − | Flagellar hook-basal body protein FliE | 300 | 1.15E−69 | 100 |
| contig_1673 | 834 | 538 | − | Flagellar basal body rod protein FlgC | 297 | 1.71E−68 | 100 |
| contig_1912 | 261 | 37 | − | Non-hemolytic enterotoxin A | 225 | 2.03E−42 | 100 |
| contig_1926 | 852 | 496 | − | Flagellar protein FliS, putative | 357 | 2.32E−74 | 100 |
| contig_1926 | 1621 | 977 | − | Flagellar capping protein | 645 | 4.44E−148 | 100 |
| contig_2028 | 167 | 21 | − | Flagellar motor protein MotS | 147 | 1.05E−29 | 100 |
| contig_2028 | 1158 | 886 | − | Flagellar motor protein MotP | 273 | 1.14E−33 | 100 |
| contig_2037 | 1066 | 857 | − | Chemotaxis protein methyltransferase CheR | 210 | 3.60E−45 | 100 |
| contig_2173 | 701 | 471 | − | Flagellar biosynthesis protein FliR | 231 | 7.44E−40 | 100 |
| contig_2221 | 38 | 262 | + | Hemolysin III | 225 | 8.80E−48 | 100 |
| contig_2339 | 364 | 29 | − | Glycosyl transferase, group 1 family protein | 336 | 5.30E−77 | 100 |
| contig_2555 | 475 | 242 | − | Flagellar hook-associated protein FlgL | 234 | 2.66E−48 | 100 |
| contig_2723 | 583 | 191 | − | O-antigen polymerase wzy | 393 | 5.21E−71 | 100 |
| contig_35 | 1687 | 995 | − | UDP-galactose phosphate transferase | 693 | 1.77E−168 | 100 |
| contig_35 | 2379 | 1687 | − | Aminotransferase family protein | 693 | 6.51E−174 | 100 |
| contig_439 | 4672 | 5061 | + | Transcriptional regulator PlcR, putative | 390 | 3.32E−90 | 100 |
| contig_572 | 306 | 88 | − | NAD dependent epimerase/dehydratase family protein | 219 | 2.36E−44 | 100 |
| contig_572 | 1657 | 284 | − | UDP-glucose 6-dehydrogenase | 1374 | 0 | 100 |
| contig_572 | 2422 | 1751 | − | Polysaccharide transport protein, putative | 672 | 9.86E−147 | 100 |
| contig_582 | 743 | 630 | − | Flagellar hook-associated protein FlgK | 114 | 2.91E−19 | 100 |
| contig_681 | 545 | 45 | − | Internalin, putative | 501 | 1.98E−116 | 100 |
| contig_694 | 5767 | 5450 | − | Channel protein, hemolysin III family | 318 | 5.60E−57 | 100 |
| contig_716 | 152 | 9 | − | Cytotoxin K | 144 | 4.42E−24 | 100 |
| contig_734 | 1083 | 232 | − | Phospholipase C | 852 | 0 | 100 |
| contig_875 | 1156 | 1413 | + | Iron compound ABC transporter iron compound-binding protein | 258 | 3.82E−40 | 100 |
| contig_875 | 2446 | 2847 | + | Iron compound ABC transporter permease protein | 402 | 8.95E−50 | 100 |
| contig_876 | 733 | 329 | − | Capsular exopolysaccharide family protein | 405 | 1.42E−75 | 100 |
| contig_882 | 3022 | 2846 | − | Flagellar motor switch protein | 177 | 1.30E−33 | 100 |
| contig_882 | 4973 | 4479 | − | Chemotaxis histidine kinase | 495 | 5.15E−94 | 100 |
| contig_899 | 782 | 63 | − | Capsular polysaccharide biosynthesis protein | 720 | 6.46E−178 | 100 |
| contig_899 | 1383 | 1264 | − | Tyrosine-protein kinase | 120 | 9.58E−21 | 100 |
| contig_903 | 2407 | 1010 | − | Flagellin | 1398 | 0 | 100 |
| contig_965 | 15 | 983 | + | Non-hemolytic enterotoxin C | 969 | 0 | 100 |
| contig_986 | 1110 | 1253 | + | Membrane-bound transcriptional regulator LytR | 144 | 2.26E−10 | 100 |
Fig. 5Phylogenetic tree based on 16S rRNA comparison of Bacillus cereus 062011msu with its closely related homologs using the MEGA7 software.
Fig. 6Circular genome map of Bacillus cereus 062011msu generated by DNAPlotter tool.
| Subject area | Biology |
| More specific subject area | Bioinformatics (Genomics) |
| Type of data | Table, figure |
| How data was acquired | Genome sequencing: Ion Torrent personal genome machine (PGM) (Life Technologies, Carlsbad, CA), |
| Denovo sequence assembly: CLC genomics workbench version 9.0.1, | |
| Bioinformatics approaches: NCBI Prokaryotic Genomes Automatic Annotation Pipeline (PGAAP), RAST genome annotation server ( | |
| Data format | Analyzed |
| Experimental factors | Genome sequencing, genome annotation, KEGG pathway analysis, orthologous group analysis, 16S rRNA and 23S rRNA based phylogeny. |
| Experimental features | The whole genome sequencing of |
| Data source location | Maruthamputhur village, Alangulam, Tirunelveli District, Tamil Nadu, India. (latitude: 8.8646N and longitude: 77.4960 E). |
| Data accessibility | Genome analysis and annotation data are given within this article and the raw data along with NCBI PGAAP annotation were deposited at NCBI repository: |
| Bioproject ID: 404036, BioSample: SAMN07629099 | |
| The Whole Genome Shotgun (WGS) project has been deposited at DDBJ/ENA/GenBank under the accession NTMF00000000 ( | |
| The genome annotation data obtained from the RAST server are given in this article. | |
| Related research article | “Bacillus sp. causing abscessation in sheep and goat population” by Mariappan et al. (2012) |