| Literature DB >> 28852435 |
Jacqueline Hollensteiner1, Anja Poehlein1, Cathrin Spröer2, Boyke Bunk2, Anna E Sheppard3,4, Philip Rosentstiel5, Hinrich Schulenburg4, Heiko Liesegang1.
Abstract
10.1601/nm.5000 is a rod-shaped facultative anaerobic spore forming bacterium of the genus 10.1601/nm.4857. The defining feature of the species is the ability to produce parasporal crystal inclusion bodies, consisting of δ-endotoxins, encoded by cry-genes. Here we present the complete annotated genome sequence of the nematicidal 10.1601/nm.5000 strain MYBT18246. The genome comprises one 5,867,749 bp chromosome and 11 plasmids which vary in size from 6330 bp to 150,790 bp. The chromosome contains 6092 protein-coding and 150 RNA genes, including 36 rRNA genes. The plasmids encode 997 proteins and 4 t-RNA's. Analysis of the genome revealed a large number of mobile elements involved in genome plasticity including 11 plasmids and 16 chromosomal prophages. Three different nematicidal toxin genes were identified and classified according to the Cry toxin naming committee as cry13Aa2, cry13Ba1, and cry13Ab1. Strikingly, these genes are located on the chromosome in close proximity to three separate prophages. Moreover, four putative toxin genes of different toxin classes were identified on the plasmids p120510 (Vip-like toxin), p120416 (Cry-like toxin) and p109822 (two Bin-like toxins). A comparative genome analysis of 10.1601/nm.5000 MYBT18246 with three closely related 10.1601/nm.5000 strains enabled determination of the pan-genome of 10.1601/nm.5000 MYBT18246, revealing a large number of singletons, mostly represented by phage genes, morons and cryptic genes.Entities:
Keywords: Bacillus cereus sensu lato; Bacillus thuringiensis; Pan-Core-genome; Parasporal crystal protein; Prophages
Year: 2017 PMID: 28852435 PMCID: PMC5569534 DOI: 10.1186/s40793-017-0259-x
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Microscopic characteristics of 10.1601/nm.5000 MYBT18246. a Light microscope analysis of Gram stained 10.1601/nm.5000 MYBT18246 cells (40×). b Phase contrast microscope analysis of sporulated and Cry-toxin producing cells of 10.1601/nm.5000 MYBT18246 (40×)
Classification and general features of 10.1601/nm.5000 MYBT18246 [54]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum 10.1601/nm.3874 | TAS [ | ||
| Class 10.1601/nm.4854 | TAS [ | ||
| Order 10.1601/nm.4855 | TAS [ | ||
| Family 10.1601/nm.4856 | TAS [ | ||
| Genus 10.1601/nm.4857 | TAS [ | ||
| Species 10.1601/nm.5000 | TAS [ | ||
| Strain MYBT18246 | IDA | ||
| Gram stain | positive | IDA | |
| Cell shape | rod-shaped | IDA | |
| Motility | Motile | TAS [ | |
| Sporulation | Spore-forming | IDA | |
| Temperature range | 10–48 °C | TAS [ | |
| Optimum temperature | 28–37 °C | TAS [ | |
| pH range; Optimum | 4.9–8.0; 7.0 | TAS [ | |
| Carbon source | Organic carbon source | NAS | |
| MIGS-6 | Habitat | Worldwide | TAS [ |
| MIGS-6.3 | Salinity | Salt tolerant | TAS [ |
| MIGS-22 | Oxygen requirement | Aerobic, facultative anaerobic | TAS [ |
| MIGS-15 | Biotic relationship | Free-living, microparasite of | TAS [ |
| MIGS-14 | Pathogenicity | Nematode pathogen | TAS [ |
| MIGS-4 | Geographic location | not reported | |
| MIGS-5 | Sample collection | not reported | |
| MIGS-4.1 | Latitude | unreported | |
| MIGS-4.2 | Longitude | unreported | |
| MIGS-4.4 | Altitude | unreported |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project
Fig. 2Phylogenetic tree highlighting the taxonomic relation of 10.1601/nm.5000 MYBT18246 (red) based on a) 16rDNA amplicon within the 10.1601/nm.4857 clade b) Multi-locus sequence typing within the 10.1601/nm.4885 sensu lato species group. GenBank accession numbers are given in parentheses. Comparison includes strains of the 10.1601/nm.4854 clade or Bcsl group members (blue). 10.1601/nm.5141 10.1601/strainfinder?urlappend=%3Fid%3DDSM+25430 or 10.1601/nm.4858 str. 168 has been used as outlier to root the tree. Sequences were aligned using ClustalW 1.6 [91, 92]. The phylogenetic tree was constructed by using the Neighbor-Joining method [50] and evolutionary distances were computed by the Maximum Composite Likelihood method [51] within MEGA7.0 [52]. Numbers at the nodes are bootstrap values calculated from 1000 replicates
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Complete |
| MIGS-28 | Libraries used | Two genomic libraries: 454 pyrosequencing shotgun library, PacBio library |
| MIGS 29 | Sequencing platforms | 454 GS FLX system, PacBioRSII |
| MIGS 31.2 | Fold coverage | 18 × 454; 50 × PacBio |
| MIGS 30 | Assemblers | Newbler 2.8; HGAP v2.3.0 |
| MIGS 32 | Gene calling method | Prodigal 2.6 |
| Locus Tag | BT246 | |
| Genbank ID |
| |
| GenBank Date of Release | 2016–07-15 | |
| GOLD ID | Gp0020852 | |
| BIOPROJECT |
| |
| MIGS 13 | Source Material Identifier | Department of Evolutionary Ecology and Genetics, CAU, Kiel |
| Project relevance | Evolution |
Summary of genome: one chromosome and 11 plasmids
| Label | Size (Mb) | Topology | INSDC identifier | RefSeq ID |
|---|---|---|---|---|
| Chromosome | 5.8 | Circular |
| NZ_CP015350.1 |
| Plasmid 1 | 0.151 | Circular |
| NZ_CP015351.1 |
| Plasmid 2 | 0.142 | Circular |
| NZ_CP015352.1 |
| Plasmid 3 | 0.121 | Circular |
| NZ_CP015353.1 |
| Plasmid 4 | 0.120 | Circular |
| NZ_CP015354.1 |
| Plasmid 5 | 0.110 | Circular |
| NZ_CP015355.1 |
| Plasmid 6 | 0.101 | Circular |
| NZ_CP015356.1 |
| Plasmid 7 | 0.055 | Linear |
| NZ_CP015357.1 |
| Plasmid 8 | 0.047 | Circular |
| NZ_CP015358.1 |
| Plasmid 9 | 0.017 | Linear |
| NZ_CP015359.1 |
| Plasmid 10 | 0.014 | Linear |
| NZ_CP015360.1 |
| Plasmid 11 | 0.006 | Circular |
| NZ_CP015361.1 |
Genome statistics
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 6,752,488 | 100 |
| DNA coding (bp) | 5,623,665 | 83.28 |
| DNA G + C (bp) | 2,389,665 | 35.39 |
| DNA scaffolds | 12 | 100 |
| Total genes | 7239 | 100 |
| Protein coding genes | 7089 | 97.9 |
| RNA genes | 151 | 2.09 |
| Genes in internal clusters | 2694 | 37.22 |
| Genes with function prediction | 5274 | 72.86 |
| Genes assigned to COGs | 4662 | 64.40 |
| Genes with Pfam domains | 5503 | 76.02 |
| Genes with signal peptides | 500 | 6.91 |
| Genes with transmembrane helices | 1863 | 25.74 |
| CRISPR repeats | 0 | 0 |
Number of protein encoding genes associated with general COG functional categories
| Code | Value | % | Description |
|---|---|---|---|
| J | 226 | 3.19 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0 | RNA processing and modification |
| K | 487 | 6.87 | Transcription |
| L | 625 | 8.82 | Replication, recombination and repair |
| B | 1 | 0.01 | Chromatin structure and dynamics |
| D | 59 | 0.83 | Cell cycle control, Cell division, chromosome partitioning |
| V | 141 | 1.99 | Defense mechanisms |
| T | 218 | 3.08 | Signal transduction mechanisms |
| M | 270 | 3.81 | Cell wall/membrane biogenesis |
| N | 64 | 0.90 | Cell motility |
| U | 79 | 1.12 | Intracellular trafficking and secretion |
| O | 121 | 1.71 | Posttranslational modification, protein turnover, chaperones |
| C | 214 | 3.02 | Energy production and conversion |
| G | 263 | 3.71 | Carbohydrate transport and metabolism |
| E | 420 | 5.93 | Amino acid transport and metabolism |
| F | 128 | 1.81 | Nucleotide transport and metabolism |
| H | 177 | 2.50 | Coenzyme transport and metabolism |
| I | 129 | 1.82 | Lipid transport and metabolism |
| P | 243 | 3.43 | Inorganic ion transport and metabolism |
| Q | 83 | 1.17 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 653 | 9.22 | General function prediction only |
| S | 505 | 7.13 | Function unknown |
| - | 1978 | 27.9 | Not in COGs |
aThe total number is based on the total number of protein coding genes in the genome
Fig. 3Circular visualization of the genome comparison of 10.1601/nm.5000 MYBT18246 with 3 other sequenced 10.1601/nm.5000 strains. The tracks from the outside represent: (track 1–2) Genes encoded by the leading and lagging strand of 10.1601/nm.5000 MYBT18246 marked in COG colors [93]; (track 3) putative prophage regions, identified with PHAST in blue [76], (track 4) identified cry toxin genes in red; (track 5–7) orthologs for the genomes of 10.1601/nm.5000 YBT-1518 (CP005935.1), 10.1601/nm.5000 CT-43 (CP001907.1), 10.1601/nm.5000 Bt407 (CP003889.1) illustrated in red to light yellow, singletons in grey (grey: <1e−20; light yellow: 1e−21–1e−50; gold: 1e−51–1e−90; light orange: 1e−91–1e−100; orange: 1e−101–1e−120; red: >1e−121 (track 7) % GC plot (track 8), GC skew [(GC)/(G + C)]. Visualization was done with DNAPlotter [94]
Fig. 4Venn diagram of the genome comparison of 10.1601/nm.5000 MYBT18246 with other 10.1601/nm.5000 strains. Venn diagram displays the orthologous genes between 10.1601/nm.5000 MYBT18246 (CP015350-CP015361), 10.1601/nm.5000 YBT-1518 (CP005935-CP002486), 10.1601/nm.5000 serovar chinensis CT-43 (CP001907-CP001917) and 10.1601/nm.5000 Bt407 (CP003889-CP003898). Ortholog detection was performed with Proteinortho [84] including protein blast with a similarity cut-off of (50%) and an E-value of 1e−10. The total number of genes and paralogs are depicted under the corresponding species name. Open reading frames that were classified as pseudogenes were not included in this analysis
General genome features of 10.1601/nm.5000 MYBT18246 and close relatives
| Genome features | Genome name | |||
|---|---|---|---|---|
| 10.1601/nm.5000 MYBT18246a | 10.1601/nm.5000 407b | 10.1601/nm.5000 YBT-1518c | 10.1601/nm.5000 CT-43d | |
| Sequencing status | Finished | Finished | Finished | Finished |
| Genome size (Mbp) | 6.75 | 6.13 | 6.67 | 6.15 |
| DNA coding (bp) | 5,623,665 | 5,133,026 | 5,421,574 | 5,079,667 |
| GC (%) | 35.4 | 35.02 | 35.29 | 35.12 |
| DNA scaffolds | 12 | 10 | 7 | 11 |
| Total gene count | 7239 | 6442 | 6738 | 6252 |
| Protein coding genes (%) | 97.9 | 95.9 | 98.0 | 95.1 |
| RNA genes | 151 | 180 | 139 | 124 |
| Genes in internal clusters | 2694 | 489 | 370 | 334 |
| Genes with function prediction | 5274 | 4615 | 5193 | 4211 |
| Genes assigned to COGs | 4662 | 3634 | 3746 | 3505 |
| Genes with Pfam domains | 5503 | 4991 | 5333 | 4809 |
| Genes with signal peptides | 500 | 447 | 471 | 418 |
| Genes with transmembrane helices | 1863 | 1750 | 1854 | 1698 |
| CRISPR repeats | 0 | 2 | 0 | 2 |
Accession numbers: aCP015350, bCP003889, cCP005935, dCP001907