| Literature DB >> 26819653 |
Beiwen Zheng1, Fan Zhang2, Hao Dong3, Lujun Chai2, Fuchang Shu3, Shaojin Yi3, Zhengliang Wang3, Qingfeng Cui4, Hanping Dong4, Zhongzhi Zhang5, Dujie Hou2, Jinshui Yang6, Yuehui She3.
Abstract
Paenibacillus sp. strain A2 is a Gram-negative rod-shaped bacterium isolated from a mixture of formation water and petroleum in Daqing oilfield, China. This facultative aerobic bacterium was found to have a broad capacity for metabolizing hydrocarbon and organosulfur compounds, which are the main reasons for the interest in sequencing its genome. Here we describe the features of Paenibacillus sp. strain A2, together with the genome sequence and its annotation. The 7,650,246 bp long genome (1 chromosome but no plasmid) exhibits a G+C content of 54.2 % and contains 7575 protein-coding and 49 RNA genes, including 3 rRNA genes. One putative alkane monooxygenase, one putative alkanesulfonate monooxygenase, one putative alkanesulfonate transporter and four putative sulfate transporters were found in the draft genome.Entities:
Keywords: Genome; Hiseq2000; Paenibacillus sp. strain A2; Sulfonate biodegradation
Year: 2016 PMID: 26819653 PMCID: PMC4728784 DOI: 10.1186/s40793-015-0125-7
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Scanning electron micrograph of cells of strain A2. Bar: 5.0 μm
Classification and general features of Paenibacillus sp. strain A2
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain: | TAS [ | |
| Phylum: | TAS [ | ||
| Class: | TAS [ | ||
| Order: | TAS [ | ||
| Family: | TAS [ | ||
| Genus: | TAS [ | ||
| Species: | IDA | ||
| Strain: A2 | IDA | ||
| Gram stain | Positive | IDA | |
| Cell shape | Rod-shaped | IDA | |
| Motility | Motile | IDA | |
| Sporulation | Spore-forming | IDA | |
| Temperature range | Mesophile | IDA | |
| Optimum temperature | 37rb | IDA | |
| pH range; Optimum | 5.0–9.0; 6.0–8.0 | IDA | |
| Carbon source | Glucose, xylose, mannitol, arabinose | IDA | |
| Energy source | Glucose, xylose, mannitol, arabinose | IDA | |
| Terminal electron receptor | Not reported | IDA | |
| MIGS-6 | Habitat | Environment | IDA |
| MIGS-6.3 | Salinity | Tolerates 5 % NaCl | IDA |
| MIGS-22 | Oxygen | Not reported | IDA |
| MIGS-15 | Biotic relationship | Free living | IDA |
| MIGS-14 | Pathogenicity | Non pathogenic, BSL1 | NAS |
| MIGS-4 | Geographic location | Daqing, China | IDA |
| MIGS-5 | Sample collection time | March 2012 | IDA |
| MIGS-4.1 | Latitude | 45°92′N | IDA |
| MIGS-4.2 | Longitude | 124°68′E | IDA |
| MIGS-4.4 | Altitude | Not reported | IDA |
aEvidence codes - IDA inferred from direct assay, TAS traceable author statement (i.e., a direct report exists in the literature), NAS non-traceable author statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are the Gene Ontology project [43]
Fig. 2Phylogenetic tree depicting the relationship between Paenibacillus sp. strain A2 and other members of the genus Paenibacillus. The strains and their corresponding Genbank accession numbers are shown following the organism name and indicated in parentheses. The phylogenetic tree uses 16S rRNA gene sequences aligned by the CLUSTALW [7], and phylogenetic inferences were made using Neighbor-joining method based on Kimura 2-parameter model within the MEGA5 software [8] and rooted with Bacillus subtilis strain DSM10T (AJ276351). Bootstrap consensus trees were inferred from 100 replicates, only bootstrap values >50 % were indicated. The scale bar represents 0.01 nucleotide change per nucleotide position
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | One pair-end 450 bp library |
| MIGS-29 | Sequencing platforms | Illumina HiSeq 2000 |
| MIGS-31.2 | Fold coverage | 180.0 × (based on 450 bp library) |
| MIGS-30 | Assemblers | Velvet 1.2.07 |
| MIGS-32 | Gene calling method | Glimmer 3.0 |
| Locus Tag | AA76 | |
| Genbank ID | JFHX00000000 | |
| Genbank Date of Release | April 2, 2014 | |
| GOLD ID | Gi0070607 | |
| BIOPROJECT | PRJNA233560 | |
| MIGS-13 | Source Material Identifier | CGMCC No. 5647 |
| Project relevance | Biotechnology |
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 7,650,246 | 100.00 |
| DNA coding region (bp) | 6,699,198 | 87.57 |
| DNA G+C (bp) | 4,144,410 | 54.2 |
| DNA scafflods | 180 | |
| Total genes | 7578 | 100.00 |
| Protein coding genes | 7575 | 99.96 |
| RNA genes | 49 | 0.65 |
| Pseudo genes | 211 | 2.78 |
| Genes in internal clusters | 203 | 2.68 |
| Genes with function prediction | 5756 | 76 |
| Genes with Pfam domains | 6300 | 83.16 |
| Genes assigned to COGs | 4710 | 62.15 |
| Genes with signal peptides | 405 | 5.34 |
| Genes with transmembrane helices | 1962 | 25.89 |
| CRISPR repeats | 1 | – |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 209 | 2.76 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0 | RNA processing and modification |
| K | 746 | 9.85 | Transcription |
| L | 195 | 2.57 | Replication, recombination and repair |
| B | 4 | 0.053 | Chromatin structure and dynamics |
| D | 85 | 1.12 | Cell cycle control, mitosis and meiosis |
| V | 258 | 3.41 | Defense mechanisms |
| T | 474 | 6.26 | Signal transduction mechanisms |
| M | 282 | 3.72 | Cell wall/membrane biogenesis |
| N | 128 | 1.69 | Cell motility |
| Z | 2 | 0.026 | Cytoskeleton |
| U | 57 | 0.75 | Intracellular trafficking and secretion |
| O | 191 | 2.52 | Posttranslational modification, protein turnover, chaperones |
| C | 351 | 4.63 | Energy production and conversion |
| G | 787 | 10.39 | Carbohydrate transport and metabolism |
| E | 796 | 10.51 | Amino acid transport and metabolism |
| F | 179 | 2.36 | Nucleotide transport and metabolism |
| H | 265 | 3.50 | Coenzyme transport and metabolism |
| I | 196 | 2.59 | Lipid transport and metabolism |
| P | 549 | 7.25 | Inorganic ion transport and metabolism |
| Q | 261 | 3.45 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 1046 | 13.81 | General function prediction only |
| S | 407 | 5.37 | Function unknown |
| - | 468 | 6.18 | Not in COGs |
The total is based on the total number of protein coding genes in the annotated genome
Fig. 3Genomic view of prophage regions identified in the genome of Paenibacillus sp. A2
Summary of prophage regions in Paenibacillus sp. A2
| Region | Region length | Completeness | CDS | Specific keyword |
|---|---|---|---|---|
| 1 | 13.2 kb | incomplete | 17 | tail |
| 2 | 43.5 kb | intact | 55 | tail, plate, capsid, protease, portal, terminase, integrase, transposase |
| 3 | 16.9 kb | incomplete | 16 | tail |
| 4 | 23.6 kb | questionable | 23 | tail, capsid, head, portal, terminase |
| 5 | 20.2 kb | incomplete | 21 | terminase, portal, head, capsid, tail |
| 6 | 19.3 kb | incomplete | 21 | tail |
| 7 | 37.7 kb | incomplete | 24 | integrase, tail |
| 8 | 30.5 kb | incomplete | 25 | integrase |
| 9 | 15.9 kb | questionable | 22 | tail,lysin, plate |
Summary of proteins involved in hydrocarbon and sulfur metabolisms
| Protein | Start | Stop | Protein product | Length | Description |
|---|---|---|---|---|---|
| 1 | 3628875 | 3629657 | WP_025849555.1 | 260 | alkanesulfonate transporter permease subunit |
| 2 | 3629626 | 3630852 | WP_025849556.1 | 408 | alkanesulfonate monooxygenase |
| 3 | 6287814 | 6288827 | WP_025846226.1 | 337 | alkane 1-monooxygenase |
| 4 | 1957755 | 1958930 | WP_025851077.1 | 391 | sulfate adenylyltransferase |
| 5 | 2493097 | 2493636 | WP_025850577.1 | 179 | adenylylsulfate kinase |
| 6 | 3634266 | 3635159 | WP_025849561.1 | 297 | sulfate/thiosulfate transporter permease subunit |
| 7 | 3635181 | 3636017 | WP_025849562.1 | 278 | sulfate transporter |
| 8 | 3636039 | 3637142 | WP_025849563.1 | 367 | sulfate transporter subunit |
| 9 | 4328289 | 4330016 | WP_025848942.1 | 575 | sulfate transporter |
| 10 | 5127629 | 5128231 | WP_025847883.1 | 200 | adenylylsulfate kinase |
Fig. 4Circular representation of seven draft Paenibacillus genomes compared against Paenibaciluus sp. A2. The inner rings show GC content (black) and GC skew (purple/green). The remaining rings show BLASTn results of each genome against P. ehimensis A2 (JFHX01000001.1) using the BRIG program. The strains used in the BLASTn were P. elgii B69 (AFHW01000001.1), P. chitinolyticus NBRC15660 (BBJT01000001.1), P. polymyxa ATCC842 (AFOX01000001.1), P. vortex V453 (ADHJ01000001.1), P. curdlanolyticus YK9 (AEDD01000001.1), P. larvae subsp. larvae BRL-230010 (AARF01000001.1) and P. mucilaginosus KNP414 (CP002869.1)