| Literature DB >> 26516406 |
Madan Junghare1, Yogita Patil2, Bernhard Schink2.
Abstract
Azoarcus sp. strain PA01(T) belongs to the genus Azoarcus, of the family Rhodocyclaceae within the class Betaproteobacteria. It is a facultatively anaerobic, mesophilic, non-motile, Gram-stain negative, non-spore-forming, short rod-shaped bacterium that was isolated from a wastewater treatment plant in Constance, Germany. It is of interest because of its ability to degrade o-phthalate and a wide variety of aromatic compounds with nitrate as an electron acceptor. Elucidation of the o-phthalate degradation pathway may help to improve the treatment of phthalate-containing wastes in the future. Here, we describe the features of this organism, together with the draft genome sequence information and annotation. The draft genome consists of 4 contigs with 3,908,301 bp and an overall G + C content of 66.08 %. Out of 3,712 total genes predicted, 3,625 genes code for proteins and 87 genes for RNAs. The majority of the protein-encoding genes (83.51 %) were assigned a putative function while those remaining were annotated as hypothetical proteins.Entities:
Keywords: Azoarcus sp. strain PA01T; Betaproteobacteria; Rhodocyclaceae; anaerobic degradation; o-phthalate degradation; pollutant; wastewater treatment plant
Year: 2015 PMID: 26516406 PMCID: PMC4625480 DOI: 10.1186/s40793-015-0079-9
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1General characteristics of Azoarcus sp. strain PA01T. a Phase contrast micrograph of strain PA01T, b Scanning electron micrograph of strain PA01T, c Agarose gel (1 %) electrophoresis of isolated genomic DNA (gDNA) of PA01T. Lane 1, 1 kb DNA marker; lane 2, gDNA before RNase treatment; lane 3, high quality gDNA after RNase treatment
Classification and general features of Azoarcus sp. strain PA01T according to the MIGS recommendations [28]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class Betaproteobacteria | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Strain: PA01T | IDA | ||
| Gram stain | Negative | TAS [ | |
| Cell shape | Short rods | IDA | |
| Motility | Non-motile | IDA | |
| Sporulation | Not-reported | IDA | |
| Temperature range | 25–37 °C | IDA | |
| Optimum temperature | 30 °C | IDA | |
| pH range; Optimum | 6–8; 7.3 ± 0.2 | TAS [ | |
| Carbon source | o-phthalate, benzoate, 3 hydroxy-benzoate, 3,4 di-hydroxy-benzoate,, sugars, fatty acids, alcohols, amino acids etc. | IDA | |
| MIGS-6 | Habitat | Freshwater, sewage sludge | TAS [ |
| MIGS-6.3 | Salinity | Not reported | |
| MIGS-22 | Oxygen requirement | anaerobic/aerotolerant | TAS [ |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | None | IDA |
| MIGS-4 | Geographic location | Constance, Germany | IDA |
| MIGS-5 | Sample collection | 2012 | IDA |
| MIGS-4.1 | Latitude | 47.67° N | IDA |
| MIGS-4.2 | Longitude | 9.14° E | IDA |
| MIGS-4.4 | Altitude | 397 m | IDA |
aEvidence codes
IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [50]. If the evidence code is IDA, the property was directly observed by one of the authors or an expert mentioned in the acknowledgments
Fig. 2Neighbor-joining phylogenetic tree generated using MEGA4 software package based on 16S rRNA gene sequences. The phylogenetic tree shows the exact position of strain PA01T and the three Azoarcus spp. (in bold) whose genome sequence are published, along with other representatives of the genus Azoarcus. The corresponding 16S rRNA gene accession numbers are given in parenthesis. Bootstrap values are calculated from 1000 repeats; bar, 0.02 substitutions per nucleotide position
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | High quality draft |
| MIGS-28 | Libraries used | 8–12 kb PacBio library |
| MIGS 29 | Sequencing platforms | PacBio RS |
| MIGS 31.2 | Fold coverage (sequencing depth) | 97.42 |
| MIGS 30 | Assemblers | HGPA3 |
| MIGS 32 | Gene calling method | Prodigal |
| Locus Tag | PA01_ | |
| GenBank ID | LARU00000000.1 | |
| GenBank Date of Release | August 26, 2015 | |
| GOLD project ID | Gp0109270 | |
| IMG taxon ID | 2596583641 | |
| BIOPROJECT | PRJNA279928 | |
| MIGS 13 | Source material identifier | KCTC 15483T |
| Project relevance | Degradation of aromatic compounds |
Fig. 3Graphical representation of different scaffolds of the genome of Azoarcus sp. strain PA01T. a Graphical map of Azoarcus sp. PA01T genome_PA01_unitig_3_quiver.1. From bottom to top: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content (black) and GC skew (purple). b Graphical map of Azoarcus sp. PA01T genome_PA01_unitig_0_quiver.2. From bottom to top: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content (black) and GC skew (purple). c Graphical map of Azoarcus sp. PA01T genome_PA01_unitig_2_quiver.3. From bottom to top: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content (black) and GC skew (purple). d Graphical map of Azoarcus sp. PA01T genome_PA01_unitig_1_quiver.4. From bottom to top: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content (black) and GC skew (purple)
Genome statistics
| Attribute | Value | % of Totala |
|---|---|---|
| Genome size (bp) | 3,908,237 | 100.00 % |
| DNA coding (bp) | 3,511,692 | 89.95 % |
| DNA G + C (bp) | 2,582,614 | 66.08 % |
| DNA scaffolds | 4 | |
| Total genes | 3,712 | 100.00 % |
| Protein coding genes | 3,625 | 97.66 % |
| RNA genes | 87 | 2.43 % |
| Pseudo genes | 13 | 0.35 % |
| Genes with function predictions | 3,100 | 83.51 % |
| Genes without function prediction | 525 | 14.14 % |
| Genes assigned to COGs | 2,579 | 69.48 % |
| Genes with Pfam domains | 3,178 | 85.61 |
| Genes with signal peptides | 311 | 8.38 % |
| Genes with transmembrane helices | 829 | 22.33 % |
| CRISPR repeats | 1 |
aThe total is based on either the size of the genome in the base pairs or the total number of protein coding genes in the annonated genome
Number of genes associated with general COG functional categories
| Code | Value | % age | Description |
|---|---|---|---|
| J | 201 | 6.93 | Translation, ribosomal structure and biogenesis |
| A | 1 | 0.03 | RNA processing and modification |
| K | 141 | 4.86 | Transcription |
| L | 111 | 3.83 | Replication, recombination and repair |
| B | 1 | 0.03 | Chromatin structure and dynamics |
| D | 35 | 1.21 | Cell cycle control, Cell division, chromosome partitioning |
| V | 55 | 1.90 | Defense mechanisms |
| T | 159 | 5.48 | Signal transduction mechanisms |
| M | 195 | 6.73 | Cell wall/membrane biogenesis |
| N | 87 | 3.00 | Cell motility |
| U | 67 | 2.31 | Intracellular trafficking and secretion |
| O | 154 | 5.24 | Posttranslational modification, protein turnover, chaperones |
| C | 250 | 8.62 | Energy production and conversion |
| G | 111 | 3.83 | Carbohydrate transport and metabolism |
| E | 230 | 7.93 | Amino acid transport and metabolism |
| F | 66 | 2.28 | Nucleotide transport and metabolism |
| H | 165 | 5.69 | Coenzyme transport and metabolism |
| I | 187 | 6.45 | Lipid transport and metabolism |
| P | 166 | 5.73 | Inorganic ion transport and metabolism |
| Q | 79 | 2.73 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 210 | 7.24 | General function prediction only |
| S | 146 | 5.04 | Function unknown |
| - | 1113 | 30.52 | Not in COGs |
The total is based on the total number of protein coding genes predicted in the genome