| Literature DB >> 23407456 |
Perrine Hugon1, Ajay Kumar Mishra, Catherine Robert, Didier Raoult, Pierre-Edouard Fournier.
Abstract
We report the properties of a draft genome sequence of the bacterium Anaerococcus vaginalis strain PH9, a species within the Anaerococcus genus. This strain, whose genome is described here, was isolated from the fecal flora of a 26-year-old woman suffering from morbid obesity. A. vaginalis is an obligate anaerobic coccus. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,048,125-bp long (one chromosome but no plasmid) and contains 2,095 protein-coding and 38 RNA genes, including three rRNA genes.Entities:
Keywords: Anaerococcus vaginalis; genome
Year: 2012 PMID: 23407456 PMCID: PMC3558966 DOI: 10.4056/sigs.2716452
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of strain PH9
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | IDA | ||
| Type strain PH9 | |||
| Gram stain | positive | IDA | |
| Cell shape | coccoid | IDA | |
| Motility | nonmotile | IDA | |
| Sporulation | nonsporulating | IDA | |
| Temperature range | mesophile | IDA | |
| Optimum temperature | 37°C | IDA | |
| MIGS-6.3 | Salinity | growth in BHI medium + 5% NaCl | IDA |
| MIGS-22 | Oxygen requirement | anaerobic | IDA |
| Carbon source | unknown | ||
| Energy source | peptones | NAS | |
| MIGS-6 | Habitat | human gut | IDA |
| MIGS-15 | Biotic relationship | free living | IDA |
| MIGS-14 | Pathogenicity | unknown | |
| MIGS-4 | Geographic location | France | IDA |
| MIGS-5 | Sample collection time | January 2011 | IDA |
| MIGS-4.1 | Latitude | 43.296482 | IDA |
| MIGS-4.1 | Longitude | 5.36978 | IDA |
| MIGS-4.3 | Depth | surface | IDA |
| MIGS-4.4 | Altitude | 0 m above sea level | IDA |
Evidence codes – IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [17]. If the evidence is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements.
Figure 1Phylogenetic tree highlighting the position of strain PH9 relative to other type strains within the genus. GenBank accession numbers are indicated in parentheses. Sequences were aligned using CLUSTALW, and phylogenetic inferences obtained using the maximum-likelihood method and the MEGA software. Numbers at the nodes are bootstrap values obtained by repeating 500 times the analysis to generate a majority consensus tree. The scale bar represents a 1% nucleotide sequence divergence.
Figure 2Gram staining of strain PH9
Figure 3Transmission electron microscopy of strain PH9, using a Morgani 268D (Philips) at an operating voltage of 60kV.The scale bar represents 900 nm.
Figure 4Reference mass spectrum from strain PH9. Spectra from 12 individual colonies were compared and a reference spectrum was generated.
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | One 454 paired end 3-kb library |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium |
| MIGS-31.2 | Fold coverage | 35 |
| MIGS-30 | Assemblers | Newbler version 2.5.3 |
| MIGS-32 | Gene calling method | Prodigal |
| INSDC ID | 2000019206 | |
| NCBI project ID | CAGU00000000 | |
| Genbank Date of Release | 31-05-2012 | |
| Gold ID | Gi13719 | |
| MIGS-13 | Project relevance | Study of the human gut microbiome |
Figure 5Graphical circular map of the chromosome. From outside to the center: Genes on the forward strand (colored by COG categories), genes on the reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red), GC content, and GC skew.
Nucleotide content and gene count levels of the genome
| Attribute | Value | % of totala |
|---|---|---|
| Genome size (bp) | 2,048,125 | |
| DNA coding region (bp) | 1,825,389 | 89.12 |
| DNA G+C content (bp) | 593,956 | 29.60 |
| Total genes | 2,133 | 100 |
| RNA genes | 38 | 1.78 |
| Protein-coding genes | 2,095 | 98.21 |
| Genes with function prediction | 1,556 | 72.94 |
| Genes assigned to COGs | 1,546 | 72.48 |
| Genes with peptide signals | 83 | 3.89 |
| Genes with transmembrane helices | 476 | 22.31 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 144 | 6.87 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 138 | 6.59 | Transcription |
| L | 146 | 6.97 | Replication, recombination and repair |
| B | 1 | 0.05 | Chromatin structure and dynamics |
| D | 19 | 0.91 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 74 | 3.53 | Defense mechanisms |
| T | 47 | 2.24 | Signal transduction mechanisms |
| M | 66 | 3.15 | Cell wall/membrane biogenesis |
| N | 5 | 0.24 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 26 | 1.24 | Intracellular trafficking and secretion |
| O | 63 | 3.01 | Posttranslational modification, protein turnover, chaperones |
| C | 90 | 4.30 | Energy production and conversion |
| G | 123 | 5.87 | Carbohydrate transport and metabolism |
| E | 136 | 6.49 | Amino acid transport and metabolism |
| F | 60 | 2.86 | Nucleotide transport and metabolism |
| H | 59 | 2.82 | Coenzyme transport and metabolism |
| I | 33 | 1.58 | Lipid transport and metabolism |
| P | 96 | 4.58 | Inorganic ion transport and metabolism |
| Q | 24 | 1.15 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 206 | 9.83 | General function prediction only |
| S | 123 | 5.87 | Function unknown |
| - | 542 | 25.41 | Not in COGs |
aThe total is based on the total number of protein coding genes in the annotated genome.