| Literature DB >> 22180809 |
Amrita Pati, Sabine Gronow, Megan Lu, Alla Lapidus, Matt Nolan, Susan Lucas, Nancy Hammon, Shweta Deshpande, Jan-Fang Cheng, Roxanne Tapia, Cliff Han, Lynne Goodwin, Sam Pitluck, Konstantinos Liolios, Ioanna Pagani, Konstantinos Mavromatis, Natalia Mikhailova, Marcel Huntemann, Amy Chen, Krishna Palaniappan, Miriam Land, Loren Hauser, John C Detter, Evelyne-Marie Brambilla, Manfred Rohde, Markus Göker, Tanja Woyke, James Bristow, Jonathan A Eisen, Victor Markowitz, Philip Hugenholtz, Nikos C Kyrpides, Hans-Peter Klenk, Natalia Ivanova.
Abstract
Prevotella multisaccharivorax Sakamoto et al. 2005 is a species of the large genus Prevotella, which belongs to the family Prevotellaceae. The species is of medical interest because its members are able to cause diseases in the human oral cavity such as periodontitis, root caries and others. Although 77 Prevotella genomes have already been sequenced or are targeted for sequencing, this is only the second completed genome sequence of a type strain of a species within the genus Prevotella to be published. The 3,388,644 bp long genome is assembled in three non-contiguous contigs, harbors 2,876 protein-coding and 75 RNA genes and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.Entities:
Keywords: GEBA; Gram-negative; Prevotellaceae; chemoorganotrophic; mesophilic; non-motile; obligately anaerobic; opportunistic pathogen
Year: 2011 PMID: 22180809 PMCID: PMC3236051 DOI: 10.4056/sigs.2164949
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic tree highlighting the position of P. multisaccharivorax relative to the type strains of the other species within the family. The tree was inferred from 1,425 aligned characters [9,10] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [11]. Rooting was done initially using the midpoint method [12] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 600 ML bootstrap replicates [13] (left) and from 1,000 maximum parsimony bootstrap replicates [14] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [15] are labeled with one asterisk, those also listed as 'Complete and Published' should be labeled with two asterisks: P. ruminicola [16] and P. melaninogenica (CP002122/CP002123).
Classification and general features of P. multisaccharivorax PPPA20T according the MIGS recommendations [17] and the NamesforLife database [1].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum “ | TAS [ | ||
| Class “ | TAS [ | ||
| Order “ | TAS [ | ||
| Family “ | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain PPPA20 | TAS [ | ||
| Gram stain | negative | TAS [ | |
| Cell shape | rod-shaped | TAS [ | |
| Motility | non-motile | TAS [ | |
| Sporulation | none | TAS [ | |
| Temperature range | mesophilic | TAS [ | |
| Optimum temperature | 37°C | TAS [ | |
| Salinity | physiological | TAS [ | |
| MIGS-22 | Oxygen requirement | obligately anaerobic | TAS [ |
| Carbon source | carbohydrates | TAS [ | |
| Energy metabolism | chemoorganotrophic | TAS [ | |
| MIGS-6 | Habitat | host, human oral microflora | TAS [ |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | opportunistic pathogen | TAS [ |
| Biosafety level | 2 | TAS [ | |
| Isolation | subgingival plaque, chronic periodontitis | TAS [ | |
| MIGS-4 | Geographic location | Japan | TAS [ |
| MIGS-5 | Sample collection time | December 9, 2002 | IDA |
| MIGS-4.1 | Latitude | not reported | |
| MIGS-4.2 | Longitude | not reported | |
| MIGS-4.3 | Depth | not reported | |
| MIGS-4.4 | Altitude | not reported |
Evidence codes - IDA: Inferred from Direct Assay (first time in publication); TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from of the Gene Ontology project [25]. If the evidence code is IDA, the property was directly observed by one of the authors or an expert mentioned in the acknowledgements.
Figure 2Scanning electron micrograph of P. multisaccharivorax PPPA20T
Genome sequencing project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Non-contiguous finished |
| MIGS-28 | Libraries used | Three genomic libraries: one 454 pyrosequence standard library, |
| MIGS-29 | Sequencing platforms | Illumina GAii, 454 GS FLX Titanium |
| MIGS-31.2 | Sequencing coverage | 290.0 × Illumina; 48.0 × pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.3, Velvet 0.7.63, phrap SPS 4.24 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| INSDC ID | AFJE00000000 GL945015-GL945017 | |
| Genbank Date of Release | June 20, 2011 | |
| GOLD ID | Gi05358 | |
| NCBI project ID | 41513 | |
| Database: IMG-GEBA | 2503754046 | |
| MIGS-13 | Source material identifier | DSM 17128 |
| Project relevance | Tree of Life, GEBA |
Genome Statistics
| | | |
|---|---|---|
| Genome size (bp) | 3,388,644 | 100.00% |
| DNA coding region (bp) | 2,970,483 | 87.66% |
| DNA G+C content (bp) | 1,636,375 | 48.31% |
| Number of scaffolds | 3 | |
| Total genes | 2,951 | 100.00% |
| RNA genes | 75 | 2.54% |
| rRNA operons | 4-6 | |
| Protein-coding genes | 2,876 | 97.46% |
| Pseudo genes | 166 | 5.63% |
| Genes in paralog clusters | 438 | 14.84% |
| Genes assigned to COGs | 1,659 | 56.22% |
| Genes assigned Pfam domains | 1,864 | 63.17% |
| Genes with signal peptides | 782 | 26.50% |
| Genes with transmembrane helices | 588 | 19.93% |
| CRISPR repeats | 3 |
Figure 3Graphical map of the largest scaffold. From outside to the center: Genes on forward strand (color by COG categories), Genes on reverse strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
Number of genes associated with the general COG functional categories
| Code | value | %age | Description |
|---|---|---|---|
| J | 138 | 7.7 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.0 | RNA processing and modification |
| K | 102 | 5.7 | Transcription |
| L | 183 | 10.1 | Replication, recombination and repair |
| B | 0 | 0.0 | Chromatin structure and dynamics |
| D | 26 | 1.4 | Cell cycle control, cell division, chromosome partitioning |
| Y | 0 | 0.0 | Nuclear structure |
| V | 46 | 2.6 | Defense mechanisms |
| T | 63 | 3.5 | Signal transduction mechanisms |
| M | 155 | 8.6 | Cell wall/membrane/envelope biogenesis |
| N | 4 | 0.2 | Cell motility |
| Z | 0 | 0.0 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 31 | 1.7 | Intracellular trafficking, secretion, and vesicular transport |
| O | 69 | 3.8 | Posttranslational modification, protein turnover, chaperones |
| C | 90 | 5.0 | Energy production and conversion |
| G | 145 | 8.0 | Carbohydrate transport and metabolism |
| E | 132 | 7.3 | Amino acid transport and metabolism |
| F | 59 | 3.3 | Nucleotide transport and metabolism |
| H | 74 | 4.1 | Coenzyme transport and metabolism |
| I | 56 | 3.1 | Lipid transport and metabolism |
| P | 120 | 6.7 | Inorganic ion transport and metabolism |
| Q | 27 | 1.5 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 202 | 11.2 | General function prediction only |
| S | 82 | 4.6 | Function unknown |
| - | 1,292 | 43.8 | Not in COGs |