| Literature DB >> 24501636 |
Roshan Padmanabhan1, Jean-Christophe Lagier1, Nicole Prisca Makaya Dangui1, Caroline Michelle1, Carine Couderc1, Didier Raoult1, Pierre-Edouard Fournier1.
Abstract
Megasphaera massiliensis strain NP3(T) sp. nov. is the type strain of Megasphaera massiliensis sp. nov., a new species within the genus Megasphaera. This strain, whose genome is described here, was isolated from the fecal flora of an HIV-infected patient. M. massiliensis is a Gram-negative, obligate anaerobic coccobacillus. Here we describe the features of this organism, together with the complete genome sequence and annotation. The 2,661,757 bp long genome (1 chromosome but no plasmid) contains 2,577 protein-coding and 61 RNA genes, including 5 rRNA genes.Entities:
Keywords: Megasphaera massiliensis; culturomics; genome; taxonogenomics
Year: 2013 PMID: 24501636 PMCID: PMC3910696 DOI: 10.4056/sigs.4077819
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of strain NP3T according to the MIGS recommendations [36]
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | IDA | ||
| Type strain NP3T | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | coccobacilli | IDA | |
| Motility | non motile | IDA | |
| Sporulation | nonsporulating | IDA | |
| Temperature range | mesophilic | IDA | |
| Optimum temperature | 37°C | IDA | |
| MIGS-6.3 | Salinity | unknown | |
| MIGS-22 | Oxygen requirement | anaerobic | IDA |
| Carbon source | unknown | ||
| Energy source | unknown | ||
| MIGS-6 | Habitat | human gut | IDA |
| MIGS-15 | Biotic relationship | free living | IDA |
| MIGS-14 | Pathogenicity | unknown | |
| MIGS-4 | Geographic location | France | IDA |
| MIGS-5 | Sample collection time | January 2012 | IDA |
| MIGS-4.1 | Latitude | 43.296482 | IDA |
| MIGS-4.1 | Longitude | 5.36978 | IDA |
| MIGS-4.3 | Depth | Surface | IDA |
| MIGS-4.4 | Altitude | 0 m above sea level | IDA |
Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [45]. If the evidence is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements.
Figure 1Phylogenetic tree highlighting the position of strain NP3T relative to other type strains within the genus and other members of the family . GenBank accession numbers are indicated in parentheses. Sequences were aligned using CLUSTALW, and phylogenetic inferences obtained using the maximum-likelihood method within the MEGA software. Numbers at the nodes are percentages of bootstrap values obtained by repeating the analysis 500 times to generate a majority consensus tree. was used as outgroup. The scale bar indicates a 1% nucleotide sequence divergence.
Figure 2Gram staining of strain NP3T
Figure 3Transmission electron microscopy of M. massiliensis strain NP3T, using a Morgani 268D (Philips) at an operating voltage of 60kV. The scale bar represents 200 nm.
Differential characteristics of M. massiliensis strain NP3T, strain DSM 20460 and strain AIP 412-00T.†
| | | | |
|---|---|---|---|
| Cell diameter (µm) | 0.87 | 1.5-3.0 | 0.4-0.6 |
| Oxygen requirement | anaerobic | anaerobic | Anaerobic |
| Pigment production | + | + | – |
| Gram stain | – | – | – |
| Motility | – | – | – |
| Endospore formation | – | – | – |
| Indole production | – | na | – |
| Catalase | – | – | – |
| Oxidase | + | + | na |
| Nitrate reductase | na | – | – |
| Urease | – | – | na |
| β-galactosidase | – | – | na |
| N-acetyl-glucosamine | na | – | na |
| Arabinose | w | – | – |
| Ribose | + | – | na |
| Mannose | – | – | – |
| Mannitol | + | + | – |
| Raffinose | – | – | – |
| Sucrose | – | – | – |
| Glycerol | w | – | – |
| Sorbitol | + | – | na |
| Arabitol | + | – | na |
| Galactose | + | + | – |
| D-glucose | + | + | – |
| D-fructose | + | + | – |
| D-maltose | + | + | – |
| D-lactose | + | + | – |
| + | + | – | |
| Habitat | Human gut | Sheep rumen | Liver abscess, whitlow |
na = data not available; w = weak
Figure 4Reference mass spectrum from M. massiliensis strain NP3T. Spectra from 12 individual colonies were compared and a reference spectrum was generated.
Figure 5Gel view comparing the M. massiliensis NP3T spectrum with those of and species. The Gel View displays the raw spectra of all loaded spectrum files arranged in a pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running spectrum number originating from subsequent spectra loading. The peak intensity is expressed by a Gray scale scheme code. The color bar and the right y-axis indicate the relation between the color a peak is displayed and the peak intensity in arbitrary units.
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | Paired-end 3kb library |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium |
| MIGS-31.2 | Fold coverage | 19 × |
| MIGS-30 | Assemblers | Newbler version 2.5.3 |
| MIGS-32 | Gene calling method | Prodigal |
| INSDC ID | PRJEB645 | |
| Genbank ID | CAVO00000000 | |
| Genbank Date of Release | June 4, 2013 | |
| Project relevance | Study of the human gut microbiome |
Figure 6Graphical circular map of the M. massiliensis strain NP3T chromosome. From the outside in: the outer two circles show open reading frames oriented in the forward and reverse (colored by COG categories) directions, respectively. The third circle displays the rRNA gene operon (red) and tRNA genes (green). The fourth circle shows the G+C% content plot. The inner-most circle shows the GC skew, purple and olive indicating negative and positive values, respectively.
Nucleotide content and gene count levels of the genome
| Value | % of totala | |
|---|---|---|
| Genome size (bp) | 2,661,757 | |
| DNA coding region (bp) | 1,479,861 | 93.98 |
| DNA G+C content (bp) | 1,337,412 | 50.2 |
| Coding region (bp) | 1,479,861 | 93.98 |
| Number of replicons | 1 | |
| Extrachromosomal elements | 0 | |
| Total genes | 2,577 | 100 |
| RNA genes | 61 | 2.39 |
| rRNA operons | 2 | |
| Protein-coding genes | 2,516 | 97.63 |
| Genes with function prediction | 1,697 | 65.8 |
| Genes assigned to COGs | 1,892 | 73.41 |
| Genes with peptide signals | 60 | 2.38 |
| Genes with transmembrane helices | 530 | 21.0 |
| CRISPR repeats | 7 |
a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 138 | 5.48 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 120 | 4.77 | Transcription |
| L | 118 | 4.69 | Replication, recombination and repair |
| B | 0 | 0 | Chromatin structure and dynamics |
| D | 23 | 0.91 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 28 | 1.11 | Defense mechanisms |
| T | 27 | 1.07 | Signal transduction mechanisms |
| M | 103 | 4.09 | Cell wall/membrane biogenesis |
| N | 0 | 0 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 24 | 0.95 | Intracellular trafficking and secretion |
| O | 52 | 2.07 | Post-translational modification, protein turnover, chaperones |
| C | 147 | 5.84 | Energy production and conversion |
| G | 118 | 4.69 | Carbohydrate transport and metabolism |
| E | 163 | 6.48 | Amino acid transport and metabolism |
| F | 52 | 2.07 | Nucleotide transport and metabolism |
| H | 87 | 3.46 | Coenzyme transport and metabolism |
| I | 46 | 1.83 | Lipid transport and metabolism |
| P | 79 | 3.14 | Inorganic ion transport and metabolism |
| Q | 14 | 0.56 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 217 | 8.62 | General function prediction only |
| S | 141 | 5.60 | Function unknown |
| - | 195 | 7.75 | Not in COGs |
a The total is based on the total number of protein coding genes in the annotated genome.
Orthologous gene comparison and average nucleotide identity of M. massiliensis with other compared genomes †
| Species (GenBank accession number) | | | | | | |
|---|---|---|---|---|---|---|
| | 1,289 | 1,189 | 987 | 999 | 1,159 | |
| 81.84 | | 1,175 | 980 | 989 | 1,145 | |
| 69.44 | 69.01 | | 933 | 939 | 1,167 | |
| 63.68 | 63.08 | 64.92 | | 1,081 | 893 | |
| 62.92 | 62.01 | 64.43 | 67.62 | | 899 | |
| 70.27 | 70.50 | 74.22 | 63.87 | 62.99 | |
†Upper right, numbers of orthologous genes; lower left, mean nucleotide identities of orthologous genes. Bold numbers indicate the numbers of genes or each genome.