| Literature DB >> 30128156 |
M Dahmani1, G Diatta2, N Labas1, A Diop1, H Bassene2, D Raoult1, L Granjon3, F Fenollar4, O Mediannikov1.
Abstract
Bartonella mastomydis sp. nov. strain 008 is the type strain of B. mastomydis sp. nov., a new species within the genus Bartonella. This strain was isolated from Mastomys erythroleucus rodents trapped in the Sine-Saloum region of Senegal. Here we describe the features of this organism, together with the complete genome sequence and its annotation. The 2 044 960 bp long genomes with 38.44% G + C content contains 1674 protein-coding and 42 RNA genes, including three rRNA genes.Entities:
Keywords: Bartonella mastomydis sp. nov.; Mastomys erythroleucus; Senegal; complete genome; rodents
Year: 2018 PMID: 30128156 PMCID: PMC6098214 DOI: 10.1016/j.nmni.2018.03.005
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Classification and general features of Bartonella massiliensis strain 008.
| MIGS ID | Property | Term | Evidence code |
|---|---|---|---|
| Current classification | Domain | TAS | |
| Phylum | TAS | ||
| Class | TAS | ||
| Order | TAS | ||
| Family | TAS | ||
| Genus | TAS | ||
| Species | IDA | ||
| Type strain 008 | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Nonmotile | IDA | |
| Sporulation | Nonsporulating | IDA | |
| Temperature range | Mesophilic | IDA | |
| Optimum temperature | 37°C | IDA | |
| MIGS-22 | Oxygen requirement | Aerobic | IDA |
| Carbon source | Unknown | IDA | |
| Energy source | Unknown | IDA | |
| MIGS-6 | Habitat | IDA | |
| MIGS-15 | Biotic relationship | Facultative intracellular | IDA |
| Pathogenicity | Unknown | IDA | |
| Biosafety level | 3 | IDA | |
| MIGS-14 | Isolation | IDA | |
| MIGS-4 | Geographic location | Senegal | IDA |
| MIGS-5 | Sample collection | February 2013 | IDA |
| MIGS-4.2 | Latitude | 14°03′N | IDA |
| MIGS-4.3 | Longitude | 15°31′W | IDA |
| MIGS-4.4 | Altitude | 8 m | IDA |
MIGS, Minimum Information About a Genome Sequence.
Evidence codes are as follows: IDA, inferred from direct assay; TAS, traceable author statement (i.e. a direct report exists in the literature); NAS, nontraceable author statement (i.e. not directly observed for the living, isolated sample, but based on a generally accepted property for the species or anecdotal evidence). These evidence codes are from the Gene Ontology project (http://www.geneontology.org/GO.evidence.shtml). If the evidence code is IDA, then the property should have been directly observed, for the purpose of this specific publication, for a live isolate by one of the authors, or by an expert or reputable institution mentioned in the acknowledgements.
Fig. 1Evolutionary history of sequenced samples was inferred using maximum likelihood method implemented in MEGA7 [40] and based on concatenated gltA, rpoB, 16S RNA and ftsZ (total length of 2731 bp) sequences. Sequences of gltA, rpoB,16S RNA and ftsZ genes used for comparison were obtained from the GenBank database (https://www.ncbi.nlm.nih.gov/genbank/). Sequences were aligned using BioEdit [41]. Firstly, for each gene individually, sequences we used for comparison were first aligned using CLUSTALW. All positions containing gaps and missing data were eliminated manually; then each alignment was concatenated and second alignment was performed. Evolutionary history was inferred by maximum likelihood method based on Hasegawa-Kishino-Yano model. Percentage of trees in which associated taxa clustered together is shown next to branches. Initial tree for heuristic search was obtained automatically by applying the Neighbor Joining and BioNJ algorithms to matrix of pairwise distances estimated using maximum composite likelihood (MCL) approach and then selecting topology with superior log likelihood value. Discrete gamma distribution was used to model evolutionary rate differences among sites (two categories (+G, parameter = 0.2144)). Tree is drawn to scale, with branch lengths measured in number of substitutions per site. Statistical support for internal branches of trees was evaluated by bootstrapping with 1000 iterations. Analysis involved 39 nucleotide sequences.
Fig. 2Reference mass spectrum from Bartonella mastomydis strain 008. Spectra from 12 individual colonies were compared and reference spectrum was generated.
Fig. 3Gel view comparing Bartonella mastomydis strain 008 spectra with other members of Bartonella genus. Gel view displays raw spectra of loaded spectrum files arranged in pseudo–gel-like look. X-axis records m/z value. Left y-axis displays running spectrum number originating from subsequent spectra loading. Peak intensity is expressed by greyscale scheme code. Colour bar and right y-axis indicate relation between colour in which peak is displayed and peak intensity in arbitrary units. Displayed species are indicated at left.
Fig. 4Transmission electron microscopy of Bartonella mastomydis strain 008 using Tecnai G20 device (FEI Company, Limeil-Brévannes, France) at operating voltage of 200 keV. Scale bar = 200 nm.
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | One paired-end 3 kb library |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium |
| MIGS-31.2 | Fold coverage | 30× |
| MIGS-30 | Assemblers | Newbler version 2.5.3 |
| MIGS-12 | Gene calling method | Prodigal |
| GenBank ID | GCA_900185775 | |
| MIGS-13 | Project relevance | Biodiversity of |
MIGS, Minimum Information About a Genome Sequence.
Fig. 5Graphical circular map of chromosome. From outside to centre: genes on forward strand coloured by COGs categories (only genes assigned to COGs), genes on reverse strand coloured by COGs categories (only gene assigned to COGs), RNA genes (tRNAs green, rRNAs red), G + C content and G + C skew. COGs, Clusters of Orthologous Groups database.
Number of gene associated with 25 general COGs functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 157 | 9.38 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 58 | 3.46 | Transcription |
| L | 73 | 4.36 | Replication, recombination and repair |
| B | 0 | 0 | Chromatin structure and dynamics |
| D | 17 | 1.02 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 21 | 1.25 | Defense mechanisms |
| T | 37 | 2.21 | Signal transduction mechanisms |
| M | 74 | 4.42 | Cell wall/membrane biogenesis |
| N | 4 | 0.24 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 42 | 2.51 | Intracellular trafficking and secretion |
| O | 74 | 4.42 | Posttranslational modification, protein turnover, chaperones |
| X | 25 | 1.49 | Mobilome: prophages, transposons |
| C | 74 | 4.42 | Energy production and conversion |
| G | 56 | 3.35 | Carbohydrate transport and metabolism |
| E | 106 | 6.33 | Amino acid transport and metabolism |
| F | 47 | 2.81 | Nucleotide transport and metabolism |
| H | 63 | 3.76 | Coenzyme transport and metabolism |
| I | 44 | 2.63 | Lipid transport and metabolism |
| P | 57 | 3.41 | Inorganic ion transport and metabolism |
| Q | 15 | 0.89 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 74 | 4.42 | General function prediction only |
| S | 68 | 4.06 | Function unknown |
| — | 603 | 36.02 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Nucleotide content and gene count levels of genome
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of Total | |
| Size (bp) | 2 044 960 | 100 |
| G + C content (bp) | 785 960 | 38.44 |
| Coding region | 1 555 569 | 76.07 |
| Total gene | 1716 | 100 |
| RNA genes | 42 | 2.45 |
| Protein-coding genes | 1674 | 100 |
| Protein assigned to COGs | 1071 | 63.99 |
| Protein with peptide signals | 263 | 15.71 |
| Genes with transmembrane helices | 372 | 22.22 |
COGs, Clusters of Orthologous Groups database.
Total is based on either the size of the genome in base pairs of the total of protein-coding genes in the annotated genome.
Fig. 6Distribution of functional classes of predicted genes according to clusters of orthologous groups of proteins.
Numbers of orthologous protein shared between Bartonella genomes (upper right)a
| 1115 | 1121 | 1154 | 1043 | 1143 | 1144 | ||
| 0.66 | 1134 | 1164 | 1057 | 1148 | 1154 | ||
| 0.67 | 0.83 | 1210 | 1081 | 1201 | 1201 | ||
| 0.80 | 0.66 | 0.66 | 1136 | 1257 | 1258 | ||
| 0.66 | 0.82 | 0.83 | 0.70 | 1114 | 1115 | ||
| 0.66 | 0.82 | 0.84 | 0.70 | 0.90 | 1264 | ||
| 0.66 | 0.82 | 0.84 | 0.70 | 0.90 | 0.96 |
Average percentage similarity of nucleotides corresponding to orthologous protein shared between genomes (lower left) and numbers of proteins per genome (bold).
Pairwise comparison of Bartonella mastomydis with six other species using GGDC, formula 2 (DDH estimates based on identities/HSP length)a
| 100% ± 00 | 25.8% ± 2.45 | 27.1% ± 2.45 | 25.8% ± 2.4 | 25.9% ± 2.4 | 25.6% ± 2.4 | 25.5% ± 2.4 | |
| 100% ± 00 | 25.5% ± 2.4 | 25.1% ± 2.4 | 27.5% ± 2.45 | 24.4% ± 2.4 | 24.2% ± 2.4 | ||
| 100% ± 00 | 26.7% ± 2.4 | 26.3% ± 2.45 | 26.8% ± 2.4 | 26.7% ± 2.4 | |||
| 100% ± 00 | 42% ± 2.55 | 37.3% ± 2.45 | 36.8% ± 2.5 | ||||
| 100% ± 00 | 37.6% ± 2.45 | 37.3% ± 2.5 | |||||
| 100% ± 00 | 60.3% ± 2.8 | ||||||
| 100% ± 00 |
DDH, DNA-DNA hybridization; GGDC, Genome-to-Genome Distance Calculator; HSP, high-scoring segment pairs.
Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with phylogenomic analyses as well as the GGDC results.