| Literature DB >> 27081435 |
Cheikh Ibrahima Lo1, Senthil Alias Sankar2, Bécaye Fall3, Bissoume Sambe-Ba3, Silman Diawara3, Mamadou Wague Gueye3, Oleg Mediannikov1, Caroline Blanc-Tailleur2, Boubacar Wade3, Didier Raoult4, Pierre-Edouard Fournier2, Florence Fenollar1.
Abstract
Strain FF7(T) was isolated from the peritoneal fluid of a 44-year-old woman who suffered from pelvic peritonitis. This strain exhibited a 16S rRNA sequence similarity of 94.8 % 16S rRNA sequence identity with Haemophilus parasuis, the phylogenetically closest species with a name with standing in nomenclature and a poor MALDI-TOF MS score (1.32 to 1.56) that does not allow any reliable identification. Using a polyphasic study made of phenotypic and genomic analyses, strain FF7(T) was a Gram-negative, facultatively anaerobic rod and member of the family Pasteurellaceae. It exhibited a genome of 2,442,548 bp long genome (one chromosome but no plasmid) contains 2,319 protein-coding and 67 RNA genes, including 6 rRNA operons. On the basis of these data, we propose the creation of Haemophilus massiliensis sp. nov. with strain FF7(T) (= CSUR P859 = DSM 28247) as the type strain.Entities:
Keywords: Culturomics; Genome; Haemophilus massiliensis; Taxono-genomics
Year: 2016 PMID: 27081435 PMCID: PMC4831113 DOI: 10.1186/s40793-016-0150-1
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Classification and general features of Haemophilus massiliensis strain FF7T [13]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain: | TAS [ | |
| Phylum: | TAS [ | ||
| Class: | TAS [ | ||
| Order: | TAS [ | ||
| Family: | TAS [ | ||
| Genus: | TAS [ | ||
| Species: | IDA | ||
| Type strain: FF7T | IDA | ||
| Gram stain | Negative | IDA | |
| Cell shape | Rod | IDA | |
| Motility | Not motile | IDA | |
| Sporulation | Non-spore forming | IDA | |
| Temperature range | Mesophile | IDA | |
| Optimum temperature | 37 °C | IDA | |
| pH range; Optimum | 7.2-7.4; 7.3 | ||
| Carbon source | Unknown | NAS | |
| Energy source | Unknown | NAS | |
| MIGS-6 | Habitat | Human peritoneal fluid | IDA |
| MIGS-6.3 | Salinity | Unknown | |
| MIGS-22 | Oxygen requirement | Facultatively anaerobic | IDA |
| MIGS-15 | Biotic relationship | Free living | IDA |
| MIGS-14 | Pathogenicity | Unknown | |
| Biosafety level | 2 | ||
| Isolation | Human | IDA | |
| MIGS-4 | Geographic location | Senegal | IDA |
| MIGS-5 | Sample collection time | June 2013 | IDA |
| MIGS-4.1 | Latitude | 14° 40' N | IDA |
| MIGS-4.1 | Longitude | 17° 26' W | IDA |
| MIGS-4.3 | Depth | Surface | IDA |
| MIGS-4.4 | Altitude | 12 m above sea level | IDA |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [34]. If the evidence is IDA, then the property was directly observed for a live isolate by one of the authors or an expert mentioned in the acknowledgements
Fig. 1Phylogenetic tree showing the position of Haemophilus massiliensis strain FF7T relative to the most closely related type strains other type strains (type = T) within the genus Haemophilus. The GenBank accession numbers for 16S rRNA genes are indicated in parentheses. An asterisk marks strains that have a genome sequence in the NCBI database. Sequences were aligned using MUSCLE [35], and a phylogenetic tree inferred using the Maximum Likelihood method with Kimura 2-parameter model using the MEGA software. Numbers at the nodes are percentages of bootstrap values obtained by repeating the analysis 1,000 times to generate a majority consensus tree. Only bootstrap values equal to or greater than 70 % are displayed. The scale bar represents a rate of substitution per site of 1 %. Escherichia coli strain ATCC 11775T was used as outgroup
Fig. 2Morphology of Haemophilus massiliensis sp. nov. strain FF7T. a: Gram staining. b: Transmission electron microscopy. The scale bar represents 500 nm
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | High-quality draft |
| MIGS-28 | Libraries used | Mate-Pair 3.1 kb library |
| MIGS-29 | Sequencing platforms | Illumina Miseq |
| MIGS-31.2 | Fold coverage | 42.54 |
| MIGS-30 | Assemblers | CLC GENOMICSWB4 |
| MIGS-32 | Gene calling method | Prodigal |
| Locus Tag | Not indicated | |
| Genbank ID | CCFL00000000 | |
| Genbank Date of Release | August 22, 2014 | |
| GOLD ID | Ga0059233 | |
| BIOPROJECT | PRJEB5521 | |
| MIGS-13 | Source material identifier | CSUR P859, DSM 28247 |
| Project relevance | MALDI-TOF-MS implementation in Dakar |
Genome statistics
| Attribute | Value | % of totala |
|---|---|---|
| Genome size (bp) | 2,442,548 | 100 |
| DNA coding (bp) | 2,181,795 | 89.35 |
| DNA G + C (bp) | 1,123,572 | 46.0 |
| DNA scaffolds | 46 | - |
| Total genes | 2,386 | 100 |
| Protein coding genes | 2,319 | 97.19 |
| RNA genes | 67 | 2.80 |
| Pseudo genes | N/Db | - |
| Gens in internal clusters | N/Db | - |
| Genes with function prediction | 1,885 | 79.00 |
| Genes assigned to COGs | 2,093 | 87.72 |
| Genes with Pfam domains | 1,419 | 59.47 |
| Genes with signal peptides | 188 | 7.87 |
| Genes with transmembrane helices | 445 | 18.65 |
| ORFan genes | 36 | 1.50 |
| CRISPR repeats | 2 | 0.08 |
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
bN/D = not determined
Fig. 3Graphical circular map of the Haemophilus massiliensis strain FF7T chromosome. From the outside in, the outer two circles show open reading frames oriented in the forward (colored by COG categories) and reverse (colored by COG categories) directions, respectively. The third circle marks the tRNA genes (green). The fourth circle shows the G + C% content plot. The inner-most circle shows GC skew, purple indicating negative values whereas olive for positive values
Number of genes associated with general COG functional categories
| Code | Value | % of totala | Description |
|---|---|---|---|
| J | 160 | 6.90 | Translation |
| A | 1 | 0.04 | RNA processing and modification |
| K | 150 | 6.47 | Transcription |
| L | 131 | 5.65 | Replication, recombination and repair |
| B | 0 | 0 | Chromatin structure and dynamics |
| D | 28 | 1.21 | Cell cycle control, mitosis and meiosis |
| V | 37 | 1.60 | Defense mechanisms |
| T | 46 | 1.98 | Signal transduction mechanisms |
| M | 135 | 5.82 | Cell wall/membrane biogenesis |
| N | 6 | 0.26 | Cell motility |
| W | 9 | 0.39 | Extracellular structures |
| U | 58 | 2.50 | Intracellular trafficking and secretion |
| O | 109 | 4.70 | Posttranslational modification, protein turnover, chaperones |
| C | 163 | 7.03 | Energy production and conversion |
| G | 228 | 9.83 | Carbohydrate transport and metabolism |
| E | 234 | 10.09 | Amino acid transport and metabolism |
| F | 63 | 2.72 | Nucleotide transport and metabolism |
| H | 117 | 5.05 | Coenzyme transport and metabolism |
| I | 62 | 2.67 | Lipid transport and metabolism |
| P | 152 | 6.55 | Inorganic ion transport and metabolism |
| Q | 34 | 1.47 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 258 | 11.13 | General function prediction only |
| S | 174 | 7.50 | Function unknown |
| - | 226 | 9.53 | Not in COGs |
aThe total is based on the total number of protein coding genes in the annotated genome
Fig. 4Distribution of functional classes of predicted genes in the genomes from Haemophilus massiliensis (HM) strain FF7T, H. parasuis (HPA) strain ATCC 19417T, Aggregatibacter segnis (AE) strain ATCC 33393T, H. aegyptius (HA) strain ATCC 11116T, H. ducreyi (HD) strain CIP 54.2, H. haemolyticus (HH) strain ATCC 33390T, H. influenzae (HI) strain ATCC 33391T, H. parahaemolyticus (HP), H. parainfluenzae (HPI) strain ATCC 10014T, H. pittmaniae (HPT) strain HK 85T, and H. sputorum (HS) strain CCUG 13788T chromosomes according to the clusters of orthologous groups of proteins
dDDH values (upper right) and AGIOS values obtained (lower left)
| HMa | HPAb | HIc | HAd | AEe | HHf | HPg | HDh | HSi | HPTj | HPHk | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| HM |
| 0.248 | 0.222 | 0.203 | 0.204 | 0.202 | 0.243 | 0.285 | 0.232 | 0.235 | 0.201 |
| HPA | 70.64 |
| 0.296 | 0.292 | 0.244 | 0.278 | 0.236 | 0.237 | 0.262 | 0.283 | 0.251 |
| HI | 72.76 | 72.75 |
| 0.777 | 0.234 | 0.433 | 0.259 | 0.252 | 0.279 | 0.252 | 0.239 |
| HA | 72.80 | 72.67 | 97.31 |
| 0.231 | 0.434 | 0.237 | 0.244 | 0.271 | 0.242 | 0.235 |
| AE | 74.19 | 71.93 | 75.72 | 75.69 |
| 0.235 | 0.243 | 0.267 | 0.255 | 0.246 | 0.232 |
| HH | 72.72 | 72.64 | 91.85 | 91.80 | 75.72 |
| 0.240 | 0.250 | 0.284 | 0.247 | 0.246 |
| HP | 70.16 | 75.40 | 74.07 | 73.98 | 71.97 | 73.79 |
| 0.239 | 0.218 | 0.293 | 0.227 |
| HD | 70.00 | 74.81 | 72.47 | 72.36 | 71.34 | 72.33 | 75.34 |
| 0.228 | 0.270 | 0.251 |
| HS | 70.23 | 75.06 | 73.46 | 73.39 | 72.33 | 73.54 | 78.00 | 75.68 |
| 0.319 | 0.280 |
| HPT | 72.55 | 71.51 | 76.38 | 76.48 | 74.95 | 76.64 | 71.88 | 71.19 | 72.88 |
| 0.269 |
| HPH | 72.67 | 72.71 | 79.69 | 79.70 | 76.20 | 79.96 | 73.36 | 72.30 | 74.14 | 78.94 |
|
The values printed in bold are gene numbers. Digital DDH similarities between the genomes were calculated using GGDC web server version 2.0. under recommend setting [36, 37]; formula 2 is recommended, particularly for draft genomes. a Haemophilus massiliensis, b Haemophilus parasuis, c Haemophilus influenzae, d Haemophilus aegyptius, e Aggregatibacter segnis, f Haemophilus haemolyticus, g Haemophilus parainfluenzae, h Haemophilus ducreyi, i Haemophilus sputorum, j Haemophilus pittmaniae and k Haemophilus parahaemolyticus