| Literature DB >> 28116105 |
M Tidjani Alou1, J Rathored1, T T Nguyen1, C Andrieu1, C Couderc1, S Brah2, B A Diallo3, P-E Fournier1, D Raoult4, G Dubourg1.
Abstract
Paenibacillus phocaensis sp. nov. strain mt24T (= CSUR P2238 = DSM 101777) is a Gram-negative, facultative anaerobic, spore-forming and motile bacilli. This strain was isolated from the stool sample of a healthy infant from Niger. Its genome was estimated to a size of 5 521 415 bp with a 53.54% GC content. It contains 4835 protein-coding genes and 89 RNAs, among which two were 16S rRNA genes. There were also 101 genes (2.09%) identified as ORFans.Entities:
Keywords: Culturomics; Paenibacillus phocaensis; genome; gut microbiota; taxonogenomics
Year: 2016 PMID: 28116105 PMCID: PMC5233791 DOI: 10.1016/j.nmni.2016.12.001
Source DB: PubMed Journal: New Microbes New Infect ISSN: 2052-2975
Fig. 1Phylogenetic tree highlighting position of Paenibacillus phocaensis strain mt24T (= CSUR P2238 = DSM 101777) relative to other close strains. Respective GenBank accession numbers for 16S rRNA genes are indicated in parentheses. Sequences were aligned using CLUSTALW, and phylogenetic inferences were obtained using maximum-likelihood method within MEGA6 software. Numbers at nodes are percentages of bootstrap values obtained by repeating analysis 1000 times to generate majority consensus tree. Clostridium felsineum strain NCIMB 10690 was used as outgroup. Scale bar represents 2% nucleotide sequence divergence.
Classification and general features of Paenibacillus phocaensis strain mt24T
| Property | Term |
|---|---|
| Current classification | Domain: |
| Phylum: | |
| Class: | |
| Order: | |
| Family: | |
| Genus: | |
| Species: | |
| Type strain: mt24 | |
| Gram stain | Negative |
| Cell shape | Bacilli |
| Motility | Motile |
| Sporulation | Sporulating |
| Temperature range | Mesophilic |
| Optimum temperature | 37°C |
Fig. 2Reference mass spectrum from Paenibacillus phocaensis strain mt24T (= CSUR P2238 = DSM 101777). Spectra from 12 individual colonies were compared and reference spectrum generated.
Fig. 3Reference mass spectrum from Paenibacillus phocaensis strain mt24T (= CSUR P2238 = DSM 101777). Spectra from 12 individual colonies were compared and reference spectrum generated.
Fig. 4Gram staining of Paenibacillus phocaensis strain mt24T (= CSUR P2238 = DSM 101777).
Fig. 5Transmission electron microscopy of Paenibacillus phocaensis strain mt24T (= CSUR P2238 = DSM 101777) using Tecnai G20 transmission electron microscope (FEI Company) at operating voltage of 60 kV. Scale bar = 500 nm.
Cellular fatty acid composition (%) of Paenibacillus phocaensis strain mt24T
| Fatty acid | IUPAC name | |
|---|---|---|
| 15:0 anteiso | 12-methyl-Tetradecanoic acid | 52.0 ± 1.1 |
| 16:00 | Hexadecanoic acid | 10.6 ± 0.4 |
| 16:0 iso | 14-methyl-Pentadecanoic acid | 8.7 ± 0.5 |
| 15:0 iso | 13-methyl-Tetradecanoic acid | 8.5 ± 0.1 |
| 17:0 anteiso | 14-methyl-Hexadecanoic acid | 7.5 ± 0.3 |
| 15:00 | Pentadecanoic acid | 3.9 ± 0.3 |
| 17:0 iso | 15-methyl-Hexadecanoic acid | 3.0 ± 0.1 |
| 14:0 iso | 12-methyl-Tridecanoic acid | 1.8 ± 0.1 |
| 17:00 | Heptadecanoic acid | 1.0 ± 1.1 |
| 18:1n9 | 9-Octadecenoic acid | TR |
| 14:00 | Tetradecanoic acid | TR |
| 18:00 | Octadecanoic acid | TR |
| 18:2n6 | 9,12-Octadecadienoic acid | TR |
| 5:0 anteiso | 2-methyl-Butanoic acid | TR |
| 18:1n6 | 12-Octadecenoic acid | TR |
| 13:00 | Tridecanoic acid | TR |
IUPAC, International Union of Pure and Applied Chemistry; TR, trace amounts <1%.
Mean peak area percentage ± standard deviation.
Differential characteristics of Penicillus phoceensis strain mt24T DSM 101777, Paenibacillus timonensis strain 2301032T CCUG 48216, Paenibacillus sanguinis strain 2301083T CCUG 48214, Paenibacillus barengoltzii strain SAFN-016T ATCC BAA-1209, Paenibacillus xylanilyticus strain XIL14T LMG 21957, Paenibacillus phoenicis strain 3PO2SAT NRRL B-59348, Paenibacillus sabinae strain T27T DSM 17841, Paenibacillus puldeungensis CAU 9324T CCUG 59189, Paenibacillus terrae strain AM141T JCM 11466 and Paenibacillus jamilae strain B.3T DSM 13815 [41], [42], [43], [44], [45], [46], [47]
| Property | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cell diameter (μm) | 0.5–0.4 | 0.5 | 0.5 | 0.5–0.8 | 1.5–1.55 | 1.0–1.5 | 0.7–0.8 | 0.3–0.4 | 1.3–1.8 | 0.5–1.2 |
| Oxygen requirement | + | + | + | + | + | + | + | + | + | + |
| Gram stain | − | + | + | + | + | + | + | + | +/− | +/− |
| Salt requirement | − | − | − | − | + | + | + | + | + | − |
| Motility | + | + | + | + | + | + | + | − | + | + |
| Endospore formation | + | + | + | + | + | + | + | + | + | + |
| Indole | − | NA | NA | − | − | + | − | NA | NA | − |
| Production of: | ||||||||||
| Catalase | + | + | − | + | + | + | + | − | + | + |
| Oxidase | + | − | − | + | − | NA | − | + | − | − |
| Nitrate reductase | − | w | + | − | + | + | + | + | + | + |
| Urease | − | NA | NA | + | − | − | NA | NA | − | NA |
| β-Galactosidase | + | NA | NA | + | + | + | NA | NA | NA | + |
| N-Acetyl-glucosamine | + | + | + | + | + | + | NA | − | w | − |
| Acid from: | ||||||||||
| | − | + | + | + | + | + | NA | + | + | + |
| Ribose | + | − | − | + | NA | + | NA | + | + | + |
| Mannose | + | + | − | + | + | + | NA | − | + | + |
| Mannitol | + | − | + | + | + | + | NA | − | + | + |
| Sucrose | + | − | + | − | + | + | − | + | + | + |
| | + | + | − | − | + | + | − | + | + | + |
| | + | + | + | + | NA | + | − | − | + | + |
| | + | + | + | + | + | + | − | + | + | + |
| | + | + | + | NA | NA | + | − | + | + | + |
| Habitat | Human gut | Blood | Blood | Clean room floor | Air | Floor | Soil | Soil | Soil | Olive oil |
+, positive result; −, negative result; w, weakly positive result; NA, data not available.
Fig. 6Graphical circular map of chromosome. From outside to centre: genes on forward strain colored by COGs categories (only gene assigned to COGs), RNA genes (tRNAs green, rRNAs red), G+C content and G+C skew. COGs, Clusters of Orthologous Groups database.
Nucleotide content and gene count levels of genome
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of total | |
| Size (bp) | 5 521 412 | 100 |
| G+C content (%) | 2 952 929 | 53.54 |
| Coding region (bp) | 4 699 807 | 85.11 |
| Total genes | 4924 | 100 |
| RNA genes | 89 | 1.80 |
| Protein-coding genes | 4835 | 98.19 |
| Genes with function prediction | 3755 | 76.25 |
| Genes assigned to COGs | 3338 | 67.79 |
| Genes with peptide signals | 707 | 14.35 |
| CRISPR repeats | 04 | 0.08 |
| ORFans genes | 101 | 2.05 |
| Genes with transmembrane helices | 1222 | 24.81 |
| Genes associated with PKS or NRPS | 09 | 0.10 |
| No of antibiotic resistance genes | 0 | 0 |
| No. of genes associated Pfam-A domains | 4484 | 91.06 |
COGs, Clusters of Orthologous Groups database.
Total is based on either size of the genome in base pairs or total number of protein-coding genes in annotated genome.
Number of genes associated with 25 general COGs functional categories
| Code | Value | % of total | Description |
|---|---|---|---|
| J | 174 | 2.98 | Translation |
| A | 0 | 0.0 | RNA processing and modification |
| K | 476 | 8.16 | Transcription |
| L | 178 | 3.05 | Replication, recombination and repair |
| B | 1 | 0.02 | Chromatin structure and dynamics |
| D | 33 | 0.57 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0.0 | Nuclear structure |
| V | 113 | 1.94 | Defense mechanisms |
| T | 324 | 5.55 | Signal transduction mechanisms |
| M | 207 | 3.55 | Cell wall/membrane biogenesis |
| N | 74 | 1.27 | Cell motility |
| Z | 3 | 0.05 | Cytoskeleton |
| W | 0 | 0.0 | Extracellular structures |
| U | 56 | 0.96 | Intracellular trafficking and secretion |
| O | 106 | 1.82 | Posttranslational modification, protein turnover, chaperones |
| C | 161 | 2.76 | Energy production and conversion |
| G | 619 | 10.61 | Carbohydrate transport and metabolism |
| E | 309 | 5.30 | Amino acid transport and metabolism |
| F | 81 | 1.39 | Nucleotide transport and metabolism |
| H | 116 | 1.99 | Coenzyme transport and metabolism |
| I | 90 | 1.54 | Lipid transport and metabolism |
| P | 251 | 4.30 | Inorganic ion transport and metabolism |
| Q | 83 | 1.42 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 550 | 9.43 | General function prediction only |
| S | 316 | 5.42 | Function unknown |
| — | 417 | 7.14 | Not in COGs |
COGs, Clusters of Orthologous Groups database.
Total is based on total number of protein-coding genes in annotated genome.
Genome comparison of closely related species to Paenibacillus phocaensis strain mt24T
| Organism | INSDC | Size | G+C | Protein-coding genes | Total genes |
|---|---|---|---|---|---|
| FCOQ00000000 | 5.52 | 53.54 | 4835 | 4924 | |
| 4.78 | 51.9 | 4145 | 4284 | ||
| JMQA00000000.1 | 7.34 | 52.6 | 6166 | 6474 | |
| 5.76 | 45.5 | 4715 | 4928 | ||
| 4.8 | 49.3 | 4238 | 4388 | ||
| 5.27 | 52.6 | 4634 | 4857 | ||
| 8.61 | 51.4 | 6698 | 6945 |
INSDC, International Nucleotide Sequence Database Collaboration.
Fig. 7Distribution of functional classes of predicted genes according to COGs of protein. COGs, Clusters of Orthologous Groups database.
Numbers of orthologous protein shared between genomes (upper right)a
| 2922 | 2651 | 2946 | 2217 | 2526 | 2192 | ||
| 83.07 | 2514 | 2793 | 2170 | 2421 | 2139 | ||
| 68.73 | 68.48 | 2747 | 2453 | 2394 | 2481 | ||
| 77.60 | 76.40 | 68.68 | 2414 | 2568 | 2254 | ||
| 67.18 | 67.44 | 67.79 | 67.09 | 2093 | 2167 | ||
| 75.32 | 74.88 | 68.44 | 74.05 | 67.62 | 1981 | ||
| 69.32 | 68.85 | 74.05 | 69.49 | 67.57 | 68.31 |
Average percentage similarity of nucleotides corresponding to orthologous protein shared between genomes (lower left) and numbers of proteins per genome (bold).
Pairwise comparison of Paenibacillus phocaensis strain mt24T with other species using GGDC, formula 2 (DDH estimates based on identities/HSP length),a upper right
| 100 ± 00 | 25.6 ± 2.41 | 22.3 ± 2.36 | 22.7 ± 2.37 | 25.8 ± 2.41 | 20.3 ± 2.23 | 22.2 ± 2.36 | |
| 100 ± 00 | 22.6 ± 2.36 | 21.1 ± 2.33 | 26.5 ± 2.42 | 19.3 ± 2.29 | 22.8 ± 2.37 | ||
| 100 ± 00 | 22.8 ± 2.37 | 25.7 ± 2.41 | 19.6 ± 2.30 | 20.4 ± 2.32 | |||
| 100 ± 00 | 27.4 ± 2.43 | 19.9 ± 2.31 | 23.1 ± 2.37 | ||||
| 100 ± 00 | 19.6 ± 2.30 | 30.8 ± 2.45 | |||||
| 100 ± 00 | 19.9 ± 2.30 | ||||||
| 100 ± 00 |
DDH, DNA-DNA hybridization; GGDC, Genome-to-Genome Distance Calculator; HSP, high-scoring segment pairs.
Confidence intervals indicate inherent uncertainty in estimating DDH values from intergenomic distances based on models derived from empirical test data sets (which are always limited in size). These results are in accordance with the 16S rRNA (Fig. 1) and phylogenomic analyses as well as GGDC results.