| Literature DB >> 32010063 |
Radhey S Gupta1, Sudip Patel1.
Abstract
The family Caryophanaceae/Planococcaceae is a taxonomically heterogeneous assemblage of >100 species classified within 13 genera, many of which are polyphyletic. Exhibiting considerable phylogenetic overlap with other families, primarily Bacillaceae, the evolutionary history of this family, containing the potent mosquitocidal species Lysinibacillus sphaericus, remains incoherent. To develop a reliable phylogenetic and taxonomic framework for the family Caryophanaceae/Planococcaceae and its genera, we report comprehensive phylogenetic and comparative genomic analyses on 124 genome sequences from all available Caryophanaceae/Planococcaceae and representative Bacillaceae species. Phylogenetic trees were constructed based on multiple datasets of proteins including 819 core proteins for this group and 87 conserved Firmicutes proteins. Using the core proteins, pairwise average amino acid identity was also determined. In parallel, comparative analyses on protein sequences from these species have identified 92 unique molecular markers (synapomorphies) consisting of conserved signature indels that are specifically shared by either the entire family Caryophanaceae/Planococcaceae or different monophyletic clades present within this family, enabling their reliable demarcation in molecular terms. Based on multiple lines of investigations, 18 monophyletic clades can be reliably distinguished within the family Caryophanaceae/Planococcaceae based on their phylogenetic affinities and identified molecular signatures. Some of these clades are comprised of species from several polyphyletic genera within this family as well as other families. Based on our results, we are proposing the creation of three novel genera within the family Caryophanaceae/Planococcaceae, namely Metalysinibacillus gen. nov., Metasolibacillus gen. nov., and Metaplanococcus gen. nov., as well as the transfer of 25 misclassified species from the families Caryophanaceae/Planococcaceae and Bacillaceae into these three genera and in Planococcus, Solibacillus, Sporosarcina, and Ureibacillus genera. These amendments establish a coherent taxonomy and evolutionary history for the family Caryophanaceae/Planococcaceae, and the described molecular markers provide novel means for diagnostic, genetic, and biochemical studies. Lastly, we are also proposing a consolidation of the family Planococcaceae within the emended family Caryophanaceae.Entities:
Keywords: Lysinibacillus; Metalysinibacillus gen. nov.; Metaplanococcus gen. nov.; Metasolibacillus gen. nov.; Planococcaceae and Caryophanaceae families; conserved signature indels for different clades; emended descriptions of the Caryophanaceae/Planococcaceae genera; phylogenomic and comparative genomic analyses
Year: 2020 PMID: 32010063 PMCID: PMC6971209 DOI: 10.3389/fmicb.2019.02821
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Maximum-likelihood phylogenetic trees for 124 genome sequenced members of the family Caryophanaceae/Planococcaceae and some representative members of the family “Bacillaceae” based on (A) 819 core proteins for this group of species, and (B) a set of 87 conserved proteins that are part of the phyloeco marker set for the phylum Firmicutes (Wang and Wu, 2013). Both trees were rooted using genome sequences of Streptococcus pyogenes, Streptococcus mitis, Lactococcus piscium, and Lactococcus lactis (labeled as Lactobacillales). SH-like statistical support values are indicated at each branch node. All clades observed in this study are labeled and presented with square brackets. The specific clades of interest are indicated in bold. A superscript “T” indicates the type species of a specific genus and the asterisk (∗) indicates the genome of Psychrobacillus quisquiliarum seems to be contaminated. The scale bars at the bottom represent 0.05 changes per amino acid position for each tree.
FIGURE 2Maximum-likelihood phylogenetic tree for 109 Caryophanaceae/Planococcaceae and Bacillaceae species based on 16S rRNA gene sequences retrieved from the All-Species Living Tree Project (Yilmaz et al., 2014). The evolutionary history was inferred based on the Kimura 2-parameter model (Kimura, 1980). The tree is drawn to scale, with branch lengths measured in the number of substitutions per site. The scale bar at the bottom represents 0.05 changes per nucleotide position. All species which had their 16S rRNA gene sequence of their type strains are indicated by (T) followed by the name of the strains. The proposed clades seen in this tree are labeled with square brackets and all non-genome-sequenced species which are part of these clades are highlighted in red with the species which we have proposed be transferred labeled with an asterisk (∗). This tree is unrooted.
FIGURE 3A matrix indicating the percentage of average amino acid identities shared by members of the family Caryophanaceae/Planococcaceae analyzed in this study based on 819 core proteins for this set of species. Higher amino acid identity shared by a pair of species are colored more darkly (red). The specific clades observed based on higher interspecies similarity are boxed with blue borders and labeled with arrows. The mean and standard deviation in AAI values for different clades of interest are indicated. The “Caryophanon” and “Flavidum” clades have only 1 AAI similarity values and hence the mean and standard deviation values for these clades could not be determined. The family Bacillaceae and the genus Jeotgalibacillus are indicated with boxes with black borders. A detailed amino acid matrix with the numerical values underlying this amino acid matrix and the species names is provided in Supplementary Figure 2.
FIGURE 4Partial sequence alignment of the phenylalanine–tRNA ligase subunit alpha protein showing a one amino acid deletion (boxed) that is exclusively shared by all members of the emended family Caryophanaceae. Sequence information for a limited number of Caryophanaceae species and other bacteria are shown here, but unless otherwise indicated, similar CSIs were detected in all members of the indicated group and not detected in any other species in the top 1000 BLASTp hits. The dashes (-) in this alignment and all other alignments presented in this paper indicate identity with the residue in their respective top sequences. Accession numbers for each sequence are indicated in the second column. Detailed sequence alignments for this CSI as well as additional CSIs specific for the family Caryophanaceae are presented in Supplementary Figures 3–15 and some of their characteristics are summarized in Table 1.
Summary of conserved signature indels specific for the family Caryophanaceae.
| Phenylalanine– tRNA ligase subunit alpha | WP_121176350 | 1 aa del | 159–209 | Family | |
| Chaperonin GroELa | WP_036075467 | 2 aa ins | 401–454 | ||
| Ribosome maturation factor RimP | WP_057986972 | 1 aa ins | 21–65 | ||
| BrxA/BrxB family bacilliredoxin | WP_057987839 | 1 aa ins | 29–82 | ||
| RNA methyltransferase | WP_057989758 | 1 aa ins | 4–41 | ||
| Rhomboid family intramembrane serine protease | WP_029500435 | 1 aa del | 123–171 | ||
| ATP-dependent Clp protease ATP-binding subunit | WP_057986366 | 1 aa ins | 40–99 | ||
| DNA-directed RNA polymerase subunit beta | WP_057986323 | 27 aa ins | 583–625 | ||
| Chorismate synthasea | WP_057982230 | 2 aa ins | 148–213 | ||
| Stage IV sporulation protein A | WP_057982892 | 2 aa ins | 53–101 | ||
| Peptidase | WP_057987802 | 6 aa del | 221–272 | ||
| KinB-signaling pathway activation proteina | WP_057986232 | 3 aa ins | 153–196 | ||
| DUF423 domain-containing protein | WP_042478071 | 2–4 aa ins | 39–92 |
FIGURE 5Partial sequence alignment of (A) the bacillithiol biosynthesis deacetylase BshB2 protein showing a one amino acid insertion (boxed) that is exclusively shared by all members of the Lysinibacillus sensu stricto clade, and (B) the arginine-binding extracellular protein ArtP precursor containing an 11 amino acid insertion that is exclusively shared by all members of the Jejuensis clade. Detailed sequence alignments for these CSIs as well as additional CSIs specific for these clades are presented in Supplementary Figures 16–21 for the Lysinibacillus sensu stricto clade and Supplementary Figures 22–38 for the Jejuensis clade and some of their characteristics are summarized in Table 2.
Summary of conserved signature indels specific for the members of the Lysinibacillus sensu stricto clade and the Jejuensis clade.
| Bacillithiol biosynthesis deacetylase BshB2 | WP_049667946 | 1 aa ins | 12–56 | ||
| PIN/TRAM domain-containing protein | WP_036077775 | 2 aa ins | 225–285 | ||
| Flagellar assembly protein FliH | WP_016994577 | 1 aa ins | 205–254 | ||
| PDZ domain-containing protein | WP_049668118 | 3 aa ins | 412–467 | ||
| TrkH family potassium uptake protein | WP_108029457 | 2 aa ins | 408–451 | ||
| WP_036080147 | 21–22 aa ins | 197–262 | |||
| Arginine-binding extracellular protein ArtP precursor | CEA00092 | 11 aa ins | 128–172 | Jejuensis clade | |
| oxygen-independent coproporphyrinogen III oxidase | WP_108307498 | 3 aa del | 218-268 | ||
| Putative hydrolase MhqD | CEA00796 | 1 aa del | 110-160 | ||
| helix-turn-helix transcriptional regulator | WP_108306939 | 1 aa del | 82-136 | ||
| tRNA preQ1(34) S-adenosylmethionine ribosyltransferase-isomerase QueA | WP_108306792 | 1 aa del | 91-136 | ||
| DNA primase | WP_108306840 | 1 aa del | 497-535 | ||
| FMN reductase (NADPH) | CEA04024 | 1 aa del | 162–210 | ||
| UvrABC system protein C | CEA00739 | 1 aa del | 57–101 | ||
| Sensor histidine kinase YycG | CDZ99298 | 2 aa ins | 298–344 | ||
| Hypothetical protein BN1050_02162 | CEA04824 | 1 aa del | 68–114 | ||
| Ribonuclease Y | CEA04602 | 4 aa ins | 20–69 | ||
| Hypothetical protein BN1050_01309 | CEA02597 | 2 aa del | 55–111 | ||
| Cell division protein FtsAa | WP_108305708 | 2 aa del | 64–128 | ||
| ABC transporter ATP-binding protein YtrBa | CEA04564 | 1 aa ins | 153–206 | ||
| Cysteine–tRNA ligasea | CDZ99411 | 1 aa del | 318–365 | ||
| Coproporphyrinogen III oxidasea | WP_036144197 | 1 aa del | 435–478 | ||
| PBP1A family penicillin-binding proteina | WP_108306067 | 1 aa ins | 529–571 |
FIGURE 6Partial sequence alignments of (A) the MFS transporter protein showing a one amino acid insertion (boxed) that is exclusively shared by all members of the Ureibacillus clade and (B) the DUF456 domain-containing protein containing a one amino acid deletion (boxed) that is exclusively shared by all members within the Meyeri clade. Detailed sequence alignments for these CSIs as well as additional CSIs specific for these clades are presented in Supplementary Figures 39–41 for the Ureibacillus clade and Supplementary Figures 42–53 for the Meyeri clade and some of their characteristics are summarized in Table 3.
Summary of conserved signature indels specific for the Ureibacillus clade, the Meyeri clade, the Solibacillus clade, and the Sporosarcina clade.
| MFS transporter | WP_121213073 | 1 aa ins | 179–225 | Ureibacillus clade | |
| EamA family transporter | WP_126296406 | 1 aa del | 177–229 | ||
| DNA internalization-related competence protein ComEC/Rec2a | WP_121213400 | 1 aa del | 260–320 | ||
| DUF456 domain-containing protein | WP_107841432 | 1 aa del | 23–62 | Meyeri clade | |
| Toxic anion resistance protein | WP_107942781 | 6 aa del | 178–230 | ||
| Undecaprenyldiphospho-muramoylpentapeptide beta- | WP_107839309 | 1 aa ins | 164–228 | ||
| c-Type cytochrome biogenesis protein CcsB | WP_107841857 | 15 aa ins | 215–275 | ||
| Thiol-disulfide oxidoreductase ResA | WP_042477869 | 1 aa ins | 123–165 | ||
| Hypothetical protein | WP_066164326 | 1 aa del | 5–46 | ||
| Hypothetical protein | WP_107942795 | 1 aa del | 232–273 | ||
| Arginase | WP_107840234 | 1 aa del | 92–130 | ||
| Preprotein translocase subunit SecY | WP_066168906 | 3 aa ins | 291–339 | ||
| ATP-binding cassette domain-containing protein | WP_107942022 | 2 aa del | 336–373 | ||
| Purine permease | WP_042470344 | 2 aa ins | 368–416 | ||
| Thiol-disulfide oxidoreductase ResA | WP_042477869 | 1 aa ins | 123–154 | ||
| Flagellar hook–basal body proteinb | WP_099422418 | 1 aa ins | 176–213 | Solibacillus clade | |
| Aminodeoxychorismate lyaseb | WP_057989468 | 1 aa del | 50–93 | ||
| VOC family protein | WP_057987106 | 1 aa ins | 67–116 | ||
| DNA topoisomerase IV subunit A | WP_057989369 | 1 aa ins | 266–297 | ||
| DegV family protein | WP_099423520 | 1 aa ins | 242–297 | ||
| Flagellar hook–basal body protein | WP_099422418 | 1 aa del | 206–248 | ||
| Helicase-exonuclease AddAB subunit AddB | WP_099422539 | 6 aa ins | 233–286 | ||
| Multidrug resistance efflux transporter family protein | WP_057988067 | 2 aa ins | 36–80 | ||
| Heme-dependent peroxidase | WP_057986646 | 4 aa del | 32–66 | ||
| Methionine ABC transporter ATP-binding protein | WP_099424991 | 1 aa ins | 220–265 | ||
| tRNA 4-thiouridine(8) synthase ThiI | WP_099424859 | 1 aa ins | 71–177 | ||
| AAA family ATPase | WP_099422549 | 1 aa ins | 315–361 | ||
| Aspartate–tRNA ligase | WP_083031738 | 2 aa del | 435–479 | Sporosarcina clade | |
| A/G-specific adenine glycosylase | WP_029055238 | 1 aa del | 183–222 | ||
| Thymidylate synthase | WP_099690866 | 1 aa del | 138–176 | ||
| RDD family protein | WP_083032299 | 2 aa ins | 31–90 | ||
| DEAD/DEAH box helicase | WP_029055098 | 2 aa del | 332–368 | ||
| Membrane protein insase YidC | WP_099632462 | 3 aa ins | 77–124 | ||
| Cytochrome b6 | WP_009765631 | 5–6 aa ins | 25–78 | ||
| Hypothetical protein | WP_083035866 | 2 aa ins | 18–53 |
FIGURE 7Partial sequence alignments of (A) the flagellar hook–basal body protein showing a one amino acid insertion (boxed) that is exclusively shared by all members of the Solibacillus clade, and (B) the aspartate–tRNA ligase protein containing a two amino acid deletion (boxed) that is exclusively shared by all members of the Sporosarcina clade. Detailed sequence alignments for these CSIs as well as additional CSIs specific for these clades are presented in Supplementary Figures 54–65 for the Solibacillus clade, and Supplementary Figures 66–73 for the Sporosarcina clade and some of their characteristics are summarized in Table 3.
FIGURE 8Partial sequence alignments of (A) the penicillin-binding protein 2 protein showing a two amino acid insertion (boxed) that is exclusively shared by all members of the Planococcus/Planomicrobium clade, (B) the ABC transporter substrate-binding protein containing a three amino acid insertion (boxed) that is exclusively shared by all members within the Flavidum clade, and (C) the DNA-directed RNA polymerase subunit beta protein containing an eight amino acid insertion that is exclusively shared by all members of the genus Caryophanon. Detailed sequence alignments for these CSIs as well as additional CSIs specific for these clades are presented in Supplementary Figures 74–78 for the Planococcus/Planomicrobium clade, Supplementary Figures 79–86 for the Flavidum clade, and Supplementary Figures 87–94 for the genus Caryophanon and some of their characteristics are summarized in Table 4.
Summary of conserved signature indels specific for the Planococcus/Planomicrobium clade, the Flavidum clade, and the genus Caryophanon.
| Penicillin-binding protein 2 | WP_078080352 | 2 aa ins | 215–252 | Planococcus/Planomicrobium clade | |
| Hypothetical protein | WP_065528121 | 3 aa ins | 56–96 | ||
| NADPH-dependent 7-cyano-7-deazaguanine reductase QueF | WP_071153391 | 1 aa del | 14–65 | ||
| ACT domain-containing protein | WP_112232475 | 1 aa del | 11–64 | ||
| Methylmalonyl-CoA mutase | WP_065525509 | 2 aa del | 1009–1053 | ||
| ABC transporter substrate-binding protein | WP_088008171 | 3 aa ins | 72–111 | Flavidum clade | |
| Methionine–tRNA ligase | WP_088008409 | 1 aa ins | 125–165 | ||
| MetQ/NlpA family ABC transporter substrate-binding protein | WP_088008611 | 1 aa ins | 70–129 | ||
| ABC transporter permease | WP_088005980 | 5 aa del | 122–182 | ||
| Spore protease YyaC | WP_088009144 | 1 aa del | 98–139 | ||
| WP_088007583 | 1 aa ins | 92–132 | |||
| Orotidine-5′-phosphate decarboxylasea | WP_088005873 | 1 aa del | 149–205 | ||
| Phospho- | WP_088005773 | 1 aa del | 105–153 | ||
| DNA-directed RNA polymerase subunit beta | WP_066462903 | 8 aa ins | 282–331 | Genus | |
| Peroxide-responsive transcriptional repressor PerR | WP_083998246 | 2 aa ins | 63–113 | ||
| ADP-forming succinate-CoA ligase subunit beta | WP_066543353 | 1 aa ins | 76–131 | ||
| tRNA ( | WP_066542314 | 1 aa ins | 324–369 | ||
| Magnesium transporter | WP_066461515 | 1 aa ins | 325–374 | ||
| Dephospho-CoA kinase | WP_066542157 | 1 aa del | 82–120 | ||
| ATP synthase subunit I | WP_066466362 | 1 aa ins | 1–53 | ||
| Bifunctional DNA-formamidopyrimidine glycosylase/DNA-(apurinic or apyrimidinic site) lyasea | WP_066461928 | 1 aa del | 240–288 |
FIGURE 9A conceptual diagram based on results obtained from phylogenetic studies, AAI similarity analysis, and several identified CSIs indicating the evolutionary relationships among the Caryophanaceae species. The total numbers of identified CSIs that are specifically shared by species from each of these clades are indicated at their respective nodes. All clades analyzed in this study are shown in boxes containing the species they comprise. Of these species, the ones in bold are genome-sequenced while the rest are placed within these clades based on our 16S rRNA analysis.
Descriptions of the new combinations in the emended genus Ureibacillus.
| The description of this taxon is as given by | KCTC 13605T (=NBRC 105754 = CCUG 57911 = DSM 23394 = JC23) | ||
| The description of this taxon is as given by | OMN17T (=DSM 28019 = CCTCC AB 2014073) | ||
| The description of this taxon is as given by | NBRC 108948T (=KACC 16626 = 2RL3-2) | ||
| The description of this taxon is as given by | DSM 24785T (=KCTC 13796 = NCCP-36 = JCM 18777) | ||
| The description of this taxon is as given by | C9T (=CGMCC 1.15291 = DSM 100506) | ||
| The description of this taxon is as given by | ACCC 00718T (=JCM 19611 = LAM612) | ||
| The description of this taxon is as given by | CCTCC AB 2012916T (=Mn1-7 = DSM 26584) | ||
| The description of this taxon is as given by | CCUG 49529T (=CIP 108446 = 4400831) | ||
| The description of this taxon is as given by | KCTC 13296T (=BLB-1 = JCM 15800) | ||
| The description of this taxon is as given by | S5H2222T (=KACC 18714 = LMG 29294 = MCC 3065) |
Descriptions of the new combinations in Metaplanococcus gen. nov., Metasolibacillus gen. nov, emended genus Solibacillus, emended genus Sporosarcina, and emended genus Planococcus.
| The description of this taxon is as given by | KCTC 13261T (=ISL-41 = CCUG 56756 = DSM 27642) | ||
| The description of this taxon is as given by | WS 4626T (=LMG 26643 = DSM 25057) | ||
| The description of this taxon is as given by | DSM 25583T (=NM73 = LMG 26837) | ||
| The description of this taxon is as given by | FF3T (=CSUR P3025 = DSM 27837) | ||
| The description of this taxon is as given by | PN5T (=MTCC 9127 = LMG 23935 = JCM 15113 = DSM 21993) | ||
| The description of this taxon is as given by | 34hs-1T (=ATCC PTA-4993 = NBRC 100172 = NRRL B-59274) | ||
| The description of this taxon is as given by | 1SS1O1T (= DSM 13886 = NCIMB 11923 = ATCC 43646) | ||
| The description of this taxon is as given by | MAE2T (=CIP 107718 = NCIMB 13489) | ||
| The description of this taxon is as given by | AS 1.3454T (=DX3-12 = JCM 12466 = DSM 17276) | ||
| The description of this taxon is as given by | JCM 15088T (=0423 = CGMCC 1.6846 = DSM 24857) | ||
| The description of this taxon is as given by | JG07T (=KCTC 3684 = JCM 10704) | ||
| The description of this taxon is as given by | S23F2T (=ATCC 700539 = CIP 105673 = DSM 13963) | ||
| The description of this taxon is as given by | DSM 14507T (=MTCC 3812 = CMS 53or) | ||
| The description of this taxon is as given by | CGMCC 1.12259T (=XN13 = KCTC 33047) | ||
| The description of this taxon is as given by | JCM 12481T (=K22-03 = DSM 16419 = MTCC 6226) |