| Literature DB >> 31500174 |
Michal A Kaminski1, Adam Sobczak1,2, Andrzej Dziembowski1,2, Leszek Lipinski3,4.
Abstract
Sphingopyxis inhabit diverse environmental niches, including marine, freshwater, oceans, soil and anthropogenic sites. The genus includes 20 phylogenetically distinct, valid species, but only a few with a sequenced genome. In this work, we analyzed the nearly complete genome of the newly described species, Sphingopyxis lindanitolerans, and compared it to the other available Sphingopyxis genomes. The genome included 4.3 Mbp in total and consists of a circular chromosome, and two putative plasmids. Among the identified set of lin genes responsible for γ-hexachlorocyclohexane pesticide degradation, we discovered a gene coding for a new isoform of the LinA protein. The significant potential of this species in the remediation of contaminated soil is also correlated with the fact that its genome encodes a higher number of enzymes potentially involved in aromatic compound degradation than for most other Sphingopyxis strains. Additional analysis of 44 Sphingopyxis representatives provides insights into the pangenome of Sphingopyxis and revealed a core of 734 protein clusters and between four and 1667 unique proteins per genome.Entities:
Keywords: Sphingopyxis lindanitolerans; complete genome sequence; lin genes; pangenome; pesticide; γ-HCH degradation
Mesh:
Substances:
Year: 2019 PMID: 31500174 PMCID: PMC6771000 DOI: 10.3390/genes10090688
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Gene counts associated with the general COGs (clusters of orthologous groups) functional categories for the genome of Sphingopyxis lindanitolerans WS5A3p, obtained from COGNIZER software [30].
| Code | Description | WS5A3p | |
|---|---|---|---|
| Value | % | ||
| J | Translation, ribosomal structure and biogenesis | 241 | 6.8% |
| K | Transcription | 287 | 8.0% |
| L | Replication, recombination and repair | 319 | 8.9% |
| D | Cell cycle control, cell division, chromosome partitioning | 38 | 1.1% |
| V | Defense mechanisms | 81 | 2.3% |
| T | Signal transduction mechanisms | 179 | 5.0% |
| M | Cell wall/membrane/envelope biogenesis | 232 | 6.5% |
| N | Cell motility | 57 | 1.6% |
| W | Extracellular structures | 8 | 0.2% |
| U | Intracellular trafficking, secretion, and vesicular transport | 209 | 5.9% |
| O | Posttranslational modification, protein turnover, chaperones | 189 | 5.3% |
| C | Energy production and conversion | 381 | 10.7% |
| G | Carbohydrate transport and metabolism | 218 | 6.1% |
| E | Amino acid transport and metabolism | 512 | 14.3% |
| F | Nucleotide transport and metabolism | 91 | 2.5% |
| H | Coenzyme transport and metabolism | 175 | 4.9% |
| I | Lipid transport and metabolism | 481 | 13.5% |
| P | Inorganic ion transport and metabolism | 481 | 13.5% |
| Q | Secondary metabolites biosynthesis, transport and catabolism | 302 | 8.5% |
| R | General function prediction only | 651 | 18.2% |
| S | Function unknown | 277 | 7.8% |
Figure 1Phylogenetic analyses of Sphingopyxis genomes. (A) Phylogenetic relationship of Sphingopyxis genomes based on a maximum likelihood analysis of 400 conserved protein sequences. Branches are colored according to bootstrap values, which are presented on the nodes. The bar represents 0.8 substitutions per site. The WS5A3p branch is indicated with a blue diamond. The blue shaded box contains outgroup genomes included in the analysis. (B) Sequence similarity of analyzed Sphingopyxis genomes comparing to WS5A3p strain based on ANIb (blue triangle), ANIm (red diamond) and AAI (green circle).
Figure 2Genome-scale analysis of protein functions for Sphingopyxis lindanitolerans WS5A3p in context of Sphingopyxis pangenome. Bar chart illustrating protein clusters participation in different COGs functional categories for Sphingopyxis core pangenome (orange) and WS5A3p unique clusters (purple). (A) Cellular processes and signaling; (B) information storage and processing functions; (C) metabolism process; (D) unknown functions.
Figure 3Comparative map of plasmid pSPMK1 and plasmid pSPMK2. Sequence similarity between pSPMK1 and pSPMK2 compared to selected plasmids derived from representative Sphingomonadaceace. The first ring represents GC content, rings 2–9 (for pSPMK1) and 2–7 (for pSPMK2) represents the similarity between chosen reference sequences. The last ring for both plasmids indicates open reading frames identified in the sequence, colored according to their function: black—transposases, integrases; purple—lin genes; orange—transfer, conjugation, partition genes; red—other catabolic genes; dark green—toxin/anti-toxin; light green—heavy metal resistance; blue—other genes, grey—hypothetical proteins.
Figure 4Multiple sequence alignment of LinA protein variants. The amino acid sequence of LinA type-1a from WS5A3p strain (1) compared to previously known variants of this protein (LinA type-1 from Sphingobium japonicum UT26 (2), LinA1 from Sphingobium indicum B90A (4), LinA3 from Sphingobium sp. HDIPO4 (6), LinA type-2 (5) and LinA type-3 from soil metagenomes (3)). Sequences were aligned and visualized using Geneious [27]. Dots correspond to the identical amino acids, dashes indicate the gaps. Phylogenetic relationships are indicated next to the sequence names and accession numbers.