| Literature DB >> 32300337 |
Anming Xu1,2, Di Wang1, Yichen Ding3, Yaqian Zheng1,2, Bo Wang1,2, Qing Wei1, Shiwei Wang1, Liang Yang4, Luyan Z Ma1,2.
Abstract
Pseudomonas aeruginosa is an environmental microorganism that can thrive in diverse ecological niches including plants, animals, water, soil, and crude oil. It also one of the microorganism widely used in tertiary recovery of crude oil and bioremediation. However, the genomic information regarding the mechanisms of survival and adapation of this bacterium in crude oil is still limited. In this study, three Pseudomonads strains (named as IMP66, IMP67, and IMP68) isolated from crude oil were taken for whole-genome sequencing by using a hybridized PacBio and Illumina approach. The phylogeny analysis showed that the three strains were all P. aeruginosa species and clustered in clade 1, the group with PAO1 as a representitive. Subsequent comparative genomic analysis revealed a high degree of individual genomic plasticity, with a probable alkane degradation genomic island, one type I-F CRISPR-Cas system and several prophages integrated into their genomes. Nine genes encoding alkane hydroxylases (AHs) homologs were found in each strain, which might enable these strains to degrade alkane in crude oil. P. aeruginosa can produce rhamnolipids (RLs) biosurfactant to emulsify oil, which enables their survival in crude oil enviroments. Our previous report showed that IMP67 and IMP68 were high RLs producers, while IMP66 produced little RLs. Genomic analysis suggested that their RLs yield was not likely due to differences at genetic level. We then further analyzed the quorum sensing (QS) signal molecules that regulate RLs synthesis. IMP67 and IMP68 produced more N-acyl-homoserine lactones (AHLs) signal molecules than that of PAO1 and IMP66, which could explain their high RLs yield. This study provides evidence for adaptation of P. aeruginosa in crude oil and proposes the potential application of IMP67 and IMP68 in microbial-enhanced oil recovery and bioremediation.Entities:
Keywords: N-acyl-homoserine lactones; Pseudomonas aeruginosa; alkane hydroxylase; crude oil; rhamnolipids
Year: 2020 PMID: 32300337 PMCID: PMC7145413 DOI: 10.3389/fmicb.2020.00519
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
FIGURE 1Phylogenetic analysis for depicting the relationships of these three newly sequenced strains with Pseudomonas spp. The tree is based on concatenated alignments of ten core housekeeping genes: acsA, aroE, dnaE, guaA, gyrB, mutL, ppsA, pyrC, recA, and rpoB, and was generated using the RAxML package. Five sequence clusters (SCs) are identified and labeled on the edge of the phylogeny. Clades corresponding to 4 dominant different monophyletic SCs (SC1 to SC3 and SC5) are shown in different colors, while clade for all the strains in the polyphyletic clade (SC4) comprising of the “unclustered” sequences is colored in black. Bootstrap supporting over 90% are labeled for the major nodes. Strains sequenced in this study are shown by yellow shading, which fall into SC5 with P. aeruginosa. The Escherichia coli K-12 genome was used as an out-group control.
FIGURE 2Phylogeny of P. aeruginosa population structure. Unrooted (A) and circular (B) maximum likelihood trees of three newly sequenced crude oil isolates combined with 87 representive P. aeruginosa isolates from diverse niches are generated from multiple-sequence alignment with 51,389 SNPs from 820 universally conserved (core) genes. The phylogenetic trees are constructed using RAxML with GTR + γ nucleotide substitution model and 100 bootstrap replicates. Strains are divided into three major clades. Clade 2 is contracted for visualization purpose. A framed miniature of the true appearance of this tree is presented. Representive crude oil sourced P. aeruginosa strains are highlighted in yellow.
General genome properties comparison of the completely sequenced crude oil isolates with laboratory strains.
| Size (bp) | 6,264,404 | 6,486,336 | 6,500,677 | 6,481,180 | 6,401,521 | 6,368,297 | 6,349,902 | 6,499,441 | 6,243,825 |
| CDS (total) | 5697 | 6077 | 6096 | 6070 | 5,823 | 5,803 | 5,766 | 5,913 | 5,808 |
| CDS (coding) | 5572 | 5996 | 6007 | 5991 | 5,684 | 5,687 | 5,671 | 5,772 | 5,761 |
| G + C (%) | 66.6 | 66.4 | 66.4 | 66.4 | 66.2 | 66.4 | 66.4 | 66.1 | 66.5 |
FIGURE 3(A) Circular chromosome map of P. aeruginosa IMP66, IMP67, and IMP68. Pan-genome analysis of three crude-oil sourced P. aeruginosa isolates and PAO1. (B) The unique and shared genes among P. aeruginosa IMP66, IMP67, IMP68 and PAO1 are performed using the EDGAR software platform based on the orthology analysis. The results are visualized in four degrees. Degree one represents the distinct singletons harbored by each single strain; degree two represents the genes shared by every two strains; degree three represents the genes shared by every three strains; degree four is the number of core genes shared by all the four strains. (C) COG annotation of all 592 exclusive CDSs shared by these three P. aeruginosa isolates.
FIGURE 4Whole genome comparison of IMP66, IMP67, IMP68, and PAO1. (A) Analysis of regions of genomic plasticity among IMP66, IMP67 and IMP68 (IMP66 was used as a representative of the three strains). The innermost ring represents the G + C skew of IMP66, followed by its G + C content. Light green, dodger blue and light blue rings that follow depict BLASTN comparisons between the genome of IMP66 and those of strains PAO1, IMP67, and IMP68, respectively. The red ring represents the core genome of the four strains. Then the two black rings are the CDSs of IMP66. The outermost interspaced ring (in dark green) represents the localization of the predicted regions of genomic plasticity (RGP) in the IMP66 genome, and the labels of each region follow the ones in Table 2. (B) Regions with different colors represent potentially horizontally transferred genes (HGT) which are gathered in genomic regions (Region of Genomic Plasticity, RGP). All potential alkane hydroxylases were marked at the chromosome with their corresponding colors.
Regions of Genomic Plasticity (RGPs) identified in the chromosome of IMP66, IMP67, and IMP68.
| RGP1 | 290261–302956 | 12696 | 22 | 57.9 | tRNA | |
| RGP2 | 631096–652201 | 21106 | 35 | 57.8 | tRNA, transposase, integrase | |
| RGP3 | 1730348–1738782 | 8435 | 15 | 56 | tRNA | |
| RGP4 | 1975456–1982437 | 6982 | 14 | 49.9 | unknown function | |
| RGP5 | 2339727–2346716 | 6990 | 22 | 65.5 | tRNA | |
| RGP6 | 2366073–2406282 | 40210 | 64 | 63.5 | integrase, phage | |
| RGP7 | 2461950–2475724 | 13775 | 18 | 57.3 | tRNA, transposase | |
| RGP8 | 2798834–2807221 | 8388 | 28 | 49.5 | Transposase | |
| RGP9 | 2858761–2870683 | 11923 | 20 | 62.5 | transposase,type I-F CRISPR | |
| RGP10 | 3122707–3171699 | 48993 | 56 | 61.9 | transposase, | |
| RGP11 | 4391909–4404142 | 12234 | 21 | 61.8 | ||
| RGP12 | 5104668–5141335 | 36668 | 65 | 64.5 | transposase, phage | |
| RGP13 | 5178764–5291282 | 112519 | 148 | 60.8 | tRNA, transposase, type II toxin-antitoxin system, integrating conjugative element protein, methyl-accepting chemotaxis protein | |
| RGP14 | 6456744–6463635 | 6892 | 15 | 60 | ||
| SIGIVOM1 | 280000–283595 | 3596 | 11 | 60.4 | transposase | |
| SIGIVOM2 | 731236–769656 | 38421 | 62 | 58.9 | tRNA | |
| SIGIVOM3 | 1287864–1292500 | 4637 | 13 | 58.2 | unknown function | |
| SIGIVOM4 | 1869178–1874512 | 5335 | 16 | 47.3 | diguanylate phosphodiesterase | |
| SIGIVOM5 | 3479225–3484687 | 5463 | 14 | 45.5 | unknown function | |
| SIGIVOM6 | 4501618–4515000 | 13383 | 23 | 59.6 | tRNA, type 1 fimbrial protein | |
| SIGIVOM7 | 4522500–4525211 | 2712 | 13 | 63.1 | unknow function | |
| SIGIVOM8 | 4669092–4686046 | 16955 | 39 | 57.9 | integrase, phage | |
| SIGIVOM9 | 5590764–5595000 | 4237 | 19 | 57.1 | phage |
Detail information of the probable alkane degradation cluster in RGP10.
| DBX28_14665 | 3119086 | 3120522 | CDS | undecaprenyl-phosphate glucose phosphotransferase | − | − | − |
| DBX28_14670 | 3120517 | 3120645 | CDS | Glycosyltransferase | −2SD | − | − |
| DBX28_14675 | 3121054 | 3121725 | CDS | hypothetical protein | − | − | − |
| DBX28_14680 | 3121772 | 3122491 | CDS | MOSC domain-containing protein | +1SD | − | − |
| DBX28_14685 | 3122707 | 3123447 | CDS | SDR family oxidoreductase | − | − | + |
| DBX28_14690 | 3123497 | 3123859 | CDS | hypothetical protein | − | + | + |
| DBX28_14695 | 3124233 | 3125303 | CDS | Hydrolase | − | + | + |
| DBX28_14700 | 3125694 | 3129635 | CDS | hypothetical protein | −2SD | + | + |
| DBX28_14705 | 3129746 | 3130042 | fCDS | IS3 family transposase | −1SD | + | + |
| DBX28_14705 | 3130043 | 3130573 | fCDS | IS3 family transposase | − | + | + |
| DBX28_14705 | 3130600 | 3130725 | fCDS | IS3 family transposase | − | + | + |
| DBX28AM_2945 | 3130645 | 3130875 | CDS | protein of unknown function | −2SD | + | + |
| DBX28_14710 | 3130846 | 3131091 | CDS | hypothetical protein | −2SD | + | + |
| DBX28_14715 | 3131276 | 3131785 | CDS | hypothetical protein | −1SD | + | + |
| DBX28_14720 | 3132014 | 3132589 | CDS | hypothetical protein | −2SD | + | + |
| DBX28_14725 | 3132719 | 3132985 | fCDS | IS3 family transposase | −1SD | + | + |
| DBX28_14725 | 3132988 | 3133407 | fCDS | IS3 family transposase | − | + | + |
| DBX28_14725 | 3133431 | 3133694 | fCDS | IS3 family transposase | −2SD | + | + |
| DBX28AM_2951 | 3133851 | 3133934 | CDS | protein of unknown function | −2SD | + | + |
| DBX28AM_2952 | 3133941 | 3134135 | CDS | protein of unknown function | −1SD | + | + |
| DBX28_14730 | 3134510 | 3135052 | CDS | hypothetical protein | −2SD | + | + |
| DBX28_14735 | 3135075 | 3136274 | CDS | hypothetical protein | − | + | + |
| DBX28_14740 | 3136568 | 3137434 | CDS | phytanoyl-CoA dioxygenase family protein | −1SD | + | + |
| DBX28_14745 | 3137519 | 3138481 | CDS | glycosyl hydrolase | − | + | − |
| DBX28_14750 | 3138492 | 3140894 | CDS | RND transporter | − | − | + |
| DBX28_14755 | 3140903 | 3142489 | CDS | DUF1302 domain-containing protein | − | − | + |
| DBX28_14760 | 3142573 | 3143997 | CDS | DUF1329 domain-containing protein | − | − | + |
| DBX28AM_2960 | 3144149 | 3144388 | CDS | protein of unknown function | −2SD | + | + |
| DBX28_14765 | 3144552 | 3145199 | CDS | hypothetical protein | −2SD | + | + |
| DBX28_14770 | 3145310 | 3146677 | CDS | FAD-binding oxidoreductase | − | + | + |
| DBX28_14775 | 3146739 | 3147626 | CDS | phytanoyl-CoA dioxygenase family protein | −1SD | + | + |
| DBX28_14780 | 3148075 | 3148656 | CDS | TetR/AcrR family transcriptional regulator | −1SD | + | + |
| DBX28_14785 | 3149040 | 3150566 | CDS | NAD(P)/FAD-dependent oxidoreductase, almA like | −1SD | + | + |
| DBX28_14790 | 3150563 | 3151438 | CDS | alpha/beta hydrolase | − | + | + |
| DBX28_14795 | 3151669 | 3152826 | CDS | NAD−dependent alcohol dehydrogenase | − | − | + |
| DBX28_14800 | 3152839 | 3154170 | CDS | APC family permease | − | − | − |
| DBX28_14805 | 3154249 | 3155673 | CDS | gamma-aminobutyraldehyde dehydrogenase | − | − | − |
| DBX28_14810 | 3155719 | 3156972 | CDS | aspartate aminotransferase family protein | − | − | − |
| DBX28_14815 | 3157060 | 3157980 | CDS | NAD(P)-dependent oxidoreductase | − | − | − |
| DBX28_14820 | 3158054 | 3158206 | CDS | amino acid permease | −1SD | − | − |
| DBX28_14825 | 3158284 | 3159942 | CDS | cation acetate symporter | − | − | − |
| DBX28_14830 | 3159939 | 3160262 | CDS | DUF485 domain-containing protein | − | − | − |
| DBX28_14835 | 3160319 | 3161968 | CDS | acyl-CoA synthetase | − | − | − |
| DBX28_14840 | 3162052 | 3162819 | CDS | SDR family NAD(P)-dependent oxidoreductase | + 1SD | − | − |
| DBX28_14845 | 3162819 | 3163991 | CDS | acyl-CoA dehydrogenase | − | − | − |
| DBX28_14850 | 3164042 | 3164398 | CDS | acyl-CoA dehydrogenase | − | − | − |
| DBX28_14855 | 3164395 | 3165450 | CDS | phosphotransferase family protein | − | − | − |
| DBX28_14860 | 3165647 | 3167599 | CDS | sigma-54-dependent Fis family transcriptional regulator | − | − | − |
| DBX28_14865 | 3167665 | 3168168 | CDS | transcriptional regulator | −1SD | − | − |
| DBX28_14870 | 3168227 | 3170530 | CDS | MCE family protein | − | − | − |
| DBX28_14875 | 3170523 | 3171143 | CDS | paraquat-inducible protein A | − | − | − |
| DBX28AM_2983 | 3171472 | 3171699 | CDS | conserved protein of unknown function | −1SD | − | − |
| DBX28_14880 | 3171792 | 3173375 | CDS | aldehyde dehydrogenase [NADP(+)] | + 1SD | − | − |
| DBX28_14885 | 3173504 | 3174499 | CDS | FAH family protein | − | − | − |
| DBX28_14890 | 3174527 | 3175702 | CDS | L-rhamnonate dehydratase | − | − | − |
| DBX28_14895 | 3175733 | 3177055 | CDS | MFS transporter | − | − | − |
FIGURE 5rhlAB transcription and AHLs quantification. (A) The concentration of C4-HSL (right panel) and 3-oxo-C12-HSL (left panel) of these three newly sequenced strains and laboratory strain PAO1. The concentration of C4-HSL is directly displayed using the original peak area due to no standard C4-HSL. (B) The rhlAB transcription level of PAO1, IMP66, IMP67 and IMP68 are indicated by the green fluorescent intensity of plasmid PrhlAB:gfp after 24 h incubation in LBNS or NB broth. Significance determined using Student’s t-test, *p < 0.05; **p < 0.01; ***p < 0.001, error bars represent the SD.
FIGURE 6rhlAB transcription of IMP66, IMP67, and IMP68 when grown with n-alkane as the sole carbon source. C20 (A) or C24 (B) was used as the sole carbon source for the growth of IMP66, IMP67, and IMP68 carrying the reporter plasmid PrhlAB:gfp in BSM broth. The expression level of PrhlAB:gfp is monitored and recorded as the flourescent intensity of GFP. The growth curve presented by bacterial total protein concentration was shown as the inset.