Xian Xia1, Jiahong Li1, Shuijiao Liao2, Gaoting Zhou2, Hui Wang1, Liqiong Li1, Biao Xu1, Gejiao Wang1. 1. State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, 430070 People's Republic of China. 2. State Key Laboratory of Agricultural Microbiology, Huazhong Agricultural University, Wuhan, 430070 People's Republic of China ; College of Basic Sciences, Huazhong Agricultural University, Wuhan, 430070 People's Republic of China.
Abstract
Alishewanella sp. WH16-1 (= CCTCC M201507) is a facultative anaerobic, motile, Gram-negative, rod-shaped bacterium isolated from soil of a copper and iron mine. This strain efficiently reduces chromate (Cr(6+)) to the much less toxic Cr(3+). In addition, it reduces sulfate (SO4 (2-)) to S(2-). The S(2-) could react with Cd(2+) to generate precipitated CdS. Thus, strain WH16-1 shows a great potential to bioremediate Cr and Cd contaimination. Here we describe the features of this organism, together with the draft genome and comparative genomic results among strain WH16-1 and other Alishewanella strains. The genome comprises 3,488,867 bp, 50.4 % G + C content, 3,132 protein-coding genes and 80 RNA genes. Both putative chromate- and sulfate-reducing genes are identified.
Alishewanella sp. WH16-1 (= CCTCC M201507) is a facultative anaerobic, motile, Gram-negative, rod-shaped bacterium isolated from soil of a copper and iron mine. This strain efficiently reduces chromate (Cr(6+)) to the much less toxic Cr(3+). In addition, it reduces sulfate (SO4 (2-)) to S(2-). The S(2-) could react with Cd(2+) to generate precipitated CdS. Thus, strain WH16-1 shows a great potential to bioremediate Cr and Cd contaimination. Here we describe the features of this organism, together with the draft genome and comparative genomic results among strain WH16-1 and other Alishewanella strains. The genome comprises 3,488,867 bp, 50.4 % G + C content, 3,132 protein-coding genes and 80 RNA genes. Both putative chromate- and sulfate-reducing genes are identified.
The genus was established by Vogel et al., in 2000 with as the type species. It belongs to the family of the class [1]. So far, contains six species: , , , and and [1-6]. The common characteristics of the genus are Gram-negative, rod-shaped and positive for oxidase and catalase [1-6]. Some strains were able to degrade pectin which is applicable in bioremediation of food industrial wastes [7-11]. Three strains ( B11T,KCTC 22429 and BL06T) have been sequenced and the pectin degradation pathway was found in their genomes [8-11]. Some strains of were reported to tolerate arsenic [12, 13], but the ability of strains to resist or transform other heavy metal(loids) have not been reported.sp. WH16-1 was isolated from mining soil in 2009. This strain could resist to multiple heavy metals. During cultivation, it could efficiently reduce the toxic chromate (Cr6+) to the much less toxic and less bioavaliable Cr3+. It could also reduce sulfate (SO42−) to S2−. When Cd2+ was present, the S2− reacted with Cd2+ and precipitated as CdS. These characteristics made strain WH16-1 a great potential for bioremediate Cr and Cd contamination. In pot experiments of rice, tobacco and Chinese cabbage, with the addition of the bacterial culture, the amount of Cr and Cd in the plants decreased significantly [14]. Sequencing the genome of WH16-1 and comparing its attributes with the other genomes would provide a means of establishing the molecular determinants required for chromate/sulfate reduction, heavy metal resistance and pectin degradation, and for better application of these strains. Here we report the high quality draft genomic information of strain WH16-1 and compare it to the three sequenced genomes.
Organism information
Classification and features
Phylogenetic analysis was performed by the neighbor-joining method based on 16S rRNA gene sequences. Strain WH16-1 is closely related to BL06T (99.7 %) and CCUG 30811 (99.1 %) (Fig. 1). A similar result was obtained based on gyrase B gene (gyrB) sequences (Additional file 1: Figure S1). The gyrB sequences has been successfully used to establish phylogenetic relatedness in [1], [15], [16], [17], [18] and [19].
Fig. 1
Phylogenetic tree highlighting the phylogenetic position of Alishewanella sp. WH16-1. The phylogenetic tree was constructed based on the 16S rRNA gene sequences. The analysis was inferred by MEGA 6.0 [45] with NJ algorithm and 1,000 bootstrap repetitions were computed to estimate the reliability of the tree
Phylogenetic tree highlighting the phylogenetic position of Alishewanella sp. WH16-1. The phylogenetic tree was constructed based on the 16S rRNA gene sequences. The analysis was inferred by MEGA 6.0 [45] with NJ algorithm and 1,000 bootstrap repetitions were computed to estimate the reliability of the treeStrain WH16-1 is Gram-negative, facultatively anaerobic, motile and rod-shaped (0.3–0.5 × 1.2–2.0) (Fig. 2). Colonies are white, circular and raised on LB agar plate. Growth occurs at 4–40 °C, in 0–8 % (w/v) NaCl and at pH 4–11. Optimal growth occurs at 37 °C, 1 % (w/v) NaCl and at pH 6.0–8.0 (Table 1). It can grow in LB, trypticase soy broth and R2A medium. API 20NE test (bioMérieux) in combination of traditional classification methods were used to analyze the physiological and biochemical characteristics. Strain WH16-1 is positive for oxidase and catalase activities and is able to reduce nitrate to nitrite. It is positive for aesculinase, gelatinase, arginine dihydrolase and urease but is negative for indole and β-galactosidase. It can use D-sucrose and maltose as the sole carbon sources. It cannot assimilate D-glucose, L-arabinose, D-mannose, D-mannitol, N-acetylglucosamine, gluconate, capric acid, adipic acid, malic acid, trisodium citrate or phenylacetic acid. Most of these biochemical characteristics are similar to the other strains [1-6]. However, unlike some strains [8-11], strain WH16-1 cannot degrade pectin.
Fig. 2
Scan electron microscope (SEM) image of Alishewanella sp. WH16-1 cells. The bar scale represents 1 μm
Table 1
Classification and general features of Alishewanella sp. WH16-1 [47]
MIGS ID
Property
Term
Evidence codea
Classification
Domain Bacteria
TAS [48]
Phylum Proteobacteria
TAS [49, 50]
Class Gammaproteobacteria
TAS [51–53]
Order Alteromonadales
TAS [52–54]
Family Alteromonadaceae
TAS [55]
Genus AlishewanellaSpecies Alishewanella sp.
TAS [1]
Strain WH16-1
Gram stain
negative
IDA
Cell shape
rod
IDA
Motility
motile
IDA
Sporulation
non-sporulating
NAS
Temperature range
4–40 °C
IDA
Optimum temperature
37 °C
IDA
pH range; Optimum
4–11; 6–8
IDA
Carbon source
maltose, D-sucrose
IDA
MIGS-6
Habitat
soil
IDA
MIGS-6.3
Salinity
0–8 % NaCl (w/v), optimal at 1 %
IDA
MIGS-22
Oxygen requirement
facultative anaerobic
IDA
MIGS-15
Biotic relationship
free-living
IDA
MIGS-14
Pathogenicity
non-pathogen
NAS
MIGS-4
Geographic location
Huangshi city, Hubei province, China
IDA
MIGS-5
Sample collection
2009
IDA
MIGS-4.1
Latitude
N29°40′–30°15′
IDA
MIGS-4.2
Longitude
E114°31′–115°20′
IDA
MIGS-4.4
Altitude
not reported
These evidence codes are from the Gene Ontology project [56]
IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence)
a Evidence codes
Scan electron microscope (SEM) image of Alishewanella sp. WH16-1 cells. The bar scale represents 1 μmClassification and general features of Alishewanella sp. WH16-1 [47]These evidence codes are from the Gene Ontology project [56]IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence)a Evidence codesInterestingly, the strain could reduce 1 mmol/L Cr6+ (added as K2CrO4) in 36 h and remove 60 μmol/L Cd2+ (added as CdCl2) in 60 h (by the production of precipitated CdS [20] in LB liquid medium) (Fig. 3). In addition, this strain is tolerant to multi-metal(loids). The minimal inhibition concentration tests for different heavy metals were carried out on LB agar plates and incubated at 37 °C for 2 days. The MICs for K2CrO4, CdCl2, PbCl2, CuCl2 and Na3AsO3 are 45, 0.08, 10, 1 and 1 mmol/L, respectively.
Fig. 3
Cr6+ and Cd2+ removed by Alishewanella sp. WH16-1. Control stands for null LB medium. Strain WH16-1 was incubated until OD600 reach 1.0, and then amended with K2CrO4 (1 mmol/L) and CdCl2 (0.06 mmol/L), respectively. The cultures were removed at 12 h intervals. After centrifuging at 12,000 rpm for 2 min, the supernatant was used to determine the residual concentration of Cr6+ and Cd2+. The concentration of Cr6+ and Cd2+ were measured by the UV spectrophotometer (DU800, Beckman, CA, USA) with the colorimetric diphenylcarbazide (DPC) method [46] and the atomic absorption spectrometry AAS, respectively
Cr6+ and Cd2+ removed by Alishewanella sp. WH16-1. Control stands for null LB medium. Strain WH16-1 was incubated until OD600 reach 1.0, and then amended with K2CrO4 (1 mmol/L) and CdCl2 (0.06 mmol/L), respectively. The cultures were removed at 12 h intervals. After centrifuging at 12,000 rpm for 2 min, the supernatant was used to determine the residual concentration of Cr6+ and Cd2+. The concentration of Cr6+ and Cd2+ were measured by the UV spectrophotometer (DU800, Beckman, CA, USA) with the colorimetric diphenylcarbazide (DPC) method [46] and the atomic absorption spectrometry AAS, respectively
Genome sequencing information
Genome project history
Strain WH16-1 was selected for genome sequencing based on its ability to reduce Cr6+ and SO42− and preliminary application for soil Cr and Cd bioremediation. Since 2009, this strain has been used in both basic and bioremediation studies and the results are very promising. It was sequenced by Majorbio Bio-pharm Technology Co., Ltd, Shanghai, China. The genome sequencing and assembly information of the project is given in Table 2. The final genome consists of 133 scaffolds with approximately 345.3 × coverage. The draft genome sequence was annotated by NCBI PGAP. The genome sequence is available in DDBJ/EMBL/GenBank under accession number LCWL00000000.
Table 2
Project information
MIGS ID
Property
Term
MIGS-31
Finishing quality
High-quality draft
MIGS-28
Libraries used
Illumina Paired-End library (300 bp insert size)
MIGS-29
Sequencing platforms
Illumina Hiseq 2000
MIGS-31.2
Fold coverage
345.3 ×
MIGS-30
Assemblers
SOAPdenovo v1.05
MIGS-32
Gene calling method
GeneMarkS+
Locus TAG
AAY72
Genbank ID
LCWL00000000
Genbank Date of Release
2015.11.12
Bioproject
PRJNA283029
MIGS-13
Source material identifier
Strain CCTCC M201507
Project relevance
Bioremediation
Project information
Growth conditions and genomic DNA preparation
A single colony of strain WH16-1 was incubated into 50 ml LB medium and grown aerobically at 37 °C for 36 h with 150 rpm shaking. The cells were collected by centrifugation. The DNA was extracted, concentrated and purified using the QiAamp kit (Qiagen, Germany). A NanoDrop Spectrophotometer 2000 was used to determine the quality and quantity of the DNA. Six micrograms of DNA was sent to Majorbio Bio-pharm Technology Co., Ltd (Shanghai, China) for sequencing.
Genome sequencing and assembly
The genome sequencing of strain WH16-1 was performed on an Illumina Hiseq2000 [21] and assembled by Majorbio Bio-pharm Technology Co., Ltd, Shanghai, China. An Illumina standard shotgun library was constructed and sequenced, which generated 12,683,662 reads totaling 1,281,049,862 bp. All original sequence data can be found at the NCBI Sequence Read Archive [22]. The following steps were performed for removing low quality reads: (1) removed the adapter o reads; (2) cut the 5′ end bases which were not A, T, G, C; (3) filtered the reads which have a quality score lower than 20; (4) filtered the reads which contained N more than 10 %; and (5) removed the reads which have the length less than 25 bp after processed by the previous four steps. The reads were assembled into 156 contigs using SOAPdenovo v1.05 [23]. A total of 149 contigs were obtained after removing the contigs < 200 bp. The total size of the genome is 3,488,867 bp and the final assembly is based on 1,205 Mbp of Illumina data which provides a coverage of 345.3 × .
Genome annotation
The draft genome of WH16-1 was annotated through the NCBI PGAP, which combines the gene caller GeneMarkS+ [24] with the similarity-based gene detection approach. Protein function classification was performed by WebMGA [25] with E-value cutoff of 1-e10. The transmembrane helices were predicted by TMHMM v. 2.0 [26]. Signal peptides in the genome were predicted by SignalP 4.1 [27]. The translations of the predicted CDSs were also used to search against the Pfam protein family database with E-value cutoff of 1-e5 [28] and the KEGG database [29]. Internal gene clustering was performed by OrthoMCL using Match cutoff of 50 % and E-value Exponent cutoff of 1-e5 [30, 31].
Genome properties
The whole genome of strain WH16-1is 3,488,867 bp in length, with an average G + C content of 50.4 %, and is distributed in 149 contigs (>200 bp). The genome properties and statistics are summarized in Table 3. There are 80 predicted RNA including 73 tRNA, 5 rRNAs and 2 ncRNA. In addition, a total of 3,132 protein-coding genes are identified. The distribution of genes into COGs functional categories is presented in Table 4.
Table 3
Nucleotide content and gene count levels of the genome
Attribute
Genome (total)
Value
% of totala
Genome size (bp)
3,488,867
100.00
DNA coding (bp)
3,117,033
89.34
DNA G + C (bp)
1,759,785
50.44
DNA scaffolds
133
100.00
Contigs
149
100.00
Total genesb
3,282
RNA genes
80
Pseudo genes
73
Protein-coding genes
3,132
100.00
Genes in internal clusters
1,190
37.99
Genes with function prediction
2,388
76.25
Genes assigned to COGs
2,249
71.81
Genes with Pfam domains
2,710
86.53
Genes with signal peptides
367
11.72
Genes with transmembrane helices
1,101
35.15
CRISPR repeats
1
aThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
bAlso includes 73 pseudogenes, 73 tRNA genes, 5 rRNAs and 2 ncRNA
Table 4
Number of genes associated with the 25 general COG functional categories
Code
Value
% of totala
Description
J
175
5.59
Translation
A
1
0.03
RNA processing and modification
K
153
4.89
Transcription
L
141
4.50
Replication, recombination and repair
B
2
0.06
Chromatin structure and dynamics
D
34
1.09
Cell cycle control, mitosis and meiosis
Y
0
0.00
Nuclear structure
V
56
1.79
Defense mechanisms
T
216
6.90
Signal transduction mechanisms
M
156
4.98
Cell wall/membrane biogenesis
N
87
2.78
Cell motility
Z
0
0.00
Cytoskeleton
W
0
0.00
Extracellular structures
U
77
2.46
Intracellular trafficking and secretion
O
116
3.70
Posttranslational modification, protein turnover, chaperones
C
157
5.01
Energy production and conversion
G
90
2.87
Carbohydrate transport and metabolism
E
207
6.61
Amino acid transport and metabolism
F
62
1.98
Nucleotide transport and metabolism
H
132
4.21
Coenzyme transport and metabolism
I
85
2.71
Lipid transport and metabolism
P
148
4.73
Inorganic ion transport and metabolism
Q
41
1.31
Secondary metabolites biosynthesis, transport and catabolism
R
244
7.79
General function prediction only
S
202
6.45
Function unknown
-
885
28.26
Not in COGs
aThe total is based on the total number of protein coding genes in the annotated genome
Nucleotide content and gene count levels of the genomeaThe total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genomebAlso includes 73 pseudogenes, 73 tRNA genes, 5 rRNAs and 2 ncRNANumber of genes associated with the 25 general COG functional categoriesaThe total is based on the total number of protein coding genes in the annotated genome
Insights from the genome sequence
Strain WH16-1 has the genes for a complete SO42− reduction pathway according to the KEGG analysis, including CysPUWA, CysN, CysD, CysC, CysH and CysIJ (Additional file 1: Figure S2; Additional file 2: Table S1). This pathway contained several steps: 1) the SO42− is uptaken by the putative CysPUWA into the cell [32]; 2) the intracellular SO42− is acetylated to adenylylsulphate (APS) by sulfate adenylyltransferases CysN and CysD [33]; 3) the APS is phosphorylated to phosphoadenylylsulphate (PAPS) by APS kinase CysC and, 4) the PAPS is reduced to sulfite (SO32−) by PAPS reductase CysH [33] and, 5) the SO32− is finally reduced to sulfide (S2−) by sulfite reductase CysIJ [33]. Strain WH16-1 was able to remove Cd2+ most probably due to the reaction between S2− and Cd2+ to form the precipitated CdS [20]. For Cr6+ reduction, a putative chromate reductase YieF was found (Additional file 2: Table S1). YieF was reported to responsible for the reduction of Cr6+ in cytoplasm [34]. An individual chromate transport gene chrA and a chromate resistance cluster including chrBAC, hp1, chrF, lppy/lpqo, hp2 and ABC transport permease gene are found in the genome (Additional file 2: Table S2) [35, 36]. Currently, we have disrupted the chrA (AAY72_02075) and the ABC transport permease genes, respectively. The chromate resistance levels were both decreased significantly in the chrA and ABC transport permease gene mutant strains (data not shown).In addition, various heavy metal transformation and resistance determinants are identified in the genome of strain WH16-1 Several transporters (MntH, CzcA and ZntA) that might be involved in the efflux of Cd2+, Pb2+ and Zn2+ are found [37-39]. Cu2+, As3+ and Hg2+ resistance determinants are also present, such as Cu transporter ATPase [40], Cu2+ resistance system CopABCD [41], Ars [42] and Pst [43] systems for arsenic resistance and MerTPADE system for mercury resistance [44] (Additional file 2: Table S2).Strain WH16-1 has a genome size (3.49 Mbp), similar to KCTC 22429 (3.84 Mbp), B11T (3.59 Mbp) and BL06T (3.49 Mbp) [8-10] (Fig. 4). The G + C content of strain WH16-1 (50.4 %) is also consistent with the other strains (KCTC 22429, 50.7 %, B11T, 51 % and BL06T, 50.6 %). Strain WH16-1 shares 2,474 proteins with the other three genomes and has 217 strain-specific proteins (Fig. 5). The 2,474 core genes include yieF, chrA, the ten genes in the whole sulfate reduction pathway and most of the heavy metal resistance genes (Additional file 2: Table S1-S2). Strain WH16-1 possesses the higher number of chromatin resistance genes compared to the other three strains.
Fig. 4
A graphical circular map of the comparison between reference strain Alishewanella sp. WH16-1 and three sequenced strains of the Alishewanella species. From outside to center, rings 1, 4 show protein-coding genes colored by COG categories on forward/reverse strand; rings 2, 3 denote genes on forward/reverse strand; rings 5, 6, 7 show the CDS vs CDS BLAST results of strain WH16-1 with those of A. agri BL06T, A. jeotgali KCTC 22429T and A. aestuarii B11T, respectively; ring 8 shows G + C % content plot and the innermost ring shows GC skew
Fig. 5
The Venn diagram depicting the core and unique genes between Alishewanella sp. WH16-1 and other three Alishewanella species (A. agri BL06T, A. jeotgali KCTC 22429T and A. aestuarii B11T)
A graphical circular map of the comparison between reference strain Alishewanella sp. WH16-1 and three sequenced strains of the Alishewanella species. From outside to center, rings 1, 4 show protein-coding genes colored by COG categories on forward/reverse strand; rings 2, 3 denote genes on forward/reverse strand; rings 5, 6, 7 show the CDS vs CDS BLAST results of strain WH16-1 with those of A. agri BL06T, A. jeotgali KCTC 22429T and A. aestuarii B11T, respectively; ring 8 shows G + C % content plot and the innermost ring shows GC skewThe Venn diagram depicting the core and unique genes between Alishewanella sp. WH16-1 and other three Alishewanella species (A. agri BL06T, A. jeotgali KCTC 22429T and A. aestuarii B11T)In addition, BL06T, KCTC 22429 and B11T were all reported to have the ability of degrading pectin and possess pectin degradation genes [8-11]. However, unlike strains BL06T, KCTC 22429 and B11T, strain WH16-1 was unable to degrade pectin and the pectin degradation genes are not found in its genome. Since strain WH16-1 was isolated from a heavy metal rich environment, it may be more relevant for bioremediation of heavy metal contamination. The pectin degradation genes may be lost during the evolution.
Conclusions
The genomic results of sp. WH16-1 reveal correlation between the gene types and some phenotypes. The strain harbors various genes responsible for sulfate transport and reduction, chromate reduction and resistance of multi-heavy metals. These observations provide insights into understand the molecular mechanisms of heavy metals. In addition, all of the analyzed genomes have putative sulfate and chromate reduction genes, which indicates that sulfate and chromate reduction may be the important characters of the strains. Thus, these strains have a great potential for application in bioremediation of heavy metal or other industrial wastes.
Authors: M Ashburner; C A Ball; J A Blake; D Botstein; H Butler; J M Cherry; A P Davis; K Dolinski; S S Dwight; J T Eppig; M A Harris; D P Hill; L Issel-Tarver; A Kasarskis; S Lewis; J C Matese; J E Richardson; M Ringwald; G M Rubin; G Sherlock Journal: Nat Genet Date: 2000-05 Impact factor: 38.330
Authors: B F Vogel; K Venkateswaran; H Christensen; E Falsen; G Christiansen; L Gram Journal: Int J Syst Evol Microbiol Date: 2000-05 Impact factor: 2.747
Authors: K Venkateswaran; D P Moser; M E Dollhopf; D P Lies; D A Saffarini; B J MacGregor; D B Ringelberg; D C White; M Nishijima; H Sano; J Burghardt; E Stackebrandt; K H Nealson Journal: Int J Syst Bacteriol Date: 1999-04