Literature DB >> 26516404

Genomic information of the arsenic-resistant bacterium Lysobacter arseniciresistens type strain ZS79(T) and comparison of Lysobacter draft genomes.

Lin Liu¹, Shengzhe Zhang¹, Meizhong Luo¹, Gejiao Wang¹.

Abstract

Lysobacter arseniciresistens ZS79(T) is a highly arsenic-resistant,rod-shaped, motile, non-spore-forming, aerobic, Gram-negative bacterium. In this study, four Lysobacter type strains were sequenced and the genomic information of L. arseniciresistens ZS79(T) and the comparative genomics results of the Lysobacter strains were described. The draft genome sequence of the strain ZS79(T) consists of 3,086,721 bp and is distributed in 109 contigs. It has a G+C content of 69.5 % and contains 2,363 protein-coding genes including eight arsenic resistant genes.

Entities: Chemical Disease Species

Keywords: Comparative genomics; Genome sequence; Lysobacter; Lysobacter arseniciresistens; Xanthomonadaceae

Year: 2015 PMID： 26516404 PMCID： PMC4624666 DOI： 10.1186/s40793-015-0070-5

Source DB: PubMed Journal: Stand Genomic Sci ISSN： 1944-3277

Introduction

type strain ZS79T (=CGMCC 1.10752T = KCTC 23365 T) belongs to family [1]. It is an arsenic-resistant bacterium isolated from subsurface soil of Tieshan iron mine, Daye City, P. R. China [1]. So far, there are 32 validly published species of [2]. Most of these strains were isolated from soil except that [3] and [4] were isolated from water, and [5], daecheongensis [6] [7] were isolated from sludge, sediment and deep-sea sponge, respectively. So far, the genomic sequences of two strains have been published ( AZ78 [8, 9] and 13-6 [10]), but the annotation of 13-6 was not completed. In order to provide genome information of genus , we performed whole genome sequencing of four strains of ( ZS79T, conceretionis Ko07T [5], GH1-9T [11], and IMMIB APB-9T [12]). In this study, the genome features of ZS79T is provided and the comparative results of five genomes of are presented.

Organism information

Classification and features

Members of genus are rod-shaped, aerobic, Gram-negative bacteria [3]. Their G+C contents are 65.4–70.1 %. They use NO−, NH+, glutamate, asparaginate as sole nitrogen sources, Q-8 as the major respiratory quinone, and diphosphatidylglycerol, phosphatidylethanolamine, phosphatidylglycerol, phosphatidyl-N-methylethanolamine as the major polar lipids [3, 8]. In addition, they could lyse cells of many creatures including bacteria, filamentous fungi, yeasts, algae and nematodes [3]. Phylogenetic analyses of ZS79T and its related strains of family were performed based on 16S rRNA genes (Fig. 1a) and 831 conserved proteins (Fig. 1b). In both trees, strain ZS79T is clustered with the other four strains of genus . The phylogenies of the two trees are similar but genomic based tree is more stable than the 16S rRNA gene one (Fig. 1b vs 1a).

Fig. 1

Phylogenetic analyses indicating the position of L. arseniciresistens (in bold) in family Xanthomonadaceae. a The NJ tree based on aligned sequences of 16S rRNA of ten strains of family Xanthomonadaceae. b The NJ tree based on 831 conserved proteins among the ten Xanthomonadaceae strains. Phylogenetic analyses were performed using MEGA version 6 [33]. The trees were built using p-distance model and a bootstrap analysis of 1000 replicates. The GenBank numbers are listed after each strain ZS79T is aerobic, motile, and Gram-negative bacterium with a Minimum Inhibitory Concentration of 14 mM arsenite in R2A medium (Table 1). The cells are rod-shaped with one flagellum and non-spore-forming (Fig. 2). Colonies of this strain are yellow, nontransparent, convex, circular, and, smooth [1].

Table 1

Classification and general features of L. arseniciresistens ZS79T according to the MIGS recommendations [27]

MIGS ID	Property	Term	Evidence code^a
	Classification	Domain Bacteria	TAS [28]
		Phylum Proteobacteria	TAS [29]
		Class Gammaproteobacteria	TAS [29, 30]
		Order Xanthomonadales	TAS [30, 31]
		Family Xanthomonadaceae	TAS [30, 31]
		Genus Lysobacter	TAS [3]
		Species Lysobacter arseniciresistens	TAS [1]
		Type strain: ZS79^T (=CGMCC 1.10752^T = KCTC 23365^T).	TAS [1]
	Gram stain	negative	TAS [1]
	Cell shape	rod-shaped	TAS [1]
	Motility	motile	TAS [1]
	Sporulation	non-spore-forming	TAS [1]
	Temperature range	4–37 °C	TAS [1]
	Optimum temperature	28 °C	TAS [1]
	pH range; Optimum	5.0–9.0; 7.0	TAS [1]
	Carbon source	tyrosine, hippurate, gelatin, 3-hydroxybutyric acid	TAS [1]
MIGS-6	Habitat subsurface soil		TAS [1]
MIGS-6.3	Salinity	0–4 % NaCl (w/v)	TAS [1]
MIGS-22	Oxygen requirement	aerobic	TAS [1]
MIGS-15	Biotic relationship	free-living	NAS
MIGS-14	Pathogenicity	non-pathogen	NAS
MIGS-4	Geographic location	Daye City, Hubei province, China	TAS [1]
MIGS-5	Sample collection	2011	TAS [1]
MIGS-4.1	Latitude	30.207178 N	TAS [1]
MIGS-4.2	Longitude	114.901092 E	TAS [1]
MIGS-4.4	Altitude	not reported

a: Evidence codes – TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32]

Fig. 2

Transmission electron microscopy of L. arseniciresistens ZS79T

Classification and general features of L. arseniciresistens ZS79T according to the MIGS recommendations [27] a: Evidence codes – TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32] Transmission electron microscopy of L. arseniciresistens ZS79T The major ubiquinone is Q-8, the major cellular fatty acids (>10 %) are iso-C: 0, iso-C:1ω9ϲ, iso-C:0, iso-C:0 and iso-C:0 3-OH. The polar lipids are diphosphatidylglycerol, phosphatidylethanolamine, phosphatidylglycerol and a kind of unknown phospholipid The C + G content is was 70.7 mol% (HPLC) [1].

Genome sequencing and annotation

Genome project history

The genome of ZS79T was sequenced in April, 2013 and finished within two months. The high-quality draft genome sequence is available in GenBank database under accession number AVPT00000000. The genome sequencing project information is summarized in Table 2.

Table 2

Project information

MIGS ID	Property	Term
MIGS 31	Finishing quality	High-quality draft
MIGS-28	Libraries used	Illumina Paired-End library (300 bp insert size)
MIGS 29	Sequencing platforms	Illumina Hiseq2000
MIGS 31.2	Fold coverage	272.6×
MIGS 30	Assemblers	SOAPdenovo v1.05
MIGS 32	Gene calling method	GeneMarkS+
	Locus Tag	N799
	GenBank ID	AVPT00000000
	GenBank Date of Release	2014/10/24
	GOLD ID	Gi0055236
	BIOPROJECT	PRJNA214588
MIGS 31	Source Material Identifier	ZS79^T
	Project relevance	Genome comparison

Project information

Growth conditions and genomic DNA preparation

ZS79T was cultured in 50 ml of LB (Luria–Bertani) medium at 28 °C for 3 days with 160 160 r/min shaking. About 10 mg cells were harvested by centrifugation and suspended in normal saline, and then lysed using lysozyme. DNA was isolated using cells were harvested by centrifugation and suspended in normal saline, and then lysed using lysozyme. The DNA was extracted and purified using the QiAamp kit according to the manufacturer’s instruction (Qiagen, Germany).

Genome sequencing and assembly

The whole genome sequencing of ZS79T was performed on Illumina Hiseq2000 with Paired-End library strategy (300 bp insert size) at Majorbio Biomedical Science and Technology Co. Ltd. DNA libraries with insert sizes from 300 to 500 bp was constructed using the established protocol [13]. The obtained high quality data contains 4,528,542 × 2 pared reads and 194,996 single reads with an average read length of 91 bp. The sequencing depth was 272.6×. Using SOAPdenovo v1.05 [14] the reads were assembled into 109 contigs with a cumulative genome size of 3,086,721 bp.

Genome annotation

The draft sequence of ZS79T was annotated using the National Center for Biotechnology Information Prokaryotic Genomes Annotation Pipeline [15]. The functions of the predicted genes were determined through blast alignment against the NCBI protein database. Genes were identified using the gene caller GeneMarkS+ with the similarity-based gene detection approach [16]. The different features were predicted by WebMGA [17], TMHMM [18] and SignalP [19].

Genome properties

The whole genome sequence of ZS79T is 3,086,721 bp long with a G+C content of 69.6 % and is distributed into 109 contigs. It has 2,422 predicted genes including 2,363 (97.6 %) protein coding genes, 50 (2.1 %) RNA genes, and 9 (0.4 %) pseudo genes. A total of 1633 (67.4 %) genes have functional prediction, and 1,858 (76.7 %) genes could be assigned to Clusters of Orthologous Groups [20]. More detailed information of the genome statistics is showed in Table 3. The protein functional classification according to COGs is showed in Table 4. The genome map is showed in Fig. 3.

Table 3

Genome statistics

Attribute	Value	% of Total
Genome size (bp)	3,086,721	100.00
DNA coding (bp)	2,284,152	74.00
DNA G+C (bp)	2,147,191	69.56
DNA scaffolds	109
Total genes	2,422	100.00
Protein coding genes	2,363	97.56
RNA genes	50	2.06
Pseudo genes	9	0.37
Genes in internal clusters	811	34.32
Genes with function prediction	1633	67.42
Genes assigned to COGs	1858	76.71
Genes with Pfam domains	2038	84.14
Genes with signal peptides	539	22.81
Genes with transmembrane helices	527	22.25
CRISPR repeats	1	0.41

Table 4

Number of genes associated with general COG functional categories

Code	Value	%age	Description
J	157	6.48	Translation, ribosomal structure and biogenesis
A	1	0.04	RNA processing and modification
K	116	4.79	Transcription
L	127	5.24	Replication, recombination and repair
B	2	0.08	Chromatin structure and dynamics
D	27	1.11	Cell cycle control, Cell division, chromosome partitioning
V	37	1.53	Defense mechanisms
T	104	4.29	Signal transduction mechanisms
M	125	5.16	Cell wall/membrane biogenesis
N	73	3.01	Cell motility
U	89	3.67	Intracellular trafficking and secretion
O	108	4.46	Posttranslational modification, protein turnover, chaperones
C	128	5.28	Energy production and conversion
G	70	2.89	Carbohydrate transport and metabolism
E	148	6.11	Amino acid transport and metabolism
F	50	2.06	Nucleotide transport and metabolism
H	91	3.76	Coenzyme transport and metabolism
I	90	3.72	Lipid transport and metabolism
P	107	4.42	Inorganic ion transport and metabolism
Q	53	2.19	Secondary metabolites biosynthesis, transport and catabolism
R	233	9.62	General function prediction only
S	185	7.64	Function unknown
-	564	23.29	Not in COGs

The total is based on the total number of protein coding genes in the genome

Fig. 3

Graphical circular map of L. arseniciresistens ZS79T genome. From outer to inner, ring 1 shows the genomic islands (red bars) that were predicted by IslandViewer [34]; ring 3,4 show the predicted genes on forward/reverse strand; ring 2,5 show the genes assigned to COGs; ring 6-9 show the ORFs similarity between the genome of L. arseniciresistens ZS79T and the genomes of L. conceretionis Ko07T, L. daejeonensis GH1-9T, L. capsici AZ78 and L. defluvii IMMIB APB-9T; ring 10 shows the G+C% content plot

Genome statistics Number of genes associated with general COG functional categories The total is based on the total number of protein coding genes in the genome Graphical circular map of L. arseniciresistens ZS79T genome. From outer to inner, ring 1 shows the genomic islands (red bars) that were predicted by IslandViewer [34]; ring 3,4 show the predicted genes on forward/reverse strand; ring 2,5 show the genes assigned to COGs; ring 6-9 show the ORFs similarity between the genome of L. arseniciresistens ZS79T and the genomes of L. conceretionis Ko07T, L. daejeonensis GH1-9T, L. capsici AZ78 and L. defluvii IMMIB APB-9T; ring 10 shows the G+C% content plot

Insights from the genome sequences

To obtain features of genomes, we sequenced four genomes of genus and performed comparative genomic analysis among the five available genomes of this genus. The general features of these five genomes are summarized in Table 5. To calculate the pan-genome and core-genome of these five genomes, we performed orthologs clustering analysis using OrthoMCL [21]. The pan-genome has 6,409 orthologs families and the core-genome has 1,207 orthologs. The numbers of unique genes of each genome are showed in Fig. 4. To evaluate the genome variation of these five genomes, we first performed multiple alignments among these genome sequences using MAUVE [22] and then calculated the nucleotide diversity using DnaSP v5 [23]. These five genomes shared 0.73 Mb co-linear sequences. The π value of these sequences among these five genomes is 0.173 which means that the approximate nucleotide sequence homology is 83 % among genomes of [23].

Table 5

General features of the five Lysobacter genomesa

Strains	Source	Size (Mb)	G+C content	CDSs	rRNA clusters	tRNAs	Genome status			GenBank No.
Strains	Source	Size (Mb)	G+C content	CDSs	rRNA clusters	tRNAs	Draft/finished	Contigs	Contigs N50 (bp)	GenBank No.
L. arseniciresistens ZS79^T	Iron-mined soil	3.1	69.58 %	2,363	3	46	Draft	109	101,761	AVPT00000000
L. conceretionis Ko07^T	Anaerobic granules	3.0	67.25 %	2,232	3	46	Draft	26	386,139	AVPS00000000
L. daejeonensis GH1-9^T	Green house soils	3.3	67.29 %	2,570	4	48	Draft	99	101,460	AVPU00000000
L. defluvii IMMIB APB-9^T	Municipal solid waste	2.7	70.22 %	2,443	13	44	Draft	578	16,113	AVBH00000000
L. capsici AZ78	Tobacco & tomato rhizosphere	6.3	66.43 %	5,139	8	65	Draft	174	101,988	JAJA00000000

aThe genome of L. arseniciresistens ZS79T, L. conceretionis Ko07T, L. daejeonensis GH1-9T and L. defluvii IMMIB APB-9T are sequenced in this study. The genome of L. capsici AZ78 was sequenced by Puoplo et al. [9]

Fig. 4

The core-genome and the unique genes of the five Lysobacter genomes. The Venn diagram shows the number of orthologous gene families of the core-genome (in the center) and the numbers of unique genes of each genome

General features of the five Lysobacter genomesa aThe genome of L. arseniciresistens ZS79T, L. conceretionis Ko07T, L. daejeonensis GH1-9T and L. defluvii IMMIB APB-9T are sequenced in this study. The genome of L. capsici AZ78 was sequenced by Puoplo et al. [9] The core-genome and the unique genes of the five Lysobacter genomes. The Venn diagram shows the number of orthologous gene families of the core-genome (in the center) and the numbers of unique genes of each genome In the genome of ZS79T, we found that the genomic island distributions are consistent with the genome C + G content anomaly areas (Fig. 3). In addition, few gene sequences from the other four genomes could be aligned with these genomic island regions (Fig. 3, ring 6 to ring 9). These results indicated that the genes within the genomic islands were most probably acquired by horizontal transfer [24] and these regions are unique in the genome of ZS79T. According to Kyoto Encyclopedia of Genes and Genomes [25] annotation result, all of the five genomes have a nearly complete type II secretion system which could secret cell wall degrading enzymes [26]. This result may correspond to the behavior of members that were able to lyse cells of many microorganisms [3]. In addition, the genomes of ZS79T, Ko07T and IMMIB APB-9T contain genes for flagellar assembly, whereas the genome of GH1-9T does not contain any genes for flagellar assembly and AZ78 does not contain genes for flagellar filament (Additional file 1: Table S2). These genotypes correspond to the phenotype descriptions that and are non-motile [8, 11]. Genomic analysis showed eight genes corresponding to arsenic resistance in the genomes of ZS79T (Additional file 1: Table S3). This result well explained the arsenite resistance of this strain [1]. By contrast, fewer arsenic resistance were found in the genomes of Ko07T, IMMIB APB-9T, AZ78, and GH1-9T compared to strain ZS79T.

Conclusions

The genomic information of ZS79T and the comparative genomics analysis of the five strains are obtained. The genomic based phylogeny is in agreement with the 16S rRNA gene based one indicating the usefulness of genomic information for bacterial taxonomic classification. Analysis of the genomes show certain correlation between the genotypes and the phenotypes.

28 in total

1. Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors: A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal: J Mol Biol Date: 2001-01-19 Impact factor: 5.469

2. Improved prediction of signal peptides: SignalP 3.0.

Authors: Jannick Dyrløv Bendtsen; Henrik Nielsen; Gunnar von Heijne; Søren Brunak
Journal: J Mol Biol Date: 2004-07-16 Impact factor: 5.469

Review 3. Type II secretion: a protein secretion system for all seasons.

Authors: Nicholas P Cianciotto
Journal: Trends Microbiol Date: 2005-10-10 Impact factor: 17.079

4. Lysobacter defluvii sp. nov., isolated from municipal solid waste.

Authors: A F Yassin; Wen-Ming Chen; H Hupfer; C Siering; R M Kroppenstedt; A B Arun; Wei-An Lai; Fo-Ting Shen; P D Rekha; Chiu Chung Young
Journal: Int J Syst Evol Microbiol Date: 2007-05 Impact factor: 2.747

5. Lysobacter spongiicola sp. nov., isolated from a deep-sea sponge.

Authors: Lyudmila A Romanenko; Masataka Uchino; Naoto Tanaka; Galina M Frolova; Valery V Mikhailov
Journal: Int J Syst Evol Microbiol Date: 2008-02 Impact factor: 2.747

6. Lysobacter daecheongensis sp. nov., isolated from sediment of stream near the Daechung dam in South Korea.

Authors: Leonid N Ten; Hae-Min Jung; Wan-Taek Im; Soon-Ae Yoo; Sung-Taik Lee
Journal: J Microbiol Date: 2008-10-31 Impact factor: 3.422

7. OrthoMCL: identification of ortholog groups for eukaryotic genomes.

Authors: Li Li; Christian J Stoeckert; David S Roos
Journal: Genome Res Date: 2003-09 Impact factor: 9.043

8. IslandViewer: an integrated interface for computational identification and visualization of genomic islands.

Authors: Morgan G I Langille; Fiona S L Brinkman
Journal: Bioinformatics Date: 2009-01-16 Impact factor: 6.937

9. SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors: Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal: Gigascience Date: 2012-12-27 Impact factor: 6.524

10. The COG database: an updated version includes eukaryotes.

Authors: Roman L Tatusov; Natalie D Fedorova; John D Jackson; Aviva R Jacobs; Boris Kiryutin; Eugene V Koonin; Dmitri M Krylov; Raja Mazumder; Sergei L Mekhedov; Anastasia N Nikolskaya; B Sridhar Rao; Sergei Smirnov; Alexander V Sverdlov; Sona Vasudevan; Yuri I Wolf; Jodie J Yin; Darren A Natale
Journal: BMC Bioinformatics Date: 2003-09-11 Impact factor: 3.169

1 in total

1. Pea Broth Enhances the Biocontrol Efficacy of Lysobacter capsici AZ78 by Triggering Cell Motility Associated with Biogenesis of Type IV Pilus.

Authors: Selena Tomada; Gerardo Puopolo; Michele Perazzolli; Rita Musetti; Nazia Loi; Ilaria Pertot
Journal: Front Microbiol Date: 2016-07-26 Impact factor: 5.640

1 in total