Literature DB >> 26516404

Genomic information of the arsenic-resistant bacterium Lysobacter arseniciresistens type strain ZS79(T) and comparison of Lysobacter draft genomes.

Lin Liu1, Shengzhe Zhang1, Meizhong Luo1, Gejiao Wang1.   

Abstract

Lysobacter arseniciresistens ZS79(T) is a highly arsenic-resistant,rod-shaped, motile, non-spore-forming, aerobic, Gram-negative bacterium. In this study, four Lysobacter type strains were sequenced and the genomic information of L. arseniciresistens ZS79(T) and the comparative genomics results of the Lysobacter strains were described. The draft genome sequence of the strain ZS79(T) consists of 3,086,721 bp and is distributed in 109 contigs. It has a G+C content of 69.5 % and contains 2,363 protein-coding genes including eight arsenic resistant genes.

Entities:  

Keywords:  Comparative genomics; Genome sequence; Lysobacter; Lysobacter arseniciresistens; Xanthomonadaceae

Year:  2015        PMID: 26516404      PMCID: PMC4624666          DOI: 10.1186/s40793-015-0070-5

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

type strain ZS79T (=CGMCC 1.10752T = KCTC 23365 T) belongs to family [1]. It is an arsenic-resistant bacterium isolated from subsurface soil of Tieshan iron mine, Daye City, P. R. China [1]. So far, there are 32 validly published species of [2]. Most of these strains were isolated from soil except that [3] and [4] were isolated from water, and [5], daecheongensis [6] [7] were isolated from sludge, sediment and deep-sea sponge, respectively. So far, the genomic sequences of two strains have been published ( AZ78 [8, 9] and 13-6 [10]), but the annotation of 13-6 was not completed. In order to provide genome information of genus , we performed whole genome sequencing of four strains of ( ZS79T, conceretionis Ko07T [5], GH1-9T [11], and IMMIB APB-9T [12]). In this study, the genome features of ZS79T is provided and the comparative results of five genomes of are presented.

Organism information

Classification and features

Members of genus are rod-shaped, aerobic, Gram-negative bacteria [3]. Their G+C contents are 65.4–70.1 %. They use NO−, NH+, glutamate, asparaginate as sole nitrogen sources, Q-8 as the major respiratory quinone, and diphosphatidylglycerol, phosphatidylethanolamine, phosphatidylglycerol, phosphatidyl-N-methylethanolamine as the major polar lipids [3, 8]. In addition, they could lyse cells of many creatures including bacteria, filamentous fungi, yeasts, algae and nematodes [3]. Phylogenetic analyses of ZS79T and its related strains of family were performed based on 16S rRNA genes (Fig. 1a) and 831 conserved proteins (Fig. 1b). In both trees, strain ZS79T is clustered with the other four strains of genus . The phylogenies of the two trees are similar but genomic based tree is more stable than the 16S rRNA gene one (Fig. 1b vs 1a).
Fig. 1

Phylogenetic analyses indicating the position of L. arseniciresistens (in bold) in family Xanthomonadaceae. a The NJ tree based on aligned sequences of 16S rRNA of ten strains of family Xanthomonadaceae. b The NJ tree based on 831 conserved proteins among the ten Xanthomonadaceae strains. Phylogenetic analyses were performed using MEGA version 6 [33]. The trees were built using p-distance model and a bootstrap analysis of 1000 replicates. The GenBank numbers are listed after each strain

Phylogenetic analyses indicating the position of L. arseniciresistens (in bold) in family Xanthomonadaceae. a The NJ tree based on aligned sequences of 16S rRNA of ten strains of family Xanthomonadaceae. b The NJ tree based on 831 conserved proteins among the ten Xanthomonadaceae strains. Phylogenetic analyses were performed using MEGA version 6 [33]. The trees were built using p-distance model and a bootstrap analysis of 1000 replicates. The GenBank numbers are listed after each strain ZS79T is aerobic, motile, and Gram-negative bacterium with a Minimum Inhibitory Concentration of 14 mM arsenite in R2A medium (Table 1). The cells are rod-shaped with one flagellum and non-spore-forming (Fig. 2). Colonies of this strain are yellow, nontransparent, convex, circular, and, smooth [1].
Table 1

Classification and general features of L. arseniciresistens ZS79T according to the MIGS recommendations [27]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [28]
Phylum Proteobacteria TAS [29]
Class Gammaproteobacteria TAS [29, 30]
Order Xanthomonadales TAS [30, 31]
Family Xanthomonadaceae TAS [30, 31]
Genus Lysobacter TAS [3]
Species Lysobacter arseniciresistens TAS [1]
Type strain: ZS79T (=CGMCC 1.10752T = KCTC 23365T).
Gram stainnegativeTAS [1]
Cell shaperod-shapedTAS [1]
MotilitymotileTAS [1]
Sporulationnon-spore-formingTAS [1]
Temperature range4–37 °CTAS [1]
Optimum temperature28 °CTAS [1]
pH range; Optimum5.0–9.0; 7.0TAS [1]
Carbon sourcetyrosine, hippurate, gelatin, 3-hydroxybutyric acidTAS [1]
MIGS-6Habitat subsurface soilTAS [1]
MIGS-6.3Salinity0–4 % NaCl (w/v)TAS [1]
MIGS-22Oxygen requirementaerobicTAS [1]
MIGS-15Biotic relationshipfree-livingNAS
MIGS-14Pathogenicitynon-pathogenNAS
MIGS-4Geographic locationDaye City, Hubei province, ChinaTAS [1]
MIGS-5Sample collection2011TAS [1]
MIGS-4.1Latitude30.207178 NTAS [1]
MIGS-4.2Longitude114.901092 ETAS [1]
MIGS-4.4Altitudenot reported

a: Evidence codes – TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32]

Fig. 2

Transmission electron microscopy of L. arseniciresistens ZS79T

Classification and general features of L. arseniciresistens ZS79T according to the MIGS recommendations [27] a: Evidence codes – TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [32] Transmission electron microscopy of L. arseniciresistens ZS79T The major ubiquinone is Q-8, the major cellular fatty acids (>10 %) are iso-C: 0, iso-C:1ω9ϲ, iso-C:0, iso-C:0 and iso-C:0 3-OH. The polar lipids are diphosphatidylglycerol, phosphatidylethanolamine, phosphatidylglycerol and a kind of unknown phospholipid The C + G content is was 70.7 mol% (HPLC) [1].

Genome sequencing and annotation

Genome project history

The genome of ZS79T was sequenced in April, 2013 and finished within two months. The high-quality draft genome sequence is available in GenBank database under accession number AVPT00000000. The genome sequencing project information is summarized in Table 2.
Table 2

Project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityHigh-quality draft
MIGS-28Libraries usedIllumina Paired-End library (300 bp insert size)
MIGS 29Sequencing platformsIllumina Hiseq2000
MIGS 31.2Fold coverage272.6×
MIGS 30AssemblersSOAPdenovo v1.05
MIGS 32Gene calling methodGeneMarkS+
Locus TagN799
GenBank IDAVPT00000000
GenBank Date of Release2014/10/24
GOLD IDGi0055236
BIOPROJECTPRJNA214588
MIGS 31Source Material IdentifierZS79T
Project relevanceGenome comparison
Project information

Growth conditions and genomic DNA preparation

ZS79T was cultured in 50 ml of LB (Luria–Bertani) medium at 28 °C for 3 days with 160 160 r/min shaking. About 10 mg cells were harvested by centrifugation and suspended in normal saline, and then lysed using lysozyme. DNA was isolated using cells were harvested by centrifugation and suspended in normal saline, and then lysed using lysozyme. The DNA was extracted and purified using the QiAamp kit according to the manufacturer’s instruction (Qiagen, Germany).

Genome sequencing and assembly

The whole genome sequencing of ZS79T was performed on Illumina Hiseq2000 with Paired-End library strategy (300 bp insert size) at Majorbio Biomedical Science and Technology Co. Ltd. DNA libraries with insert sizes from 300 to 500 bp was constructed using the established protocol [13]. The obtained high quality data contains 4,528,542 × 2 pared reads and 194,996 single reads with an average read length of 91 bp. The sequencing depth was 272.6×. Using SOAPdenovo v1.05 [14] the reads were assembled into 109 contigs with a cumulative genome size of 3,086,721 bp.

Genome annotation

The draft sequence of ZS79T was annotated using the National Center for Biotechnology Information Prokaryotic Genomes Annotation Pipeline [15]. The functions of the predicted genes were determined through blast alignment against the NCBI protein database. Genes were identified using the gene caller GeneMarkS+ with the similarity-based gene detection approach [16]. The different features were predicted by WebMGA [17], TMHMM [18] and SignalP [19].

Genome properties

The whole genome sequence of ZS79T is 3,086,721 bp long with a G+C content of 69.6 % and is distributed into 109 contigs. It has 2,422 predicted genes including 2,363 (97.6 %) protein coding genes, 50 (2.1 %) RNA genes, and 9 (0.4 %) pseudo genes. A total of 1633 (67.4 %) genes have functional prediction, and 1,858 (76.7 %) genes could be assigned to Clusters of Orthologous Groups [20]. More detailed information of the genome statistics is showed in Table 3. The protein functional classification according to COGs is showed in Table 4. The genome map is showed in Fig. 3.
Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)3,086,721100.00
DNA coding (bp)2,284,15274.00
DNA G+C (bp)2,147,19169.56
DNA scaffolds109
Total genes2,422100.00
Protein coding genes2,36397.56
RNA genes502.06
Pseudo genes90.37
Genes in internal clusters81134.32
Genes with function prediction163367.42
Genes assigned to COGs185876.71
Genes with Pfam domains203884.14
Genes with signal peptides53922.81
Genes with transmembrane helices52722.25
CRISPR repeats10.41
Table 4

Number of genes associated with general COG functional categories

CodeValue%ageDescription
J1576.48Translation, ribosomal structure and biogenesis
A10.04RNA processing and modification
K1164.79Transcription
L1275.24Replication, recombination and repair
B20.08Chromatin structure and dynamics
D271.11Cell cycle control, Cell division, chromosome partitioning
V371.53Defense mechanisms
T1044.29Signal transduction mechanisms
M1255.16Cell wall/membrane biogenesis
N733.01Cell motility
U893.67Intracellular trafficking and secretion
O1084.46Posttranslational modification, protein turnover, chaperones
C1285.28Energy production and conversion
G702.89Carbohydrate transport and metabolism
E1486.11Amino acid transport and metabolism
F502.06Nucleotide transport and metabolism
H913.76Coenzyme transport and metabolism
I903.72Lipid transport and metabolism
P1074.42Inorganic ion transport and metabolism
Q532.19Secondary metabolites biosynthesis, transport and catabolism
R2339.62General function prediction only
S1857.64Function unknown
-56423.29Not in COGs

The total is based on the total number of protein coding genes in the genome

Fig. 3

Graphical circular map of L. arseniciresistens ZS79T genome. From outer to inner, ring 1 shows the genomic islands (red bars) that were predicted by IslandViewer [34]; ring 3,4 show the predicted genes on forward/reverse strand; ring 2,5 show the genes assigned to COGs; ring 6-9 show the ORFs similarity between the genome of L. arseniciresistens ZS79T and the genomes of L. conceretionis Ko07T, L. daejeonensis GH1-9T, L. capsici AZ78 and L. defluvii IMMIB APB-9T; ring 10 shows the G+C% content plot

Genome statistics Number of genes associated with general COG functional categories The total is based on the total number of protein coding genes in the genome Graphical circular map of L. arseniciresistens ZS79T genome. From outer to inner, ring 1 shows the genomic islands (red bars) that were predicted by IslandViewer [34]; ring 3,4 show the predicted genes on forward/reverse strand; ring 2,5 show the genes assigned to COGs; ring 6-9 show the ORFs similarity between the genome of L. arseniciresistens ZS79T and the genomes of L. conceretionis Ko07T, L. daejeonensis GH1-9T, L. capsici AZ78 and L. defluvii IMMIB APB-9T; ring 10 shows the G+C% content plot

Insights from the genome sequences

To obtain features of genomes, we sequenced four genomes of genus and performed comparative genomic analysis among the five available genomes of this genus. The general features of these five genomes are summarized in Table 5. To calculate the pan-genome and core-genome of these five genomes, we performed orthologs clustering analysis using OrthoMCL [21]. The pan-genome has 6,409 orthologs families and the core-genome has 1,207 orthologs. The numbers of unique genes of each genome are showed in Fig. 4. To evaluate the genome variation of these five genomes, we first performed multiple alignments among these genome sequences using MAUVE [22] and then calculated the nucleotide diversity using DnaSP v5 [23]. These five genomes shared 0.73 Mb co-linear sequences. The π value of these sequences among these five genomes is 0.173 which means that the approximate nucleotide sequence homology is 83 % among genomes of [23].
Table 5

General features of the five Lysobacter genomesa

StrainsSourceSize (Mb)G+C contentCDSsrRNA clusterstRNAsGenome statusGenBank No.
Draft/finishedContigsContigs N50 (bp)
L. arseniciresistens ZS79T Iron-mined soil3.169.58 %2,363346Draft109101,761AVPT00000000
L. conceretionis Ko07T Anaerobic granules3.067.25 %2,232346Draft26386,139AVPS00000000
L. daejeonensis GH1-9T Green house soils3.367.29 %2,570448Draft99101,460AVPU00000000
L. defluvii IMMIB APB-9T Municipal solid waste2.770.22 %2,4431344Draft57816,113AVBH00000000
L. capsici AZ78Tobacco & tomato rhizosphere6.366.43 %5,139865Draft174101,988JAJA00000000

aThe genome of L. arseniciresistens ZS79T, L. conceretionis Ko07T, L. daejeonensis GH1-9T and L. defluvii IMMIB APB-9T are sequenced in this study. The genome of L. capsici AZ78 was sequenced by Puoplo et al. [9]

Fig. 4

The core-genome and the unique genes of the five Lysobacter genomes. The Venn diagram shows the number of orthologous gene families of the core-genome (in the center) and the numbers of unique genes of each genome

General features of the five Lysobacter genomesa aThe genome of L. arseniciresistens ZS79T, L. conceretionis Ko07T, L. daejeonensis GH1-9T and L. defluvii IMMIB APB-9T are sequenced in this study. The genome of L. capsici AZ78 was sequenced by Puoplo et al. [9] The core-genome and the unique genes of the five Lysobacter genomes. The Venn diagram shows the number of orthologous gene families of the core-genome (in the center) and the numbers of unique genes of each genome In the genome of ZS79T, we found that the genomic island distributions are consistent with the genome C + G content anomaly areas (Fig. 3). In addition, few gene sequences from the other four genomes could be aligned with these genomic island regions (Fig. 3, ring 6 to ring 9). These results indicated that the genes within the genomic islands were most probably acquired by horizontal transfer [24] and these regions are unique in the genome of ZS79T. According to Kyoto Encyclopedia of Genes and Genomes [25] annotation result, all of the five genomes have a nearly complete type II secretion system which could secret cell wall degrading enzymes [26]. This result may correspond to the behavior of members that were able to lyse cells of many microorganisms [3]. In addition, the genomes of ZS79T, Ko07T and IMMIB APB-9T contain genes for flagellar assembly, whereas the genome of GH1-9T does not contain any genes for flagellar assembly and AZ78 does not contain genes for flagellar filament (Additional file 1: Table S2). These genotypes correspond to the phenotype descriptions that and are non-motile [8, 11]. Genomic analysis showed eight genes corresponding to arsenic resistance in the genomes of ZS79T (Additional file 1: Table S3). This result well explained the arsenite resistance of this strain [1]. By contrast, fewer arsenic resistance were found in the genomes of Ko07T, IMMIB APB-9T, AZ78, and GH1-9T compared to strain ZS79T.

Conclusions

The genomic information of ZS79T and the comparative genomics analysis of the five strains are obtained. The genomic based phylogeny is in agreement with the 16S rRNA gene based one indicating the usefulness of genomic information for bacterial taxonomic classification. Analysis of the genomes show certain correlation between the genotypes and the phenotypes.
  28 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  Improved prediction of signal peptides: SignalP 3.0.

Authors:  Jannick Dyrløv Bendtsen; Henrik Nielsen; Gunnar von Heijne; Søren Brunak
Journal:  J Mol Biol       Date:  2004-07-16       Impact factor: 5.469

Review 3.  Type II secretion: a protein secretion system for all seasons.

Authors:  Nicholas P Cianciotto
Journal:  Trends Microbiol       Date:  2005-10-10       Impact factor: 17.079

4.  Lysobacter defluvii sp. nov., isolated from municipal solid waste.

Authors:  A F Yassin; Wen-Ming Chen; H Hupfer; C Siering; R M Kroppenstedt; A B Arun; Wei-An Lai; Fo-Ting Shen; P D Rekha; Chiu Chung Young
Journal:  Int J Syst Evol Microbiol       Date:  2007-05       Impact factor: 2.747

5.  Lysobacter spongiicola sp. nov., isolated from a deep-sea sponge.

Authors:  Lyudmila A Romanenko; Masataka Uchino; Naoto Tanaka; Galina M Frolova; Valery V Mikhailov
Journal:  Int J Syst Evol Microbiol       Date:  2008-02       Impact factor: 2.747

6.  Lysobacter daecheongensis sp. nov., isolated from sediment of stream near the Daechung dam in South Korea.

Authors:  Leonid N Ten; Hae-Min Jung; Wan-Taek Im; Soon-Ae Yoo; Sung-Taik Lee
Journal:  J Microbiol       Date:  2008-10-31       Impact factor: 3.422

7.  OrthoMCL: identification of ortholog groups for eukaryotic genomes.

Authors:  Li Li; Christian J Stoeckert; David S Roos
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

8.  IslandViewer: an integrated interface for computational identification and visualization of genomic islands.

Authors:  Morgan G I Langille; Fiona S L Brinkman
Journal:  Bioinformatics       Date:  2009-01-16       Impact factor: 6.937

9.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

10.  The COG database: an updated version includes eukaryotes.

Authors:  Roman L Tatusov; Natalie D Fedorova; John D Jackson; Aviva R Jacobs; Boris Kiryutin; Eugene V Koonin; Dmitri M Krylov; Raja Mazumder; Sergei L Mekhedov; Anastasia N Nikolskaya; B Sridhar Rao; Sergei Smirnov; Alexander V Sverdlov; Sona Vasudevan; Yuri I Wolf; Jodie J Yin; Darren A Natale
Journal:  BMC Bioinformatics       Date:  2003-09-11       Impact factor: 3.169

View more
  1 in total

1.  Pea Broth Enhances the Biocontrol Efficacy of Lysobacter capsici AZ78 by Triggering Cell Motility Associated with Biogenesis of Type IV Pilus.

Authors:  Selena Tomada; Gerardo Puopolo; Michele Perazzolli; Rita Musetti; Nazia Loi; Ilaria Pertot
Journal:  Front Microbiol       Date:  2016-07-26       Impact factor: 5.640

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.