Literature DB >> 28138355

Draft genome sequence of Arthrobacter sp. strain B6 isolated from the high-arsenic sediments in Datong Basin, China.

Linghua Xu1,2, Wanxia Shi1, Xian-Chun Zeng1, Ye Yang1, Lingli Zhou1, Yao Mu1, Yichen Liu1.   

Abstract

Arthrobacter sp. B6 is a Gram-positive, non-motile, facultative aerobic bacterium, isolated from the arsenic-contaminated aquifer sediment in the Datong basin, China. This strain displays high resistance to arsenic, and can dynamically transform arsenic under aerobic condition. Here, we described the high quality draft genome sequence, annotations and the features of Arthrobacter sp. B6. The G + C content of the genome is 64.67%. This strain has a genome size of 4,663,437 bp; the genome is arranged in 8 scaffolds that contain 25 contigs. From the sequences, 3956 protein-coding genes, 264 pseudo genes and 89 tRNA/rRNA-encoding genes were identified. The genome analysis of this strain helps to better understand the mechanism by which the microbe efficiently tolerates arsenic in the arsenic-contaminated environment.

Entities:  

Keywords:  Arsenate reduction; Arthrobacter sp. B6; Datong basin; Genome; High-arsenic sediment

Year:  2017        PMID: 28138355      PMCID: PMC5259909          DOI: 10.1186/s40793-017-0231-9

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

The genus was first proposed in 1947 by Conn and Dimmick [1], belongs to the family of in the class of . Recently, based on the intrageneric phylogeny and chemotaxonomic characteristics, the description of the genus sensu lato was emended by Busse, and the genus sensu stricto was restricted to , , and [2]. Due to their nutritional versatility and tolerance to various environmental stressors [3-7], Arthrobacter species are widely present in soils and the environments contaminated with chemicals and heavy metal [8-13], as well as extreme environments, such as Antarctic and radioactive sediments [14, 15]. sp. B6 was isolated from an arsenic-contaminated sediment sample collected from the Datong Basin, China, where the uses of high arsenic groundwater for drinking and irrigation have resulted in endemic arsenic poisoning among tens of thousands of residents [16]. Strain B6 is of particular interest because it showed high level of resistance to arsenic and can dynamically transform arsenic under aerobic condition. Here, we presented a summary of the taxonomic characterization of sp. B6 and its main genomic features. These data help to better understand the microbial detoxification mechanism for arsenic, and are useful for the comparisons of the genomic and physiological features between this isolate and other species.

Organism information

Classification and features

sp. B6 is a Gram-positive, non-motile, facultative aerobic bacterium. Cells are straight or slightly curved rods during log phase of bacterial growth (Fig. 1) and become coccoid in stationary phase. The bacteria cells formed white colonies on 0.1× Trypticase Soy Broth agar plate. Colonies are convex and circular with entire margin. The strain can grow at a wide range of temperatures from 4 to 37 °C; the optimum is 30 °C. It can proliferate in a pH range of 6.0–8.5; the optimum is 7.0. The strain tolerates high concentrations of NaCl up to approximately 7% (Table 1). It is catalase- and oxidase-positive. It hydrolyzes starch and tyrosine, but not o-nitrophenyl-β-d-galactoside, gelatin, aesculin, chitin, casein or cellulose. It is negative for nitrate reduction, H2S production, citrate utilization, indole production, arginine dihydrolase and urease activity.
Fig. 1

Images of Arthrobacter sp. B6 using scanning electron microscopy (Left) and the appearance of colony morphology on 0.1× Trypticase Soy Broth solid media (Right)

Table 1

Classification and general features of Arthrobacter sp. B6 [19]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Bacteria TAS [24]
Phylum Actinobacteria TAS [25]
Class Actinobacteria TAS [26]
Order Actinomycetales TAS [27, 28]
Family Micrococcaceae TAS [27, 29]
Genus Arthrobacter TAS [1, 2]
Species undetermined-
Strain: B6IDA
Gram stainPositiveIDA
Cell shapePolymorphic: rod to coccus shapedIDA
MotilityNon-motileIDA
SporulationNon-sporulatingIDA
Temperature range4–37 °CIDA
Optimum temperature30 °CIDA
pH range; Optimum6.0–8.5; 7IDA
Carbon sourceDextrin, Tween 40, D-fructose, Gentiobiose, α-D-glucose, Lactulose, Maltotriose, D-mannose, D-mannitol, D-melezitose, Palatinose, D-psicose, D-raffinose, L-rhamnose, D-ribose, D-sorbitol, Sucrose, Turanose, α- hydroxybutyric acid, α-ketoglutaric acid, L-malic acid, Pyruvic acid, D-alanine, L-alanine, L-serine, Glycerol, Adenosine, 2-deoxy adenosine, Inosine.IDA
MIGS-6HabitatSoil, sedimentIDA
MIGS-6.3Salinity1–7% NaCl (w/v)IDA
MIGS-22Oxygen requirementAerobicIDA
MIGS-15Biotic relationshipfree-livingIDA
MIGS-14PathogenicityNon-pathogenNAS
MIGS-4Geographic locationDatong basin, Shanxi, ChinaIDA
MIGS-5Sample collectionAugust 2011IDA
MIGS-4.1Latitude39.4899IDA
MIGS-4.2Longitude112.915IDA
MIGS-4.4AltitudeNot recorded

aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [30]

Images of Arthrobacter sp. B6 using scanning electron microscopy (Left) and the appearance of colony morphology on 0.1× Trypticase Soy Broth solid media (Right) Classification and general features of Arthrobacter sp. B6 [19] aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [30] The strain utilizes dextrin, tween 40, D-fructose, gentiobiose, α-D-glucose, lactulose, maltotriose, D-mannose, D-mannitol, D-melezitose, palatinose, D-psicose, D-raffinose, L-rhamnose, D-ribose, D-sorbitol, sucrose, turanose, α- hydroxybutyric acid, α-ketoglutaric acid, L-malic acid, pyruvic acid, D-alanine, L-alanine, L-serine, glycerol, adenosine, 2-deoxy adenosine and inosine as tested using the Biolog GP2 microplate system. The major fatty acids of strain B6 are anteiso-C15:0 (56.58%), anteiso-C17:1ω9c (8.89%), anteiso-C17:0 (8.22%), iso-C15:0 (7.63%), iso-C16:0 (5.26%), sum in feature 3 (4.31%), summed feature 3 (containing C16:1ω6c and/or C16:1ω7c) (4.31%) and iso-C16:1 H (2.32%). These data suggested that the morphological and biochemical traits and fatty acid profile of B6 are consistent with those of other described species of the genus . The 16S rRNA gene sequence of strain B6 shares 94.67–99.59% identities with those of other known species of the genus . In order to evaluate the evolutionary relationships between B6 and other known strains of the genus , the 16S rRNA gene sequence of all of these bacteria were aligned using ClustalW [17], and a phylogenetic tree were conducted using the maximum-likelihood and neighbor-joining algorithms implemented in MEGA 6.0, respectively [18]. The phylogeny illustrated that the strain B6 is closely associated with , , and ; suggesting that B6 is affiliated with the genus (Fig. 2). We also found that sp. B6 showed high resistance to arsenic, with maximal inhibitory concentrations of 150.0 mM for arsenate and 5.0 mM for arsenite. A dynamic transformation of arsenic catalyzed by strain B6 was observed when it was cultured aerobically with arsenate.
Fig. 2

Phylogenetic tree based on 16S rRNA gene sequences showing the phylogenetic position of Arthrobacter sp. B6 (●). Sequences were aligned with the CLUSTAL W program and were constructed using maximum-likelihood method implemented in MEGA 6.0 program [17, 18]. GenBank accession numbers are listed in parentheses. Type strains are indicated with a superscript T. Strains with published genomes are shown in bold. Bootstrap support values for 1000 replications above 50% are shown near nodes. The scale bar indicates 0.05 nucleotide substitution per nucleotide position

Phylogenetic tree based on 16S rRNA gene sequences showing the phylogenetic position of Arthrobacter sp. B6 (●). Sequences were aligned with the CLUSTAL W program and were constructed using maximum-likelihood method implemented in MEGA 6.0 program [17, 18]. GenBank accession numbers are listed in parentheses. Type strains are indicated with a superscript T. Strains with published genomes are shown in bold. Bootstrap support values for 1000 replications above 50% are shown near nodes. The scale bar indicates 0.05 nucleotide substitution per nucleotide position

Genome sequencing information

Genome project history

sp. strain B6 was selected for sequencing on the basis of its high resistance to arsenic and dynamic arsenic transformation capability. The Whole Genome Shotgun project has been deposited at DDBJ/EMBL/GenBank database under the accession number LQAP00000000. A summary of the main project information on compliance with MIGS version 2.0 is shown in Table 2 [19].
Table 2

Project information

MIGS IDPropertyTerm
MIGS 31Finishing qualityHigh-Quality Permanent Draft
MIGS-28Libraries usedIllumina Std. shotgun library
MIGS 29Sequencing platformsIllumina HiSeq 2000
MIGS 31.2Fold coverage161 ×
MIGS 30AssemblersSOAPdenovo v2.04
MIGS 32Gene calling methodGlimmer v3.02
Locus TagAU175
Genbank IDLQAP01000000
GenBank Date of ReleaseJun 15, 2016
GOLD IDGs0118476
BIOPROJECTPRJNA306410
MIGS 13Source Material IdentifierCGMCC 1.15656
Project relevanceBiotechnological, Environmental
Project information

Growth conditions and genomic DNA preparation

Strain B6 was grown at 30 °C in 0.1× Trypticase Soy Broth liquid medium to mid-exponential phase. Genomic DNA was extracted from 0.5 to 1.0 g of cells using the modified method of Marmur [20]. The purity of DNA, expressed as the value of A260/A280, was assessed on a NanoDrop™ ND-1000 Spectrophotometer (Biolab).

Genome sequencing and assembly

The draft genome of sp. B6 was sequenced at the Beijing Genomics Institute (BGI, Shenzhen) using the high throughout sequencing technique. A standard Illumina shotgun library was constructed and sequenced using the Illumina HiSeq 2000 platform; this generated 8,355,450 clean reads totaling 752 Mbp. These reads were assembled using the Short Oligonucleotide Analysis Package (SOAPdenovo v2.04) with all parameters set to default [21]. The final draft assembly contains 25 contigs in 8 scaffolds. Final assembly was based on all clean reads that provide an average of 161-fold coverage of the genome. The total size of the genome is 4.66 Mbp.

Genome annotation

Genes were identified using Glimmer v3.02 [22]. The predicted CDSs were translated into amino acid sequences that were used as queries to BLAST the GenBank, Swissprot, InterPro, KEGG, COG and GO databases, respectively. These data were combined to assert a product description for each predicted protein. Additional gene prediction analysis and functional annotation was performed using the Integrated Microbial Genomes-Expert Review (IMG-ER) platform [23].

Genome properties

The assembly of the draft genome sequence consists of 8 scaffolds amounting to 4,663,437 bp. The G + C content is 64.67% (Table 3). From the genome, 4309 genes were predicted, of which 3956 are protein-coding genes. Among these protein-coding genes, 154 were assigned to putative functions, and 275 were annotated as hypothetical proteins. The assignment of genes into COGs functional categories is presented in Table 4 and Fig. 3.
Table 3

Genome statistics

AttributeValue% of Total
Genome size (bp)4,663,437100.00
DNA coding (bp)4,100,73987.93
DNA G + C (bp)3,015,84564.67
DNA scaffolds8100.00
Total genes4309100.00
Protein coding genes395691.81
RNA genes892.07
Pseudo genes2646.12
Genes in internal clusters425098.63
Genes with function prediction352781.85
Genes assigned to COGs221051.29
Genes with Pfam domains346480.39
Genes with signal peptides2205.11
Genes with transmembrane helices2495.78
CRISPR repeats1252.90
Table 4

Number of genes associated with general COG functional categories

CodeValue%ageDescription
J1456.56Translation, ribosomal structure and biogenesis
A10.05RNA processing and modification
K1627.33Transcription
L1104.98Replication, recombination and repair
B10.05Chromatin structure and dynamics
D120.54Cell cycle control, Cell division, chromosome partitioning
V261.18Defense mechanisms
T582.62Signal transduction mechanisms
M723.26Cell wall/membrane biogenesis
N00Cell motility
U180.81Intracellular trafficking and secretion
O652.94Posttranslational modification, protein turnover, chaperones
C1687.60Energy production and conversion
G22510.18Carbohydrate transport and metabolism
E27212.31Amino acid transport and metabolism
F713.21Nucleotide transport and metabolism
H1115.02Coenzyme transport and metabolism
I1034.66Lipid transport and metabolism
P1275.75Inorganic ion transport and metabolism
Q662.99Secondary metabolites biosynthesis, transport and catabolism
R26612.04General function prediction only
S1315.93Function unknown
-209948.71Not in COGs

The total is based on the total number of protein coding genes in the genome

Fig. 3

A graphical circular map of the genome performed with CGview comparison tool [31]. From outside to center, ring 1 and 4 show protein-coding genes oriented in the forward (colored by COG categories) and reverse (colored by COG categories) directions, respectively. ring 2 and 3 denote genes on forward/reverse strand; ring 5 shows G + C% content plot, and the inner-most ring shows GC skew, purple indicating negative values and olive, positive values

Genome statistics Number of genes associated with general COG functional categories The total is based on the total number of protein coding genes in the genome A graphical circular map of the genome performed with CGview comparison tool [31]. From outside to center, ring 1 and 4 show protein-coding genes oriented in the forward (colored by COG categories) and reverse (colored by COG categories) directions, respectively. ring 2 and 3 denote genes on forward/reverse strand; ring 5 shows G + C% content plot, and the inner-most ring shows GC skew, purple indicating negative values and olive, positive values

Insights from the genome sequence

Genome comparison using the RAST Prokaryotic Genome Annotation Server revealed that the genome sequence of sp. B6 is most similar to that of sp. FB24 (comparison score: 536), but less similar to those of other strains. sp. B6 shares 2035, 2011, 1958, 1930, 1850 and 1829 genes with the strains NBRC 12137, sp. FB24, NIO-1008, SJCon, DSM 16402 and BAB-32, respectively. A three-gene (arsR-acr3-arsC) operon involved in the regulation of arsenate tolerance and reduction was identified from the genome of sp. B6. The putative arsenate reductase (ArsC) of strain B6 shows 96% and 95% sequence identities to those of sp. Leaf137 and Sphe3, respectively. It also shows 89% identities to those of NBRC 12137, SJCon, NIO-1008 and sp. FB24, respectively. The amino acid sequence of ACR3 displays 85% identity to that of the arsenic transporter from sp. FB24. Numerous genes responsible for tolerance or detoxification of metals were identified from the genome of sp. B6, including copper resistance protein CopC and CopD, copper chaperone, copper-translocating P-type ATPase, cobalt-zinc-cadmium resistance protein CzcD, mercuric reductase, DNA gyrase subunit A and B involved in fluoroquinolones resistance, various polyols ABC transporter and DedA protein involved in the uptake of selenate and selenite. In addition, there are some genes in the genome responsible for osmotic stress. The high tolerance of salt (7% NaCl) of strain B6 may be explained by the presence of glycine betaine ABC transport system permease protein in the genome.

Conclusions

In the present study, we characterized the genome of sp. B6 that was isolated from the arsenic-contaminated aquifer sediment in the Datong Basin, China. It contains numerous genes involved in heavy metal tolerance and detoxification. The knowledge of the genome sequence of sp. B6 lays foundation for better understanding of the special metabolic abilities of the strain and for elucidation of the metabolic diversity of bacteria inhabiting in the high-arsenic environment. Further functional analyses of the identified genes may gain insights into the detailed molecular mechanisms by which the microbes tolerate and transform arsenic in the arsenic-contaminated environments.
  25 in total

1.  Soil Bacteria Similar in Morphology to Mycobacterium and Corynebacterium.

Authors:  H J Conn; I Dimmick
Journal:  J Bacteriol       Date:  1947-09       Impact factor: 3.490

2.  Studies in the Nomenclature and Classification of the Bacteria: II. The Primary Subdivisions of the Schizomycetes.

Authors:  R E Buchanan
Journal:  J Bacteriol       Date:  1917-03       Impact factor: 3.490

3.  MEGA6: Molecular Evolutionary Genetics Analysis version 6.0.

Authors:  Koichiro Tamura; Glen Stecher; Daniel Peterson; Alan Filipski; Sudhir Kumar
Journal:  Mol Biol Evol       Date:  2013-10-16       Impact factor: 16.240

4.  Genetic and physiological diversity of phylogenetically and geographically distinct groups of Arthrobacter isolated from lead-zinc mine tailings.

Authors:  Zhang Hanbo; Duan Changqun; Shao Qiyong; Ren Weimin; Sha Tao; Cheng Lizhong; Zhao Zhiwei; Hu Bin
Journal:  FEMS Microbiol Ecol       Date:  2004-08-01       Impact factor: 4.194

Review 5.  Review of the taxonomy of the genus Arthrobacter, emendation of the genus Arthrobacter sensu lato, proposal to reclassify selected species of the genus Arthrobacter in the novel genera Glutamicibacter gen. nov., Paeniglutamicibacter gen. nov., Pseudoglutamicibacter gen. nov., Paenarthrobacter gen. nov. and Pseudarthrobacter gen. nov., and emended description of Arthrobacter roseus.

Authors:  Hans-Jürgen Busse
Journal:  Int J Syst Evol Microbiol       Date:  2015-10-20       Impact factor: 2.747

6.  Long-term performance of rapid oxidation of arsenite in simulated groundwater using a population of arsenite-oxidizing microorganisms in a bioreactor.

Authors:  Hao Li; Xian-Chun Zeng; Zhong He; Xiaoming Chen; Guoji E; Yiyang Han; Yanxin Wang
Journal:  Water Res       Date:  2016-05-20       Impact factor: 11.236

7.  Phylogenetic and physiological diversity of Arthrobacter strains isolated from unconsolidated subsurface sediments.

Authors:  F H Crocker; J K Fredrickson; D C White; D B Ringelberg; D L Balkwill
Journal:  Microbiology       Date:  2000-06       Impact factor: 2.777

8.  Comparing thousands of circular genomes using the CGView Comparison Tool.

Authors:  Jason R Grant; Adriano S Arantes; Paul Stothard
Journal:  BMC Genomics       Date:  2012-05-23       Impact factor: 3.969

9.  Genome Sequence of Arthrobacter sp. YC-RL1, an Aromatic Compound-Degrading Bacterium.

Authors:  Lei Ren; Yanhua Shi; Yang Jia; Yanchun Yan
Journal:  Genome Announc       Date:  2015-07-09

10.  SOAPdenovo2: an empirically improved memory-efficient short-read de novo assembler.

Authors:  Ruibang Luo; Binghang Liu; Yinlong Xie; Zhenyu Li; Weihua Huang; Jianying Yuan; Guangzhu He; Yanxiang Chen; Qi Pan; Yunjie Liu; Jingbo Tang; Gengxiong Wu; Hao Zhang; Yujian Shi; Yong Liu; Chang Yu; Bo Wang; Yao Lu; Changlei Han; David W Cheung; Siu-Ming Yiu; Shaoliang Peng; Zhu Xiaoqian; Guangming Liu; Xiangke Liao; Yingrui Li; Huanming Yang; Jian Wang; Tak-Wah Lam; Jun Wang
Journal:  Gigascience       Date:  2012-12-27       Impact factor: 6.524

View more
  2 in total

1.  Biodegradation of persistent environmental pollutants by Arthrobacter sp.

Authors:  Xiaohong Guo; Chengyun Xie; Lijuan Wang; Qinfan Li; Yan Wang
Journal:  Environ Sci Pollut Res Int       Date:  2019-01-31       Impact factor: 4.223

2.  Draft Genome Sequence of Arthrobacter oryzae TNBS02, a Bacterium Containing Heavy Metal Resistance Genes, Isolated from Soil of Antarctica.

Authors:  Yong-Joon Cho; Ahnna Cho; Soon Gyu Hong; Han-Gu Choi; Ok-Sun Kim
Journal:  Microbiol Resour Announc       Date:  2019-01-24
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.