Literature DB >> 24501649

Complete genome sequence of Arthrobacter sp. strain FB24.

Cindy H Nakatsu¹, Ravi Barabote², Sue Thompson², David Bruce², Chris Detter², Thomas Brettin², Cliff Han², Federico Beasley¹, Weimin Chen¹, Allan Konopka³, Gary Xie².

Abstract

Arthrobacter sp. strain FB24 is a species in the genus Arthrobacter Conn and Dimmick 1947, in the family Micrococcaceae and class Actinobacteria. A number of Arthrobacter genome sequences have been completed because of their important role in soil, especially bioremediation. This isolate is of special interest because it is tolerant to multiple metals and it is extremely resistant to elevated concentrations of chromate. The genome consists of a 4,698,945 bp circular chromosome and three plasmids (96,488, 115,507, and 159,536 bp, a total of 5,070,478 bp), coding 4,536 proteins of which 1,257 are without known function. This genome was sequenced as part of the DOE Joint Genome Institute Program.

Entities: Chemical Disease Species

Year: 2013 PMID： 24501649 PMCID： PMC3910542 DOI： 10.4056/sigs.4438185

Source DB: PubMed Journal: Stand Genomic Sci ISSN： 1944-3277

Introduction

strain FB24 was isolated from a microcosm made from soil collected at an Indiana Department of Transport facility in Seymour, Indiana. This site was of particular interest because the soils were contaminated by mixed waste, both petroleum hydrocarbons and extreme metal (chromium and lead) levels [1]. Details of microcosm enrichment and isolation procedures used to obtain the strain have been described previously [2]. This isolate was of particular interest because of its extreme resistance to chromate [3,4]. This work is a part of a larger study determining the compositional and functional diversity of bacterial communities in soils exposed to long-term contamination with metals [5-7].

Classification and features

strain FB24 is a high G+C Gram-positive member of the (Figure 1, Table 1). The strain is a facultative, non-motile aerobe with characteristic morphology of rod-shaped cells (Figure 2) that become coccoid in stationary phase. Strain FB24 is able to use a number carbon sources for growth, including glucose, fructose, lactate, succinate, malate, xylose and aromatic hydrocarbons (hydroxybenzoates, phthalate). Additionally, this strain is resistant to multiple metals: arsenate, arsenite, chromate, cadmium, lead, nickel, and zinc.

Figure 1

Table 1

Classification and general features of strain FB24

MIGS ID	Property	Term	Evidence code^a
		Domain Bacteria	TAS [13]
		Phylum Actinobacteria	TAS [14]
		Class Actinobacteria	TAS [15]
	Current classification	Order Actinomycetales	TAS [15-18]
		Family Micrococcaceae	TAS [15-17,19]
		Genus Arthrobacter	TAS [17,20-23]
		Species Arthrobacter sp.	TAS [14]
		Type strain	TAS [15]
	Gram stain	Positive	IDA
	Cell shape	Polymorphic: Coccus to rod shape	IDA
	Motility	Non-motile	IDA
	Sporulation	Non-sporulating	IDA
	Temperature range	4-37°C	IDA
	Optimum temperature	30°C	IDA
	Carbon source	Yeast extract, glucose, fructose, lactate, succinate, malate, xylose, hydroxybenzoates, phthalate
	Energy source	Yeast extract, glucose, fructose, lactate, succinate, malate, xylose, hydroxybenzoates, phthalate
	Terminal electron receptor	Oxygen or nitrate	IDA
MIGS-6	Habitat	Soil	TAS [1]
	Isolation	Chromate and xylene enriched microcosm composed of anthropogenically disturbed soils	TAS [2]
MIGS-6.3	Salinity
MIGS-22	Oxygen	Facultative aerobe	IDA
MIGS-15	Biotic relationship	Free-living	IDA
MIGS-14	Pathogenicity	Non-pathogenic	NAS
MIGS-4	Geographic location	Seymour, Indiana, USA	TAS [1,2]
MIGS-5	Sample collection time	June 27, 2001	IDA
MIGS-4.1	Latitude	38.9591667	NAS
MIGS-4.2	Longitude	-85.8902778	NAS
MIGS-4.3	Depth	40-90 cm	NAS
MIGS-4.4	Altitude	583 feet	NAS

aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [24].

Figure 2

Transmission electron micrograph of strain FB24. Cells were grown in nutrient broth for 15 h (~early stationary phase), fixed in 3% glutaraldehyde in 0.1 M cacodylate buffer, then fixed in reduced osmium, followed by a series of ethanol dehydration steps. Cells are then embedded in Spurr resin, stained with uranyl acetate and Reynold’s lead citrate. Image was captured on Kodak SO-163 film at 33,000× magnification.

Phylogenetic tree of strain FB24 relative to nearest neighboring type strains and strains with finished genome sequences: re117 (FQ311476) [8], TC1 (NC_008709) [9], A6 (NC_011886), Sphe3 (CP002379 [10], DC2201, Microccus luteus Fleming NCTC 2665, ATCC 33209, ATCC 17931, and DY-18. The sequences were aligned in ClustalX and a consensus tree was generated using a 1,000× repeated bootstrapping process [11,12]. aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [24]. Transmission electron micrograph of strain FB24. Cells were grown in nutrient broth for 15 h (~early stationary phase), fixed in 3% glutaraldehyde in 0.1 M cacodylate buffer, then fixed in reduced osmium, followed by a series of ethanol dehydration steps. Cells are then embedded in Spurr resin, stained with uranyl acetate and Reynold’s lead citrate. Image was captured on Kodak SO-163 film at 33,000× magnification.

Genome sequencing information

Genome project history

strain FB24 was chosen for sequencing by DOE-JGI because of its extreme resistance to chromate. Table 2 presents the project information and its association with MIGS version 2.0 compliance [25].

Table 2

Project information

MIGS ID	Property	Term
MIGS-31	Finishing quality	Finished
MIGS-28	Libraries used	Small and medium random shotgun clones
MIGS-29	Sequencing platforms	Sanger
MIGS-31.2	Fold coverage	~15-fold
MIGS-30	Assemblers	Parallel PHRAP
MIGS-32	Gene calling method	Critica, Generation, Glimmer
	Genome Database release	March 1, 2007
	Genbank ID	12640
	Genbank Date of Release	October 24, 2006
	GOLD ID	Gc00445
	Project relevance	Bioremediation, biotechnological, environmental

Growth conditions and DNA isolation

The FB24 culture used for DNA extraction was started from the glycerol stock (stored at -80 ºC) that was made from the original isolate. Cells were streaked onto a 0.1× nutrient agar plate, incubated at 30ºC, then a single colony was used to grow a culture in 0.25× nutrient broth (NB) (Difco, USA). Total genomic DNA was extracted from cells grown in liquid culture using the standard CTAB procedure [26].

Genome sequencing and assembly

The random shotgun method was used in Sanger sequencing the genome of strain FB24 at the DOE-Joint Genome Institution (DOE-JGI). Medium (8 kb) and small (3 kb) insert random libraries were partially sequenced with average success rate of 88% and average high-quality read lengths of 614 nucleotides. Sequences were assembled with parallel phrap (High Performance Software, LLC). Possible mis-assemblies were corrected with Dupfinisher [27] or by analysis of transposon insertions in bridge clones. Gaps between contigs were closed by editing, custom primer walk or PCR amplification. The completed genome sequence of FB24 contains 89530 reads, achieving an average of 15-fold sequence coverage per base with an error rate less than 1 in 100,000. The sequences of FB24 can be accessed using the GenBank accession number NC_008541 for the chromosome and NC_008537, NC_008538, NC_008539 for three plasmids.

Genome annotation

Automated gene prediction was performed by using the output of Critica [28], combined with the output of Generation and Glimmer [29]. The assignment of product descriptions was made by using search results of the following curated databases in this order: TIGRFam; PRIAM (e–30 cutoff); Pfam; Smart; COGs; Swissprot/TrEMBL (SPTR); and KEGG. If there was no significant similarity to any protein in another organism, it was described as “hypothetical protein.” “Conserved hypothetical protein” was used if at least one match was found to a hypothetical protein in another organism. EC numbering was based on searches in PRIAM at an e–10 cutoff; COG and KEGG functional classifications were based on homology searches in the respective databases. Additionally, the tRNAScanSE tool [30] was used to find tRNA genes, whereas ribosomal RNAs were found by using BLASTn vs. the 16S and 23S ribosomal RNA databases. Other “standard” structural RNAs (e.g., 5S rRNA, rnpB, tmRNA, SRP RNA) were found by using covariance models with the Infernal search tool [31]. The HMMTOP program was used to predict the number of transmembrane segments (TMSs) in each protein. Those predicted to have two or more TMSs (about 918 proteins) were used to interrogate the transporter database (TCDB). Peter Karp’s pathologic tool was used for pathway prediction [32]. This method largely relies on the keyword matching and other automatic methods to manually curate some of the pathways, such as aromatic compound degradation. Metabolic pathways were constructed using MetaCyc as a reference data set [33].

Genome properties

The 5,070,478- base pair genome of is composed of a single 4,698,945-base pair circular chromosome and three large circular plasmids (96,488, 115,507, and 159,536 bp) (Table 3) with GC content of 65.5, 64.7, 63.3 and 65.0%, respectively. Based on a summary of genomic features listed on the Integrated Microbial Genomes (IMG) [34] there are 4,536 protein coding sequences identified, of which 3,279 (70.94%, Table 4) have been assigned to a COG functional category (Table 5, Figure 3and Figure 4). There are 1,257 (27.19%) predicted genes without an associated function.

Table 3

Summary of genome

Label	Size (bp)	Topology	INSDC identifier	RefSeq ID
Chromosome 1	4,698,945	Circular	CP000454.1	NC_008541.1
Plasmid pFB104	96,488	Circular	CP000457.1	NC_008539.1
Plasmid pFB105	115,507	Circular	CP000456.1	NC_008538.1
Plasmid pFB136	159,536	Circular	CP000455.1	NC_008537.1

Table 4

Nucleotide content and gene count levels of the genome

Attribute	Value	% of total^a
Genome size (bp)	5,070,478	100.0
DNA coding region (bp)	4,552,065	89.78
DNA G+C content (bp)	3,315,507	65.39
Total genes^b	4,622	100.00
RNA genes	86	1.86
Protein-coding genes with function prediction	3,279	70.94
Protein coding genes without function prediction	1,257	27.19
Genes in paralog clusters	965	20.88
Genes assigned to COGs	3,361	72.72
Genes with signal peptides	1,098	23.76
Genes with transmembrane helices	1,168	25.27
Paralogous groups	373	100.00

a) The total is based on either the size of the genome in base pairs or on the total number of protein coding genes in the annotated genome.

b) Also includes 54 pseudogenes and 5 other genes

Table 5

Number of genes associated with general COG functional categories

Code	Value	%age^a	Description
J	162	4.27	Translation, ribosomal structure and biogenesis
A	1	0.03	RNA processing and modification
K	363	9.57	Transcription
L	164	4.32	Replication, recombination and repair
B	1	0.03	Chromatin structure and dynamics
D	32	0.84	Cell cycle control, cell division, chromosome partitioning
Y	-	-	Nuclear structure
V	49	1.29	Defense mechanisms
T	162	4.27	Signal transduction mechanisms
M	171	4.51	Cell wall/membrane/envelope biogenesis
N	3	0.08	Cell motility
Z	1	0.03	Cytoskeleton
W	0	0.0	Extracellular structures
U	48	1.27	Intracellular trafficking, secretion, and vesicular transport
O	124	3.27	Posttranslational modification, protein turnover, chaperones
C	239	6.3	Energy production and conversion
G	436	11.49	Carbohydrate transport and metabolism
E	364	9.6	Amino acid transport and metabolism
H	98	2.58	Nucleotide transport and metabolism
I	155	4.09	Lipid transport and metabolism
P	207	5.46	Inorganic ion transport and metabolism
Q	112	2.95	Secondary metabolites biosynthesis, transport and catabolism

R	458	12.07	General function prediction only
S	286	7.54	Function unknown
-	1,261	27.28	Not in COG

a) The total is based on the total number of protein coding genes in the annotated genome.

Figure 3

Circular map of FB24 chromosome, graphical depiction from outside to the center: genes on forward strand, genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew. Chromosome is not to scale with plasmid maps.

Figure 4

Circular map of three plasmids in FB24, graphical depiction from outside to the center: genes on forward strand, genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew. Plasmid maps not to scale with each other or with chromosome map.

a) The total is based on either the size of the genome in base pairs or on the total number of protein coding genes in the annotated genome. b) Also includes 54 pseudogenes and 5 other genes a) The total is based on the total number of protein coding genes in the annotated genome. Circular map of FB24 chromosome, graphical depiction from outside to the center: genes on forward strand, genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew. Chromosome is not to scale with plasmid maps. Circular map of three plasmids in FB24, graphical depiction from outside to the center: genes on forward strand, genes on reverse strand (colored by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew. Plasmid maps not to scale with each other or with chromosome map.

Genome comparisons

A comparative analysis of genome sizes and protein coding genes in FB24 and other species with finished sequences (Table 6) was made from data listed on the IMG website [34]. Included in the comparison is re117 (Gc01419, FQ311476) [8], TC1 (Gc00480, NC_008709) [9], A6 (Gc00930, NC_011886), Rue61a (Gc0006272, CP003203), and Sphe3 (Gc01621, CP002379) [10]. In addition, the draft genome of NBRC 12137 was included because its phylogenetic relatedness to FB24 based on the 16S rRNA gene sequence. Similarity between functional protein groups (based on COG, clusters of orthologous groups) in the genomes of these strains were made and visualized using hierarchical clustering (Figure 5) with tools available on the Joint Genome Institute (JGI) Integrated Microbial Genomes (IMG) site. Also included in the tree were closely related species in the family with finished genomes DC2201 (Gc00769), Microccus luteus Fleming NCTC 2665 (Gc01033), ATCC 33209 (Gc00698), ATCC 17931 (Gc01662), and DY-18 (Gc01162). Detailed information about the genome properties and genome annotation of these strains can be obtained from the JGI-IMG website at the JGI website [35].

Table 6

Comparison of genomes of the genus with finished genome sequences

Genome Name	Genome size (bp)	Gene count	Protein coding	Protein with function	Without function	Plasmid number	rRNA operons
Arthrobacter arilaitensis re117, CIP108037	3,918,192	3,518	3,436	2,390	1,046	2	6

Arthrobacter aurescens TC1	5,226,648	4,793	4,699	3,419	1,280	2	6

Arthrobacter chlorophenolicus A6	4,980,870	4,744	4,641	3,125	1,516	2	5

Arthrobacter nitroguajacolicus Rue61a	5,081,038	4,655	4,584	3,800	784	2	6

Arthrobacter phenanthrenivorans Sphe3	4,535,320	4,273	4,209	3,101	1,108	2	4

Arthrobacter sp. FB24	5,070,478	4,622	4,536	3,279	1,257	3	5

Arthrobacter globiformis NBRC 12137*	4,954,410	4,582	4,529	2,784	1,745	?	1

*Sequence not fully assembled

Figure 5

Hierarchical tree based on similarity of COG groups between genomes. Included are genomes of bacteria in the family with finished genome sequences.

*Sequence not fully assembled Hierarchical tree based on similarity of COG groups between genomes. Included are genomes of bacteria in the family with finished genome sequences.

27 in total

1. Soil Bacteria Similar in Morphology to Mycobacterium and Corynebacterium.

Authors: H J Conn; I Dimmick
Journal: J Bacteriol Date: 1947-09 Impact factor: 3.490

2. Bacterial activity, community structure, and centimeter-scale spatial heterogeneity in contaminated soil.

Authors: Joanna M Becker; Tim Parkin; Cindy H Nakatsu; Jayson D Wilbur; Allan Konopka
Journal: Microb Ecol Date: 2006-02-10 Impact factor: 4.552

3. Studies in the Nomenclature and Classification of the Bacteria: II. The Primary Subdivisions of the Schizomycetes.

Authors: R E Buchanan
Journal: J Bacteriol Date: 1917-03 Impact factor: 3.490

4. tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence.

Authors: T M Lowe; S R Eddy
Journal: Nucleic Acids Res Date: 1997-03-01 Impact factor: 16.971

5. Global proteomic analysis of the chromate response in Arthrobacter sp. strain FB24.

Authors: Kristene L Henne; Joshua E Turse; Carrie D Nicora; Mary S Lipton; Sandra L Tollaksen; Carl Lindberg; Gyorgy Babnigg; Carol S Giometti; Cindy H Nakatsu; Dorothea K Thompson; Allan E Konopka
Journal: J Proteome Res Date: 2009-04 Impact factor: 4.466

6. The arthrobacter arilaitensis Re117 genome sequence reveals its genetic adaptation to the surface of cheese.

Authors: Christophe Monnet; Valentin Loux; Jean-François Gibrat; Eric Spinnler; Valérie Barbe; Benoit Vacherie; Frederick Gavory; Edith Gourbeyre; Patricia Siguier; Michaël Chandler; Rayda Elleuch; Françoise Irlinger; Tatiana Vallaeys
Journal: PLoS One Date: 2010-11-24 Impact factor: 3.240

7. Inhibition of nitrate reduction by chromium (VI) in anaerobic soil microcosms.

Authors: Peter S Kourtev; Cindy H Nakatsu; Allan Konopka
Journal: Appl Environ Microbiol Date: 2009-08-14 Impact factor: 4.792

8. The minimum information about a genome sequence (MIGS) specification.

Authors: Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal: Nat Biotechnol Date: 2008-05 Impact factor: 54.908

9. High-level chromate resistance in Arthrobacter sp. strain FB24 requires previously uncharacterized accessory genes.

Authors: Kristene L Henne; Cindy H Nakatsu; Dorothea K Thompson; Allan E Konopka
Journal: BMC Microbiol Date: 2009-09-16 Impact factor: 3.605

10. A memory-efficient dynamic programming algorithm for optimal alignment of a sequence to an RNA secondary structure.

Authors: Sean R Eddy
Journal: BMC Bioinformatics Date: 2002-07-02 Impact factor: 3.169

10 in total

1. Biodegradation of persistent environmental pollutants by Arthrobacter sp.

Authors: Xiaohong Guo; Chengyun Xie; Lijuan Wang; Qinfan Li; Yan Wang
Journal: Environ Sci Pollut Res Int Date: 2019-01-31 Impact factor: 4.223

2. Identification, functional characterization, and crystal structure determination of bacterial levoglucosan dehydrogenase.

Authors: Masayuki Sugiura; Moe Nakahara; Chihaya Yamada; Takatoshi Arakawa; Motomitsu Kitaoka; Shinya Fushinobu
Journal: J Biol Chem Date: 2018-09-17 Impact factor: 5.157

3. Comparative genome analysis reveals the molecular basis of nicotine degradation and survival capacities of Arthrobacter.

Authors: Yuxiang Yao; Hongzhi Tang; Fei Su; Ping Xu
Journal: Sci Rep Date: 2015-02-27 Impact factor: 4.379

4. New Genome Sequence of an Echinaceapurpurea Endophyte, Arthrobacter sp. Strain EpSL27, Able To Inhibit Human-Opportunistic Pathogens.

Authors: Elisangela Miceli; Luana Presta; Valentina Maggini; Marco Fondi; Emanuele Bosi; Carolina Chiellini; Camilla Fagorzi; Patrizia Bogani; Vincenzo Di Pilato; Gian Maria Rossolini; Alessio Mengoni; Fabio Firenzuoli; Elena Perrin; Renato Fani
Journal: Genome Announc Date: 2017-06-22

5. Draft genome sequence of Arthrobacter sp. strain B6 isolated from the high-arsenic sediments in Datong Basin, China.

Authors: Linghua Xu; Wanxia Shi; Xian-Chun Zeng; Ye Yang; Lingli Zhou; Yao Mu; Yichen Liu
Journal: Stand Genomic Sci Date: 2017-01-23

6. Microbial Communities of Meat and Meat Products: An Exploratory Analysis of the Product Quality and Safety at Selected Enterprises in South Africa.

Authors: Evelyn Madoroba; Kudakwashe Magwedere; Nyaradzo Stella Chaora; Itumeleng Matle; Farai Muchadeyi; Masenyabu Aletta Mathole; Rian Pierneef
Journal: Microorganisms Date: 2021-02-27

7. Sequencing and Comparative Genomic Analysis of a Highly Metal-Tolerant Penicillium janthinellum P1 Provide Insights Into Its Metal Tolerance.

Authors: Bin-Bin Chi; Ya-Nan Lu; Ping-Chuan Yin; Hong-Yan Liu; Hui-Ying Chen; Yang Shan
Journal: Front Microbiol Date: 2021-06-04 Impact factor: 5.640

8. Genomic and phenotypic insights into the ecology of Arthrobacter from Antarctic soils.

Authors: Melissa Dsouza; Michael W Taylor; Susan J Turner; Jackie Aislabie
Journal: BMC Genomics Date: 2015-02-05 Impact factor: 3.969

9. Characterisation of an efficient atrazine-degrading bacterium, Arthrobacter sp. ZXY-2: an attempt to lay the foundation for potential bioaugmentation applications.

Authors: Xinyue Zhao; Li Wang; Fang Ma; Jixian Yang
Journal: Biotechnol Biofuels Date: 2018-04-18 Impact factor: 6.040

10. Draft Genome Sequence of Arthrobacter oryzae TNBS02, a Bacterium Containing Heavy Metal Resistance Genes, Isolated from Soil of Antarctica.

Authors: Yong-Joon Cho; Ahnna Cho; Soon Gyu Hong; Han-Gu Choi; Ok-Sun Kim
Journal: Microbiol Resour Announc Date: 2019-01-24

10 in total