Literature DB >> 27617060

Complete genome sequence of the Antarctic Halorubrum lacusprofundi type strain ACAM 34.

Iain J Anderson1, Priya DasSarma2, Susan Lucas1, Alex Copeland1, Alla Lapidus1, Tijana Glavina Del Rio1, Hope Tice1, Eileen Dalin1, David C Bruce3, Lynne Goodwin3, Sam Pitluck1, David Sims3, Thomas S Brettin3, John C Detter3, Cliff S Han3, Frank Larimer4, Loren Hauser4, Miriam Land4, Natalia Ivanova1, Paul Richardson1, Ricardo Cavicchioli5, Shiladitya DasSarma2, Carl R Woese6, Nikos C Kyrpides1.   

Abstract

Halorubrum lacusprofundi is an extreme halophile within the archaeal phylum Euryarchaeota. The type strain ACAM 34 was isolated from Deep Lake, Antarctica. H. lacusprofundi is of phylogenetic interest because it is distantly related to the haloarchaea that have previously been sequenced. It is also of interest because of its psychrotolerance. We report here the complete genome sequence of H. lacusprofundi type strain ACAM 34 and its annotation. This genome is part of a 2006 Joint Genome Institute Community Sequencing Program project to sequence genomes of diverse Archaea.

Entities:  

Keywords:  Archaea; Cold adaptation; Extremophile; Halophile; Halorubrum; Tree of life

Year:  2016        PMID: 27617060      PMCID: PMC5018182          DOI: 10.1186/s40793-016-0194-2

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

is an extremely halophilic archaeon belonging to the class within the phylum . The species is represented by the type strain, ACAM 34 (= DSM 5036 = ATCC 49239 = JCM 8891), and a second strain, ACAM 32, both of which were isolated from Deep Lake, Antarctica [1]. This organism was first described as but was later transferred to the genus [2]. Members of the genus have been found not only in Antarctica, but also in Africa [3], Asia [4], and North America [5], where they are usually found in saline lakes or salterns. Most members of the genus are neutrophiles, but some are haloalkaliphiles [6, 7]. (Fig. 1) was proposed for sequencing as part of a 2006 Joint Genome Institute Community Sequencing Program project because of its ability to grow at low temperature and its phylogenetic distance from other halophiles with sequenced genomes (Fig. 2).
Fig. 1

Photomicrograph of H. lacusprofundi type strain ACAM 34 cells. The cells were grown in Franzmann et al. [1] medium. The image was taken using a phase microscope (Nikon Labphot) with 1000× magnification. The scale bar represents 10 μm

Fig. 2

Phylogenetic tree of DNA-directed RNA polymerase subunit A’ of select haloarchaea. Sequence alignment and tree construction were carried out with Clustal W [39]. The tree was visualized with njplot [40]. Positions with gaps were excluded during tree construction. Methanosarcina acetivorans was used as the outgroup. The numbers indicate bootstrap values based on 1000 replicates

Photomicrograph of H. lacusprofundi type strain ACAM 34 cells. The cells were grown in Franzmann et al. [1] medium. The image was taken using a phase microscope (Nikon Labphot) with 1000× magnification. The scale bar represents 10 μm Phylogenetic tree of DNA-directed RNA polymerase subunit A’ of select haloarchaea. Sequence alignment and tree construction were carried out with Clustal W [39]. The tree was visualized with njplot [40]. Positions with gaps were excluded during tree construction. Methanosarcina acetivorans was used as the outgroup. The numbers indicate bootstrap values based on 1000 replicates

Organism information

Classification and features

ACAM 34 was isolated from a water-sediment sample from Deep Lake, Antarctica [1]. The water-sediment sample was incubated in the light at 18 °C, and after 3 months developed a reddish color. was isolated from the sample by streaking on Deep Lake vitamin agar, which was composed of Lake Deep water with 1 g/L yeast extract, 15 g/L agar, and vitamin solution. The physiological characteristics of were described as follows [1]. Cells were pleomorphic. Motility was not observed, and no flagella were present. Cells grew at a temperature range of −1 °C to 40 °C with an optimal growth temperature of 36 °C [8]. Growth was observed at NaCl concentration of 1.5 M to 4.5 M with an optimum salt concentration of 3.5 M. Cells lysed in distilled water. The optimum magnesium concentration for growth was 0.1 M. No growth was observed at magnesium concentrations of 0 M or 1.0 M. Ammonium could not be used as a nitrogen source; complex media such as yeast extract or peptone was required. Growth was stimulated by addition of glucose, galactose, mannose, ribose, lactose, glycerol, succinate, lactate, formate, acetate, propionate, and ethanol. Growth was not stimulated by addition of glycine. Acid was not produced from sugars.

Genome sequencing information

Genome project history

was selected for sequencing based upon its phylogenetic position relative to other haloarchaea and its cold tolerance (Table 1). It is part of a 2006 Joint Genome Institute Community Sequencing Program project that included six diverse archaeal genomes. Sequencing was done at the JGI Production Genomics Facility. Finishing was done at Los Alamos National Laboratory. Annotation was done at Oak Ridge National Laboratory and JGI. The complete genome sequence was finished in September, 2008 and was released to the public in GenBank in February, 2009. A summary of the project information is shown in Table 2.
Table 1

Classification and general features of Halorubrum lacusprofundi ACAM 34T [31]

MIGS IDPropertyTermEvidence codea
ClassificationDomain Archaea TAS [32]
Phylum Euryarchaeota TAS [33, 34]
Class Halobacteria TAS [35]
Order Halobacteriales TAS [36]
Family Halobacteriaceae TAS [37]
Genus Halorubrum TAS [3]
Species Halorubrum lacusprofundi TAS [1]
Gram stainUnknown
Cell shapePleomorphicTAS [1]
MotilityNon-motileTAS [1]
SporulationNonsporulatingNAS
Temperature range−1–40 °CTAS [1]
Optimum temperature36 °CTAS [1]
pH range, optimumUnknown
Carbon sourceSugars, organic acids, ethanolTAS [1]
MIGS-6HabitatSaline lakeTAS [1]
MIGS-6.3Salinity10–25 % NaClTAS [1]
MIGS-22Oxygen requirementAerobicTAS [1]
MIGS-15Biotic relationshipFree-livingTAS [1]
MIGS-14PathogenicityNon-pathogenNAS
MIGS-4Geographic locationDeep Lake, AntarcticaTAS [1]
MIGS-5Sample collection1988TAS [1]
MIGS-4.1 MIGS-4.2Latitude-LongitudeUnknown
MIGS-4.4AltitudeUnknown

aEvidence codes–IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [38]

Table 2

Project information

MIGS IDPropertyTerm
MIGS-31Finishing qualityFinished
MIGS-28Libraries Used3 kb, 8 kb, and fosmid DNA
MIGS-29Sequencing platformsABI3730
MIGS-31.2Fold coverage12.5×
Sequencing qualityLess than one error per 50 kb
MIGS-30AssemblersPhrap
MIGS-32Gene calling methodCRITICA, GLIMMER, GenePRIMP
Locus tagHlac
GenBank IDsCP001365, CP001366, CP001367
GenBank date of releaseFebruary 4, 2009
GOLD IDGc00952
BIOPROJECTPRJNA18455
NCBI project ID18455
IMG Taxon ID643692025
MIGS-13Source material identifierATCC 49239, DSM 5036
Project relevanceTree of Life, cold adaptation
Classification and general features of Halorubrum lacusprofundi ACAM 34T [31] aEvidence codes–IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [38] Project information

Growth conditions and genomic DNA preparation

ATCC 49239 was grown in Franzmann medium (180 g NaCl, 75 g MgCl2 · 6H2O, 7.4 g MgSO4 · 7H2O, 7.4 g KCl, 1 g CaCl2 · 2H2O, 10 g C4H4O4Na2 · 6H2O per liter, pH 7.4 with addition of 10 ml vitamin solution) [1]. The vitamin solution contained 0.1 g biotin, 0.1 g cyanocobalamin, and 0.1 g thiamine HCl per liter. Cells were grown with shaking at 220 rpm at 4 °C with illumination. The DNA extraction method was modified from [9]. Cells were grown to OD600 = 0.8, collected by centrifugation at 8000 rpm for 10 min at 4 °C, resuspended in 1/20 volume basal salts and lysed by addition of 2 volumes of deionized water and mixing at room temperature. Next, proteinase K was added to a final concentration of 100 μg/ml, mixed gently, and incubated for 1 h at 37 °C. The lysate was extracted using an equal volume of phenol, mixed gently by inverting at room temperature for 5 min, and then spinning at 8000 g for 15 min at 4 °C. The aqueous and interphase was collected and the phenol extraction was repeated twice more. The aqueous and interphase were then dialyzed against TE overnight at 4 °C with one change of buffer. The dialyzed solution was collected and RNase A was added to a final concentration of 50 μg/ml, the solution was mixed and incubated for 2 h at 37 °C with gentle shaking. Proteinase K was added to a final concentration of 100 μg/ml, mixed and incubated for an additional hour at 37 °C. The RNase A and proteinase K steps were repeated. The DNA was then dialyzed overnight against TE at 4 °C with one buffer change.

Genome sequencing and assembly

The genome of was sequenced at the Joint Genome Institute using a combination of 3 kb, 8 kb, and fosmid DNA libraries. All general aspects of library construction and sequencing were performed at the JGI [10]. Draft assemblies were based on 40,800 total reads. All libraries provided 12.5× coverage. The Phred/Phrap/Consed software package was used for sequence assembly and quality assessment [11-13]. After the shotgun stage, reads were assembled with parallel phrap (High Performance Software, LLC). Possible mis-assemblies were corrected with Dupfinisher [14] or transposon bombing of bridging clones (Epicentre Biotechnologies, Madison, WI). Gaps between contigs were closed by editing in Consed, custom primer walk or PCR amplification (Roche Applied Science, Indianapolis, IN). A total of 1722 additional reactions were necessary to close gaps and to raise the quality of the finished sequence. The completed genome sequence of contains 54,250 reads, achieving an average of 11.8× and 13.8× coverage in the chromosomes per base with an error rate of less than 1 in 50,000 bp.

Genome annotation

Protein-coding genes were identified using a combination of CRITICA [15] and Glimmer [16] followed by a round of manual curation using the JGI GenePRIMP pipeline [17]. GenePRIMP points out cases where gene start sites may be incorrect based on alignment with homologous proteins. It also highlights genes that appear to be broken into two or more pieces, due to a premature stop codon or frameshift, and genes that are disrupted by transposable elements. All of these types of broken and interrupted genes are labeled as pseudogenes. Genes that may have been missed by the gene calling programs are also identified in intergenic regions. The predicted CDSs were translated and used to search the National Center for Biotechnology Information nonredundant database, UniProt, TIGRFam, Pfam, PRIAM, KEGG, COG, and Interpro databases. Signal peptides were identified with SignalP [18], and transmembrane helices were determined with TMHMM [19]. CRISPR elements were identified with the CRISPR Recognition Tool [20]. Paralogs are hits of a protein against another protein within the same genome with an e-value of 10−2 or lower. The tRNAScanSE tool [21] was used to find tRNA genes. Additional gene prediction analysis and manual functional annotation was performed within the Integrated Microbial Genomes Expert Review (IMG-ER) [22] and HaloWeb [23] platform.

Genome properties

The genome of consists of two chromosomes of length 2,735,295 bp (Chromosome 1) and 525,943 bp (Chromosome 2 or pHL500) and one plasmid of length 431,338 bp (pHL400) (Table 3). The map of the genome is available on HaloWeb [24]. Partial sequence was obtained from a second smaller plasmid, but it appeared to be present in a minority of the cells and its complete sequence could not be determined. The GC content of the large chromosome (67 %) is larger than those of the small chromosome (57 %) and the plasmid (55 %). There are 2801 genes on the large chromosome, 522 genes on the smaller chromosome, and 402 genes on the plasmid. Two of the ribosomal RNA operons are on the large chromosome and one is found on the smaller chromosome. The properties and statistics of the genome are summarized in Table 4, and genes belonging to COG functional categories are listed in Table 5.
Table 3

Summary of genome: two chromosomes and one plasmid

LabelSize (Mb)TopologyINSDC identifierRefSeq ID
Chromosome 12.74circularCP001365.1NC012029.1
Chromosome 2 (pHL500)0.53circularCP001366.1NC012028.1
Plasmid (pHL400)0.43circularCP001367.1NC012030.1
Table 4

Genome statistics

AttributeValue% of Total
Genome size (bp)3,692,576100.00 %
DNA coding (bp)3,199,41786.64 %
DNA G + C (bp)2,362,21463.97 %
DNA scaffolds3
Number of replicons3
Extrachromosomal elements1
Total genes3725100.00 %
Protein coding genes366598.39 %
RNA genes601.61 %
Pseudo genes1052.82 %
Genes in internal clusters200953.93 %
Genes with function prediction214357.53 %
Genes assigned to COGs222659.76 %
Genes with Pfam domains216258.04 %
Genes with signal peptides39610.63 %
Genes with transmembrane helices77920.91 %
CRISPR repeats3
Table 5

Numbers of genes associated with the 25 general COG functional categories

CodeValue% agea Description
J1594.34Translation, ribosomal structure and biogenesis
A00.00RNA processing and modification
K1363.71Transcription
L2266.17Replication, recombination and repair
B40.11Chromatin structure and dynamics
D270.74Cell cycle control, Cell division, chromosome partitioning
V270.74Defense mechanisms
T1042.84Signal transduction mechanisms
M681.86Cell wall/membrane biogenesis
N280.76Cell motility
U300.82Intracellular trafficking and secretion
O1113.03Posttranslational modification, protein turnover, chaperones
C1564.26Energy production and conversion
G1133.08Carbohydrate transport and metabolism
E2276.19Amino acid transport and metabolism
F731.99Nucleotide transport and metabolism
H1223.33Coenzyme transport and metabolism
I621.69Lipid transport and metabolism
P1463.98Inorganic ion transport and metabolism
Q330.90Secondary metabolites biosynthesis, transport and catabolism
Z00.00Cytoskeleton
W00.00Extracellular structures
Y00.00Nuclear structure
R3629.88General function prediction only
S2145.84Function unknown
-143939.26Not in COGs

aThe total is based on the total number of protein coding genes in the annotated genome

Summary of genome: two chromosomes and one plasmid Genome statistics Numbers of genes associated with the 25 general COG functional categories aThe total is based on the total number of protein coding genes in the annotated genome

Conclusions

The genome sequence is the first established from a cold-adapted haloarchaeon. The genome has features typical of halophilic Archaea, including high G + C-content, large extrachromosomal replicons, and eukaryotic-like DNA replication and transcription genes. Encoded proteins are highly acidic with properties that suggest looser packing and greater flexibility important for function at cold temperatures [25-28]. co-exists in a community of three major haloarchaea in Deep Lake, Antarctica [29, 30].
  28 in total

1.  Predicting transmembrane protein topology with a hidden Markov model: application to complete genomes.

Authors:  A Krogh; B Larsson; G von Heijne; E L Sonnhammer
Journal:  J Mol Biol       Date:  2001-01-19       Impact factor: 5.469

2.  Validation of publication of new names and new combinations previously effectively published outside the IJSEM. International Journal of Systematic and Evolutionary Microbiology.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2002-05       Impact factor: 2.747

3.  GenePRIMP: a gene prediction improvement pipeline for prokaryotic genomes.

Authors:  Amrita Pati; Natalia N Ivanova; Natalia Mikhailova; Galina Ovchinnikova; Sean D Hooper; Athanasios Lykidis; Nikos C Kyrpides
Journal:  Nat Methods       Date:  2010-05-02       Impact factor: 28.547

4.  Locating proteins in the cell using TargetP, SignalP and related tools.

Authors:  Olof Emanuelsson; Søren Brunak; Gunnar von Heijne; Henrik Nielsen
Journal:  Nat Protoc       Date:  2007       Impact factor: 13.491

5.  Consed: a graphical tool for sequence finishing.

Authors:  D Gordon; C Abajian; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

6.  WWW-query: an on-line retrieval system for biological sequence banks.

Authors:  G Perrière; M Gouy
Journal:  Biochimie       Date:  1996       Impact factor: 4.079

Review 7.  Halophiles and their enzymes: negativity put to good use.

Authors:  Shiladitya DasSarma; Priya DasSarma
Journal:  Curr Opin Microbiol       Date:  2015-06-09       Impact factor: 7.934

8.  Halorubrum ezzemoulense sp. nov., a halophilic archaeon isolated from Ezzemoul sabkha, Algeria.

Authors:  Karima Kharroub; Teresa Quesada; Raquel Ferrer; Susana Fuentes; Margarita Aguilera; Abdrahmane Boulahrouf; Alberto Ramos-Cormenzana; Mercedes Monteoliva-Sánchez
Journal:  Int J Syst Evol Microbiol       Date:  2006-07       Impact factor: 2.747

9.  Amino acid substitutions in cold-adapted proteins from Halorubrum lacusprofundi, an extremely halophilic microbe from antarctica.

Authors:  Shiladitya Dassarma; Melinda D Capes; Ram Karan; Priya Dassarma
Journal:  PLoS One       Date:  2013-03-11       Impact factor: 3.240

10.  CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats.

Authors:  Charles Bland; Teresa L Ramsey; Fareedah Sabree; Micheal Lowe; Kyndall Brown; Nikos C Kyrpides; Philip Hugenholtz
Journal:  BMC Bioinformatics       Date:  2007-06-18       Impact factor: 3.169

View more
  6 in total

1.  An Antarctic Extreme Halophile and Its Polyextremophilic Enzyme: Effects of Perchlorate Salts.

Authors:  Victoria J Laye; Shiladitya DasSarma
Journal:  Astrobiology       Date:  2017-11-30       Impact factor: 4.335

Review 2.  Extremophilic models for astrobiology: haloarchaeal survival strategies and pigments for remote sensing.

Authors:  Shiladitya DasSarma; Priya DasSarma; Victoria J Laye; Edward W Schwieterman
Journal:  Extremophiles       Date:  2019-08-28       Impact factor: 2.395

3.  Double mutations far from the active site affect cold activity in an Antarctic halophilic β-galactosidase.

Authors:  Victoria J Laye; Shiladitya DasSarma
Journal:  Protein Sci       Date:  2022-01-05       Impact factor: 6.725

4.  Minority report: small-scale metagenomic analysis of the non-bacterial kitchen sponge microbiota.

Authors:  Lena Brandau; Susanne Jacksch; Severin Weis; Sylvia Schnell; Markus Egert
Journal:  Arch Microbiol       Date:  2022-06-04       Impact factor: 2.667

5.  Genome Sequence of Salarchaeum sp. Strain JOR-1, an Extremely Halophilic Archaeon from the Dead Sea.

Authors:  Brian P Anton; Priya DasSarma; Fabiana L Martinez; Satyajit L DasSarma; Mohammad Al Madadha; Richard J Roberts; Shiladitya DasSarma
Journal:  Microbiol Resour Announc       Date:  2020-01-30

Review 6.  Hypersaline environments as natural sources of microbes with potential applications in biotechnology: The case of solar evaporation systems to produce salt in Alicante County (Spain).

Authors:  Guillermo Martínez Martínez; Carmen Pire; Rosa María Martínez-Espinosa
Journal:  Curr Res Microb Sci       Date:  2022-04-26
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.