Literature DB >> 24019991

Genome of the marine alphaproteobacterium Hoeflea phototrophica type strain (DFL-43(T)).

Anne Fiebig1, Silke Pradella, Jörn Petersen, Victoria Michael, Orsola Päuker, Manfred Rohde, Markus Göker, Hans-Peter Klenk, Irene Wagner-Döbler.   

Abstract

Hoeflea phototrophica Biebl et al. 2006 is a member of the family Phyllobacteriaceae in the order Rhizobiales, which is thus far only partially characterized at the genome level. This marine bacterium contains the photosynthesis reaction-center genes pufL and pufM and is of interest because it lives in close association with toxic dinoflagellates such as Prorocentrum lima. The 4,467,792 bp genome (permanent draft sequence) with its 4,296 protein-coding and 69 RNA genes is a part of the Marine Microbial Initiative.

Entities:  

Keywords:  Phenotype MicroArray; Phyllobacteriaceae; Prorocentrum lima; aerobic; bacteriochlorophyll a; dinoflagellates; motile; photoheterotroph; rod-shaped; symbiosis

Year:  2013        PMID: 24019991      PMCID: PMC3764936          DOI: 10.4056/sigs.3486982

Source DB:  PubMed          Journal:  Stand Genomic Sci        ISSN: 1944-3277


Introduction

Strain DFL-43T (= DSM 17068 = NCIMB 14078) is the type strain of , a marine member of the () [1]. The genus, which was named in honor of the German microbiologist Manfred Höfle [2], contains four species, with as type species [2]; the name of a fifth member of the genus, ' is until now only effectively published [3]. DFL-43T and strain DFL-44 were found in the course of a screening program for marine bacteria containing the photosynthesis reaction-center genes pufL and pufM [4]. The species epithet 'phototrophica' refers to the likely ability of strains to use light as an additional energy source [1]. Strain DFL-43T was isolated from single cells of a culture of the toxic dinoflagellate Prorocentrum lima maintained at the Biological Research Institute of Helgoland, Germany [1]. Here we present a summary classification and a set of features for DFL-43T including so far undiscovered aspects of its phenotype, together with the description of the complete genomic sequencing and annotation. This work is part of the Marine Microbial Initiative (MMI) which enabled the J. Craig Venter Institute (JCVI) to sequence the genomes of approximately 165 marine microbes with funding from the Gordon and Betty Moore Foundation. These microbes were contributed by collaborators worldwide, and represent an array of physiological diversity, including carbon fixers, photoautotrophs, photoheterotrophs, nitrifiers, and methanotrophs. The MMI was designed to complement other ongoing research at JCVI and elsewhere to characterize the microbial biodiversity of marine and terrestrial environments through metagenomic profiling of environmental samples.

Classification and features

16S rRNA analysis

A representative genomic 16S rRNA sequence of DFL-43T was compared using NCBI BLAST [5,6] under default settings (e.g., considering only the high-scoring segment pairs (HSPs) from the best 250 hits) with the most recent release of the Greengenes database [7] and the relative frequencies of taxa and keywords (reduced to their stem [8]) were determined, weighted by BLAST scores. The most frequently occurring genera were (53.7%), (24.0%), (4.5%), (4.5%) and (3.7%) (132 hits in total). Regarding the two hits to sequences from members of the species, both, the average identity within HSPs and the average coverage by HSPs were 100.0%. Regarding the single hit to sequences from other members of the genus, the average identity within HSPs was 98.2%, whereas the average coverage by HSPs was 100.0%. Among all other species, the one yielding the highest score was (AY598817), which corresponded to an identity of 98.2% and an HSP coverage of 100.0%. (Note that the Greengenes database uses the INSDC (= EMBL/NCBI/DDBJ) annotation, which is not an authoritative source for nomenclature or classification.) The highest-scoring environmental sequence was AY922224 (Greengenes short name 'whalefall clone 131720'), which showed an identity of 98.1% and an HSP coverage of 97.5%. The most frequently occurring keywords within the labels of all environmental samples which yielded hits were 'bee' (3.1%), 'singl' (3.0%), 'abdomen, bumbl, distinct, honei, microbiota, simpl' (2.9%), 'microbi' (2.8%) and 'structur' (1.8%) (118 hits in total). Environmental samples which yielded hits of a higher score than the highest scoring species were not found, indicating that is rarely found in environmental samples. Figure 1 shows the phylogenetic neighborhood of in a 16S rRNA based tree. The sequences of the two identical 16S rRNA gene copies in the genome differ by one nucleotide from the previously published 16S rRNA sequence (AJ582088)
Figure 1

Phylogenetic tree highlighting the position of relative to the type strains of the other species within the family . The tree was inferred from 1,362 aligned characters [9,10] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [11]. Rooting was done initially using the midpoint method [12] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates [13] (left) and from 1,000 maximum-parsimony bootstrap replicates [14] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [15] are labeled with one asterisk, those also listed as 'Complete and Published' (CP002279 for ) with two asterisks.

Phylogenetic tree highlighting the position of relative to the type strains of the other species within the family . The tree was inferred from 1,362 aligned characters [9,10] of the 16S rRNA gene sequence under the maximum likelihood (ML) criterion [11]. Rooting was done initially using the midpoint method [12] and then checked for its agreement with the current classification (Table 1). The branches are scaled in terms of the expected number of substitutions per site. Numbers adjacent to the branches are support values from 1,000 ML bootstrap replicates [13] (left) and from 1,000 maximum-parsimony bootstrap replicates [14] (right) if larger than 60%. Lineages with type strain genome sequencing projects registered in GOLD [15] are labeled with one asterisk, those also listed as 'Complete and Published' (CP002279 for ) with two asterisks.
Table 1

Classification and general features of DFL-43T according to the MIGS recommendations [16].

MIGS ID    Property    Term    Evidence code
    Current classification    Domain Bacteria    TAS [17]
    Phylum Proteobacteria    TAS [18]
    Class Alphaproteobacteria    TAS [19,20]
    Order Rhizobiales    TAS [20,21]
    Family Phyllobacteriaceae    TAS [20,22]
    Genus Hoeflea    TAS [2]
    Species Hoeflea phototrophica    TAS [1]
MIGS-7    Subspecific genetic lineage (strain)    DFL-43T    TAS [1]
MIGS-12    Reference for biomaterial    Biebl et al. 2006    TAS [1]
    Gram stain    Gram-negative    TAS [1]
    Cell shape    rod-shaped    TAS [1]
    Motility    motile    TAS [1]
    Sporulation    not reported
    Temperature range    mesophile, 25-33°C    TAS [1]
    Optimum temperature    31°C    TAS [1]
    Salinity    0.5–7.0 % NaCl    TAS [1]
MIGS-22    Relationship to oxygen    aerobe    TAS [1]
    Carbon source    acetate, malate    TAS [1]
    Energy metabolism    photoheterotroph    TAS [1]
MIGS-6    Habitat    marine    TAS [1]
MIGS-6.2    pH    6–9.0    TAS [1]
MIGS-15    Biotic relationship    host-associated    TAS [1]
MIGS-14    Known pathogenicity    none    TAS [1]
MIGS-16    Specific host    Prorocentrum lima ME130    TAS [1]
MIGS-18    Health status of Host    not reported
    Biosafety level    1    TAS [23]
MIGS-19    Trophic level    not reported
MIGS-23.1    Isolation    from a culture of    Prorocentrum lima ME130    TAS [1]
MIGS-4    Geographic location    not reported
MIGS-5    Time of sample collection    April 1, 2002    TAS [1]
MIGS-4.1    Latitude    54.133    TAS [1]
MIGS-4.2    Longitude    7.867    TAS [1]
MIGS-4.3    Depth    not reported
MIGS-4.4    Altitude    not reported

Evidence codes TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [24].

Evidence codes TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). Evidence codes are from the Gene Ontology project [24].

Morphology and physiology

Cells of are small rods of 0.3–0.5 μm in width and 0.7–2.0 μm length [1] (Figure 2) and motile by means of single, polar flagellum [1] (not visible in Figure 2). Depending on the availability of light, colonies are opaque to beige (grown in the dark) on marine agar 2216 [1]. The cultures are strictly aerobic and prefer microaerobic conditions. Good growth was detectable within a range of 25-33°C (1/5 limited growth rate below this value), concentration of sea salt from 0.5-7.0% and pH values from 6.0-9.0 [1]. Acetate and malate were accepted as carbon sources, whereas ethanol and methanol were not used for growth [1]. No hydrolysis of gelatin, starch, alginate or Tween 8 was observed [1].
Figure 2

Scanning electron micrograph of DFL-43T

Scanning electron micrograph of DFL-43T The utilization of carbon compounds by DFL-43T was also determined for this study using PM01 microplates in an OmniLog phenotyping device (BIOLOG Inc., Hayward, CA, USA). The microplates were inoculated at 28°C with a cell suspension at a cell density of 85% Turbidity and dye D. Further additives were artificial sea salts, vitamins, trace elements and NaHCO3. The exported measurement data were further analyzed with the opm package for R [25], using its functionality for statistically estimating parameters from the respiration curves such as the maximum height, and automatically translating these values into negative, ambiguous, and positive reactions. The strain was studied in two independent biological replicates, and reactions with a different behavior between the two repetitions were regarded as ambiguous and are not listed below. DFL-43T was positive for D,L-malic acid, D-cellobiose, D-fructose, D-galactonic acid-γ-lactone, D-galactose, D-galacturonic acid, D-gluconic acid, D-glucuronic acid, D-malic acid, D-mannitol, D-melibiose, D-sorbitol, D-trehalose, D-xylose, L-alanine, L-arabinose, L-glutamic acid, L-glutamine, L-lactic acid, L-lyxose, L-malic acid, L-proline, L-serine, acetic acid, adonitol, α-D-glucose, α-keto-glutaric acid, α-methyl-D-galactoside, β-methyl-D-glucoside, bromo-succinic acid, citric acid, ethanolamine, fumaric acid, m-inositol, maltose, maltotriose, mono-methyl succinate, propionic acid, pyruvic acid, succinic acid, sucrose and uridine. The strain was negative for 1,2-propanediol, 2'-deoxy-adenosine, D,L-α-glycerol-phosphate, D-alanine, D-aspartic acid, D-fructose-6-phosphate, D-glucosaminic acid, D-glucose-1-phosphate, D-glucose-6-phosphate, D-mannose, D-psicose, D-serine, D-threonine, L-alanyl-glycine, L-aspartic acid, L-fucose, L-galactonic acid-γ-lactone, L-rhamnose, L-threonine, N-acetyl-D-glucosamine, N-acetyl-β-D-mannosamine, acetoacetic acid, adenosine, α-D-lactose, α-hydroxy-butyric acid, α-hydroxy-glutaric acid-γ-lactone, α-keto-butyric acid, β-phenylethylamine, dulcitol, glycolic acid, glycyl-L-aspartic acid, glyoxylic acid, inosine, m-hydroxy-phenylacetic acid, m-tartaric acid, mucic acid, thymidine, tricarballylic acid, tween 40, tween 80 and tyramine.

Chemotaxonomy

Phosphatidylglycerol, phosphatidylethanolamine and phosphatidylmonomethylethanolamine were the predominant polar lipids of the membrane. The most frequent cellular fatty acids in strain DFL-43T are the mono-unsaturated straight chain acids C18:1 ω7 (62.8%) and its methylated form C18:1 ω7 11Me (21%), followed by C16:0 (6.3%) and C19:1 (3.4%) [1]. The absorption spectrum of an acetone/methanol extract showed the presence of bacteriochlorophyll a and an additional carotenoid (possibly spheroidenone) in small amounts [1]. Further experiments indicated that the pigment production depends on the concentration of sea salts in the medium [1].

Genome sequencing and annotation

Genome project history

The genome was sequenced within the MMI supported by the Gordon and Betty Moore Foundation. Initial Sequencing was performed by the J. Craig Venter Institute, JCVI (Rockville, MD, USA), and a high-quality draft sequence was deposited at INSDC. The number of scaffolds and contigs was reduced and the assembly improved by a subsequent round of manual gap closure at HZI/DSMZ. A summary of the project information is shown in Table 2.
Table 2

Genome sequencing project information

MIGS ID     Property    Term
MIGS-31     Finishing quality    High quality draft
MIGS-28     Libraries used    Two genomic libraries: 40 kb fosmid library and 3 kb pUC18 plasmid library
MIGS-29     Sequencing platform    ABI3730
MIGS-31.2     Sequencing coverage    10.3 × Sanger
MIGS-30     Assemblers    Consed 20.0
MIGS-31.3     Contig count    5
MIGS-32     Gene calling method    Prodigal 2.0, Infernal 1.0.2
     INSDC ID    Final ID pending; previous version ABIA00000000
     Genbank Date of Release    final version not yet available
     GOLD ID    Gi01415
     NCBI project ID    19311
     Database: IMG    2509276008
MIGS-13     Source Material Identifier    DSM 17068
     Project relevance    Environmental, Marine Microbial Initiative

Growth conditions and DNA extractions

Cells of strain DFL-43T were grown for two to three days on a LB & sea-salt agar plate, containing (l-1) 10 g tryptone, 5 g yeast extract, 10 g NaCl, 17 g sea salt (Sigma-Aldrich S9883) and 15 g agar. A single colony was used to inoculate LB & sea-salt liquid medium and the culture was incubated at 28°C on a shaking platform. The genomic DNA was isolated using the Qiagen Genomic 500 DNA Kit (Qiagen 10262) as indicated by the manufacturer. DNA quality and quantity were in accordance with the instructions of the genome sequencing center. DNA is available through the DNA Bank Network [26].

Genome sequencing and assembly

The genome was sequenced with the Sanger technology using a combination of two libraries. All general aspects of library construction and sequencing performed at the JCVI can be found on the JCVI website. Base calling of the sequences were performed with the phredPhrap script using default settings. The reads were assembled and assemblies analyzed using the phred/phrap/consed pipeline [27]. The last gaps were closed by adding new reads produced by recombinant PCR and PCR primer walks. In total 21 Sanger reads were required for gap closure and improvement of low quality regions. The final consensus sequence was built from 46,086 Sanger reads (10.3 × coverage).

Genome annotation

Gene prediction was carried out using GeneMark as part of the genome annotation pipeline in the Integrated Microbial Genomes Expert Review (IMG-ER) system [28]. To identify coding genes, Prodigal [29] was used, while ribosomal RNA genes within the genome were identified using RNAmmer [30]. Other non-coding genes were predicted using Infernal [31]. Manual functional annotation was performed within the IMG platform [28] and the Artemis Genome Browser [32].

Genome properties

The draft genome consists of one circular scaffold with a total length of 4,467,822 bp containing five large contigs with a total length of 4,467,792 bp and a G+C content of 59.8%. Contig lengths vary from 133,683 bp to 2,215,172 bp (Figure 3); genome statistics are provided in Table 3. Of the 4,296 genes predicted, 4,227 were protein-coding genes, and 69 RNAs; pseudogenes were not identified. The majority of the protein-coding genes (83.1%) were assigned a putative function while the remaining ones were annotated as hypothetical proteins. The distribution of genes into COGs functional categories is presented in Table 4.
Figure 3

Graphical map of the chromosome. From outside to the centerp: Genes on forward strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.

Table 3

Genome Statistics

     Attribute     Value     % of Total
     Genome size (bp)     4,467,832     100.00
     DNA coding region (bp)     4,006,040     89.66
     DNA G+C content (bp)     2,671,973     59.81
     Number of replicons     1
     Extrachromosomal elements     0
     Total genes     4,296     100.00
     RNA genes     69     1.61
     rRNA operons     2
     tRNA genes     47     1.09
     Protein-coding genes     4,227     98.39
     Pseudo genes     0
     Genes with function prediction     3,574     83.19
     Genes in paralog clusters     1,423     33.12
     Genes assigned to COGs     3,525     82.05
     Genes assigned Pfam domains     3,580     83.33
     Genes with signal peptides     927     21.58
     Genes with transmembrane helices     994     24.57
     CRISPR repeats     0
Table 4

Number of genes associated with the general COG functional categories

Code    Value    %age     Description
J    178    4.58     Translation, ribosomal structure and biogenesis
A    0    0.00     RNA processing and modification
K    274    7.05     Transcription
L    162    4.17     Replication, recombination and repair
B    2    0.05     Chromatin structure and dynamics
D    27    0.69     Cell cycle control, cell division, chromosome partitioning
Y    -    -     Nuclear structure
V    39    1.00     Defense mechanisms
T    175    4.50     Signal transduction mechanisms
M    205    5.27     Cell wall/membrane/envelope biogenesis
N    60    1.54     Cell motility
Z    0    0.00     Cytoskeleton
W    -    -     Extracellular structures
U    66    1.70     Intracellular trafficking, secretion, and vesicular transport
O    135    3.47     Posttranslational modification, protein turnover, chaperones
C    226    5.81     Energy production and conversion
G    325    8.36     Carbohydrate transport and metabolism
E    405    10.41     Amino acid transport and metabolism
F    80    2.06     Nucleotide transport and metabolism
H    157    4.04     Coenzyme transport and metabolism
I    188    4.83     Lipid transport and metabolism
P    178    4.58     Inorganic ion transport and metabolism
Q    130    3.34     Secondary metabolites biosynthesis, transport and catabolism
R    524    13.47     General function prediction only
S    353    9.08     Function unknown
-    773    18.00     Not in COGs
Graphical map of the chromosome. From outside to the centerp: Genes on forward strand (color by COG categories), RNA genes (tRNAs green, rRNAs red, other RNAs black), GC content, GC skew.
  21 in total

1.  Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis.

Authors:  J Castresana
Journal:  Mol Biol Evol       Date:  2000-04       Impact factor: 16.240

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  [Hoeflea siderophila sp. Nov., new neutrophilic iron-oxidizing bacteria].

Authors:  A Iu Sorokina; E Iu Chernousova; G A Dubinina
Journal:  Mikrobiologiia       Date:  2012 Jan-Feb

4.  Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB.

Authors:  T Z DeSantis; P Hugenholtz; N Larsen; M Rojas; E L Brodie; K Keller; T Huber; D Dalevi; P Hu; G L Andersen
Journal:  Appl Environ Microbiol       Date:  2006-07       Impact factor: 4.792

5.  List of new names and new combinations previously effectively, but not validly, published.

Authors: 
Journal:  Int J Syst Evol Microbiol       Date:  2006-01       Impact factor: 2.747

6.  The DNA bank network: the start from a german initiative.

Authors:  Birgit Gemeinholzer; Gabriele Dröge; Holger Zetzsche; Gerhard Haszprunar; Hans-Peter Klenk; Anton Güntsch; Walter G Berendsohn; Johann-Wolfgang Wägele
Journal:  Biopreserv Biobank       Date:  2011-03       Impact factor: 2.300

7.  Prodigal: prokaryotic gene recognition and translation initiation site identification.

Authors:  Doug Hyatt; Gwo-Liang Chen; Philip F Locascio; Miriam L Land; Frank W Larimer; Loren J Hauser
Journal:  BMC Bioinformatics       Date:  2010-03-08       Impact factor: 3.169

8.  Genome organization and localization of the pufLM genes of the photosynthesis reaction center in phylogenetically diverse marine Alphaproteobacteria.

Authors:  Silke Pradella; Martin Allgaier; Christa Hoch; Orsola Päuker; Erko Stackebrandt; Irene Wagner-Döbler
Journal:  Appl Environ Microbiol       Date:  2004-06       Impact factor: 4.792

9.  The minimum information about a genome sequence (MIGS) specification.

Authors:  Dawn Field; George Garrity; Tanya Gray; Norman Morrison; Jeremy Selengut; Peter Sterk; Tatiana Tatusova; Nicholas Thomson; Michael J Allen; Samuel V Angiuoli; Michael Ashburner; Nelson Axelrod; Sandra Baldauf; Stuart Ballard; Jeffrey Boore; Guy Cochrane; James Cole; Peter Dawyndt; Paul De Vos; Claude DePamphilis; Robert Edwards; Nadeem Faruque; Robert Feldman; Jack Gilbert; Paul Gilna; Frank Oliver Glöckner; Philip Goldstein; Robert Guralnick; Dan Haft; David Hancock; Henning Hermjakob; Christiane Hertz-Fowler; Phil Hugenholtz; Ian Joint; Leonid Kagan; Matthew Kane; Jessie Kennedy; George Kowalchuk; Renzo Kottmann; Eugene Kolker; Saul Kravitz; Nikos Kyrpides; Jim Leebens-Mack; Suzanna E Lewis; Kelvin Li; Allyson L Lister; Phillip Lord; Natalia Maltsev; Victor Markowitz; Jennifer Martiny; Barbara Methe; Ilene Mizrachi; Richard Moxon; Karen Nelson; Julian Parkhill; Lita Proctor; Owen White; Susanna-Assunta Sansone; Andrew Spiers; Robert Stevens; Paul Swift; Chris Taylor; Yoshio Tateno; Adrian Tett; Sarah Turner; David Ussery; Bob Vaughan; Naomi Ward; Trish Whetzel; Ingio San Gil; Gareth Wilson; Anil Wipat
Journal:  Nat Biotechnol       Date:  2008-05       Impact factor: 54.908

10.  Visualization and curve-parameter estimation strategies for efficient exploration of phenotype microarray kinetics.

Authors:  Lea A I Vaas; Johannes Sikorski; Victoria Michael; Markus Göker; Hans-Peter Klenk
Journal:  PLoS One       Date:  2012-04-20       Impact factor: 3.240

View more
  5 in total

1.  A marine inducible prophage vB_CibM-P1 isolated from the aerobic anoxygenic phototrophic bacterium Citromicrobium bathyomarinum JL354.

Authors:  Qiang Zheng; Rui Zhang; Yongle Xu; Richard Allen White; Yu Wang; Tingwei Luo; Nianzhi Jiao
Journal:  Sci Rep       Date:  2014-11-19       Impact factor: 4.379

2.  Light Regimes Shape Utilization of Extracellular Organic C and N in a Cyanobacterial Biofilm.

Authors:  Rhona K Stuart; Xavier Mayali; Amy A Boaro; Adam Zemla; R Craig Everroad; Daniel Nilson; Peter K Weber; Mary Lipton; Brad M Bebout; Jennifer Pett-Ridge; Michael P Thelen
Journal:  MBio       Date:  2016-06-28       Impact factor: 7.867

3.  Fatal affairs - conjugational transfer of a dinoflagellate-killing plasmid between marine Rhodobacterales.

Authors:  Jürgen Tomasch; Victoria Ringel; Hui Wang; Heike M Freese; Pascal Bartling; Henner Brinkmann; John Vollmers; Michael Jarek; Irene Wagner-Döbler; Jörn Petersen
Journal:  Microb Genom       Date:  2022-03

4.  Phylogenetic Co-Occurrence of ExoR, ExoS, and ChvI, Components of the RSI Bacterial Invasion Switch, Suggests a Key Adaptive Mechanism Regulating the Transition between Free-Living and Host-Invading Phases in Rhizobiales.

Authors:  Mary Ellen Heavner; Wei-Gang Qiu; Hai-Ping Cheng
Journal:  PLoS One       Date:  2015-08-26       Impact factor: 3.240

5.  Pyrosequencing of the bacteria associated with Platygyra carnosus corals with skeletal growth anomalies reveals differences in bacterial community composition in apparently healthy and diseased tissues.

Authors:  Jenny C Y Ng; Yuki Chan; Hein M Tun; Frederick C C Leung; Paul K S Shin; Jill M Y Chiu
Journal:  Front Microbiol       Date:  2015-10-20       Impact factor: 5.640

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.