| Literature DB >> 25197450 |
Bryndan P Durham1, Jana Grote2, Kerry A Whittaker3, Sara J Bender4, Haiwei Luo5, Sharon L Grim6, Julia M Brown7, John R Casey2, Antony Dron8, Lennin Florez-Leiva9, Andreas Krupke10, Catherine M Luria11, Aric H Mine12, Olivia D Nigro2, Santhiska Pather13, Agathe Talarmin14, Emma K Wear15, Thomas S Weber16, Jesse M Wilson17, Matthew J Church2, Edward F DeLong18, David M Karl2, Grieg F Steward2, John M Eppley18, Nikos C Kyrpides19, Stephan Schuster20, Michael S Rappé21.
Abstract
Strain HIMB11 is a planktonic marine bacterium isolated from coastal seawater in Kaneohe Bay, Oahu, Hawaii belonging to the ubiquitous and versatile Roseobacter clade of the alphaproteobacterial family Rhodobacteraceae. Here we describe the preliminary characteristics of strain HIMB11, including annotation of the draft genome sequence and comparative genomic analysis with other members of the Roseobacter lineage. The 3,098,747 bp draft genome is arranged in 34 contigs and contains 3,183 protein-coding genes and 54 RNA genes. Phylogenomic and 16S rRNA gene analyses indicate that HIMB11 represents a unique sublineage within the Roseobacter clade. Comparison with other publicly available genome sequences from members of the Roseobacter lineage reveals that strain HIMB11 has the genomic potential to utilize a wide variety of energy sources (e.g. organic matter, reduced inorganic sulfur, light, carbon monoxide), while possessing a reduced number of substrate transporters.Entities:
Keywords: Roseobacter; aerobic anoxygenic phototroph; dimethylsulfoniopropionate; marine bacterioplankton
Year: 2014 PMID: 25197450 PMCID: PMC4148974 DOI: 10.4056/sigs.4998989
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Phylogenetic relationships between HIMB11 and bacterial strains belonging to the clade. SSU rRNA gene sequences were aligned with version 111 of the ‘All-Species Living Tree’ project SSU rRNA gene database [16] using the ARB software package [17]. The phylogeny was constructed from nearly full-length gene sequences using the RAxML maximum likelihood method [18] within ARB, filtered to exclude alignment positions that contained gaps or ambiguous nucleotides in any of the sequences included in the tree. Bootstrap analyses were determined by RAxML [19] via the raxmlGUI graphical front end [20]. The scale bar corresponds to 0.05 substitutions per nucleotide position. Open circles indicate nodes with bootstrap support between 50-80%, while closed circles indicate bootstrap support >80%, from 500 replicates. A variety of were used as outgroups.
Figure 2Maximum likelihood phylogenomic tree derived from a concatenation of 719 shared single-copy orthologous protein sequences from strain HIMB11, 40 other members, and the SAR116 clade member Ca. Puniceispirillum marinum IMCC1322 (outgroup) using the RAxML maximum likelihood method [18]. After a reciprocal BLAST search [22] was performed for all possible genome pairs at the amino acid level using an E value of 10-6, orthologous genes in each of the genome pairs were identified using the MSOAR software [23], which assigns orthologs by considering both sequence similarity and gene context information. One genome was picked at random as the reference genome, and pairwise orthologs were linked to the reference. Values at the nodes show the number of times the clade defined by that node appeared in the 100 bootstrapped data sets. The scale bar indicates number of substitutions per site. Bootstrap values lower than 50% were not displayed.
Figure 3Scanning electron micrograph of strain HIMB11. For scale, the membrane pore size is 0.2 μm in diameter.
Classification and general features of strain HIMB11 according to the MIGS recommendations [29].
| | | | |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus not assigned | |||
| Species not assigned | |||
| Strain HIMB11 | |||
| Gram stain | Negative | NAS | |
| Cell shape | Short irregular rods | IDA | |
| Motility | Flagella | NAS | |
| Sporulation | Non-sporulating | NAS | |
| Temperature range | Mesophile | IDA | |
| Optimum temperature | Unknown | ||
| Carbon source | Ambient seawater DOC | TAS [ | |
| Energy source | Mixotrophic | NAS | |
| Terminal electron receptor | |||
| MIGS-6 | Habitat | Seawater | IDA |
| MIGS-6.3 | Salinity | ~35.0 ‰ | IDA |
| MIGS-22 | Oxygen | Aerobic | NAS |
| MIGS-15 | Biotic relationship | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | None | NAS |
| MIGS-4 | Geographic location | Kaneohe Bay, Hawaii | TAS [ |
| MIGS-5 | Sample collection time | 18 May, 2005 | TAS [ |
| MIGS-4.1 | Latitude - Longitude | 21.44, -157.78 | TAS [ |
| MIGS-4.2 | |||
| MIGS-4.3 | Depth | ~1 m | TAS [ |
| MIGS-4.4 | Altitude |
a) Evidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [37].
Project information
| | | |
|---|---|---|
| MIGS-31 | Finishing quality | Draft |
| MIGS-28 | Libraries used | One standard 454 pyrosequence titanium library |
| MIGS-29 | Sequencing platforms | 454 GS FLX Titanium |
| MIGS-31.2 | Fold coverage | 121× pyrosequence |
| MIGS-30 | Assemblers | Newbler version 2.5.3 |
| MIGS-32 | Gene calling method | Prodigal 1.4, GenePRIMP |
| Genome Database release | IMG; 2506210027 | |
| Genbank ID | AVDB00000000 | |
| Genbank Date of Release | September 15, 2013 | |
| GOLD ID | Gi09592 | |
| Project relevance | Environmental |
Nucleotide content and gene count levels of the genome
| Attribute | Value | % of totala |
|---|---|---|
| Genome size (bp) | 3,098,747 | 100.00 |
| DNA coding region (bp) | 2,812,982 | 90.78 |
| DNA G+C content (bp) | 1,541,077 | 49.73 |
| Total genes | 3,237 | 100.00 |
| RNA genes | 54 | 1.67 |
| Protein-coding genes | 3,183 | 98.33 |
| Genes in paralog clusters | ||
| Genes assigned to COGs | 2,523 | 77.94 |
| 1 or more conserved domains | ||
| 2 or more conserved domains | ||
| 3 or more conserved domains | ||
| 4 or more conserved domains | ||
| Genes with signal peptides | 919 | 28.39 |
| Genes with transmembrane helices | 654 | 20.20 |
| Paralogous groups |
a) The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
Number of genes associated with the 25 general COG functional categories
| | | | |
|---|---|---|---|
| J | 158 | 5.6 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 159 | 5.7 | Transcription |
| L | 104 | 3.7 | Replication, recombination and repair |
| B | 3 | 0.1 | Chromatin structure and dynamics |
| D | 24 | 0.9 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 22 | 0.8 | Defense mechanisms |
| T | 74 | 2.6 | Signal transduction mechanisms |
| M | 133 | 4.7 | Cell wall/membrane biogenesis |
| N | 40 | 1.4 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 38 | 1.4 | Intracellular trafficking and secretion |
| O | 118 | 4.2 | Posttranslational modification, protein turnover, chaperones |
| C | 196 | 7.0 | Energy production and conversion |
| G | 165 | 5.9 | Carbohydrate transport and metabolism |
| E | 334 | 11.9 | Amino acid transport and metabolism |
| F | 73 | 2.6 | Nucleotide transport and metabolism |
| H | 157 | 5.6 | Coenzyme transport and metabolism |
| I | 154 | 5.5 | Lipid transport and metabolism |
| P | 112 | 4.0 | Inorganic ion transport and metabolism |
| Q | 123 | 4.4 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 379 | 13.5 | General function prediction only |
| S | 238 | 8.5 | Function unknown |
| - | 714 | 22.06 | Not in COGs |
a) The total is based on the total number of protein coding genes in the annotated genome.
Figure 4Proposed mechanisms for the acquisition of carbon, nitrogen, phosphorus, sulfur, and energy in HIMB11. Substrates that are hypothesized to be transported and used by HIMB11 are in bold. Genes that designate these mechanisms are indicated. DMSP, dimethylsulfoniopropionate; DMS, dimethylsulfide; DON, dissolved organic nitrogen; C-O-P, phosphoesters; C-P, phosphonates; Pi, phosphate.
Figure 5Regression model for strain HIMB11 genome size estimation based on the genomes of 40 cultured strains. The x-axis shows the ratio of the number of conserved single-copy genes universally present in fully sequenced genomes to the number of predicted protein-encoding genes. The y-axis is the number of nucleotides sequenced. The data were fit to an exponential regression model (R2=0.94), and the model was used to estimate the genome size of HIMB11 to be 3.42 Mb.
Publicly available clade genomes use for comparative analysis with strain HIMB11, as of IMG release 3.4.
| | | | | |
|---|---|---|---|---|
| Draft | 5,523,231 | 5,499 | 67 | |
| Finished | 4,417,868 | 4,271 | 66 | |
| Finished | 4,404,049 | 4,339 | 62 | |
| Draft | 3,497,325 | 3,703 | 55 | |
| Draft | 3,063,691 | 3,117 | 60 | |
| Draft | 4,529,231 | 4,763 | 64 | |
| Draft | 4,105,524 | 4,208 | 60 | |
| Draft | 4,437,668 | 4,261 | 66 | |
| Draft | 4,039,111 | 3,855 | 70 | |
| Draft | 5,393,715 | 5,883 | 55 | |
| Draft | 4,909,025 | 5,544 | 55 | |
| Draft | 5,425,920 | 5,519 | 66 | |
| Draft | 4,157,399 | 4,017 | 60 | |
| Draft | 4,232,367 | 4,136 | 60 | |
| Draft | 4,018,415 | 4,226 | 53 | |
| Draft | 3,582,902 | 3,713 | 49 | |
| Draft | 4,344,244 | 4,206 | 64 | |
| Draft | 2,224,475 | 2,209 | 37 | |
| Finished | 4,331,234 | 4,201 | 59 | |
| Draft | 4,178,704 | 4,197 | 62 | |
| Draft | 4,652,716 | 4,605 | 57 | |
| Draft | 4,555,826 | 4,610 | 57 | |
| Draft | 3,668,667 | 3,605 | 64 | |
| Draft | 4,762,632 | 4,823 | 61 | |
| Draft | 4,209,812 | 4,158 | 61 | |
| Finished | 4,601,053 | 4,355 | 64 | |
| Draft | 4,487,498 | 4,338 | 58 | |
| Finished | 4,153,699 | 3,964 | 60 | |
| Draft | 5,262,893 | 5,121 | 65 | |
| Draft | 3,523,710 | 3,677 | 63 | |
| Draft | 4,689,084 | 4,814 | 59 | |
| Draft | 3,547,243 | 3,542 | 60 | |
| Draft | 4,527,951 | 4,258 | 59 | |
| Draft | 4,002,069 | 4,026 | 60 | |
| Draft | 3,487,925 | 3,744 | 55 |