| Literature DB >> 26203339 |
Chun-Hui Yu1, Chung-Kuang Lu2, Huang-Ming Su3, Tzen-Yuh Chiang4, Chi-Chuan Hwang5, Tsunglin Liu6, Yi-Min Chen3.
Abstract
To date, genome sequences (complete or in draft form) from only six baeocytous cyanobacteria in four genera have been reported: Xenococcus, Chroococcidiopsis, Pleurocapsa, and Stanieria. To expand our knowledge on the diversity of baeocytous cyanobacteria, this study sequenced the genome of GI1, which is a Myxosarcina-like baeocytous cyanobacterium. GI1 is of interest not only because of its phylogenetic niche, but also because it is a cyanobiont isolated from the marine cyanobacteriosponge Terpios hoshinota, which has been shown to cause the death of corals. The ~7 Mb draft GI1 genome contains 6,891 protein-coding genes and 62 RNA genes. A comparison of genomes among the sequenced baeocytous cyanobacterial strains revealed the existence or absence of numerous discrete genes involved in nitrogen metabolism. It will be interesting to determine whether these genes are important for cyanobacterial adaptations and interactions between cyanobionts and their marine sponge hosts.Entities:
Keywords: Baeocytous cyanobacterium; Comparative genomics; Cyanobiont; GI1; Myxosarcina; Pleurocapsales; Terpios hoshinota
Year: 2015 PMID: 26203339 PMCID: PMC4510999 DOI: 10.1186/s40793-015-0011-3
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Figure 1Characteristic vegetative cell aggregates (VA) and baeocytes (B) of GI1 observed under a light microscope. Cells were cultivated in ASN III medium for 1 month (~ late exponential phase). Bar = 20 μm.
Figure 2Phylogenetic position of Myxosarcina sp. strain GI1 within cyanobacteria. The 16S rRNA gene sequences of GI1 and type strains belonging to different cyanobacterial subsections were subjected to phylogenetic analysis using MEGA5 software [31] in conjunction with the multiple alignment program CLUSTAL W to construct a maximum-likelihood tree, using bootstrap values of 1000 replicates. The GenBank accession numbers for each stain are shown in parenthesis.
Classification and general features of sp. strain GI1 according to MIGS recommendations[8]
| Current classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Order | TAS [ | ||
| Genus | TAS [ | ||
| Species Unknown | |||
| Type strain PCC 7325 | TAS [ | ||
| Gram stain | Not reported | ||
| Cell shape | Spherical (baeocyte) | IDA | |
| Cubic (vegetative cell) | IDA | ||
| Motility | Gliding (newly-born baeocyte) | IDA | |
| Sporulation | Not reported | ||
| Temperature range | Not reported | ||
| Optimum temperature | Not reported | ||
| pH range; Optimum | Not reported | ||
| Carbon source | Not reported | ||
| Energy source | Phototrophic | IDA | |
| MIGS-6 | Habitat | Marine | IDA |
| MIGS-6.3 | Salinity | 3-4% | IDA |
| MIGS-22 | Oxygen | Aerobic | IDA |
| MIGS-15 | Biotic relationship | Symbiont | IDA |
| MIGS-14 | Pathogenicity | Non-pathogen | IDA |
| MIGS-4 | Geographic location | Green Island, Taiwan | IDA |
| MIGS-5 | Sample collection time | July, 2007 | IDA |
| MIGS-4.1 | Latitude | 22.6769444 | IDA |
| MIGS-4.2 | Longitude | 121.4930556 | IDA |
| MIGS-4.3 | Depth | 10 feet | IDA |
| MIGS-4.4 | Altitude | Not applicable |
aEvidence codes - IDA: Inferred from Direct Assay; TAS: Traceable Author Statement (i.e., a direct report exists in the literature); NAS: Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [30].
Project information
| MIGS-31 | Finishing quality | High quality draft |
| MIGS-28 | Libraries used | 1 paired-end and 3 mate-pairs |
| MIGS-29 | Sequencing platforms | Illumina (GAIIx, Hiseq2000) |
| MIGS-31.2 | Fold coverage | 1186x |
| MIGS-30 | Assemblers | ALLPATHS-LG (v47833) |
| MIGS-32 | Gene calling method | RAST webserver (Glimmer 3) |
| Locus Tag | KV40 (prefix) | |
| Genbank ID | GI:685984682 | |
| Genbank Date of Release | Sep 14, 2014 | |
| GOLD ID | Gi0078648 | |
| BIOPROJECT | PRJNA259928 | |
| MIGS-13 | Source Material Identifier | N.A. |
| Project relevance | Cyanobacterial ecology, cyanobiont |
Genome statistics
| Genome size (bp) | 7,069,859 |
| DNA coding (bp) | 5,958,317 |
| DNA G+C (bp) | 2,834,956 |
| DNA scaffolds | 21 |
| Total genes | 6,953 |
| Protein coding genes | 62 |
| RNA genes | 62 |
| Pseudo genes | |
| Genes in internal clusters | |
| Genes with function prediction | |
| Genes assigned to COGs | 4,118 |
| Genes with Pfam domains | 4,730 |
| Genes with signal peptides | |
| Genes with transmembrane helices | |
| CRISPR repeats |
Number of genes associated with the 25 general COG functional categories
| J | 196 | 2.84 | Translation |
| A | 0 | 0 | RNA processing and modification |
| K | 274 | 3.98 | Transcription |
| L | 307 | 4.46 | Replication, recombination and repair |
| B | 2 | 0.03 | Chromatin structure and dynamics |
| D | 60 | 0.87 | Cell cycle control, mitosis and meiosis |
| Y | 0 | 0 | Nuclear structure |
| V | 74 | 1.07 | Defense mechanisms |
| T | 396 | 5.75 | Signal transduction mechanisms |
| M | 274 | 3.98 | Cell wall/membrane biogenesis |
| N | 41 | 0.59 | Cell motility |
| Z | 0 | 0 | Cytoskeleton |
| W | 0 | 0 | Extracellular structures |
| U | 80 | 1.16 | Intracellular trafficking and secretion |
| O | 193 | 2.80 | Posttranslational modification, protein turnover, chaperones |
| C | 239 | 3.47 | Energy production and conversion |
| G | 212 | 3.08 | Carbohydrate transport and metabolism |
| E | 299 | 4.34 | Amino acid transport and metabolism |
| F | 87 | 1.26 | Nucleotide transport and metabolism |
| H | 194 | 2.82 | Coenzyme transport and metabolism |
| I | 96 | 1.39 | Lipid transport and metabolism |
| P | 287 | 4.16 | Inorganic ion transport and metabolism |
| Q | 151 | 2.19 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 677 | 9.82 | General function prediction only |
| S | 401 | 5.82 | Function unknown |
| - | 2773 | 40.24 | Not in COGs |
aThe total is based on the total number of protein coding genes in the genome.
Figure 3Circular map of GI1 chromosome.
Genome statistics comparison among baeocytous cyanobacteria
| Genome size (bp) | 6,689,401 | 5,720,887 | 4,986,817 | 7,386,997 | 5,544,990 | 5,929,641 | 7,069,859 |
| G + C content (%) | 44.5 | 35.3 | 45.2 | 38.7 | 36.2 | 39.7 | 40.1 |
| Total genes | 6,033 | 5,176 | 4,665 | 5,896 | 5,041 | 5,419 | 6,953 |
| Protein-coding genes | 5,752 | 5,116 | 4,268 | 5,762 | 4,781 | 5,373 | 6,891 |
| Protein with function prediction | 3,728 | 3,988 | 2,848 | 1,256 | 3,393 | 3,694 | 3,664 |
| RNA genes | 58 | 60 | 56 | 52 | 52 | 46 | 62 |
| COGs | 3,980 | 3,372 | 2,970 | 3,896 | 3,207 | 3,458 | 4,118 |
| %COGs | 65.97% | 65.15% | 63.67% | 66.08% | 63.62% | 63.81% | 59.23% |
| Pfam | 4,530 | 3,918 | 3,421 | 4,495 | 3,716 | 4,049 | 4,730 |
| %Pfam | 75.09% | 75.70% | 73.33% | 76.24% | 73.72% | 74.72% | 68.03% |
| TIGRfam | 3,107 | 2,527 | 2,361 | 2,872 | 2,489 | 2,584 | 3,078 |
| %TIGRfam | 51.50% | 48.82% | 50.61% | 48.71% | 49.38% | 47.68% | 44.27% |
| SMART | 1,338 | 1,202 | 1,025 | 1,294 | 1,151 | 1,154 | 1,452 |
| %SMART | 22.18% | 23.22% | 21.97% | 21.95% | 22.83% | 21.30% | 20.88% |
| PRK | 3,211 | 2,603 | 2,484 | 2,968 | 2,549 | 2,629 | 3,154 |
| %PRK | 53.22% | 50.29% | 53.25% | 50.34% | 50.57% | 48.51% | 45.36% |
Putative gene products (obtained from genome wide COG assignment) existing in all sequenced baeocytous cyanobacteria except GI1
| UreE | O | Urease accessory protein UreE |
| SpeA | E | Arginine decarboxylase (spermidine biosynthesis) |
| GltD | E, R | NADPH-dependent glutamate synthase beta chain |
| COG5551 | V | CRISPR system related protein, RAMP |
| COG2378 | K | Predicted transcriptional regulator |
| COG4235 | O | Cytochrome c biogenesis factor |
| MnhB | P | Multisubunit Na+/H+ antiporter, MnhB subunit |
| COG4942 | D | Membrane-bound metallopeptidase |
| ERG3 | I | Sterol desaturase |
| COG1468 | V | CRISPR-associated protein Cas4 (RecB family) |
| COG1343 | V | CRISPR-associated protein Cas2 |
| COG2607 | R | Predicted ATPase (AAA+ superfamily) |
| COG3689 | S | Predicted membrane protein |
*D: cell division and chromosome partitioning; E: amino acid transport and metabolism; I: lipid metabolism; K: transcription; O: posttranslational modification, protein turnover, chaperones; P: inorganic ion transport and metabolism; Q: secondary metabolites biosynthesis, transport, and catabolism; R: general function prediction only; S: function unknown; V: defense mechanisms.
Putative gene products (obtained from genome wide COG assignment) that only appeared in GI1
| ArgR | K | Arginine repressor |
| COG2070 | R | Dioxygenases related to 2-nitropropane dioxygenase |
| HutG | E | N-formylglutamate amidohydrolase |
| COG4262 | R | Predicted spermidine synthase with an N-terminal membrane domain |
| NtrB | T | Signal transduction histidine kinase, nitrogen specific |
| IQG1 | D, T | Protein involved in regulation of cellular morphogenesis/cytokinesis |
| COG3635 | G | Predicted phosphoglycerate mutase, AP superfamily |
| Rof | K | Transcriptional antiterminator |
| COG4092 | M | Predicted glycosyltransferase involved in capsule biosynthesis |
| PRI2 | L | Eukaryotic-type DNA primase, large subunit |
| ERG12 | I | Mevalonate kinase |
| MazG | R | Predicted pyrophosphatase |
| CelA | G | Cellobiohydrolase A (1,4-beta-cellobiosidase A) |
| COG4101 | G | Predicted mannose-6-phosphate isomerase |
| COG1107 | L | Archaea-specific RecJ-like exonuclease |
| COG4129 | S | Predicted membrane protein |
| PepD | E | Dipeptidase |
| COG4849 | R | Predicted nucleotidyltransferase |
| COG3103 | T | SH3 domain protein |
| AbiF | V | Abortive infection bacteriophage resistance protein |
| DRG | R | Predicted GTPase |
| COG4186 | R | Predicted phosphoesterase or phosphohydrolase |
| COG3292 | T | Predicted periplasmic ligand-binding sensor domain |
| COG4227 | L | Antirestriction protein |
| COG2837 | P | Predicted iron-dependent peroxidase |
| COG4109 | K | Predicted transcriptional regulator containing CBS domains |
| MecR1 | K, T | Antirepressor regulating drug resistance |
| Gcd | G | Glucose dehydrogenase |
| COG3588 | G | Fructose-1,6-bisphosphate aldolase |
| COG1289 | S | Predicted membrane protein |
| COG4341 | R | Predicted HD phosphohydrolase |
| SRP1 | U | Karyopherin (importin) alpha |
| COG3886 | L | Predicted HKD family nuclease |
| COG1444 | R | Predicted P-loop ATPase fused to an acetyltransferase |
| COG1204 | R | Superfamily II helicase |
| HdrC | C | Heterodisulfide reductase, subunit C |
*C: energy production and conversion; D: cell division and chromosome partitioning; E: amino acid transport and metabolism; G: carbohydrate transport and metabolism; I: lipid metabolism; K: transcription; L: DNA replication, recombination, and repair; M: cell envelope biogenesis, outer membrane; P: inorganic ion transport and metabolism; R: general function prediction only; S: function unknown; T: signal transduction mechanisms; U: intracellular trafficking and secretion; V: defense mechanisms.