| Literature DB >> 27777652 |
Natalia Velichko1, Mikhail Rayko2, Ekaterina Chernyaeva3, Alla Lapidus3, Alexander Pinevich1.
Abstract
Prochlorothrix hollandica is filamentous non-heterocystous cyanobacterium which possesses the chlorophyll a/b light-harvesting complexes. Despite the growing interest in unusual green-pigmented cyanobacteria (prochlorophytes) to date only a few sequenced genome from prochlorophytes genera have been reported. This study sequenced the genome of Prochlorothrix hollandica CCAP 1490/1T (CALU1027). The produced draft genome assembly (5.5 Mb) contains 3737 protein-coding genes and 114 RNA genes.Entities:
Keywords: Comparative genomics; Cyanobacteria; Prochlorophytes; Prochlorothrix hollandica
Year: 2016 PMID: 27777652 PMCID: PMC5069947 DOI: 10.1186/s40793-016-0204-4
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phylogenetic position of P. hollandica CALU1027 within cyanobacteria. GenBank accession numbers are indicated in parentheses. The numbers above branches indicate bootstrap support from 1000 replicates
Fig. 2Light micrograph of P. hollandica CALU1027. Scale bar 10 μm
Classification and general features of P. hollandica CALU1027
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Current classification | Domain | TAS [ | |
| Phylum BX | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Type strain CCAP 1490/1T | TAS [ | ||
| Gram stain | Not reported | ||
| Cell shape | Elongated rods | TAS [ | |
| Motility | Nonmotile | TAS [ | |
| Sporulation | Not reported | ||
| Temperature range | 15 °C − 30 °C | TAS [ | |
| Optimum temperature | 20 °C | TAS [ | |
| pH range, Optimum | 8.4 | TAS [ | |
| Carbon source | Autotroph | TAS [ | |
| Energy source | Phototroph | TAS [ | |
| MIGS-6 | Habitat | Freshwater | TAS [ |
| MIGS-6.3 | Salinity | Less than 25 mM | TAS [ |
| MIGS-22 | Oxygen requirement | Aerobic | TAS [ |
| MIGS-6.4 | Chlorophyll type | Chlorophylls | TAS [ |
| MIGS-15 | Biotic relationships | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | Not reported | |
| MIGS-4 | Geographic location | Loosdrecht lake, The Netherlands | TAS [ |
| MIGS-5 | Sample collection time | 9 July, 1984 | TAS [ |
| MIGS-4.1 | Latitude | 52.20 N | TAS [ |
| MIGS-4.2 | Longitude | 5.5 E | TAS [ |
| MIGS-4.3 | Depth | 0.2 m | TAS [ |
| MIGS-4.4 | Altitude | 2 m | NAS |
aEvidence codes - TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence)
These evidence codes are from the Gene Ontology Project [25]
Project information
| MIGS ID | Property | Term |
|---|---|---|
| MIGS-31 | Finishing quality | Draft |
| MIGS-28 | Libraries used | Illumina paired-end library |
| MIGS-29 | Sequencing platform | Illumina MiSeq |
| MIGS-31.2 | Fold coverage | 30× |
| MIGS-30 | Assemblers | SPAdes v. 3.5.0 |
| MIGS-32 | Gene calling method | GeneMarkS+ |
| Locus Tag | PROH | |
| GenBank ID | GCA_000341585.2 | |
| Genbank date of release | 20 February, 2013 | |
| Gold ID | Gp0010359 | |
| BioProject | PRJNA63021 | |
| DDBJ ID | AJTX00000000.2 | |
| MIGS-13 | Source Material Identifier | CALU1027 |
| Project relevance | comparative genomics |
Genome statistics
| Attribute | Genome (total) | |
|---|---|---|
| Value | % of totala | |
| Genome size (bp) | 5,525,469 | 100.00 |
| DNA coding (bp) | 3,931,877 | 71.16 |
| DNA G + C (bp) | 2,999,78 | 54.56 |
| DNA scaffolds | 10 | − |
| Total genes | 4,294 | 100.00 |
| Protein coding genes | 3,737 | 87.00 |
| RNA genes | 57 | 1.32 |
| rRNA genes | 12 | 0.28 |
| tRNA genes | 44 | 1.02 |
| ncRNA genes | 1 | 0.02 |
| Pseudo genes | 515 | 11.99 |
| Genes in internal clusters | 235 | 5.4 |
| Genes with function prediction | 2,770 | 64,5 |
| Genes assigned to COGs | 2,855 | 66.00 |
| Genes with Pfam domains | 2,386 | 55.56 |
| Genes with signal peptides | 86 | 2 |
| Genes with transmembrane helices | 869 | 20.24 |
| CRISPR repeats | 9 | 0.2 |
a The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome
Number of genes associated with general COG functional categories
| Code | Value | % agea | Description |
|---|---|---|---|
| J | 160 | 4.28 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0 | RNA processing and modification |
| K | 141 | 3.77 | Transcription |
| L | 213 | 5.69 | Replication, recombination and repair |
| B | 3 | 0.08 | Chromatin structure and dynamics |
| D | 39 | 1.04 | Cell cycle control, cell division, chromosome partitioning |
| V | 64 | 1.71 | Defense mechanisms |
| T | 316 | 8.46 | Signal transduction mechanisms |
| M | 210 | 5.62 | Cell wall/membrane biogenesis |
| N | 56 | 1.50 | Cell motility |
| U | 76 | 2.03 | Intracellular trafficking and secretion |
| O | 144 | 3.85 | Posttranslational modification, protein turnover, chaperones |
| C | 148 | 3.96 | Energy production and conversion |
| G | 126 | 3.37 | Carbohydrate transport and metabolism |
| E | 201 | 5.37 | Amino acid transport and metabolism |
| F | 67 | 1.79 | Nucleotide transport and metabolism |
| H | 156 | 4.17 | Coenzyme transport and metabolism |
| I | 55 | 1.47 | Lipid transport and metabolism |
| P | 136 | 3.64 | Inorganic ion transport and metabolism |
| Q | 52 | 1.39 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 407 | 10.89 | General function prediction only |
| S | 409 | 10.94 | Function unknown |
| − | 8 | 0.21 | Not in COGs |
aThe total is based on the total number of protein coding genes in annotated genome
Selected functional capacities
| Cell function | Metabolic system/element | Putative gene/gene product |
|---|---|---|
| Light energy metabolism | Oxygenic phototrophy; photorespiration |
|
| Dark energy metabolism | Glycolysis and gluconeogenesis; methylglyoxal metabolism, pentose phosphate pathway; Entner-Doudoroff pathway; pyruvate cleavage; TCA with glyoxylate bypass | GlcK, HxK, PPgK, PfK1, PfK2, PPiFKa, PPiFKb, Fbp_I, B, X; Fba1- 2, TpI, GADH, G3PNP, PgK, PgM, EnO, PyK, PpS, PpD, Hyp1, GPDH; MgsA, GloA-B, AldA-B, GRE2; GPDH, PLG, RisA-B, TK, TA, FPK, XPK, PglD, OpcA; AlaDH, AlaR, AlaGAT, SerD, SerT; |
| Lipid/pigments metabolism | Chl, iron tetrapyrrols, fatty acids, isoprenoids, phospholipids | PMgCD, PMgCH, PmgMT, ChlEAe, DVR, ChlB, ChlG, ChlL-M, POR; GltR, UroM, UroD, HemQ, HemX-Y; FabA-T, HpnE-H; CruA-G, CrtL, GlyP, GarL |
| Carbon substrate intermediary metabolism | Calvin cycle; fructose, galactose, mannose, sucrose, polyglucoside, aminosugar, nucleotide sugar, C1-substrate and glycogen metabolism | PRK, Rbc, PGK, GAPDH, TPI, FBA1-2, FBP_I-X, TK, RPE; |
| Nitrogen substrate intermediary metabolism | Nitrogen and ammonia assimilation; urea cycle |
|
| Protein metabolism | Amino acids, polyamines and glutathione biosyntheses; protein processing, degradation, modification and folding; selenoproteins | GltB, GlxC-B, GldH, AspA-C, AsnA-B, GltS, GlsA, HisA-I, AstA-E, ArgR, SpeA-C, ArcA-D, MetN-T, ThrA-C, AspC, CysB-E, Lys1, LLP, CadA-C, DavA-D, CodA, LeuA-D, TrpA-E, TyrA, PheA, ProA-C, SelD, GlyA-B, AlaB, AlaR, CsdA, SufS, SerA-C; SelA-B |
| Mineral substrate metabolism | Phosphate, sulfur, iron and potassium metabolism |
|
| Enzyme cofactor metabolism | Coenzyme B12, FAD, FMN, lipoic acid, Mo-cofactor, NAD, pterines, pyridoxin, quinone, riboflavin, thiamine biosynthesis | BioC, BioH, HoxE, HoxF, HoxH, HoxU, HoxY, CobA-C, CbiA-K, ThiB-G; UbiA-H; |
| Secondary metabolism | Auxin, flavonoids, terpenes and derivatives biosynthesis | plant hormones (AUX1, APRT, PRAI, IGS,TSa, TM, IAH, IAD, AAD, AFTS), toxin-antitoxin replicon stabilization systems (RelB, E, F; CcdA-B, ParE-D, HigA-B, VapC-B, YoeB, YefM, YafQ, DinJ, YeeU, YkfI, YafW, YpjF, YgiZ) |
| Membrane transport | ABC transporters ( | |
| Cell division, cytoskeleton |
| |
| Regulation |
| |
| Stress response | Protection from reactive oxygen species; oxidative and periplasmic stress |
|
| Phages, integrons and CRISPRs | SA bacteriophages 11, TFP1-2, TFAP, TFC, Lys1-8, LysA-B, Hol1-2, TransI, endolysin; integrons (Int1-2, Int4, InyIPac); CRISPR cmr-cluster (Cmr1-6, Csx11, NEO113, TM1812, Cas02710); CasReg, Cas1-7, Csh1-2, Csd1-2, Cse1-4, Csn1-2, Csy1-4, Csa1-5, Csm1-5, Cst1-2 |