| Literature DB >> 30344889 |
Juan J Fuentes-Valdés1,2, Katia Soto-Liebe1, Danilo Pérez-Pantoja3, Javier Tamames4, Lucy Belmar1, Carlos Pedrós-Alió4, Daniel Garrido2, Mónica Vásquez1.
Abstract
Members of the genus Cylindrospermopsis represent an important environmental and health concern. Strains CS-508 and MVCC14 of C. raciborskii were isolated from freshwater reservoirs located in Australia and Uruguay, respectively. While CS-508 has been reported as non-toxic, MVCC14 is a saxitoxin (STX) producer. We annotated the draft genomes of these C. raciborskii strains using the assembly of reads obtained from Illumina MiSeq sequencing. The final assemblies resulted in genome sizes close to 3.6 Mbp for both strains and included 3202 ORFs for CS-508 (in 163 contigs) and 3560 ORFs for MVCC14 (in 99 contigs). Finally, both the average nucleotide identity (ANI) and the similarity of gene content indicate that these two genomes should be considered as strains of the C. raciborskii species.Entities:
Keywords: Bloom; Cyanobacteria; Cylindrospermopsis; Environmental toxicity; Non-ribosomal peptide-synthetase; Polyketide synthases
Year: 2018 PMID: 30344889 PMCID: PMC6186047 DOI: 10.1186/s40793-018-0323-1
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Light microscope photographs of source organisms. a-b C. raciborskii CS-508 and of c-e C. raciborskii MVCC14
Classification and general features of C. raciborskii strain CS-508 according to MIGS designation [45]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Gram stain | Negative | TAS [ | |
| Cell shape | Filaments | ||
| Motility | Temporary-motile (Hormogonia) | ||
| Sporulation | None | TAS [ | |
| Temperature range | Mesophile | TAS [ | |
| Optimum temperature | 25 °C | TAS [ | |
| pH range; Optimum | pH 7.50–9.21; pH 8.33 | ||
| Carbon source | Autotroph | TAS [ | |
| MIGS-6 | Habitat | Freshwater | TAS [ |
| MIGS-6.3 | Salinity | 0.4% NaCl (maximum) | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | NAS |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | non-pathogen | TAS [ |
| MIGS-4 | Geographic location | Isolated Solomon Dam, Australia | NAS |
| MIGS-5 | Sample collection | 1999 | NAS |
| MIGS-4.1 | Latitude | −18.7241 | IDA |
| MIGS-4.2 | Longitude | 146.5938 | TAS [ |
| MIGS-4.4 | Altitude | Unknown | TAS [ |
aEvidence codes - IDA: Inferred from Direct Assay; TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [54]
Classification and general features of C. raciborskii strain MVCC14 according to MIGS designation [45]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species | TAS [ | ||
| Gram stain | Negative | ||
| Cell shape | Filaments | TAS [ | |
| Motility | Non-motile | ||
| Sporulation | None | TAS [ | |
| Temperature range | Mesophile | TAS [ | |
| Optimum temperature | 25 °C | TAS [ | |
| pH range; Optimum | pH 7.50–9.21; pH 8.33 | ||
| Carbon source | Autotroph | TAS [ | |
| MIGS-6 | Habitat | Fresh water | TAS [ |
| MIGS-6.3 | Salinity | 0.4% NaCl (maximum) | IDA |
| MIGS-22 | Oxygen requirement | Aerobic | NAS |
| MIGS-15 | Biotic relationship | free-living | NAS |
| MIGS-14 | Pathogenicity | Saxitoxin (STX) | TAS [ |
| MIGS-4 | Geographic location | Isolated Laguna Blanca, Uruguay | NAS |
| MIGS-5 | Sample collection | Unknown | NAS |
| MIGS-4.1 | Latitude | −34.8984 | TAS [ |
| MIGS-4.2 | Longitude | −54.8369 | TAS [ |
| MIGS-4.4 | Altitude | Unknown | NAS |
aEvidence codes - IDA: Inferred from Direct Assay; TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [54]
Fig. 2Maximum likelihood (ML) phylogenetic tree indicating the phylogenetic relationship of C. raciborskii strains. The ML tree is based on 16S rRNA gene sequences from C. raciborskii strains CS-508 and MVCC14 and sequences retrieved from previous reports stored in the NCBI database. These sequences were aligned using MUSCLE [43] and the phylogenetic tree was constructed with the phyML using GTR substitution model and BEST option for searching the starting tree [44]. Bootstrap support values ≥50% are indicated from 1000 bootstrap replicates. In supplemental material a complete phylogenetic tree is reported (Additional file 1: Figure S1)
Project information
| MIGS ID | Property | Term (for CS-508) | Term (for MVCC14) |
|---|---|---|---|
| MIGS 31 | Finishing quality | High- Quality Draft | High- Quality Draft |
| MIGS-28 | Libraries used | Illumina | Illumina |
| MIGS 29 | Sequencing platforms | Illumina HiSeq2000 | Illumina HiSeq2000 |
| MIGS 31.2 | Fold coverage | 20× | 20× |
| MIGS 30 | Assemblers | IDBA, SPADES, VELVET and ABYSS | IDBA, SPADES, VELVET and ABYSS |
| MIGS 32 | Gene calling method | Rast | Rast |
| Locus Tag | CYL_CS508 | CYL_MVCC14 | |
| GenBank ID | MBQX00000000 | MBQY00000000 | |
| GenBank Date of Release | November 01, 2016 | November 01, 2016 | |
| GOLD ID | Gs0120410 | Gs0121371 | |
| BIOPROJECT | PRJNA327084 | PRJNA327088 | |
| MIGS 13 | Source Material Identifier | Freshwater | Freshwater |
| Project relevance | Environment | Environment |
Genome statistics of CS-508 (A) and MVCC14 (B)
| Attribute | A Value | A % of Total | B Value | B % of Total |
|---|---|---|---|---|
| Genome size (bp) | 3,558,956 | 100 | 3,594,524 | 100 |
| DNA coding (bp) | 3,039,246 | 85.34 | 3,074,946 | 85.55 |
| DNA G + C (bp) | 1,530,351 | 43 | 1,581,591 | 44 |
| DNA scaffolds | 163 | 100 | 99 | 100 |
| Total genes | 3344 | 100 | 3616 | 100 |
| Protein coding genes | 3302 | 98.74 | 3560 | 98.45 |
| RNA genes | 42 | 1.26 | 56 | 1.55 |
| Pseudo genes | – | – | – | – |
| Genes in internal clusters | – | – | – | – |
| Genes with function prediction | 2247 | 67.19 | 2337 | 64.63 |
| Genes assigned to COGs | 1747 | 56.16 | 1796 | 55.55 |
| Genes with Pfam domains | 2656 | 79.43 | 2800 | 77.43 |
| Genes with signal peptides | 71 | 2.12 | 63 | 1.74 |
| Genes with transmembrane helices | 255 | 7.63 | 748 | 20.66 |
| CRISPR repeats | 7 | – | 9 | – |
Number of genes associated with general COG functional categories
| CS-508 | MVCC14 | Description | |||
|---|---|---|---|---|---|
| Code | Value | %age | Value | %age | |
| J | 142 | 4.56 | 143 | 4.37 | Translation, ribosomal structure and biogenesis |
| A | 0 | 0.00 | 0 | 0.00 | RNA processing and modification |
| K | 69 | 2.22 | 64 | 1.96 | Transcription |
| L | 88 | 2.83 | 112 | 3.43 | Replication, recombination and repair |
| B | 0 | 0.00 | 0 | 0.00 | Chromatin structure and dynamics |
| D | 21 | 0.68 | 19 | 0.58 | Cell cycle control, Cell division, chromosome partitioning |
| V | 0 | 0.00 | 0 | 0.00 | Defense mechanisms |
| T | 49 | 1.58 | 54 | 1.65 | Signal transduction mechanisms |
| M | 123 | 3.95 | 130 | 3.98 | Cell wall/membrane biogenesis |
| N | 6 | 0.19 | 5 | 0.15 | Cell motility |
| U | 0 | 0.00 | 0 | 0.00 | Intracellular trafficking and secretion |
| O | 111 | 3.57 | 111 | 3.40 | Posttranslational modification, protein turnover, chaperones |
| C | 157 | 5.05 | 163 | 4.99 | Energy production and conversion |
| G | 99 | 3.18 | 93 | 2.84 | Carbohydrate transport and metabolism |
| E | 125 | 4.02 | 123 | 3.76 | Amino acid transport and metabolism |
| F | 45 | 1.45 | 44 | 1.35 | Nucleotide transport and metabolism |
| H | 104 | 3.34 | 107 | 3.27 | Coenzyme transport and metabolism |
| I | 32 | 1.03 | 31 | 0.95 | Lipid transport and metabolism |
| P | 128 | 4.11 | 130 | 3.98 | Inorganic ion transport and metabolism |
| Q | 40 | 1.29 | 36 | 1.10 | Secondary metabolites biosynthesis, transport and catabolism |
| R | 252 | 8.10 | 262 | 8.01 | General function prediction only |
| S | 156 | 5.01 | 169 | 5.17 | Function unknown |
| – | 1364 | 43.84 | 1473 | 45.06 | Not in COGs |
The total is based on the total number of protein coding genes in the genome
Average nucleotide identity (ANI) values for the sequenced C. raciborskii and Raphidiopsis brookii strains
| – | 99.31 | 99.73 | 99.77 | 93.73 | 93.26 | |
| 99.31 | – | 99.39 | 99.32 | 93.45 | 92.85 | |
| 99.73 | 99.39 | – | 99.76 | 93.82 | 93.25 | |
| 99.77 | 99.32 | 99.76 | – | 93.80 | 93.23 | |
| 93.73 | 93.45 | 93.82 | 93.80 | – | 97.17 | |
| 93.26 | 92.85 | 93.25 | 93.23 | 97.17 | – |