| Literature DB >> 29774086 |
Samuel M Luedin1,2,3, Joël F Pothier4, Francesco Danza1,2, Nicola Storelli1,2, Niels-Ulrik Frigaard5, Matthias Wittwer3, Mauro Tonolla1,2.
Abstract
"Thiodictyon syntrophicum" sp. nov. strain Cad16T is a photoautotrophic purple sulfur bacterium belonging to the family of Chromatiaceae in the class of Gammaproteobacteria. The type strain Cad16T was isolated from the chemocline of the alpine meromictic Lake Cadagno in Switzerland. Strain Cad16T represents a key species within this sulfur-driven bacterial ecosystem with respect to carbon fixation. The 7.74-Mbp genome of strain Cad16T has been sequenced and annotated. It encodes 6237 predicted protein sequences and 59 RNA sequences. Phylogenetic comparison based on 16S rRNA revealed that Thiodictyon elegans strain DSM 232T the most closely related species. Genes involved in sulfur oxidation, central carbon metabolism and transmembrane transport were found. Noteworthy, clusters of genes encoding the photosynthetic machinery and pigment biosynthesis are found on the 0.48 Mb plasmid pTs485. We provide a detailed insight into the Cad16T genome and analyze it in the context of the microbial ecosystem of Lake Cadagno.Entities:
Keywords: CRISPR; Chromatiaceae; Meromictic lake; Okenone; Phototrophic sulfur bacteria; Sulfur cycling
Year: 2018 PMID: 29774086 PMCID: PMC5944118 DOI: 10.1186/s40793-018-0317-z
Source DB: PubMed Journal: Stand Genomic Sci ISSN: 1944-3277
Fig. 1Phase-contrast photomicrograph of “T. syntrophicum” sp. nov. strain Cad16T. The elementary sulfur globule inclusions are visible as black dots within the cell
Fig. 2Phylogenetic tree indicating the position of “T. syntrophicum” sp. nov. strain Cad16T relative to other genomes sequenced within the family Chromatiaceae. The tree was inferred from 16S rRNA sequences (≥ 1300 bp) using the maximum likelihood algorithm of the IQ-TREE software [62]. 1000 bootstrap replicates were performed. Numbers at the nodes indicate the SH-aLRT support (%) and ultrafast bootstrap support (%), respectively. Asterisk mark strains with complete genome sequences available. Open circles as node shapes indicate bootstrap support values below 50%
Classification and general features of “T. syntrophicum” sp. nov. strain Cad16T according to the MIGS recommendations [65]
| MIGS ID | Property | Term | Evidence codea |
|---|---|---|---|
| Classification | Domain | TAS [ | |
| Phylum | TAS [ | ||
| Class | TAS [ | ||
| Order | TAS [ | ||
| Family | TAS [ | ||
| Genus | TAS [ | ||
| Species “” | TAS [ | ||
| Strain: Cad16T | TAS [ | ||
| Gram stain | Negative | TAS [ | |
| Cell shape | Coccus | TAS [ | |
| Motility | Non-motile | TAS [ | |
| Sporulation | No | NAS | |
| Temperature range | 5–25 °C | TAS [ | |
| Optimum temperature | 20–23 | TAS [ | |
| pH range; Optimum | 6.8–7.5 | TAS [ | |
| Carbon source | CO2, acetate, pyruvate, fructose | TAS [ | |
| MIGS-6 | Habitat | Fresh water, alpine meromictic lake | TAS [ |
| MIGS-6.3 | Salinity | Not determined | NAS |
| MIGS-22 | Oxygen requirement | Aerotolerant | TAS [ |
| MIGS-15 | Biotic relationship | Free-living | TAS [ |
| MIGS-14 | Pathogenicity | Non-pathogen | NAS |
| MIGS-4 | Geographic location | Switzerland, Ticino | TAS [ |
| MIGS-5 | Sample collection | 08.28.2001 | TAS [ |
| MIGS-4.1 | Latitude | 46°33’ N | TAS [ |
| MIGS-4.2 | Longitude | 8°43′ E | TAS [ |
| MIGS-4.4 | Altitude | 1923 m | TAS [ |
aEvidence codes – IDA Inferred from Direct Assay, TAS Traceable Author Statement (i.e., a direct report exists in the literature), NAS Non-traceable Author Statement (i.e., not directly observed for the living, isolated sample, but based on a generally accepted property for the species, or anecdotal evidence). These evidence codes are from the Gene Ontology project [10]
Fig. 3Schematic representation of the complete genome sequence of “Thiodictyon syntrophicum” sp. nov. strain Cad16T and gene classification according to the COG criteria. The genome comprises one circular chromosome (a) and two circular plasmids, pTs485 (b) and pTs417 (c). Beginning with the outermost circle, tracks are shown in the order of: (1) predicted PGC on pTs485 (green), (2) CRISPR arrays on the chromosome and pTs485 (alternating in blue and red) (3) protein coding genes on forward strand colored according to COG categories, (4) CDS (blue), tRNA (orange) and rRNA (violet) on forward strand, (5) CDS (blue), tRNA (orange) and rRNA (violet) on reverse strand, (6) protein coding genes on reverse strand colored according to COG categories, (7) GC content (black), (8) positive and negative GC skew (green and purple, respectively) and (9) genome region by kbp. GCView [63] was used to create this genome map
Project information for “T. syntrophicum” sp. nov. strain Cad16T
| MIGS ID | Property | Term |
|---|---|---|
| MIGS 31 | Finishing quality | Complete |
| MIGS-28 | Libraries used | SMRT 10 kb (BluePippin size selection) |
| MIGS 29 | Sequencing platforms | PacBio RS II |
| MIGS 31.2 | Fold coverage | 200× |
| MIGS 30 | Assemblers | HGAP2 |
| MIGS 32 | Gene calling method | GeneMarkS+, software revision: 4.1 |
| Locus Tag | THSYN | |
| GeneBank ID | CP020370.1, CP020371.1, CP020372.1 | |
| GeneBank Date of Release | 07/12/2017 | |
| GOLD ID | Gp0131589 | |
| BIOPROJECT | PRJNA354524 | |
| MIGS 13 | Source Material Identifier | NA |
| Project relevance | Environmental |
Summary of genome of “T. syntrophicum” sp. nov. strain Cad16T: one circular chromosome and two circular plasmids
| Label | Size (Mb) | Topology | INSDC identifier | RefSeq ID |
|---|---|---|---|---|
| Chromosome | 6.84 | Circular | CP020370 | NA |
| pTs417 | 0.42 | Circular | CP020371 | NA |
| pTs485 | 0.49 | Circular | CP020372 | NA |
Genome statistics for the “T. syntrophicum” sp. nov. strain Cad16T genome
| Attribute | Value | % of Total |
|---|---|---|
| Genome size (bp) | 7,738,984 | 100.00 |
| DNA coding (bp) | 6,663,511 | 86.10 |
| DNA G + C (bp) | 5,124,386 | 66.22 |
| DNA scaffolds | 3 | 100.00 |
| Total genes | 6601 | 100.00 |
| Protein coding genes | 6237 | 94.49 |
| RNA genes | 59 | 0.89 |
| rRNA genes | 6 | 0.09 |
| tRNA genes | 49 | 0.74 |
| ncRNA genes | 4 | 0.06 |
| Pseudo genes | 305 | 4.62 |
| Genes in internal clusters | NA | NA |
| Genes with function prediction | 2737 | 41.46 |
| Genes assigned to COGs | 3157 | 47.83 |
| Genes with Pfam domains | 4675 | 70.82 |
| Genes with signal peptides | 436 | 6.61 |
| Genes with transmembrane helices | 1185 | 17.95 |
| CRISPR repeats | 5 | – |
NA not applicable
Number of genes associated with general COG functional categories of the genome of “T. syntrophicum” sp. nov. strain Cad16T. The percentage is set relative to the total number of protein coding genes in the genome
| Code | Value | % age | Description |
|---|---|---|---|
|
| 210 | 3.37 | Translation, ribosomal structure and biogenesis |
|
| 1 | 0.02 | RNA processing and modification |
|
| 144 | 2.31 | Transcription |
|
| 276 | 4.43 | Replication, recombination and repair |
|
| 0 | 0.00 | Chromatin structure and dynamics |
|
| 42 | 0.67 | Cell cycle control, Cell division, chromosome partitioning |
|
| 0 | 0.00 | Nuclear structure |
|
| 186 | 2.98 | Defense mechanisms |
|
| 297 | 4.76 | Signal transduction mechanisms |
|
| 274 | 4.39 | Cell wall/membrane biogenesis |
|
| 10 | 0.16 | Cell motility |
|
| 0 | 0.00 | Cytoskeleton |
|
| 0 | 0.00 | Extracellular Structures |
|
| 62 | 0.99 | Intracellular trafficking and secretion |
|
| 226 | 3.62 | Posttranslational modification, protein turnover, chaperones |
|
| 97 | 1.56 | Energy production and conversion |
|
| 245 | 3.93 | Energy production and conversion |
|
| 122 | 1.96 | Carbohydrate transport and metabolism |
|
| 167 | 2.68 | Amino acid transport and metabolism |
|
| 49 | 0.79 | Nucleotide transport and metabolism |
|
| 135 | 2.16 | Coenzyme transport and metabolism |
|
| 93 | 1.49 | Lipid transport and metabolism |
|
| 184 | 2.95 | Inorganic ion transport and metabolism |
|
| 29 | 0.46 | Secondary metabolites biosynthesis, transport and catabolism |
|
| 308 | 4.94 | General function prediction only |
|
| 1522 | 24.40 | Function unknown |
| No COG | 1543 | 24.74 | Not in COGs |
| Multi COG | 320 | 5.13 | Multiple COG assignments |
CRISPR-Cas loci detected in “T. syntrophicum” sp. nov. strain Cad16T genome
| Localization | Name | CRISPR start | CRISPR end | CRISPR length (bp) | DR consensus | DR length | No. of spacers | CRISPR-Cas locia |
|---|---|---|---|---|---|---|---|---|
| chromosome | CRR1CRR1 | 1,879,131 | 1,881,639 | 2508 | GCTTCAATGAGGCCGCGGCGAATTCGCCGCGGAAAC | 36 | 34 | type I-U |
| CRR2CRR2 | 1,883,646 | 1,894,325 | 10,679 | GCTTCAATGAGGCCGCGGCGAATTCGCCGCGGAAAC | 36 | 146 | ||
| CRR3CRR3 | 4,626,522 | 4,629,249 | 2727 | GCATCGCCCGGCCAATTGGCCGGGCGCGGATTGAAAC | 37 | 37 | type I-C | |
| CRR4CRR4 | 5,078,034 | 5,085,199 | 7165 | GTTTCCGCGGCGAATTCGCCGCGGCCTCATTGAAGC | 36 | 98 | – | |
| pTs485 | CRR5CRR5 | 391,741 | 393,104 | 1363 | GTAGCGCTACTCCGAGCCGCAAAGGCTATTGAAAC | 35 | 19 | – |
a CRISPR-Cas classification according to Makarova et al. [58]