| Literature DB >> 26473841 |
Qinglong Ding1, Gu Chen2, Yuling Wang3, Dong Wei4.
Abstract
Cyanobacterium Synechocystis sp. PCC 6803 is a widely used model organism in basic research and biofuel biotechnology application. Here, we report the genomic sequence of chromosome and seven plasmids of a glucose-tolerant, non-motile strain originated from ATCC 27184, GT-G, in use at Guangzhou. Through high-throughput genome re-sequencing and verification by Sanger sequencing, eight novel variants were identified in its chromosome and plasmids. The eight novel variants, especially the five non-silent mutations might have interesting effects on the phenotype of GT-G strains, for example the truncated Sll1895 and Slr0322 protein. These resequencing data provide background information for further research and application based on the GT-G strain and also provide evidence to study the evolution and divergence of Synechocystis 6803 globally.Entities:
Keywords: Synechocystis sp. PCC 6803; deletion; insertion; single nucleotide polymorphism; whole genome resequencing
Mesh:
Substances:
Year: 2015 PMID: 26473841 PMCID: PMC4632739 DOI: 10.3390/ijms161024081
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Location and effects of SNPs and indels identified in GT-G compared with the nucleotide sequence of GT-Kazusa in the database. Specific variants identified in GT-G are highlighted in bold. The errors of database are in grey. SNP, insertions, deletions and intergenic region are labeled as S, I, D and IGR respectively.
| Event | Effect | Locus | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| # | M | Start | End | Size | Nucl Change | Ref→mut | AA Change | Result | Locus | Gene Name | Product |
| 1 | I | 386410 | 386411 | 102 | - | - | - | 34 additional AAs | slr1084 | - | Hypothetical protein |
| 2 | I | 781625 | 781626 | 154 | - | - | - | 5'extension of reading frame | IGR slr2030-slr2031 | - | - |
| 3 | S | 831647 | 831647 | 1 | C→T | - | - | Possible effect on infA promoter | IGR adk-infA | - | - |
| 4 | S | 943495 | 943495 | 1 | G→A | GTC→ATC | V→I | AA 'change | slr1834 | P700 apoprotein subunit Ia | |
| 5 | S | 1012958 | 1012958 | 1 | G→T | - | - | - | IGR ssl3177-sll1633 | - | - |
| 6 | D | 1200294 | 1201476 | 1183 | - | - | - | sll1780 | - | Transposase | |
| 7 | S | 1364187 | 1364187 | 1 | A→G | TTG→CTG | L→L | –silent- | sll0838 | Orotidine 5' monophosphate decarboxylase | |
| * | |||||||||||
| 9 | S | 1819782 | 1819782 | 1 | A→G | TCT→TCC | S→S | –silent- | sll1867 | Photosystem II D1 protein | |
| 10 | S | 1819788 | 1819788 | 1 | A→G | CTT→CTC | L→L | –silent- | sll1867 | Photosystem II D1 protein | |
| 11 | D | 2048412 | 2049594 | 1183 | - | - | - | slr1635 | - | Transposase | |
| 12 | S | 2092571 | 2092571 | 1 | A→T | TTA→TAA | L→* | New stop codon | sll0422 | - | Asparaginase |
| 13 | S | 2198893 | 2198893 | 1 | T→C | TTA→TTG | L→L | –silent- | sll0142 | - | Probable cation efflux system protein |
| 14 | D | 2204584 | 2204584 | 1 | G→* | GGT→GTT | G→V | Frameshift | slr0162 | A part of pilC, pilin biogenesis protein, required for twitching motility | |
| 18 | S | 2301721 | 2301721 | 1 | A→G | AAG→GAG | K→E | AA change | slr0168 | - | Hypothetical protein, no conserved domains |
| 19 | I | 2350285 | 2350286 | 1 | *→A | - | - | - | IGR sml0001-slr0363 | - | - |
| 20 | I | 2360245 | 2360246 | 1 | *→C | GCG→GCC | A→A | Frameshift | slr0364/slr0366 | - | Hypothetical protein, no conserved domains |
| 21 | D | 2409244 | 2409244 | 1 | C→* | GGA→GAT | G→D | Frameshift | sll0762 | - | Hypothetical protein, no conserved domains |
| 22 | D | 2419399 | 2419399 | 1 | T→* | AAT→ATG | N→M | Frameshift | sll0751(ycf22);sll0752 | Hypothetical protein YCF22 | |
| 23 | I | 2544044 | 2544045 | 1 | *→C | AGG→GAG | R→E | Frameshift | ssl0787/ssl0788 | - | Hypothetical protein, no conserved domains |
| 24 | S | 2602717 | 2602717 | 1 | C→A | CAC→CAA | H→Q | AA change | slr0468 | - | Hypothetical protein, no conserved domains |
| 25 | S | 2602734 | 2602734 | 1 | T→A | ATT→AAT | I→N | AA change | slr0468 | - | Hypothetical protein, no conserved domains |
| 26 | S | 2748897 | 2748897 | 1 | C→T | - | - | - | IGR slr0210-ssr0332 | - | - |
| 27 | S | 3014665 | 3014665 | 1 | T→C | ACT→ACC | T→T | –silent- | slr0302 | PleD-like protein | |
| 28 | S | 3096187 | 3096187 | 1 | T→C | ATA→ACA | I→T | AA change | ssr1175(transposase) | - | Located in a mobile element(ISY100v1) |
| 29 | S | 3098707 | 3098707 | 1 | T→C | TGT→CGT | C→R | AA change | ssr1176(transposase) | - | Located in a mobile element(ISY100v3) |
| 30 | S | 3110189 | 3110189 | 1 | G→A | - | - | - | IGR sll0665-sll0666 | - | Located in a mobile element(ISY523) |
| 31 | S | 3110343 | 3110343 | 1 | G→T | CCA→CAA | P→Q | AA change | sll0665 | - | Transposase |
| 32 | S | 3142651 | 3142651 | 1 | A→G | CTT→CTC | L→L | –silent- | sll0045 | Sucrose phosphate synthase | |
| 33 | D | 3260096 | 3260096 | 1 | C→* | - | - | - | IGR sll0528-sll0529 | - | - |
| 34 | D | 3400331 | 3401513 | 1183 | - | - | - | sll1474 | - | Transposase | |
| 35 | D | 117269 | 118451 | 1183 | - | - | - | sll5131 | - | Transposase | |
| 39 | S | 82405 | 82405 | 1 | A→G | AAC→AGC | N→S | AA change | ssr6089 | Hypothetical protein, no conserved domains | |
Comparison of SNPs and indels identified in GT-G with sequences from other reported strains. The errors of database are in grey. NIa: not investigated.
| Variation | Strains Reported in Literatures and This Work | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| # | Event | GT-Kazusa [ | GT-S [ | GT-I [ | GT-O1 [ | GT-O2 [ | GT-G | PCC-P [ | PCC-N [ | PCC-M [ |
| 1 | I | - | - | √ | √ | √ | √ | √ | √ | √ |
| 2 | I | - | - | - | - | - | √ | √ | √ | √ |
| 3 | S | - | - | - | - | - | √ | √ | √ | √ |
| 4 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 5 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 6 | D | - | √ | √ | √ | √ | √ | √ | √ | √ |
| 7 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 8 | I | - | - | - | - | - | √ | - | - | - |
| 9 | S | √ | √ | - | √ | √ | √ | - | - | - |
| 10 | S | √ | √ | - | √ | √ | √ | - | - | - |
| 11 | D | - | - | √ | √ | √ | √ | √ | √ | √ |
| 12 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 13 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 14 | D | - | √ | √ | √ | √ | √ | √ | √ | √ |
| 15 | S | - | - | - | - | - | √ | - | - | - |
| 16 | S | - | - | - | - | - | √ | - | - | - |
| 17 | D | - | - | - | - | - | √ | - | - | - |
| 18 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 19 | I | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 20 | I | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 21 | D | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 22 | D | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 23 | I | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 24 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 25 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 26 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 27 | S | - | - | - | - | - | √ | √ | √ | √ |
| 28 | S | √ | √ | √ | √ | √ | √ | √ | √ | - |
| 29 | S | - | - | - | - | - | √ | √ | √ | √ |
| 30 | S | √ | √ | - | √ | √ | √ | - | - | √ |
| 31 | S | √ | √ | - | √ | √ | √ | - | - | - |
| 32 | S | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 33 | D | √ | √ | √ | √ | √ | √ | √ | √ | √ |
| 34 | D | - | √ | √ | √ | √ | √ | √ | √ | √ |
| 35 | D | NIa | NI | NI | - | - | √ | NI | NI | √ |
| 36 | S | NI | NI | NI | - | - | √ | NI | NI | - |
| 37 | S | NI | NI | NI | - | - | √ | NI | NI | - |
| 38 | S | NI | NI | NI | - | - | √ | NI | NI | - |
| 39 | S | NI | NI | NI | √ | √ | √ | NI | NI | - |
| 40 | D | NI | NI | NI | - | - | √ | NI | NI | - |
Figure 1Domain organization of mutated genes coding protein in GT-Kazusa and GT-G. The black box indicates the conserved motif. (a) Domain organization of Sll1895. FHA: Forkhead-associated domain for phosphopeptide recognition; GGDEF: diguanylate cyclase domain; EAL: candidate for a diguanylate phosphodiesterase function. Sll1895 in GT-G loses EAL domain and part of the GGDEF domain; (b) Domain organization of Slr0322. The ATPase domain and CheW like domain are lost in GT-G, which is indicated by the dash lines. The white star indicates the P280T residue change in the histidine kinase domain in GT-G.
Figure 2Surface structure and motility of GT-G. (a,b) Electron micrograph of negatively stained GT-G cells; (c) Phototactic movement of colonies of GT-G. The arrow indicated the direction of lateral light and the black line under the colony shows the initial position before lateral illumination.
Figure 3Visualization of phylogenetic relationships among various sequenced stains of Synechocystis sp. PCC 6803. The presence of identified SNPs and indels are indicated along the branches. The variants in plasmid are in italic. Putative insertions, deletions, and intergenic regions are labeled as Ins, Del, and IGR respectively. Modified after Trautmann et al., Kanesaki et al. and Morris et al. [11,12,13].