| Literature DB >> 23069868 |
Danika Trautmann1, Björn Voss, Annegret Wilde, Salim Al-Babili, Wolfgang R Hess.
Abstract
Synechocystis sp. PCC 6803 is a widely used model cyanobacterium for studying photosynthesis, phototaxis, the production of biofuels and many other aspects. Here we present a re-sequencing study of the genome and seven plasmids of one of the most widely used Synechocystis sp. PCC 6803 substrains, the glucose tolerant and motile Moscow or 'PCC-M' strain, revealing considerable evidence for recent microevolution. Seven single nucleotide polymorphisms (SNPs) specifically shared between 'PCC-M' and the 'PCC-N and PCC-P' substrains indicate that 'PCC-M' belongs to the 'PCC' group of motile strains. The identified indels and SNPs in 'PCC-M' are likely to affect glucose tolerance, motility, phage resistance, certain stress responses as well as functions in the primary metabolism, potentially relevant for the synthesis of alkanes. Three SNPs in intergenic regions could affect the promoter activities of two protein-coding genes and one cis-antisense RNA. Two deletions in 'PCC-M' affect parts of clustered regularly interspaced short palindrome repeats-associated spacer-repeat regions on plasmid pSYSA, in one case by an unusual recombination between spacer sequences.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23069868 PMCID: PMC3514855 DOI: 10.1093/dnares/dss024
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Location and effects of SNPs and indels found in ‘PCC-M’ compared with the nucleotide sequence of ‘GT-Kazusa’ in the database
| Event | Effect | Locus | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| # | M | Start | End | Size | Nucl change | Ref →mut | AA change | Result | Locus | Gene name | Product |
| Chromosome | |||||||||||
| 1 | S | 144 507 | 14 4507 | 1 | A → G | GTA → GTG | V → V | - silent - | slr0242 | bcp | Bacterioferritin comigratory protein homolog |
| 2 | I | 386 410 | 386 411 | 102 | 34 additional AAs | slr1084 | Hypothetical protein | ||||
| 3 | S | 489 109 | 489 109 | 1 | T → C | TTA → TCA | L → S | AA change | slr1609 | Acyl-ACP synthetase (AAS) | |
| 4 | D | 527 395 | 527 994 | 600 | a | slr1753 | Hypothetical protein | ||||
| 5 | D | 731 367 | 731 367 | 1 | A → * | AAT → ATT | N → I | Frameshift | sll1574 | spkA | Part of SpkA, cellular motility regulator |
| 6 | I | 781 625 | 781 626 | 154 | 5′ extension of reading frame | ||||||
| 7 | S | 831 647 | 831 647 | 1 | C → T | Possible effect on infA promoter | |||||
| 8 | S | 848 078 | 848 078 | 1 | G → A | AGC → AAC | S → N | AA change | slr1898 | argB | |
| 9 | S | 943 495 | 943 495 | 1 | G → A | GTC → ATC | V → I | AA changea | slr1834 | psaA | P700 apoprotein subunit Ia |
| 10 | S | 1 012 958 | 1 012 958 | 1 | G → T | a | |||||
| 11 | S | 1 070 839 | 1 070 839 | 1 | T → A | AAT → AAA | N → K | AA change | sll1359 | Predicted cytochrome | |
| 12 | D | 1 200 290 | 1 201 474 | 1185 | sll1780 (transposase); slr1862/3 | Hypothetical protein | |||||
| 13 | S | 1 204 616 | 1 204 616 | 1 | G → A | TGT → TAT | C → Y | AA change | slr1865 | Hypothetical protein | |
| 14 | S | 1 364 187 | 1 364 187 | 1 | T → C | TTG → CTG | L → L | - silent -a | sll0838 | pyrF | Orotidine 5′ monophosphate decarboxylase |
| 15 | D | 1 423 340 | 1 423 340 | 1 | A → * | GAC → GCA | D → A | Frameshift, protein truncated | sll1951 | hlyA | Haemolysin |
| 16 | S | 1 812 419 | 1 812 419 | 1 | C → T | GCG → GTG | A → V | AA change | slr1983 | Two-component hybrid sensor and regulator | |
| 17 | D | 2 048 403 | 2 049 585 | 1183 | slr1635 (transposase); slr1636 | hypothetical protein | |||||
| 18 | S | 2 092 571 | 2 092 571 | 1 | T → A | TTA → TAA | L → * | AA change, new stop codona | sll0422 | Asparaginase | |
| 19 | S | 2 198 893 | 2 198 893 | 1 | A → G | TTA → TTG | L → L | - silent -a | sll0142 | Probable cation efflux system protein | |
| 20 | D | 2 204 584 | 2 204 584 | 1 | G → * | GGT → GTT | G → V | Frameshift | slr0162 | pilC | Part of PilC, pilin biogenesis protein, twitching motility |
| 21 | S | 2 301 721 | 2 301 721 | 1 | A → G | AAG → GAG | K → E | AA changea | slr0168 | Hypothetical protein, no conserved domains | |
| 22 | I | 2 350 285 | 2 350 286 | 1 | * → A | a | |||||
| 23 | I | 2 360 245 | 2 360 246 | 1 | * → C | GCG → GCC | A → A | Frameshifta | slr0364/slr0366 | Hypothetical protein, no conserved domains | |
| 24 | S | 2 400 722 | 2 400 722 | 1 | C → A | Possible effect on glcP promoter | |||||
| 25 | D | 2 409 244 | 2 409 244 | 1 | G → * | GGA → GAT | G → D | Frameshifta | sll0762 | Hypothetical protein, no conserved domains | |
| 26 | D | 2 419 399 | 2 419 399 | 1 | A → * | AAT → ATG | N → M | Frameshifta | sll0751 (ycf22); sll0752 | ycf22 | Hypothetical protein YCF22 |
| 27 | S | 2 521 013 | 2 521 013 | 1 | T → C | TTT → TCT | F → S | AA change | slr0222 | hik25 | Two-component hybrid sensor and regulator |
| 28 | I | 2 544 045 | 2 544 046 | 1 | * → G | AGG → GAG | R → E | Frameshifta | ssl0787/ssl0788 | Hypothetical protein, no conserved domains | |
| 29 | S | 2 602 717 | 2 602 717 | 1 | C → A | CAC → CAA | H → Q | AA change a | slr0468 | Hypothetical protein, no conserved domains | |
| 30 | S | 2 602 734 | 2 602 734 | 1 | T → A | ATT → AAT | I → N | AA change a | slr0468 | Hypothetical protein, no conserved domains | |
| 31 | S | 2 748 897 | 2 748 897 | 1 | C → T | a | |||||
| 32 | S | 3 014 665 | 3 014 665 | 1 | T → C | ACT → ACC | T → T | - silent - | slr0302 | pleD | PleD-like protein |
| 33 | S | 3 098 707 | 3 098 707 | 1 | T → C | TGT → CGT | C → R | AA change | ssr1176 (transposase) | Located in a mobile element (ISY100v3) | |
| 34 | S | 3 110 189 | 3 110 189 | 1 | G → A | Located in a mobile element (ISY523) | |||||
| 35 | S | 3 142 651 | 3 142 651 | 1 | T → C | CTT → CTC | L → L | - silent -a | sll0045 | spsA | Sucrose phosphate synthase |
| 36 | I | 3 194 022 | 3 194 023 | 1 | * → A | Possible effect on slr0534_as3 promoter | |||||
| 37 | D | 3 260 096 | 3 260 096 | 1 | C → * | ||||||
| 38 | D | 3 364 288 | 3 364 288 | 1 | A → * | ATT → TTG | I → L | Frameshift | sll1496 | Mannose-1-phosphate guanyltransferase | |
| 39 | S | 3 371 938 | 3 371 938 | 1 | T → A | ATG → AAG | M → K | AA change | slr1564 | sigF | Group 3 RNA polymerase sigma factor |
| 40 | D | 3 400 331 | 3 401 513 | 1183 | sll1474 (transposase) | ccaS | sll1473/5 = His-Kinase/ATPase | ||||
| 41 | S | 3 423 372 | 3 423 372 | 1 | C → T | CCC → CTC | P → L | AA change | slr0753 | Probable transport protein (anion permease) | |
| Plasmid pCA2.4 | |||||||||||
| 42 | S | 1127 | 1127 | 1 | T → G | CGT → CGG | R → R | - silent -a | repA | Replication initiation factor | |
| Plasmid pSYSA | |||||||||||
| 43 | D | 17 343 | 19 741 | 2399 | ssl7018/19/20/21 [CRISPR 1] | ||||||
| 44 | D | 71 558 | 71 596 | 159 | new CRISPR spacer | CRISPR 2 | |||||
| Plasmid pSYSM | |||||||||||
| 45 | D | 117 269 | 118 451 | 1183 | sll5130/32 | Hypothetical protein | |||||
The events are numbered (column #), the type of mutation (M) is indicated as S, substitution, D, deletion or I, insertion, together with the respective start and end positions in the ‘GT-Kazusa’ reference sequence. For each event the respective nucleotide change is indicated on the forward strand, together with the resulting codon modification (Ref. → Mut) and amino acid change, if any. Highlighted in italics are four instances of missing ISY203 copies and in bold all SNPs affecting intergenic spacer regions (IGR).
aIndicate errors in the database.
Comparison of SNPs and indels found in the chromosome of ‘PCC-M’ with sequences from other substrains
| Event | Comparison of strains: literature + this work | ||||||
|---|---|---|---|---|---|---|---|
| # | Event | GT-Kazusa[ | GT-S[ | GT-I[ | PCC-P[ | PCC-N[ | PCC-M |
| 1 | S | — | — | — | — | — | √ |
| 2 | I | — | — | √ | √ | √ | √ |
| 3 | S | — | — | — | — | — | √ |
| 4 | D | √a | — | — | — | — | √ |
| 5 | D | — | — | — | √ | √ | √ |
| 6 | I | — | — | — | √ | √ | √ |
| 7 | S | — | — | — | — | — | √ |
| 8 | S | — | — | — | — | — | √ |
| 9 | S | √ | √ | √ | √ | √ | √ |
| 10 | S | √ | √ | √ | √ | √ | √ |
| 11 | S | — | — | — | — | — | √ |
| 12 | D | — | √ | √ | √ | √ | √ |
| 13 | S | — | — | — | √ | √ | √ |
| 14 | S | √ | √ | √ | √ | √ | √ |
| 15 | D | — | — | — | √ | √ | √ |
| 16 | S | — | — | — | √ | √ | √ |
| 17 | D | — | — | √ | √ | √ | √ |
| 18 | S | √ | √ | √ | √ | √ | √ |
| 19 | S | √ | √ | √ | √ | √ | √ |
| 20 | D | — | √ | √ | √ | √ | √ |
| 21 | S | √ | √ | √ | √ | √ | √ |
| 22 | I | √ | √ | √ | √ | √ | √ |
| 23 | I | √ | √ | √ | √ | √ | √ |
| 24 | S | — | — | — | — | — | √ |
| 25 | D | √ | √ | √ | √ | √ | √ |
| 26 | D | √ | √ | √ | √ | √ | √ |
| 27 | S | — | — | — | √ | √ | √ |
| 28 | I | √ | √ | √ | √ | √ | √ |
| 29 | S | √ | √ | √ | √ | √ | √ |
| 30 | S | √ | √ | √ | √ | √ | √ |
| 31 | S | √ | √ | √ | √ | √ | √ |
| 32 | S | — | — | — | √ | √ | √ |
| S | — | — | — | √ | √ | √ | |
| 34 | S | — | √ | — | — | — | √ |
| 35 | √ | √ | √ | √ | √ | √ | |
| 36 | I | — | — | — | — | — | √ |
| 37 | D | — | √ | √ | √ | √ | √ |
| 38 | D | — | — | — | — | — | √ |
| 39 | S | — | — | — | — | — | √ |
| 40 | D | — | √ | √ | √ | √ | √ |
| 41 | S | — | — | — | — | — | √ |
All events are numbered (column #) as in Table 1. The presence of the respective ‘PCC-M’ mutation in the different substrains is indicated by the check marks.
aThe deletion of 0.6 kb in the gene slr1753 compared with the reference was also verified here in ‘GT-Kazusa’.
Figure 1.Alignment of the possible indel region in gene slr1753. The sequence obtained in the verification experiment is aligned with that of the ‘GT-Kazusa’ reference. Two types of DNA repeats are indicated by the filled and non-filled lozenges.
Figure 2.Non-homologous recombination in the plasmid pSYSA affecting spacers 40, 41 and 42 of CRISPR2. As a result of the 159-bp deletion in ‘PCC-M’ compared with ‘GT-Kazusa’, a novel hybrid spacer 40 was generated. The direct repeats are presented as squares and the nucleotide positions in the ‘GT-Kazusa’ are given according to the GenBank file NC_005230.
Figure 3.Visualization of phylogenetic relationships between various strains of Synechocystis sp. PCC 6803. The occurrence of the identified SNPs and other known events are indicated along the branches. The eight events separating the ‘GT’ and ‘PCC’ strains from each other are given at the branch point where these two lineages split or on the respective branches where they occurred. Putative insertions and deletions are labelled ‘Ins’. and ‘Del’., respectively.