| Literature DB >> 35475644 |
Marie Adomako1, Dustin Ernst1, Ryan Simkovsky1, Yi-Yun Chao1, Jingtong Wang1, Mingxu Fang1,2, Christiane Bouchier3, Rocio Lopez-Igual4,5, Didier Mazel4, Muriel Gugger6, Susan S Golden1,2.
Abstract
Strains of the freshwater cyanobacterium Synechococcus elongatus were first isolated approximately 60 years ago, and PCC 7942 is well established as a model for photosynthesis, circadian biology, and biotechnology research. The recent isolation of UTEX 3055 and subsequent discoveries in biofilm and phototaxis phenotypes suggest that lab strains of S. elongatus are highly domesticated. We performed a comprehensive genome comparison among the available genomes of S. elongatus and sequenced two additional laboratory strains to trace the loss of native phenotypes from the standard lab strains and determine the genetic basis of useful phenotypes. The genome comparison analysis provides a pangenome description of S. elongatus, as well as correction of extensive errors in the published sequence for the type strain PCC 6301. The comparison of gene sets and single nucleotide polymorphisms (SNPs) among strains clarifies strain isolation histories and, together with large-scale genome differences, supports a hypothesis of laboratory domestication. Prophage genes in laboratory strains, but not UTEX 3055, affect pigmentation, while unique genes in UTEX 3055 are necessary for phototaxis. The genomic differences identified in this study include previously reported SNPs that are, in reality, sequencing errors, as well as SNPs and genome differences that have phenotypic consequences. One SNP in the circadian response regulator rpaA that has caused confusion is clarified here as belonging to an aberrant clone of PCC 7942, used for the published genome sequence, that has confounded the interpretation of circadian fitness research. IMPORTANCE Synechococcus elongatus is a versatile and robust model cyanobacterium for photosynthetic metabolism and circadian biology research, with utility as a biological production platform. We compared the genomes of closely related S. elongatus strains to create a pangenome annotation to aid gene discovery for novel phenotypes. The comparative genomic analysis revealed the need for a new sequence of the species type strain PCC 6301 and includes two new sequences for S. elongatus strains PCC 6311 and PCC 7943. The genomic comparison revealed a pattern of early laboratory domestication of strains, clarifies the relationship between the strains PCC 6301 and UTEX 2973, and showed that differences in large prophage regions, operons, and even single nucleotides have effects on phenotypes as wide-ranging as pigmentation, phototaxis, and circadian gene expression.Entities:
Keywords: Synechococcus; biofilm; circadian rhythms; comparative genomics; cyanobacteria; phototaxis
Mesh:
Substances:
Year: 2022 PMID: 35475644 PMCID: PMC9239245 DOI: 10.1128/mbio.00862-22
Source DB: PubMed Journal: mBio Impact factor: 7.786
FIG 1The legacy strains and UTEX 3055 share high average nucleotide identity (A), cluster in a monophyletic group separately from other Prochlorococcus and Synechococcus species (see expanded phylogenetic tree in Fig. S1 in the supplemental material) (B), and have unique combinations of phenotypes (gray checkmark represents presumed circadian rhythms in UTEX 2973; the strain has never been explicitly tested for rhythmic gene expression) (C).
FIG 2(A) Alignment of PCC 7942 (representing legacy strains) and UTEX 3055 chromosomes showing gaps, inversions, and regions of high variability illustrated by SNP density per 1,000 bp. (B) Alignment of UTEX 3055 pMAL and pANL of legacy strains; repeated regions of homology and plasmid maintenance genes are highlighted. (C and D) Gene maps of the small plasmids UTEX 3055 pMAS (C) and legacy strain pNAS (D).
FIG 3Activation of a theophylline riboswitch driving overexpression of Synpcc7942_0766 induces a loss of culture density (A) and visible lysis (B) by day 3 postinduction. (C) Excision and circularization of the prophage genome following theophylline induction was observed through whole-cell PCR using primer sets with annealing sites just within and outside the prophage region. Cell lysis in induced cultures resulted in faint PCR bands from day 3 samples.
FIG 4Prophage genes control pigmentation in PCC 7942. (A) Absorbance readings of spot cultures show that PCC 7942 loses phycocyanin and chlorophyll pigments over time compared to the phageless strain D1K3. (B) Loss of either Synpcc7942_0759 or Synpcc7942_0760 in section 7 of the prophage leads to a dark pigmentation phenotype similar to the phageless strain. (C) Expression of both Synpcc7942_0759 and Synpcc7942 _0760 together, but not Synpcc7942_0759 alone in the phageless strain, led to WT light pigmentation.
FIG 5Replacement of UTEX3055_pg2263-pg2266 with a kanamycin resistance cassette (aphI) leads to a loss of phototaxis. The site of Tn5 insertion in the initial phototaxis mutant is marked by an asterisk. Complementation with the full four-gene operon restores phototaxis. Introduction of UTEX3055_pg2263-2266 is not sufficient to restore a phototaxis phenotype to PCC 7942.
SNPs relative to PCC 7942 that lead to amino acid changes
See Data Set S2 for all SNPs and indels. SNPs shared among several genomes are in light green; SNPs in only one strain are yellow. Two SNPs of the published PCC 7942 sequence (in bold) result in a mutant allele (rpaA) and a likely suppressor mutation (aas).
FIG 6(A) Reconstruction of the crm1 mutant leads to arrhythmic and rhythmic populations due to the presence (clones represented in gray) or absence (clones represented in magenta) of the RpaA-Q121 allele, respectively. (B) The conserved amino acid at position 121 of RpaA in cyanobacteria is arginine. An RpaA-L4-Q121 suppressor mutation restores LD fitness (C) and rhythmic gene expression (D). Dilution series of strains were grown in constant light (LL) or in 12-h light/dark cycles (12:12 LD) for 48 h to assess LD fitness. Bioluminescence from strains carrying a P reporter at NS2 was recorded as an assay for circadian rhythms of gene expression. LL, constant light after entrainment.