| Literature DB >> 27894276 |
Desiree Baumgartner1, Matthias Kopf1,2, Stephan Klähn1, Claudia Steglich1, Wolfgang R Hess3.
Abstract
BACKGROUND: Despite their versatile functions in multimeric protein complexes, in the modification of enzymatic activities, intercellular communication or regulatory processes, proteins shorter than 80 amino acids (μ-proteins) are a systematically underestimated class of gene products in bacteria. Photosynthetic cyanobacteria provide a paradigm for small protein functions due to extensive work on the photosynthetic apparatus that led to the functional characterization of 19 small proteins of less than 50 amino acids. In analogy, previously unstudied small ORFs with similar degrees of conservation might encode small proteins of high relevance also in other functional contexts.Entities:
Keywords: Cyanobacteria; Nitrogen deprivation; Photosynthesis; Small proteins; Synechocystis
Mesh:
Substances:
Year: 2016 PMID: 27894276 PMCID: PMC5126843 DOI: 10.1186/s12866-016-0896-z
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
List of oligonucleotides
| Name of Oligonucleotide | Sequence (in 5′ – 3′ direction) | Application |
|---|---|---|
| Probe_norf1_fw | G | Amplification of probe template |
| Probe_norf1_rev | TTGAGATGCTACAGGACCTTATGC | |
| Pnorf1_fw | taccggtGCCTAGGGGATACCTCTCCCC | Amplification of putative |
| Pnorf1_rev | tggccggcCTCCGTCCCAATGGGGGAAAC | |
| nsiR6_probe_fw | G | Amplification of probe template |
| nsiR6_probe_rev | TGTGTGGCGTCACCATTGAAAATG | |
| hliR1_probe_fw | G | Amplification of probe template |
| hliR1_probe_rev | ATGTCTAATTTGATTGCTGTTGCTTTC | |
| norf4_probe_fw | G | Amplification of probe template |
| norf4_probe_rev | ATGACCGCCGATCAACTGTTG | |
| nsiR6_fw | gccaagaagtATGAGTGTTTTCCCCGCAGA | Amplification of |
| nsiR6_rev | tataatccatGTCGTAATAATCCCGGCTGG | |
| PpetE::hliR1_fw | gccaagaagtATGTCTAATTTGATTGCTGTTG | Amplification of |
| 3xFlag_hliR1_rev | tataatccatCTCGGGAAGATTAAGACTGG | |
| PpetE::ssr1169_fw | gccaagaagtATGGATATTGTTAAGATCATTTGTGCGATTC | Amplification of |
| 3xFlag_ssr1169_rev | tataatccatACGTTCCCTGGCAATGACCC | |
| PpetE::Norf4_fw | gccaagaagtATGACCGCCGATCAACTGTT | Amplification of |
| 3xFlag_Norf4_rev | tataatccatACCCCCTTTAGCAAAACTAC | |
| pUC19-XbaI_PpetE_fw | gctcggtacccggggatcc | Amplification of P |
| nsiR6::PpetE_rev | aaacactcatACTTCTTGGCGATTGTATCTATAGG | |
| hliR1::PpetE_rev | aattagacatACTTCTTGGCGATTGTATCTATAGG | |
| ssr1169::PpetE_rev | caatatccatACTTCTTGGCGATTGTATCTATAGG | |
| Norf4::PpetE_rev | gatcggcggtcatACTTCTTGGCGATTGTATCTATAGG | |
| 3xFlag_PstI-pUC19_rev | gccaagcttgcatgc | Amplification of 3xFLAG::3′UTR |
| nsiR6_3xFlag_fw | ttattacgacATGGATTATAAAGATCATGATGGCGATTATAAAG | |
| hliR1_3xFlag_fw | tcttcccgagATGGATTATAAAGATCATGATGGCGATTATAAAG | |
| ssr1169_3xFlag_fw | cagggaacgtATGGATTATAAAGATCATGATGGCGATTATAAAG | |
| Norf4_3xFlag_fw | taaagggggtATGGATTATAAAGATCATGATGGCGATTATAAAG | |
| CDSnsiR6::PnsiR6_fw | ataaatactcATGAGTGTTTTCCCCGCAGAAAC | |
| CDShliR1::PhliR1_fw | aaattaactaaATGTCTAATTTGATTGCTGTTGCTTTCTG | |
| CDSNorf4::PNorf4_fw | aatttttaccATGACCGCCGATCAACTGTTG | |
| CDSssr1169::Pssr1169_fw | gagtgaactaATGGATATTGTTAAGATCATTTGTGCGATTC | |
| pUC19::PnsiR6_fw | gctcggtacccggggatcc | Amplification of P |
| PnsiR6::CDSnsiR6_rev | aaacactcatGAGTATTTATTCCTAGTGAATGAATTAGAAG | |
| pUC19::PhliR1_fw | gctcggtacccggggatcc | Amplification of P |
| PhliR::CDShliR1_rev | aattagacatTTAGTTAATTTTTGTAACGGGAG | |
| pUC19::PNorf4_fw | gctcggtacccggggatcc | Amplification of P |
| PNorf4::CDSNorf4_rev | gatcggcggtcatGGTAAAAATTCCACTAATTCAAAAAAC | |
| pUC19::Pssr1169_fw | gctcggtacccggggatcc | Amplification of P |
| Pssr1169::CDSssr1169_rev | caatatccatTAGTTCACTCCAATATGTCGGGATAATTAG |
Fig. 1Scheme of computational prediction of small ORFs in cyanobacteria. Small ORFs were detected based on the transcriptome information for expressed intergenic regions [20–22] and their coding potential evaluated with RNAcode [40]. This information was merged with the pre-existing annotation [32]. Orthologs between the two small ORF populations were detected when they were identified as reciprocal best hits (RBH) by blastP with e ≤ 1e−2
Predicted and previously annotated proteins ≤50 amino acids in Synechocystis 6803
| Start | End | S | L | Locus_tag | Gene | Product | 6714 | A. th. | E. coli |
|---|---|---|---|---|---|---|---|---|---|
| 608828 | 608748 | - | 26 | Chr_ORF_4 | NA | Hypothetical protein | N | N | N |
| 1840846 | 1840926 | + | 26 | Chr_ORF_10 | NA | Hypothetical protein | Y | N | N |
| 160004 | 160093 | - | 29 | sml0004 |
| Cytochrome | Y | Y | N |
| 586617 | 586525 | - | 30 | Chr_ORF_3 | NA | Hypothetical protein | Y | N | N |
| 2414584 | 2414679 | + | 31 | smr0001 |
| Photosystem II PsbT protein | Y | Y | N |
| 467201 | 467296 | + | 31 | smr0005 |
| Photosystem I subunit XII | Y | N | N |
|
|
|
|
|
|
|
|
|
|
|
| 1643502 | 1643600 | - | 32 | ssl3803 |
| Cytochrome | Y | N | N |
| 3097275 | 3097379 | + | 34 | smr0002 | NA | Transposase, fragment | Y | N | N |
| 146724 | 146831 | - | 35 | sml0003 |
| Photosystem II reaction center M protein | Y | Y | N |
| 468997 | 468887 | - | 36 | Chr_ORF_2 | NA | Hypothetical protein | Y | N | N |
| 3118192 | 3118302 | + | 36 | smr0003 |
| Cytochrome | Y | N | N |
|
|
|
|
|
|
|
|
|
|
|
| 473802 | 473915 | - | 37 | sml0009 | NA | VapC fragment | Y | N | Y |
| 32865 | 32978 | - | 37 | ssl5031 | NA | NA | Y | N | N |
| 831101 | 831217 | - | 38 | sml0006 |
| 50S ribosomal protein L36 | Y | Y | Y| |
| 1823570 | 1823686 | + | 38 | smr0010 |
| Cytochrome | Y | Y | N |
| 2350140 | 2350256 | - | 38 | sml0001 |
| Photosystem II reaction center PsbI protein | Y | Y | N |
| 571084 | 571203 | + | 39 | smr0007 |
| Photosystem II PsbL protein | Y | Y | N |
| 571236 | 571355 | + | 39 | smr0008 |
| Photosystem II PsbJ protein | Y | Y | N |
| 1268189 | 1268308 | - | 39 | sml0007 |
| Photosystem II protein Y | Y | N | N |
| 2613481 | 2613600 | - | 39 | sml0002 |
| Photosystem II PsbX protein | Y | N | N |
| 2816991 | 2817110 | - | 39 | sgl0001 | NA | Hypothetical protein | Y | N | N |
| 3140045 | 3140164 | - | 39 | sll0047 |
| Psb30, YCF12 | Y | N | N |
| 1687326 | 1687448 | - | 40 | sml0008 |
| Photosystem I subunit IX | Y | Y | N |
| 3458023 | 3458145 | + | 40 | smr0004 |
| Photosystem I subunit VIII | Y | Y | N |
| 3188105 | 3188227 | - | 40 | sml0013 |
| NdhP | Y | Y | N |
| 2138496 | 2138621 | + | 41 | Chr_ORF12 | NA | Hypothetical protein | Y | N | N |
| 633626 | 633754 | + | 42 | Chr_ORF5 | NA | Transposase, fragment | Y | N | N |
| 273512 | 273381 | - | 43 | Chr_ORF1 | NA | Hypothetical protein | Y | N | N |
| 1167333 | 1167464 | + | 43 | smr0009 |
| Photosystem II PsbN protein | Y | Y | N |
| 570940 | 571074 | + | 44 | smr0006 |
| Cytochrome b559 b subunit | Y | Y | N |
| 3067172 | 3067306 | - | 44 | Norf8 | NA | NA | Y | N | N |
| 30142 | 30008 | - | 44 | pSYSA_ORF3 | NA | N terminus of bifunctional aconitate hydratase 2/2-methylisocitrate dehydratase; similar short homologs also in other bacteria | Y | N | Y |
| 553065 | 553202 | - | 45 | sml0005 |
| Photosystem II PsbK protein | Y | Y | N |
| 1826764 | 1826901 | + | 45 | smr0011 |
| 50S ribosomal protein L34 | Y | Y | Y |
| 903627 | 903764 | - | 45 | sml0012 | NA | Hypothetical protein | Y | N | N |
| 14477 | 14340 | - | 45 | pSYSM_ORF1 | NA | O-acetyl-N-acetylneuraminate esterase, partial | Y | N | N |
| 1842716 | 1842856 | - | 46 | Norf6 |
| NdhQ subunit of the NDH-1 L complex | Y | N | N |
| 1141803 | 1141946 | - | 47 | ssl1633 |
| HliC, CAB/ELIP /HLIP superfamily | Y | Y | N |
|
|
|
|
|
|
|
| Y |
|
|
| 572978 | 573124 | - | 48 | sml0010 | NA | Transposase | Y | N | N |
| 159812 | 159961 | - | 49 | sgl0002 | NA | Hypothetical protein | Y | N | N |
| 2595112 | 2595264 | - | 50 | ssl0090 | NA | Hypothetical protein | Y | N | N |
The start and end positions according to the chromosomal or plasmid sequences in Genbank files (accessions NC_000911, AP004311 and AP004310), the location (S) on the forward (+) or reverse strand (−) and respective length (L; in amino acids) are given, followed by the locus tag ID, gene name and product if assigned. Location on chromosome or one of the plasmids is prefixed by “Chr” or the name of the plasmid. The existence of homologs in Synechocystis 6714, A. thaliana and E. coli is indicated by “Y” for yes or “N” for no. Homologs tagged and detected in this study are highlighted in boldface letters. Names of genes tested in this work are in boldface
Fig. 2Western blot detection of small proteins. Recombinant Synechocystis 6803 cells carrying the genes of interest (goi) under control of the petE promoter on pVZ322 vector were collected before (−) or 24 h after induction of gene expression (+) for the extraction of total proteins. FLAG-tagged superfolder GFP (sfGFP) under the control of the petJ promoter [37] served as positive control, a WT strain carrying an empty pVZ322 vector was used as negative control (n.c.). Theoretical protein masses are listed in Table 3. Two gels were run in parallel. a Proteins (30 μg) were separated on a 15% (w/v) SDS polyacrylamide gel and subjected to colloidal Coomassie G-250 staining as a loading control. b Immunoblot with the same loading order probed with specific ANTI-FLAG® M2-Peroxidase (HRP) antibody
Physicochemical properties of μ-proteins overexpressed in Synechocystis 6803. TMR, putative transmembrane domains, predicted using TMHMM v. 2.0 [41]
| Protein | Length (aa) | MW untagged | + FLAG tag | Predicted protein | ||
|---|---|---|---|---|---|---|
| [kDa] | pI | [kDa] | pI | Domains | ||
| Norf4 | 31 | 3.16 | 5.59 | 5.99 | 4.08 | 1 TMR |
| HliR1 | 37 | 4.11 | 7.92 | 6.93 | 4.30 | 1 TMR |
| Norf1 | 48 | 5.47 | 3.76 | 8.30 | 3.88 | |
| Ssr1169 | 54 | 6.09 | 7.96 | 8.92 | 4.48 | Pmp3; 2 TMR |
| NsiR6 | 66 | 7.10 | 6.05 | 9.92 | 4.40 | PHA02325 |
Fig. 4The NsiR6 peptide. a Transcriptomic datasets indicated high read coverage in a region without annotation in Synechocystis 6803 [21], which contains the here defined nsiR6 gene. The homolog in Synechocystis 6714 is D082_18940 [20]. Shown is the read coverage (grey) resulting from previous transcriptome analysis, including the respective transcriptional units (TU) defined in that work [20, 21] and a putative NtcA binding site, centered 42 nt upstream the transcription initiation site in both strains. Relevant transcription initiation sites appear as steep increase in read coverage and are labelled by a black arrow. The length of the 5’-UTRs is 26 nt in both strains. Other non-coding TUs are colored orange. There is transcription in antisense orientation in both strains but with much lower coverage. b Northern blot showing the nitrogen stress-induced transcript accumulation of the NsiR6 mRNA in Synechocystis 6803 over 72 h. Time point 0 refers to the nitrogen-replete condition. c Time course of NsiR6 mRNA accumulation after normalization to 5S rRNA. The data are presented as relative to the signal at 6 h after diminishing N (=100%). d Sequence comparison of NsiR6 homologs from the two Synechocystis strains, Cyanothece ATCC 51142, Crocosphaera watsonii WH 8502 and the two marine Synechococcus strains WH 8102 and WH 8103 which harbor an identical protein. Four conserved cysteine residues are highlighted by arrows. These are conserved in all 63 homologs detected throughout the cyanobacterial phylum
Fig. 3Heatmap indicating the expression of the genes encoding the five investigated small proteins in Synechocystis strains PCC 6803 and PCC 6714 under 10 different growth conditions: exponential (Exp.) and stationary growth phase (Stat.); cold (15 °C) and heat (42 °C) stress for 30 min each; depletion of inorganic carbon (−C), cells were washed 3 times with carbon-free BG11 and cultivated further for 20 h; dark, no light for 12 h; Fe2+ limitation (−Fe), addition of iron-specific chelator desferrioxamine B (DFB) and further cultivation for 24 h; high light (HL), 470 μmol photons m−2s−1 for 30 min; nitrogen depletion (−N), cells were washed 3 times with nitrogen-free BG11 and cultivated further for 12 h; phosphate depletion (−P), cells were washed 3 times with phosphate-free BG11 and further incubated for 12 h. Data derived from previous genome-wide expression analysis by differential RNA-Seq [20, 21]. Values indicate sequencing read counts for the primary 5′ end (= transcriptional start site [TSS]) of the corresponding transcriptional unit (TU). The TSS positions are given for the Synechocystis genomes available under accession numbers BA000022 and CP007542. The colour varies from red (no expression) to yellow (intermediate expression) to green (high expression)
Fig. 5Sequence comparison and regulated expression of the norf1 gene. a Sequence comparison of Norf1 homologs from the two Synechocystis strains, Cyanothece ATCC 51142, 5 different strains of Crocosphaera encoding an identical protein and Anabaena (Nostoc) sp. PCC 7120. b Expression of norf1 in Synechocystis 6803 is strongly upregulated after transfer to darkness. c Bioluminescence of a Synechocystis 6803 reporter strain harboring a transcriptional fusion of Pnorf1 (−328 to +137, TSS at +1) and luxAB genes in response to transfer to darkness. Representative bioluminescence dataset indicating means ± SD of measurements for two biological replicates (= independent transformants). A strain carrying a promoterless luxAB was used as a negative control (measured in two independent cultures each)
Fig. 6The Norf4 peptide. a Datasets from the previously performed primary transcriptome analysis showed that Norf4 expression responded positively to nitrogen depletion in both Synechocystis 6803 [21] and Synechocystis 6714 [20], when 11 different growth conditions were tested. The mRNAs of norf4 and gap1 are co-regulated and overlap by several hundred nt. The previously mapped transcriptional start sites are labelled by black arrows. b Northern blot analysis of norf4 mRNA accumulation in a time course experiment up to 72 h after the removal of nitrogen. The same RNA samples were used as in Fig. 4. c The signals obtained from the Northern blots (panel b) were evaluated densitometrically after normalization by the level of 5S rRNA. The relative norf4 expression is shown with respect to the maximum expression after transfer to nitrogen-free conditions (24 h = 100%). The bands at 200 nt (filled circles) and at 800 nt (empty circles) were analyzed separately from each other. d Multiple sequence alignment of 35 homologs from 51 different cyanobacterial genome sequences (homologs are identical among 12 Microcystis, five Crocosphaera watsonii and two Fischerella genome sequences)
Fig. 7The HliR1 peptide in Synechocystis 6803. a Pairwise sequence alignment of the HliR1 peptides from Synechocystis 6803 and Synechocystis 6714. A predicted transmembrane region is boxed. b Data replotted from the primary transcriptome analysis of Synechocystis 6803 suggest that HliR1 expression is induced by high light and that transcripts may extend into the subsequent TU1649 covering the sodB gene [21]. c Northern analysis of hliR1 mRNA accumulation upon transfer to high light (HL) or normal light (NL). d Quantification of the hliR1 mRNA accumulation shown in panel c after normalization to the 5S rRNA level. Relative values refer to the maximum level at 0.5 h after HL shift (=100%)
Fig. 8Sequence alignment of Ssr1169 homologs from cyanobacteria with those from Arabidopsis thaliana, Desulfococcus oleovorans Hxd3, and identical proteins in 4 strains of Rhodospirillum rubrum. Putative transmembrane domains were predicted using TMHMM v. 2.0 [41] and are boxed
Fig. 9Detection of μ-proteins upon expression from their native promoters and 5′ UTRs. Recombinant Synechocystis 6803 cells carrying the respective genes under control of their own promoters and 5′UTRs on vector pVZ322 were collected from cultures grown at standard conditions (0) and after transfer to the respective inducing condition at indicated time points (6 or 24 h) or in case of Ssr1169 after 24 h at standard condition. The Western blot was probed with specific ANTI-FLAG® M2-Peroxidase (HRP) antibody. All samples were separated on the same gel and transferred to the same membrane but the part probed for Norf4 had to be exposed longer because of its lower expression. Prestained Protein Ladder (10–170 kDa, Fermentas) was used as molecular weight marker