| Literature DB >> 18769676 |
Claudia Steglich1, Matthias E Futschik, Debbie Lindell, Bjoern Voss, Sallie W Chisholm, Wolfgang R Hess.
Abstract
Prochlorococcus, an extremely small cyanobacterium that is very abundant in the world's oceans, has a very streamlined genome. On average, these cells have about 2,000 genes and very few regulatory proteins. The limited capability of regulation is thought to be a result of selection imposed by a relatively stable environment in combination with a very small genome. Furthermore, only ten non-coding RNAs (ncRNAs), which play crucial regulatory roles in all forms of life, have been described in Prochlorococcus. Most strains also lack the RNA chaperone Hfq, raising the question of how important this mode of regulation is for these cells. To explore this question, we examined the transcription of intergenic regions of Prochlorococcus MED4 cells subjected to a number of different stress conditions: changes in light qualities and quantities, phage infection, or phosphorus starvation. Analysis of Affymetrix microarray expression data from intergenic regions revealed 276 novel transcriptional units. Among these were 12 new ncRNAs, 24 antisense RNAs (asRNAs), as well as 113 short mRNAs. Two additional ncRNAs were identified by homology, and all 14 new ncRNAs were independently verified by Northern hybridization and 5'RACE. Unlike its reduced suite of regulatory proteins, the number of ncRNAs relative to genome size in Prochlorococcus is comparable to that found in other bacteria, suggesting that RNA regulators likely play a major role in regulation in this group. Moreover, the ncRNAs are concentrated in previously identified genomic islands, which carry genes of significance to the ecology of this organism, many of which are not of cyanobacterial origin. Expression profiles of some of these ncRNAs suggest involvement in light stress adaptation and/or the response to phage infection consistent with their location in the hypervariable genomic islands.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18769676 PMCID: PMC2518516 DOI: 10.1371/journal.pgen.1000173
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Figure 1Computational pipeline for the identification of novel genetic elements from microarray data.
The data were derived from experiments studying stress of Prochlorococcus MED4 induced by light changes, phosphorus starvation, and phage infection. Note that the sum of classified transcripts is unequal to the number of transcriptional units due to the occurrence of dicistronic elements. In addition to the 12 ncRNAs newly detected by microarray analysis, two ncRNAs Yfr12 and Yfr18 were found based on similarity.
Synopsis of ncRNAs in Prochlorococcus MED4. Newly identified ncRNAs are based on microarray expression data or on sequence similarity (Yfr12 and Yfr18) according to the scheme shown in Figure 1.
| probe names | new name | genomic location | length in nt | adjacent genes | genomic context | occurence in | Genomic island | mean MFE z score | SVM RNA-class probability |
| IG0367f11-17 | Yfr8 | 339952..340241 | 180, 220, 290 | PMED4_03781 and PMED4_03791 (PMM0356) | → → → | MIT9301, MIT9515, MIT9312, AS9601, MIT9215 | 1 | −4.64 | 0.999616 |
| IG0367r2-5 | Yfr9 | complement(339887..340176) | 180, 220, 290 | PMED4_03781 and PMED4_03791 (PMM0356) | → ← → | MIT9301, MIT9515, MIT9312, AS9601, MIT9215 | 1 | −2.54 | 0.676448 |
| IG0375f18-20 | Yfr10 | 346661..346738 | 78, 270 | PMED4_03891 and Yfr2 | ← → → | MIT9301, MIT9312 | 1 | −4.69 | 0.991539 |
| IG0390r20-22 | Yfr11 | complement(358606..358754) | 149 | PMED4_04111 and PMED4_04121 | |||||
| (PMM0378) | ← ← ← | MIT9515 | 1 | −2.38 | 0.998347 | ||||
| - | Yfr12 | 383230..383312 | 83 | PMED4_04471 and Yfr4 | → → → | MIT9301, MIT9312 | - | −3.09 | 0.996117 |
| IG0668r5 | Yfr13 | complement(621601..621701) | 101 | PMED4_07021 and PMED4_07031 | |||||
| (PMM0652) | ← ← ← | MIT9301, MIT9515, MIT9312, MIT9215, NATL1a, NATL2a | - | −1.78 | 0.600913 | ||||
| IG0676f22-24 | Yfr14 | 627811..628062 | 252 | PMED4_07142 and PMED4_07151 | → → → | n.d. | - | n.d. | n.d. |
| IG0702r18-21 | Yfr15 | complement(654301..654562) | 86, 262 | PMED4_07441 and Yfr3 | → ← → | n.d. | 2 | n.d. | n.d. |
| IG0704r24-25 | Yfr16 | complement(657392..657529) | 138 | PMED4_07521 and PMED4_07522 | ← ← → | MIT9515 | 2 | −2.38 | 0.998347 |
| IG0714f13 | Yfr17 | 664667..664930 | 76, 285, 450 | PMED4_07691 and PMED4_07701 (PMM0699) | ← → → | MIT9301, MIT9515, MIT9312,AS9601, MIT9215 | - | −1.05 | 0.000000 |
| - | Yfr18 | complement(972284..972388) | 29, 76 | Yfr5 and PMED4_11681 | ← ← → | MIT9301, MIT9515, MIT9312, AS9601, MIT9215 | - | −0.92 | 0.996639 |
| IG1142r19-25 | Yfr21 | complement(1069553..1069757) | 205 | PMED4_12671 and PMED4_12681 | |||||
| (PMM1121) | ← ← ← | no homologues | - | n.d. | n.d. | ||||
| IG1157r8 | Yfr19 | complement(1086081..1086162) | 82 | PMED4_12891 and PMED4_12901 | |||||
| (PMM1135) | → ← → | MIT9301, MIT9312, AS9601, MIT9215 | - | −3.89 | 0.999504 | ||||
| IG1412f30 | Yfr20 | 1336435..1336523 | 89 | PMED4_15791 and PMED4_15792 | → → → | MIT9515, MIT9312 | 5 | −0.77 | 0.142982 |
Microarray probe IDs are given together with the respective ncRNA name, location and length. Computational support values (mean MFE z score and SVM RNA class probability) have been calculated according to Washietl et al. [65].
length determined from Northern results and verified by 5′RACE.
values derived from RNAz.
not detected by Northern blot.
length and location determined from 5′RACE results.
precursor of Yfr10 and Yfr2.
values derived from consensus structure of Yfr11, Yfr16 and a single homologue in MIT9515.
values without MIT9301, MIT9312 and MIT9215.
values derived from 264 nt long RNA as determined by 5′RACE; without AS9601, MIT9301 and MIT9215.
values without MIT9301 and MIT9215.
values derived using the same MIT9301 and MIT9312 homologues.
Figure 2Detection of novel ncRNAs by Northern hybridization.
The lengths of hybridizing fragments were calculated by comparison to two different size markers (M) – size in nt is shown to the left of the markers. 50 µg of total RNA was loaded per lane on a 10% polyacrylamide gel that was electro-blotted on Hybond-N nylon membranes. All ncRNAs were detected with single strand RNA probes except for those labeled with an asterisk, which were hybridized to an oligonucleotide probe.
Figure 3Genomic location of Yfr13 (red boxes) in seven different Prochlorococcus strains.
Homologous genes in different strains are indicated by blue boxes and those without homology in this genomic region are shown in white. Gene designations are given according to published genome sequences for MED4 (BX548174.1), MIT9312 (CP000111.1), MIT9515 (CP000552.1), MIT9215 (CP000825), MIT9301 (CP000576.1), NATL1A (CP000553.1), NATL2A (CP000095.2).
Figure 4Regulation of ncRNAs under light and phage stress (no changes were found under P-stress).
Fold changes (log2) of triplicate experiments are shown. For the light experiment grey vertical lines indicate the border of ≥2-fold change after 45 min of light treatment. Only ncRNAs with significant differential expression and their oppositely located ncRNA pair (when present) are illustrated for the phage experiment. Standard errors derive from experimental repetitions of all probes that gave a signal above threshold.
Figure 5Genomic location of Yfr20 (red boxes) in three high-light adapted Prochlorococcus strains.
Homologous genes are indicated by the same colors in the different strains and those without homology in this genomic region are shown in white. Gene designations are given according to published genome sequences for MED4 (BX548174.1), MIT9312 (CP000111.1) and MIT9515 (CP000552.1).
Figure 6A highly conserved RNA motif.
The conserved motif (5′-ACUCCUCACAC-3′) is highlighted in blue and the Prochlorococcus–specific extension of the motif is indicated by green letters. The two base mutations in Yfr12 are colored in purple. The color code for shaded bases is as follows: red, all sequences have the same two nucleotides; ochre, two types of base pairs occur; green, three types of base pairs occur; turquoise, four types of base pairs occur. The saturation decreases with the number of sequences unable to form a base pair at this position. Circles around letters indicate compensatory mutations. Consensus structures for Yfr1, Yfr10, Yfr18 were predicted with RNAlishapes [63] and drawn with RNAplot [64]. Homologous ncRNAs found in other Prochlorococcus strains that were used for structure predictions are listed in parentheses. The structure of Yfr12 was folded using RNAshapes and relies on the single sequence of Yfr12 from MED4.
Figure 7Gene arrangement of ncRNA pairs Yfr8/Yfr9 and Yfr6/Yfr14.
Potential peptide-coding open reading frames within the sequence of the RNAs are shown as white boxes. Positions in the genome are indicated by numbers for the forward and reverse (c) strand.