| Literature DB >> 26024923 |
Isabelle Rosinski-Chupin1,2, Elisabeth Sauvage3,4, Odile Sismeiro5, Adrien Villain6,7, Violette Da Cunha8,9, Marie-Elise Caliot10, Marie-Agnès Dillies11, Patrick Trieu-Cuot12, Philippe Bouloc13, Marie-Frédérique Lartigue14,15,16,17, Philippe Glaser18,19.
Abstract
BACKGROUND: Streptococcus agalactiae, or Group B Streptococcus, is a leading cause of neonatal infections and an increasing cause of infections in adults with underlying diseases. In an effort to reconstruct the transcriptional networks involved in S. agalactiae physiology and pathogenesis, we performed an extensive and robust characterization of its transcriptome through a combination of differential RNA-sequencing in eight different growth conditions or genetic backgrounds and strand-specific RNA-sequencing.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26024923 PMCID: PMC4448216 DOI: 10.1186/s12864-015-1583-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Characterization of transcription start sites in S. agalactiae. A. Visualization of sequence reads mapped to the genome of strain NEM316 in conditions of dRNA-seq: strand-specific sequencing of transcript 5′ ends with (TAP+) and without (TAP-) TAP treatment, and strand-specific RNA-seq. Two dRNA-seq experiments are shown corresponding to 1: RNA from multiple growth conditions (MG sample); 2: RNA from a ∆covRS mutant grown to mid-exponential phase. The RNA-seq library was prepared with the wt strain at mid-exponential phase. B. Detailed view of the 958000–978000 region. Protein coding genes annotated on the (+) and (−) strands are indicated by red and blue large arrows. TSSs are depicted as small arrows. Based on dRNA-seq and RNA-seq data, a transcript corresponding to a ncRNA (srn040/tmRNA) was annotated and is shown as a large green arrow. C. Proportions of the total TSSs detected under each experimental condition. Grey: TSSs detected with RNA from multiple growth conditions (wt, MG); red: TSSs not detected in wt, MG. MG: mixture of growth conditions; LE: late exponential phase; midE: mid-exponential growth phase; AcStr: acid stress. D: Proportions of TSSs according to the number of experiments in which they were detected.
Main characteristics of the transcriptome (strain NEM316)
|
|
|
|
|---|---|---|
|
|
| |
| Coding genes | 2084 | 2082 previously annotated CDS, including 11 for which the initiation codon has been reannotated; 2 new CDS identified in this study |
| Pseudogenes | 36 | |
| rRNA operons | 7 | |
| tRNAs genes | 80 | 14 located outside the rRNA regions |
|
|
| |
| Primary TSS | 891 | |
| Secondary TSS | 36 | |
| TSS internal to CDS, same orientation as CDS | 191 | |
| TSS initiating antisense transcription | 39 | 34 TSS inside CDS, 5 TSS located close to CDS |
| Intergenic TSS | 53 | including 8 TSS upstream tRNAs genes and one upstream a rRNA operon |
|
|
| |
|
|
| 602 corresponding to predicted rho-independent terminators |
|
|
| |
| Operons identified in the study | 407 | including 56 operons with internal promoters, 26 operons with internal terminators and 15 operons with both internal promoters and internal terminators |
| Monocistrons | 484 | |
|
|
| |
| Cis-regulatory sequences | 39 | 23 belonging to rfam families and acting through attenuation of transcription |
| 5 belonging to rfam families and acting through regulation of translation (no sRNA) | ||
| 11 not belonging to rfam families and acting through attenuation of transcription | ||
| Cis-antisense sRNAs | 39 | |
| Trans acting sRNAs | 47 | 44 with TSS in intergenic regions, 1 with TSS at the end of a transcriptionally inactive gene and 2 likely resulting from the cleavage of longer transcripts |
Figure 2The vast majority of S. agalactiae promoters are sigma70 dependent. A. Motif search upstream of 1179/1210 S. agalactiae TSSs reveals extended Pribnow or −10 boxes and less conserved −35 boxes. The consensus sequence was generated using WebLogo (http://weblogo.berkeley.edu/logo.cgi). B. Mean distances between the −10 box and the TSS (d1) and between the −10 and −35 boxes (d2). Distances were calculated between the 3′ end nucleotide of the −10 box and the TSS and between the 3′ nucleotide of the −35 box and the 5′ nucleotide of the TATAAT sequence.
Figure 3Extensive reiterative transcription in S. agalactiae. A. Pie chart of the occurrence of pseudo-templated nucleotides at the transcription initiation sites. B. Functional classification of genes submitted to reiterative transcription according to the KEGG categories (AA: Amino acid metabolism; Carb.,E.: carbohydrate, glycan and energy metabolism; Lip.: Lipid metabolism; Transport: Membrane transport; Cof.,Vit: metabolism of cofactors and vitamins; Nucl.: Nucleotide metabolism; Rep.: Replication and Repair; Transc, ST: Transcription, Signal transduction; Transl., deg.: translation, protein folding, sorting and degradation; Unknown: genes for which the KEGG category was not defined). TSSs were classified according to the functional class of the first gene of the transcription unit. The number of TSSs linked to each class is indicated in brackets. The mean proportion of TSSs with reiterative transcription is indicated by the red line. The proportion of genes showing reiterative transcription was found to be significantly higher (p < 0.01) for genes involved in nucleotide metabolism and lower (p < 0.01) for genes involved in carbohydrate, glycan and energy metabolism using chi-square and Fisher exact tests. C. Reiterative transcription at the thyA gene. The promoter of thyA directs initiation of transcription at a T or a G (shown in red on the non-template strand of the DNA sequence). The G position also corresponds to the first nucleotide of the Shine-Dalgarno (SD) sequence. Initiation at the T position leads to reiterative incorporation of pseudo-templated U. D. The proportion of reads containing pseudo-templated nucleotides in thyA transcripts is found to vary depending on growth conditions: mid-exponential and late exponential phases in the wt and the ∆covRS mutant of strain NEM316. The total number of reads corresponding to the TSS is indicated in brackets next to the experimental condition in the figure legend.
Figure 4Distribution of 5′UTR lengths in S. agalactiae. Numbers of mRNA individual 5′UTRs according to their length based on 891 primary TSSs upstream coding genes. 78 genes were considered as leaderless with 5′UTRs shorter than 5 nucleotides, as indicated by the red box. 49% of the 5′UTRs have a size comprised between 15 and 35 nucleotides as indicated by black vertical lines
Figure 5Identification of a novel riboswitch upstream gbs1262. A . Transcriptional organization deduced from dRNA-seq and RNA-seq experiments. Reads aligning upstream gbs1262 were visualized by IGV browser [67]. The sequence of the sRNA resulting from transcription premature arrest at a rho-independent terminator is indicated as a grey arrow. B. Secondary structure predicted by RNalifold [71], based on the alignment of 14 sequences similar to gbs1262 5′UTR in Lactobacillales and upstream a potential tryptophan-related gene in F. nucleatum (Additional file 9). The two conserved folded structures are indicated in red and green boxes, with the green box corresponding to a rho-independent terminator. Sequence covariations supporting the consensus structure are marked by color: red marks pairs with no sequence variation; ochre and green mark pairs with 2 or 3 types of pairs, respectively. C and D. Alternative structures of gbs1262 putative riboswitch as determined by mfold [72], showing the formation of an antiterminator structure.
Figure 6Modifications in sRNA expression in response to an acidic stress. A. Detection of Srn015/csRNA10 and Srn070/csRNA12 by combined analysis of RNA-seq and dRNA-seq experiments is given as an example of sRNA detection B. sRNA expression was quantified using RNA-seq (triplicate experiments) on exponentially growing bacteria submitted or not to a 20 min-acid stress. Normalization and statistical assessment were performed using the EdgeR software. Expression values were arbitrarily fixed to 1 for samples grown at pH7. Ten sRNAs were shown differentially regulated (p <0.05 after correction for multiple-testing adjustment) in acidic conditions: 6 up-regulated, left panel and 4 down-regulated, right panel.