Literature DB >> 21687718

The dawning era of comprehensive transcriptome analysis in cellular microbiology.

Chihiro Aikawa1, Fumito Maruyama, Ichiro Nakagawa.   

Abstract

Bacteria rapidly change their transcriptional patterns during infection in order to adapt to the host environment. To investigate host-bacteria interactions, various strategies including the use of animal infection models, in vitro assay systems and microscopic observations have been used. However, these studies primarily focused on a few specific genes and molecules in bacteria. High-density tiling arrays and massively parallel sequencing analyses are rapidly improving our understanding of the complex host-bacterial interactions through identification and characterization of bacterial transcriptomes. Information resulting from these high-throughput techniques will continue to provide novel information on the complexity, plasticity, and regulation of bacterial transcriptomes as well as their adaptive responses relative to pathogenecity. Here we summarize recent studies using these new technologies and discuss the utility of transcriptome analysis.

Entities:  

Keywords:  massively parallel sequencing; tiling array; transcriptome

Year:  2010        PMID: 21687718      PMCID: PMC3109594          DOI: 10.3389/fmicb.2010.00118

Source DB:  PubMed          Journal:  Front Microbiol        ISSN: 1664-302X            Impact factor:   5.640


Introduction

The host expresses various defense systems against bacterial infection. At the same time, pathogens attempt to protect themselves from recognition and removal by the host immune system through changes in their gene expression patterns. Thus, multidirectional investigations of bacterial factors for adaptation to host environments are required for understanding the interactions between the host and bacteria as well as to clarify the acquired and innate immune responses against infection. For approaching this issue, various strategies have been used including animal infection models, in vitro assay systems and microscopic observations. Until recently, the majority of these reports have been focused on the roles of a few specific genes of either the pathogen or its host primarily due to technical limitations. These detailed and focused studies greatly affected and strengthened our understanding of only a limited portion of host–bacteria interactions. However, they did not reflect the dynamics of transcriptional regulation at the whole genome level between the host and pathogen upon infection. Comprehensive analyses to reveal the function and regulation of global factors involved in the bacteria–host interactions should be invaluable because such approaches may reveal novel virulence genes or mechanisms, which have not previously been linked to bacterial infection. Recently whole genomic tiling arrays and massively parallel sequencing approaches have emerged as powerful tools in microbiology. Genomic tiling arrays use a set of overlapping oligonucleotide probes that represent a subset of or the whole genome at very high resolution (Wang et al., 2009). There are two general types of the tiling arrays that are most widely used (Mockler et al., 2005). The first array generally contains relatively short probes (<100-mer) synthesized directly on the surface of a chip by photolithographic method (Fodor et al., 1991; Hughes et al., 2001; Nuwaysir et al., 2002). This type of array can be made with greater than 6 million discrete features, each of which contains millions of copies of a distinct probe. The second array is consisted of mechanically printing probes including amplified PCR products, oligonucleotides or cloned DNA fragments onto the chip. This type of array can hold up to nearly 40,000 features per chip (Mockler et al., 2005). As massively parallel sequencing approach, three commercial technologies, Roche 454 FLX titanium (Roche Diagnostics, Basel, Switzerland), Illumina Genome Analyzer (Illumina Inc., San Diego, CA, USA) and Life Technologies SOLiD (Applied Biosystems by Life Technologies, CA, USA), are now widely used and can produce millions or a billion of sequences at once. These high-throughput sequencing technologies allow the cost effective DNA sequencing compared with standard dye-terminator Sanger methods. Roche 454 FLX titanium technology is based on pyrosequencing and its advantages are the generation of long sequence reads (400 bp) and the relatively rapid sequencing run (approximately 10 h per run). This technology generates a small amount of data (>400 Mbp per run) among the three sequencers and may lead to homopolymer errors because multiple incorporations were provided at a given cycle (Engstrand, 2009). The Illumina GA technology is based on massively parallel sequencing of millions of fragments using a reversible terminator-based sequencing chemistry. Advantages of Illumina technology are the generation of large amount of data (100 Gbp total per run) and less homopolymer errors compared with Roche 454 technology. However, this sequencing generates relatively short sequence length (100 bp) and takes long time for sequence run (7 days). Finally, the Life Technologies SOLiD technology is based on sequencing by ligation of dye-labeled oligonucleotides. This technology can deal with many samples using multiple sequence tags in a single run and generate large datasets (>100 Gbp total per run). However, disadvantages are the short read length (50 bp) and the long run times as in case of Illumina technology. Applications of tiling arrays and massively parallel sequencing include de novo assembly, chromatin immunoprecipitation analyses, genome resequencing, and metagenomics. Here we describe several major applications of both technologies (Figure 1) and briefly introduce their relevance to bacteria–host interactions.
Figure 1

Applications of tiling array and massively parallel sequencing. Transcriptome analysis, genome resequencing and protein–DNA interaction (ChIP-) studies can employ both tiling array and massively parallel sequencing while applications like metagenomic studies and de novo assembly can only be performed using massively parallel sequencing. Tiling array: Blue, Massively parallel sequencing: Red.

Applications of tiling array and massively parallel sequencing. Transcriptome analysis, genome resequencing and protein–DNA interaction (ChIP-) studies can employ both tiling array and massively parallel sequencing while applications like metagenomic studies and de novo assembly can only be performed using massively parallel sequencing. Tiling array: Blue, Massively parallel sequencing: Red. De novo genome assembly using massively parallel sequencing and/or Sanger sequencing have been performed for some bacterial genomes including Mycoplasma conjunctive (Calderon-Copete et al., 2009), Brucella microti (Audic et al., 2009), and Helicobacter pylori strain G27 (Baltrus et al., 2009). This approach provides rapid and low cost closure of whole genome assembly and is useful for fine drafts of genome assemblies for other bacteria. Genome resequencing using both approaches can accurately characterize mutant genomes relative to previously sequenced parental (reference) strains. In this approach, sequence differences such as insertions/deletions or sequential single strand polymorphisms (SNPs) are primarily identified with mutant and reference strains. This approach has been applied with methicillin-resistant Staphylococcus aureus (Kennedy et al., 2008), Chlamydia trachomatis (Kari et al., 2008), Brucella species (Foster et al., 2009), and Salmonella enterica serovar Typhimurium (Holt et al., 2008). These studies have demonstrated the value and importance of genome resequencing to define distinct virulence factors. Chromatin immunoprecipitation analyses followed by microarrays (ChIP-chip) or sequencing (ChIP-seq) have been developed as powerful methods for the study of genome-wide protein–DNA interactions. These approaches can accurately identify transcriptional factors regulating bacterial pathogenesis at the whole genome level. ChIP-chip analysis using tiling arrays has been performed for Bacillus subtilis (Ishikawa et al., 2007), Escherichia coli (Cho et al., 2008a,b). In addition, ChIP-seq analysis using massively parallel sequencing has been carried out with Mycobacterium tuberculosis (Lun et al., 2009). There are few studies using ChIP-seq up to now; however, since sequencing has become faster and cheaper, ChIP-seq will likely become more available for mapping sites of protein–DNA interactions in the future. Metagenomics is the genomic analysis of microbial communities by direct extraction of DNA from an assemblage of microorganisms (Handelsman, 2004) and reveals landscapes of bacterial diversity for a wide range of environments. This analysis has been performed with Sanger sequencing earlier (Venter et al., 2004; Kurokawa et al., 2007) but recently has been conducted using massively parallel sequencing. Several projects including the characterization of the soil metagenome (Roesch et al., 2007), the honey bee metagenome (Cox-Foster et al., 2007), the human gut metagenome (McKenna et al., 2008), mouse gut metagenome (Turnbaugh et al., 2006), the mine metagenome (Edwards et al., 2006), and the chicken cecum metagenome (Qu et al., 2008) have been recently completed. In addition to the above applications, transcriptome analysis is a novel application for a better understanding of host–bacteria interactions. Eukaryotic transcriptome analyses by massively parallel sequencing have been recently carried out because of its effectiveness and power in collecting data (Mardis, 2008; Shendure and Ji, 2008; Wang et al., 2009; Wilhelm and Landry, 2009). For bacteria, these analytical strategies are now available for elucidating the complexity of transcriptomes but only a few applications have been carried out so far. As well as high-throughput mRNA sequencing (RNA-seq) using massively parallel sequencing, genomic tiling arrays have been used in genome-wide transcriptome analysis approaches. In this review, we summarize recent significant reports in the field of cellular microbiology, in which two powerful tools, RNA-seq and genomic tiling arrays, have been used. The significance of these technologies is also described relative to obtain more knowledge of the transcriptional regulation of pathogenicity.

Use of Tiling Array Technology for Bacterial Transcriptomes

Compared with massively parallel sequencing, tiling arrays do not always require mRNA enrichment, and their experimental protocols are now well established. Despite these obvious advantages, tiling arrays have one major drawback, i.e., transcriptome maps are usually of a lower resolution than the maps produced by RNA-seq. The most optimal candidates for tiling array probes should begin at every single base position in the genome (Sorek and Cossart, 2010). However, most tiling arrays have lower densities mainly because of cost issues. In addition, tiling arrays often produce high backgrounds because of non-specific or cross-hybridization reactions (van Vliet, 2010). Thus, the raw data for tiling arrays must be subjected to extensive normalization. After suitable normalization of the data, tiling arrays reveal dynamic and abundant units of transcription. Conventional open reading frame (ORF) microarrays are designed to detect gene expression with relatively few probes for known or predicted genes. In contrast, tiling arrays can lead to the identification of many novel non-coding RNAs (ncRNAs) since these use probes that span the entire genome. Therefore, use of tiling arrays is the major technique for transcriptome analysis to date and it has been applied to several bacterial transcriptome studies including Bacillus subtilis, Caulobacter crescentus, Halobacterium salinarum, and Mycobacterium leprae (McGrath et al., 2007; Koide et al., 2009; Rasmussen et al., 2009; Akama et al., 2009). In addition, one excellent earlier study and several more recent new studies in cellular microbiology focused on transcriptomes using tiling arrays and are summarized in the following sections (Table 1).
Table 1

Bacterial transcriptome analyses using genomic tiling array.

BacteriaExperimentsAnnotationReferences
Anaplasma phagocytophilumComparison of transcriptomes expressed in tick or human cellCharacteristics of differently expressed transcriptsApproximately 50% were membrane associated gene products (including 7 paralogs of virB2 that showed exclusive expressions)Nelson et al. (2008)
Bacillus subtilisIdentification of the transcriptionally active regions in the genomeComposition of identified 3662 strand specific transcriptionally active regions (TARs); 77.3% of currently annotated genes: 84 putative non-coding RNAs (ncRNAs): 127 antisense transcriptsPredicted transcripts; a ncRNA ncr22 possibly contribute to translational control on cstA gene: an antisense transcript opposite to a housekeeping sigma factor sigARasmussen et al. (2009)
Caulobacter crescentusHigh-throughput identification of transcription start sites (TSS) of genesGenes of TSS identified; 769 genes including 53 genes with multiple TSSRegulatory-protein binding motifs identified; 27 including novel 17 motifsMcGrath et al. (2007)
Escherichia coliComparison of transcriptomes expressed in log and stationary phasesCharacteristics of differently expressed 1529 transcriptsUpregulated in log phase; translation- and cellular membrane synthesis-related (including lpp) geneUpregulated in stationary phase; starvation responding genes (including dps, rmf), putative receptor gene b0836 and a 30S ribosomal protein subunit S22 geneSelinger et al. (2000)
Halobacterium salinarumCharacterization of transcription promoters within operons and coding sequencesCharacterized features; widespread environment-dependent regulation of operon architectures, transcriptional starts and terminations within coding sequences; extensive overlaps in 3′-ends of transcripts initiated from convergent genes occur in a relation to the binding location of 11 transcriptional factors and regulators binding sitesKoide et al. (2009)
Listeria monocytogenesDrawing the whole genome transcriptional landscape employing wild-type and mutant strains (prfA, sigB, hfg)Classifications of RNA species; 50 low molecular weight species (less than 500 nucleotides): antisense RNAs covering several ORFs and 3′- and 5’- untranslated regionsTwo colonization-related mechanisms were suggested; SigB controls the expression of genes important for the bacterial adaptation to the intestinal environment; PrfA and a cluster of pathogenic genes contribute to both survival and replication in bloodToledo-Arana et al. (2009)
Mycobacterium lepraeCharacterization of expression patterns of pseudogenes and non-coding regionsCharacteristics of expression patterns; non-coding regions expressed in higher signal intensities than pseudogenes; These regions included M. leprae unique repetitive sequence (RLEP) and other novel non-coding sequencesAkama et al. (2009)
Salmonella enterica serovar Typhimurium (S. typhimurium)Investigation of mutations to establish efficient live-vaccine strains through Transposon Mediated Differential Hybridization (TMDH) methodCandidate mutation-patterns: 47Defined mutations for practically effective live-vaccine establishment: trxA and atpAChaudhuri et al. (2009)
Bacterial transcriptome analyses using genomic tiling array.

Escherichia Coli

Ten years ago, the transcriptomes of E. coli in the growth and stationary phases were compared using tiling arrays (Selinger et al., 2000). In this approach, the authors used average 25-mer probes, arranged every 6 bases for the intergenic regions and every 60 bases for ORFs. In nutrient rich medium, transcripts detected in the stationary and log phases covered 97 and 87% of the ORFs, respectively. Under these conditions, the 1529 transcripts showed differential expression. In log phase, proteins involved in translation (rRNA, tRNA, and ribosomal proteins) and the synthesis of cell membrane (lpp) were expressed at higher levels than those of the stationary phase, while genes encoding proteins involved in responding to starvation (such as dps and rmf) were expressed at higher levels in the stationary phase. In addition, putative receptor (b0836) and a 30S ribosomal protein subunit (S22) genes have been revealed to be highly upregulated in stationary phase for the first time (Selinger et al., 2000). In this study, the density of probes was higher than in previous studies, and significant expression of RNAs was clearly detected from antisense strands and intergenic regions. Thus, this study has been recognized as a milestone in the technical development of tiling arrays for prokaryotic transcriptome analyses.

Anaplasma Phagocytophilum

Anaplasma phagocytophilum causes the tick-bone disease human anaplasmosis. A. phagocytophilum can replicate in tick cell line ISE6 (Munderloh et al., 2003) and two human cell lines HL-60 and HMEC-1 which have been used as models of human infection (Ades et al., 1992). Transcriptomes of A. phagocytophilum in ticks (ISE6) and humans (HL-60 and HMEC-1) were compared to obtain clues for life cycle regulation and the pathogenecity of this bacterium (Nelson et al., 2008). As a result, no significant difference was found between bacterial transcriptomes expressed in the two human cell lines, however, distinct differences in transcriptional activities of bacterial genes were observed between the two different host species. Specifically, transcriptional levels of half of the membrane associated protein genes including seven virB2 paralog genes (associated with the bacterial type IV secretion system) were markedly distinct. Moreover, a few paralogs of the major surface protein genes p44/msp2 were newly identified through hybridization between transcripts and hypervariable regions (HVRs) in human cells while this was not found in ISE6. This study indicated the flexibility of the bacterium in adapting and altering its pathogenicity for different hosts by changing its transcriptional patterns.

Salmonella Enterica Serovar Typhimurium

Systemic typhoid fever is one of the important targets for vaccine therapy (Mastroeni and Menager, 2003; Girard et al., 2006). In this study, a novel microarray-based technology designated as transposon mediated differential hybridization (TMDH) (Charles, 2001) was used to identify attenuated transposon mutants of the bacterium which inactivated virulence genes against mice. The authors examined selected genes from the mutants as live vaccine candidates by the TMDH method with tiling arrays (Chaudhuri et al., 2009). In this approach, modified transposons carrying outward-facing T7 and Sp6 promoters were introduced into the bacterium and the mixture of transformants were either infected into the mice or cultured in vitro. Subsequently, genomic DNA libraries from both infecting and cultured bacteria were prepared and subject to in vitro transcription in the presence of isotope-labeled UTP. The DNA/RNA-mixtures isolated were then digested with a restriction endonuclease Rsa I and applied to the tiling arrays for analysis. The data from both infecting and cultured samples were used to evaluate attenuation scores and provided 47 subsets of transposons carrying distinct deleted genes. Among these mutants, subsequent analyses focused on two mutants as candidates for preparing live-vaccination strains; trxA encoding thioredoxin 1, which is known to be important for infection of mice (Bjur et al., 2006) and atpA involved in oxidative phosphorylation (Turner et al., 2003). Eventually, two strains of the bacterium carrying the respective candidate mutations were immunized into mice and the mice were successfully confirmed to have become resistant against infection with wild-type S. typhimurium (Chaudhuri et al., 2009). Thus, the TMDH method with tiling arrays could be applicable to other bacterial species in identification of attenuated virulence genes.

Listeria Monocytogenes

Listeria monocytogenes ubiquitously inhabits many different environments and often causes severe food-bone diseases. Toredo-Arana et al. (2009) used wild-type and mutant (prfA, sigB, hfq) strains to describe the complete operon map of the pathogen. It is known that PrfA controls transcription of virulence genes in the blood (Scortti et al., 2007) and SigB mediates virulence activation in the host intestine (Chakraborty et al., 2000). Hfq is an RNA-binding protein and is involved in stress tolerance and virulence control (Christiansen et al., 2004). In this study, total RNA of the strains was extracted from ex vivo and in vitro cultures and were used with tiling arrays to analyze whole genome transcriptomes. As a result, the presence of a variety of RNA species was observed. These RNAs include 50 low molecular weight species (less than 500 nucleotides) and at least two of them were involved in virulence in mice. Antisense RNAs covering several ORFs and 3′ and 5′ untranslated regions (UTRs) were also detected. Following detailed analysis, a possible role for a riboswitch functioning in the termination of an upstream gene was suggested. In addition this study also described a novel proposal regarding the relevance of SigB in specifically controlling the expression of genes important for the bacterial adaptation to the intestinal environment as well as the involvement of PrfA and a pathogenic gene cluster in survival and replication in the blood. Interestingly, this analysis revealed that changes in transcriptional levels of ncRNAs were similar as for virulence genes in L. monocytogenes although no such changes were observed in non-pathogenic L. innocula. As a consequence, it was suggested that successive and coordinated global transcriptional changes occur during infection (Toledo-Arana et al., 2009). This study suggested significant progress in comprehensive whole-transcriptome analysis of a bacterial species. In addition, this report provided insight into the greater complexity of bacterial transcription than was previously predicted.

Use of Massively Parallel Sequencing for Bacterial Transcriptome Analysis

Approaches for studying pathogenic bacteria with massively parallel sequencing have much improved our knowledge of their pathogenicity, evolution and adaptation to different environments including the host. In order to evaluate this approach, the basic procedures involved will be summarized below. Isolated bacterial RNA consists of approximately 80% rRNA and tRNA (Condon, 2007). Therefore, removal of tRNA/rRNA is usually carried out before reverse transcription (Passalacqua et al., 2009; Perkins et al., 2009; Yoder-Himes et al., 2009). Size fractionation of RNA prior to cDNA synthesis has been optionally used for the removal of mRNA and rRNA (Liu et al., 2009). Most bacterial mRNA does not contain a poly-A tail as do eukaryotes and thus immobilized poly-T cannot enrich for mRNA relative to other RNA species following hybridization. As a consequence, cDNA synthesis (reverse transcription) should use one of the following priming methods: use of random hexamers (Passalacqua et al., 2009; Perkins et al., 2009; Yoder-Himes et al., 2009), oligo (dT) priming after polyadenylation of mRNA (Frias-Lopez et al., 2008) or priming after ligation of specific RNA adaptors to mRNA (Sittka et al., 2008; Wurtzel et al., 2009). Sequencers such as Illumina GA/Solexa, ABI SOLiD, or 454 FLX/Titanium are now widely available for high-throughput analyses. The adaptors ligated before cDNA synthesis should be removed followed by mapping the reads to its genome sequence as the first step in information processing.

Important Notes for Sample Preparation

Several reports that used RNA-seq with the relevant sequencers are summarized (Table 2). Among these studies, several recent excellent studies which have not been referenced in previous reviews (Sorek and Cossart, 2010; van Vliet, 2010) are introduced in the following sections.
Table 2

Bacterial transcriptome analyses using massively parallel sequencers.

BacteriaSequencerExperimentsAnnotationReferences
Acinetobacter baumanniiSolexaInvestigation of transcriptional modulation in the presence of ethanol.Novel observations; Ethanol upregulates 49 different genes including metabolic enzymes and stress-related genes such as uspA, hsp90, groEL and lon genesCamarena et al. (2010)
Bacillus anthracisSolexa/SOLiDCombined transcriptome analyses on various growth conditions using Solexa and SOLiD sequencersImprovement; sufficient correlation was achieved between RNA-seq and microarray data Novel observations; previously non-annotated regions were identifiedPassalacqua et al. (2009)
Burkholderia cenocepaciaSolexaComparison of isolates from soil and cystic fibrosis (CF) patientNovel observations; 12 ncRNAs preferentially expressed in soil isolate; 1 ncRNA expression biased in patient isolate; large number of regulatory differences detected between soil and CF strainYoder-Himes et al. (2009)
Chlamidia trachomatis454Comparison of gene expression between elementary bodies (EB) and reticulate bodies (RB)Nobel observations; Transcripts in 84 genes were differently expressed; 42 genome and 1 plasmid-derived ncRNAs were identified; a ncRNA ctrR0332 was predicted its involvement in EB-RB transitionAlbrecht et al. (2010)
Helicobacter pylori454/SolexaIdentification of growth condition- and host-specific TSSs using dRNA-seqNovel observations; hundreds of TSSs resides within operons and antisense to annotated genes; about 60 small RNAs and regulator of mRNAs were foundSharma et al. (2010)
Listeria monocytogenesSolexaComparison of transcriptome between L. monocytogenes 10403S strain and it's sigB deficient strainNovel Observation; Expression of 96 genes depends on SigB factor; 67 ncRNA including 7 putatives were expressed at the stationary phaseOliver et al. (2009)
Mycoplasma pneumoniae454/SolexaTranscriptome analysis under various conditions (growth phase, heat shock, DNA damage, and halt of cell cycle) using both RNA-seq and tiling arrayNovel Observations; Novel 117 transcripts seemed to be ncRNAs were identified under differential conditions; 89 of these transcripts located antisense to previously annotated genes; 139 transcripts among the identified 341 operons were polycistronic; half of these operons showed staircase-like expression pattern; the operons could classified into 447 smaller transcriptional unitsGuell et al. (2009)
Salmonella enterica erovar TyphiSolexaIdentification of transcriptional template strands using strand-specific cDNA sequencing (ssRNA-seq)Improvement; ssRNA-seq facilitates the re-annotation of number of genesNovel observations; 40 novel ncRNAs were identifiedPerkins et al. (2009)
Vibrio cholerae454Investigation of improvement in novel direct cloning technique with RNA size selection and depletion of tRNA and 5S RNAImprovement; depletion of tRNA and 5S RNA by specific oligos and RNaseH was preferentially improved in short RNA investigationNovel observations; 500 putative intergenic sRNAs: 127 putative antisense RNAsLiu et al. (2009)
Bacterial transcriptome analyses using massively parallel sequencers.

Chlamydia Trachomatis

Chlamydia trachomatis, a causative agent of a sexually transmitted disease and/or a contagious eye infection, was subjected to transcriptome sequencing (Albrecht et al., 2010). The authors compared the transcriptome of C. trachomatis in different states: metabolically inactive elementary bodies (EB) and metabolically active reticulate bodies (RB), which can replicate in vacuoles inside of host cells. In this study, the Roche/454 GS-FLX system was used and the sequences obtained were subjected to determinative analysis of transcriptional start sites (TSS). To identify primary TSS, cDNA libraries from both EB and RB were sequenced; one library was generated from untreated total RNA and the other was constructed following enrichment of primary transcripts by selective enzymatic degradation of “processed RNA species” (see section for details). Transcripts of 84 genes revealed distinct expression levels between EB and RB. In addition, 42 genome and 1 plasmid-derived ncRNA were identified, respectively. Among these ncRNAs, ctrR0332 in the genome showed approximately ten times greater expression in EB than that in RB. This result suggests that ctrR0332 plays an important role in the EB-RB transition (Albrecht et al., 2010). The precise identification of TSS should lead to a better understanding of genome organization as well as the control of bacterial behavior.

Acinetobacter Baumannii

Transcriptome analysis of A. baumannii was carried out using Illumina technology (Camarena et al., 2010). In this study, cDNA libraries (obtained from mRNA which were enriched by removal of 23S and 16S rRNA) were prepared from cultures with or without ethanol since previous reports showed that ethanol increases the virulence of the pathogen in both Caenorhabditis elegans and Dictyostelium discoideum (Wanner, 1987; Smith et al., 2004). Sequence data showed that 49 genes were upregulated in the presence of ethanol. Among these genes, some encoded metabolic enzymes including several dehydrogenases for ethanol which were highly induced, suggesting that A. baumannii oxidizes ethanol to acetate by these enzymes. The genes encoding stress proteins including hsp90, groEL, and lon were also detected in the presence of ethanol. These genes are involved in the heat-shock stress response (HSR) in many bacterial species (Asadulghani et al., 2003; Green and Donohue, 2006; Qin et al., 2006; Slamti et al., 2007; Audia et al., 2008; Martinez-Salazar et al., 2009). The HSR is controlled by the rpoH gene encoding sigma factor σ32 (Yura et al., 1993) and has been shown to be required for optimal virulence in some bacteria including Vibrio tapetis and Neisseria gonorrhoeae (Du et al., 2005; Lakhal et al., 2008). In addition, a previous report showed that A. baumannii carrying a transposon insertion in rpoH attenuated virulence in the presence of ethanol (Smith et al., 2007). Therefore, the authors suggested that ethanol could increase the virulence of the bacterium through the induction of heat-shock proteins, such as Hsp90, GroEL and Lon. Furthermore, ethanol-dependent upregulation was also observed in secretory phospholipase C. Since deletion of phospholipase C gene in the bacterium diminished its cytotoxicity in epithelial cells, this gene may be significantly involved in the virulence of the pathogen. RNA-seq was also used to examine the transcriptome of L. monocytogenes in addition to the tiling array approach noted above. In this method, the authors analyzed the differences in the transcriptome between L. monocytogenes strain 10403S and its sigB deficient strain using Illumina sequencing (Oliver et al., 2009). cDNA libraries were obtained following enrichment of mRNA by removal of 23S and 16S rRNA, and were fractionated into 60-200 nucleotides. The authors identified transcripts of 96 genes which were expressed in a sigB-dependent manner. According to the RNA-seq data, the bacterium expressed 67 ncRNAs including seven novel ncRNAs. Furthermore, a total of 65 putative sigB promoters upstream of 82 of the 96 sigB-dependent genes and upstream of the one sigB-dependent ncRNA were identified. This study provided comprehensive insight into prokaryotic transcriptional regulation following comparison of a mutant devoid of a transcriptional regulator and its parent strain.

Helicobacter Pylori

Sharma et al. (2010) analyzed the transcriptome of H. pylori strain 26695 with the Roche/454 GS-FLX system and Illumina technology. In this study, a new approach named differential RNA-seq (dRNA-seq) was employed to identify primary TSS. Primary transcripts included most precursor mRNAs and small RNAs (sRNAs) carrying a 5′ tri-phosphate (5′PPP) group, whereas processed transcripts include mature rRNA and tRNA harboring a 5′ mono-phosphate (5′P). The authors presented a single-nucleotide resolution map of the primary transcriptome of H. pylori through discrimination of primary transcripts with native 5′ (5′PPP) ends from processed species (5′P) following treatment with a 5′P-dependent exonuclease. Total RNA was extracted from the pathogen in various states such as different growth phases, stressed with acid, and different host cells. After removal of genomic DNA with DNAse I and treatment by 5′P-dependent exonuclease, the cDNA libraries were then analyzed with the 454 system and mapped to the H. pylori chromosome to identify TSS. Solexa sequencing for operon mapping was also performed under the same growth conditions. TSSs were identified within operons and antisense sequences to annotated genes. These observations suggested that the major factors for increasing transcriptional complexities in H. pylori were the uncoupling of polycistrons and genome-wide antisense transcription. Approximately 60 small ncRNAs were detected in this study. These ncRNAs included 6S RNA which is a ubiquitous riboregulator of RNA polymerase but is not present in ε-proteobacteria (Barrick et al., 2005; Weinberg et al., 2007). The dRNA-seq could identify TSS at the genome-wide level and uncovered a surprisingly large number of novel ncRNA and antisense transcripts in H. pylori. This approach could be applicable to all bacterial species where native transcripts carry a 5′PPP and should be widespread soon.

Mycoplasma Pneumoniae

A different approach used for the study of the transcriptome of M. pneumoniae (Guell et al., 2009), where three methods: spotted arrays, tiling arrays, and RNA-seq were used in combination. RNA-seq and tiling array data obtained from the bacteria grown under four different conditions (growth phase, heat shock, DNA damage, and interruption of the cell cycle) revealed novel 117 transcripts. Almost all of the novel transcripts appeared not to be structural RNAs but ncRNAs and 89 of these transcripts were antisense with respect to previously annotated genes. Among the 341 operons identified, 139 transcripts were polycistronic and half of the operons showed decay patterns in transcription. This suggests that such staircase-like expression is a widespread phenomenon in bacteria. Comparison of transcriptomes obtained under various growth conditions suggested the possible classification of operons into 447 smaller transcriptional units. In addition, growth condition dependent alternative transcripts were detected as a result of spotted array data. The complexity, as known in eukaryotes, of the bacterial transcriptome is clearly indicated from these studies and was unexpected.

Concluding Remarks

In a very short period of time, bacterial transcriptomics using tiling arrays and massively parallel sequencing has been remarkably improved and has become a powerful tool for understanding host–bacteria interactions. A number of studies revealed that the bacterial transcriptome is much more complicated than previously thought. Like eukaryotes, RNA molecules are key factors in regulating gene expression in prokaryotes (Waters and Storz, 2009). Among these, regulatory RNAs, including antisense RNA and riboswitches, have been shown to modulate pathogenesis (Toledo-Arana et al., 2007), iron metabolism (Masse et al., 2007), quorum sensing (Bejerano-Sagie and Xavier, 2007) through regulation of gene expression. Novel types of RNA molecules found by tiling arrays and massively parallel sequencing are rapidly increasing (Toledo-Arana and Solano, 2010). It is difficult for current bioinformatic algorithms and databases to predict the existence and functions of all of the novel RNA or small proteins detected but several studies have attempted to clarify their functions through validation of their different expression patterns (see reviews: Romby et al., 2006; Sharma and Vogel, 2009; Sorek and Cossart, 2010). During infection, bacteria colonize in the host environment not as single entities but as communities. Therefore, the elucidation of the transcriptome of bacterial communities is essential for a more complete understanding of host–bacteria interactions. Metatranscriptomics has emerged as an approach for enhancing our understanding of the transcriptome of bacterial communities. Several metatransciptomic studies have recently been performed for microbial communities in the soil or ocean water (Leininger et al., 2006; Frias-Lopez et al., 2008; Gilbert et al., 2008; Urich et al., 2008). In metatranscriptomics, total RNA is extracted from a microbial community, converted into cDNA and sequenced without primers (DeLong, 2009). Moreover, in this approach there is no need to be concerned about the number of genes surveyed and to select specific genes to target (Moran, 2009). Therefore, metatranscriptomics may become one of the most powerful tools for understanding bacterial regulation and adaptation upon infection within complex microbial communities. Bacterial transcriptomics using tiling arrays and/or massively parallel sequencing will be more frequently utilized in coming years. These high-throughput technologies will continue to further improve through the use of lower amounts of starting samples, longer reads, increasing number of reads, and lower costs. Under these circumstances, when data will be almost overwhelming, new approaches for information management and interpretation will be also developed. Therefore, in the future these technologies will become more convenient and can serve as general tools for bacterial transcriptome analysis due to their valuable contributions to our knowledge base. Selection of the appropriate technology is an issue for many researchers to perform their purpose. Massively parallel sequencing provides clear advantages over the tilling arrays, since massively parallel sequencing offers both a single-base resolution and a high-mapping resolution (Marguerat et al., 2008; Wang et al., 2009). On the other hand, tiling arrays is inherently biased by the chip design and frequently miss out alternative and antisense transcripts (Wang et al., 2009). However, massively parallel sequencing also has several assignments. Massively parallel sequencing is more expensive than array-based analysis and large data obtained from this technology need highly efficient software with a high performance computer. In contrast, tiling arrays is a good tool for first screening of bacterial transcriptome because it is more cost effective and the data derived from this technology can be analyzed with conventional computers in a laboratory or an individual level. With regard to this, Guell et al. (2009) provided a valuable report, in which they used tiling arrays and massively parallel sequencing to study the transcriptome of M. pneumoniae. They reported that sequencing data alone were insufficient to clearly detect operon boundaries in the case that genes were lowly expressed. They also described that the combination analysis using both technologies provide more accurate landscape of bacterial transcriptome. Taken together, the use of tiling arrays gives a valuable data for the first analysis of bacterial transcriptome. If more detailed data are necessary, for example, to determine the boundary of mRNA, we recommend the addition of massively parallel sequencing data to the tiling array data. Transcriptome analysis allows the identification or prediction of novel bacterial virulence factors required for adaptation and survival within host environments as well as the enhancement of disease potential. In addition, the combination of transcriptome analyses with clinical or other experimental analyses (i.e. proteomic or metabolic analysis) will enable us to identify novel functions relating to gene expression. This will provide new insights into the molecular mechanism of host–bacteria interactions and also enhance our ability to develop a number of potential targeting molecules more efficiently. Therefore, such comprehensive analyses will continue to increase our understanding of the molecular complexity of host–bacteria interactions.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  86 in total

Review 1.  Applications of DNA tiling arrays for whole-genome analysis.

Authors:  Todd C Mockler; Simon Chan; Ambika Sundaresan; Huaming Chen; Steven E Jacobsen; Joseph R Ecker
Journal:  Genomics       Date:  2005-01       Impact factor: 5.736

Review 2.  Metagenomics: application of genomics to uncultured microorganisms.

Authors:  Jo Handelsman
Journal:  Microbiol Mol Biol Rev       Date:  2004-12       Impact factor: 11.056

Review 3.  The role of RNAs in the regulation of virulence-gene expression.

Authors:  Pascale Romby; François Vandenesch; E Gerhart H Wagner
Journal:  Curr Opin Microbiol       Date:  2006-03-10       Impact factor: 7.934

4.  6S RNA is a widespread regulator of eubacterial RNA polymerase that resembles an open promoter.

Authors:  Jeffrey E Barrick; Narasimhan Sudarsan; Zasha Weinberg; Walter L Ruzzo; Ronald R Breaker
Journal:  RNA       Date:  2005-04-05       Impact factor: 4.942

5.  Global gene expression and the role of sigma factors in Neisseria gonorrhoeae in interactions with epithelial cells.

Authors:  Ying Du; Jonathan Lenz; Cindy Grove Arvidson
Journal:  Infect Immun       Date:  2005-08       Impact factor: 3.441

6.  Activity of Rhodobacter sphaeroides RpoHII, a second member of the heat shock sigma factor family.

Authors:  Heather A Green; Timothy J Donohue
Journal:  J Bacteriol       Date:  2006-08       Impact factor: 3.490

7.  Microbial synergy via an ethanol-triggered pathway.

Authors:  Michael G Smith; Shelley G Des Etages; Michael Snyder
Journal:  Mol Cell Biol       Date:  2004-05       Impact factor: 4.272

8.  The RNA-binding protein Hfq of Listeria monocytogenes: role in stress tolerance and virulence.

Authors:  Janne K Christiansen; Marianne H Larsen; Hanne Ingmer; Lotte Søgaard-Andersen; Birgitte H Kallipolitis
Journal:  J Bacteriol       Date:  2004-06       Impact factor: 3.490

9.  Genome-wide transcriptional analysis of temperature shift in L. interrogans serovar lai strain 56601.

Authors:  Jin-Hong Qin; Yue-Ying Sheng; Zhi-Ming Zhang; Yao-Zhou Shi; Ping He; Bao-Yu Hu; Yang Yang; Shi-Gui Liu; Guo-Ping Zhao; Xiao-Kui Guo
Journal:  BMC Microbiol       Date:  2006-06-09       Impact factor: 3.605

10.  Using pyrosequencing to shed light on deep mine microbial ecology.

Authors:  Robert A Edwards; Beltran Rodriguez-Brito; Linda Wegley; Matthew Haynes; Mya Breitbart; Dean M Peterson; Martin O Saar; Scott Alexander; E Calvin Alexander; Forest Rohwer
Journal:  BMC Genomics       Date:  2006-03-20       Impact factor: 3.969

View more
  2 in total

1.  Genome-wide identification of transcriptional start sites in the plant pathogen Pseudomonas syringae pv. tomato str. DC3000.

Authors:  Melanie J Filiatrault; Paul V Stodghill; Christopher R Myers; Philip A Bronstein; Bronwyn G Butcher; Hanh Lam; George Grills; Peter Schweitzer; Wei Wang; David J Schneider; Samuel W Cartinhour
Journal:  PLoS One       Date:  2011-12-28       Impact factor: 3.240

2.  Comparative transcriptome analysis reveals that lactose acts as an inducer and provides proper carbon sources for enhancing exopolysaccharide yield in the deep-sea bacterium Zunongwangia profunda SM-A87.

Authors:  Qi-Long Qin; Yi Li; Mei-Ling Sun; Jin-Cheng Rong; Sheng-Bo Liu; Xiu-Lan Chen; Hai-Nan Su; Bai-Cheng Zhou; Bin-Bin Xie; Yu-Zhong Zhang; Xi-Ying Zhang
Journal:  PLoS One       Date:  2015-02-13       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.