| Literature DB >> 28317029 |
Christophe Bécavin1, Mikael Koutero2, Nicolas Tchitchek3, Franck Cerutti4, Pierre Lechat5, Nicolas Maillet5, Claire Hoede4, Hélène Chiapello4, Christine Gaspin4, Pascale Cossart2.
Abstract
As for many model organisms, the amount of Listeria omics data produced has recently increased exponentially. There are now >80 published complete Listeria genomes, around 350 different transcriptomic data sets, and 25 proteomic data sets available. The analysis of these data sets through a systems biology approach and the generation of tools for biologists to browse these various data are a challenge for bioinformaticians. We have developed a web-based platform, named Listeriomics, that integrates different tools for omics data analyses, i.e., (i) an interactive genome viewer to display gene expression arrays, tiling arrays, and sequencing data sets along with proteomics and genomics data sets; (ii) an expression and protein atlas that connects every gene, small RNA, antisense RNA, or protein with the most relevant omics data; (iii) a specific tool for exploring protein conservation through the Listeria phylogenomic tree; and (iv) a coexpression network tool for the discovery of potential new regulations. Our platform integrates all the complete Listeria species genomes, transcriptomes, and proteomes published to date. This website allows navigation among all these data sets with enriched metadata in a user-friendly format and can be used as a central database for systems biology analysis. IMPORTANCE In the last decades, Listeria has become a key model organism for the study of host-pathogen interactions, noncoding RNA regulation, and bacterial adaptation to stress. To study these mechanisms, several genomics, transcriptomics, and proteomics data sets have been produced. We have developed Listeriomics, an interactive web platform to browse and correlate these heterogeneous sources of information. Our website will allow listeriologists and microbiologists to decipher key regulation mechanism by using a systems biology approach.Entities:
Keywords: Listeria; database; genomics; proteomics; systems biology; transcriptomics
Year: 2017 PMID: 28317029 PMCID: PMC5350546 DOI: 10.1128/mSystems.00186-16
Source DB: PubMed Journal: mSystems ISSN: 2379-5077 Impact factor: 6.496
FIG 1 Overview of the Listeriomics platform. (Center) The five major tools of Listeriomics, i.e., gene conservation and synteny, coexpression network, genome viewer, expression, and protein atlas. (Left) Summary of all the available genomic information available on the website. (Right) List of all the transcriptomic information available in Listeriomics. (Bottom) View of all the proteomic information that can be accessed.
Summary of omics data sets included in the Listeriomics database
| Category | Data sizes | Data type(s) | Tools available |
|---|---|---|---|
| Genomics | 83 complete genomes (NCBI), all protein coding genes and noncoding RNAs, 304 small RNAs | Genome, phylogeny, genome elements, homologs | Genome summary, gene panel, small RNA panel, genome viewer |
| Transcriptomics | 362 biological conditions, 8 | Gene expression array, tiling array, TSS, RNA-Seq | Transcriptome summary, expression atlas, heat map, genome viewer |
| Proteomics | 74 biological conditions, 4 | Mass spectrometry | Proteome summary, protein atlas, heat map, genome viewer |
FIG 2 Transcriptomic and proteomic data sets available in the Listeriomics database. (A) Summary of all the transcriptomic data sets available at the Listeriomics website. In parentheses is the number of transcriptomics data sets available in the Listeriomics database for a specific biological condition. (B) Schematic representation of all the L. monocytogenes mutants for which transcriptomic data sets are available in the Listeriomics database. (C) Schematic representation of the number of transcriptomics data sets available for each L. monocytogenes growth phase. (D) Summary of all the proteomics data sets available at the Listeriomics website. In parentheses is the number of proteomics data sets available in the Listeriomics database for a specific biological condition.
FIG 3 Multi-omics genome viewer and coexpression network tool. (A) Genome viewer of representative omics data sets for L. monocytogenes EGD-e grown in BHI at 37°C to the exponential and stationary growth phases as indicated in the text. The genome viewer shows positive genome strand genes (in red), negative genome strand genes (blue), tRNAs and rRNAs (in yellow), small RNAs (in purple), riboswitches (in green), asRNAs (in light green), predicted operons (in orange) from reference 56, and predicted transcription terminators (22) (in blue circles). Exp, exponential. (B) Coexpression network of the virulence locus genes (lmo0200 to lmo0207) of L. monocytogenes EGD-e. Network nodes are genome elements (genes and noncoding RNAs) with the same color code as in the genome viewer tool. (C) Circular graph visualization of the coexpression network of the virulence locus genes (lmo0200 to lmo0207). Coexpression edges are displayed overlaid on a circular representation of the EGD-e genome.
FIG 4 Meta-analysis of the Listeriomics transcriptomic data sets. (A) Relational network built on the 362 transcriptomic biological conditions found in the Listeriomics database. Each node corresponds to a growth condition. The size of each node is proportional to the occurrence of each condition in the whole database. A link is drawn between two growth conditions if they are present in the same transcriptomic data set. (B) Heat map of the 15 genes with the highest ratio of differential expression. The value used for colorization is the number of data sets in which each gene has been found to be differently expressed. (C) Heat map of the six genes with no variability. (D) Pathway enrichment analysis of the 651 genes of L. monocytogenes EGD-e that are found differently expressed in >10% of the 279 data sets. We performed a pathway enrichment analysis by using COG information and the Fisher exact test P value.