| Literature DB >> 26586805 |
Marco Moretto1, Paolo Sonego2, Nicolas Dierckxsens3, Matteo Brilli4, Luca Bianco2, Daniela Ledezma-Tejeida5, Socorro Gama-Castro5, Marco Galardini6, Chiara Romualdi7, Kris Laukens8, Julio Collado-Vides5, Pieter Meysman8, Kristof Engelen9.
Abstract
COLOMBOS is a database that integrates publicly available transcriptomics data for several prokaryotic model organisms. Compared to the previous version it has more than doubled in size, both in terms of species and data available. The manually curated condition annotation has been overhauled as well, giving more complete information about samples' experimental conditions and their differences. Functionality-wise cross-species analyses now enable users to analyse expression data for all species simultaneously, and identify candidate genes with evolutionary conserved expression behaviour. All the expression-based query tools have undergone a substantial improvement, overcoming the limit of enforced co-expression data retrieval and instead enabling the return of more complex patterns of expression behaviour. COLOMBOS is freely available through a web application at http://colombos.net/. The complete database is also accessible via REST API or downloadable as tab-delimited text files.Entities:
Mesh:
Year: 2015 PMID: 26586805 PMCID: PMC4702885 DOI: 10.1093/nar/gkv1251
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Overview of the data available in COLOMBOS
| Strain | Number of genes | Number of contrasts | Missing values (%) | First inclusion | Samples | Experiments | Platforms | |
|---|---|---|---|---|---|---|---|---|
| MG1655 | 4321 | 4077 | 3.6 | v1.0 | 5510 | 254 [15] | 73 | |
| 168 | 4176 | 1259 | 3.7 | v1.0 | 1814 | 45 | 35 | |
| cross-strain | 6261 | 1066 | 41.6 | v1.0 | 1856 | 36 | 22 | |
| LT2 | 4556 | 172 | 6.4 | v2.0 | 316 | 8 | 10 | |
| 14028S | 5416 | 681 | 22.7 | v2.0 | 1252 | 17 | 7 | |
| SL1344 | 4655 | 213 | 9.8 | v2.0 | 288 | 11 | 9 | |
| A3(2) | 8239 | 371 | 7.3 | v2.0 | 546 | 7 [2] | 7 | |
| PAO1 | 5647 | 559 | 1.6 | v2.0 | 592 | 33 | 2 | |
| 26695 | 1616 | 133 | 3.1 | v2.0 | 256 | 8 | 5 | |
| Ames | 5039 | 66 | 3.0 | v3.0 | 75 | 4 | 4 | |
| ATCC 14579 | 5231 | 283 | 2.4 | v3.0 | 392 | 16 | 10 | |
| VPI-5482 | 4816 | 333 | 1.9 | v3.0 | 353 | 19 | 4 | |
| NCTC 11168 | 1572 | 152 | 12.5 | v3.0 | 260 | 14 | 11 | |
| ATCC 824 | 3778 | 377 | 2.4 | v3.0 | 419 | 12 | 11 | |
| GG | 2834 | 79 | 3.6 | v3.0 | 158 | 3 | 2 | |
| S2 | 1722 | 364 | 1.5 | v3.0 | 728 | 19 | 3 | |
| 301 | 4315 | 35 | 17.0 | v3.0 | 38 | 3 | 3 | |
| 1021 | 6218 | 424 | 2.7 | v3.0 | 713 | 20 [19] | 10 | |
| D39 | 1914 | 68 | 5.7 | v3.0 | 136 | 12 | 2 | |
| HB8 | 2173 | 444 | 1.4 | v3.0 | 480 | 6 | 3 | |
| CO92 | 3979 | 36 | 6.1 | v3.0 | 72 | 5 | 2 |
Rows of the table represent all the species and strains for which a gene expression compendium is hosted. Columns represent (from left to right): the species name, the strain used as reference genome for microarray probe to gene mapping and RNA-Seq read alignment, the total number of genes in the compendium, the total number of contrasts in the compendium, the percentage of missing values, the COLOMBOS version of the first inclusion of the respective species or strain, the total number of samples from which the compendium's contrasts are built, the total number of corresponding experiments on GEO and ArrayExpress (the latter indicated between square brackets) and the total number of platforms represented.