| Literature DB >> 36016717 |
Alexandra J Lee1, Taylor Reiter2, Georgia Doing3, Julia Oh3, Deborah A Hogan4, Casey S Greene2.
Abstract
A gene expression compendium is a heterogeneous collection of gene expression experiments assembled from data collected for diverse purposes. The widely varied experimental conditions and genetic backgrounds across samples creates a tremendous opportunity for gaining a systems level understanding of the transcriptional responses that influence phenotypes. Variety in experimental design is particularly important for studying microbes, where the transcriptional responses integrate many signals and demonstrate plasticity across strains including response to what nutrients are available and what microbes are present. Advances in high-throughput measurement technology have made it feasible to construct compendia for many microbes. In this review we discuss how these compendia are constructed and analyzed to reveal transcriptional patterns.Entities:
Keywords: Compendia; Machine learning; Microbiology; Transcriptomics
Year: 2022 PMID: 36016717 PMCID: PMC9396250 DOI: 10.1016/j.csbj.2022.08.012
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 6.155
Examples of existing microbial compendia.
| Compendium containing | 109 | 950 | 5,549 | Affymetrix platform GPL84 | ||
| Compendium containing | > 100 | 2,333 | 5,563 (PAO1) | RNA-seq | ||
| Compendium containing | 127 | 2,198 | 4,189 | Affymetrix E. Coli Genome 2.0 Array GPL 3154; | ||
| EcoGEC | Compendium containing | 144 | 2,262 | 4,166 | Affymetrix E. Coli Genome 2.0 Array; | |
| Unnamed | Compendium containing E. coli gene array data downloaded from GEO, ArrayExpress and Stanford Microarray Database. It includes a mixture of different experimental conditions | 74 | 870 | NA | Affymetrix; P33; spotted cDNA/DNA; spotted oligonucleotides | |
| Unnamed | Compendium containing E. coli RNA-seq data downloaded from GEO. It includes a mixture of different experimental conditions | 21 | 278 | 3,923 | RNA-seq | |
| Unnamed | Compendium containing S. auerus RNA-seq data downloaded form GEO combined with RNA-seq data generated by this publication. It includes expression profiles exposed to various media conditions, antibiotics, nutrient sources, and other stressors | 8 | 109 | 2,581 | RNA-seq | |
| Unnamed | Compendium containing | >151 | 1,909 | >2000 | two-color cDNA microarray hybridization assay | |
| Refine.bio | Many prokaryotes | Database containing processed compendia for multiple prokaryotes including P. aerguinosa, E. Coli and sacch. The data for these compendia were downloaded from SRA, GEO and ArrayExpress. | ∼40 to > 500 | ∼300 to ∼ 13,000 | ∼5000 | microarray; |
*Note in SRA, samples are referred to as “Experiment” and a group of samples forming an experiment are referred to as a “Study”.