| Literature DB >> 22559291 |
Jumamurat R Bayjanov1, Douwe Molenaar, Vesela Tzeneva, Roland J Siezen, Sacha A F T van Hijum.
Abstract
BACKGROUND: Linking phenotypes to high-throughput molecular biology information generated by ~omics technologies allows revealing cellular mechanisms underlying an organism's phenotype. ~Omics datasets are often very large and noisy with many features (e.g., genes, metabolite abundances). Thus, associating phenotypes to ~omics data requires an approach that is robust to noise and can handle large and diverse data sets.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22559291 PMCID: PMC3366882 DOI: 10.1186/1471-2164-13-170
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Phenotypic measurements of strains
| Medium enriched with | Phenotypesa |
|---|---|
| D-arabitol | Yes (7), Maybeb (11), No (21) |
| D-melezitose | Yes (34), No (5) |
| D-raffinose | Yes (33), Maybe (1), No(5) |
| D-sorbitol | Yes (35), No (4) |
| D-turanose | Yes (32), No (7) |
| Glycerol | Maybe (4), No (35) |
| K-gluconate | Yes (26), Maybe (9), No (4) |
| L-Arabinose | Yes (26), Maybe (3), No (10) |
| L-Rhamnose | Yes (6), Maybe (8), No (25) |
| Methyl-.d-glucopyranoside | Yes (8), Maybe (1), No (30) |
| Methyl-.d-mannopyranoside | Yes (27), Maybe (1), No (12) |
| Nitrogen-dioxide production | Yes (6), No (36) |
a: Numbers in parenthesis show number of strains with given phenotype, for instance there are 32 strains that could grow on sugar D-turanose and 7 strains that could not grow on this sugar. Phenotype and ~omics data are available at the web address of PhenoLink.
b: For some strains phenotypes could not be determined accurately resulting in an ambiguous phenotype "Maybe". Using the web-interface of PhenoLink such phenotypes can be discarded from the association analysis (see Methods and User's Guide available at PhenoLink's web page).
Figure 1Genes related to growth on Methyl-d-mannopyranoside. Relations of genes to growth on Methyl-d-mannopyranoside as visualized by PhenoLink. For gene annotations see Additional file 2. Color codes are explained in Figure 4. Colors of surrounding genes can be seen by running PhenoLink with the same configurations used here.
Figure 2Gene clusters related to utilization of multiple sugars. Relations of L-arabinose (A) and L-rhamnose (B) gene clusters to utilization of multiple sugars as visualized by PhenoLink. A cluster of 9 genes is related to multiple sugars and nitrogen-dioxide production (C). Partial relations between genes and growth on sorbitol are shown in Additional file 3. For gene annotations see Additional file 2. Color codes are explained in Figure 4. Colors of surrounding genes can be seen by running PhenoLink with the same configurations used here.
Figure 3Flow diagram for a web-tool PhenoLink. Default values of m and r are 3 and 5 respectively (see Methods section for more information).
Figure 4Integration of gene significance with its presence/absence in different strains. A feature (in this study a gene) that is found to be important to separate strains of different phenotypes is assumed important. Present (for the majority of strains): feature is present in at least p percent (default of 75%) of strains for a given phenotype. Absent (for the majority of strains): feature is absent in at least 75% of strains of a given phenotype. Remaining genes are present in a subset of strains.