| Literature DB >> 19091052 |
Kevin Kontos1, Patrice Godard, Bruno André, Jacques van Helden, Gianluca Bontempi.
Abstract
BACKGROUND: Nitrogen is an essential nutrient for all life forms. Like most unicellular organisms, the yeast Saccharomyces cerevisiae transports and catabolizes good nitrogen sources in preference to poor ones. Nitrogen catabolite repression (NCR) refers to this selection mechanism. All known nitrogen catabolite pathways are regulated by four regulators. The ultimate goal is to infer the complete nitrogen catabolite pathways. Bioinformatics approaches offer the possibility to identify putative NCR genes and to discard uninteresting genes.Entities:
Year: 2008 PMID: 19091052 PMCID: PMC2654973 DOI: 10.1186/1753-6561-2-s4-s5
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Abbreviations and short descriptions of variables.
| Abbreviation | Description |
| NUM | Number of GATA boxes |
| 1-GAP, 2-GAP, 3-GAP, B-GAP | First, second and third smallest, and biggest GATA gaps |
| M-GAP, MI-GAP, SD-GAP | Mean, median and standard deviation (sd) of all GATA gaps |
| Minimum number of bp spanning over | |
| UP- | N{1, i}GATA |
| DOWN- | GATAN{1, i} |
| GAP- | N{1, i}GATAN{1, i} |
| F-POS, L-POS | Positions of the first and of the last GATA boxes, resp. |
| M-POS, MI-POS, SD-POS | Mean, median and sd of the positions of all GATA boxes |
Figure 1GATA boxes in the upstream non-coding sequences of ANCR genes. Graphical map of the GATA boxes in the upstream non-coding sequences of ANCR genes generated with RSAT [6].
Figure 2GATA boxes in the upstream non-coding sequences of NNCR genes. Graphical map of the GATA boxes in the upstream non-coding sequences of NNCR genes generated with RSAT [6].
Performance assessment. VS and CLASS stand for variable selection method and classifier, respectively.
| Negative control | |||||||
| VS | CLASS | BER | AUC | AUCext | BER | AUC | AUCext |
| Filter | NB | 0.31 | 0.93 | 0.95 | 0.49 ± 0.022 | 0.50 ± 0.072 | 0.63 ± 0.023 |
| KNN | 0.18 | 0.90 | 0.91 | 0.51 ± 0.021 | 0.51 ± 0.077 | 0.34 ± 0.088 | |
| SVM | 0.13* | 0.93 | 0.98 | 0.48 ± 0.060 | 0.50 ± 0.097 | 0.67 ± 0.026 | |
| NB | 0.24 | 0.95 | 0.91 | 0.49 ± 0.054 | 0.50 ± 0.130 | 0.48 ± 0.016 | |
| Wrapper | KNN | 0.20 | 0.97 | 0.66 | 0.48 ± 0.045 | 0.52 ± 0.100 | 0.41 ± 0.073 |
| SVM | 0.13* | 0.95 | 0.88 | 0.47 ± 0.066 | 0.58 ± 0.130 | 0.58 ± 0.042 | |
Gene set comparisons. VS and CLASS stand for variable selection method and classifier, respectively.
| VS | CLASS | Bar-Joseph et al., 2003 [ | Godard et al., 2007 [ | Scherens et al., 2006 [ |
| Filter | NB | 0.05 2.9 × 10-16 | 0.09 (3.5 × 10-7) | 0.06 (2.4 × 10-13) |
| KNN | 0.06 (9.4 × 10-9) | 0.09 (4.8 × 10-5) | 0.07 (1.1 × 10-7) | |
| SVM | 0.11 (1.5 × 10-13) | 0.15 (9.0 × 10-10) | 0.14 (8.2 × 10-14) | |
| Wrapper | NB | 0.07 (9.1 × 10-11) | 0.11 (7.7 × 10-18) | 0.08 (4.3 × 10-16) |
| KNN | 0.12 (7.7 × 10-14) | 0.20 (7.0 × 10-28) | 0.16 (5.2 × 10-26) | |
| SVM | 0.13 (8.9 × 10-11) | 0.16 (7.2 × 10-14) | 0.13 (2.6 × 10-11) | |