| Literature DB >> 20515496 |
Christian Hödar1, Rodrigo Assar, Marcela Colombres, Andrés Aravena, Leonardo Pavez, Mauricio González, Servet Martínez, Nibaldo C Inestrosa, Alejandro Maass.
Abstract
BACKGROUND: The importance of in silico predictions for understanding cellular processes is now widely accepted, and a variety of algorithms useful for studying different biological features have been designed. In particular, the prediction of cis regulatory modules in non-coding human genome regions represents a major challenge for understanding gene regulation in several diseases. Recently, studies of the Wnt signaling pathway revealed a connection with neurodegenerative diseases such as Alzheimer's. In this article, we construct a classification tool that uses the transcription factor binding site motifs composition of some gene promoters to identify new Wnt/beta-catenin pathway target genes potentially involved in brain diseases.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20515496 PMCID: PMC2996972 DOI: 10.1186/1471-2164-11-348
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Number of CART trees that declare a gene to be a Wnt/β-catenin pathway target. Red points correspond to genes already known to be targets of the Wnt/β-catenin pathway, and black points represent genes not previously identified as Wnt/β-catenin pathway targets. The vertical axis denotes the "score" of each individual gene. Genes are ordered decreasingly with the score. The horizontal line represents the threshold value C associated to the highest percentile.
A sample of relevant transcription factors
| Entrez ID | Symbol | Name | I1 | Score |
|---|---|---|---|---|
| 2908 | NR3C1 (GR) | 822.6 | 1500 | |
| 5077 | PAX3 | 1389.7 | 1489 | |
| 6932 | TCF -1 | 3.3 | 1485 | |
| 51176 | LEF1 | 68.5 | 1500 | |
| 3172 | HNF4a | 90.2 | 1497 | |
| 4150 | MAZ | 6.3 | 1316 | |
| 4520 | MTF1 | 27.2 | 1476 |
Figure 2Gene Ontology enrichment for training and proposed Wnt/β-catenin pathway targets. Enriched nodes are colored in different intensities of green depending of adjusted p-values. Thus, more significant enrichment corresponds to the more intense green. First ratio corresponds to the proportion of GO terms in the human genome and the second ratio corresponds to the proportion of GO terms in the study group. A) GO terms enrichment for the known Wnt/β-catenin pathway target genes used as training group. B) GO terms enrichment in the same terms, for the proposed Wnt/β-catenin pathway target genes.
A sample of predicted Wnt/β-catenin pathway target genes
| Entrez ID | Symbol | Name | Score |
|---|---|---|---|
| 814 | CAMK4 | 1489 | |
| 84152 | PPP1R1B | 489 | |
| 8503 | PIK3R3 | 310 | |
| 27124 | PIB5PA | 196 | |
| 7168 | TPM1 | 422 | |
| 8871 | SYNJ2 | 316 | |
| 1917 | EEF1A2 | 384 | |
| 3705 | ITPK1 | 213 | |
| 6854 | SYN2 | 184 | |
| 10236 | HNRPR | 258 | |
| 8507 | ENC1 | 177 |
Gene expression change measured by RT-qPCR for 9 predicted Wnt/β -catenin targets and 2 controls
| Entrez ID | Symbol | Name | CART Score | Fold Change |
|---|---|---|---|---|
| 133 | ADM | 260 | 1.91* | |
| 1399 | CRKL | 138 | 2.57* | |
| 1917 | EEF1A2 | 384 | 1.13¥ | |
| 3705 | ITPK1 | 213 | 1.84* | |
| 8503 | PIK3R3 | 310 | 1.47* | |
| 8507 | ENC1 | 177 | 1.31¥ | |
| 27242 | TNFRSF21 | 404 | 1.14¥ | |
| 50674 | NEUROG3 | 586 | 1.90* | |
| 11040 | PIM2 | 266 | 1.18 | |
| 334 | APLP2 | Ctrol | 1.06 | |
| 9997 | SCO2 | Ctrol | 1.03 |
*p-value < 0.05, ¥ involved in brain development or abnormalities (see text for references).
Figure 3Classic structure of a CART tree. The first node of the tree is subdivided into two finer nodes depending on whether v1 is lower than c1. The resulting nodes are further subdivided to determine the assigned class.
Figure 4General structure of the proposed method. We trained 1,500 CART trees using 66 known target genes marked as black dots and two groups of 8,000 randomly chosen genes from a list of 15,476 genes in the human genome. The first group is used to produce a first tree and the second to prune and evaluate it. The classification method is the consolidation of the results of the 1,500 CART trees.
Comparative analysis of the method and robustness
| Method | Instance 1 | Instance 2 | Instance 3 | Instance 4 | Prior | New |
|---|---|---|---|---|---|---|
| Instance 1 | 155 (100%) | 150 (97%) | 147 (95%) | 151 (97%) | 66 (100%) | 89 |
| Instance 2 | 150 (97%) | 155 (100%) | 147 (95%) | 150 (97%) | 66 (100%) | 89 |
| Instance 3 | 147 (95%) | 147 (95%) | 155 (100%) | 149 (96%) | 66 (100%) | 89 |
| Instance 4 | 151 (97%) | 150 (97%) | 149 (96%) | 155 (100%) | 66 (100%) | 89 |
| KNN 1 | 1 (1%) | 1 (1%) | 1 (1%) | 1 (1%) | 0 (0%) | 30 |
| KNN 2 | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 17 |
| SVM | 66 (43%) | 66 (43%) | 66 (43%) | 66 (43%) | 66 (100%) | 0 |
| CART | 58 (37%) | 58 (37%) | 58 (37%) | 58 (37%) | 44 (67%) | 46 |
| L-1-O (avg) | 147,8 (95%) | 145,6 (94%) | 144 (93%) | 145,7 (94%) | 66 (100%) | 89 |
List of primers used for RT-qPCR in this study
| Entrez ID | Symbol | Primer Sense (5'- > 3') | Primer Antisense (5'- > 3') |
|---|---|---|---|
| 133 | ADM | TGGGTTCGCTCGCCTTCCTA | CATCCGCAGTTCCCTCTTCC |
| 1399 | CRKL | TGATTCCTGTCCCTTATGT | GGTCTGAGGTTGAGCGTAT |
| 1917 | EEF1A2 | CCTTCAAGTATGCCTGGGTG | CAGTCCGCCTGGGATGTAC |
| 3705 | ITPK1 | CGGCTTGACTTTCCCATTC | CTCGCCAACCACGAACACC |
| 8503 | PIK3R3 | CATTACCAGCAGACATCC | CTCTTCCCACTTCCTCTTT |
| 8507 | ENC1 | TGGGAGATGTGACAGCAA | CAGTAGGAATCAGCGAGTA |
| 27242 | TNFRSF21 | CCCACAGGACAAGAACAA | AGCCGCTGGATGTAGAGT |
| 79962 | DNAJC22 | CAGCTTGAGGGTCTAAGGATA | GGTTACTCGCAGCACAGAA |
| 50674 | NEUROG3 | GGCTGTGGGTGCTAAGGGTAA | CAGGGAGAAGCAGAAGGAACAAG |
| 11040 | PIM2 | CTCAGCCCAGGATTCTTTA | AGAGCACTTGGGATAACAGA |
| 334 | APLP2 | GTGGAATAGGGAACTGTAAT | GGGGAAGTGAACGGTAAAA |
| 9997 | SCO2 | AGTGGGTGCTGATGTACTTTG | CGCAGCCCGTTTAATGATGG |