| Literature DB >> 17650321 |
Merja Heinäniemi1, J Oskari Uski, Tatjana Degenhardt, Carsten Carlberg.
Abstract
BACKGROUND: Peroxisome proliferator-activated receptors (PPARs) are known for their critical role in the development of diseases, such as obesity, cardiovascular disease, type 2 diabetes and cancer. Here, an in silico screening method is presented, which incorporates experiment- and informatics-derived evidence, such as DNA-binding data of PPAR subtypes to a panel of PPAR response elements (PPREs), PPRE location relative to the transcription start site (TSS) and PPRE conservation across multiple species, for more reliable prediction of PPREs.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17650321 PMCID: PMC2323243 DOI: 10.1186/gb-2007-8-7-r147
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Systematic variation from consensus DR1-type PPRE
| Percent binding strength | PPRE position | ||||||||||||
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | |
| Consensus (90-100) | A/G | G | G | T | A/C | A | A | A | G | G | T | C | A |
| Class I (60-90) | T | C | G | T | G | T | C/G | A/G | G | ||||
| Class II (30-60) | C | T | A/T | A/C/G | T | T | C/G | C | A/C/T | T | C/T | ||
| Class III (0-30) | A/C | C/G | T | A/C | A | ||||||||
| Consensus (90-100) | A/G | G | G | T | C/G | A | A | A | G | G | T | C | A |
| Class I (60-90) | C/G | A/T | T | G | T | C/G | A/G/T | G | |||||
| Class II (30-60) | C/T | A/T | T | A | C | C | A/C/T | C/T | |||||
| Class III (0-30) | C | A/C | C/G/T | G | T | A/C | A | ||||||
| Consensus (90-100) | A/G | G | G | T | C | A | A | A | G | G | T | C | A |
| Class I (60-90) | C/G | G/T | T | G | T | G/T | |||||||
| Class II (30-60) | C | A/T | T | A | A | A/T | C/G | A | G/C/T | ||||
| Class III (0-30) | T | C | A/C | C/G/T | C/G | C/T | C | A/C | A | ||||
The binding strengths of in vitro translated PPAR-RXR heterodimers to 39 systematic variations of the DR1-type consensus PPRE AGGTCAAAGGTCA were determined by gelshift assays in reference to this consensus PPRE. Based on their average binding strength, all variations are sorted into three classes.
Figure 1Testing the RE classification scheme on natural DR1-type sequences. The average binding strength of in vitro translated PPAR-RXR heterodimers to DR1-type PPREs was determined by gelshift assays in reference to the consensus PPRE AGGTCAAAGGTCA, including all categories (that is, combinations of the classes I, II and III) that resulted in an average binding of at least 1%. Variations from the consensus PPRE are highlighted in green for PPARα, in dark blue for PPARγ and in light blue for PPARβ/δ. In total, the in vitro binding data of 136 different REs were used (the non-binding DR1-type REs are shown in Additional data file 1), with a minimum of six sequences for each category. SD, standard deviation.
Figure 2ROC curves comparing in silico methods. (a) A PSWM constructed from 20 medium and strong PPREs that contain multiple variations, and a PSAM constructed using the single nucleotide data and ten initializations of PPRE classifier created based on Table 1 and random sampling of Figure 1 and Additional data file 1 were compared for their ability to detect binding. True positive rates (TPRs) and false positive rates (FPRs) were calculated, with false positives given when no binding was detected despite prediction, and false negatives given when binding was detected but not predicted (correlation of matrix scores to predicted binding was done based on lines fitted to correlation plots shown in Additional data file 3). A line of no discrimination is a diagonal line and optimum performance approaches the value (0, 1). For clarity, only one representative instance of a PPRE classifier is shown in (a). (b) To assess how good the predicted experimental binding estimates were, the performance of the method used was tested with a 15% tolerance interval for a match to experimental binding (5% when prediction was 15% or less) using a single cut-off (the optimal cut-off was 3% for the classifier, 25% or a score of 0.0000015 for PSAM, and 20% or a score of 4.7 for PSWM) and calculating again the FPR and TPR for each method. False positives in this case represented predictions that were too high and false negatives predictions that were too low.
Figure 3In silico analysis of selected primary PPAR target genes. Overview of the genomic organization of eight human PPAR target genes; 10 kB upstream and downstream of the TSSs are shown (horizontal black line). Putative REs (red boxes, no conservation; orange boxes, within conserved area; yellow boxes, conserved) were identified using the classifier by in silico screening of the genomic sequences and are classified according to their degree of conservation compared to the orthologous mouse gene. Already published PPREs are indicated by an asterisk. For each predicted RE the calculated binding strengths of PPARα (green), PPARγ (dark blue) and PPARβ/δ (light blue) in reference to a consensus DR1-type PPRE are represented by column height. All putative PPRE sequences are available on request. For the UCP3 gene REs, the average in vitro DNA binding strength of PPAR-RXR heterodimers was also determined by gelshift experiments and is shown in the same color code scheme. Horizontal red bars indicate the genomic regions that were subcloned for reporter gene assays (Figure 4) and were analyzed by ChIP assays (Figure 5).
Genomic PCR primers
| Gene (region) | Location | Primer sequences (5'-3') |
| -4919 to -4643 | TGAGCTCTT | |
| -1646 to -1374 | TGAGCTCTT | |
| +599 to +716 | TGAGCTCTT | |
| +2822 to +3154 | TGAGCTCTT | |
| -6765 to -6535 | TGAGCTCTT | |
| +2829 to +3610 | TGAGCTCTT | |
| -6429 to -6143 | TGAGCTCTT | |
| -4249 to -3886 | TGAGCTCTT | |
| -870 to -568 | TGAGCTCTT | |
| -262 to -3 | ATTTCTAGA | |
| +2424 to +2722 | TGAGCTCTT | |
| +7701 to +8022 (relative to | TGAGCTCTT | |
| -306 to -64 | ATTTCTAGA | |
| -1376 to -1156 | TGAGCTCTT | |
| -938 to -634 | TGAGCTCTT | |
| -7279 to -7040 | TGAGCTCTT | |
| -510 to -70 | TGAGCTCTT | |
| -510 to +119 (subcloned -266 to +119) | TGAGCTCTT | |
| -6104 to -5797 | ATTTCTAGA | |
| -9680 to -9349 | TGAGCTCTT | |
| -396 to -89 | TGAGCTCTT | |
| +2036 to +2303 | TGAGCTCTT | |
| +5971 to +6236 | TGAGCTCTT | |
| -5297 to -4917 | TGAGCTCTT | |
| -2819 to -2499 | TGAGCTCTT | |
| -1389 to -978 | TGAGCTCTT |
Sequence and location of the primer pairs used for real-time PCR quantification of genomic regions containing putative REs within the nine PPAR target genes. The positions indicated are in relation to the respective annotated gene TSS. The same primers were used for subcloning; the gene-specific sequences are indicated in bold.
Figure 4Extra-genomic functionality of the PPRE-containing promoter regions of PPAR target genes. Reporter gene assays were performed with extracts from (a-c) HEK293 and (d-f) HepG2 cells that were transiently transfected with luciferase reporter constructs containing genomic regions of eight human PPAR target genes (please note that the APOC3 gene forms a cluster with the genes APOC1 and APOC4). These were co-transfected with empty expression vector (endogenous PPAR) or the indicated expression vectors for PPARα, PPARγ and PPARβ/δ. Cells were then treated for 16 h with solvent or PPAR subtype-specific ligands. Relative luciferase activity was determined and normalized to the activity of empty cloning vector control co-transfected with empty expression vector (dashed horizontal red line). The genomic regions were subdivided according to their location into close to TSS (a, d), upstream of TSS (b, e) and downstream of TSS (c, f); for further details see Figure 3 and Table 2. Columns represent the means of at least three experiments and bars indicate standard deviations. Two-tailed Student's t-tests were performed to determine the significance of the ligand induction in reference to solvent controls (*p < 0.05, **p < 0.01, ***p < 0.001).
Functionality of genomic regions
| Genomic region | Predicted binding | Response in RGA | Association of PPARα | Association of RXRα | Association of pPol II | PPRE status |
| Strong | + | + | + | + | + | |
| Weak | Down | + | + | + | + | |
| Medium | Down | + | + | - | + | |
| Weak | +/down | + | + | + | + | |
| Strong | + | - | - | - | ± | |
| Medium | + | + | + | + | + | |
| Medium | + | + | + | + | + | |
| No DR1 | - | +* | +* | +* | - | |
| Medium | Down | + | + | + | ||
| Medium | - | - | - | - | - | |
| Not binding | - | - | - | - | - | |
| Weak | +/down | + | + | - | + | |
| Medium | ± | - | - | - | - | |
| Medium | ± | - | - | - | - | |
| Strong | +/down | + | + | + | + | |
| Weak | - | + | + | - | ± | |
| Strong | + | + | + | + | + | |
| Strong | + | + | + | + | + | |
| Medium | + | + | + | + | + | |
| Strong | + | - | + | + | + | |
| Strong | + | - | - | - | ± | |
| Weak | - | - | - | - | - | |
| Medium | Down | + | + | + | + |
The data from reporter gene assay (RGA) and ChIP assays are summarized for each genomic region tested. The PPRE status indicates the conclusion drawn from the assays concerning the functionality of each region, with '+' assigned to functional regions, '-' to non-functional regions and '±' where the two assays were not in agreement. *Impossible to assess independent of adjacent region.
Figure 5Association of genomic regions of PPAR target genes with PPARs and their partner proteins. Chromatin was extracted from HEK293 cells that had been treated with solvent (DMSO) or for 120 minutes with 100 nM GW7647. The association of PPARα, RXRα and pPol II was monitored by ChIP assays with respective antibodies on genomic regions of the eight PPAR target genes that are (a) close to the TSS, (b) upstream of the TSS and (c) downstream of the TSS; for location see Figure 3 and Table 2. Since the APOC3 gene is not expressed in HEK293 cells, the data for its four genomic regions were obtained using chromatin derived from HepG2 cells. Real-time quantitative PCR was performed on chromatin templates and the fold change of the antibody-precipitated template in relation to an IgG-precipitated specificity control template was calculated. (d) PPARα shows specific association with 15 of the 23 tested regions and the relative association with these regions is shown. Columns represent means of at least three experiments and bars indicate standard deviations. Two-tailed Student's t-tests were performed to determine the significance of association in reference to IgG controls (*p < 0.05, **p < 0.01, ***p < 0.001).
Figure 6SOM analysis of established primary PPAR target genes, clusters I and II. Overview of the genomic organization of 38 known human PPAR target genes (left) and their mouse orthologs (right); 10 kB upstream and downstream of the TSS are shown in this and Figure 7. Please note that for the mouse CYP1A1 gene and the human FADS2 gene, there are discrepancies between the Ensembl (E) and NCBI (N) databases; therefore, both versions are shown. Putative PPREs (red boxes, no conservation; orange boxes, within conserved area; yellow boxes, conserved) were identified by in silico screening of the genomic sequences and are classified according to their degree of conservation between mouse and human. Already published PPREs are indicated by an asterisk. For each of the predicted PPREs, the calculated binding strengths of PPARα (green), PPARγ (dark blue) and PPARβ/δ (light blue) in reference to a consensus DR1-type PPRE are represented by column height. All putative PPRE sequences are available on request. The genes were sorted by SOM analysis with respect to overall PPRE pattern similarity and their evolutionary conservation into (a) cluster I and (b) cluster II.
Figure 7SOM analysis of established primary PPAR target genes, clusters III and IV. The genes were sorted by SOM analysis with respect to overall PPRE pattern similarity and their evolutionary conservation into (a) cluster III and (b) cluster IV. For more details, see the Figure 6 legend.
Figure 8Conservation patterns across multiple species. The genes ACOX1 and ANGPLT4 from chicken, chimpanzee, dog, rat and zebrafish were also analyzed. Putative PPREs (red boxes, no conservation; orange boxes, within conserved area; yellow boxes, conserved in human; pink boxes, conserved in mouse) were identified by in silico screening of the genomic sequences. For each of the predicted PPREs, the calculated binding strengths of PPARα (green), PPARγ (dark blue) and PPARβ/δ (light blue) in reference to a consensus DR1-type PPRE are represented by column height. All putative PPRE sequences are available on request.
Predicted PPAR target genes in human chromosome 19
| Ensembl ID (human) | Gene name | Ensembl ID (mouse) |
| ENSG00000004776 | ENSMUSG00000036854 | |
| ENSG00000004777 | ENSMUSG00000036882 | |
| ENSG00000005007 | ENSMUSG00000058301 | |
| ENSG00000010310 | ENSMUSG00000030406 | |
| ENSG00000032444 | ENSMUSG00000004565 | |
| ENSG00000039987 | ENSMUSG00000052819 | |
| ENSG00000060566 | ENSMUSG00000035041 | |
| ENSG00000063176 | ENSMUSG00000057342 | |
| ENSG00000063241 | ENSMUSG00000052605 | |
| ENSG00000064547 | ENSMUSG00000031861 | |
| ENSG00000072954 | ENSMUSG00000031791 | |
| ENSG00000072958 | ENSMUSG00000003309 | |
| ENSG00000076944 | ENSMUSG00000004626 | |
| ENSG00000077348 | ENSMUSG00000061286 | |
| ENSG00000079435 | ENSMUSG00000053714 | |
| ENSG00000080031 | ENSMUSG00000035429 | |
| ENSG00000080511 | ENSMUSG00000053773 | |
| ENSG00000083807 | ENSMUSG00000030382 | |
| ENSG00000083838 | ENSMUSG00000033961 | |
| ENSG00000089327 | ENSMUSG00000009687 | |
| ENSG00000089639 | ENSMUSG00000036246 | |
| ENSG00000099203 | ENSMUSG00000032180 | |
| ENSG00000099308 | ENSMUSG00000031833 | |
| ENSG00000099331 | ENSMUSG00000004677 | |
| ENSG00000099617 | ENSMUSG00000003070 | |
| ENSG00000099622 | ENSMUSG00000045193 | |
| ENSG00000099800 | ENSMUSG00000020219 | |
| ENSG00000104826 | ENSMUSG00000038194 | |
| ENSG00000104859 | ENSMUSG00000061028 | |
| ENSG00000104863 | ENSMUSG00000003872 | |
| ENSG00000104870 | ENSMUSG00000003420 | |
| ENSG00000104918 | ENSMUSG00000012705 | |
| ENSG00000104936 | ENSMUSG00000030409 | |
| ENSG00000104946 | ENSMUSG00000038520 | |
| ENSG00000104960 | ENSMUSG00000038502 | |
| ENSG00000104980 | ENSMUSG00000002949 | |
| ENSG00000105066 | ENSMUSG00000061374 | |
| ENSG00000105173 | ENSMUSG00000002068 | |
| ENSG00000105204 | ENSMUSG00000002409 | |
| ENSG00000105287 | ENSMUSG00000041187 | |
| ENSG00000105289 | ENSMUSG00000034917 | |
| ENSG00000105364 | ENSMUSG00000003299 | |
| ENSG00000105374 | ENSMUSG00000004612 | |
| ENSG00000105379 | ENSMUSG00000004610 | |
| ENSG00000105398 | ENSMUSG00000074375 | |
| ENSG00000105447 | ENSMUSG00000053801 | |
| ENSG00000105467 | ENSMUSG00000040231 | |
| ENSG00000105516 | ENSMUSG00000059824 | |
| ENSG00000105552 | ENSMUSG00000030826 | |
| ENSG00000105664 | ENSMUSG00000031849 | |
| ENSG00000105701 | ENSMUSG00000019428 | |
| ENSG00000105707 | ENSMUSG00000001249 | |
| ENSG00000108106 | ENSMUSG00000060860 | |
| ENSG00000118046 | ENSMUSG00000003068 | |
| ENSG00000119574 | ENSMUSG00000049600 | |
| ENSG00000123154 | ENSMUSG00000005150 | |
| ENSG00000125910 | ENSMUSG00000044199 | |
| ENSG00000125912 | ENSMUSG00000020238 | |
| ENSG00000126246 | ENSMUSG00000036826 | |
| ENSG00000126247 | ENSMUSG00000001794 | |
| ENSG00000127526 | ENSMUSG00000019731 | |
| ENSG00000129355 | ENSMUSG00000066860 | |
| ENSG00000129451 | ENSMUSG00000030693 | |
| ENSG00000129455 | ENSMUSG00000047884 | |
| ENSG00000130165 | ENSMUSG00000013822 | |
| ENSG00000130288 | ENSMUSG00000036199 | |
| ENSG00000130300 | ENSMUSG00000034845 | |
| ENSG00000130303 | ENSMUSG00000046718 | |
| ENSG00000130402 | ENSMUSG00000054808 | |
| ENSG00000130520 | ENSMUSG00000031848 | |
| ENSG00000130522 | ENSMUSG00000071076 | |
| ENSG00000130669 | ENSMUSG00000030602 | |
| ENSG00000130687 | ENSMUSG00000042831 | |
| ENSG00000130755 | ENSMUSG00000060791 | |
| ENSG00000130818 | ENSMUSG00000059475 | |
| ENSG00000130881 | ENSMUSG00000001802 | |
| ENSG00000131398 | ENSMUSG00000062785 | |
| ENSG00000132024 | ENSMUSG00000036686 | |
| ENSG00000133246 | ENSMUSG00000032739 | |
| ENSG00000141837 | ENSMUSG00000034656 | |
| ENSG00000142009 | ENSMUSG00000056204 | |
| ENSG00000142290 | ENSMUSG00000036578 | |
| ENSG00000142513 | ENSMUSG00000012777 | |
| ENSG00000142538 | ENSMUSG00000038300 | |
| ENSG00000142539 | ENSMUSG00000008193 | |
| ENSG00000160113 | ENSMUSG00000002393 | |
| ENSG00000160318 | ENSMUSG00000038973 | |
| ENSG00000160396 | ENSMUSG00000040424 | |
| ENSG00000161249 | ENSMUSG00000060962 | |
| ENSG00000161558 | ENSMUSG00000002781 | |
| ENSG00000161677 | ENSMUSG00000038695 | |
| ENSG00000167460 | ENSMUSG00000031799 | |
| ENSG00000167470 | ENSMUSG00000035621 | |
| ENSG00000167578 | ENSMUSG00000053291 | |
| ENSG00000167754 | ENSMUSG00000074155 | |
| ENSG00000167757 | ENSMUSG00000067616 | |
| ENSG00000167772 | ENSMUSG00000002289 | |
| ENSG00000167775 | ENSMUSG00000002308 | |
| ENSG00000168813 | ENSMUSG00000044452 | |
| ENSG00000171236 | ENSMUSG00000037095 | |
| ENSG00000171443 | ENSMUSG00000051184 | |
| ENSG00000171570 | ENSMUSG00000058709 | |
| ENSG00000174521 | ENSMUSG00000007944 | |
| ENSG00000174562 | ENSMUSG00000055193 | |
| ENSG00000176531 | ENSMUSG00000061511 | |
| ENSG00000178093 | ENSMUSG00000047654 | |
| ENSG00000180448 | ENSMUSG00000035697 | |
| ENSG00000180739 | ENSMUSG00000045087 | |
| ENSG00000185761 | ENSMUSG00000043822 | |
| ENSG00000185800 | ENSMUSG00000030410 | |
| ENSG00000186474 | ENSMUSG00000044430 | |
| ENSG00000196867 | ENSMUSG00000062861 | |
| ENSG00000197050 | ENSMUSG00000058402 | |
| ENSG00000198356 | ENSMUSG00000052456 | |
| ENSG00000204673 | ENSMUSG00000011096 | |
| ENSG00000205155 | ENSMUSG00000036835 |
All 956 genes of human chromosome 19 that have known mouse orthologs were screened in silico for strong and medium PPREs within 10 kB upstream and downstream of the gene's annotated TSS. All putative PPRE sequences are available on request. The 116 genes that carry, in both species, a strong PPRE or two or more medium PPREs, or a medium PPRE within 500 bp upstream of the TSS are listed. The 50 genes that pass the even more stringent criterion of three PPREs, including one strong, are highlighted in bold.
Figure 9Validation of novel PPAR target genes on human chromosome 19. (a) Real-time quantitative PCR was used to determine the inducibility of the mRNA expression of the indicated eight PPAR target genes, relative to the control gene RPLP0, in HepG2 cells. The cells were stimulated for 2, 4 and 6 h with 100 nM GW7647. (b) An overview of the genomic organization of the human LASS1 gene; 10 kB upstream and downstream of the TSS are shown. Putative REs were identified by in silico screening and the calculated binding strengths of the PPAR subtypes are represented by columns in reference to a consensus DR1-type PPRE. All putative PPRE sequences are available on request. (c) Reporter gene assays were performed with extracts from HepG2 cells that were transiently transfected with luciferase reporter constructs containing genomic regions of the LASS1 gene together with empty expression vector (endogenous PPAR) or the indicated expression vectors for PPARα, PPARγ and PPARβ/δ. Cells were then treated for 16 h with solvent or PPAR subtype-specific ligands. Relative luciferase activity was determined and normalized to the activity of empty cloning vector control co-transfected with empty expression vector. (d) Chromatin was extracted from HepG2 cells that had been treated with solvent or for 120 minutes with 100 nM GW7647. The association of PPARα, RXRα and pPol II was monitored by ChIP assays with respective antibodies on three genomic regions of the LASS1 gene. Real-time quantitative PCR was performed on chromatin templates and fold change of antibody-precipitated template in relation to IgG-precipitated specificity control template was calculated. Columns in (a, c, d) represent means of at least three experiments and bars indicate standard deviations. Two-tailed Student's t-tests were performed to determine the significance (*p < 0.05, **p < 0.01).
PCR primer pairs for quantitative real-time PCR
| Gene | Primer pairs (5'-3') | Product size (bp) |
| GTATGGAATCAGTCAGAACGC | 261 | |
| GAGCCTCTCTGGAGGCTGGTG | 334 | |
| CATGCAGGGTTACATGAAGCAC | 325 | |
| TTCTGCCTTTACTTGGTCTCCA | 124 | |
| TGCTGTCTTCTGTGATGAAC | 268 | |
| CATCTTCGCAATCCATCACAAC | 174 | |
| CAGCTTGAGTTCACCAAGCTC | 266 | |
| GTGGCTTTCATGGACCAG | 344 | |
| GAGCGACTCGATCCTGCTGAC | 173 | |
| AGGACCAGACAGTGATGTTC | 343 | |
| CAGGTTGTGAGGGTAAGGTG | 169 | |
| GATTATGTAGTGGACAAAGCAC | 296 | |
| CACCTGCTCACTGACAACTTC | 247 | |
| CAGAGGATGACGGACAAGTG | 172 | |
| AGATGCAGCAGATCCGCAT | 318 |
PCR product sizes are indicated.