| Literature DB >> 22978616 |
Pierre-Yves Dupont1, Audrey Guttin, Jean-Paul Issartel, Georges Stepien.
Abstract
BACKGROUND: The analysis of gene promoters is essential to understand the mechanisms of transcriptional regulation required under the effects of physiological processes, nutritional intake or pathologies. In higher eukaryotes, transcriptional regulation implies the recruitment of a set of regulatory proteins that bind on combinations of nucleotide motifs. We developed a computational analysis of promoter nucleotide sequences, to identify co-regulated genes by combining several programs that allowed us to build regulatory models and perform a crossed analysis on several databases. This strategy was tested on a set of four human genes encoding isoforms 1 to 4 of the mitochondrial ADP/ATP carrier ANT. Each isoform has a specific tissue expression profile linked to its role in cellular bioenergetics.Entities:
Mesh:
Substances:
Year: 2012 PMID: 22978616 PMCID: PMC3477019 DOI: 10.1186/1471-2164-13-482
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Steps of the bioinformatics study. 1.1 - Phylogenetic alignment of ANT genes in various mammalian species to filter unusable sequences and identification of matrices shared by aligned mammalian sequences from each ANT isoform gene (via Genomatix FrameWorker tool); 1.2 identification of TFBS in each human ANT isoform gene (MatInspector tool); 2 - Construction of regulation models by combining the selected matrices TFBS, IUPAC strings from bibliography; 3 - location of models in the whole human genome (3.1), in the ElDorado promoter database (3.2), and search for genes with models conserved in orthologous promoters (3.3); 4 - Identification of potentially co-regulated genes by GeneProm analysis.
Figure 2Evolutionary relationships of taxa. The evolutionary history was inferred using the Neighbor-Joining method [15]. The percentage of replicate trees in which the associated taxa clustered together in the bootstrap test (100 replicates) is shown next to the branches [16]. The tree is drawn to scale, with branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree. The evolutionary distances (number of base substitutions per site) were computed using the Maximum Composite Likelihood method [17]. The analysis involved 29 nucleotide sequences. All positions containing gaps and missing data were eliminated.
Examples of gene models in mammals
| OXBOX | V$GAT1. 04* | CAAT | O$VTATA.01 | (TSS) | |
| GRBOX | V$MZF1.01 | V$EGR1.02 | V$SP1.01 | O$VTATA.01 | |
| V$CTCF* | V$CHRE* | V$RXRF* | V$RORA* | ATG | |
| V$SMAD3.01* | V$MZF1.02* | V$MAZ.01* | V$HIFF* | V$HIFF* |
Matrix families (i.e. V$MEF2) or matrices (i.e. V$NRF2.01) with an asterisk were identified from phylogenetic analyses and showed to be conserved in orthologous promoters (Genomatix FrameWorker tool). The other matrices are selected with the Genomatix MatInspector tool on the human promoter sequences.
Figure 3Examples of human models constructed from the 4 genes sequences. Part of the promoter sequences and the beginning of the first exon (in bold and underlined) are shown for each isoform. Matrix core sequences, nucleotide strings and TSS are highlighted in black and matrix flanking sequences in grey. Amodel: GGCTCTAAA (OXBOX) / GATAA (V$GATA1.04) / CAAT / TATAA (O$VTATA.01) / G (TSS) /ATG; model: CATTGTT (GRBOX) / TCCCC (V$MZF1.01) / GGGC (EGR1.02) / TATAA (O$VTATA.01) / G (TSS) / ATG; model: GAGGG (V$CTCF) / CACGGG (V$CHRE) / ACCTA (V$RORA) / C (TSS); model: GTCTGG (V$SMAD3.01) / GGGGA (V$MZF1.02) / CCCCTC (V$MAZ.01) / CACGTG (V$HIFF) / G (TSS)/ ATG.
Genes co-regulated with
| Anoctamin-1 | Ca++-activated chloride channel, higher levels in liver and | |
| | ||
| Arrestin domain-containing protein 3 | Associated with plasma membrane, highly expressed in | |
| | ||
| ATP synthase, beta polypeptide | H + transporting, | |
| | ||
| ATP synthase subunit δ, mitochondrial | ||
| | ||
| Cation-transporting ATPase 13A4 | ATP + H2O = ADP + phosphate; expressed in heart, placenta, liver, | |
| | ||
| Protein chibby homolog 1 | Expressed at higher levels in heart, | |
| | ||
| Ubiquinone biosynthesis protein COQ7 | Involved in ubiquinone biosynthesis, expressed in heart and | |
| | ||
| Cytochrome c oxidase subunit VIb 2 | Connects the two | |
| | ||
| cytochrome c oxidase subunit VIIb | One of the nuclear-coded polypeptide chains of | |
| | ||
| Myosin IF | ||
| | ||
| NADH dehydrogenase 1 α- subcomplex, 9 | Subunit of the | |
| | ||
| NADH-ubiquinone oxido- reductase 75 kDa subunit | Subunit of the | |
Several constructed models of the ANT1 promoter region were screened as described in Figure 1 either on the full chromosomal human sequences or the human promoter library (results with an asterisk). Genes in bold are shown overexpressed in microarrays ( Additional file 3). Gene IDs are with 15 numbers (ex. ENSG00000023228).
Genes co-regulated with
| Serine/threonine-protein kinase 13 | Organizing microtubules during | |
| | ||
| B-cell translocation gene 1 protein | Anti-proliferative protein, associated with the | |
| | ||
| CDKN2A-interacting protein | Activates | |
| | ||
| CCAAT/enhancer-binding protein beta | Transcriptional activator of genes involved in immune responses | |
| | ||
| Creatine kinase B-type | ||
| | ||
| Collagen alpha-1(I) chain | Type I collagen is a member of group I collagen | |
| | ||
| DNA damage-indu- cible transcript 4 | ||
| | ||
| Dickkopf-related protein 1 | Antagonizes Wnt signalling by inhibiting LRP5/6 interaction with Wnt | |
| | ||
| Fibrinogen-like protein 1 | Hepatocyte | |
| | ||
| Growth arrest inducible GADD45 β | ||
| | ||
| Growth/differentiation factor 15 | Transforming | |
| | ||
| Hypoxia-inducible factor 1-alpha | Master transcriptional regulator of the | |
| | ||
| ATP-sensitive rectifier K+ channel 8 | This potassium channel is controlled by G proteins | |
| | ||
| Lymphocyte antigen 96 | Cooperates with TLR4 in response to bacterial lipopolysaccharide | |
| 154589 | | |
| C-type natriuretic peptide | ||
| | ||
| RAR responder protein 1 | Negative regulation of | |
| | ||
| Chromosome domain maintenance | ||
Several constructed models of the ANT2 promoter region were screened as described in Figure 1 either on the whole chromosome human sequences or the human promoter library (results with an asterisk). Function involved in cell proliferation is shown in bold characters. Genes in bold are shown overexpressed in microarrays ( Additional file 3). δNo correct probe set on Affymetrix chips corresponding to this gene. Gene IDs are with 15 numbers (ex. ENSG00000023228).
Genes co-regulated with
| Acetylglucosamine-6-phosphate deacetylase | N-acetyl-D-glucosamine 6-P + H2O = D-glucosamine 6-P + acetate | |
| | ||
| APEX nuclease (DNA repair enzyme) 1 | Repair of apurinic / apyrimidinic sites | |
| | ||
| cyclin-dependent kinase 4 | Cell cycle G1 phase progression | |
| | ||
| Caseinolytic peptidase B protein homolog | Function as a | |
| | ||
| Pre-mRNA 3'-end-processing factor FIP1 | Contributes to poly(A) site recognition and poly(A) addition | |
| | ||
| Unknown | Unknown ( | |
| | ||
| Fibronectin type-III domain-containing 3A | Mediates spermatid-Sertoli adhesion during | |
| | ||
| Glucose-6-phosphatase 2 | Glucose production, expressed | |
| | ||
| Glucose-6-phosphatase 3 | Glucose production in endoplasmic reticulum, expressed | |
| | ||
| K(lysine) acetyltransferase 5 | Chromatin remodelling with an abundant | |
| | ||
| Kelch-like protein 12 | Ubiquitin-protein E3 ligase complex adapter, highly expressed | |
| | ||
| lysosomal-associated membrane protein 1 | Binds amelogenin, differentially expressed in | |
| | ||
| RNA pseudouridylate synthase domain-containing | Unknown, | |
| | ||
| solute carrier family 2, member 4 (GLUT4) | Facilitated glucose transporter, | |
| | ||
| Germ cell-specific, oogenesis regulator and | ||
| | ||
| chr. 7 unc-84 homolog A | Nuclear anchorage/migration, expres- sion of | |
| | ||
| tudor domain containing 1 | Essential for | |
| | ||
| O-sialoglycoprotein endopeptidase | Unknown ( | |
| | ||
| transketolase-like 1 | Important role in transketolase activity, | |
| | ||
| transmembrane protein 184A = Sdmg1 | ||
| | ||
| ubiquitin-conjugating enzyme E2B | Post-replicative DNA damage repair | |
Several constructed models of the ANT4 promoter region were screened as described in Figure 1 either on the full chromosomal human sequences or the human promoter library (results with an asterisk). Function involved in spermatogenesis or in testis or prostate metabolism is shown in bold characters. Genes in bold are shown overexpressed in microarrays ( Additional file 3). Genes identified in the previous ModelInspector / GeneProm analysis [2]. δNo correct probe set on Affymetrix chips corresponding to this gene.
Figure 4Schematic representation of the role of ANT1 isoform in muscle cell oxidative phosphorylation. ADP3- and inorganic phosphate (Pi) are transported across the mitochondrial inner membrane (MIM) into the mitochondrial matrix by the mitochondrial ANT and phosphate carrier (PiC), respectively. F1F0-ATPase combines Pi and ADP to form ATP, which is then exchanged for ADP across the MIM by ANT1 then across MOM (mitochondrial outer membrane). The whole reaction is driven by a proton gradient maintained mainly by the respiratory chain. Six of the genes identified from our promotology analysis encode proteins included in the oxidative phosphorylation (respiratory chain and F0-F1 ATP synthase proteins).
Figure 5Schematic representation of the proposed indirect / co-regulation in glycolytic conditions. In glycolytic conditions, the mitochondrial hexokinase isoform, HK II, generates ATP from cytoplasmic glucose 6-P (G-6P). The ATP4− is then imported into mitochondria by the ANT2 isoform, contributing to the maintenance of the mitochondrial membrane potential (ΔΨm). The HKII gene transcription is induced by the HIF1-alpha protein. CBP/p300, CREB-binding protein; ARNT, aryl hydrocarbon nuclear translocator; HRE, hypoxia response element.
Figure 6Schematic representation of the role of ANT4 isoform in bioenergetics during spermatogenesis. The glucose-6-phosphatases 2 and 3 (G6PC) generates ATP from glucose 6-P (G-6P) produced by the cytoplasmic hexokinase, HK. The ATP4− is imported into mitochondria across the MOM (mitochondrial outer membrane) through the voltage-dependent anion channel (VDAC), and then across the MIM (the mitochondrial inner membrane) by the ANT4 isoform. ATP4− contributes to the maintenance of the mitochondrial membrane potential (ΔΨm) in spermatozoid mitochondria. The hydrolysis of imported glycolytic ATP4− by the F1 component of the ATP synthase leads to 1 - the release of ADP3− in mitochondria with the gain of a negative charge on the matrix side; and 2 - the ejection of a proton into the intermembrane space through the F0 component.
Figure 7GeneProm filters for the respective orientations of the model, the identified gene, the EMBL contig and the chromosomal DNA strand. 2000 nt was the maximal distance between the 5’ end of the model and the TSS of a gene. A (model + / clone + / chromosome +): a’ = x + a and z = x + a + 2000; B (−/+/−): a’ = x + a and z = x + a – 2000; C (− /- / +): a’ = y - a and z = y – a + 2000; D (+/−/-): a’ = y - a and z = y – a – 2000.