| Literature DB >> 19403458 |
Wei Zhang1, Lijun He, Wanqing Liu, Chang Sun, Mark J Ratain.
Abstract
The putative role of (TG/CA)n repeats in the regulation of transcription has recently been reported for several cancer- and disease-related genes, including the genes encoding the epidermal growth factor receptor (EGFR), hydroxysteroid (11-beta) dehydrogenase 2 (HSD11B2) and interferon-gamma (IFNG). These studies indicated a correlation between gene expression levels and the presence or length of (TG/CA)n repeats in their intron 1 regions. A genome-wide search for genes with similar features may provide evidence of whether these dinucleotide repeats represent a class of universal regulators of gene expression, which has recently begun to be investigated as a quantitative complex phenotype. Using a public database of simple repeats, we identified 330 genes containing potentially polymorphic long (TG/CA)n repeats (n >or= 12) in their intron 1 regions. One known physiological pathway, the calcium signalling pathway, was found to be enriched among the genes containing long repeats. In addition, certain biological processes, such as cation transport, signal transduction and ion transport, were found to be enriched in these genes. Genotyping of the long repeats showed that the majority of these dinucleotide repeats were polymorphic in the HapMap CEU (Caucasians from Utah, USA) samples of northern and western European ancestry. Evidence for a significant association between these repeats and gene expression was not observed in the genes selected based on their expression profiles in the HapMap CEU samples. Our current findings, therefore, do not support a role for these repeats as a class of universal gene expression regulators. A more comprehensive evaluation of the relationship between these repeats and gene expression, potentially in other tissues, may be necessary to illustrate their roles in gene regulation in the future.Entities:
Mesh:
Year: 2009 PMID: 19403458 PMCID: PMC2735212 DOI: 10.1186/1479-7364-3-3-236
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Figure 1Schematic of the intron 1 regions covered in the analysis. The 5' windows (2.5 kb) and 3' windows (2.5 kb) of the intron 1 regions were included in the analysis. (a) The intron 1 region is smaller than 5 kb. (b) The intron 1 region is larger than 5 kb.
Primers used in genotyping and real-time PCR
| Experiment | Affymetrix transcript cluster IDa | Gene | Description | Direction | Primer |
|---|---|---|---|---|---|
| Genotyping | 3930360 | Runt-related transcription factor 1 | Forward | 5'-CCTCTGCTGCCAGTAAAGAAG-3' | |
| Reverse | 5'-Fam-CCGGCCACTTTATCACAGC-3' | ||||
| 2719656 | CD38 molecule | Forward | 5'-CCTGTCCAGATACTTGATACTTTC-3' | ||
| Reverse | 5'-Fam-AGTCCAGAGTGCTGCCAGT-3' | ||||
| 3577940 | Calmin | Forward | 5'-Fam-TGCCTCGAAGGTACTTAGGAAGT-3' | ||
| Reverse | 5'-AGAAGTTTGTACGTGGGCTCTTG-3' | ||||
| 3160175 | Very low density lipoprotein receptor | Forward | 5'-AGCACACTGTACTCCTAATGCCTG-3' | ||
| Reverse | 5'-Fam-AGGACAGGCATCAGCATTTTC-3' | ||||
| 2370433 | Calcium channel, voltage-dependent, R type, alpha 1E subunit | Forward | 5'-Fam-AATGTTGTGTGCTGTCCCAA-3' | ||
| Reverse | 5'-TTGCATACAGTGGGCACACA-3' | ||||
| 2991233 | Aryl hydrocarbon receptor | Forward | 5'-Fam-AGTTGTACAGACAACCTGGAGA-3' | ||
| Reverse | 5'-CAGTGATGTATGTGGAGAAGTAAT-3' | ||||
| 2878074 | Neuregulin 2 | Forward | 5'-TTCCGGCCTAAAGAACTAGCTC-3' | ||
| Reverse | 5'-Fam-AGACTGGCCTGGGATGTGT-3' | ||||
| 3284596 | Par-3 partitioning defective 3 homolog | Forward | 5'-GCTTTGGACACACTAGTGTCTGC-3' | ||
| Reverse | 5'-Fam-ATGAGGATCCAGAGCAGGAAA-3' | ||||
| 3848243 | Insulin receptor | Forward | 5'-GAAGTCTTTAGATGGCTTCTCTCTG-3' | ||
| Reverse | 5'-Fam-GCCTTATATTCTAGGTGCTGAGG-3' | ||||
| 3721010 | Insulin-like growth factor binding protein 4 | Forward | 5'-Fam-GGTGGTAGAAAAGTCTTGGCTC-3' | ||
| Reverse | 5'-GTGTAAGAATGAGGTTAGGGCAA-3' | ||||
| Real-time PCR | 3848243 | Insulin receptor | Forward | 5'-AAGACCATCGACTCGGTGAC-3' | |
| Reverse | 5'-GGATCGGCGGATTTTTAGAT-3' | ||||
| 3577940 | Calmin | Forward | 5'-GGTTCTTGGGCTGATATGGA-3' | ||
| Reverse | 5'-GGCCTTGATAGCCTTCCTCT-3' | ||||
aFrom GSE7851.
Summary of the (TG/CA)repeats in intron 1 regions (version hg 16)
| All ( | Short (6 ≤ | Long ( | Long ( | Long (5' windowb) | Long (3' windowa) | |
|---|---|---|---|---|---|---|
| Number of repeats | 1137 | 704 | 433 | 371 | 205 | 194 |
| Number of genes | 839 | 536 | 388 | 330 | 187 | 187 |
aAll intron 1 regions.
bIntron 1 regions larger than 2.5 kb.
Figure 2Chromosomal distribution of genes containing long (TG/CA). In total, 330 genes containing long repeats were identified in the 5' and/or 3' 2.5 kb windows of the intron I regions in the human genome. Chromosome II was under-represented relative to the human genome reference (version hg16).
Enriched biological processes and pathways in the genes containing II (TG/CA)repeats
| Category | Database | Database ID | Name | Count | P | FDR |
|---|---|---|---|---|---|---|
| Pathway | KEGG | hsa:04020 | Calcium signalling pathway | 16 | 2.10E-04 | 4.10E-02 |
| Biological process | GO | GO:0030001 | Metal ion transport | 25 | 1.20E-05 | 3.10E-02 |
| GO:0030182 | Neurone differentiation | 18 | 1.40E-05 | 1.80E-02 | ||
| GO:0006812 | Cation transport | 28 | 2.20E-05 | 1.90E-02 | ||
| GO:0015672 | Monovalent inorganic cation transport | 21 | 2.50E-05 | 1.60E-02 | ||
| PANTHER | BP00143 | Cation transport | 97 | 6.20E-11 | 1.40E-08 | |
| BP00142 | Ion transport | 57 | 4.10E-07 | 4.50E-05 | ||
| BP00044 | mRNA transcription regulation | 160 | 8.90E-07 | 6.50E-05 | ||
| BP00071 | Proteolysis | 90 | 4.60E-05 | 2.50E-03 | ||
| BP00040 | mRNA transcription | 105 | 8.50E-05 | 3.70E-03 | ||
| BP00102 | Signal transduction | 75 | 1.20E-04 | 4.50E-03 | ||
| BP00141 | Transport | 34 | 3.40E-04 | 1.10E-02 | ||
| BP00019 | Lipid, fatty acid and steroid metabolism | 24 | 4.10E-04 | 1.10E-02 | ||
| BP00112 | Calcium-mediated signalling | 15 | 9.50E-04 | 2.10E-02 | ||
| BP00063 | Protein modification | 53 | 9.90E-04 | 2.00E-02 | ||
| BP00284 | Haematopoiesis | 6 | 1.10E-03 | 2.10E-02 | ||
| BP00124 | Cell adhesion | 18 | 1.20E-03 | 2.10E-02 | ||
| BP00199 | Neurogenesis | 25 | 1.80E-03 | 2.80E-02 | ||
| BP00286 | Cell structure | 59 | 2.40E-03 | 3.50E-02 | ||
| BP00064 | Protein phosphorylation | 43 | 2.90E-03 | 3.90E-02 | ||
| BP00111 | Intracellular signalling cascade | 24 | 3.20E-03 | 4.00E-02 | ||
| BP00060 | Protein metabolism and modification | 54 | 3.40E-03 | 4.10E-02 | ||
Polymorphic I1 repeats in the unrelated CEU samples
| Gene | Chromosome | Repeat lengtha |
|---|---|---|
| 19 | 1-8 | |
| 5 | 1-6 | |
| 10 | 1-5 | |
| 4 | 1-5 | |
| 1 | 1-8 | |
| 7 | 1-4 | |
| 14 | 1-6 | |
| 9 | 1-11 | |
aRelative repeat length.