| Literature DB >> 17916572 |
Ren Zhang1, Yan Lin, Chun-Ting Zhang.
Abstract
The double helix is a conformation that genomic DNA usually assumes; under certain conditions, however, guanine-rich DNA sequences can form a four-stranded structure, G-quadruplex, which is found to play a role in regulating gene expression. Indeed, it has been demonstrated that the G-quadruplex formed in the c-MYC promoter suppresses its transcriptional activity. Recent studies suggest that G-quadruplex motifs (GQMs) are enriched in human gene promoters. To facilitate the research of G-quadruplex, we have constructed Greglist, a database listing potentially G-quadruplex regulated genes. Greglist harbors genes that contain promoter GQMs from genomes of various species, including humans, mice, rats and chickens. Many important genes are found to contain previously unreported promoter GQMs, such as ATM, BAD, AKT1, LEPR, UCP1, APOE, DKK1, WT1, WEE1, WNT1 and CLOCK. Furthermore, we find that not only protein coding genes, 126 human microRNAs also contain promoter GQMs. Greglist therefore provides candidates for further studying G-quadruplex functions and is freely available at http://tubic.tju.edu.cn/greglist.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17916572 PMCID: PMC2238908 DOI: 10.1093/nar/gkm787
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Descriptive statistics of genes in Greglist
| Species (Latin name) | Species | Genome version | Number of genes having promoter GQMs | Total gene number | Percentage of genes having promoter GQMs | Average GQMs a gene has | GQM density in promoter regions (GQMs/Kb) | Average GQM length (mean ± SD) |
|---|---|---|---|---|---|---|---|---|
| Human | NCBI36 | 10 277 | 31 524 | 32.60% | 1.93 | 0.63 | 29.19 ± 13.57 | |
| Mouse | NCBIM36 | 8962 | 28 390 | 31.57% | 1.61 | 0.51 | 28.18 ± 12.57 | |
| Rat | RGSC3.4 | 7013 | 27 302 | 25.69% | 1.43 | 0.37 | 26.39 ± 8.82 | |
| Chicken | WASHUC2 | 5949 | 17 438 | 34.12% | 1.75 | 0.60 | 28.70 ± 14.44 |
A list of 30 human genes that have not been previously reported to contain promoter G-quadruplex motifs
| No. | Abbreviation | Gene name | Ensembl ID | Function or associated disease | Reference | Number of GQM | Distance to TSS |
|---|---|---|---|---|---|---|---|
| 1 | WNT1 | Wingless-type MMTV integration site family, member 1 | ENSG00000125084 | The Wnt signaling pathway, CNS development | ( | 1 | 193 |
| 2 | WNT5A | Wingless-type MMTV integration site family, member 5A | ENSG00000114251 | The Wnt signaling pathway, vertebrate development | ( | 2 | 567, 936 |
| 3 | LEPR | LEPTIN receptor | ENSG00000116678 | Energy metabolism | ( | 3 | 310, 372, 495 |
| 4 | UCP1 | Uncoupling protein 1 | ENSG00000109424 | Energy metabolism | ( | 2 | 89, 224 |
| 5 | APOE | Apolipoprotein E | ENSG00000130203 | Alzheimer's disease | ( | 4 | 46, 65, 407, 739 |
| 6 | ATM | Ataxia telangiectasia mutated | ENSG00000149311 | Ataxia telangiectasia | ( | 1 | 59 |
| 7 | PAX8 | Paired box gene 8 | ENSG00000125618 | Permanent congenital hypothyroidism | ( | 1 | 133 |
| 8 | SOX1 | SRY (sex determining region Y)-box 1 | ENSG00000203883 | Lens development | ( | 3 | 80, 726, 826 |
| 9 | SOX10 | SRY (sex determining region Y)-box 10 | ENSG00000100146 | Waardenburg–Hirschsprung disease | ( | 2 | 130, 313 |
| 10 | HDAC1 | Histone deacetylase 1 | ENSG00000116478 | Histone modification | ( | 1 | 34 |
| 11 | TGFβ1 | Transforming growth factor, beta 1 | ENSG00000105329 | TGFβ signaling | ( | 1 | 151 |
| 12 | SMAD2 | MAD homolog 2 | ENSG00000175387 | TGFβ signaling | ( | 2 | 235, 450 |
| 13 | DKK1 | Dickkopf homolog 1 | ENSG00000107984 | TGFβ signaling | ( | 1 | 136 |
| 14 | CLOCK | Clock homolog | ENSG00000134852 | Circadian rhythms | ( | 3 | 147, 341, 692 |
| 15 | WEE1 | WEE1 homolog | ENSG00000166483 | Cell cycle control | ( | 1 | 542 |
| 16 | BAD | BCL2-antagonist of cell death | ENSG00000002330 | Apoptosis | ( | 3 | 116, 628, 756 |
| 17 | AKT1 | V-akt murine thymoma viral oncogene homolog 1 | ENSG00000142208 | Apoptosis | ( | 1 | 61 |
| 18 | GATA4 | GATA-binding protein 4 | ENSG00000136574 | Heart development | ( | 1 | 314 |
| 19 | MYOD1 | Myogenic differentiation 1 | ENSG00000129152 | Muscle development | ( | 2 | 128, 216 |
| 20 | WT1 | Wilms tumor 1 | ENSG00000184937 | Kidney development | ( | 2 | 168, 900 |
| 21 | GDF1 | Growth differentiation factor 1 | ENSG00000135414 | Left–right patterning | ( | 4 | 78, 166, 327, 766 |
| 22 | BMP2 | Bone morphogenetic protein 2 | ENSG00000125845 | Bone development | ( | 1 | 163 |
| 23 | MEF2D | MADS box transcription enhancer factor 2D | ENSG00000116604 | Heart development | ( | 4 | 18, 85, 169, 232 |
| 24 | STAT6 | Signal transducer and activator of transcription 6 | ENSG00000166888 | Immunity | ( | 1 | 505 |
| 25 | SOCS1 | Suppressor of cytokine signaling 1 | ENSG00000185338 | Immunity | ( | 5 | 112, 211, 534, 578, 758 |
| 26 | MMP2 | Matrix metallopeptidase 2 | ENSG00000167346 | Function of extracellular matrix | ( | 1 | 576 |
| 27 | MAPK2 | Mitogen-activated protein kinase 2 | ENSG00000162889 | MAP kinase pathway | ( | 2 | 100, 137 |
| 28 | ATF1 | Activating transcription factor 1 | ENSG00000123268 | Transcriptional regulation | ( | 1 | 36 |
| 29 | TAF2 | TAF2 RNA polymerase II | ENSG00000064313 | Transcriptional regulation | ( | 1 | 296 |
| 30 | RING1 | Ring finger protein 1 | ENSG00000204227 | Transcriptional regulation | ( | 4 | 501, 559, 677, 938 |