| Literature DB >> 18851760 |
Diana G F Ross1, Josephine Bowles, Peter Koopman, Sigrid Lehnert.
Abstract
BACKGROUND: SRY is the pivotal gene initiating male sex determination in most mammals, but how its expression is regulated is still not understood. In this study we derived novel SRY 5' flanking genomic sequence data from bovine and caprine genomic BAC clones.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18851760 PMCID: PMC2572636 DOI: 10.1186/1471-2199-9-85
Source DB: PubMed Journal: BMC Mol Biol ISSN: 1471-2199 Impact factor: 2.946
Figure 1Homology of human, caprine, porcine and mouse . Pink shading indicates 70% or higher homologies calculated over 100 bp. Peaks of homology are labelled Region A to D above the graph. Repetitive elements (LINEs and SINEs) are indicated in green, and the SRY coding region in blue. Grey line below each graph shows the extent of sequence used.
Figure 2Spacing of and co-ordinates of conserved . Sequence information for porcine region A was not available for this study. Numbering represents number of nucleotides 5' to the transcription start site in each species, ? denotes unknown positions.
Figure 3DNA sequence homologies calculated across the whole of regions A, B, C and D. Species are human (hum), bovine (bov), porcine (por), caprine (cap), and murine (mur). na, porcine sequence not available.
Transcription factor binding sites found in SRY 5' regions
| BRNF | b/g/p (×2) | b/g/h (×2) | 18 | ||
| OCT1 | b/g/p (×2) | b/g/p (×2) | 18 | ||
| HOXF | b/g/h | b/g/h | b/g/h | 13 | |
| PARF | b/g/p | b/g/p (×3) | 12 | ||
| FKHD | b/g/p | b/g/p | 10 | ||
| GATA | b/g/h (×2) | b/g/p | 9 | ||
| CREB | b/g/p | b/g/p (×2) | 9 | ||
| CDXF | 8 | ||||
| SRFF | b/g/h | 7 | |||
| MEF2 | b/g/h | g/h/p | 6 | ||
| ETSF | b/g/h, b/g/p | 6 | |||
| HNF1 | b/g/p | b/g/h | 6 | ||
| SORY | b/g/p | g/h/p | 6 | ||
| NKXH | b/g/h | 6 | |||
| LHXF | 4 | ||||
| MYT1 | 4 | ||||
| PLZF | 4 | ||||
| NFKB | 4 | ||||
| EVI1 | b/g/h | 3 | |||
| TBPF | b/g/h | 3 | |||
| HOXC | b/g/h | 3 | |||
| GFI1 | b/g/h | 3 | |||
| PITI | b/g/h | 3 | |||
| OCTP | b/g/h | 3 | |||
| RORA | b/g/h | 3 | |||
| HAML | b/g/h | 3 | |||
| RBPF | b/g/h | 3 | |||
| IRFF | b/g/p | 3 | |||
| PAX6 | b/g/p | 3 | |||
| MZF1 | b/g/p | 3 | |||
| GZF1 | b/g/h | 3 | |||
| ZFHX | b/g/p | 3 | |||
| PLAG | b/g/p | 3 | |||
| MOKF | b/g/h | 3 | |||
| HOMF | b/g/h | 3 | |||
| RBIT | b/g/h | 3 | |||
| SATB | b/g/p | 3 | |||
| CLOX | b/g/h | 3 | |||
| 27 | 18 | 62 | 103 | 210 | |
Only sites conserved between 3 or more species are shown. Sites conserved between 4 species are marked in bold. Numbers in parentheses indicate the number of times each binding site was found in the same region. Data are sorted in order of most common to least common transcription factor binding sites. Bovine (b), goat (g), human (h), pig (p), mouse (m).
Transcription factor family members
| BRNF | Brn POU domain factors |
| BRN2/3/4/5 | |
| OCT1 | Octamer binding protein |
| OCT1/2/3 | |
| HOXF | Factors with moderate activity to homeodomain consensus sequence |
| Barx2, CRX, GSC, Gsh-1/2, HOX1, HOXA9, HOXB9, HOXC13, NANOG, OTX2, PCE1, PHOX2a/2b, PTX1 pituitary homeobox. | |
| PARF | PAR/bZIP family |
| DBP Albumin D-box binding protein, HLF hepatic leukemia factor, TEF Thyrotrophic embryonic factor, VBP PAR-type chicken vitellogenin promoter binding protein. | |
| FKHD | Fork head domain factors |
| FHXA/B, FKHRL1 (FOXO), FREAC2/3/4/7 fork head related activators (FOXF2, FOXC1, FOXD1, FOXL1), HFH1/2/3/8 (FOXQ1, FOXD3, FOXI1, Freac-6. FXF1), HNF3B (FOXA2), IlF1 (FOXK2), XFD1/2/3. | |
| GATA | GATA binding factors |
| GATA, GATA1/2/3. | |
| CREB | Camp-responsive element binding proteins |
| ATF, ATF2/6 activation transcription factors, c-Jun/ATF2 heterodimers, CREB, CREB1/2, CREB2/cJun, E4BP4, TAX/CREB complex, XBP1 X-box-binding protein. | |
| CDXF | Vertebrate caudal related homeodomain Protein |
| CDX1/2 Intestine specific homeodomain factor and mammalian caudal related intestinal TF. | |
| SRFF | Serum response element binding factor |
| SRF.01/02/03 | |
| MEF2 | Myocyte-specific enhancer binding Factor |
| MEF2, RSRFC4 related to serum response factor, SL1 member of RSRF | |
| ETSF | Human and murine ETS1 factors |
| c-Ets-1/2(p54), ELF-2(NERF1a), ELK1, FLI, GABP GA binding protein, GABPB1 GA repeat binding protein beta 1, NRF2 nuclear respiratory factor 2, PDEF Prostate-derived Ets factor, PEA3 polyomavirus enhancer A binding protein 3, ETV4, PU1, SPI1, SpiB. | |
| HNF1 | Hepatic nuclear factor 1 |
| HNF1 | |
| SORY | SOX/SRY-sex/testis determining and related HMG box factors |
| HBP1, HMGA1/2, HMGIY, SOX5/9, SRY. | |
| NKXH | NKX homeodomain factors |
| Hmx2/Nkx5-2 homeodomain transcription factor, NKX31 prostate-specific homeodomain protein, TTF1 thyroid transcription factor | |
| LHXF | Lim homeodomain factors |
| LHX3 and LMXB1 | |
| MYT1 | MYT1 C2HC zinc finger protein |
| MyT1 myelin transcription factor, and MyT1L. | |
| PLZF | C2H2 zinc finger protein |
| PLZF promyelocytic leukemia zinc finger (TF with 9 Kruppel-like zinc fingers) | |
| NFKB | Nuclear factor kappa B/c-rel |
| NF-kappaB (p50 and p65), HIVEP1; ZAS Domain TF human immunodeficiency virus type 1 enhancer-binding protein-1 (HIVEP1), major histocompatibility complex-binding protein-1 (MBP-1), positive regulatory domain II-binding factor (PRDII-BF1) | |
| EVI1 | Myleoid transforming protein |
| EVI1 ecotropic viral integration site 1 encoded factor, amino-terminal zinc finger domain. MEL1 (MDS1/EVI1-like gene 1) DNA-binding domain 1. | |
| TBPF | TATA-binding protein factors |
| ATATA avian C-type LTR TAT box, LTATA Lentivirus LTR TAT box, MTATA muscle TATA box, TATA cellular and viral TATA box elements, and Mammalian C-type LTR TATA box. | |
| HOXC | HOX – PBX complexes |
| HOX/PBX binding sites, PBX1, PBX-HOXA9 binding site. | |
| GFI1 | Growth factor independence transcriptional Repressor |
| GFI1.01/02, GFI1B.01. | |
| PITI | GHF-1 pituitary specific pou domain TF |
| Pit1, GHF1. | |
| OCTP | OCT1 binding factor (POU-specific domain) |
| OCT1P Octamer-binding factor 1, POU-specific domain) | |
| RORA | v-ERB and RAR-related orphan receptor alpha |
| REV-ERBA orphan nuclear receptor rev-erb alpha (NR1D1), RORA/RORA1/2 RAR-related orphan receptor alpha/1/2, RORGAMMA RAR-related orphan receptor gamma, VERBA viral homolog of thyroid hormone receptor alpha1 | |
| HAML | Human acute myelogenous leukemia factors |
| AML1/CBFA2 Runt domain binding site, AML3 runt-related transcription factor 2/CBFA1 | |
| RBPF | RBPJ kappa |
| Mammalian transcriptional repressor RBP-Jkappa/CBF1 | |
| IRFF | Interferon regulatory factors |
| IRF1/2/3/4(NF-EM5, PIP, LSIRF, ICSAT)/7, ISRE interferone stimulated response element. | |
| PAX6 | PAX-4/PAX-6 paired domain binding sites |
| PAX4 and PAX6 paired domain binding site | |
| MZF1 | Myeloid zinc finger 1 factors |
| MZF1 | |
| GZF1 | GDNF-inducible zinc finger gene 1 |
| GZF1 (ZNF336) | |
| ZFHX | Two-handed zinc finger homeodomain transcription factors |
| AREB6 (Atp1a1 regulatory element binding factor 6), deltaEF1 (Delta-crystallin enhancer binding factor, transcription factor 8, zinc finger homeobox 1a), SIP1 (Smad-interacting protein) | |
| PLAG | Pleomorphic adenoma gene |
| (PLAG) 1, a developmentally regulated C2H2 zinc finger protein | |
| MOKF | Mouse Kruppel like factors |
| MOK2.01/02 Ribonucleoprotein associated zinc finger protein MOK-2 | |
| HOMF | Homeodomain transcription factors |
| DLX1/2/5, Distal-less 3, EN1 homeobox protein engrailed, HHEX, MSX1/2, NOBOX, S8. | |
| RBIT | Regulator of B-Cell IgH transcription |
| Bright, B cell regulator of IgH transcription | |
| SATB | Special AT-rich sequence binding |
| Protein SATB1 | |
| CLOX | CLOX and CLOX homology (CDP) factors |
| CDP cut-like homeodomain protein, transcriptional repressor CDP, CDPCR3, CDPCR3HD, CLOX, CUT2. | |
List of transcription factor families found in Regions A-D and the specific transcription factors that comprise them.
Figure 4Conserved transcription factor binding sites in each region of homology. Black text indicates conservation between 3 species of which one is human, grey text indicates 3-species conservation without human, and red text indicates conservation between 4 species (human, bovine, porcine and caprine). An example of the highly conserved area of region D is shown as a sequence alignment with conserved transcription factor binding sites boxed or shaded.
Matrix similarity scores for putative binding sites
| A | HOXF | 0.96 | 0.988 | 0.947 | - | 0.965 |
| GATA (a) | 0.924 | 0.963 | 0.956 | - | 0.948 | |
| GATA (b) | 0.944 | 0.972 | 0.916 | - | 0.944 | |
| PITI | 0.942 | 0.93 | 0.945 | - | 0.939 | |
| GFI1 | 0.96 | 0.911 | 0.918 | - | 0.930 | |
| HOXC | 0.911 | 0.922 | 0.951 | - | 0.928 | |
| OCTP | 0.922 | 0.875 | 0.968 | - | 0.922 | |
| EVI1 | 0.958 | 0.86 | 0.904 | - | 0.907 | |
| TBPF | 0.923 | 0.813 | 0.933 | - | 0.890 | |
| B | RBPF | 0.944 | 0.943 | 0.961 | - | 0.949 |
| RORA | 0.958 | 0.983 | 0.897 | - | 0.946 | |
| HAML | 0.943 | 0.935 | 0.943 | - | 0.940 | |
| HOXF | 0.884 | 0.889 | 0.884 | - | 0.886 | |
| MEF2 | 0.905 | 0.885 | 0.775 | - | 0.855 | |
| SRFF | 0.697 | 0.717 | 0.681 | - | 0.698 | |
| C | MZF1 | 1.000 | - | 1.000 | 0.995 | 0.998 |
| ZFHX | 0.984 | - | 0.984 | 0.984 | 0.984 | |
| ETSF | 0.983 | 0.982 | 0.983 | - | 0.983 | |
| FKHD | 0.962 | - | 0.962 | 0.962 | 0.962 | |
| GATA | 0.973 | 0.936 | 0.973 | 0.954 | 0.959 | |
| IRFF | 0.964 | - | 0.887 | 0.945 | 0.932 | |
| CREB | 0.938 | - | 0.938 | 0.914 | 0.930 | |
| HOXF | 0.975 | 0.870 | 0.975 | 0.857 | 0.919 | |
| BRNF | 0.946 | - | 0.906 | 0.899 | 0.917 | |
| PARF | 0.940 | - | 0.864 | 0.921 | 0.908 | |
| ETSF | 0.880 | - | 0.890 | 0.925 | 0.898 | |
| OCT1 | 0.905 | - | 0.894 | 0.894 | 0.898 | |
| SORY | 0.879 | - | 0.879 | 0.927 | 0.895 | |
| PLAG | 0.900 | - | 0.882 | 0.887 | 0.890 | |
| BRNF | 0.810 | - | 0.810 | 0.916 | 0.845 | |
| LHXF | 0.839 | 0.846 | 0.839 | 0.849 | 0.843 | |
| OCT1 | 0.846 | - | 0.841 | 0.820 | 0.836 | |
| GZF1 | 0.761 | 0.858 | 0.858 | - | 0.826 | |
| HNF1 | 0.801 | - | 0.803 | 0.819 | 0.808 | |
| PAX6 | 0.778 | - | 0.769 | 0.781 | 0.776 | |
| D | SORY | - | 0.991 | 0.987 | 0.986 | 0.988 |
| MOKF | 0.983 | 0.983 | 0.983 | - | 0.983 | |
| HOMF | 0.989 | 0.950 | 0.989 | - | 0.976 | |
| SATB | 0.958 | - | 0.958 | 0.967 | 0.961 | |
| CLOX | 0.948 | 0.967 | 0.948 | - | 0.954 | |
| CDXF | 0.980 | 0.855 | 0.980 | 0.980 | 0.949 | |
| PARF | 0.921 | - | 0.921 | 0.995 | 0.946 | |
| RBIT | 0.924 | 0.965 | 0.924 | - | 0.938 | |
| NKXH | 0.933 | 0.928 | 0.933 | - | 0.931 | |
| HOXF | 0.923 | - | 0.923 | 0.942 | 0.929 | |
| NKXH | 0.946 | - | 0.835 | 1.000 | 0.927 | |
| FKHD | 0.922 | - | 0.922 | 0.909 | 0.918 | |
| NFKB | 0.864 | 0.992 | 0.841 | 0.947 | 0.911 | |
| CREB | 0.918 | - | 0.918 | 0.893 | 0.910 | |
| MEF2 | - | 0.890 | 0.791 | 0.991 | 0.891 | |
| OCT1 | 0.849 | 0.954 | 0.849 | - | 0.884 | |
| OCT1 | 0.873 | 0.893 | 0.873 | - | 0.880 | |
| SRFF | 0.844 | 0.855 | 0.918 | 0.884 | 0.875 | |
| HNF1 | 0.943 | 0.854 | 0.828 | - | 0.875 | |
| PLZF | - | 0.883 | 0.874 | 0.866 | 0.874 | |
| PARF | 0.860 | - | 0.865 | 0.897 | 0.874 | |
| OCT1 | 0.862 | - | 0.856 | 0.899 | 0.872 | |
| FKHD | 0.867 | 0.861 | 0.836 | 0.919 | 0.871 | |
| PARF | 0.867 | - | 0.867 | 0.867 | 0.867 | |
| BRNF | 0.902 | 0.785 | 0.902 | - | 0.863 | |
| CDXF | 0.872 | 0.850 | 0.870 | 0.860 | 0.863 | |
| MYT1 | 0.875 | - | 0.775 | 0.875 | 0.842 | |
| BRNF | 0.805 | 0.892 | 0.819 | - | 0.839 | |
| CREB | 0.844 | - | 0.833 | 0.833 | 0.837 | |
| BRNF | 0.796 | - | 0.807 | 0.889 | 0.831 | |
| BRNF | 0.790 | - | 0.790 | 0.898 | 0.826 | |
| OCT1 | 0.790 | - | 0.783 | - | 0.787 | |
List of matrix similarity scores (the similarity of each putative site to the canonical binding site for the relevant transcription factor) generated by MatInspector software for each putative transcription factor binding site in each species, for each region of homology. Matrix scores are ranked from the highest to lowest mean score.