| Literature DB >> 17166261 |
Jungsu Oh1, Jiae Lee, Jong-Min Woo, Eunyoung Choi, Inju Park, Cecil Han, Namhoe Baek, Hoyong Lee, Do Han Kim, Chunghee Cho.
Abstract
BACKGROUND: Maturation of spermatozoa, including development of motility and the ability to fertilize the oocyte, occurs during transit through the microenvironment of the epididymis. Comprehensive understanding of sperm maturation requires identification and characterization of unique genes expressed in the epididymis.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17166261 PMCID: PMC1764739 DOI: 10.1186/1471-2164-7-314
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Classification of genes in the epididymis library
| Genes | Number |
| Total entries | 1505 |
| Known (named or assigned) | 205 |
| Specific | 10 |
| Predominant | 12 |
| Widely expressed | 183 |
| Unknown | 204 |
| Specific | 58 |
| Predominant | 25 |
| Widely expressed | 121 |
| Single EST | 1096 |
Gene entries from the epididymis UniGene library (as of September, 2004) were classified into known and unknown genes. Each category was further classified into epididymis-specific, epididymis-predominant, and widely-expressed genes. Single ESTs were counted and excluded in the present study. All of the known and unknown genes are listed in Additional data files 1 and 2, respectively.
List of genes and gene-specific primers for RT-PCR
| UniGene ID (GenBank ID) | Gene descriptiona | PCR primers | |
| Forward | Reverse | ||
| Mm.99495 ( | 9230112N15Rik, hypothetical protein | GTCCGGTGCTAATAGAGCCGGCTAG | GGCTGATGAGGTCAACTGGAACTAG |
| Mm.190454 ( | 9230107O10Rik, hypothetical protein (Defb20) | GGTTATGGGCAGTGAGTGGCACAC | GGACAACGGCCTTCGTGAACAAG |
| Mm.99123 ( | 9230102M18Rik, hypothetical lipocalin protein (Lcn12) | CCACCACCAGCCATGCAGTTTCAG | GAGGCTCTACTGGCAGGAACCTGTTC |
| Mm.297297 ( | LOC219026, hypothetical protein (Gene model 75) | AGCGACGGGTGCACTGATTAGATG | CAGAACCATCCAGAGGTGATGAGAC |
| Mm.99530 ( | 9230102D03Rik, hypothetical protein (Defb41) | TCTTGTCCAAGAAACTGTACCATGAAG | CAGTAAGTAGTACTTCTGTGTGGCAG |
| Mm.99350 ( | 9230104O11Rik, hypothetical protein | CATGAAAGGCTTCAGAAGAGAG | CAGAATAATAGGAATTAACACACC |
| Mm.235619 ( | 9230113P08Rik, unknown EST | GCCGCAATGGCATGAAATCATGCTG | ACATAGAGAGGAGTATGGGGCCTG |
| Mm.99733 ( | 9230116B18Rik, similar to ACBP | GAGTCACAGCATTCCGTGTCTCATC | GGAGATCTGATTTCTCCGTCACC |
| Mm.297745 ( | LOC330921, hypothetical protein (Gene model 846) | GGTTGCAGGATGTTTGTGCTGGTG | CAGAACTAGAGTCCCATTGGGAGG |
| Mm.335028 ( | LOC382065, hypothetical protein (Gene model 1110) | GGCCGGTGGCTTCAATACTCTTACCACG | AACTTGCACGGCAATGATGGGGCCGC |
| Mm.229362 ( | 9230117D22Rik, hypothetical protein | CACTGATCTCCAAGGCCGTGA | AGAGCTCTGCCGTCAACACCAGC |
| Mm.159846 ( | 9230110F15Rik, hypothetical protein | ATCCCAGACTGAGATGGGCAAGC | ACTGGGACACAGTGCCATTGCTG |
| Mm.99387 ( | LOC432867, hypothetical protein (Defb42) | CCTTCCACCATGAGACTGTATCTGC | CCATTGCTTTAGCCGGCCGTGTGAG |
| Mm.99065 ( | 9230002F21Rik, similar to 2D6 Gylcoprotein (Defb22) | GTTCCTGGCCCATTTGGTCACAG | GCACCAACCATTGCAGCAGTGCTGGC |
| Mm.82875 ( | 2410125J01Rik, unknown EST (Defb30) | GTCTTGCTCTCCTATGTTCCAC | GTAGAGAACACTAGCCGGGATC |
| Mm.319913 ( | 9230112K08Rik, unknown EST (CRISP4) | GAGTTGGAGTTCAGCTGCTGCAGAG | CACAGCCAACGAGGTAGGTAGAGGC |
| Mm.332572 ( | LOC381667, hypothetical protein (Gene model 1679) | GTCTGCTGCTGCCTTAAAGCAG | GTCCTGATGCAACAGTTGCTGTGGC |
| Mm.291102 ( | LOC330470, hypothetical protein (Gene model 767) | CAACCTGACAGCAGGAACATGGC | GGGAAAGGCACATGGAGCATAGTCTG |
| Mm.261496 ( | 9230106F14Rik, hypothetical protein | CACCCGCATGACTGGTGACATCAAC | CACAGCCCCAGCTTTGGAATAGG |
| Mm.252404 ( | C630025C03, hypothetical protein | CAGATGTCTCATCTTTCCCTTC | ATCAGATAATGAAAGTCCAGGGC |
| Mm.234248 ( | LOC629747, similar to Eppin | ACCATGCAGCTCCAGGCCTACTTC | GGCTTGACAAGTCAGGTGTTGGTG |
| Mm.159975 ( | 9230104L09Rik, hypothetical protein (Cystatin E2) | ATGTCCAGGGAGCTCAGGCATGG | GAGCCTGGGCTGCTGATGCTG |
| Mm.245908b ( | 9230118I06Rik, hypothetical protein (Defb44) | GACCCTCCACAGCTATGAACC | CTGGAGCTGTGAGGCTAGGTC |
| Mm.117440 ( | LOC433181, hypothetical protein | GTTGCAGAGTCTGCTGTTGC | GCCACCAGTTGAGAACATTCC |
| Mm.99782 ( | 9230117E20Rik, hypothetical serine protease inhibitor | GCATGTTCAACGCCCCTAAC | GTTTGGCAGAATGCACAGCGG |
| Mm.190482 ( | 9230107M04Rik, unclassifiable | CATCCTCCAGAACAAGTTG | GTTAGGAGAACATTGCTTCC |
| Mm.99576 ( | 9230106D23Rik, hypothetical protease protein (Ovch2) | GTTTGTGAGGCCTGTGTGTC | CACTCCAGCCAGAGTCCAGG |
| Mm.99499 ( | 9230101D24Rik, hypothetical lipocalin protein (Lcn6) | ATCTTTCCTAGGCCAGGCGG | TTCTGCAGCTGAGCCTGCTG |
| Mm.293365 ( | LOC209351, similar to WAP four-disulfide core 6-like 1 | GAGCATCCAGGAACCTGAGC | CAGACAACGGTGCAGATACC |
| Mm.190489 ( | 9230010P13Rik, unknown EST | AGACTACAAACCACAGCAGC | CAGTAAGTCCAGCAGCACG |
| Mm.99690b ( | 9230111O07Rik, unclassifiable | TCATGAAGCCCTCGTGGTTC | CCAGCGGGAAGTCAGGGTC |
| Mm.99400 ( | LOC328788, hypothetical protein (Gene model 749) | CACAGCGACAGGGACTTATC | GCCAGATCGACAGGGACAC |
a Genes annotated or named during the updates of the library are shown in parentheses.
b UniGene ID numbers of Mm.245908 and Mm.99690 have been changed into Mm.387101 and Mm.389336, respectively, during the update of the UniGene database.
Figure 1Tissue distribution of the genes by RT-PCR analysis in various tissues of adult male mice. All of the genes were specifically or predominantly expressed in the epididymis. Estimation of intensity of bands amplified using primers specific to the glyceraldehyde-3-phosphate dehydrogenase (G3PDH) gene indicates the equivalent amounts of cDNA template in each tissue. M, skeletal muscle; B, brain; Lu, lung; H, heart; Li, liver; K, kidney; T, testis; S, spleen; E, epididymis; V, vas deferens.
Figure 2Transcript analysis by Northern-blot hybridization. Total RNA from epididymis (E) and spleen (S) were hybridized with cDNA probes of the genes. Agarose gels were stained with ethidium bromide to visualize 28S and 18S RNAs as a control to ensure loading of the same amount of RNA in each lane. There were significant differences in transcript size between the Northern blots and the UniGene predictions for eight genes, and these genes were subjected to RACE. This resulted in an additional new sequence for a gene (Mm.335028). Transcript sizes from known sequences (UniGene database and RACE), transcripts with significant differences in size between the Northern blots and cDNA sequences, and transcripts with isoforms are indicated below the blots. In the RACE analysis, genes with results in which transcript sequences were not extended (N.E.) or no result (N.R.), i.e. no amplification of transcript sequence in the analysis, are indicated. In the case of results from both 5'- and 3'-RACE in a given gene, the results are shown as 5'/3'.
Figure 3Genomic and transcript characteristics of the novel genes. Gene structure and exon organization were determined by genome database searches. In the gene structure, vertical bars and connecting horizontal lines represent the position of exons and introns, respectively. The orientation of each gene is indicated by a broken arrow. In the exon organization, diagonal lines represent additional unknown sequences. Coding regions were determined by selecting the longest open reading frames deduced from the cDNA sequences, and the predicted coding regions are shaded. The position of the poly(A) signal is marked by filled arrowheads. The calculated transcript sizes are summarized from the recent UniGene databases and the results from the Northern blots shown in Figure. 3. The numbers of amino acids (No. aa) corresponding to the predicted coding regions are listed. Chromosomal locations were determined by searches of the assembled human (UCSC Build 36.1) and mouse (UCSC Build 36) genomes.
Figure 4Hormonal regulation of gene expression. Using epididymides from wild-type mice (lane 1) and from castrated mice treated with oil (lane 2) or dihydrotestosterone (lane 3), Northern-blot analysis was performed as described in Fig. 2. Genes were divided into three groups based on whether their expression was regulated by androgen or testicular factors: androgen-dependent (A), testicular factor-dependent (B), and androgen- and testicular factor-independent gene expression (C).
Figure 5Region-specific and developmental expression profile of novel genes by RT-PCR. (A) Region-dependent expression of the novel genes in the epididymis. The schematic organization of the mouse epididymis is shown at the top, highlighting the initial segment, caput, corpus, and cauda regions. RT-PCR analysis was performed with cDNAs prepared from four different regions of epididymis. The house-keeping gene G3PDH was used as a control to normalize the template input. The genes were divided into 5 different groups, based on regional expression pattern and each group is indicated by a vertical bar. IS, initial segment; Cp, caput; Cr; corpus; Cd, cauda. (B) Postnatal developmental expression of the novel genes. Schematic diagram of the postnatal development of the epididymis is shown, highlighting the epithelial cell differentiation stage. RT-PCR was performed with cDNAs prepared from epididymides of mice of different ages. Genes were divided into three groups based on the stage of development at which they were expressed: before epithelial cell differentiation (Group I), at epithelial cell differentiation (Group II), and at puberty (Group III). Estimation of intensity of bands amplified using primers specific to the G3PDH gene indicates the equivalent amounts of cDNA template at each stage. The numbers above the lanes indicate days after birth.
Figure 6Putative domains and motifs in proteins encoded by the novel genes. The predicted amino acid sequences of the novel genes were analyzed using various bioinformatics tools (see Materials and Methods) and genes containing putative domains or motifs are listed. The proteins are indicated by boxes and, the putative domains or motifs are shaded. The size of the scale bar is shown as number of amino acids (aa) below each protein. Domain/motif abbreviations are as follows: WFDB, whey-acid-protein four-disulfide binding core domain; ACBP, acyl-CoA binding protein; Glyco_hydro_35, Glycosyl hydrolases family 35; SCP, sperm coating protein-like extracellular protein; FN2, type II fibronectin collagen-binding domain; ABC transport, ATP binding cassette transport; KU, BPTI (bovine pancreatic trypsin inhibitor)/Kunitz family of serine protease inhibitors; KAZAL_PSTI, Kazal-type pancreatic secretory trypsin inhibitors (PSTI) and related proteins; Tryp_SPc, trypsin-like serine protease; CUB, CUB domain.
Figure 7Secretion of proteins encoded by the novel genes. COS-7 cells were transfected with pcDNA3.1-UniGene-myc/His. After 48 hours, UniGene-myc/His were immunoprecipitated with anti-myc mAb from the culture medium and the cell lysates, and then subjected to Western-blot analysis using α-myc. Proteins were divided into two groups based on their secretion profile: secretory (A) and intracellular (B) proteins. Vector alone (Mock) and cysteine-rich secretory protein 1 (CRISP1) were used as a negative control and secretion marker, respectively. The lower bands in the immunoblot of Mm.99576 represent immunoglobulin G (IgG) heavy chain. The arrow indicates the molecular weight of the each protein.