| Literature DB >> 15748293 |
Petr Divina1, Cestmír Vlcek, Petr Strnad, Václav Paces, Jirí Forejt.
Abstract
BACKGROUND: We generated the gene expression profile of the total testis from the adult C57BL/6J male mice using serial analysis of gene expression (SAGE). Two high-quality SAGE libraries containing a total of 76 854 tags were constructed. An extensive bioinformatic analysis and comparison of SAGE transcriptomes of the total testis, testicular somatic cells and other mouse tissues was performed and the theory of male-biased gene accumulation on the X chromosome was tested.Entities:
Mesh:
Substances:
Year: 2005 PMID: 15748293 PMCID: PMC1079818 DOI: 10.1186/1471-2164-6-29
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Parameters of constructed SAGE libraries from B6 mouse total testis
| SAGE library | Total testis 1 (TT 1) | Total testis 2 (TT 2) |
| Sequenced clones | 811 | 1 510 |
| Total tags* | 24 975 | 51 879 |
| Unique transcripts | 10 516 | 18 848 |
| Single copy tags | 7 731 | 13 422 |
| Quality parameters | ||
| Average tags per clone | 30.8 | 34.4 |
| Duplicated ditags | 157 (1.2 %) | 276 (1.0 %) |
| Linker derived tags | 147 (0.6 %) | 223 (0.4 %) |
* excluding tags from duplicated ditags and linker-derived tags (i.e. two linker tags TCCCTATTAA, TCCCCGTACA and all possible 1-bp linker tag variations)
Figure 1Comparison of mouse testis SAGE libraries represented by dot plots and fold factor distribution graphs. Comparison of tag counts between two total testis libraries (A), and between the combined total testis library and adult testis somatic cells library (B). Tags with significant p-chance are depicted in blue (0.01 < p < 0.05) and red (p < 0.01). Tags missing in one of the libraries are plotted at -0.5 coordinates. Point size is proportional to the number of represented tags. Distribution of the fold factor between two total testis libraries (C), and between the combined total testis library and adult testis somatic cells library (D). Fold factor is the ratio of normalized tag abundances in two SAGE libraries with ratios < 1 converted to reciprocal negatives. For tags missing in one library, normalized tag count of single copy tags was assumed. Abbreviations: TT 1 = total testis library 1; TT 2 = total testis library 2; TT 1+2 = combined total testis libraries; ATSC = adult testis somatic cells library.
Identification of tags in the combined total testis SAGE library (TT 1+2) The tags matching the mitochondrial genome were omitted in this summary. Tags in group "Unreliable matches" (*) are considered not reliable according to Mouse SAGE Site and RNA evidence mappings, because they are not supported by the required number of mRNA and EST sequences. These tags are, however, included in reliable single/multiple match groups in the SAGEmap reliable mapping, which results in a highly increased number of reliable multiple matches and a slightly increased number of reliable single matches.
| NCBI SAGEmap reliable mapping | Mouse SAGE Site reliable mapping | RNA evidence mapping | ||||
| tags | % | tags | % | tags | % | |
| Reliable single match | 4 061 | 54.3 | 3 553 | 47.5 | 3 367 | 45.0 |
| Reliable multiple matches | 2 865 | 38.3 | 1 157 | 15.5 | 472 | 6.3 |
| Unreliable match(es)* | - | - | 2 216 | 29.6 | 3 087 | 41.3 |
| No match | 555 | 7.4 | 555 | 7.4 | 555 | 7.4 |
| Total tags (tag count > 1) | 7 481 | 100.0 | 7 481 | 100.0 | 7 481 | 100.0 |
Figure 2Classification of genes expressed in total testis according to the biological process ontology of the GO database. Bar graphs represent the sum of tag abundances corresponding to genes associated with a particular GO term. Only selected GO terms with the sum of tag abundances > 500 are displayed. The number of genes associated with each GO term is indicated inside the bars.
Parameters of the SAGE libraries constructed from total testis and somatic cells of adult testis
| SAGE library | Total testis | Adult testis somatic cells |
| TT 1+2 | ATSC | |
| Total tags | 76 854 | 81 478 |
| Unique tags | 24 529 | 22 809 |
| Unique tags with count > 1 | 7 481 | 7 435 |
| Proportions of unique tags with count > 1 | ||
| % of total tags | 77.8 | 81.1 |
| % of unique tags | 30.5 | 32.6 |
Figure 3Dissimilarities of mouse SAGE libraries illustrated by a dendrogram. Thirty-two SAGE libraries constructed from bulk tissues containing 190 871 unique tags (including single copy tags) were selected (see Additional file 4). Pairwise library distances based on differences between normalized tag counts were computed according to [28]. The average agglomeration method was used in hierarchical clustering due to the highest cophenetic correlation (0.936) between observed and predicted distances resulting from the dendrogram. The number of tags in each SAGE library is indicated.
Distribution of testis-specific genes on autosomes and the X chromosome The total of 14 222 LocusLink genes were identified in total testis, adult testis somatic cells and non-testis tissue SAGE libraries (see Additional file 4) using RNA evidence mapping (tags matching multiple LocusLink genes were discarded). The genes identified by total tag count = 1 were then excluded from analysis. The genes expressed only in one tissue type (total testis, adult testis somatic cells, brain, eye, heart, liver, kidney, limbs and adipose tissue) were considered to be tissue-specific genes. Chromosomal distribution of genes specific for total testis (a) and testis somatic cells (b) in comparison to the non-testis tissue-specific genes was evaluated. The significance was tested by permutations (100 000 random shufflings of the chromosomes while keeping the sum of genes on autosomes and the X chromosome fixed) and confirmed by Fisher's exact test. Abbreviations: total t. = total testis; t. somatic = testicular somatic cells; other = non-testis tissues; ChrA = autosomes; ChrX = X chromosome.
| a) Total testis: 395 genes specific for the combined total testis SAGE library (TT 1+2) Other tissues: 877 genes specific for one tissue type in the pool of other SAGE libraries | |||||||
| Chrom | Observed gene counts | Gene counts in randomized genome | % observed gene counts | Ratio of observed proportions | |||
| total t. | other | total t. | other | total t. | other | total t./other | |
| ChrA | 381 | 836 | 378 | 839 | 96.5 | 95.3 | 1.0 |
| ChrX | 14 | 41 | 17 | 38 | 3.5 | 4.7 | 0.7 |
| Permutations yielding < = observed gene counts in total t. on ChrX | 22 395 | ||||||
| Permutations, p-value (two tailed) | 0.4479 | ||||||
| Fisher's exact, p-value (two tailed) | 0.4563 | ||||||
| Confidence interval (0.95) | 0.70 – 2.68 | ||||||
| b) Testis somatic cells: 81 genes specific for the adult testis somatic cells SAGE library (ATSC) Other tissues: 924 genes specific for one tissue type in the pool of other SAGE libraries | |||||||
| Chrom | Observed gene counts | Gene counts in randomized genome | % observed gene counts | Ratio of observed proportions | |||
| t.somatic | other | t.somatic | other | t.somatic | other | t. somatic/other | |
| ChrA | 70 | 885 | 77 | 878 | 86.4 | 95.8 | 0.9 |
| ChrX | 11 | 39 | 4 | 46 | 13.6 | 4.2 | 3.2 |
| Permutations yielding > = observed gene counts in t. somatic on ChrX | 121 | ||||||
| Permutations, p-value (two tailed) | 0.0024 | ||||||
| Fisher's exact, p-value (two tailed) | 0.0013 | ||||||
| Confidence interval (0.95) | 0.13 – 0.64 | ||||||
Figure 4Chromosomal positions of clusters containing preferentially expressed genes in testis. The positions of 103 gene clusters according to the physical map are displayed on an ideogram with corresponding cytogenetic bands on chromosomes. The clusters revealed in total testis and testicular somatic cells are not distinguished. Tight clusters (long dashes) form a subset of loose clusters (short dashes).
Number of preferentially expressed genes in testis located in clusters within tandem duplicate-free mouse genome Out of the 19 684 known genes (LocusLink) mapped on mouse genome assembly (NCBI, build32), 16 858 genes remained in tandem duplicate-free genome, including 1 300 and 1 050 preferentially expressed genes in total testis and testicular somatic cells, respectively. Chromosome search found clusters containing at least three adjacent preferentially expressed genes (tight clusters) or at least three preferentially expressed genes among the six adjacent genes (loose clusters). The tight clusters therefore form a subset of the loose clusters. Observed gene counts were evaluated using permutations (100000 random shufflings of the expression status of genes while keeping the gene positions constant) and the average number of genes located in clusters in the randomized genomes was computed.
| Total testis (TT 1+2) | Adult testis somatic cells (ATSC) | |||
| 1 300 | 1 050 | |||
| in tight clusters | in loose clusters | in tight clusters | in loose clusters | |
| Observed gene counts | 44 | 230 | 36 | 120 |
| Proportion of preferentially expressed genes | 3.4 % | 17.7 % | 3.4 % | 11.4 % |
| Gene counts in randomized genomes (mean ± std. dev.) | 21.9 ± 8.1 | 168.4 ± 20.1 | 11.7 ± 5.9 | 94.2 ± 15.6 |
| Ratio observed/mean in randomized genomes | 2.0 | 1.4 | 3.1 | 1.3 |
| Permutations yielding > = observed gene counts | 741 | 180 | 52 | 5 722 |
| p-value (one tailed) | 0.0074 | 0.0018 | 0.0005 | 0.0572 |