| Literature DB >> 24710137 |
Yali Xue1, Chris Tyler-Smith2.
Abstract
The TSPY gene stands out from all other human protein-coding genes because of its high copy number and tandemly-repeated organization. Here, we review its evolutionary history in great apes in order to assess whether these unusual properties are more likely to result from a relaxation of constraint or an unusual functional role. Detailed comparisons with chimpanzee are possible because a finished sequence of the chimpanzee Y chromosome is available, together with more limited data from other apes. These comparisons suggest that the human-chimpanzee ancestral Y chromosome carried a tandem array of TSPY genes which expanded on the human lineage while undergoing multiple duplication events followed by pseudogene formation on the chimpanzee lineage. The protein coding region is the most highly conserved of the multi-copy Y genes in human-chimpanzee comparisons, and the analysis of the dN/dS ratio indicates that TSPY is evolutionarily highly constrained, but may have experienced positive selection after the human-chimpanzee split. We therefore conclude that the exceptionally high copy number in humans is most likely due to a human-specific but unknown functional role, possibly involving rapid production of a large amount of TSPY protein at some stage during spermatogenesis.Entities:
Year: 2011 PMID: 24710137 PMCID: PMC3924835 DOI: 10.3390/genes2010036
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1Comparison of human and chimpanzee gene clusters. Left panels, human TSPY minor cluster at 6.1 Mb; right panels, human TSPY major array at 9.1–9.4 Mb including assembly gap shown as a blank region (GRCh37). Top panels, chimpanzee TSPY cluster 1 at 4.3 Mb (4 genes); middle panels, chimpanzee TSPY cluster 2 at 9.9 Mb (1 gene); bottom panels, chimpanzee TSPY cluster 3 at 13.6 Mb (1 gene) (panTro2). Red circles indicate the region analyzed in Figure 2.
Numbers of TSPY gene and pseudogene copies in the human and chimpanzee reference sequences.
| ∼35 + 1 | 5 | ∼41 | [ | |
| 4 + 1 + 1 | 21 | 27 | [ |
Figure 2Phylogenetic relationships of the two human and three chimpanzee TSPY arrays deduced from comparisons of the flanking sequences circled in Figure 1 (coordinates in text). The arrays, named according to their approximate genomic coordinates, are shown at the bottom, with active TSPY genes indicated by black arrowheads and the flanking region by a red rectangle. The topology of the tree is shown and the number of mutational events on each branch indicated, but the branches are not drawn to scale. Note that no outgroup sequence is available and the tree is unrooted.
Divergence of human and chimpanzee ampliconic gene sequences. All data are from [18]; NA = not applicable.
| 0.935 | 1.504 | |
| 1.705 | NA | |
| 2.188 | 1.422 | |
| 2.728 | 2.511 | |
| 0.794 | 2.888 | |
| 4.298 | 1.579 | |
| 1.626 | NA |
Selective forces on the TSPY amino acid sequence assessed using the dN/dS statistic.
| 71 | 35 | 0.58 | |
| 73 | 34 | 0.61 | |
| 5 | 1 | 1.49 | |
| 73 | 34 | 0.62 | |
| 73 | 35 | 0.61 | |
| 7 | 0 | NA | |