| Literature DB >> 15869325 |
Rasmus Nielsen1, Carlos Bustamante, Andrew G Clark, Stephen Glanowski, Timothy B Sackton, Melissa J Hubisz, Adi Fledel-Alon, David M Tanenbaum, Daniel Civello, Thomas J White, John J Sninsky, Mark D Adams, Michele Cargill.
Abstract
Since the divergence of humans and chimpanzees about 5 million years ago, these species have undergone a remarkable evolution with drastic divergence in anatomy and cognitive abilities. At the molecular level, despite the small overall magnitude of DNA sequence divergence, we might expect such evolutionary changes to leave a noticeable signature throughout the genome. We here compare 13,731 annotated genes from humans to their chimpanzee orthologs to identify genes that show evidence of positive selection. Many of the genes that present a signature of positive selection tend to be involved in sensory perception or immune defenses. However, the group of genes that show the strongest evidence for positive selection also includes a surprising number of genes involved in tumor suppression and apoptosis, and of genes involved in spermatogenesis. We hypothesize that positive selection in some of these genes may be driven by genomic conflict due to apoptosis during spermatogenesis. Genes with maximal expression in the brain show little or no evidence for positive selection, while genes with maximal expression in the testis tend to be enriched with positively selected genes. Genes on the X chromosome also tend to show an elevated tendency for positive selection. We also present polymorphism data from 20 Caucasian Americans and 19 African Americans for the 50 annotated genes showing the strongest evidence for positive selection. The polymorphism analysis further supports the presence of positive selection in these genes by showing an excess of high-frequency derived nonsynonymous mutations.Entities:
Mesh:
Year: 2005 PMID: 15869325 PMCID: PMC1088278 DOI: 10.1371/journal.pbio.0030170
Source DB: PubMed Journal: PLoS Biol ISSN: 1544-9173 Impact factor: 8.029
Figure 1Distribution of Mutations
The figure shows the number of synonymous and nonsynonymous nucleotide differences in 13,731 human–chimpanzee orthologous gene pairs.
Biological Process Categories with an Excess of Putatively Positively Selected Genes (Nominal p less than 0.05; MWU) among a Total of 133 Biological Process Categories
Note that the categories overlap; e.g., “T-cell-mediated immunity” is entirely nested within “Immunity and defense.”
Test for an Excess of Putatively Positively Selected Genes by Tissue Type
Small p-values (MWU; nominal p-values not corrected for multiple testing) indicate an excess of putatively positively selected genes in the tissue type.
The Top 50 Genes Showing Evidence for Positive Selection
aReference number used in Dataset S1.
bNumber of nonsynonymous differences between humans and chimps.
cNumber of synonymous differences between humans and chimps.
dNumber of nonsynonymous polymorphism in humans.
eNumber of synonymous polymorphism in humans.
fLikelihood ratio from the likelihood ratio test of d/d equals one versus d/d is greater than one in the human–chimp alignment.
Figure 2Frequency Spectra
The figure shows the frequency spectra of nonsynonymous (red) and synonymous (black) mutations among the 50 genes showing the strongest evidence for positive selection in the interspecific comparison. Also shown is the expectation from the standard neutral model, expectations from the neutral model taking the protocol used to select the 50 genes into account (see text), and from the prediction of the selection model. On the x-axis is the number of derived allele in a sample of size 30 chromosomes (Count), and on the y-axis is the proportion of sites expected in the sample with a particular frequency.
Figure 3Power of the Likelihood Ratio Test for Positive Selection
The power is shown as a function of the proportion of the d/d ratio, and for sequence lengths (n) of 150 and 500 codons. Power is defined as the proportion of tests that are significant at the 5% level. Simulation parameters, including codon frequencies, transition/transversion bias, and divergence times, are equal to the values estimated from the data. Notice the logarithmic x-axis.