| Literature DB >> 21551350 |
Seong Hwan Park1, Ondrej Podlaha, Wendy E Grus, Jianzhi Zhang.
Abstract
Vomeronasal sensitivity is important for detecting intraspecific pheromonal cues as well as environmental odorants and is involved in mating, social interaction, and other daily activities of many vertebrates. Two large families of seven-transmembrane G-protein-coupled receptors, V1rs and V2rs, bind to various ligands to initiate vomeronasal signal transduction. Although the macroevolution of V1r and V2r genes has been well characterized throughout vertebrates, especially mammals, little is known about their microevolutionary patterns, which hampers a clear understanding of the evolutionary forces behind the rapid evolutionary turnover of V1r and V2r genes and the great diversity in receptor repertoire across species. Furthermore, the role of divergent vomeronasal perception in enhancing premating isolation and maintaining species identity has not been evaluated. Here we sequenced 44 V1r genes and 25 presumably neutral noncoding regions in 14 wild-caught mice belonging to Mus musculus and M. domesticus, two closely related species with strong yet incomplete reproductive isolation. We found that nucleotide changes in V1rs are generally under weak purifying selection and that only ∼5% of V1rs may have been subject to positive selection that promotes nonsynonymous substitutions. Consistent with the low functional constraints on V1rs, 18 of the 44 V1rs have null alleles segregating in one or both species. Together, our results demonstrate that, despite occasional actions of positive selection, the evolution of V1rs is in a large part shaped by purifying selection and random drift. These findings have broad implications for understanding the driving forces of rapid gene turnovers that are often observed in the evolution of large gene families.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21551350 PMCID: PMC3114644 DOI: 10.1093/gbe/evr039
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FAn unrooted tree of 188 putatively functional mouse V1rs (Shi et al. 2005). Red branches denote the 44 V1rs surveyed in this study. Branches denoted with * have putatively functional rat V1r orthologs (Grus and Zhang 2008). Gene family names are from Rodriguez et al. (2002). The tree was reconstructed using the Neighbor-Joining method (Saitou and Nei 1987) with Poisson-corrected protein distances. The scale bar shows 0.1 amino acid substitution per site.
Intra- and Interspecific Sequence Variations of 44 Mouse V1r Genes
Intra- and Interspecific Sequence Variations of 25 Noncoding Regions in Mice
| Region name1 | Chromos. no. | Nucleotide position | length (nt) | Within | Within | Between species | |||||||||
| θ (×104) | π (×104) | Tajima's D | Fu and Li's D* | θ (×104) | π (×104) | Tajima's D | Fu and Li's D* | ||||||||
| 032 | 3 | 61421835–61422742 | 908 | 1 | 3.5 | 1.6 | −1.16 | −1.40 | 10 | 34.6 | 26.4 | −0.93 | 0.58 | 2 | 85.7 |
| con1 | 5 | 4750295–4751185 | 885 | 4 | 14.2 | 22.4 | 1.16 | 5 | 17.8 | 23.6 | 1.14 | 1.23 | 0 | 44.7 | |
| con2 | 5 | 4177067–4178013 | 948 | 6 | 19.9 | 26.9 | 1.26 | 1.29 | 4 | 13.3 | 11.1 | −0.53 | 1.16 | 0 | 40.0 |
| 062 | 6 | 13370671–13371601 | 930 | 1 | 3.6 | 5.7 | 1.38 | 0.75 | 5 | 16.9 | 17.7 | 0.17 | 0.51 | 9 | 122.8 |
| 063 | 6 | 13335122–13336060 | 940 | 0 | 0.0 | 0.0 | 0.00 | 0.00 | 7 | 23.5 | 32.5 | 1.41 | 0.77 | 7 | 93.5 |
| 071 | 7 | 34198101–34199051 | 951 | 6 | 19.8 | 14.9 | −0.89 | −0.61 | 8 | 26.5 | 29.5 | 0.43 | 1.37 | 9 | 150.0 |
| 072 | 7 | 1485242–1486253 | 1,012 | 2 | 6.2 | 5.2 | −0.44 | 0.94 | 1 | 3.1 | 4.9 | 1.21 | 0.72 | 11 | 117.9 |
| 073 | 7 | 14736902–14737811 | 902 | 15 | 55.1 | 90.7 | 7 | 24.2 | 30.1 | 0.90 | 0.21 | 4 | 148.9 | ||
| 074 | 7 | 13161241–13162059 | 818 | 1 | 3.8 | 1.7 | −1.16 | −1.40 | 10 | 38.4 | 65.4 | 1 | 71.6 | ||
| con3 | 7 | 12877447–12878480 | 1,034 | 3 | 9.1 | 12.4 | 0.07 | 0.30 | 4 | 12.2 | 10.5 | −0.44 | −0.55 | 1 | 26.2 |
| 102 | 10 | 54958829–54959659 | 812 | 0 | 0.0 | 0.0 | 0.00 | 0.00 | 1 | 3.8 | 1.7 | −1.16 | −1.40 | 9 | 111.7 |
| 131 | 13 | 4938296–4939215 | 920 | 2 | 6.8 | 10.5 | 1.46 | 0.94 | 3 | 10.3 | 13.3 | 0.90 | 0.02 | 9 | 117.2 |
| con4 | 13 | 5177266–5178204 | 939 | 3 | 10.0 | 14.4 | 1.32 | 1.07 | 6 | 20.1 | 28.8 | 1.55 | 1.29 | 4 | 81.4 |
| 133 | 13 | 14942772–14943791 | 1,021 | 1 | 3.1 | 5.3 | 1.51 | 0.72 | 7 | 21.6 | 18.1 | −0.60 | 1.33 | 4 | 60.9 |
| con5 | 13 | 10340883–10341849 | 984 | 3 | 9.6 | 8.3 | −0.42 | −1.04 | 8 | 25.5 | 21.9 | −0.53 | 0.86 | 6 | 86.4 |
| con6 | 14 | 12999764–13000706 | 948 | 5 | 16.6 | 15.1 | −0.32 | 0.51 | 2 | 6.6 | 4.3 | −0.96 | −0.45 | 1 | 39.9 |
| con7 | 14 | 55603471–55604414 | 944 | 3 | 10.5 | 5.3 | − | − | 17 | 56.6 | 29.3 | − | − | 1 | 45.9 |
| con8 | 17 | 1418876–1419796 | 921 | 5 | 17.1 | 13.2 | −0.78 | −0.95 | 5 | 17.1 | 13.6 | −0.70 | −0.22 | 2 | 42.7 |
| 173 | 17 | 27883797–27884681 | 885 | 2 | 7.1 | 6.6 | −0.20 | −0.45 | 1 | 3.6 | 5.1 | 0.84 | 0.72 | 15 | 181.6 |
| con9 | 17 | 64305167–64306026 | 860 | 10 | 36.6 | 30.2 | −0.68 | −1.52 | 3 | 11.0 | 6.4 | −1.28 | −1.04 | 0 | 64.8 |
| GGH | 4 | 20166795–201667294 | 499 | 1 | 5.0 | 3.5 | −0.34 | 0.72 | 4 | 18.3 | 19.3 | 0.00 | 0.00 | 0 | 2.9 |
| MELK | 4 | 2162811–2163729 | 918 | 0 | 0.0 | 0.0 | 0.00 | 0.00 | 14 | 50.6 | 75.0 | 1.20 | 5 | 116.2 | |
| NKD1 | 8 | 91403263–91404440 | 1,178 | 8 | 16.9 | 14.9 | −0.87 | −0.22 | 11 | 21.1 | 12.8 | 0.83 | 0.02 | 0 | 26.7 |
| PUM1 | 4 | 130022004–130023187 | 1,183 | 4 | 8.8 | 2.2 | 0.00 | 0.00 | 5 | 10.0 | 3.9 | −1.16 | −1.40 | 0 | 7.9 |
| SFRP1 | 8 | 2480594–2481312 | 1,071 | 1 | 3.0 | 5.0 | 1.43 | 0.72 | 12 | 30.1 | 24.3 | −0.75 | −0.68 | 10 | 122.0 |
| Mean | 936.4 | 3.5 | 11.7 | 12.6 | 0.17 | 0.04 | 160 | 20.7 | 21.2 | 0.13 | 0.21 | 4 | 80.4 | ||
NOTE.—1Most noncoding regions do not have standard names, and the names listed are the idendification numbers used in our laboratory.
Nucleotide positions in the mouse genome sequence of National Center for Biotechnology Information Build 37.
Length in M. musculus. Length in M. domesticus may be slightly different due to indels.
Number of polymorphic sites.
Watterson's polymorphism per site.
Nucleotide diversity per site.
Values significantly different from 0 at the 5% level are underlined. Significance is determined by 10,000 coalescent simulations.
Number of fixed nucleotide differences between species.
Mean number of nucleotide differences between species.
FFrequency distributions of Tajima's D and Fu and Li's D* among 44 V1rs and 25 noncoding regions in mouse. (A) Tajima's D in M. musculus; (B) Tajima's D in M. domesticus; (C) Fu and Li's D* in M. musculus; and (D) Fu and Li's D* in M. domesticus.
Numbers of Synonymous and Nonsynonymous Sequence Variations in V1rs
| Nonsynonymous/synonymous | Nonsynonymous/(noncoding + synonymous) | ||||||
| Nonsynonymous | Synonymous | Noncoding1 | Ratio | Ratio | |||
| Polymorphisms in | 111 | 97 | 80 | 1.144 | 0.774 | 0.627 | <0.001 |
| Polymorphisms in | 102 | 56 | 144 | 1.821 | 0.068 | 0.510 | <0.001 |
| High-frequency | 78 | 77 | 65 | 1.013 | 1.000 | 0.549 | <0.001 |
| High-frequency | 62 | 44 | 104 | 1.409 | 0.330 | 0.419 | 0.01 |
| Fixed differences between the two species | 32 | 31 | 110 | 1.032 | 0.227 | ||
NOTE.—1Variations in the 25 noncoding regions.
P value is from Fisher's exact test in comparison with fixed differences.
Frequency of the derived allele is equal to or greater than 2 of 14.
Comparison Between V1rs and the 25 Noncoding Regions by the HKA Test
| A1 | 1.95 | 2.6E-03 | 1.63 | 3.5E-01 |
| A9 | 1.5E-04 | 1.4E-04 | ||
| B1 | 2.11 | 5.0E-04 | 3.31 | 9.1E-01 |
| B2 | 2.41 | 4.4E-03 | 2.76 | 8.5E-01 |
| B3 | 2.35 | 1.0E-01 | 1.01 | 1.5E-02 |
| B7 | 4.32 | 8.5E-01 | 2.16 | 8.7E-01 |
| B8 | 1.6E-03 | 2.4E-03 | ||
| C5 | 3.2E-10 | 2.1E-03 | ||
| C6 | 5.13 | 3.3E-01 | 2.85 | 8.3E-01 |
| C21 | 2.73 | 9.5E-01 | 1.36 | 9.9E-01 |
| C27 | 1.35 | 2.9E-07 | 6.42 | 5.9E-01 |
| C28 | 1.09 | 1.2E-07 | 1.96 | 6.9E-01 |
| C30 | 4.44 | 7.7E-01 | 1.48 | 2.4E-01 |
| D6 | 1.65 | 1.7E-03 | 2.20 | 9.6E-01 |
| D7 | 12.61 | 7.0E-02 | 5.04 | 8.6E-01 |
| D13 | 1.56 | 1.8E-01 | 0.45 | 1.7E-05 |
| E1 | 1.05 | 5.9E-08 | 4.20 | 9.7E-01 |
| E2 | 4.43 | 7.7E-01 | 4.43 | 9.5E-01 |
| E3 | 0.76 | 4.5E-09 | 4.54 | 4.0E-01 |
| E4 | 2.14 | 4.5E-03 | 3.56 | 9.8E-01 |
| E7 | 1.97 | 4.2E-02 | 5.91 | 1.6E-01 |
| E8 | 1.14 | 3.4E-05 | 0.91 | 2.8E-02 |
| E11 | 1.66 | 1.9E-03 | 1.11 | 4.9E-02 |
| F1 | 0.41 | 2.2E-11 | 1.02 | 9.5E-01 |
| F2 | 0.37 | 1.7E-14 | 1.24 | 6.4E-01 |
| F3 | 0.87 | 7.2E-10 | 1.5E-03 | |
| F4 | 1.38 | 1.8E-05 | 4.15 | 9.7E-01 |
| F5 | 1.76 | 1.4E-03 | 4.2E-06 | |
| G1 | 2.99 | 1.2E-01 | 0.54 | 9.3E-06 |
| G6 | 1.11 | 1.1E-08 | 3.16 | 9.0E-01 |
| G9 | 1.56 | 2.5E-05 | 4.16 | 9.6E-01 |
| G11 | 1.14 | 6.4E-04 | 1.14 | 5.0E-01 |
| G12 | 2.40 | 3.2E-03 | 4.32 | 9.2E-01 |
| H4 | 8.52 | 4.2E-01 | 3.41 | 9.4E-01 |
| H15 | 4.69 | 5.2E-01 | 1.17 | 1.6E-02 |
| H19 | 2.05 | 1.2E-02 | 2.05 | 8.4E-01 |
| H20 | 1.23 | 3.2E-07 | 3.07 | 9.6E-01 |
| I1 | 1.70 | 1.0E+00 | 0.34 | 3.6E-05 |
| I6 | 4.09 | 4.7E-01 | 3.07 | 9.6E-01 |
| I7 | 2.12 | 2.8E-03 | 4.24 | 9.6E-01 |
| J2 | 1.48 | 2.2E-05 | 2.21 | 8.8E-01 |
| J3 | 1.91 | 5.4E-04 | 1.8E-04 | |
| K1 | 1.29 | 1.4E-04 | 1.72 | 8.9E-01 |
| L1 | 0.76 | 4.5E-09 | 0.76 | 3.0E-03 |
| All | 2.59 | 1.5E-96 | 3.63 | 6.9E-05 |
| 25 noncoding regions | 7.30 | 4.20 | ||
NOTE.—1Interspecific divergence per site divided by Watterson's polymorphism per site. The actual number of polymorphc site plus 1 was used in calculating Watterson's polymorphism per site. Values of d/θ that are significantly greater than expected from the 25 noncoding regions are underlined.
P values are from chi-squares tests.
Mouse V1rs With Segregating Null Alleles1
| A9 | 3 (deletion) | |
| B1 | 8 (deletion) | |
| B2 | 1 (deletion) | |
| C6 | 4 (SNP) | |
| C27 | 5 (SNP) + 1 (deletion) | |
| C28 | 4 (no amplification) | 1 (SNP) |
| E1 | 2 (deletion) | |
| F1 | 1 (deletion) + 2 (deletion) | |
| F3 | 10 (no amplification) + 2 (SNP) | |
| F5 | 1 (deletion) | |
| H4 | 1 (insertion) | |
| H15 | 6 (SNP) | |
| I6 | 2 (insertion) | 1 (insertion) + 1 (SNP) |
| I7 | 2 (deletion) | 1 (insertion) |
| J2 | 2 (SNP) | |
| J3 | 1 (SNP) | |
| K1 | 1 (SNP) + 2 (deletion) | |
| L1 | 1 (SNP) |
NOTE.—Numbers in the table are the numbers of null alleles (of 14 per species) for V1rs that harbor null alleles in the species. The type of null mutations is also indicated.