| Literature DB >> 24398377 |
Anne D Yoder1, Lauren M Chan, Mario dos Reis, Peter A Larsen, C Ryan Campbell, Rodin Rasoloarison, Meredith Barrett, Christian Roos, Peter Kappeler, Joseph Bielawski, Ziheng Yang.
Abstract
Vomeronasal receptor genes have frequently been invoked as integral to the establishment and maintenance of species boundaries among mammals due to the elaborate one-to-one correspondence between semiochemical signals and neuronal sensory inputs. Here, we report the most extensive sample of vomeronasal receptor class 1 (V1R) sequences ever generated for a diverse yet phylogenetically coherent group of mammals, the tooth-combed primates (suborder Strepsirrhini). Phylogenetic analysis confirms our intensive sampling from a single V1R subfamily, apparently unique to the strepsirrhine primates. We designate this subfamily as V1Rstrep. The subfamily retains extensive repertoires of gene copies that descend from an ancestral gene duplication that appears to have occurred prior to the diversification of all lemuriform primates excluding the basal genus Daubentonia (the aye-aye). We refer to the descendent clades as V1Rstrep-α and V1Rstrep-β. Comparison of the two clades reveals different amino acid compositions corresponding to the predicted ligand-binding site and thus potentially to altered functional profiles between the two. In agreement with previous studies of the mouse lemur (genus, Microcebus), the majority of V1Rstrep gene copies appear to be intact and under strong positive selection, particularly within transmembrane regions. Finally, despite the surprisingly high number of gene copies identified in this study, it is nonetheless probable that V1R diversity remains underestimated in these nonmodel primates and that complete characterization will be limited until high-coverage assembled genomes are available.Entities:
Keywords: G-protein-coupled receptors; chemosensory genes; gene family evolution; lemurs; olfaction; positive selection
Mesh:
Substances:
Year: 2014 PMID: 24398377 PMCID: PMC3914689 DOI: 10.1093/gbe/evu006
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
FTime-scaled phylogeny for strepsirrhine primates included in this study. Data and 95% credible divergence ages were derived from Horvath et al. (2008). Some nodes do not show age estimates as not all species examined in this study were included in the Horvath et al. (2008) analysis. Branches are colored as follows: Lorisiformes (red), Daubentoniidae (orange), Indriidae (yellow), Cheirogaleidae (blue), and Lemuridae (green). First column on right indicates whether urine washing is practiced by that species. Second column indicates whether the species is nocturnal (N), diurnal (D), or cathemeral (C). Behavioral data were taken from Delbarco-Trillo et al. (2011). Latin binomials with common names of species are O. garnettii (small-eared galago), Galago moholi (mohol bushbaby), Nycticebus pygmaeus (pygmy slow loris), Daubentonia madagascariensis (aye-aye), Propithecus coquereli (Coquerel's sifaka), Propithecus tattersalli (Tattersall's sifaka), Phaner pallescens (pale fork-marked lemur), Cheirogaleus medius (fat-tailed dwarf lemur), C. major (greater dwarf lemur), Allocebus trichotis (hairy-eared dwarf lemur), Microcebus simmonsi (Simmon's mouse lemur), M. murinus (gray mouse lemur), M. griseorufus (gray–brown mouse lemur), Varecia rubra (red ruffed lemur), V. variegata (black and white ruffed lemur), Hapalemur griseus (eastern lesser bamboo lemur), Lemur catta (ring-tailed lemur), Eulemur mongoz (mongoose lemur), and Eulemur collaris (collared brown lemur).
Average Pairwise Genetic Distances within and between Alpha and Beta Lineages of Cheirogaleidae and Lemuridae
| Alpha (%) | Beta (%) | Alpha vs. Beta (%) | |
|---|---|---|---|
| Average nucleotide pairwise genetic distances | |||
| Cheirogaleidae | 11.0 | 8.6 | 13.8 |
| Lemuridae | 5.0 | 1.7 | 13.6 |
| Average amino acid pairwise genetic distances | |||
| Cheirogaleidae | 20.5 | 16.0 | 26.7 |
| Lemuridae | 9.3 | 3.3 | 23.7 |
Note.—Pairwise distances were calculated with BASEML and CODEML (PAML 4.6). Nucleotide sequences were corrected with the F84 + Gamma substitution model; amino acid sequences were corrected with the LG + F + Gamma substitution model. Alpha was fixed at 0.5 in both cases.
FRAxML maximum likelihood tree of V1R sequences generated herein combined with a subset of those identified by Young et al. (2010). (A) V1R sequences originating from strepsirrhine primates (lemurs and lorises) are shown in red. The largest red clade corresponds to the V1Rstrep subfamily. (B) Primate V1R sequences: lemuriforms (blue); lorisiforms (red); tarsiers (yellow); and anthropoids (green).
FMaximum likelihood phylogeny based on DNA sequence data showing the relationships among V1Rstrep gene copies with the α and β clades indicated. Red represents the lorisiform primates sampled herein (Otolemur, Galago, and Nycticebus), orange represents family Daubentoniidae (genus Daubentonia), green indicates the family Lemuridae (genera Lemur, Hapalemur, Eulemur, and Varecia), yellow indicates family Indriidae (genus Propithecus), and blue represents family Cheirogaleidae (genera Microcebus, Cheirogaleus, Phaner, and Allocebus). It is notable that lemurids (green) are far more abundant in the α clade than in the β clade; conversely, indriids (yellow) are more abundant in the β clade than in the α clade. For precise taxonomic identities represented, refer to supplementary table S1, Supplementary Material online.
FMaximum likelihood phylogenies based on clustered DNA sequence data (98% similarity threshold) of the V1Rstrep α and β repertoires for Cheirogaleidae and Lemuridae. Phylogenetic analyses were performed on centroid sequences of clusters sharing less than 98% similarity. Results of RaxML analyses on combined (A) and family specific analyses of Cheirogaleidae (B) and Lemuridae (C) are shown. Nodal support was measured by performing 500 bootstrap iterations using RaxML and PhyML (top percentages, respectively) and with Bayesian posterior probabilities based on 2 million iterations (bottom score). Daubentonia was used as the outgroup for rooting both trees (not shown). NS, no support in all three phylogenetic methods.
FPredicted protein structure and function for amino acid sequences in our alignment using I-TASSER (http://zhanglab.ccmb.med.umich.edu/I-TASSER/, last accessed January 17, 2014). (A) Model of a seven TM G protein-coupled receptor: yellow, TM1; light blue, TM2; green, TM3; purple, TM4; dark blue, TM5; orange, TM6; and dark gray, TM7. Arrow identifies the orientation of the protein with respect to extracellular (EXT) and intracellular (INT) regions. (B) I-TASSER predicted ligand-binding site with putative ligand highlighted in gold. The amino acid variation defining V1Rstrep α and β lineages is highlighted in red (4th and 5th TM regions and second extracellular loop). This region is part of the hypothesized ligand-binding pocket (see Discussion).
Analysis under Site Models
| Model | 2Δ | Mean ω | Parameters |
|---|---|---|---|
| M1a (neutral) | 0.471 | ω0 = 0.233, ω1 = 1, | |
| M2a (selection) | 999.35 | 0.759 | ω0 = 0.243, ω1 = 1, ω2 = 3.10, |
| M7 (beta) | 0.433 | ||
| M8 (beta and ω) | 923.85 | 0.636 |
***P value < 0.001.
Analysis under Branch-Site and Clade Models
| Clade | Branch-Site Models | Clade Model, | |||||
|---|---|---|---|---|---|---|---|
| 2Δ | ω0 | ω2 | |||||
| Lorisiformes | 92.44 | 0.213 | 2.58 | 0.574 | 0.245 | 0.182 | 0.727 |
| 142.36 | 0.237 | 6.43 | 0.642 | 0.265 | 0.093 | 2.34 | |
| β-Cheirogaleidae | 195.69 | 0.236 | 4.65 | 0.648 | 0.321 | 0.031 | 3.73 |
| β- | 338.01 | 0.238 | 5.05 | 0.635 | 0.276 | 0.089 | 4.06 |
| β-Lemuridae | 1.67 | 0.231 | 2.68 | 0.636 | 0.288 | 0.076 | 2.46 |
| α-Cheirogaleidae | 393.60 | 0.236 | 3.71 | 0.593 | 0.342 | 0.065 | 3.81 |
| α- | 0.00 | 0.233 | 1 | 0.689 | 0.311 | 0 | 5.91 |
| α-Lemuridae | 116.64 | 0.230 | 3.85 | 0.627 | 0.268 | 0.105 | 2.96 |
aThe foreground branches are all branches within the specific clade.
bEleven ω values are estimated in the clade model, ω0 = 0.242 and ω1 = 1 are the same for all clades, ωclade is estimated for each clade in particular. For the ancestral branches (those around the root of the tree), ωclade = 3.13.
***P value < 0.001.
Positively Selected Sites for the Foreground Branches Identified under Branch-Site Models, Using BEB
| Clade | Positively Selected Sites |
|---|---|
| Lorisiformes | |
| β-Cheirogaleidae | |
| β- | |
| β-Lemuridae | No sites |
| α-Cheirogaleidae | |
| α- | No sites |
| α-Lemuridae |
Note.—BEB, Bayes Empirical Bayes.
aThe foreground branches are all branches within the specific clade.
bSites have posterior probability > 95%. Sites in bold-type face have posterior probability > 99%.
F(A) Snake diagram of the V1Rstrep protein. TM regions 1–7 (labeled) appear in light gray, extracellular regions in white, and intracellular regions in dark gray. Red amino acid residues identify the start (left) and end (right) of the V1Rstrep sequence data reported herein. (B) Circos diagram (Krzywinski et al. 2009) showing residues under significant positive selection (see table 4) along a generic V1Rstrep protein. White, light gray, and dark gray bands correspond to the extracellular, TM (TM1–TM7), and intracellular regions, respectively (shown in panel A). Red bands in TM2 and TM7 identify start and end of the sequence data reported herein. Ribbons are colored according to taxonomic group and/or V1Rstrep α and β lineages. Outer edge histogram (light purple) identifies position and quantity of positively selected residues within each region.