Literature DB >> 30456442

Genes Encoding Mammalian Oviductal Proteins Involved in Fertilization are Subjected to Gene Death and Positive Selection.

Carla Moros-Nicolás1, Sophie Fouchécourt1, Ghylène Goudet1, Philippe Monget2.   

Abstract

Oviductal proteins play an important role in mammalian fertilization, as proteins from seminal fluid. However, in contrast with the latter, their phylogenetic evolution has been poorly studied. Our objective was to study in 16 mammals the evolution of 16 genes that encode oviductal proteins involved in at least one of the following steps: (1) sperm-oviduct interaction, (2) acrosome reaction, and/or (3) sperm-zona pellucida interaction. Most genes were present in all studied mammals. However, some genes were lost along the evolution of mammals and found as pseudogenes: annexin A5 (ANXA5) and deleted in malignant brain tumor 1 (DMBT1) in tarsier; oviductin (OVGP1) in megabat; and probably progestagen-associated endometrial protein (PAEP) in tarsier, mouse, rat, rabbit, dolphin, and megabat; prostaglandin D2 synthase (PTGDS) in microbat; and plasminogen (PLG) in megabat. Four genes [ANXA1, ANXA4, ANXA5, and heat shock 70 kDa protein 5 (HSPA5)] showed branch-site positive selection, whereas for seven genes [ANXA2, lactotransferrin (LTF), OVGP1, PLG, S100 calcium-binding protein A11 (S100A11), Sperm adhesion molecule 1 (SPAM1), and osteopontin (SPP1)] branch-site model and model-site positive selection were observed. These results strongly suggest that genes encoding oviductal proteins that are known to be important for gamete fertilization are subjected to positive selection during evolution, as numerous genes encoding proteins from mammalian seminal fluid. This suggests that such a rapid evolution may have as a consequence that two isolated populations become separate species more rapidly.

Entities:  

Keywords:  Fertilization; Oviduct; Positive selection; Protein evolution; Pseudogene

Mesh:

Substances:

Year:  2018        PMID: 30456442      PMCID: PMC6267676          DOI: 10.1007/s00239-018-9878-0

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


Introduction

The oviduct, also called the Fallopian tube in human, is a tubular organ that plays an important role in mammalian fertilization. The oviduct is the location where fertilization takes place in amniotes. It is involved in gamete transportation and maturation, sperm capacitation, and it provides the appropriate microenvironment for the early embryo development (Coy et al. 2012a; Hunter 1998, 2012). The oviductal fluid (OF) is formed by the secretion of the epithelial cells and also results from a transudate of the blood plasma, and contains glycosaminoglycans, metabolites, amino acids, inorganic salts, and proteins (Aviles et al. 2010; Leese et al. 2008, 2001). The proteome of the OF has been studied in several mammalian species, such as cow (Lamy et al. 2016), ewe (Soleilhavoup et al. 2016), mare (Smits et al. 2017), pig (Georgiou et al. 2005; Mondejar et al. 2012), and rabbit (Yu et al. 2016). It has been demonstrated that the oviductal proteome changes with the different stages of the oestrous cycle (Lamy et al. 2016; Soleilhavoup et al. 2016), the presence of gametes (Georgiou et al. 2005) or the presence of embryos (Smits et al. 2017). The proteome of the oviduct epithelial cells has also been studied in humans (Wang et al. 2016) and pigs (Seytanoglu et al. 2008). We have chosen to study the evolution of 16 genes that have been demonstrated to be crucial for the biological function of oviduct. Oviductal proteins, like oviductin (OVGP1), annexins (ANXA), heat shock proteins (HSP), deleted in malignant brain tumor (DMBT1), or osteopontin (SPP1) among others have been extensively studied (Coy and Yanagimachi 2015). These proteins are involved in different steps that are crucial for fertilization. For instance, annexins and DMBT1 participate in the binding between the sperm and the oviduct (Ignotz et al. 2007; Teijeiro et al. 2011). Progestagen-associated endometrial protein (PAEP) also known as glycodelin (Gd), lactotransferrin (LTF), heat shock 70 kDa protein 5 (HSPA5), OVGP1, prostaglandin D2 synthase (PTGDS), plasminogen (PLG), and SPP1 participate in the zona pellucida–sperm interaction controlling polyspermy (Algarra et al. 2016; Chiu et al. 2007; Coy et al. 2008, 2012b; Goncalves et al. 2008a; Hao et al. 2006; Lachance et al. 2007; Marin-Briggiler et al. 2010; Mondejar et al. 2013; Zumoffen et al. 2013). Heat shock 70 kDa protein 8 (HSPA8) has a beneficial effect on sperm viability (Elliott et al. 2009; Lloyd et al. 2009; Moein-Vaziri et al. 2014). Natriuretic peptide A (NPPA) and glycodelin-A (GdA) induce the acrosome reaction (Chiu et al. 2010; Zhang et al. 2006). Sperm adhesion molecule 1 (SPAM1) plays a role in the dispersion of the cumulus cells (Griffiths et al. 2008) and S100 calcium-binding protein A11 (S100A11) plays a role in sperm selection through its action on cumulus cells (Hanaue et al. 2011). It is well known that genes related to reproduction evolve faster than genes expressed in most other tissues (Singh and Kulathinal 2000; Swanson and Vacquier 2002; Turner and Hoekstra 2008; Vacquier 1998); genes subjected to rapid evolution show a big percentage of amino acids substitution between species. For example, some genes involved in oocyte–sperm interaction were highlighted as ZP2, ZP3, or acrosine (Makalowski and Boguski 1998). Rapid evolution can also be associated to a loss of functionality, sometimes leading to pseudogenization (Bodmer and Ashburner 1984; Hellberg and Vacquier 1999; Li et al. 1981), or it can be related to an adaptive evolution induced by natural selection (Swanson and Vacquier 2002). This rapid evolution can be species-specific and it could even play a role in speciation, as it has been described for instance for CD9 in the oocyte and IZUMO1 in the sperm (Claw et al. 2014). Several studies have been focused on the positive selection of the seminal plasma proteins in Drosophila, in rodents, and in primates (Meslin et al. 2015). However, to our knowledge, there are no studies focusing on the evolution and an eventual positive selection of the genes encoding oviductal proteins in any mammalian species. For this reason, the aim of this work was to study the evolution of these genes in mammals.

Materials and Methods

Phylogenetic and Syntenic Analyses

Sixteen oviductal proteins (ANXA1, ANXA2, ANXA4, ANXA5, DMBT1, PAEP, HSPA5, HSPA8, LTF, NPPA, OVGP1, PLG, PTGDS, SPAM1, SPP1, and S100A11), described in the literature as important for fertilization, were chosen for the analysis. On this study, the genomes of 16 placental mammal species were sampled; different species were chosen due to the availability of their genome and to include species from all the phylogenetic tree of mammals (Ailuropoda melanoleuca, Bos taurus, Canis lupus familiaris, Equus caballus, Felis catus, Homo sapiens, Myotis lucifugus, Mus musculus, Oryctolagus cuniculus, Ovis aries, Pongo abelii, Pteropus vampyrus, Rattus norvegicus, Sus scrofa, Tarsius syrichta, and Tursiops truncatus). The version of Ensembl used to perform the analyses was Ensembl 87: Dec 2016, and the following versions of genomes were used: cat (Felis_catus_6.2), cow (UMD3.1), horse (EquCab2), human (GRCh38.p7), dog (CanFam3.1), dolphin (turTru1), megabat (pteVam1), microbat (Myoluc2.0), mouse (GRCm38.p5), orangutan (PPYG2), panda (ailMel1), pig (Sscrofa10.2), rabbit (OryCun2.0), rat (Rnor_6.0), sheep (Oar_v3.1), and tarsier (tarSyr1). For all identified genes, we used the PhyleasProg web server v3.1 (http://phyleasprog.inra.fr/) (Busset et al. 2011) to study the evolution of the corresponding Ensembl protein ID (Table 1), retrieved from the Ensembl database. We carefully examined all reconstructed phylogenetic trees before the interpretation of the results, that we eventually corrected by synteny analysis as previously described (Tian et al. 2009), so that calculations were performed with correct orthologs.
Table 1

List of the 16 proteins studied. Human sequences of the proteins were used for analyses

ProteinAbbreviationProtein accession number
Annexin 1ANXA1ENSP00000366109
Annexin 2ANXA2ENSP00000346032
Annexin 4ANXA4ENSP00000377833
Annexin 5ANXA5ENSP00000296511
Deleted in malignant brain tumorDMBT1ENSP00000357905
Heat shock 70 kDa protein 5HSPA5ENSP00000324173
Heat shock 70 kDa protein 8HSPA8ENSP00000432083
LactotransferrinLTFENSP00000231751
Natriuretic peptide ANPPAENSP00000365663
OviductinOVGP1ENSP00000358747
Progestagen-associated endometrial proteinPAEPENSP00000417898
PlasminogenPLGENSP00000308938
Prostaglandin D2 synthasePTGDSENSP00000360687
Sperm adhesion molecule 1SPAM1ENSP00000345849
OsteopontinSPP1ENSP00000237623
S100 calcium-binding protein A11S100A11ENSP00000271638
List of the 16 proteins studied. Human sequences of the proteins were used for analyses

Identification of Pseudogenes—Inference of Positive Selection

We systematically performed the search for pseudogenes by using tBlastn in the studied genomes for genes for which one of the species of interest presented no ortholog. As previously described for seminal plasma genes, this methodology allowed us to test the hypothesis that evolution of oviductal genes in mammals is characterized by a gene loss pattern (Meslin et al. 2011, 2015; Tian et al. 2009). The pseudogene status was inferred in a genome if we found a premature stop codon or an indel in the sequence identified by the similarity search in the syntenic locus in comparison with the other species of interest. In the cases of absent genes without pseudogene observed in the synteny locus, we could only make the hypothesis that the gene has been lost. The PhyleasProg web server used the CODEML application from the PAML package version 4.4 to investigate selective pressure (Yang 2007). This allows the calculation of the ratio dN/dS which varied across codons and the estimation of the probability for each codon to be under positive selection. We used MUSCLE and PAL2NAL softwares to obtain the alignments (Thompson et al. 1994; Suyama et al. 2006). We systematically and carefully examined multiple alignments to avoid false-positive results. In particular, we did not consider amino acids that were at the boundary of the alignments. We also systematically eliminated genes that presented sequence errors in Ensembl according to other databases such as RefSeq in NCBI, in particular for ANXA1 and PLG in rabbit; OVG1 in horse, megabat, and mouse; and SPAM1 in megabat. Site Models implemented in PAML were used to evaluate if the intensity of selective pressure varies among sites in the sequences studied (Nielsen and Yang 1998). This allowed the estimation of ω ratio to vary among sites (Nielsen and Yang 1998; Yang 2000). Five models and three comparisons are used in PhyleasProg: M1a (0 < ω0 < 1 and ω1 = 1) versus M2a (0 < ω0 < 1, ω1 = 1 and ω2 > 1) (Wong et al. 2004; Yang et al. 2005), M7 (0 < ω < 1) versus M8 (0 < ω < 1 and ωS > 1) (Yang 2000), and M8 versus M8a (0 < ω < 1 and ωS = 1) (Swanson et al. 2003). We used likelihood ratio tests (LRTs) to compare the log likelihood values (Nielsen and Yang 1998). We used Bayes Empirical Bayes (BEB) method (Yang et al. 2005) implemented in PAML to estimate posterior probabilities of selection on each codon. Probabilities higher than 0.95 were considered significant. Studying whether one or more species have undergone selection pressure on some of their genes during evolution has led us to use the Phyleasprog tool using branch-site models of PAML (Yang and Nielsen 2002; Zhang et al. 2005). These models can detect positive selection episodes on specific branches of the phylogenetic tree, designated by the user. Two types of branches are defined in these models, the so-called foreground branches for which positive selection will be allowed, (defined a priori by the user), and the so-called background branches, for which the dN/dS ratio is allowed to vary only between 0 and 1. Two models are used, one model allowing positive selection, called alternative model, and the other not allowing it, called null model. In the latter model, the dN/dS ratio ω2 is fixed to 1. For the alternative model, three classes are defined: ω0: dN/dS < 1, ω1: dN/dS = 1, and ω2: dN/dS ≥ 1. As for the site model, LRT (Nielsen and Yang 1998) and BEB (Yang et al. 2005) were used. We tested simultaneously each branch of each phylogenetic tree for positive selection. The q value was used to control the statistical evidence associated with each branch tested because multiple-hypothesis tests were performed. Similar to the p value, we used the q value to measure the significance in terms of false discovery rate rather than false-positive rate. We used R package QVALUE to compute the q values (Storey and Tibshirani 2003). We considered positive selection on the foreground branch to be significant with a threshold of q < 10% of false positives. After validation of the branch with the q value, we only considered sites with posterior probabilities of Bayes Empirical Bayes analyses superior to 95% or 99%. We discarded from subsequent analysis datasets with less than ten sequences, which is the minimum threshold required to obtain significant results, with excessively divergent sequences, or with sequences of genes for which annotations are not reliable.

Results

Identification of Pseudogenes and Paralogs

The present search for pseudogenes showed that ANXA5 and DMBT1 were lost in tarsier; OVGP1 in megabat; and probably PAEP in tarsier, mouse, rat, rabbit, dolphin, and megabat; PTGDS in microbat; and PLG in megabat (examples of traces of pseudogenes in Fig. 1; pseudogenes and species affected in Fig. 2).
Fig. 1

Identification of marks of pseudogenes. A tBlastn analysis allowed the identification of exons presenting STOP codons (*) within tarsier ANXA5, tarsier DMBT1, and megabat OVGP1. For each alignment, the upper number corresponds to the amino acid position and the lower number to the genomic position of nucleotides on the corresponding chromosome

Fig. 2

Proteins present in oviductal fluids and phylogenetic results showing gene loss, gene duplication, and positive selection. The list of oviductal proteins involved in the different processes of sperm storage and gamete fertilization in different mammals was established in the “Material and Methods” section

Identification of marks of pseudogenes. A tBlastn analysis allowed the identification of exons presenting STOP codons (*) within tarsier ANXA5, tarsier DMBT1, and megabat OVGP1. For each alignment, the upper number corresponds to the amino acid position and the lower number to the genomic position of nucleotides on the corresponding chromosome Proteins present in oviductal fluids and phylogenetic results showing gene loss, gene duplication, and positive selection. The list of oviductal proteins involved in the different processes of sperm storage and gamete fertilization in different mammals was established in the “Material and Methods” section Paralogues (n) were found for the next genes: OVGP1 in sheep (1), cow (2), and pig (1); HSPA8 in rat (3) and dog (1); and SPAM1 in mouse (1), rat (1), horse (1), cow (1), and sheep (1) (Fig. 2).

Inference of Positive Selection

Five genes did not show positive selection: DMBT1, HSPA8, NPPA, PAEP, and PTGDS. Seven genes showed site model positive selection: ANXA2, LTF, OVGP1, PLG, S100A11, SPAM1, and SPP1; and 11 genes showed branch model positive selection: ANXA1, ANXA2, ANXA4, ANXA5, HSPA5, LTF, OVGP1, PLG, S100A11, SPAM1, and SPP1. Site model positive selection was found for OVGP1, PLG, S100A11, and SPP1 with the M8a versus M8 comparison, whereas LTF and SPAM1, LRTs for both comparisons (M1 vs. M2, M8a vs. M8) were significant (p < 0.001) for the dataset. However, as these genes evolve particularly rapidly, the protein sequences are very divergent between species, and the identification of amino acids that are under positive selection was very difficult. So, we decided not to include this part of result because they are not confident. The branch model showed positive selection for ANXA1, ANXA 2, ANXA4, ANXA5, HSPA5, LTF, OVGP1, PLG, S100A11, SPAM1, and SPP1 genes. The comparison between site models of PAML detects positive selection only if the ω ratio averaged over all branches on the tree is greater than 1, but it can also be expected positive selection to affect only a few amino acid residues along particular lineages. So, the branch-site models were used (Yang and Nielsen 2002), which are dedicated to detect such signals of local episodic positive selection, to determine whether different species have undergone selective pressure. Sixteen species were tested as foreground branch with branch-site models of PAML, branches that were tested are indicated in Fig. 2. Significant LRTs (with at least p < 0.05) were found for ANXA1 in dolphin; ANXA 2 and OVGP1 in dog; ANXA4 in ruminants (cow and sheep); ANXA5 in sheep; HSPA5 in rabbit; PLG in pig; S100A11 in human, orangutan, and rat; SPP1 in microbat, megabat, horse, dog, panda, cat, pig, dolphin, cow, sheep, and mouse; and LTF and SPAM1 in all the studied species suggesting that orthologs of these genes might have been subjected to positive selection in these species (Table 2).
Table 2

Parameter estimates and likelihood scores for branch-site evolutionary models

SpeciesModel ɭ a Estimates of parametersb2∆lcPositively selected sites (BEB)d
ANXA1—parameter estimates and likelihood scores for branch-site models for species
 DolphinNull− 2590.515664 ρ 0 = 0.557, (ρ1 = 0.123), ω0 = 0.046, (ω1 = 1)3.915*Not allowed
Alternative− 2588.557995 ρ 0 = 0.750, ρ1 = 0.160, (ρ2 = 0.087), ω0 = 0.049, (ω1 = 1), ω2 = 7.2751 Site p > 95%: 5N
ANXA2—parameter estimates and likelihood scores for branch-site models for species
 DogNull− 3141.701658 ρ 0 = 0.312, (ρ1 = 0.007), ω0 = 0.071, (ω1 = 1)6.423*Not allowed
Alternative− 3138.489876 ρ 0 = 0.535, ρ1 = 0.011, (ρ2 = 0.072), ω0 = 0.072, (ω1 = 1), ω2 = 2.7268 Sites p > 99%: 32A, 108A109E, 110S, 111L, 172I, 249H, 281Q. 16 sites p > 95%: 80T, 143P, 159T, 162P, 184W, 190W, 209T, 235G, 242G, 246Q, 282I
ANXA4—parameter estimates and likelihood scores for branch-site models for species
 Ruminants (cow, sheep)Null− 2537.951438 ρ 0 = 0.886, (ρ1 = 0.074), ω0 = 0.034, (ω1 = 1)4.451*Not allowed
Alternative− 2535.725639 ρ 0 = 0.915, ρ1 = 0.074, (ρ2 = 0.009), ω0 = 0.034, (ω1 = 1), ω2 = 18.872Cow: 1 site p > 95%: 216ASheep: 1 site p > 95%: 217T
ANXA5—parameter estimates and likelihood scores for branch-site models for species
 SheepNull− 2580.297972 ρ 0 = 0.734, (ρ1 = 0.063), ω0 = 0.018, (ω1 = 1)3.937*Not allowed
Alternative− 2578.329334 ρ 0 = 0.903, ρ1 = 0.077, (ρ2 = 0.018), ω0 = 0.018, (ω1 = 1), ω2 = 29.2981 Site p > 95%: 309R
HSPA5—parameter estimates and likelihood scores for branch-site models for species
 RabbitNull− 5737.704417 ρ 0 = 0.962, (ρ1 = 0.017), ω0 = 0.001, (ω1 = 1)10.735**Not allowed
Alternative− 5732.336651 ρ 0 = 0.975, ρ1 = 0.016, (ρ2 = 0.007), ω0 = 0.001, (ω1 = 1), ω2 = 998.9991 Site p > 95%: 329S
LTF—parameter estimates and likelihood scores for branch-site models for species
 MammalsNull− 12709.754534 ρ 0 = 0.632, (ρ1 = 0.169), ω0 = 0.108, (ω1 = 1)90.879***Not allowed
Alternative− 12664.314683 ρ 0 = 0.605, ρ1 = 0.357, (ρ2 = 0.036), ω0 = 0.106, (ω1 = 1), ω2 = 3.836Rabbit: 4 sites p > 99%: 32P, 39S, 62L, 642A; 3 sites p > 95%: 36A, 61A, 183QMouse: 4 sites p > 99%: 31N, 38L, 61R, 641Q; 3 sites p > 95%: 35E, 60T, 182DRat: 4 sites p > 99%: 32R, 39F, 62P, 641C; 3 sites p > 95%: 36Q, 61M, 182STarsier: 4 sites p > 99%: 32N, 39Y, 62R, 534R; 3 sites p > 95%: 36T, 61V, 183ROrangutan: 4 sites p > 99%: 34Q, 41F, 64T, 646S; 3 sites p > 95%: 38T, 63P, 185GHuman: 4 sites p > 99%: 32Q, 39F, 62I, 644S; 3 sites p > 95%: 36T, 61P, 183GDog: 4 sites p > 99%: 32K, 39S, 62Q, 642T; 3 sites p > 95%: 36K, 61R, 183KPanda: 4 sites p > 99%: 32K, 39S, 62Q, 640A; 3 sites p > 95%: 36A, 61H, 183RCat: 4 sites p > 99%: 32Q, 39T, 62Q, 645D; 3 sites p > 95%: 36T, 61R, 183KMicrobat: 4 sites p > 99%: 32P, 39F, 62T, 644R; 3 sites p > 95%: 36T, 61Y, 183RMegabat: 4 sites p > 99%: 32K, 39S, 62K, 641S; 3 sites p > 95%: 36A, 61H, 183THorse: 4 sites p > 99%: 32P, 39A, 62F, 642P; 3 sites p > 95%: 36A, 61S, 183KPig: 4 sites p > 99%: 32T, 39R, 60T, 638K; 3 sites p > 95%: 36S, 59P, 178NDolphin: 4 sites p > 99%: 32L, 39Y, 62F, 627K; 3 sites p > 95%: 36L, 61R, 183KCow: 4 sites p > 99%: 32Q, 39R, 62L, 642K; 3 sites p > 95%: 36F, 61A, 183QSheep: 4 sites p > 99%: 32P, 39Y, 62L, 642K; 3 sites p > 95%: 36S, 61A, 183K
 Artiodactyls (pig, dolphin, cow, sheep)Null− 12706.881324 ρ 0 = 0.608, (ρ1 = 0.337), ω0 = 0.104, (ω1 = 1)10.855**Not allowed
Alternative− 12701.453814 ρ 0 = 0.622, ρ1 = 0.354, (ρ2 = 0.022), ω0 = 0.108, (ω1 = 1), ω2 = 6.298Dolphin: 1 site p > 95%: 583VSheep: 1 site p > 95%: 597T
 PigNull− 12707.737211 ρ 0 = 0.587, (ρ1 = 0.333), ω0 = 0.106, (ω1 = 1)18.188***Not allowed
Alternative− 12698.643012 ρ 0 = 0.621, ρ1 = 0.356, (ρ2 = ), ω0 = 0.022 0.107, (ω1 = 1), ω2 = 999.0001 Site p > 95%: 49N
 DolphinNull− 12709.096501 ρ 0 = 0.594, (ρ1 = 0.338), ω0 = 0.108, (ω1 = 1)7.501**Not allowed
Alternative− 12705.345589 ρ 0 = 0.629, ρ1 = 0.353, (ρ2 = 0.015), ω0 = 0.109, (ω1 = 1), ω2 = 16.0851 Site p > 95%: 507D
 Rodents (mouse, rat)Null− 12694.848599 ρ 0 = 0.543, (ρ1 = 0.306), ω0 = 0.091, (ω1 = 1)4.198*Not allowed
Alternative− 12692.749587 ρ 0 = 0.586, ρ1 = 0.327, (ρ2 = 0.085), ω0 = 0.097, (ω1 = 1), ω2 = 2.516Mouse: 1 site p > 99%: 180QRat: 1 site p > 99%: 180Q
 RatNull− 12694.360640 ρ 0 = 0.481, (ρ1 = 0.260), ω0 = 0.100, (ω1 = 1)21.139***Not allowed
Alternative− 12683.791136 ρ 0 = 0.561, ρ1 = 0.300, (ρ2 = 0.136), ω0 = 0.101, (ω1 = 1), ω2 = 5.6621 Site p > 99%: 79S; 5 sites p > 95%: 86F, 145A, 186R, 514F, 696I
 Chiroptera (microbat, megabat)Null− 12709.210769 ρ 0 = 0.614, (ρ1 = 0.350), ω0 = 0.106, (ω1 = 1)7.019**Not allowed
Alternative− 12705.700835 ρ 0 = 0.630, ρ1 = 0.355, (ρ2 = 0.012), ω0 = 0.109, (ω1 = 1), ω2 = 11.651Microbat: 1 site p > 95%: 12LMegabat: 1 site p > 95%: 12W
OVGP1—parameter estimates and likelihood scores for branch-site models for species
 DogNull− 4378.052266 ρ 0 = 0.519, (ρ1 = 0.239), ω0 = 0.074, (ω1 = 1)3.908*Not allowed
Alternative− 4376.097858 ρ 0 = 0.621, ρ1 = 0.279, (ρ2 = 0.098), ω0 = 0.076, (ω1 = 1), ω2 = 4.4541 Site p > 95%: 70T
PLG—parameter estimates and likelihood scores for branch-site models for species
 PigNull− 13182.998657 ρ 0 = 0.775, (ρ1 = 0.197), ω0 = 0.072, (ω1 = 1)6.215*Not allowed
Alternative− 13179.890971 ρ 0 = 0.792, ρ1 = 0.199, (ρ2 = 0.006), ω0 = 0.073, (ω1 = 1), ω2 = 21.8051 Site p > 95%: 620K
S100A11—parameter estimates and likelihood scores for branch-site models for species
 Primates (human, orangutan)Null− 1468.549296 ρ 0 = 0.795, (ρ1 = 0.153), ω0 = 0.044, (ω1 = 1)4.798*Not allowed
Alternative− 1466.149841 ρ 0 = 0.825, ρ1 = 0.158, (ρ2 = 0.015), ω0 = 0.044, (ω1 = 1), ω2 = 19.752Human: 1 site p > 95%: 78SOrangutan: 1 site p > 95%: 78S
 RatNull− 1464.292877 ρ 0 = 0.697, (ρ1 = 0.128), ω0 = 0.038, (ω1 = 1)6.934**Not allowed
Alternative− 1460.825580 ρ 0 = 0.811, ρ1 = 0.150, (ρ2 = 0.036), ω0 = 0.041, (ω1 = 1), ω2 = 45.0871 Site p > 95%: 8L
SPAM1—parameter estimates and likelihood scores for branch-site models for species
 MammalsNull− 9332.838271 ρ 0 = 0.486, (ρ1 = 0.222), ω0 = 0.088, (ω1 = 1)55.457***Not allowed
Alternative− 9305.109656 ρ 0 = 0.461, ρ1 = 0.470, (ρ2 = 0.067), ω0 = 0.086, (ω1 = 1), ω2 = 2.833Rabbit: 3 sites p > 99%: 121Q, 131L, 403GMouse: 3 sites p > 99%: 121S, 131L, 401QRat: 3 sites p > 99%: 121S, 131H, 403RTarsier: 3 sites p > 99%: 122K, 132S, 403IOrangutan: 3 sites p > 99%: 122D, 132L, 403DHuman: 3 sites p > 99%: 122D, 132T, 403DMicrobat: 3 sites p > 99%: 121A, 131E, 403KDog: 3 sites p > 99%: 122K, 132S, 404VPanda: 3 sites p > 99%: 122K, 132A, 404DCat: 3 sites p > 99%: 122K, 132S, 404VPig: 3 sites p > 99%: 122R, 132L, 403ADolphin: 3 sites p > 99%: 122E, 132S, 404QCow: 3 sites p > 99%: 123S, 133A, 405MSheep: 3 sites p > 99%: 122N, 132A, 404MHorse: 3 sites p > 99%: 118A, 128S, 399K
 Rodents (mouse, rat, rabbit) and primates (human, orangutan, tarsier)Null− 9332.291017 ρ 0 = 0.481, (ρ1 = 0.494), ω0 = 0.085, (ω1 = 1)20.661***Not allowed
Alternative− 9321.960143 ρ 0 = 0.474, ρ1 = 0.485, (ρ2 = 0.039), ω0 = 0.087, (ω1 = 1), ω2 = 4.121Rabbit: 1 site p > 95%: 131LMouse: 1 site p > 95%: 131LRat: 1 site p > 95%: 131HTarsier: 1 site p > 95%: 132SOrangutan: 1 site p > 95%: 132LHuman: 1 site p > 95%: 132T
 MicrobatNull− 9328.404724 ρ 0 = 0.415, (ρ1 = 0.432), ω0 = 0.077, (ω1 = 1)21.585***Not allowed
Alternative− 9317.611891 ρ 0 = 0.443, ρ1 = 0.456, (ρ2 = 0.100), ω0 = 0.081, (ω1 = 1), ω2 = 11.3221 site p > 99%: 374L ; 2 sites p > 95%: 392A, 423M
 PigNull− 9328.072812 ρ 0 = 0.400, (ρ1 = 0.422), ω0 = 0.078, (ω1 = 1)10.202**Not allowed
Alternative− 9322.971449 ρ 0 = 0.468, ρ1 = 0.491, (ρ2 = 0.039), ω0 = 0.081, (ω1 = 1), ω2 = 13.4071 site p > 95%: 77T ; 1 site p > 99%: 413N
SPP1—parameter estimates and likelihood scores for branch-site models for species
 Microbat, megabat, horse, dog, panda, cat, pig, dolphin, cow, sheepNull− 3405.427792 ρ 0 = 0.521, (ρ1 = 0.158), ω0 = 0.180, (ω1 = 1)4.387*Not allowed
Alternative− 3403.234241 ρ 0 = 0.611, ρ1 = 0.142, (ρ2 = 0.245), ω0 = 0.226, (ω1 = 1), ω2 = 1.603Microbat: 3 sites p > 95%: 291H, 299PMegabat: 3 sites p > 95%: 108S, 254R, 262PHorse: 3 sites p > 95%: 107S, 280S, 288RDog: 3 sites p > 95%: 104L, 244H, 252SPanda: 3 sites p > 95%: 104G, 266H, 274SCat: 3 sites p > 95%: 104T, 266N, 274PPig: 3 sites p > 95%: 107A, 271H, 279SDolphin: 3 sites p > 95%: 87A, 256R, 264LCow: 3 sites p > 95%: 103S, 244L, 252HSheep: 3 sites p > 95%: 105S, 247L, 255H
 Dog, panda, cat, pig, dolphin, cow, sheepNull− 3408.017517 ρ 0 = 0.559, (ρ1 = 0.230), ω0 = 0.201, (ω1 = 1)6.525*Not allowed
Alternative− 3404.754608 ρ 0 = 0.602, ρ1 = 0.234, (ρ2 = 0.162), ω0 = 0.221, (ω1 = 1), ω2 = 2.002Dog: 1 site p > 95%: 104LPanda: 1 site p > 95%: 104GCat: 1 site p > 95%: 104TPig: 1 site p > 95%: 107ADolphin: 1 site p > 95%: 87ACow: 1 site p > 95%: 103SSheep: 1 site p > 95%: 105S
 Dog, panda, catNull− 3404.586199 ρ 0 = 0.443, (ρ1 = 0.204), ω0 = 0.197, (ω1 = 1)18.383***Not allowed
Alternative− 3395.394529 ρ 0 = 0.556, ρ1 = 0.245, (ρ2 = 0.197), ω0 = 0.207, (ω1 = 1), ω2 = 4.297Dog: 2 sites p > 99%: 104L, 240L; 2 sites p > 95%: 75S, 182SPanda: 2 sites p > 99%: 104G, 262I; 2 sites p > 95%: 75S, 182SCat: 2 sites p > 99%: 104T, 262T; 2 sites p > 95%: 75A, 182S
 Dog, pandaNull− 3410.889147 ρ 0 = 0.600, (ρ1 = 0.399), ω0 = 0.208, (ω1 = 1)10.162**Not allowed
Alternative− 3405.808026 ρ 0 = 0.586, ρ1 = 0.393, (ρ2 = 0.018), ω0 = 0.208, (ω1 = 1), ω2 = 33.914Dog: 1 site p > 95%: 104LPanda: 1 site p > 95%: 104G
 Pig, dolphin, cow, sheepNull− 3410.383104 ρ 0 = 0.584, (ρ1 = 0.368), ω0 = 0.204, (ω1 = 1)8.998**Not allowed
Alternative− 3405.883945 ρ 0 = 0.569, ρ1 = 0.372, (ρ2 = 0.057), ω0 = 0.206, (ω1 = 1), ω2 = 4.776Pig: 1 site p > 95%: 176QDolphin: 1 site p > 95%: 156YCow: 1 site p > 95%: 172SSheep: 1 site p > 95%: 175S
 MouseNull− 3408.601339 ρ 0 = 0.350, (ρ1 = 0.254), ω0 = 0.186, (ω1 = 1)7.352**Not allowed
Alternative− 3404.925140 ρ 0 = 0.565, ρ1 = 0.394, (ρ2 = 0.039), ω0 = 0.195, (ω1 = 1), ω2 = 30.7131 Site p > 95%: 35L

aLog-likelihood values

b ρ 0, ρ1, and ρ2 are the proportions of codons subject to purifying, neutral, and positive selection, respectively. ω0, ω1, and ω2 represented dN/dS for each class (purifying, neutral, and positive selection, respectively)

c*Significant at p < 0.05 threshold; **significant at p < 0.01 threshold; ***significant at p < 0.001 threshold

dBayes Empirical Bayes inference of amino acid sites under positive selection

Parameter estimates and likelihood scores for branch-site evolutionary models aLog-likelihood values b ρ 0, ρ1, and ρ2 are the proportions of codons subject to purifying, neutral, and positive selection, respectively. ω0, ω1, and ω2 represented dN/dS for each class (purifying, neutral, and positive selection, respectively) c*Significant at p < 0.05 threshold; **significant at p < 0.01 threshold; ***significant at p < 0.001 threshold dBayes Empirical Bayes inference of amino acid sites under positive selection

Discussion

Previous studies have identified numerous proteins in the OF from several mammalian species (Georgiou et al. 2005; Lamy et al. 2016; Mondejar et al. 2012; Smits et al. 2017; Soleilhavoup et al. 2016; Yu et al. 2016). Their expression changes with stages of the oestrous cycle (Lamy et al. 2016; Soleilhavoup et al. 2016), presence of gametes (Georgiou et al. 2005), or presence of embryos (Smits et al. 2017). Some of these proteins are involved in crucial steps for fertilization, such as binding between sperm and oviduct (Ignotz et al. 2007; Teijeiro et al. 2011), zona pellucida–sperm interaction (Algarra et al. 2016; Coy et al. 2012a; Goncalves et al. 2008a; Hao et al. 2006; Lachance et al. 2007; Mondejar et al. 2013; Zumoffen et al. 2013), sperm viability (Elliott et al. 2009; Lloyd et al. 2009; Moein-Vaziri et al. 2014), acrosome reaction (Chiu et al. 2010; Zhang et al. 2006), dispersion of cumulus cells (Griffiths et al. 2008), and sperm selection (Hanaue et al. 2011). The present work suggests that the high diversity of proteins present in the OF of mammals is associated with a species-specific evolutionary pattern of the corresponding genes by fairly frequent pseudogenization. Gene death is usually caused by introducing any changes that lead to a premature stop codon. Now, with the availability of complete animal genome sequences, it is possible to test hypotheses concerning gene death by searching for the trace of a pseudogene. Pseudogenization has been previously demonstrated for OVGP1 in the rat (Tian et al. 2009). Our study shows that OVGP1 is lost in megabat. OVGP1, also known as oviductin, has been shown to play a role in the fertilization process in goat (Pradeep et al. 2011), pig (McCauley et al. 2003), cow (Martus et al. 1998), buffalo (Choudhary et al. 2017), hamster (Saccary et al. 2013; Schmidt et al. 1997; Yang et al. 2015), and human (O’Day-Bowman et al. 1996; Zhao et al. 2016): it attaches to the zona pellucida and the spermatozoa, and improves in vitro fertilization and embryo development. Interestingly, targeted invalidation of Ovgp1 in the mouse is without any consequence in female reproduction (Araki et al. 2003), implying that at least in murine species, Ovgp1 is not essential for fertility. Gene loss by pseudogenization has already been reported for genes encoding seminal fluid proteins, such as for prostate-specific transglutaminase (TGM4) genes, and semenogelines (SEM1 and SEM2) in gorillas (Carnahan and Jensen-Seaman 2008), as well as TGM4 in pigs (Meslin et al. 2015). Our study shows that PAEP is lost in tarsier, rat, mouse, rabbit, megabat, and dolphin. PAEP, also named glycodelin-A, interacts with fucosyltransferase on human sperm plasma membrane thus inhibiting spermatozoa-zona pellucida binding (Chiu et al. 2007). Moreover, pre-treatment of spermatozoa with glycodelin-A enhances zona pellucida-induced calcium influx and zona pellucida-induced acrosome reaction (Chiu et al. 2010). Thus, PAEP is involved in sperm selection and preparation for fertilization. In tarsier, rat, mouse, rabbit, megabat, and dolphin, this role may not be essential, or may be assumed by other proteins. Two other proteins have been lost in tarsier: ANXA5 and DMBT1. ANXA5 is a candidate for the bovine oviductal epithelium sperm receptors, it binds with high affinity to heparin and related glycosaminoglycans and antibodies to this protein block sperm–oviduct binding (Talevi and Gualtieri 2010). However, since other annexins, such as ANXA1, ANXA2, and ANXA4 are also candidates for the oviductal epithelium sperm receptors, the loss of ANXA5 may be replaced by other annexins. DMBT1 has been shown to be involved in the mechanism of fertilization in equine and porcine species (Ambruosi et al. 2013): pre-incubation of oocytes with DMBT1 induces an increase in in vitro fertilization rate and an interaction between DMBT1 and spermatozoa has been shown using surface plasmon resonance studies. Moreover, DMBT1 has been proposed to be implicated in sperm selection in the oviduct through acrosome alteration and suppression of motility (Teijeiro and Marini 2012). As previously mentioned for ANXA5, in tarsier, other candidates for sperm binding and selection are probably involved. Finally, our study shows that PTGDS and PLG are lost in microbat and megabat, respectively. In bovine, prostaglandin D2 synthase (PTGDS) is involved in sperm binding to the ZP, in vitro fertilization and embryonic development, reaction with both oocytes and spermatozoa (Goncalves et al. 2008a, b). In swine, plasminogen (PLG) regulates sperm entry into the oocyte: sperm binding to oocytes triggers the releasing of plasminogen activators and the generated plasmin causes supernumerary spermatozoa detachment from the zona pellucida (Coy et al. 2012b). Since these proteins are lost in microbat and megabat, some other proteins are probably involved in this regulation. Overall, this study revealed that the evolution of mammals was accompanied by the progressive loss of genes coding for oviductal proteins involved in the process of fertilization. The significance of such loss remains to be further investigated. It may be related, at least for bats, to sperm storage in the oviduct. Prolonged sperm storage in the female genital tract over winter is a common feature of reproduction in some bats, so that they can mate in the autumn but postpone fertilization until the spring (Holt 2011). Thus, oviductal proteins involved in sperm attachment to the oviduct or sperm selection within the oviduct may have evolved in a different way in these species. In our study, gene duplication has been observed in several species: orangutan, rat, mouse, cow, sheep, pig, horse, cat, and dog. The duplication could be a preservation mechanism through which some daughter copies keep ancient functions, whereas others evolve toward new biological functions. OVGP1 is lost in rat and megabat. Cow, sheep, and pig have two copies of the gene. Porcine OVGP1 has been shown to have an effect on the number of sperm bound to the zona pellucida, on sperm penetration rate, and polyspermy rate (Kouba et al. 2000). Sperm adhesion molecule 1 (SPAM1) is duplicated in rat, mouse, cow, sheep, and horse. SPAM1 is a hyaluronidase which can bind to sperm during capacitation and increase cumulus dispersal efficiency and ability of sperm to penetrate the cumulus of oocytes (Griffiths et al. 2008; Martin-DeLeon 2006). Prothmann et al. (2012) found that the rate of evolution of the gene encoding SPAM1 could be correlated with relative testis weight in monkeys as well as the uni-male versus multi-male breeding system. This suggests that different levels of sperm competition might account for species-specific sequence evolution of SPAM1 in these species. The evolution of gene encoding the male zonadhesin gene also may be related to the uni-male breeding system in other primates (Herlyn and Zischler 2007, 2008). In the present work, we find a positive selection of this gene in all the studied species. However, it is too speculative here to make a hypothesis concerning breeding system because we did the work on a greater diversity of mammals, with wild and domesticated animals, and very different breeding systems. However, such a question would be very interesting to investigate with a smaller and especially more homogenous group of species. S100 calcium-binding protein A11 (S100A11) is duplicated in rat, pig, and cat. This protein plays a role in sperm selection through its action on cumulus cells (Hanaue et al. 2011). Annexin A2 (ANXA2) is duplicated in sheep and dog. ANXA2 has been shown to be present at the apical surface of the oviductal epithelial cells in sow and it is the main sperm binding isoform in pig (Teijeiro et al. 2009). Other annexins, such as ANXA1, ANXA4, and ANXA5 are other candidates for the oviductal epithelium sperm receptors (Talevi and Gualtieri 2010). Finally, HSPA8 is duplicated in orangutan, rat, sheep, and pig. HSPA8 is involved in maintenance of bovine, ovine, and porcine sperm survival in the oviduct (Elliott et al. 2009; Lloyd et al. 2009). Overall, all the proteins that are duplicated are involved in sperm selection and competence to reach the oocyte. In our study, we have observed branch-site positive selection in all the species that we have studied. SPAM1 showed positive selection in all the species analyzed: human, orangutan, tarsier, rat, mouse, rabbit, microbat, megabat, cow, sheep, dolphin, pig, horse, cat, dog, and panda. Moreover, SPAM1 has not been lost in the sampled species, but was duplicated in rat, mouse, cow, sheep, and horse. Since it is involved in the ability of sperm to penetrate the cumulus of oocytes (Griffiths et al. 2008; Martin-DeLeon 2006), this could highlight its species-specific role in gamete recognition. Lactotransferrin (LTF) also showed positive selection in all the species analyzed. Moreover, LTF has neither been lost nor duplicated in the sampled species. LTF has been shown to interact with spermatozoa and oocytes and to modulate gamete interaction (Zumoffen et al. 2013). This rapid evolution may contribute to the diversity of mating systems and may explain part of the loss in interspecific fecundity after speciation (Meslin et al. 2015). Osteopontin (SPP1) showed positive selection in many species: mouse, microbat, megabat, cow, sheep, dolphin, pig, horse, cat, dog, and panda. Moreover, SPP1 has neither been lost nor duplicated in the sampled species. SPP1 has been shown to improve bovine sperm capacitation (Monaco et al. 2009) and in vitro development of porcine embryos (Hao et al. 2008). Moreover, osteopontin plays an important role in the regulation of pig polyspermic fertilization; it decreases polyspermy and increases fertilization efficiency during IVF (Hao et al. 2006). Annexins showed positive selection in cow (ANXA4), sheep (ANXA4 and ANXA5), dolphin (ANXA1), and dog (ANXA2). Several annexins, such as ANXA1, ANXA2, ANXA4, and ANXA5 are involved in oviductal epithelium sperm receptors. Their rapid evolution may also contribute to the loss in interspecific fecundity. S100A11, also involved in the interaction between sperm and cumulus cells, showed positive selection only in human, orangutan and rat, and gene duplication in rat, pig and cat. Heat shock 70 kDa protein 5 (HSPA5) showed positive selection only in the rabbit. It has neither been lost nor duplicated in the sampled species, contrary to HSPA8, which was duplicated in orangutan, rat, sheep, and pig. Heat shock proteins A5 and A8 play a role in the modulation of boar sperm function in the oviductal reservoir (Yeste et al. 2014). Plasminogen (PLG) showed positive selection only in the pig. In porcine, PLG regulates sperm entry into the oocyte (Coy et al. 2012b). Its rapid evolution could highlight the species-specific gamete recognition. OVGP1 showed positive selection only in the dog. It is lost in rat and megabat, and duplicated in cow, sheep, and pig. It has been shown to have an effect on the number of sperm bound to the zona pellucida, on sperm penetration rate, and polyspermy rate (Kouba et al. 2000).

Conclusion

Overall, the present work highlights the evolution of genes encoding proteins involved in gamete transportation and maturation, sperm capacitation, and early embryo development. These genes are subjected to a particularly rapid evolution by duplication as well as species-specific positive selection, suggesting a diversification of their functions and of their role in the different mammalian species, with possible impact on the process of speciation, as proposed for the evolution of genes encoding proteins of seminal fluid of different species (Meslin et al. 2015). Moreover, several previous works have suggested that genes encoding proteins involved in sperm–egg interactions might undergo co-evolution and will also have correlated evolutionary rates due to compensatory changes on both the sperm and egg (Claw et al. 2014). In the present paper and one previous one (Meslin et al. 2015), one could hypothesize such a kind of co-evolution for proteins of seminal fluid and proteins of oviductal fluid.
  81 in total

1.  Evaluation of an improved branch-site likelihood method for detecting positive selection at the molecular level.

Authors:  Jianzhi Zhang; Rasmus Nielsen; Ziheng Yang
Journal:  Mol Biol Evol       Date:  2005-08-17       Impact factor: 16.240

Review 2.  Causes and consequences of the evolution of reproductive proteins.

Authors:  Leslie M Turner; Hopi E Hoekstra
Journal:  Int J Dev Biol       Date:  2008       Impact factor: 2.203

3.  The expression of atrial natriuretic peptide in the oviduct and its functions in pig spermatozoa.

Authors:  Meijia Zhang; Haiyan Hong; Bo Zhou; Shiying Jin; Chao Wang; Maoyong Fu; Songbo Wang; Guoliang Xia
Journal:  J Endocrinol       Date:  2006-06       Impact factor: 4.286

4.  Effects of the porcine oviduct-specific glycoprotein on fertilization, polyspermy, and embryonic development in vitro.

Authors:  A J Kouba; L R Abeydeera; I M Alvarez; B N Day; W C Buhi
Journal:  Biol Reprod       Date:  2000-07       Impact factor: 4.285

5.  Pseudogenes as a paradigm of neutral evolution.

Authors:  W H Li; T Gojobori; M Nei
Journal:  Nature       Date:  1981-07-16       Impact factor: 49.962

6.  Rapid evolution of fertilization selectivity and lysin cDNA sequences in teguline gastropods.

Authors:  M E Hellberg; V D Vacquier
Journal:  Mol Biol Evol       Date:  1999-06       Impact factor: 16.240

7.  Proteome of equine oviducal fluid: effects of ovulation and pregnancy.

Authors:  Katrien Smits; Hilde Nelis; Katleen Van Steendam; Jan Govaere; Kim Roels; Cyrillus Ververs; Bart Leemans; Eline Wydooghe; Dieter Deforce; Ann Van Soom
Journal:  Reprod Fertil Dev       Date:  2017-06       Impact factor: 2.311

8.  Identification of potential oviductal factors responsible for zona pellucida hardening and monospermy during fertilization in mammals.

Authors:  Irene Mondéjar; Irene Martínez-Martínez; Manuel Avilés; Pilar Coy
Journal:  Biol Reprod       Date:  2013-09-27       Impact factor: 4.285

9.  Oviductal fluid proteins associated with the bovine zona pellucida and the effect on in vitro sperm-egg binding, fertilization and embryo development.

Authors:  R F Gonçalves; A L Staros; G J Killian
Journal:  Reprod Domest Anim       Date:  2008-05-13       Impact factor: 2.005

10.  The C-terminal region of OVGP1 remodels the zona pellucida and modifies fertility parameters.

Authors:  B Algarra; L Han; C Soriano-Úbeda; M Avilés; P Coy; L Jovine; M Jiménez-Movilla
Journal:  Sci Rep       Date:  2016-09-07       Impact factor: 4.379

View more
  3 in total

1.  Rapid divergence of a gamete recognition gene promoted macroevolution of Eutheria.

Authors:  Emma K Roberts; Steve Tardif; Emily A Wright; Roy N Platt; Robert D Bradley; Daniel M Hardy
Journal:  Genome Biol       Date:  2022-07-11       Impact factor: 17.906

Review 2.  β-Lactoglobulin and Glycodelin: Two Sides of the Same Coin?

Authors:  Lindsay Sawyer
Journal:  Front Physiol       Date:  2021-05-20       Impact factor: 4.566

Review 3.  New Insights into the Mammalian Egg Zona Pellucida.

Authors:  Carla Moros-Nicolás; Pascale Chevret; María Jiménez-Movilla; Blanca Algarra; Paula Cots-Rodríguez; Leopoldo González-Brusi; Manuel Avilés; Mª José Izquierdo-Rico
Journal:  Int J Mol Sci       Date:  2021-03-23       Impact factor: 5.923

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.