| Literature DB >> 21619679 |
Camille Meslin1, Fanny Brimau, Patricia Nagnan-Le Meillour, Isabelle Callebaut, Géraldine Pascal, Philippe Monget.
Abstract
BACKGROUND: SAL1 (salivary lipocalin) is a member of the OBP (Odorant Binding Protein) family and is involved in chemical sexual communication in pig. SAL1 and its relatives may be involved in pheromone and olfactory receptor binding and in pre-mating behaviour. The evolutionary history and the selective pressures acting on SAL1 and its orthologous genes have not yet been exhaustively described. The aim of the present work was to study the evolution of these genes, to elucidate the role of selective pressures in their evolution and the consequences for their functions.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21619679 PMCID: PMC3128046 DOI: 10.1186/1471-2148-11-148
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Lineage specific expansion of the SAL1 family. Genomic localization of SAL1 and its orthologous genes in 13 species: pig (Sus scrofa), cow (Bos taurus), horse (Equus caballus), dog (Canis familiaris), guinea pig (Cavia porcellus), rat (Rattus norvegicus), mouse (Mus musculus), rabbit (Oryctolagus cuniculus), macaque (Macaca mulatta), chimpanzee (Pan troglodytes), gorilla (Gorilla gorilla), marmoset (Callithrix jacchus) and elephant (Loxodonta Africana). Genes that belong to SAL1 family are in black. Pseudogenes are in gray. Orthologous flanking genes SNX30, SLC46A2, ZFP37, SLC31A2 and FKBP15 are in pink, red, green, blue and purple, respectively. Numbers of chromosomes and localization of genes on chromosomes are indicated near each species.
Figure 2Loss of SAL1 orthologous gene in human and Neandertal genomes. The underlined A represents the G-to-A substitution in the donor site of the second intron in human and Neandertal genomic sequences, resulting in a shift in the ORF and in the pseudogenization of the gene in the two species.
Figure 3Phylogenetic tree of SAL1 family. The phylogenetic tree was reconstructed using maximum likelihood (ML) and rooted by midpoint rooting. Bootstrap values are given for main branches, in bold when nodes are strongly supported (>80%). The 50 mammalian SAL1 amino-acid sequences related to the 50 identified functional genes were used to reconstruct the phylogenetic tree. After removal of gaps, the dataset comprised 119 sites. Blue circles represent speciation and red squares duplication. Branches on the tree that were tested for positive selection on species clades are indicated by black arrows.
Interlocus gene conversion events
| Species | Converted tract length (bp) | |||
|---|---|---|---|---|
| min | max | |||
| Horse | 5 | 3 | 57 | 74 |
| Rat | 10 | 4 | 161 | 273 |
| Mouse | 21 | 20 | 107 | 529 |
| Guinea pig | 5 | 5 | 11 | 135 |
(1) All sequences are indicated in the table S1 of the additional file 1.
(2) Sequences involved in gene conversion events are indicated in bold in the table S1 of the additional file 1.
Parameter estimates and likelihood scores for branch-site models for paralogs
| Genes | Model | Positively selected sites (BEB) | |||
|---|---|---|---|---|---|
| Cow LOC783399 | Null | -5660.941163 | ρ0 = 0.32, (ρ1 = 0.68), ω0 = 0.24, (ω1 = 1) | 27.10 *** | Not allowed |
| Alternative | -5647.392083 | ρ0 = 0.30, ρ1 = 0.66, (ρ2 = 0.05), ω0 = 0.23, (ω1 = 1), ω2 = ∞ | 1 site p > 99%: 36Y | ||
| Guinea pig ENSCPOG00000023399 | Null | -5660.050892 | ρ0 = 0.25, (ρ1 = 0.50), ω0 = 0.24, (ω1 = 1) | 56.40 *** | Not allowed |
| Alternative | -5631.851037 | ρ0 = 0.27, ρ1 = 0.55, (ρ2 = 0.18), ω0 = 0.23, (ω1 = 1), ω2 = 336.70 | 7 sites p > 99%: 24R, 27A, 113L, 118Q, 123T, 125T, 128T, 8 sites p > 95%: 31L, 25E, 26T, 85V, 114T, 117T, 122V, 126L | ||
| Horse LOC100053653 | Null | -5665.171131 | ρ0 = 0.22, (ρ1 = 0.47), ω0 = 0.23, (ω1 = 1) | 15.86 *** | Not allowed |
| Alternative | -5657.239232 | ρ0 = 0.09, ρ1 = 0.19, (ρ2 = 0.72), ω0 = 0.23, (ω1 = 1), ω2 = 6.44 | 6 sites p > 95%: 81E, 97A, 123Q, 135K, 166K, 168F | ||
| Mouse Mup19 | Null | -5660.941163 | ρ0 = 0.32, (ρ1 = 0.67), ω0 = 0.24, (ω1 = 1) | 53.87 *** | Not allowed |
| Alternative | -5634.004112 | ρ0 = 0.30, ρ1 = 0.66, (ρ2 = 0.04), ω0 = 0.23, (ω1 = 1), ω2 = ∞ | 2 sites p > 99%: 174K, 177F | ||
| Rat LOC298116 | Null | -5660.941163 | ρ0 = 0.32, (ρ1 = 0.67), ω0 = 0.24, (ω1 = 1) | 8.69 *** | Not allowed |
| Alternative | -5656.597218 | ρ0 = 0.32, ρ1 = 0.66, (ρ2 = 0.02), ω0 = 0.24, (ω1 = 1), ω2 = ∞ | 1 site p > 99%: 135A | ||
(1) Log- likelihood values.
(2) ρ0, ρ1, and ρ2 are the proportions of codons subject to purifying selection, neutral evolution, and positive selection, respectively. ω0, ω1 and ω2 represented dN/dS for each class (purifying, neutral and positive selection, respectively).
(3) *** significant at p < 0.001.
Parameter estimates and likelihood scores for site models
| Model | ||||
|---|---|---|---|---|
| M0 | -5729.43479 | ω = 0.92 | Not allowed | |
| M1a | -5660.94116 | ρ0 = 0.32 | Not allowed | |
| M2a | -5643.14631 | ρ0 = 0.24, ρ1 = 0.53, ρs = 0.23, ωs = 2.08 | 35.59 *** (M2a vs M1a) | 1 site p > 99%: |
| M7 | -5661.09613 | p = 0.64, q = 0.24 | Not allowed | |
| M8a | -5657.82124 | ρ0 = 0.41, ρ1 = 0.59, p = 1.69, q = 3.62 | Not allowed | |
| M8 | -5641.30866 | ρ0 = 0.71, ρs = 0.29, p = 0.83, q = 0.44, ωs = 1.84 | 33.02 *** (M8 vs M8a) | 9 sites p > 99%: |
| M8a | 18637.95571 | p = 1.10, q = 1.73 | Not allowed | |
| MEC | 18220.66368 | p = 0.80, q = 2.56 | 5 sites p > 95%: | |
(1) Log- likelihood values.
(2) ωS: average dN/dS ratio for sites subject to positive selection (models M2a and M8), p and q: shape parameters for the beta distribution of ω (models M7, M8 and MEC). ρ0, ρ1, and ρS are the proportions of codons subject to purifying selection, neutral evolution, and positive selection, respectively.
(3) *** significant at p < 0.001.
(4) Bold: P > 95% for the three comparisons (M2a vs. M1a, M8 vs. M8a, MEC vs. M8a).
Italic: P > 95% for the two comparisons (M2a vs. M1a and M8 vs. M8a).
Underlined: amino acids involved in androstenol and androstenone binding.
Site numbers and amino acids refer to the pig SAL1 reference sequence PDB: 1GM6.
Figure 4Positive selection acting on orthologs. Map of the amino acids involved in the binding of ligand [13] on the SAL1 3D structure are in blue, and positively selected sites in pink (PDB: 1GM6). Amino acids shown in a van der Waals representation (orange) were both involved in ligand binding and were positively selected. Positively selected amino acids were identified by PAML computations using site models.
Figure 5Positively selected sites and solvent accessibility. Positively charged, negatively charged, polar and non-polar residues are in blue, red, green and gray respectively, the same as in the ASAView. Bold: amino acids with P > 95% for the three pairs of comparison (M2a vs. M1a, M8 vs. M8a, MEC vs. M8a). Italics: amino acids with P > 95% for the two pairs of comparison (M2a vs. M1a and M8 vs. M8a). Underlined: amino acids involved in androstenol and androstenone binding.
Parameter estimates and likelihood scores for branch- site models for 5 species
| Species | Model | Positively selected sites (BEB) | |||
|---|---|---|---|---|---|
| Marmoset | Null | -5660.37202 | ρ0 = 0.18, (ρ1 = 0.39), ω0 = 0.23, (ω1 = 1) | 18.47 *** | Not allowed |
| Alternative | -5651.1375 | ρ0 = 0.30, ρ1 = 0.63, (ρ2 = 0.07), ω0 = 0.23, (ω1 = 1), ω2 = 60.38 | 2 sites p > 95%: 159R, 161F | ||
| Dog | Null | -5660.55998 | ρ0 = 0.25, (ρ1 = 0.52), ω0 = 0.24, (ω1 = 1) | 6.78 ** | Not allowed |
| Alternative | -5657.16845 | ρ0 = 0.28, ρ1 = 0.58, (ρ2 = s0.14), ω0 = 0.23, (ω1 = 1), ω2 = 11.15 | 2 sites p > 95%: | ||
| Guinea pig | Null | -5660.94116 | ρ0 = 0.32, (ρ1 = 0.67), ω0 = 0.24, (ω1 = 1) | 13.68 *** | Not allowed |
| Alternative | -5654.10045 | ρ0 = 0.27, ρ1 = 0.51, (ρ2 = 0.22), ω0 = 0.25, (ω1 = 1), ω2 = 3.94 | 3 sites p > 95%: 11S, | ||
| Horse | Null | -5660.94121 | ρ0 = 0.32, (ρ1 = 0.67), ω0 = 0.24, (ω1 = 1 ) | 27.22 *** | Not allowed |
| Alternative | -5647.32999 | ρ0 = 0.30, ρ1 = 0.61, (ρ2 = 0.09), ω0 = 0.24, (ω1 = 1), ω2 = 7.57 | 2 sites p > 99%: | ||
| Mouse | Null | -5658.60187 | ρ0 = 0.24, (ρ1 = 0.55), ω0 = 0.19, (ω1 = 1 ) | 10.75 ** | Not allowed |
| Alternative | -5653.22783 | ρ0 = 0.25, ρ1 = 0.58, (ρ2 = 0.17), ω0 = 0.19, (ω1 = 1), ω2 = 3.25 | 1 site p > 95%: 162Q | ||
(1) Log- likelihood values.
(2) ρ0, ρ1, and ρ2 are the proportions of codons subject to purifying selection, neutral evolution, and positive selection, respectively. ω0, ω1 and ω2 represented dN/dS for each class (purifying, neutral and positive selection, respectively).
(3) ** significant at p < 0.01
*** significant at p < 0.001.
(4) Underlined: amino acids involved in androstenol and androstenone binding.
Site numbers and amino acids refer to the pig SAL1 reference sequence PDB: 1GM6, except for the amino acid in bold, which is not part of the structure but comes from the dog genome sequence XP_855342.1.
Figure 6Positive selection acting on species clades. Positively selected sites identified on species clades are shown in a van der Waals representation on the SAL1 3D structure. Positively selected sites that matched amino acids involved in ligand binding are in yellow, the others in pink. Amino acids involved in ligand binding are in blue, the same as in Figure 4. Positively selected amino acids were identified by PAML computations using branch site models. A: in marmoset, B: in guinea pig, C: in horse, D: in mouse and E: in dog.