| Literature DB >> 31953474 |
Mónica C Silva1, Marcus Chibucos2, James B Munro2, Sean Daugherty2, M Manuela Coelho3, Joana C Silva2,4.
Abstract
Olfactory receptors (ORs), encoded by the largest vertebrate multigene family, enable the detection of thousands of unique odorants in the environment and consequently play a critical role in species survival. Here, we advance our knowledge of OR gene evolution in procellariiform seabirds, an avian group which relies on the sense of olfaction for critical ecological functions. We built a cosmid library of Cory's Shearwater (Calonectris borealis) genomic DNA, a model species for the study of olfaction-based navigation, and sequence OR gene-positive cosmid clones with a combination of sequencing technologies. We identified 220 OR open reading frames, 20 of which are full length, intact OR genes, and found a large ratio of partial and pseudogenes to intact OR genes (2:1), suggestive of a dynamic mode of evolution. Phylogenetic analyses revealed that while a few genes cluster with those of other sauropsid species in a γ (gamma) clade that predates the divergence of different avian lineages, most genes belong to an avian-specific γ-c clade, within which sequences cluster by species, suggesting frequent duplication and/or gene conversion events. We identified evidence of positive selection on full length γ-c clade genes. These patterns are consistent with a key role of adaptation in the functional diversification of olfactory receptor genes in a bird lineage that relies extensively on olfaction.Entities:
Mesh:
Substances:
Year: 2020 PMID: 31953474 PMCID: PMC6969042 DOI: 10.1038/s41598-019-56950-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Classification and schematic distribution of olfactory receptor genes in major vertebrate lineages.
Functional annotation and location of the 220 Olfactory receptor genes found in the Cory’s Shearwater genome.
| Clade | Sub-family | Number of OR genes | Location | Integrity* |
|---|---|---|---|---|
| γ - c | 73 | Scaffold (9)/Degenerate (64) | 9 intact, 4 partial 5 pseudogene, 55 truncated | |
| γ - c | 58 | Scaffold (13)/Degenerate (45) | 6 intact, 5 partial 5 pseudogene, 42 truncated | |
| γ - c | 7 | Scaffold (2)/Degenerate (5) | 5 partial, 2 truncated | |
| γ - c | 47 | Scaffold (6)/Degenerate (41) | 2 intact, 4 partial 3 pseudogene, 38 truncated | |
| γ - c | 1 | Scaffold | 1 truncated | |
| γ | 1 | Scaffold | 1 intact | |
| γ | 2 | Scaffold | 1 intact, 1 partial | |
| γ | 1 | Degenerate | 1 truncated | |
| γ | 2 | Scaffold (1)/Degenerate (1) | 2 truncated | |
| γ | 1 | Scaffold | 1 intact | |
| γ | 2 | Degenerate | 2 truncated | |
| γ | 1 | Scaffold | 1 partial | |
| γ | 10 | Scaffold (1)/Degenerate (9) | 3 partial, 2 pseudogene 5 truncated | |
| ? | 14 | Degenerate | 12 truncated |
Genes that could not be reliable assigned to an OR sub-family with the functional annotation pipeline belong to an “unknown” family. *Genes were classified as Intact if they had start and stop codons, no premature stops or frameshift mutations; Partial if they had an incomplete coding region and were not at the edge of a contig; Pseudogene if they had the full coding region but had internal stop or frameshifts, Truncated if they were located at the edge of a contig.
Figure 2Sequence conservation in complete OR γ-c shearwater genes. The open reading frames of the aligned gene set (n = 17) was used to build a sequence logo. Putative location of transmembrane regions, (TM1-7), intra-cellular loops (IC1-3), extra-cellular loops (EC1-3), conserved motifs (□) and residues (✶) are shown.
Figure 3Maximum-likelihood phylogenetic analysis of sauropsid olfactory receptor genes. (a) Analysis includes the Cory’s Shearwater’s 220 OR genes aligned with 98 Northern fulmar genes from GenBank, 134 zebra finch genes, 214 chicken genes, and 112 green anole genes (from [8]). (b) Analysis using only the set of complete genes from the Cory’s Shearwater (n = 20), aligned with 44 full fulmar genes (from GenBank), 134 zebra finch genes, 214 chicken genes, and 112 green anole genes (from [8]). Both analyses are based on the JTT model as the best fitting substitution model. Outgroup sequences of the Adenosine receptor A2b are from GenBank. Nodal support values are shown for major clades.
Figure 4Secondary structure of a full-length γ-c OR gene from Cory’s shearwater. (a) Structure showing the α-helices of the trans-membrane domains in blue. The amino-acid sites estimated to be evolving under positive selection are plotted, showing two predicted by all methods to be under positive selection (in red) as well as those predicted by at least two methods to be under positive selection (in orange). (b). Detail of the secondary structure of the same OR receptor, showing the side chain of the two amino-acids with the strongest signal of positive selection, aa109 (TM3 domain) and aa274 (TM7 domain) (in red), orienting towards the center of the receptor, suggesting they might form part of the ligand binding pocket.
Positive selected sites detected by different likelihood approaches, as implemented in HyPhy.
| # of gene sequences | Integrative analysis | ||||
|---|---|---|---|---|---|
| 18 | 6, | 107, | 99, 103, | 6, 103, 107, |
Sites detected by all methods are underlined and shown in bold, those detected by two or more methods are shown by an asterisk. All methods were used on two alignment partitions due to breakpoint at codon site 264, as estimated by GARD.
†Significance level considered at P < 0.1 (Pond & Frost 2005b);
ΨBayes factor >50.