| Literature DB >> 26963722 |
Seong-Il Eyun1,2, Hideaki Moriyama1, Federico G Hoffmann3, Etsuko N Moriyama1,4.
Abstract
Trace amine-associated receptors (TAARs) are a member of the G-protein-coupled receptor superfamily and are known to be expressed in olfactory sensory neurons. A limited number of molecular evolutionary studies have been done for TAARs so far. To elucidate how lineage-specific evolution contributed to their functional divergence, we examined 30 metazoan genomes. In total, 493 TAAR gene candidates (including 84 pseudogenes) were identified from 26 vertebrate genomes. TAARs were not identified from non-vertebrate genomes. An ancestral-type TAAR-like gene appeared to have emerged in lamprey. We found four therian-specific TAAR subfamilies (one eutherian-specific and three metatherian-specific) in addition to previously known nine subfamilies. Many species-specific TAAR gene duplications and losses contributed to a large variation of TAAR gene numbers among mammals, ranging from 0 in dolphin to 26 in flying fox. TAARs are classified into two groups based on binding preferences for primary or tertiary amines as well as their sequence similarities. Primary amine-detecting TAARs (TAAR1-4) have emerged earlier, generally have single-copy orthologs (very few duplication or loss), and have evolved under strong functional constraints. In contrast, tertiary amine-detecting TAARs (TAAR5-9) have emerged more recently and the majority of them experienced higher rates of gene duplications. Protein members that belong to the tertiary amine-detecting TAAR group also showed the patterns of positive selection especially in the area surrounding the ligand-binding pocket, which could have affected ligand-binding activities and specificities. Expansions of the tertiary amine-detecting TAAR gene family may have played important roles in terrestrial adaptations of therian mammals. Molecular evolution of the TAAR gene family appears to be governed by a complex, species-specific, interplay between environmental and evolutionary factors.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26963722 PMCID: PMC4786312 DOI: 10.1371/journal.pone.0151023
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The number of TAAR genes identified in the 30 animal genomes.
| Group/Species name | Common name | Totalnumber | Number of TAAR subfamily genes | |||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| T1 | T2 | T3 | T4 | T5 | T6 | T7 | T8 | T9 | TE1 | TM1 | TM2 | TM3 | TFI | TFII | TFIII | TL | TV | |||
| | human | 6 (3) | 1 | 1 | 0 (1) | 0 (1) | 1 | 1 | 0 (1) | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | house mouse | 15 (1) | 1 | 1 | 1 | 1 | 1 | 1 | 5 (1) | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | Norway rat | 17 (2) | 1 | 1 | 1 | 1 | 1 | 1 | 7 (2) | 3 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | cow | 21 (8) | 1 | 1 | 1 | 1 | 1 | 5 (2) | 7 (4) | 3 (2) | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | bottlenosed dolphin | 0 (3) | 0 (2) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 (1) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | horse | 11 (4) | 1 | 1 | 1 | 1 | 2 | 1 | 1 (1) | 2 (1) | 1 (1) | 0 (1) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | dog | 2 (2) | 0 (1) | 0 (1) | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | Malayan flying fox | 26 (10) | 1 | 1 | 0 (1) | 1 | 1 | 4 (6) | 16 | 1 (3) | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | little brown bat | 6 (1) | 1 | 1 | 1 | 1 | 1 | 0 (1) | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | common shrew | 9 [ | 1 | [ | 1 | 1 (1) | 1 | 3 | 2 | 0 | 0 (1) | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | hedgehog | 6 [ | [ | 1 | 1 | 0 (2) | 0 (1) | [ | 2 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | lesser hedgehog tenrec | 9 [ | 1 (1) | 1 | 1 | 1 | 0 | 0 (1) | 2 (1) | 1 | [ | 2 (2) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | African elephant | 9 [ | [ | 1 | [ | 1 [ | 1 | 1 | 2 | 2 (3) | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | nine-banded armadillo | 5 (4) | 1 | 1 | 1 | 0 | 1 | 0 | 1 (1) | 0 (2) | 0 (1) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | tammar wallaby | 18 [ | (1) | [ | 1 | 1 | 1 | 0 | 0 | 0 | 1 | 0 | 1 (1) | 9 (1) | 4 | 0 | 0 | 0 | 0 | 0 |
| | opossum | 22 (4) | 1 | 1 | 1 | 3 (1) | 1 | 0 | 0 | 0 | 7 (1) | 0 | 1 | 2 (1) | 5 (1) | 0 | 0 | 0 | 0 | 0 |
| | platypus | 4 (1) | 1 | 1 | 1 | 1 (1) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | chicken | 4 (1) | 1 | 2 | 0 | 0 | 1 (1) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | zebra finch | 1 (0) | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | Carolina anole | 3 (0) | 1 | 1 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | pipid frog | 7 (0) | 1 | 0 | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
| | fugu (Japanese pufferfish) | 18 (1) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 (1) | 0 | 3 | 0 | 1 |
| | spotted green pufferfish | 34 (3) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 12 (3) | 0 | 21 | 0 | 1 |
| | zebrafish | 110 (10) | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 92 (8) | 11 | 6 (2) | 0 | 1 |
| | elephant shark | 2 (3) | 1 (1) | 0 | 0 | 1 (2) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| | sea lamprey | 25 (3) | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 25 (3) | 0 |
| | amphioxus | 0 | ||||||||||||||||||
| | vase tunicate | 0 | ||||||||||||||||||
| | tunicate | 0 | ||||||||||||||||||
| | sea anemone | 0 | ||||||||||||||||||
aTAAR gene candidates are divided into three categories: intact, incomplete, and pseudogenes. The first number shown is that of "intact" genes, which contain full-length open reading frames with seven complete transmembrane regions. The number of "incomplete" genes due to incomplete genome sequences (e.g., long ambiguous sequences such as a run of N’s or contig ends) or incompletely identified exons (e.g., TAAR2, see below) is given in square brackets. The number in parentheses is that of possible pseudogenes, which contain premature stop codons or frame-shifting insertions or deletions.
bT1-T9, TE1, TM1-TM3, TFI-III, TL, and TV indicate TAAR1-9, TAAR E1, TAAR M1-M3, fish-specific TAAR I-III, lamprey TAAR-like genes, and TAAR V, respectively. The group names of fish-specific TAAR I-III and TAAR V are given by Hashiguchi and Nishida [26].
cOnly the exon2 sequences (coding 304 to 331 amino acids) were identified from these TAAR2 genes. The exon1 (coding 8 to 20 amino acids) can be located more than 6000 bp upstream.
dThe sequences are from Hashiguchi and Nishida [26]. We classified them into five subfamilies.
eThese three shark sequences (S2a, S2bP, and S2cP) are most similar to TAAR4. However, as we described, these shark TAARs may have diverged from the ancestral TAARs before the divergence of TAAR2-4 (see also phylogenies in Figs 2 and 3, and S1 Fig).
Fig 2The maximum-likelihood phylogeny of TAAR proteins from ten representative vertebrate species.
Only representative TAAR proteins are included for each species. Four biogenic amine receptors (5HT4R: serotonin receptors, and H2R: histamine receptors) are used as the outgroup. The genes newly identified in this study are shown in italics. The numbers at internal branches show the bootstrap support values (%) for the maximum-likelihood and neighbor-joining phylogenies and the posterior probability (%) for the Bayesian phylogeny in this order. Supporting values are shown only for the internal branches that have at least one method supporting higher than 70%. For TAAR V, teleost TAARs, and lamprey TAAR-like, we followed the gene names given by Hashiguchi and Nishida [26]. The inset illustrates a current consensus of the vertebrate phylogeny with their approximate divergence times (MYA) [33, 34].
Fig 3The maximum-likelihood phylogeny of TAAR proteins from 24 gnathostome genomes.
All functional proteins in tetrapods, nine representative teleost proteins, and two elephant shark TAARs are included in the analysis. TAAR V as well as the lamprey TAAR-like sequences are used as the outgroup. The genes newly identified in this study are shown in italics. The numbers at internal branches show the bootstrap support values (%) for the maximum-likelihood and neighbor-joining phylogenies and the posterior probability (%) for the Bayesian phylogeny in this order. Supporting values are shown only for the major internal branches that have at least one method supporting higher than 70%. Blue-colored branches indicate the species-specific gene duplications within a cluster supported by higher than 80% of bootstrap values or posterior probability for all methods. Red-colored branches and arrows indicate those identified to be under positive selection by the branch-site models of PAML analysis (see S4 Table). Brown-colored branches indicate nine representative teleost TAARs, elephant shark TAARs, and lamprey TAAR-like proteins. The inset illustrates the evolution of vertebrate TAARs with approximate timing of various gain (green) and loss (blue) events. The vertebrate phylogeny is based on Blair and Hedges [33].
Fig 1Syntenic relationship of the TAAR genes in nine vertebrate genomes.
Only genomes in which all TAAR genes are located in one chromosome or no more than two scaffolds were examined (Xenopus genes are found in two scaffolds). TAAR and adjacent non-TAAR genes are depicted by the closed and open boxes, respectively (pseudogenes are not included). TAAR genes are shown in different colors based on their taxonomic distributions as follows: TAAR1 found in jawed vertebrates in red, amniote-specific TAAR2 and TAAR5 in purple, mammalian-specific TAAR3 in orange, tetrapod-specific TAAR4 in dark blue, eutherian-specific TAAR6, 7, and 8 in light green, therian-specific TAAR9 in cyan, and metatherian-specific TAARM1-M3 in dark green. Note that the same color scheme is used in Figs 2 and 3. When tandemly duplicated functional copies exist for a TAAR gene, the copy number is also shown. Gene locations are not in scale (see S2 Table for the actual positional information). Black arrows indicate transcriptional directions. A current consensus of the tetrapod phylogeny with their approximate divergence times (million years ago; MYA) is illustrated at the top [33, 34]. The chromosome or scaffold numbers are shown below the genus names.
Fig 4The number of TAAR genes within each TAAR subfamily for each therian species.
The size of bubbles denotes the number of species where the corresponding TAAR genes are found. The average ω (dN/dS) calculated by the PAML M0 model for each TAAR subfamily is also plotted (open squares).
Fig 5The 3D-structural model of the elephant TAAR7a protein (cyan) superimposed with the turkey β1-adrenergic receptor (β1AR, gray).
The ligand of the β1AR, dobutamine, is shown with the stick model. Positively selected sites are indicated by red (detected by the site model in TAAR7), green (detected by the branch-site model in flying fox TAAR7c and elephant TAAR7a), purple (detected by the site model in TAAR8), and brown (detected by the branch-site model in mouse TAAR8a). The transmembranes (TM) and internal/external loop (IC1-3 and EC1-3) regions as well as N-terminal (N) are labeled. The C-terminal is invisible locating behind TM1. See S6 Fig for more details.