Literature DB >> 34151983

Expansion and Accelerated Evolution of 9-Exon Odorant Receptors in Polistes Paper Wasps.

Andrew W Legan1, Christopher M Jernigan1, Sara E Miller1, Matthieu F Fuchs1, Michael J Sheehan1.   

Abstract

Independent origins of sociality in bees and ants are associated with independent expansions of particular odorant receptor (OR) gene subfamilies. In ants, one clade within the OR gene family, the 9-exon subfamily, has dramatically expanded. These receptors detect cuticular hydrocarbons (CHCs), key social signaling molecules in insects. It is unclear to what extent 9-exon OR subfamily expansion is associated with the independent evolution of sociality across Hymenoptera, warranting studies of taxa with independently derived social behavior. Here, we describe OR gene family evolution in the northern paper wasp, Polistes fuscatus, and compare it to four additional paper wasp species spanning ∼40 million years of evolutionary divergence. We find 200 putatively functional OR genes in P. fuscatus, matching predictions from neuroanatomy, and more than half of these are in the 9-exon subfamily. Most OR gene expansions are tandemly arrayed at orthologous loci in Polistes genomes, and microsynteny analysis shows species-specific gain and loss of 9-exon ORs within tandem arrays. There is evidence of episodic positive diversifying selection shaping ORs in expanded subfamilies. Values of omega (dN/dS) are higher among 9-exon ORs compared to other OR subfamilies. Within the Polistes OR gene tree, branches in the 9-exon OR clade experience relaxed negative (relaxed purifying) selection relative to other branches in the tree. Patterns of OR evolution within Polistes are consistent with 9-exon OR function in CHC perception by combinatorial coding, with both natural selection and neutral drift contributing to interspecies differences in gene copy number and sequence.
© The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  antennal lobe glomeruli; birth-and-death process; comparative genomics; olfaction; social insect; tandem array

Mesh:

Substances:

Year:  2021        PMID: 34151983      PMCID: PMC8383895          DOI: 10.1093/molbev/msab023

Source DB:  PubMed          Journal:  Mol Biol Evol        ISSN: 0737-4038            Impact factor:   16.240


Introduction

Odorant/olfactory receptors (ORs) are among the largest gene families in animal genomes, and variation in the OR repertoire is hypothesized to reflect aspects of species chemosensory ecology. From the standpoint of molecular evolution, the ORs of insects and mammals have been widely studied as a model to understand the dynamics of gene family evolution (Young et al. 2002; Robertson et al. 2003; Nozawa and Nei 2007; Eirín-López et al. 2012; Nei 2013; Benton 2015; McKenzie and Kronauer 2018). Yet fundamental features of odorant receptor (OR) evolution remain unclear—why do some groups show predominantly conserved OR repertoires across species while others show rapid turnover in gene content or accelerated rates of evolution? Moreover, the relative importance of social interactions, sexual selection, and ecology in shaping patterns of OR evolution within and between clades is poorly understood. Comparative studies of distantly related species have provided insights into the evolutionary processes shaping the insect and mammal OR gene families at broad phylogenetic scales, where there is relatively little 1:1 orthology of receptors among species (Tsutsui 2013; Roux et al. 2014; Freeman et al. 2020; Yan et al. 2020). At the same time, studies within species and between closely related species can reveal the dynamics of receptor evolution at finer timescales and elucidate the process of gene family turnover, as evidenced by studies of insects and mammals (Guo and Kim 2007; McBride et al. 2014; Brand et al. 2015; Karpe et al. 2016; Brand and Ramírez 2017; Cohanim et al. 2018; Miller CH et al. 2020). Recent efforts to sequence a growing number of social insect genomes have suggested that social evolution is associated with expansions within the OR gene family, and the 9-exon OR subfamily in particular has experienced increased gene turnover and sequence evolution relative to other OR subfamilies (Zhou et al. 2012, 2015; LeBoeuf et al. 2013; Engsontia et al. 2015; Kapheim et al. 2015; Karpe et al. 2016, 2017; McKenzie et al. 2016; Saad et al. 2018). Given the importance of olfaction for social insect behavioral ecology, ORs provide a key route to linking genes to diverse and complex behaviors among ants, bees, and wasps. However, there are two major gaps in our knowledge of OR evolution in social insects. First, the hypothesis that social evolution is associated with OR expansions has yet to be tested in all of the independent origins of sociality among Hymenoptera. The independent origins of sociality in wasps provide an opportunity to compare patterns of OR gene family evolution to those that have been observed within social bees and ants (Hines et al. 2007). Second, the fine-scale dynamics of OR gene family turnover between social insect species remain understudied, since most studies have focused on comparisons between genera or families. A better understanding of the short-term mechanisms of OR evolution provides additional insights into the molecular evolutionary dynamics shaping receptor diversity across more distantly related taxa. The recent release of five genomes of Polistes paper wasps spanning ∼1–40 million years of divergence provides an opportunity to fill these gaps in our knowledge. Organisms use chemoreceptor proteins to detect stimuli and provide input to neural circuits that regulate decision-making (Su et al. 2009; Yapici et al. 2014). The insect OR gene family evolved in the ancestor of all insects and constitutes the largest among the insect chemoreceptor gene families, which also include gustatory receptors and ionotropic receptors (Hansson and Stensmyr 2011; Suh et al. 2014; Brand et al. 2018; Fleischer et al. 2018; Vizueta et al. 2020). OR genes code for subunits of heterotetrameric ligand-gated ion channels embedded in the membranes of olfactory receptor neurons (ORNs) (Sato et al. 2008; Butterwick et al. 2018). The odorant receptor coreceptor (Orco) gene codes for a component of all OR complexes and is ubiquitously expressed in olfactory tissue and highly conserved across insects (Fleischer et al. 2018). In addition to expressing Orco, each ORN generally expresses only one OR gene, and the specificity with which an OR binds chemical compounds or families of compounds (ligands) determines the response spectrum of the ORN in which it is expressed (Hallem et al. 2004). At the molecular level, the ORN responses are dependent upon a complex interaction of OR, ligand concentration, and odorant-binding proteins (Vogt et al. 1991; Hallem et al. 2004; Hallem and Carlson 2006; Stensmyr et al. 2012; Mathew et al. 2013; Dweck et al. 2015; Ebrahim et al. 2015; Münch and Galizia 2016). In flies, ORNs expressing a particular OR project to a common glomerulus in the antennal lobe (AL), which generally appears to be true across insects (Couto et al. 2005; Su et al. 2009; Galizia and Rössler 2010). The molecular evolution of the OR gene family is best described as a birth-and-death process, in which genes are duplicated and deleted over evolutionary time (Nei 2007; Nozawa and Nei 2007; Eirín-López et al. 2012). Both random drift and natural selection are present during this process, determining the extent of OR gene copy number variation and the rate of gene sequence evolution (Nei 2007; Nozawa and Nei 2007). First identified in the common fruit fly Drosophila melanogaster and the African malaria mosquito Anopheles gambiae, much of our understanding of the relationship between insect OR function and evolution comes from studies of Diptera, which show that the OR gene family can be conserved among species in a genus, with exceptions arising when chemosensory landscapes differ between species (Vosshall et al. 1999; Fox et al. 2001; Robertson et al. 2003). There is prevalent negative (purifying) selection conserving ORs across Drosophila species, and the majority of D. melanogaster ORs form simple orthologous relationships across the genus (Clark et al. 2007; Guo and Kim 2007; McBride and Arguello 2007; Nozawa and Nei 2007; Sánchez-Gracia et al. 2009; Mansourian and Stensmyr 2015). However, there is evidence of gene loss and accelerated evolution of some ORs during the evolution of host specialization and herbivory in Drosophilids (McBride 2007; McBride and Arguello 2007; Goldman-Huertas et al. 2015). Between genera, the OR repertoire is more variable. The Aedes aegypti OR repertoire includes about 131 genes organized in clades that are largely divergent from Anopheles gambiae's 79 OR genes (Fox et al. 2001; Hill et al. 2002; Bohbot et al. 2007). Between Dipteran families, there is considerable variation in OR sequences and copy number, indicating that OR evolution is more dynamic at this phylogenetic scale (Fox et al. 2001; Bohbot et al. 2007; Carey et al. 2010). The molecular evolution of the OR gene family is dynamic among Hymenoptera genera, with prevalent lineage-specific gene expansions and losses, especially in the 9-exon OR subfamily (Engsontia et al. 2015; Zhou et al. 2015; McKenzie and Kronauer 2018). The 9-exon ORs constitute about one-third of all ant ORs, and have evolved rapidly in ants, leading researchers to propose that 9-exon ORs facilitate recognition of cuticular hydrocarbons (CHCs) (Smith CR et al. 2011; Smith CD et al. 2011; Zhou et al. 2012, 2015; Engsontia et al. 2015; McKenzie et al. 2016). CHCs are used by insects to waterproof the cuticle and to communicate with conspecifics (Blomquist and Bagnères 2010). While less pronounced than in ants, dynamic evolution is also characteristic of 9-exon OR evolution in social bees, which rely on CHCs in communication (Sadd et al. 2015; Karpe et al. 2016, 2017). Functional studies in which ORs were transfected into an empty D. melanogaster ORN have verified that at least some 9-exon ORs of the ant Harpegnathos saltator overlap in their responses to ligands, with multiple 9-exon ORs responding to the same CHC molecule and unique 9-exon ORs responding to multiple different CHC molecules (Pask et al. 2017; Slone et al. 2017). This is characteristic of combinatorial coding: the process of combining input from multiple ORs that bind overlapping sets of ligands in order to discriminate a larger variety of odors (Malnic et al. 1999; Touhara and Vosshall 2009). Functional ORs are necessary for normal nesting behavior and for nestmate recognition in ants, a process which involves detecting variation in the CHCs on the cuticles of conspecifics (Lavine et al. 1990; van Zweden and d’Ettorre 2010; Sturgis and Gordon 2012; Trible et al. 2017; Yan et al. 2017; Ferguson et al. 2020). Together these studies suggest that 9-exon ORs function in combinatorial coding of CHC perception in ants and potentially in general across Hymenoptera. Like other Hymenopterans, vespid wasps, including the genus Polistes, use CHCs in complex social behaviors (Gamboa et al. 1986, 1996; Dani and Turillazzi 2018). Polistes use chemicals as signals and cues during mate attraction, mate compatibility recognition, queen recognition, dominance/fertility signaling, and nestmate recognition (Post and Jeanne 1984; Reed and Landolt 1990; Espelie et al. 1994; Sledge et al. 2001a, 2001b, 2004; Dapporto et al. 2007; Jandt et al. 2014; Oi et al. 2019). The molecular mechanistic basis of chemical signal perception in Polistes has not been explored, but the importance of olfaction in mediating social behaviors is expected to favor increased copy number of OR genes encoding chemical signal receptors. Neuroanatomical analyses of the antennal lobe of social wasps suggest they possess expanded OR repertoires. In the clonal raider ant, Ooceraea biroi, the T6 cluster of the antennal lobe receives inputs from OR neurons expressing 9-exon subfamily ORs in sensilla basiconica (McKenzie et al. 2016). The large TB cluster in the antennal lobe of Vespid wasps is homologous to the T6 cluster in ants, suggesting the 9-exon subfamily of ORs has also expanded in wasps (Masson and Strambi 1977; Couto et al. 2016, 2017). Recent efforts to sequence Polistes genomes provide an opportunity to resolve patterns of OR evolution among closely related species as an independent test of 9-exon OR gene subfamily expansion during social evolution (Patalano et al. 2015; Standage et al. 2016; Miller et al. 2020). We annotated the OR repertoires of five Polistes species representing ∼40 million years of evolution: P. fuscatus, P. metricus, P. dorsalis, P. canadensis, and P. dominula (fig. 1). Combining neuroanatomy, manual gene annotation, and molecular evolution analysis, we examined the evolution of Polistes ORs with a focus on the 9-exon subfamily. We discover that social wasps, like ants, have an expanded set of 9-exon ORs. Between Polistes species, 9-exon ORs exhibit dynamic evolution and relaxed negative selection relative to ORs in other subfamilies, which are highly conserved. Patterns of molecular evolution of the 9-exon OR subfamily in social wasps are consistent with a unique function in combinatorial coding perception of CHCs.
. 1.

Phylogeny of five Polistes species considered in this study: P. fuscatus, P. metricus, P. dorsalis, P. canadensis, and P. dominula. The photo to the right of the phylogeny shows P. fuscatus foundresses on a nest. Phylogenetic tree based on the 16S ribosomal RNA gene and the cytochrome oxidase subunit I gene.

Phylogeny of five Polistes species considered in this study: P. fuscatus, P. metricus, P. dorsalis, P. canadensis, and P. dominula. The photo to the right of the phylogeny shows P. fuscatus foundresses on a nest. Phylogenetic tree based on the 16S ribosomal RNA gene and the cytochrome oxidase subunit I gene.

Results

Antennal Lobe Neuroanatomy and Manual Gene Annotation Predict 200 ORs in P. fuscatus

In order to predict the OR repertoires of P. fuscatus and four other Polistes species, we combined fluorescent confocal microscopy of the P. fuscatus antennal lobe with manual genome annotation informed by antennal RNAseq. We found 229 glomeruli in the antennal lobe of an adult gyne (female reproductive) (supplementary fig. S1, Supplementary Material online). Across a sample of insects, the number of intact OR genes in the genome correlates with the number of glomeruli in the antennal lobe, predicting 229 ORs in the P. fuscatus genome (fig. 2). Here, we focus on the P. fuscatus genome because it has nearly chromosome level scaffolds and is the best assembled Polistes genome (Patalano et al. 2015; Standage et al. 2016; Miller et al. 2020) (supplementary table S1, Supplementary Material online). Automated annotation using the MAKER pipeline (Holt and Yandell 2011) without guidance from antennal mRNA predicted 115 OR gene models in the P. fuscatus genome. A combined P. fuscatus male and gyne (reproductive female) antennal transcriptome generated using Trinity (Haas et al. 2013) yielded 89 OR genes greater than 900 nucleotides in length. Some long Trinity genes contain multiple 7-transmembrane domains and likely represent concatenated OR genes. The small fraction of the P. fuscatus OR repertoire predicted by transcriptome assembly is consistent with previous observations that annotation of OR repertoires using only transcriptome data typically fails to recover all ORs (Karpe et al. 2016, 2017, 2021).
Fig. 2.

The number of functional ORs is correlated with the number of antennal lobe glomeruli across insect species (50 glomeruli and 62 ORs in the genome of the common fruit fly D. melanogaster, Fishilevich and Vosshall 2005; 166 glomeruli in the worker and 163 intact ORs in the genome of the honey bee A. mellifera, Arnold et al. 1985, Robertson and Wanner 2006; ∼200 glomeruli in females and 225 intact ORs in the genome of the parasitic wasp N. vitripennis, Groothuis et al. 2019, Robertson et al. 2010; ∼434 glomeruli in the worker and 352 functional ORs in the genome of the ant C. floridanus, Zube and Rössler 2008, Zhou et al. 2012; 493 glomeruli in the worker and 503 intact ORs in the genome of the ant O. biroi, McKenzie et al. 2016; McKenzie and Kronauer 2018). The diagonal line represents a line of equality with slope of 1.

The number of functional ORs is correlated with the number of antennal lobe glomeruli across insect species (50 glomeruli and 62 ORs in the genome of the common fruit fly D. melanogaster, Fishilevich and Vosshall 2005; 166 glomeruli in the worker and 163 intact ORs in the genome of the honey bee A. mellifera, Arnold et al. 1985, Robertson and Wanner 2006; ∼200 glomeruli in females and 225 intact ORs in the genome of the parasitic wasp N. vitripennis, Groothuis et al. 2019, Robertson et al. 2010; ∼434 glomeruli in the worker and 352 functional ORs in the genome of the ant C. floridanus, Zube and Rössler 2008, Zhou et al. 2012; 493 glomeruli in the worker and 503 intact ORs in the genome of the ant O. biroi, McKenzie et al. 2016; McKenzie and Kronauer 2018). The diagonal line represents a line of equality with slope of 1. Manual gene annotation of P. fuscatus ORs recovered 231 gene models across 28 scaffolds (supplementary fig. S2, Supplementary Material online), of which 28 are pseudogenes and 10 are incomplete gene models (seven missing N termini, two missing C termini, and one missing both N and C termini). Since functional insect ORs are typically composed of 400 amino acids, we defined gene models as putatively functional if they coded for proteins greater than or equal to 300 amino acids in length, even if the gene models were incomplete. In P. fuscatus, the 200 putatively functional gene models encode protein sequences with an average length of 395 ± 15 (SD) amino acids, and 198 of these gene models encode protein sequences greater than 350 amino acids in length (table 1). OR proteins possess seven transmembrane domains (Wicher 2015). The putatively functional P. fuscatus OR proteins possess on average 5.95 ± 0.91 (SD) transmembrane domains as predicted by TMHMM version 2.0c (Sonnhammer et al. 1998) and 6.43 ± 1.13 (SD) as predicted by Phobius version 1.01 (Käll et al. 2004). For comparison, transmembrane domain prediction in 61 D. melanogaster ORs coding for proteins greater than 375 amino acids in length found on average 5.77 ± 1.12 (SD) transmembrane domains as predicted by TMHMM version 2.0c and 6.18 ± 1.09 (SD) as predicted by Phobius version 1.01 (sequences from Supplemental Data 1, Supplementary Material online, in Hopf et al. 2015). The close match between the number of ORs predicted by neuroanatomy and the number recovered from manual annotation suggests that we have identified nearly all of the OR genes in P. fuscatus. The number of transmembrane domains predicted are comparable to annotations of D. melanogaster and approach the seven transmembrane domains expected for insect ORs. Manual OR gene annotation in P. fuscatus and four other Polistes genomes is summarized in table 1.
Table1.

Summary of Odorant Receptor Gene Annotations in Five Polistes Genomes.

SpeciesFunctional ORsaMean LengthbMean TM TMHMMcMean TM PhobiusdOR ModelsPSEPartial Models
P. fuscatus 200395 ± 155.95 ± 0.916.43 ± 1.132312810
P. metricus 204396 ± 135.96 ± 0.856.45 ± 1.17217129
P. dorsalis 177393 ± 205.90 ± 0.906.40 ± 1.212031624
P. canadensis 188394 ± 175.95 ± 0.916.48 ± 1.202351359
P. dominula 180392 ± 195.99 ± 0.886.59 ± 1.33202733

OR gene models encoding proteins ≥ 300 amino acids in length.

Mean length in amino acids (± SD).

Mean transmembrane domains predicted by TMHMM.

Mean transmembrane domains predicted by Phobius.

Summary of Odorant Receptor Gene Annotations in Five Polistes Genomes. OR gene models encoding proteins ≥ 300 amino acids in length. Mean length in amino acids (± SD). Mean transmembrane domains predicted by TMHMM. Mean transmembrane domains predicted by Phobius.

9-Exon OR Subfamily Expanded During the Evolution of Social Wasps

We conducted a Hymenoptera-wide analysis of OR evolution to test the prediction that the 9-exon OR subfamily was independently expanded during the evolution of eusociality in vespid wasps. By comparing the P. fuscatus OR repertoire to other Hymenopterans, our findings reinforce previous results showing that across Hymenopteran families, ORs evolve with lineage-specific expansions of multiple OR subfamilies (fig. 3). Gene gain and loss events were predicted using NOTUNG (Chen et al. 2000) and mapped onto a species cladogram of 14 Hymenopterans (fig. 4). NOTUNG estimated an ancestral Apocritan repertoire of 56 ORs, which has expanded independently during the evolution of braconid wasps, ants, bees, and paper wasps (fig. 4). The 9-exon subfamily is commonly expanded across Hymenoptera (∼90 genes on average), and comprises ∼36% of social insect OR repertoires. The largest lineage-specific expansions of Hymenopteran 9-exon ORs have occurred independently during the evolution of ants and social wasps. In P. fuscatus, this clade has expanded to 105 genes, comprising 53% of the OR gene set (fig. 4). Given the well-documented use of CHCs as signal molecules in Polistes (Singer 1998; Dani et al. 2001; Dani 2009; Beani et al. 2019), it is not surprising to find expansions in the putatively CHC-detecting 9-exon subfamily in this genus. Subfamilies L, T, H, E, and V have also expanded in Polistes, but not to the extent of the 9-exon OR subfamily.
. 3.

Maximum likelihood OR protein tree constructed using data from four Hymenopterans (Apis mellifera, Robertson and Wanner 2006; Camponotus floridanus, Zhou et al. 2012; Nasonia vitripennis, Robertson et al. 2010). Branches are colored by species (Red: C. floridanus; Light blue: A. mellifera; Green: P. fuscatus; Purple: N. vitripennis). The L and 9-exon OR subfamilies are highlighted. Scale bar represents 0.5 mean substitutions per site.

. 4.

Cladogram of Hymenoptera species showing estimated number of OR gene gain and loss events along branches and estimated size of ancestral and extant species OR repertoires in boxes. To the right is a bar chart showing numbers of ORs broken down by subfamily. Non-Polistine OR data are from Robertson et al. (2010) and Zhou et al. (2012, 2015). The set of intact ORs that were longer than 300 amino acids was used except for C. floridanus in the bar chart, where only ORs considered putatively functional by Zhou et al. (2012) were used.

Maximum likelihood OR protein tree constructed using data from four Hymenopterans (Apis mellifera, Robertson and Wanner 2006; Camponotus floridanus, Zhou et al. 2012; Nasonia vitripennis, Robertson et al. 2010). Branches are colored by species (Red: C. floridanus; Light blue: A. mellifera; Green: P. fuscatus; Purple: N. vitripennis). The L and 9-exon OR subfamilies are highlighted. Scale bar represents 0.5 mean substitutions per site. Cladogram of Hymenoptera species showing estimated number of OR gene gain and loss events along branches and estimated size of ancestral and extant species OR repertoires in boxes. To the right is a bar chart showing numbers of ORs broken down by subfamily. Non-Polistine OR data are from Robertson et al. (2010) and Zhou et al. (2012, 2015). The set of intact ORs that were longer than 300 amino acids was used except for C. floridanus in the bar chart, where only ORs considered putatively functional by Zhou et al. (2012) were used.

The 9-Exon OR Subfamily Shows a Distinct Pattern of Orthology within Polistes

We next examined the evolutionary history of OR genes among the five Polistes species to reveal patterns of orthology and paralogy within subfamilies. Across the Polistes genus, most OR subfamilies are highly conserved (fig. 5supplementary fig. S3, Supplementary Material online). About 70% of non-9-exon family P. fuscatus ORs are in 1:1 orthology with all other Polistes species sampled as predicted by OrthoFinder (Emms and Kelly 2015) (supplementary table S3, Supplementary Material online). The remaining orthologous groups contain an expansion in one or more species (fig. 5). Considering non-9-exon ORs, most ORs are shared by all five Polistes species examined, and most expansions are shared across all five species. Given that the species examined here span ∼40 million years of divergence (Peters et al. 2017), the conservation of most of the OR repertoire is notable and may be related to the similarity of ecological and social niches found among Polistes wasps. While a common evolutionary history has led to large 9-exon OR complements in all Polistes species examined, lineage-specific gains and losses of 9-exon ORs account for most of the variation in OR repertoire size across Polistes species (fig. 4). In contrast with the other OR subfamilies, the 9-exon OR subfamily shows more lineage specificity with only 32% of P. fuscatus 9-exon ORs showing simple 1:1 orthology across all five Polistes examined (supplementary table S3, Supplementary Material online). Most 9-exon subfamily orthologous groups contain gene copies from four or fewer species, and lineage-specific expansions are more common in 9-exon OR orthologous groups (fig. 5). The relative lack of orthology among 9-exon OR genes compared to the rest of the OR gene subfamilies suggests unique evolutionary processes shaping 9-exon ORs.
Fig. 5.

(A) Maximum likelihood OR protein tree with branches colored by species (Green: Polistes fuscatus; Yellow: P. metricus; Orange: P. dorsalis; Magenta: P. canadensis; Blue: P. dominula). The L and 9-exon subfamilies are highlighted. Scale bar represents 0.4 mean substitutions per site. (B) Stacked bar chart showing the number of Polistes species (x-axis) represented in each orthologous group (y-axis), and whether or not each orthologous group is single copy (shaded bottom portion of bar) or contains an expansion in at least one species (top-striped portion of bar). Orthologous groups are split into two categories: non-9-exon orthologous groups (“non-9e” left bar) and 9-exon orthologous groups (right bar).

(A) Maximum likelihood OR protein tree with branches colored by species (Green: Polistes fuscatus; Yellow: P. metricus; Orange: P. dorsalis; Magenta: P. canadensis; Blue: P. dominula). The L and 9-exon subfamilies are highlighted. Scale bar represents 0.4 mean substitutions per site. (B) Stacked bar chart showing the number of Polistes species (x-axis) represented in each orthologous group (y-axis), and whether or not each orthologous group is single copy (shaded bottom portion of bar) or contains an expansion in at least one species (top-striped portion of bar). Orthologous groups are split into two categories: non-9-exon orthologous groups (“non-9e” left bar) and 9-exon orthologous groups (right bar).

Microsynteny Reveals Recent Birth and Death Events in Polistes 9-Exon OR Subfamily

Expanded gene families often occur as tandem arrays, a genomic architecture that can contribute to increased rates of gene birth and death, increasing copy number variation among species (Ohno 1970). Therefore, we examined how genomic organization varies between OR subfamilies in Polistes species to generate insights into the molecular evolutionary mechanisms shaping OR subfamily function. Genomic organization of ORs across Polistes is consistent with a model of birth and death evolution shaping OR repertoires. As in bees, gene gain and loss at a small number of loci containing tandem arrays are responsible for most copy number variation in the OR family across closely related species (Brand and Ramírez 2017). In P. fuscatus, 62% of ORs occur in tandem arrays of six or more genes (fig. 6). The frequency of tandem arrays and the tail-to-head orientations of neighboring genes point to tandem duplication as the primary mechanism of OR expansion, likely caused by nonallelic homologous recombination (Lynch 2007; Ramdya and Benton 2010). We examined microsynteny of OR genes and pseudogenes in the four longest tandem arrays of ORs in Polistes. Gene birth and death events have resulted in more complex orthology among genes in orthologous 9-exon OR arrays compared to tandem arrays of L and T subfamily ORs in Polistes genomes (fig. 6). The longest OR gene tandem array in P. fuscatus is comprised of 44 genes in the 9-exon subfamily on scaffold 13 (s13), which corresponds to homologous arrays of 50 genes in P. metricus, 25 genes in P. dorsalis, 33 genes in P. canadensis, and 29 genes in P. dominula. Only 34% of P. fuscatus ORs in this array have orthologs across all Polistes species sampled, and collinear orthologs in this array are frequently interrupted by inparalogs (fig. 6). The second longest OR gene tandem array in P. fuscatus contains 24 ORs in the L subfamily on scaffold 17 (s17), and these ORs show 1:1 orthology across P. fuscatus, P. metricus, and P. dorsalis, while P. canadensis possesses an array of ∼23 genes split between two scaffolds, and P. dominula possesses an array of 21 ORs at this locus. This tandem array, widely expanded across Hymenoptera, has been expanded and conserved across Polistes. The T subfamily, located on scaffold 8 (s8) of the P. fuscatus genome, is composed of 14 tandemly arrayed genes that show 1:1 orthology across five Polistes (fig. 6). Differences between OR subfamilies in patterns of orthology within microsyntenic regions highlight the unique evolutionary processes shaping 9-exon OR evolution in paper wasps. At the same time, the orthology of collinear genes within syntenic L and T subfamily tandem arrays across all examined Polistes species highlights the strong conservation of much of the Polistes OR repertoire.
Fig. 6.

(A) Frequency of OR gene singletons and tandem arrays in the Polistes fuscatus genome. 62% of ORs in P. fuscatus occur in tandem arrays of six or more genes. The longest tandem array is a 44 gene cluster on scaffold 13 (s13) containing 9-exon subfamily ORs. The first row of x-axis labels is the number of OR genes in a tandem array cluster, and the second row labels the OR subfamily and scaffold number (abbreviated s# in parentheses) of the six longest tandem arrays. (B) Genome alignments of four loci containing tandem arrays of OR genes in all Polistes species examined. Each alignment is labeled with the corresponding OR subfamily and P. fuscatus scaffold number. Black boxes represent putatively functional genes (≥300 amino acids) and gray boxes represent pseudogenes. Directionality of genes is denoted by curved corners at the 3' (tail) end. Black lines connect orthologous genes between species. Genomic scaffolds are represented by horizontal, gray lines, and scaffold ends are represented by vertical gray lines. Scale bars beneath each alignment represent 5 kb.

(A) Frequency of OR gene singletons and tandem arrays in the Polistes fuscatus genome. 62% of ORs in P. fuscatus occur in tandem arrays of six or more genes. The longest tandem array is a 44 gene cluster on scaffold 13 (s13) containing 9-exon subfamily ORs. The first row of x-axis labels is the number of OR genes in a tandem array cluster, and the second row labels the OR subfamily and scaffold number (abbreviated s# in parentheses) of the six longest tandem arrays. (B) Genome alignments of four loci containing tandem arrays of OR genes in all Polistes species examined. Each alignment is labeled with the corresponding OR subfamily and P. fuscatus scaffold number. Black boxes represent putatively functional genes (≥300 amino acids) and gray boxes represent pseudogenes. Directionality of genes is denoted by curved corners at the 3' (tail) end. Black lines connect orthologous genes between species. Genomic scaffolds are represented by horizontal, gray lines, and scaffold ends are represented by vertical gray lines. Scale bars beneath each alignment represent 5 kb. Microsynteny analysis suggests a process of ongoing gene turnover in 9-exon arrays but stasis in most other expanded subfamilies. More recent turnover should be associated with higher pairwise amino acid identity between neighboring genes in an array if they are the result of recent duplication events (Ohno 1970; Bohbot et al. 2007; Engsontia et al. 2015). To explore the relationship between amino acid divergence and tandem array locus, we compared the mean percent amino acid identity among neighboring genes within an array between the eight loci containing the longest tandem arrays of ORs in the P. fuscatus genome using one-way ANOVA (fig. 7). Mean percent amino acid identity of neighboring genes was significantly separated by OR array identity (DF = 7; F = 5.39; P-value = 2.67e−05). Differences between particular OR tandem arrays were identified using Tukey HSD post hoc tests. The mean percent amino acid identity among neighboring genes within one tandem array of nine 9-exon ORs on scaffold 12 (s12) of the P. fuscatus genome is higher than in the s13 9-exon array (P Adj = 0.04586), the s17 L array (P Adj = 0.00013), the s8 T array (P Adj = 0.00458), the s16 9-exon array (P Adj = 0.00124), and the s19 V array (P Adj = 0.02247). The s12 9-exon OR array is composed of a larger proportion of pseudogenes (5 PSE, 10 intact gene models) than the other two 9-exon arrays (s13: 9 PSE, 44 intact gene models; s16: 0 PSE, 12 intact gene models). ORs in the P. fuscatus s12 9-exon array lack clear orthologous relationships with ORs in species other than P. metricus. Taken together, the high within array sequence similarity, high frequency of pseudogenes, and low orthology exhibited by this array indicate that it is the result of one or more recent gene duplication events since the divergence of P. fuscatus and P. metricus from the other three Polistes species. The s6 H subfamily array also shows higher amino acid sequence identity among neighboring genes than the s17 L subfamily array (P-value Adj = 0.01625) and the s16 9-exon array (P-value Adj = 0.04038). Increased amino acid similarity may also occur within older tandem arrays as a result of gene conversion (Nagawa et al. 2002). However, we searched for gene conversion using GENECONV (Sawyer 1989) and did not detect gene conversion events within the s12 9-exon array or in the s6 H array after Bonferroni correction. Patterns of genomic organization of OR genes in Polistes genomes lead to the conclusion that gene gain and loss in the 9-exon OR subfamily is an ongoing process within this genus, in contrast to the stable and conserved tandem arrays in most other OR subfamilies.
. 7.

Percent amino acid identity between neighboring genes at eight loci containing the longest OR gene tandem arrays in the P. fuscatus genome. Arrays are ordered by length in gene number, from longest (44 9-exon subfamily ORs in the s13 tandem array) to shortest (6 H subfamily ORs in the s6 tandem array and 6 V subfamily ORs in the s19 tandem array).

Percent amino acid identity between neighboring genes at eight loci containing the longest OR gene tandem arrays in the P. fuscatus genome. Arrays are ordered by length in gene number, from longest (44 9-exon subfamily ORs in the s13 tandem array) to shortest (6 H subfamily ORs in the s6 tandem array and 6 V subfamily ORs in the s19 tandem array).

Positive Selection in Expanded OR Subfamilies and Accelerated Evolution of 9-Exon ORs

We examined the variation in omega (dN/dS) and used codon models to characterize sequence evolution of Polistes OR genes. We were especially interested in patterns of selection in the 9-exon OR clade. Consecutive analyses of Polistes OR subfamilies using HyPhy adaptive branch-site random effects likelihood (aBSREL) model (Smith et al. 2015; Pond et al. 2020) detected eight branches under episodic positive selection, all in OR subfamilies with expansions: three branches in the 9-exon subfamily (0.33% of 918 9-exon subfamily branches; supplementary fig. S4, Supplementary Material online); three branches in the L subfamily (1.28% of 234 L subfamily branches; supplementary fig. S5, Supplementary Material online); one branch in the E subfamily (1.67% of 60 E subfamily branches; supplementary fig. S6, Supplementary Material online); and one branch in the H subfamily (1.54% of 65 H subfamily branches; supplementary fig. S7, Supplementary Material online). This supports the hypothesis that gene duplication releases duplicate genes from selective constraints, allowing duplicate sequences to evolve towards other evolutionary optima (Ohno 1970; Saad et al. 2018). To visualize the range of patterns of synonymous and nonsynonymous substitutions in Polistes ORs, we computed the values of dN and dS for pairwise alignments of 151 single copy orthologs between P. fuscatus and P. dorsalis (fig. 8) using model yn00 of PAML (Yang 2007). Values of dN are significantly higher in 9-exon (mean dN = 0.015) compared to other OR ortholog pairs (mean dN = 0.006) (Welch Two Sample t-test; P-value = 5.317e−07). Values of dS are not significantly elevated among 9-exon ortholog pairs compared to other OR subfamilies (mean dS = 0.029 in 9-exon ORs and 0.025 in non-9-exon ORs; P-value = 0.343). Omega values (dN/dS) greater than 1 are often considered evidence of positive selection, while dN/dS = 1 corresponds to neutral drift, and dN/dS < 1 is evidence of negative (purifying) selection. The omega value (dN/dS) for the majority of OR genes is less than one, suggesting negative selection (mean omega = 0.454). However, omega is significantly higher in 9-exon ORs than in non-9-exon ORs, indicating that negative selection is weaker on 9-exon ORs (Welch Two Sample t-test: mean omega = 0.644 in 9-exon ORs (N = 62) and 0.32 in non-9-exon ORs (N = 89); P-value = 8.027e−05). This 1:1 orthology analysis excludes patterns of molecular evolution among genes with more complex orthology relationships, though an analysis considering all orthogroups identified by OrthoFinder containing at least four genes (N = 145) confirms the pattern. Estimates of omega by M0 in CodeML are significantly higher in 9-exon orthogroups than in non-9-exon orthogroups (Welch Two Sample t-test: mean omega = 0.407 in 9-exon ORs (N = 66) and 0.189 in non-9-exon ORs (N = 79); P-value < 2.2e−16) (supplementary fig. S8, Supplementary Material online). An elevated omega ratio in 9-exon 1:1 ortholog pairs and orthogroups implies that either a relaxation of negative selection or an intensification of positive selection is responsible for sequence evolution in the 9-exon relative to other OR subfamilies. We explicitly tested the hypothesis that relaxed negative selection is responsible for higher omega values in branches of the 9-exon OR clade compared to branches in other OR subfamilies using HyPhy RELAX (Wertheim et al. 2015; Pond et al. 2020). This analysis found a significant pattern of reduced selection intensity in the 9-exon OR clade compared to the rest of the Polistes OR tree (LRT = 505.81; mean selection intensity parameter k = 0.48; P-value < 0.0001). Relaxed selection in the 9-exon OR subfamily may allow ORs to explore phenotypic space and develop novel response spectra for behaviorally relevant chemical signals.
. 8.

The values of dS (x-axis) and dN (y-axis) from pairwise alignments of Polistes fuscatus and P. dorsalis 1:1 orthologs. Values of dN are elevated in the 9-exon OR subfamily (data points represented by red triangles) relative to other OR subfamilies (data points represented by circles). The diagonal line represents a line of equality with slope of 1.

The values of dS (x-axis) and dN (y-axis) from pairwise alignments of Polistes fuscatus and P. dorsalis 1:1 orthologs. Values of dN are elevated in the 9-exon OR subfamily (data points represented by red triangles) relative to other OR subfamilies (data points represented by circles). The diagonal line represents a line of equality with slope of 1.

Discussion

Expansion of 9-Exon OR Subfamily during Independent Evolution of Sociality in Wasps

By carefully annotating the OR repertoires of five social wasp species spanning ∼40 million years of divergence in the Polistes genus, this study adds a higher resolution lens to our view of the evolution of social insect ORs. During the diversification of Polistes, evolutionary patterns show genus-wide conservation of their ∼200 ORs except for the 9-exon genes, which show elevated turnover and lower sequence conservation. The 9-exon OR subfamily has expanded in paper wasps, and now makes up over half of the Polistes OR gene set. Social and ecological niches are relatively conserved within Polistes, though there is considerable variation in social behavior and ecological niches among vespid wasps (Ross and Matthews 1991; O’Neill 2001). An analysis of three hornet genomes suggested that the highly eusocial hornets may have even larger OR repertoires compared to the primitively eusocial Polistes (Harrop et al. 2020). That analysis recovered less than half of the ORs reported here for Polistes, likely due to a lack of manual annotation informed by antennal transcriptome data, suggesting that hornets may have larger OR repertoires than reported. Evidence from the hornet Vespa velutina, including the discovery of ∼265 antennal lobe glomeruli, indicates that the hornet OR repertoire has expanded (Couto et al. 2016). Interestingly, 96 glomeruli populate the TB cluster in V. velutina, an antennal lobe region innervated by CHC-detecting sensilla basiconica and proposed to be homologous with the T6 antennal lobe cluster of ants (Couto et al. 2017). Given that the number of T6 glomeruli correlates with the number of 9-exon OR genes expressed in ant antennae, the large number of TB glomeruli in hornets strongly suggests an expanded complement of 9-exon ORs (McKenzie et al. 2016). Future analysis of additional genomes and antennal transcriptomes of diverse social and solitary vespid wasps will allow further examination of the relationship between social behavior and OR subfamily expansion.

Combinatorial Coding of CHCs by 9-Exon ORs Facilitates Recognition

Electrophysiological deorphanization studies of 9-exon ORs in the ant Harpegnathos saltator offer key insights into how 9-exon OR coding might relate to gene expansion. Through combinatorial coding, 9-exon ORs can detect a large variety of structurally diverse CHCs. Pask et al. (2017) examined 22 H. saltator 9-exon ORs, a subset of the 118 annotated 9-exon ORs in this species, and found that 9-exon ORs were responsive to CHCs, and overlapped in their responses to multiple CHC compounds. The combined responses of these 22 ORs to CHC extracts from different castes were sufficient to map the CHC profiles of males, workers, and reproductive females (gamergates) to separate regions of a 22-dimensional receptor space (Pask et al. 2017). This highlights the ability of 9-exon ORs to facilitate social recognition by combinatorial coding. In social insect colonies, CHC variation holds information at multiple levels of conspecific recognition, from inter-colony nestmate recognition to within colony individual recognition (Greene and Gordon 2003; d’Ettorre and Heinze 2005; d’Ettorre and Moore 2008; Leonhardt et al. 2016). Expansion of the 9-exon OR subfamily might result from selection for more combinations of ORs that together can discriminate between subtle qualitative and quantitative variations in CHC blends of conspecifics. Nest-specific quantitative variation in CHCs has been documented across Polistes species (Espelie et al. 1990; Singer et al. 1992; Espelie et al. 1994; Layton et al. 1994), but the molecular mechanisms underlying nestmate recognition in Polistes are still obscure. Increased copy number of 9-exon ORs may not only expand the qualitative range of compounds perceived by paper wasps, but also the perceived quantitative olfactory space, since wasps may be able to discern unique concentration differences between CHC blends as a result of the combined action of 9-exon ORs with various response thresholds. Gene duplication can also promote regulatory diversification (Kucharski et al. 2016; Dyson and Goodisman 2020). In P. metricus, CHCs vary between castes and across stages of the colony cycle (Toth et al. 2014). Regulatory subfunctionalization of duplicate ORs could be responsible for caste- and colony phase-specific expression of ORs involved in detecting caste-specific and seasonally variable CHCs. In addition to adaptive expansion of ORs, neutral processes contribute to OR gene birth-and-death events. There may be an advantage for a large 9-exon OR gene copy number up to a point, followed by random gene duplication and deletion around this optimal copy number. This random genomic drift has been proposed to shape mammalian olfactory receptor evolution and copy number variation in other large multigene families (Nei 2007; but see Hayden et al. 2010). Indeed, we find evidence of relaxed selection on the 9-exon OR subfamily compared to other wasp OR subfamilies, which is consistent with predictions for the evolution of combinatorial coding (Andersson et al. 2015). Mutations that slightly alter the response profiles of functionally redundant ORs may not be eliminated by negative selection, since other ORs can help compensate (Fishilevich et al. 2005; Keller and Vosshall 2007).

Evolution of ORs Reflects Distinct Chemosensory Ecologies of Species

Social insect species differ in their level of sociality and extent of olfactory recognition abilities (Stuart 1988; Page et al. 1991; d’Ettorre and Moore 2008; Peeters and Liebig 2009; Rehan and Toth 2015). Some aspects of the Polistes colony cycle vary across species. For example, the average number of cooperative foundresses varies from 1 to ∼6, and average sizes of mature nests may vary from ∼60 cells in P. metricus to ∼490 cells in P. annularis (Rabb 1960; Downing and Jeanne 1986; Reeve 1991; Sheehan et al. 2015; Miller SE et al. 2018). Increased 9-exon OR copy number may facilitate complex olfactory recognition in species with larger colony sizes, higher cooperative nest-founding rates, and greater sympatry with related species. However, expansions of 9-exon ORs are not exclusive to social wasps, suggesting that the specific chemical ecology of an insect is a more influential factor shaping OR evolution than level of sociality (Karpe et al. 2017). Furthermore, a meta-analysis found that the complexity of CHC phenotypes does not differ between social and solitary Hymenopteran species (Kather and Martin 2015). The CHC profile of Nasonia vitripennis includes at least 52 CHC compounds, and detection of CHCs on prey items may help Microplitis identify prey (Lewis et al. 1988; Niehuis et al. 2011). The need for parasitoid wasps to perceive CHCs could explain why genomes of N. vitripennis and M. demolitor exhibit expansions in the 9-exon OR subfamily. The OR repertoire of the fig wasp Ceratosolen solmsi is about one-third of the size of those described in N. vitripennis and M. demolitor, which likely reflects the specialized sensory demands of identifying one host plant species (Xiao et al. 2013). In general, the 9-exon OR subfamily comprises a smaller proportion of the OR gene set in bees, which do not predate insects, relative to ants and wasps. The extreme expansion of 9-exon ORs in the myrmecophagous clonal raider ant relative to other ant species provides further evidence suggesting that the need to detect insect prey may influence 9-exon OR gene content. Dietary selective pressures may play a part in shaping the evolution of Polistes ORs. Paper wasps appear to primarily predate Lepidoptera larvae, and sympatric Polistes species predate overlapping sets of prey species (Rabb and Lawson 1957; Rabb 1960; Southon et al. 2019). P. dominula workers were found to predate a wider range of insects than other Polistes species (Cervo et al. 2000). Variation in the OR repertoire could underlie variation in foraging behavior between Polistes species.

Lineage-Specific Molecular Evolution of Polistes 9-Exon ORs

Most expanded OR subfamilies are highly conserved in copy number across five Polistes species, with the exception of the 9-exon OR subfamily. In particular, one portion of the 9-exon subfamily arranged in a tandem array (P. fuscatus 9e s13; fig. 6) has experienced dynamic evolution. What selective pressures might drive rapid gain and loss of 9-exon ORs? Divergent chemical signaling between species may lead to gene turnover as 9-exon OR evolution tracks evolutionarily labile chemical signals. For example, P. fuscatus and P. metricus are relatively closely related, and both species possess CHC profiles consisting of linear and methyl-branched alkanes (Espelie et al. 1990; 1994). However, the P. fuscatus CHC profile includes a higher proportion of alkenes than P. metricus or P. dominulus, and the position of the methylated carbon of methyl-branched alkanes is sometimes shifted between species (Espelie et al. 1990; Singer et al. 1992; Espelie et al. 1994; Layton et al. 1994). Ant 9-exon ORs respond differently to subtle variations in CHC structure (Pask et al. 2017). Between closely related Polistes species, structural isomers of methyl-branched alkanes probably activate different ensembles of ORs. If a chemical evolves new behavioral relevance in a population, gene duplication could allow the olfactory system to explore chemical space in the direction of this compound. HyPhy aBSREL analyses identified eight branches in expanded OR subfamilies, including the 9-exon subfamily, that have undergone positive selection during the last ∼40 million years, consistent with neofunctionalization or subfunctionalization of duplicated genes. Signatures of positive selection on OR genes may indicate directional selection to perceive species-specific chemical signals. Perception of species-specific CHCs might be important in mate compatibility recognition. In Polistes, mating occurs at sites defended by males and visited by females of multiple species (Post and Jeanne 1984, 2010; Reed and Landolt 1990). However, the frequency of interspecific mating is low, suggesting Polistes use vision and/or olfaction to inform their mating decisions (Miller et al. 2019). Duplication and deletion of ORs would facilitate evolution of species-specific chemical signaling systems that could contribute to reproductive isolation of sympatric species. If a chemical signal is lost in a species, the corresponding ORs may become obsolete, and would be expected to pseudogenize and be purged from the genome. Duplication and deletion of ORs could also lead to species-specific chemical signaling in the absence of evolutionary change in chemical signals (Cande et al. 2013). However, OR evolution is not strictly necessary for such a difference to evolve between species, and circuit-level changes can prescribe new valence to chemical signals that are shared between species and perceived by common peripheral receptors (Seeholzer et al. 2018).

Conservation of Most OR Subfamilies Suggests Conserved Functions

Aside from the 9-exon OR subfamily, gene expansions have occurred in subfamilies L, T, H, E, and V (fig. 4). A larger variety of ORs relaying information through odorant receptor neurons (ORNs) to a larger number of antennal lobe glomeruli will increase sensory acuity in any olfactory discrimination task, social, or otherwise. An ancient locus of tandemly duplicated L subfamily ORs observed across social insects has expanded in Polistes, although to a lesser extent than in other social insects (∼50 L subfamily ORs in honeybee and ants, 25 L subfamily ORs in a tandem array on P. fuscatus scaffold 17). ORs in the L subfamily are thought to detect queen pheromone components and fatty acids in bees as well as CHCs in ants (Wanner et al. 2007; Karpe et al. 2016; Slone et al. 2017). The T subfamily has expanded to a greater degree in P. fuscatus (14 genes) than in ants (∼7 genes) and the honeybee (2 genes), but no ORs in this clade have been functionally characterized. The P. fuscatus genome encodes nine H subfamily ORs, which are putative floral odorant detectors in bees, and which also respond to CHCs and other general odorants in ants (Claudianos et al. 2014; Slone et al. 2017). Fatty acids and volatile organic compounds are produced by flowers that wasps rely on as a source of carbohydrates (Raguso 2008). Expansions in several OR subfamilies may increase olfactory discrimination of chemicals with diverse behavioral relevance. Polistes species are distributed globally in temperate and tropical regions, occupying similar social and ecological niches as generalist predators of Lepidoptera and floral foragers that form primitively eusocial societies (Reeve 1991; Richter 2000). A conserved subset of the OR repertoire may perform common functions in conserved behaviors across paper wasp species. High levels of OR conservation are also consistent with a specialist molecular function of an OR in a dedicated channel of olfaction (Andersson et al. 2015).

Distinct Patterns of OR Evolution within the Same Genome

In paper wasps, we report both highly conserved OR expansions similar to those seen in Drosophila as well as elevated gene turnover and drift among the 9-exon ORs, reminiscent of a more mammal-like evolutionary pattern. The differences between the conserved OR repertoires in Drosophila and the more dynamic evolution of mammal OR gene families have given rise to speculation about the relationship between OR function and evolution (Nozawa and Nei 2007; Andersson et al. 2015). If the highly dynamic clades of 9-exon ORs of social wasps are involved in more combinatorial coding compared to other more conserved 9-exon or non-9-exon ORs, that would indicate a link between molecular evolution of ORs and neural coding. Further investigations into the relative tuning of 9-exon as well as more conserved ORs in social wasps and other social insects provide a promising research direction to investigate the links between molecular evolutionary patterns, OR tuning, and neural coding.

Materials and Methods

Antennal Lobe Imaging

Antennal lobe glomeruli of male and female P. fuscatus wasps were stained with anti-synapsin and imaged using a confocal laser scanning microscope. Details of the immunocytochemistry and imaging are included as Supplementary Material online.

Gene Annotation

The genomes and annotations of P. canadensis, P. dominula, P. fuscatus, P. dorsalis, and P. metricus were accessed through NCBI (Patalano et al. 2015; Standage et al. 2016; Miller SE et al. 2020). Coding regions of ORs were identified by using TBLASTN (Altschul et al. 1997) with a sample of OR proteins from 19 insect species used as query sequences: Atta cephalotes, Acromyrmex echinatior, Apis mellifera, Camponotus floridanus, Cardiocondyla obscurior, Ceratosolen solmsi, Drosophila melanogaster, Eulaema bombiformis, Euglossa dilemma, Euglossa flammea, Euglossa imperialis, Eulaema meriana, Eufriesea mexicana, Lasioglossum albipes, Microplitis demolitor, Monomorium pharaonis, Melipona quadrifasciata, Nasonia vitripennis, Solenopsis invicta (Robertson et al. 2003, 2010; Zhou et al. 2012, 2015; Brand and Ramírez 2017; McKenzie and Kronauer 2018). Genomes were queried iteratively with TBLASTN, adding newly annotated Polistes ORs to the query file, until no new OR coding regions were identified. To guide annotation of exon–intron boundaries, antennal mRNA from P. fuscatus males and females (gynes) was mapped to P. fuscatus, P. metricus, and P. dorsalis genomes using STAR (Dobin et al. 2013) and assembled into transcripts using Trinity (Haas et al. 2013) (supplementary table S2, Supplementary Material online). Predicted transcripts were aligned to genomes using BLAT. Uncertain gene models in P. metricus, P. dorsalis, P. canadensis,and P. dominula were aligned to their orthologs in P. fuscatus using Muscle version 3.8.425 with maximum four iterations (Edgar 2004), and gene models were manually adjusted. All annotation evidence was imported into Geneious v11.1.5 genome browser for manual annotation. The majority of apparently functional ORs that were not detected by the automated annotation and required extensive manual curation were 9-exon subfamily receptors (e.g. supplementary fig. S9, Supplementary Material online). Gene models were called pseudogenes if they exhibited frame-shift mutations, premature stop codons, or unacceptable 5ʹ donor or 3ʹ receptor splice sites. Transmembrane helices of all putatively functional ORs (>300 amino acids) were predicted using TMHMM version 2.0c (Sonnhammer et al. 1998) and Phobius version 1.01 (Käll et al. 2004). Throughout the main text, “putatively functional ORs” are OR proteins at least 300 amino acids in length. Details of mRNA library preparation, sequencing, read mapping, and manual gene annotation are included in Supplementary Material online.

Phylogenetic Reconstruction

Phylogenetic trees were constructed using RAxML (Jones et al. 1992; Stamatakis 2014). Gene duplication and loss events were reconstructed by reconciling a gene tree with a species tree in NOTUNG version 2.9.1.3 (Durand et al. 2006; Vernot et al. 2008). Orthologous genes were determined using OrthoFinder (Emms and Kelly 2015), bootstrap support, and microsynteny. Phylogenetic reconstruction methods are explained in detail in Supplementary Material online.

Genomic Organization

OR genes and pseudogenes were considered to be in a tandem array if they were uninterrupted by non-OR genes and were within 5 kb of each other. The lengths of OR arrays correspond to the number of putatively functional ORs and exclude the pseudogenes contained within the array. The pairwise percent amino acid identity between neighbors in an array was calculated using only putatively functional ORs that neighbored another putatively functional OR within 5 kb. Details of analyses of genomic organization are included in the Supplementary Material online.

Sequence Analyses

All putatively functional Polistes ORs greater than 350 amino acids in length were used in analyses of selection. The adaptive branch-site relative effects likelihood model (aBSREL) was used to test for signatures of episodic diversifying positive selection in HyPhy version 2.5.15 (Smith et al. 2015; Pond et al. 2020). Values of pairwise dN/dS for orthologs shared by P. fuscatus and P. dorsalis were estimated using PAML version 4.9 (Yang 2007) program yn00 with the Yang and Nielsen (2000) method. Values of dN/dS for Polistes orthogroups were estimated using PAML version 4.9 program CodeML with the M0 (one-ratio) model (Yang 2007). Finally, RELAX was run in HyPhy version 2.5.15 to test for relaxed negative (relaxed purifying) selection (Wertheim et al. 2015; Pond et al. 2020). RELAX was used to test for relaxed negative selection on 9-exon OR branches relative to non-9-exon OR branches in the Polistes OR tree. Gene subfamily codon alignments used in the above aBSREL analysis were tested with GENECONV to identify gene conversion (Sawyer 1989). Details of sequence analyses are included in the Supplementary Material online.

Data Availability

The genome assemblies analyzed in this article are available on Genbank (see supplementary table S1, Supplementary Material online). Gene models, amino acid sequences, and nucleotide sequences underlying this article, as well as alignments analyzed in HyPhy aBSREL and RELAX selection analyses and phylogenetic trees in newick format, are available in its Supplementary Material online. Click here for additional data file.
  137 in total

1.  A spatial map of olfactory receptor expression in the Drosophila antenna.

Authors:  L B Vosshall; H Amrein; P S Morozov; A Rzhetsky; R Axel
Journal:  Cell       Date:  1999-03-05       Impact factor: 41.582

Review 2.  Evolution of insect olfaction.

Authors:  Bill S Hansson; Marcus C Stensmyr
Journal:  Neuron       Date:  2011-12-08       Impact factor: 17.173

3.  Draft genome of the red harvester ant Pogonomyrmex barbatus.

Authors:  Chris R Smith; Christopher D Smith; Hugh M Robertson; Martin Helmkampf; Aleksey Zimin; Mark Yandell; Carson Holt; Hao Hu; Ehab Abouheif; Richard Benton; Elizabeth Cash; Vincent Croset; Cameron R Currie; Eran Elhaik; Christine G Elsik; Marie-Julie Favé; Vilaiwan Fernandes; Joshua D Gibson; Dan Graur; Wulfila Gronenberg; Kirk J Grubbs; Darren E Hagen; Ana Sofia Ibarraran Viniegra; Brian R Johnson; Reed M Johnson; Abderrahman Khila; Jay W Kim; Kaitlyn A Mathis; Monica C Munoz-Torres; Marguerite C Murphy; Julie A Mustard; Rin Nakamura; Oliver Niehuis; Surabhi Nigam; Rick P Overson; Jennifer E Placek; Rajendhran Rajakumar; Justin T Reese; Garret Suen; Shu Tao; Candice W Torres; Neil D Tsutsui; Lumi Viljakainen; Florian Wolschin; Jürgen Gadau
Journal:  Proc Natl Acad Sci U S A       Date:  2011-01-31       Impact factor: 11.205

4.  The new mutation theory of phenotypic evolution.

Authors:  Masatoshi Nei
Journal:  Proc Natl Acad Sci U S A       Date:  2007-07-17       Impact factor: 11.205

5.  An Engineered orco Mutation Produces Aberrant Social Behavior and Defective Neural Development in Ants.

Authors:  Hua Yan; Comzit Opachaloemphan; Giacomo Mancini; Huan Yang; Matthew Gallitto; Jakub Mlejnek; Alexandra Leibholz; Kevin Haight; Majid Ghaninia; Lucy Huo; Michael Perry; Jesse Slone; Xiaofan Zhou; Maria Traficante; Clint A Penick; Kelly Dolezal; Kaustubh Gokhale; Kelsey Stevens; Ingrid Fetter-Pruneda; Roberto Bonasio; Laurence J Zwiebel; Shelley L Berger; Jürgen Liebig; Danny Reinberg; Claude Desplan
Journal:  Cell       Date:  2017-08-10       Impact factor: 41.582

Review 6.  Evolution, developmental expression and function of odorant receptors in insects.

Authors:  Hua Yan; Shadi Jafari; Gregory Pask; Xiaofan Zhou; Danny Reinberg; Claude Desplan
Journal:  J Exp Biol       Date:  2020-02-07       Impact factor: 3.312

7.  Rapid evolution of smell and taste receptor genes during host specialization in Drosophila sechellia.

Authors:  Carolyn S McBride
Journal:  Proc Natl Acad Sci U S A       Date:  2007-03-09       Impact factor: 11.205

8.  Odor memories regulate olfactory receptor expression in the sensory periphery.

Authors:  Charles Claudianos; Julianne Lim; Melanie Young; Shanzhi Yan; Alexandre S Cristino; Richard D Newcomb; Nivetha Gunasekaran; Judith Reinhard
Journal:  Eur J Neurosci       Date:  2014-03-13       Impact factor: 3.386

9.  Obligate mutualism within a host drives the extreme specialization of a fig wasp genome.

Authors:  Jin-Hua Xiao; Zhen Yue; Ling-Yi Jia; Xin-Hua Yang; Li-Hua Niu; Zhuo Wang; Peng Zhang; Bao-Fa Sun; Shun-Min He; Zi Li; Tuan-Lin Xiong; Wen Xin; Hai-Feng Gu; Bo Wang; John H Werren; Robert W Murphy; David Wheeler; Li-Ming Niu; Guang-Chang Ma; Ting Tang; Sheng-Nan Bian; Ning-Xin Wang; Chun-Yan Yang; Nan Wang; Yue-Guan Fu; Wen-Zhu Li; Soojin V Yi; Xing-Yu Yang; Qing Zhou; Chang-Xin Lu; Chun-Yan Xu; Li-Juan He; Li-Li Yu; Ming Chen; Yuan Zheng; Shao-Wei Wang; Shuang Zhao; Yan-Hong Li; Yang-Yang Yu; Xiao-Ju Qian; Yue Cai; Lian-Le Bian; Shu Zhang; Jun-Yi Wang; Ye Yin; Hui Xiao; Guan-Hong Wang; Hui Yu; Wen-Shan Wu; James M Cook; Jun Wang; Da-Wei Huang
Journal:  Genome Biol       Date:  2013-12-20       Impact factor: 13.583

10.  Evolution of Olfactory Functions on the Fire Ant Social Chromosome.

Authors:  Amir B Cohanim; Etya Amsalem; Rana Saad; DeWayne Shoemaker; Eyal Privman
Journal:  Genome Biol Evol       Date:  2018-11-01       Impact factor: 3.416

View more
  3 in total

1.  Annotation and Analysis of 3902 Odorant Receptor Protein Sequences from 21 Insect Species Provide Insights into the Evolution of Odorant Receptor Gene Families in Solitary and Social Insects.

Authors:  Pablo Mier; Jean-Fred Fontaine; Marah Stoldt; Romain Libbrecht; Carlotta Martelli; Susanne Foitzik; Miguel A Andrade-Navarro
Journal:  Genes (Basel)       Date:  2022-05-20       Impact factor: 4.141

2.  Interspecific variation of antennal lobe composition among four hornet species.

Authors:  Antoine Couto; Gérard Arnold; Hiroyuki Ai; Jean-Christophe Sandoz
Journal:  Sci Rep       Date:  2021-10-22       Impact factor: 4.379

3.  Genomic and transcriptomic analyses of the subterranean termite Reticulitermes speratus: Gene duplication facilitates social evolution.

Authors:  Shuji Shigenobu; Yoshinobu Hayashi; Dai Watanabe; Gaku Tokuda; Masaru Y Hojo; Kouhei Toga; Ryota Saiki; Hajime Yaguchi; Yudai Masuoka; Ryutaro Suzuki; Shogo Suzuki; Moe Kimura; Masatoshi Matsunami; Yasuhiro Sugime; Kohei Oguchi; Teruyuki Niimi; Hiroki Gotoh; Masaru K Hojo; Satoshi Miyazaki; Atsushi Toyoda; Toru Miura; Kiyoto Maekawa
Journal:  Proc Natl Acad Sci U S A       Date:  2022-01-18       Impact factor: 11.205

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.