Literature DB >> 28515876

Population genomics reveals a candidate gene involved in bumble bee pigmentation.

Meaghan L Pimsler1, Jason M Jackson1, Jeffrey D Lozier1.   

Abstract

Variation in bumble bee color patterns is well-documented within and between species. Identifying the genetic mechanisms underlying such variation may be useful in revealing evolutionary forces shaping rapid phenotypic diversification. The widespread North American species Bombus bifarius exhibits regional variation in abdominal color forms, ranging from red-banded to black-banded phenotypes and including geographically and phenotypically intermediate forms. Identifying genomic regions linked to this variation has been complicated by strong, near species level, genome-wide differentiation between red- and black-banded forms. Here, we instead focus on the closely related black-banded and intermediate forms that both belong to the subspecies B. bifarius nearcticus. We analyze an RNA sequencing (RNAseq) data set and identify a cluster of single nucleotide polymorphisms (SNPs) within one gene, Xanthine dehydrogenase/oxidase-like, that exhibit highly unusual differentiation compared to the rest of the sequenced genome. Homologs of this gene contribute to pigmentation in other insects, and results thus represent a strong candidate for investigating the genetic basis of pigment variation in B. bifarius and other bumble bee mimicry complexes.

Entities:  

Keywords:  Bombus; RNAseq; SNPs; adaptation; coloration; mimicry

Year:  2017        PMID: 28515876      PMCID: PMC5433978          DOI: 10.1002/ece3.2935

Source DB:  PubMed          Journal:  Ecol Evol        ISSN: 2045-7758            Impact factor:   2.912


Introduction

Pigment variation within and between populations and species has provided some of the most charismatic evidence for ecological and evolutionary forces involved in phenotypic differentiation (Kronforst et al., 2012). Although color variation can arise from neutral forces such as genetic drift, more often such patterns are the result of divergent selection on color morphs and can thus have implications for diverse processes from local adaptation to adaptive radiation of species (Rausher, 2008; Runemark, Hansson, Pafilis, Valakos, & Svensson, 2010; Wlasiuk, Carlos Garza, Lessa, & Smith, 2003). Insect pigmentation, in particular, has been studied in the context of adaptive strategies such as thermoregulation, crypsis, and warning coloration (Kronforst et al., 2012; True, 2003; Williams, 2007). Such color pattern variation is especially intriguing for its potential to reveal the genetic mechanisms of adaptive phenotypic change, and whether convergence across lineages occurs by shared versus independent molecular mechanisms (Oxford, 2005; Van Belleghem et al., 2017). For example, do species use the same basic tool sets to evolve pigmentation changes, co‐opt genes that serve related functions in other species in novel ways, or does convergent evolution occur through unique sets of genes in different lineages with similar phenotypes? Bumble bees (Hymenoptera: Apidae: Bombus) commonly exhibit adaptive color pattern variation within and between species (Plowright & Owen, 1980; Williams, 2007). Co‐distributed species may converge on pigmentation patterns to produce Müllerian mimicry complexes (Müller, 1879), while intraspecific populations in different geographic regions can evolve different color patterns to match these local complexes (Hines & Williams, 2012; Plowright & Owen, 1980; Rapti, Duennes, & Cameron, 2014; Williams, 2007). Much of the flexibility appears to stem from partially independent “ground plan” color pattern elements that subdivide the abdomen inter‐ and intrasegmentally along anterioposterior and mediolateral axes, which are likely encoded by a combination of regulatory and downstream pigmentation genes (Rapti et al., 2014). Bumble bee color pattern complexes thus provide a model system in which to investigate genetic mechanisms underlying phenotypic polymorphism, the evolution and maintenance of aposematic coloration, and rapid phenotypic diversification. Traditional genetic approaches for identifying molecular mechanisms underlying phenotypic variation is often challenging in bumble bees owing to difficulties of rearing and working with colonies in the laboratory (Plowright & Jay, 1966; Velthuis & van Doorn, 2006). Experimental research in bumble bees has thus often focused on a small set of species, especially domesticated species such as B. impatiens and B. terrestris (Amarasinghe, Clayton, & Mallon, 2014; Sadd et al., 2015; Stolle et al., 2011; Velthuis & van Doorn, 2006; Woodard et al., 2015) that may not be representative of the diversity of color pattern complexes in nature. Population genomics approaches have become a powerful tool for revealing adaptive genetic variation in nonmodel species where traditional genetics approaches are challenging and a priori knowledge of candidate genes is limited (Beaumont, 2005; Hohenlohe et al., 2010; Stapley et al., 2010). Phenotypically variable wild bumble bee populations are thus excellent targets for a population genomics approach (Lozier & Zayed, 2016), especially if discovered candidate genes can be cross‐validated against functional data from model organisms. Bombus (Pyrobombus) bifarius Cresson is a widespread polymorphic species found most commonly at medium to high elevations throughout montane western North America (Stephen, 1957; Williams, Thorp, Richardson, & Colla, 2014). The species exhibits a striking red‐black dimorphism in abdominal coloration across its range (Lozier, Jackson, Dillon, & Strange, 2016; Stephen, 1957; Williams et al., 2014). Easternmost B. bifarius populations in the southern Colorado Rocky Mountains exhibit bright red pile on the second and third abdominal tergites (red‐banded subspecies B. bifarius bifarius‐red), while westernmost Pacific populations have black pile on these segments (black‐banded subspecies B. bifarius nearcticus‐blk). Geographically intermediate populations of B. b. nearcticus have a range of intermediate pigmentation, often exhibiting a clouded orange appearance from a variable mixture of pile colors (B. b. nearcticus‐int) (Lozier, Strange, & Koch, 2013; Lozier et al., 2016) (Figure 1a). The geographic distributions of these color patterns are consistent with a hypothesis of adaptive convergence on local color pattern complexes through Müllerian mimicry (Plowright & Owen, 1980; Thorp & Horning, 1983).
Figure 1

(a) Color patterns and distributions of B. b. nearcticus‐blk (top) and nearcticus‐int (bottom). RNAseq sample sites shown as white dots; B. b. bifarius‐red range indicated (red‐dashed line) for reference. (b) SNPs sorted by F ST, expanded view of 88 top‐0.5% SNPs (inset‐left), and histogram of top‐0.5% SNPs per gene (inset‐right), highlighting SNP's present in Xdh‐like (red triangles). (c) F ST‐per‐window across nine comparable scaffolds indicating SNP density and significance (see Section 2 and Appendix S2 for selection criteria). Three of five significant windows (see Fig. S4 for example test‐statistic distribution) overlap Xdh‐like; the comparison scaffold NT_176438.1 and has one (predicted triple functional domain protein); NT_176882.1 has the third (not shown) which encompasses three genes (predicted OTU domain‐containing protein 7B‐like, glutathione S‐transferase C‐terminal domain‐containing protein homolog, ubiquitin‐conjugating enzyme E2 R2). (d) Allele frequencies and F ST (*** indicates p < .001) between B. b. nearcticus‐blk and B. b. nearcticus‐int at DNA sequencing‐validated SNP's

(a) Color patterns and distributions of B. b. nearcticus‐blk (top) and nearcticus‐int (bottom). RNAseq sample sites shown as white dots; B. b. bifarius‐red range indicated (red‐dashed line) for reference. (b) SNPs sorted by F ST, expanded view of 88 top‐0.5% SNPs (inset‐left), and histogram of top‐0.5% SNPs per gene (inset‐right), highlighting SNP's present in Xdh‐like (red triangles). (c) F ST‐per‐window across nine comparable scaffolds indicating SNP density and significance (see Section 2 and Appendix S2 for selection criteria). Three of five significant windows (see Fig. S4 for example test‐statistic distribution) overlap Xdh‐like; the comparison scaffold NT_176438.1 and has one (predicted triple functional domain protein); NT_176882.1 has the third (not shown) which encompasses three genes (predicted OTU domain‐containing protein 7B‐like, glutathione S‐transferase C‐terminal domain‐containing protein homolog, ubiquitin‐conjugating enzyme E2 R2). (d) Allele frequencies and F ST (*** indicates p < .001) between B. b. nearcticus‐blk and B. b. nearcticus‐int at DNA sequencing‐validated SNP's Divergent selection for red versus black abdominal pigmentation in B. bifarius might be expected to produce unique detectable differentiation patterns at genomic regions controlling color compared to the majority of the genome (i.e., “outlier” loci) (Beaumont, 2005; Stapley et al., 2010). Our previous analysis of relationships among red‐banded, black‐banded, and intermediate B. bifarius populations with genomewide data suggested unexpectedly large differentiation between B. b. nearcticus and B. b. bifarius‐red that is consistent with distinct species. Therefore, outlier loci were difficult to isolate from extensive neutral background divergence by comparing phenotypically extreme populations (i.e., B. b. nearcticus‐blk vs. B. b. bifarius‐red) (Lozier et al., 2016). However, weak genomewide differentiation among the much more closely related B. b. nearcticus populations may permit more sensitive tests to discover loci linked to phenotype, despite the less dramatic color differences (Figure 1a). In this study, we reanalyze an existing transcriptome data set for B. b. nearcticus (Lozier et al., 2016) to identify outlier loci between black‐banded and intermediate color forms. We identify a cluster of SNPs exhibiting unusual behavior compared to the rest of the sequenced genome. Annotation with the sequenced Bombus impatiens genome (Sadd et al., 2015) reveals that SNPs all fall within Xanthine dehydrogenase/oxidase‐like (Xdh‐like), part of a gene family with established pigmentation roles in other animals (Frost, Borchert, Thorsteinsdottir, & Robinson, 1985; McGraw, 2005; Watt, 1972).

Methods

We utilize B. bifarius RNAseq data from Lozier et al. (2016) (doi:10.5061/dryad.k21r5; Appendix S1; Figure 1a) that includes samples of B. b. nearcticus‐blk (Oregon, USA), B. b. nearcticus‐int (Wyoming and Utah, USA), and B. b. bifarius‐red (Colorado, USA). We focus on RNAseq as a tool for characterizing SNPs from large numbers of gene regions (e.g., De Wit, Pespeni, & Palumbi, 2015). Briefly, RNA was extracted from the head and thorax of wild‐caught female bees preserved in RNAlater (Thermo Fisher Scientific Inc.; Waltham, MA USA) and sequenced with 100‐bp paired‐end reads on an Illumina HiSeq 2000 (Illumina, Inc.; San Diego, CA, USA). Reads were filtered and trimmed with TrimGalore (Krueger, 2015) to remove adaptor sequences, low‐quality bases (<20), and short reads (<20 bp). Trimmed reads were then mapped to the closely related B. impatiens (both subgenus Pyrobombus, common ancestry ~4–6 MYA; Hines, 2008) genome (Lozier et al., 2016; Sadd et al., 2015) using TopHat v2.0.10 (Kim et al., 2013). Bombus impatiens is the most closely related species to B. bifarius with a published genome (Cameron, Hines, & Williams, 2007), and Bombus as a whole exhibits a high degree of synteny between more distantly related species pairs (Sadd et al., 2015). Following suggested best practices (Quinn et al., 2013), reads with a map quality <20 were excluded and duplicate reads were removed with SAMTools (Li et al., 2009) and Picard (http://broadinstitute.github.io/picard/), respectively. Variant calling was performed with SAMtools (mpileup, bcftools, and varFilter; Li et al., 2009), and indels and SNP's within 4 bp of a gap were excluded (Singhal (2013) see Lozier et al. (2016) for details). A final filtering with vcftools (Danecek et al., 2011) was performed for this study to include only those SNPs variable within B. b. nearcticus exons (annotated with snpEff v4.11: Cingolani et al., 2012), resulting in 17,450 SNPs for B. b. nearcticus‐blk (N = 7) and B. b. nearcticus‐int (N = 8). SNPs were identified in 261 B. impatiens scaffolds. Because whole chromosomes are not available in the B. impatiens assembly, for some comparisons and visual presentations, we examined the scaffold containing the discovered gene of interest (NT_176739.1) with respect to a subset of comparable scaffolds. We identified comparable scaffolds by first ranking scaffolds by length (±10 rank centered around NT_176739.1) and then by ranking this subset according to SNP density. We retained ±4 rank length‐selected scaffolds (ordered by Number of SNP's/Total length of scaffold) around NT_176739.1 for visual presentation. Summary statistics for this scaffold set are provided in Appendix S2.

Population genetics and outlier analysis

The fixation index F ST (Weir & Cockerham, 1984) and nucleotide diversity (π) were calculated per‐SNP and in sliding windows (10 kb windows, 5 kb increments) using vcftools v0.1.12b (Danecek et al., 2011). Tajima's D (Tajima, 1989) was calculated as an average across whole scaffolds and in nonoverlapping 10 kb windows that contained at least ten SNPs. Narrow windows are required because of rapid linkage disequilibrium decay in Bombus (Sadd et al., 2015). We tested for extreme window‐specific divergence using nested hierarchical bootstrapping to identify significantly elevated F ST accounting for SNP density per window (similar to Hohenlohe et al., 2010), as follows. A null distribution of 500,000 pseudo‐replicate data sets was simulated. For each pseudo‐replicate, mean F ST was calculated in windows matching the empirical data set by randomly sampling individual SNP F ST values with replacement from the whole transcriptome data set, with the number of SNPs in the window corresponding to the number of SNPs observed in its corresponding empirical window. Simulations and figures used R version 3.2.4 (R Core team 2015; code is available on DRYAD). Given the large number of windows, we only considered a window as significant if no simulation produced a value as extreme as observed (p < 2 × 10−6). In addition to demonstrating that Xdh‐like SNPs are outliers within the genome using the sliding window F ST bootstrap approach (Figure 1c), we also tested whether Xdh‐like SNPs were identified as outliers with a suite of several alternative methods that can identify targets of selection in wild populations while accounting for population structure. First, we used a clustering approach to identify SNPs contributing most to genetic divergence between B. b. nearcticus‐blk and B. b. nearcticus‐int using discriminant analysis of principal components (DAPC). The DAPC was performed for filtered synonymous/nonsynonymous RNAseq SNP set (N = 17,450 loci) in the R package adegenet 2.0.1 (Jombart, 2008), grouping individuals into the two regional phenotypes, and retaining five principle components and a single discriminant axis. SNPs contributing most significantly to the discriminant analysis (“selected vs. unselected”) were identified by applying the snpzip function to the DAPC followed by hierarchical clustering using the “average” clustering method (hclust function). A second principal components analysis method, pcadapt, is specifically designed to detect adaptive outliers in the face of population structure (Duforet‐Frebourg, Luu, Laval, Bazin, & Blum, 2016). The pcadapt R package uses principal components to control for population structure (in this case, we retained a single principal component for analysis based on initial evaluation with larger K, as above) and performs well under island models and models of divergence with admixture (high power, low false‐positive errors) (Duforet‐Frebourg et al., 2016). We implemented the analysis using default settings, excluding 8,314 SNPs with minor allele frequencies (MAF) < 0.05. To detect outliers, we employed a false discovery rate (FDR) correction with the R package qvalue (Storey, Bass, Dabney, & Robinson, 2016) with a FDR cutoff of 5%. We next applied two outlier detection methods implemented in Arlequin 3.5 (Excoffier & Lischer, 2010), also filtered to remove loci with MAF < 0.05. The first approach is based on identifying outliers in a simulated distribution (100,000 coalescent simulations) of F ST as a function of heterozygosity under the finite island model of 100 demes (FDIST approach in Appendix S3; similar to Beaumont & Balding, 2004). The second approach is similar, but relies on a hierarchical model of population structure—here using four groups (two times the number sampled) with 100 demes each—and has a reduced false‐positive rate under several more realistic demographic scenarios and should be more conservative (Excoffier, Hofer, & Foll, 2009) (HeirFDIST in Appendix S3). This approach has recently proven useful for detecting recent signatures of selection associated with recent divergence (e.g., Marques et al., 2016). For the hierarchical analysis, individuals were first grouped by site and then by regional color group. SNPs were removed from the hierarchical analysis if they were not present in any individual from a particular site. Finally, we used a recently developed outlier detection method outFLANK (Whitlock & Lotterhos, 2015) that compares differentiation at each locus against a trimmed null distribution of F ST values for loci not seemingly affected by strong selection. We removed loci with heterozygosity <0.1 and applied 5% trimming of the upper and lower F ST distribution tails. Because some results are presented spatially across scaffolds, to confirm structural conservation across our genome region of interest, we used Mauve (Darling, 2004) (implemented in Geneious R7; Kearse et al., 2012) to align NT_176739.1 from B. impatiens and orthologous scaffold NW_003566384.1 from B. terrestris (Sadd et al., 2015), assuming that strong synteny between these distantly related genomes (~20 mya) would reflect greater genome structure conservation between B. bifarius and B. impatiens (~4–6 mya) across the region of interest.

Sanger sequencing validation of SNPs

To confirm allele frequency differentiation at our target gene, including possible errors in the RNAseq data for this region from limited sampling, missing data, and incorrect SNP calling or read mapping, we performed Sanger DNA sequencing of four high‐F ST Xdh‐like SNPs in a larger data set (53 individuals from 16 populations) (Appendix S1). Primer sets were designed using Geneious R7 such that one primer per pair sat within an intron and the other within an exon for the following loci: SNP's 348647 and 348649 with Xd_343538F (5′‐ TGTTGGAAGGCAGGTCAGTC‐3′) and Xd_343915R (5′‐AAGATAAGCCGTGCCACACA‐3′); SNP's 343850 and 343759 with Xd_348265F (5′‐CGCCAGTTACCAACCACTCT‐3′) and Xd_348680R (5′‐ACCGACCAATTCTGGATGCA‐3′). Amplification from specimens with available genomic DNA (modified DNeasy protocol, following Lozier, 2014; Qiagen, Valencia, CA) used the GoTaq® G2 system (Promega, Madison, WI) with the following 25 μl reaction conditions [5 μl 5× PCR Buffer, 0.8–1 μl 25 mmol/L MgCl2, 0.5 μl 40 mmol/L (10 mmol/L each) dNTPs, 1 μl per primer (10 μmol/L each), 0.25 μl taq and 1–1.5 μl genomic DNA] and thermocycling conditions [5:00 denaturation at 95°C followed by 32–35 cycles of 95°C for 30 s, annealing at 52°C (for Xd_343538F/Xd_343915R) or 60–62°C (for Xd_348265F/Xd_348680R) for 30 s, and 72°C for 30 s, with a final 5‐min extension at 72°C]. PCR products were purified with ExoSAP‐IT (Affymetrix, Santa Clara, CA USA), and sequencing was performed at Eurofins Genomics (Louisville, KY USA). Sequences were manually inspected, edited, and aligned in Geneious R7 (Kearse et al., 2012). In all cases, heterozygotes were clearly identifiable from overlapping double peaks in the chromatograms. F ST was calculated in Arlequin 3.5 (Excoffier & Lischer, 2010).

Phylogenetic analysis

We characterized phylogenetic relationships among B. bifarius Xdh‐like region sequences, including B. b. bifarius‐red, using statistical parsimony in the software TCS v1.21 (Clement, Posada, & Crandall, 2000). For this analysis, we extracted the RNAseq consensus sequences for the Xdh‐like region (NT_176739.1:340,370‐357,224) in IGV (Robinson et al., 2011), manually removed introns and other sites with missing data, and phased the sequences using the fastPHASE algorithm (Scheet & Stephens, 2006) in DNAsp v5.10.1 (Librado & Rozas, 2009), resulting in 41 complete sequences (40 B. bifarius haplotypes and the B. impatiens reference) of length 4,049 bp.

Results

Average differentiation for B. b. nearcticus is minimal (F ST = 0.008 ± 0.078 SD), and few SNPs show a large degree of differentiation (N = 13, F ST > 0.5 and N = 2, F ST > 0.75) (Figure 1b). The two most highly differentiated SNPs map to Xdh‐like (LOC100741462 on NT_176739.1: 340,335–357,287 bp) (Appendices S1 and S3, Table S1). Of the top 0.5% highly differentiated SNPs (N = 88, F ST = 0.379–0.790), 13 (seven synonymous, six nonsynonymous) map to Xdh‐like, a notable outlier compared to other top‐F ST genes (Figure 1b). As expected from whole‐genome comparisons showing strong synteny in bumble bees (Sadd et al., 2015), NT_176739.1 structure is highly conserved between subgenera (Appendix S4), suggesting that SNP positioning from B. impatiens is suitable for B. bifarius (both subgenus Pyrobombus). A spike in differentiation in Xdh‐like is apparent for individual SNPs and windows (Figure 1c; Figs. S1–S4; Table S1). F ST was significantly elevated (p < 2 × 10−6) in all three windows overlapping Xdh‐like. Xdh‐like SNPs were recovered as significant in multiple outlier detection approaches (Appendix S3; Fig. S1), and no comparable patterns are observed elsewhere in the transcriptome. Tajima's D is also elevated in Xdh‐like (Fig. S3). F ST from sequencing‐validated SNPs (F ST = 0.66; Figure 1d, Table S2) were essentially identical to RNAseq values (average F ST = 0.64), suggesting that RNAseq reliably estimates differentiation despite smaller sample sizes. Statistical parsimony analysis of 4,049 sites in the Xdh‐like region shows two groups of highly divergent B. b. nearcticus sequences, one set restricted to B. b. nearcticus‐blk, and another set including all B. b. nearcticus‐int and alleles from B. b. nearcticus‐blk heterozygotes. In contrast to previous work demonstrating that most of the genome shows B. b. nearcticus samples more closely related to each other than to B. b. bifarius (Lozier et al., 2016), both sets of these Xdh‐like sequences are approximately equally divergent from the B. b. bifarius‐red Xdh‐like sequences (Figure 2).
Figure 2

90% statistical parsimony TCS network for phased Xdh‐like sequences from the complete RNAseq data extracted for all B. bifarius subspecies (4,049 bp with no missing sequence). Colored circles represent unique haplotypes, shaded by population of origin (largest circle = 12 observed haplotypes, smallest = 1 haplotype) with unobserved mutation steps as small gray dots. More stringent haplotype networks produce the same topology within B. bifarius, and so 90% was selected to include the B. impatiens reference

90% statistical parsimony TCS network for phased Xdh‐like sequences from the complete RNAseq data extracted for all B. bifarius subspecies (4,049 bp with no missing sequence). Colored circles represent unique haplotypes, shaded by population of origin (largest circle = 12 observed haplotypes, smallest = 1 haplotype) with unobserved mutation steps as small gray dots. More stringent haplotype networks produce the same topology within B. bifarius, and so 90% was selected to include the B. impatiens reference

Discussion

Color pattern variation is among the most striking examples of adaptive phenotypic evolution that is excellently represented in the high degree of polymorphism within and between species of the genus Bombus. One of the major goals for investigating color variation in bumble bees is to understand the genetic mechanism underlying “color pattern elements” that facilitate adaptive divergence and convergence of color forms across the Bombus phylogeny (Rapti et al., 2014). Such loci have been identified in other color polymorphic mimicry complexes (e.g., optix in Heliconius) and are providing insights into evolutionary processes and genetic architectures underlying phenotypic variation (Kronforst et al., 2012; Van Belleghem et al., 2017). We show that Xanthine dehydrogenase/oxidase‐like contains unusual SNP density and divergence between B. b. nearcticus color variants. Bombus b. nearcticus is genetically well‐connected (Lozier et al., 2016), but Xdh‐like starkly contrasts with such genomewide patterns. Xdh‐like homologs are known to be involved in pigmentation (Frost et al., 1985; Watt, 1972; Yasukochi, Kanda, & Tamura, 1998). Xdh‐like is thus a strong candidate for one element contributing to bumble bee color pattern variation, representing a new example of selection using common pathways in novel evolutionary contexts. Xanthine dehydrogenase and related genes code for molybdenum‐containing hydroxylases that catalyze purine and pteridine reactions (Brondino, Romão, Moura, & Moura, 2006). This family of genes contribute red, orange, and yellow pigmentation in many animals (Frost et al., 1985; Shamim, Ranjan, Pandey, & Ramani, 2014; Watt, 1972), including well‐known Drosophila melanogaster loci responsible for the classic red‐eyed phenotype (Reaume, Knecht, & Chovnick, 1991). In bumble bees, black, red, and yellow are the main pile colors (Rapti et al., 2014); black and red are melanins while yellow is likely a pterin (Hines, 2008). Results implicating a Xanthine dehydrogenase homolog thus suggest that the lighter reddish‐orange pigmentation in B. b. nearcticus‐int may be a unique state involving yellow, rather than a mix of pure red and black hairs. Although orange‐red tinged hairs are present in B. b. nearcticus‐int individuals, they are usually not as intense as in B. b. bifarius‐red bees, possibly indicating a mixture of pigments that includes pterins. This could also explain the lack of any noteworthy patterns across this genomic region from comparisons with B. b. bifarius‐red in our previous study (Lozier et al., 2016). Comparisons across B. bifarius do provide some insights into the dynamics of phylogeographic divergence and phenotypic divergence within the species, although the importance of Xdh‐like in the evolution of the group remains to be fully understood. Bumble bee color pattern complexes are believed to be adaptive outcomes of Müllerian mimicry (Plowright & Owen, 1980; Williams, 2007). Color pattern may be more indicative of geography than phylogeny, as populations of the same species can diverge from one another and converge on a prominent local phenotype (Plowright & Owen, 1980). There is a clear relationship between geographic, phenotypic, and genetic divergence for B. b. bifarius‐red and B. b. nearcticus, although any signal at genes important for color pattern differentiation in this comparison is swamped by the extent of genomewide divergence (Lozier et al., 2016). In the case of the less extreme phenotypic differences within B. b. nearcticus, color variation is evident without differentiation across most of the genome, likely due to the greater range connectivity for these populations (Lozier et al., 2013). The broad geographic structure in coloration and in differentiation at Xdh‐like despite this demographic connectivity is consistent with strong selection that might be imposed by mimicry complexes. Nonetheless, there are regions of overlap for black‐banded and intermediate forms of B. b. nearcticus, especially in the north (e.g., Washington, Idaho), which, together with lack of fixation for Xdh‐like alleles in B. b. nearcticus‐blk (Figure 1d, SI), suggests that selection is imperfect. Parts of the B. b. nearcticus range overlap with other red/black color polymorphic species, for example, B. melanopygous (Plowright & Owen, 1980; Williams et al., 2014), and it may therefore be advantageous to maintain alleles associated with color pattern variation to facilitate convergence on local phenotypes (Williams, 2007). The unusual phylogenetic relationships among Xdh‐like haplotypes are particularly intriguing from this perspective (Figure 2), with the B. b. nearcticus haplotypes more divergent from each other than either is to B. b. bifarius‐red, potentially reflecting retention and modification of ancient alleles in the maintenance of phenotypic polymorphism. Alternatively, the partially overlapping distributions of phenotypes and Xdh‐like alleles in parts of the B. b. nearcticus range may simply reflect the balance between selection, migration, and drift. Additional sampling of finer scale transects, including sites where both B. b. nearcticus forms are present, should help refine these hypotheses. One consideration is that Xdh‐like is a gene prediction, and its exact function in B. bifarius pigmentation thus remains uncertain. As discussed above, no SNPs perfectly segregate with color form; although intermediate B. b. nearcticus‐int populations were largely fixed for one allele at highly differentiated SNPs, this was not true in B. b. nearcticus‐blk. We were unable to confirm any obvious phenotypic associations between B. b. nearcticus genotype and abdominal or thoracic phenotype with post hoc examination of western B. b. nearcticus specimens with central B. b. nearcticus alleles. Sequenced SNPs may be incompletely linked to a causal structural or cis‐regulatory mutation that might show even stronger associations. However, causal or regulatory sites producing the unusual signatures at Xdh‐like detected here must lie near the sequenced SNPs due to weak linkage in bees (Sadd et al., 2015). The rapid decay of differentiation is highlighted by analysis of restriction site‐associated DNA sequences that show no elevated F ST for intergenic SNPs flanking Xdh‐like (Fig. S2). It is also unclear, at this point, how Xdh‐like alleles might interact with each other and other loci through dominance, epistasis, or regulatory differences. Additional work is thus necessary to determine which factors, either singly or in concert, are responsible for the development and maintenance of color pattern variation in B. bifarius and other bumble bee species. Examining differential expression of this gene and its alleles throughout development in different body segments will be a next step to determine how Xdh‐like contributes to color pattern variation (Mallarino et al., 2016). Experimental work with laboratory colonies will be necessary for functional genomics; however, maintaining successful laboratory colonies with bumble bees over multiple generations can be challenging (Plowright & Jay, 1966; Velthuis & van Doorn, 2006). Given the role of Xdh‐like homologs in pteridine production, screening for Xdh‐like outlier SNPs in other species complexes exhibiting color variation involving yellow pigmentation, such as B. perplexus, B. terrestris, B. fervidus/B. californicus, or B. pensylvanicus/B. sonorous (Williams et al., 2014), could provide complementary evidence for the gene's role in adaptive color pattern variation. In conclusion, we have identified a gene that may facilitate understanding the formation of some color pattern complexes in bumble bees. Knowledge that Xdh‐like homologs affect pigmentation strengthens this hypothesis, but confirming mechanisms of action in B. bifarius will require additional experimentation. We also suspect that color in B. bifarius is likely to be complex, with multiple contributing loci. Our RNAseq data are likely to reveal only the strongest outlier loci, and by sequencing adult bees, will exclude sequence from genes expressed only during development. Developmental patterning genes (e.g., Hox‐genes) are likely key players in bumble bee color pattern evolution, for instance (Rapti et al., 2014). Finally, it will be important to consider ecological pressures driving color variation, and why selection produces certain complexes in particular geographic regions. Finer scale spatial sampling that incorporates whole‐genome data will help resolve some of these issues, while functional genomics and multispecies comparative approaches will be useful for targeting Xdh‐like's possible functions. Our findings provide a starting point for such studies of ontogenesis and evolution of color pattern in B. bifarius and other bumble bees.

Data accessibility

Data, results, and analytical codes area available in the Supplementary Information, on DRYAD (http://dx.doi.org/10.5061/dryad.44c0r) and validation sequencing results are on NCBI Genbank (Nos. KX851406–KX851511). Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  43 in total

1.  Reliable Detection of Loci Responsible for Local Adaptation: Inference of a Null Model through Trimming the Distribution of F(ST).

Authors:  Michael C Whitlock; Katie E Lotterhos
Journal:  Am Nat       Date:  2015-09-15       Impact factor: 3.926

2.  DnaSP v5: a software for comprehensive analysis of DNA polymorphism data.

Authors:  P Librado; J Rozas
Journal:  Bioinformatics       Date:  2009-04-03       Impact factor: 6.937

3.  THE EVOLUTIONARY SIGNIFICANCE OF BUMBLE BEE COLOR PATTERNS: A MIMETIC INTERPRETATION.

Authors:  R C Plowright; Robin E Owen
Journal:  Evolution       Date:  1980-07       Impact factor: 3.694

4.  Revisiting comparisons of genetic diversity in stable and declining species: assessing genome-wide polymorphism in North American bumble bees using RAD sequencing.

Authors:  J D Lozier
Journal:  Mol Ecol       Date:  2014-02       Impact factor: 6.185

5.  Xanthine dehydrogenase and pteridine metabolism in Colias butterflies.

Authors:  W B Watt
Journal:  J Biol Chem       Date:  1972-03-10       Impact factor: 5.157

6.  The rosy locus in Drosophila melanogaster: xanthine dehydrogenase and eye pigments.

Authors:  A G Reaume; D A Knecht; A Chovnick
Journal:  Genetics       Date:  1991-12       Impact factor: 4.562

7.  Genetic and geographic differentiation in the Rio Negro tuco-tuco (Ctenomys rionegrensis): inferring the roles of migration and drift from multiple genetic markers.

Authors:  Gabriela Wlasiuk; John Carlos Garza; Enrique P Lessa
Journal:  Evolution       Date:  2003-04       Impact factor: 3.694

Review 8.  Molecular tools and bumble bees: revealing hidden details of ecology and evolution in a model system.

Authors:  S Hollis Woodard; Jeffrey D Lozier; David Goulson; Paul H Williams; James P Strange; Shalene Jha
Journal:  Mol Ecol       Date:  2015-05-19       Impact factor: 6.185

9.  The variant call format and VCFtools.

Authors:  Petr Danecek; Adam Auton; Goncalo Abecasis; Cornelis A Albers; Eric Banks; Mark A DePristo; Robert E Handsaker; Gerton Lunter; Gabor T Marth; Stephen T Sherry; Gilean McVean; Richard Durbin
Journal:  Bioinformatics       Date:  2011-06-07       Impact factor: 6.937

10.  The genomes of two key bumblebee species with primitive eusocial organization.

Authors:  Ben M Sadd; Seth M Barribeau; Guy Bloch; Dirk C de Graaf; Peter Dearden; Christine G Elsik; Jürgen Gadau; Cornelis J P Grimmelikhuijzen; Martin Hasselmann; Jeffrey D Lozier; Hugh M Robertson; Guy Smagghe; Eckart Stolle; Matthias Van Vaerenbergh; Robert M Waterhouse; Erich Bornberg-Bauer; Steffen Klasberg; Anna K Bennett; Francisco Câmara; Roderic Guigó; Katharina Hoff; Marco Mariotti; Monica Munoz-Torres; Terence Murphy; Didac Santesmasses; Gro V Amdam; Matthew Beckers; Martin Beye; Matthias Biewer; Márcia M G Bitondi; Mark L Blaxter; Andrew F G Bourke; Mark J F Brown; Severine D Buechel; Rossanah Cameron; Kaat Cappelle; James C Carolan; Olivier Christiaens; Kate L Ciborowski; David F Clarke; Thomas J Colgan; David H Collins; Andrew G Cridge; Tamas Dalmay; Stephanie Dreier; Louis du Plessis; Elizabeth Duncan; Silvio Erler; Jay Evans; Tiago Falcon; Kevin Flores; Flávia C P Freitas; Taro Fuchikawa; Tanja Gempe; Klaus Hartfelder; Frank Hauser; Sophie Helbing; Fernanda C Humann; Frano Irvine; Lars S Jermiin; Claire E Johnson; Reed M Johnson; Andrew K Jones; Tatsuhiko Kadowaki; Jonathan H Kidner; Vasco Koch; Arian Köhler; F Bernhard Kraus; H Michael G Lattorff; Megan Leask; Gabrielle A Lockett; Eamonn B Mallon; David S Marco Antonio; Monika Marxer; Ivan Meeus; Robin F A Moritz; Ajay Nair; Kathrin Näpflin; Inga Nissen; Jinzhi Niu; Francis M F Nunes; John G Oakeshott; Amy Osborne; Marianne Otte; Daniel G Pinheiro; Nina Rossié; Olav Rueppell; Carolina G Santos; Regula Schmid-Hempel; Björn D Schmitt; Christina Schulte; Zilá L P Simões; Michelle P M Soares; Luc Swevers; Eva C Winnebeck; Florian Wolschin; Na Yu; Evgeny M Zdobnov; Peshtewani K Aqrawi; Kerstin P Blankenburg; Marcus Coyle; Liezl Francisco; Alvaro G Hernandez; Michael Holder; Matthew E Hudson; LaRonda Jackson; Joy Jayaseelan; Vandita Joshi; Christie Kovar; Sandra L Lee; Robert Mata; Tittu Mathew; Irene F Newsham; Robin Ngo; Geoffrey Okwuonu; Christopher Pham; Ling-Ling Pu; Nehad Saada; Jireh Santibanez; DeNard Simmons; Rebecca Thornton; Aarti Venkat; Kimberly K O Walden; Yuan-Qing Wu; Griet Debyser; Bart Devreese; Claire Asher; Julie Blommaert; Ariel D Chipman; Lars Chittka; Bertrand Fouks; Jisheng Liu; Meaghan P O'Neill; Seirian Sumner; Daniela Puiu; Jiaxin Qu; Steven L Salzberg; Steven E Scherer; Donna M Muzny; Stephen Richards; Gene E Robinson; Richard A Gibbs; Paul Schmid-Hempel; Kim C Worley
Journal:  Genome Biol       Date:  2015-04-24       Impact factor: 13.583

View more
  4 in total

1.  A combined RAD-Seq and WGS approach reveals the genomic basis of yellow color variation in bumble bee Bombus terrestris.

Authors:  Sarthok Rasique Rahman; Jonathan Cnaani; Lisa N Kinch; Nick V Grishin; Heather M Hines
Journal:  Sci Rep       Date:  2021-04-12       Impact factor: 4.379

2.  Pterin-based pigmentation in animals.

Authors:  Pedro Andrade; Miguel Carneiro
Journal:  Biol Lett       Date:  2021-08-18       Impact factor: 3.812

3.  De Novo Genome Assemblies for Three North American Bumble Bee Species: Bombus bifarius, Bombus vancouverensis, and Bombus vosnesenskii.

Authors:  Sam D Heraghty; John M Sutton; Meaghan L Pimsler; Janna L Fierst; James P Strange; Jeffrey D Lozier
Journal:  G3 (Bethesda)       Date:  2020-08-05       Impact factor: 3.154

4.  Population genomics reveals a candidate gene involved in bumble bee pigmentation.

Authors:  Meaghan L Pimsler; Jason M Jackson; Jeffrey D Lozier
Journal:  Ecol Evol       Date:  2017-04-04       Impact factor: 2.912

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.