Literature DB >> 24223982

Molecular markers reveal limited population genetic structure in a North American corvid, Clark's nutcracker (Nucifraga columbiana).

Kimberly M Dohms1, Theresa M Burg.   

Abstract

The genetic impact of barriers and Pleistocene glaciations on high latitude resident species has not been widely investigated. The Clark's nutcracker is an endemic North American corvid closely associated with Pinus-dominated forests. The nutcracker's encompasses known barriers to dispersal for other species, and glaciated and unglaciated areas. Clark's nutcrackers also irruptively disperse long distances in search of pine seed crops, creating the potential for gene flow among populations. Using the highly variable mitochondrial DNA control region, seven microsatellite loci, and species distribution modeling, we examined the effects of glaciations and dispersal barriers on population genetic patterns and population structure of nutcrackers. We sequenced 900 bp of mitochondrial control region for 169 individuals from 15 populations and analysed seven polymorphic microsatellite loci for 13 populations across the Clark's nutcracker range. We used species distribution modeling and a range of phylogeographic analyses to examine evolutionary history. Clark's nutcracker populations are not highly differentiated throughout their range, suggesting high levels of gene flow among populations, though we did find some evidence of isolation by distance and peripheral isolation. Our analyses suggested expansion from a single refugium after the last glacial maximum, but patterns of genetic diversity and paleodistribution modeling of suitable habitat were inconclusive as to the location of this refugium. Potential barriers to dispersal (e.g. mountain ranges) do not appear to restrict gene flow in Clark's nutcracker, and postglacial expansion likely occurred quickly from a single refugium located south of the ice sheets.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24223982      PMCID: PMC3817134          DOI: 10.1371/journal.pone.0079621

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Phylogeography is the study of processes underlying spatial and temporal dimensions of genetic variation [1]. Patterns of genetic variation attained through phylogeographic methods can provide insights into population-level response to disturbance and the processes responsible for historical dispersal and colonization [2,3]. Examining patterns of genetic variation can help us to evaluate the roles of gene flow, bottlenecks, and historical or geological barriers in explaining contemporary patterns of diversity across geographical regions [4]. Contemporary genetic patterns are strongly influenced by postglacial expansion from refugia [5,6], historical and contemporary barriers to dispersal [7,8], and dispersal potential [9]. The repeated glaciations and climate oscillations of the Pleistocene epoch provide a natural tool to address population responses to large-scale landscape changes. Multiple expansions and contractions of ice sheets created a dynamic landscape that repeatedly fragmented historical populations, alternately creating barriers to dispersal and creating new habitats for colonization [10,11]. During the Pleistocene glacial maxima in North America, many populations were confined to refugia (ice-free areas) [10]. Plant and animal species expanded from several known refugia following the retreat of the ice sheets, including Beringia (present-day Bering Sea and parts of Alaska) and three main areas south of the ice sheets (Pacific Coast, Rockies, and Taiga) [10,12]. Additional areas along the periphery of the ice sheets are contested to have been ice-free, such as an Atlantic shelf near present-day Newfoundland [5,10,13]. High latitude resident bird species provide a unique opportunity to investigate patterns of postglacial and barrier-mediated dispersal; historical events shaping current population structure should be particularly evident in these species. Non-migratory species have the potential to retain patterns of genetic variation longer due to limited dispersal, allowing researchers to make inferences about past historic events [14-16]. For example, species of trees show distinct patterns of population genetic structure and have retained information on historic environmental changes [13,15]. As the number of studies on resident species increase, similar patterns are emerging in vertebrate taxa [16-18]. However, other resident species show limited differentiation despite potential barriers to dispersal [19,20]. Clark’s nutcracker (Nucifraga columbiana) is a high latitude resident corvid species (Family Corvidae) that inhabits North American coniferous forests. Much of the published work on this species has focused on spatial cognition exhibited by their extensive food caching behaviours [21]. Additional research has concentrated on Clark’s nutcracker’s role as essential seed dispersers for many conifers, particularly Pinus species [22-24]. To date, no published phylogeographic work exists for this species, though genetic work has been identified as a priority [21]. Clark’s nutcracker prefers higher altitude montane forests dominated by one or more Pinus species. Their close association with pine-dominated forests is due to a specialised diet of pine seeds, which influences dispersal patterns of both nutcrackers and seeds [21,25]. After nestlings fledge, Clark’s nutcrackers have been known to undergo seasonal altitudinal movement to nearby Pinus-dominated subalpine areas [21]. In times of food shortage, and occasionally during the non-breeding season, large numbers of birds have been shown to irruptively disperse, leaving home ranges, and travelling more than 100 km in search of large crops of pine seeds [21,26]. Range-wide morphological and plumage variation is well documented for some high latitude resident species [27,28], but there is little evidence of morphological variation in Clark’s nutcracker [21]. Some suggest that this lack of variation is a reflection of high gene flow between populations due to irruptive dispersal, but this has not been investigated to date [21,29]. To address these gaps in phylogeographic knowledge, our goals for this study were to investigate patterns of dispersal and postglacial colonization in Clark’s nutcracker as reflected in genetic structure. We predict that contemporary Clark’s nutcracker populations will exhibit low levels of genetic differentiation between populations due to irruptive food-related dispersal patterns. Mountain ranges and unsuitable habitats will not act as strong barriers to gene flow due to this species’ preference for high altitude sub-alpine habitats [21] where barriers are limited. Rather, dispersal will be restricted by availability of pine-dominated coniferous forests and seed crops. We expect that postglacial colonization likely occurred from a single refugium south of the ice sheets, similar to patterns of postglacial colonization in several North America Pinus species [13].

Materials and Methods

Ethics statement

All animals captured in the field were handled following animal welfare protocols (#0614 and #1028) approved by the University of Lethbridge Animal Welfare Committee using guidelines set by the Canadian Council on Animal Care (CCAC). Banding in Canada was performed under Canadian Wildlife Service banding permit #10425W (2007) and #10804 (2008 onward) and in the US under US Fish and Wildlife banding permit #23522. All netting and blood sampling was conducted under the appropriate state, provincial, federal, and institutional authorities; detailed permit information is available upon request.

Sample collection and preparation

From 2009-2012, we captured up to 30 individual Clark’s nutcrackers at each sampling site (Table 1) using standard mistnetting techniques with call playback. For call playback, we used a medley of group interaction vocalizations sourced from multiple tracks found on xeno-canto.org. Sample collection locations for each defined population (i.e. sampling site) were limited to within a 50 km of radius and were not separated by any obvious barriers to dispersal. We chose sites on either side of possible barriers to dispersal (e.g., Rocky Mountains, Great Basin) and from areas that were previously glaciated and unglaciated during the last glacial maximum (e.g., Central Alberta and New Mexico, respectively; Figure 1). We collected less than 100 μL of blood from each bird by pricking the brachial vein with a sterile needle and collecting blood in a capillary tube. Blood was stored in 95% ethanol and archived at -80°C upon return to the University of Lethbridge. Each bird was banded with a US Fish & Wildlife aluminum band, aged and sexed (if possible), and mass, tarsus, and other morphological measurements taken. Additional tissue and feather samples were obtained from museum collections taken from birds during the breeding season within the past 20 years to ensure all samples were from contemporary populations (Table S1).
Table 1

Summary table of and mitochondrial DNA information for populations included in this study.

Population Lat (N) Long (W) n Hn Hd π
CAB52.687-118.056860.8930.342
SAB49.774-114.397950.5830.127
MT46.592-112.15820180.9640.281
WY43.756-110.58119130.9010.295
UT40.703-110.61219100.6730.226
CO39.813-106.19513100.8590.221
NM35.815-106.852870.9640.396
SCA34.279-117.5511490.9120.312
CCA37.771-118.9261190.9640.375
NECA41.151-120.205221.0000.456
SOR42.745-122.1441260.8490.354
NEOR45.243-117.52420150.9530.402
WA47.554-120.958760.8570.329
NEWA48.845-119.625640.8000.222
SEBC49.162-119.27011--
Overall 169 0.854 0.324

Population abbreviations are as follows: CAB = central Alberta, SAB = southern Alberta, MT = Montana, WY = Wyoming, UT = Utah, CO = Colorado, NM = New Mexico, SCA = Southern California, CCA = central California, NECA = northeast California, SOR = southern Oregon, NEOR = northeast Oregon, WA = central Washington, NEWA = northeast Washington, and SEBC = southeast British Columbia. Column headers are: Lat (N) = latitude and Long (W) = longitude of central locations; n = number of samples used for mitochondrial DNA analyses; Hn = number of haplotypes; Hd = haplotype diversity; and π = nucleotide diversity multiplied by 100 for ease of interpretation.

Figure 1

Clark’s nutcracker population sampling locations.

Current range overlaid in yellow [76]. Darker shades imply higher elevation areas. See Table 1 for coordinates, sample sizes, and abbreviations per population.

Population abbreviations are as follows: CAB = central Alberta, SAB = southern Alberta, MT = Montana, WY = Wyoming, UT = Utah, CO = Colorado, NM = New Mexico, SCA = Southern California, CCA = central California, NECA = northeast California, SOR = southern Oregon, NEOR = northeast Oregon, WA = central Washington, NEWA = northeast Washington, and SEBC = southeast British Columbia. Column headers are: Lat (N) = latitude and Long (W) = longitude of central locations; n = number of samples used for mitochondrial DNA analyses; Hn = number of haplotypes; Hd = haplotype diversity; and π = nucleotide diversity multiplied by 100 for ease of interpretation.

Clark’s nutcracker population sampling locations.

Current range overlaid in yellow [76]. Darker shades imply higher elevation areas. See Table 1 for coordinates, sample sizes, and abbreviations per population.

Laboratory procedures

DNA extraction

DNA was extracted from blood, feather, and tissue samples using a modified Chelex protocol [30]. Once extracted, DNA was stored at -20°C in low TE buffer.

Mitochondrial DNA

We amplified a section of the mitochondrial DNA control region (CR) using primers L46 SJ (5’-TTT GGC TAT GTA TTT CTT TGC-3’; Birt & Lemmen, unpublished data) and H1030 JCR 18 [31] corresponding to positions 46 (Domain I) and 1030 (Domain III) of the corvid mitochondrial control region, respectively. Where the complete fragment would not amplify, we used internal primers designed in-house, H560 clnuCR (5’-GCA AAG GGA GGA GTA TGC AG-3’) or L530 corvidCR (5’-CGC CTC TGG TTC CTA TTT CA-3’), with L46 SJ or H1030 JCR 18, respectively, to amplify two overlapping fragments. Polymerase chain reactions (PCR) were performed on a Master gradient thermocycler (Eppendorf: Hauppauge, NY) in a 25 μL volume containing 1x goTaq Flexi buffer (Promega: Madison, WI), 2.5 mM MgCl2, 200 μM dNTP, 0.4 μM of each primer, and 0.5 units goTaq Flexi taq polymerase (Promega) under the following conditions: one cycle of 94°C for 120 s, 52°C for 45 s, and 72°C for 60 s, 37 cycles of 94°C for 30 s, 52°C for 45 s and 72°C for 60 s and one cycle of 72°C for five min. PCR products were run on a 0.8% agarose gel to confirm DNA amplification. DNA sequencing was performed at McGill University and Génome Québec Innovation Centre on a 3730xl DNA Analyzer (Applied Biosystems: Carlsbad, CA) or at the University of Lethbridge on a 3130 DNA Analyzer (Applied Biosystems). For in-house sequencing we used a shrimp alkaline phosphatase-exonuclease clean up followed by sequencing and sodium acetate precipitation [28] prior to gel electrophoresis.

Microsatellite DNA

Three individuals from geographically distant populations (Montana, southern Oregon, and Colorado) were initially screened for 32 microsatellite primer pairs developed for and used in other corvids. Thirteen additional nutcracker-specific microsatellite loci have since been developed by another independent research group [32]. If screened loci were suspected to be polymorphic and/or amplified clearly, two additional individuals from each of the initial test populations plus three individuals from another population potentially separated by dispersal barriers were screened. Eight loci were polymorphic (Table 2). All forward primers were modified by adding an M13 sequence (5’ – CAC GAC GTT GTA AAA CGA C - 3’) to the 5’ end, which allowed the integration of a fluorescently labeled primer (700 nm or 800 nm) directly into the PCR product. DNA was amplified in a 10 μL reaction with 1x buffer, 1 mM MgCl2, 200 μM dNTP (Fisher Scientific: Hampton, NH), 1 μM of each primer (forward and reverse), 0.05 μM of the fluorescent primer (Eurofins MWG Operon: Huntsville, AL) and 0.5 units taq polymerase under the following conditions: one cycle of 94°C for 120 s, T1 for 45 s, and 72°C for 60 s, seven cycles of 94°C for 60 s, T1 for 30 s and 72°C for 45 s, 31 cycles of 94°C for 30 s, T2 for 30 s, and 72°C for 45 s, and one final elongation cycle at 72°C for five minutes (Table 2).
Table 2

Microsatellite primer pairs used in this study.

Primer Sequence (5' to 3') Buffer taq T1 T2
ApCo19F CAG ACT GCA GTC TTG CTA TAG C Flexicrimson4548
ApCo19R GCC TTG GAT GCT TTT ACG
ApCo30F GCC CTG ATG CTG TTG ATG GT FlexiFlexi4548
ApCo30R CTG GAG CCT GGT TTA GAG TTA TGC
ApCo37F TGC CAA ATG CAA CCA AAT CTT FlexiFlexi5052
ApCo37R CAT CAC TTG CAG AGA GGG CA
ApCo41F CCT ACT CTG GGC ACT GTT ATT ATC crimsoncrimson5052
ApCo41R CCC ATT ATC AGC ATG TCG TAC A
ApCo46F GGG AGC CTA GTA TGT TAA GAT GCT Flexicrimson5052
ApCo46R TTC CAG GTG AGG TGA TTC AGC
ApCo91F GTA GTC CCA ATG GTT TCT CTG TC crimsoncrimson4548
ApCo91R GAT GAA GTA ATG TGA AAC CTG G
PnuA106WF GTA TTT TGG GAT GTC TTA GGG TTG FlexiFlexi5052
PnuA106WR CAC ACT CGA GCT ACA ATA AAC CTG

Primer names, sequence, focal species, source, and reaction conditions for this study (buffer and taq polymerase, and annealing temperatures, T1 and T2 (°C)). Forward primers (shown with an ‘F’ suffix) were modified to include a short sequence at the 5’ end allowing for incorporation of the florescent tag. All loci except PnuA106W are sourced from Florida Scrub Jay (Aphelocoma coerulescens) [77]. PnuA106W is sourced from Yellow-billed Magpie (Pica nuttalli) [78].

Primer names, sequence, focal species, source, and reaction conditions for this study (buffer and taq polymerase, and annealing temperatures, T1 and T2 (°C)). Forward primers (shown with an ‘F’ suffix) were modified to include a short sequence at the 5’ end allowing for incorporation of the florescent tag. All loci except PnuA106W are sourced from Florida Scrub Jay (Aphelocoma coerulescens) [77]. PnuA106W is sourced from Yellow-billed Magpie (Pica nuttalli) [78]. PCR products were mixed with a stop solution (95% formamide, 20 mM EDTA and bromophenol blue), denatured for 3 min at 94°C, and run on a 6% polyacrylamide gel using a LI-COR 4300 DNA Analyzer (LI-COR Inc.: Lincoln, NE). Alleles were scored via visual inspection, and genotypes were independently confirmed by a second person. Three controls of known allele sizes (pre-screened individuals) plus a size standard were included on each gel to ensure consistent scoring along with a negative control to ensure no contamination was present.

Analyses of genetic structure

Chromatograms were edited and sequences aligned manually in MEGA v5.0 [33]. Shared haplotypes were determined using DnaSP v5.10 [34]. We assessed population structure and relationships among haplotypes using a maximum parsimony network constructed using Network v4.611, which is designed to construct the shortest, least complex network [35]. The median-joining (MJ) tree algorithm was used as this allows for multi-state data (e.g., four nucleotide base options at each variable site), with settings as follows: weight = 10 for each site, epsilon = 0, and the MJ squared option turned off. After the tree was constructed, the MP (maximum parsimony) option was used to remove superfluous links and unnecessary median vectors from the network, which can be produced during the initial network calculations [36]. Genetic variation within populations was measured by calculating haplotype (Hd) and nucleotide (π) diversity using DnaSP v5.10 [34]. We calculated pairwise ΦST values (an analogue of Wright’s fixation index (F ) using Arlequin v3.5.1.3 [37] to examine population structure and test for genetic differentiation among populations and haplogroups. Significance values were corrected using a Benjamini-Hochberg correction [38], to control for false discovery rate (FDR), the expected proportion of falsely rejected hypotheses in multiple significance testing situations where multiplicity might occur. A Mantel’s test was performed to examine the correlation between genetic and geographic distances in GenAlEx v6.0 [39]. We calculated geographic distances using weighted central coordinates for each population (Table 1) and the Geographic Distance Matrix Generator v1.2.3 [40]. Linearised ΦST values (ΦST / 1- ΦST) were used for genetic distances and significance was tested using 9,999 permutations. A mismatch distribution was used to help visualize signatures of demographic expansion, and to test the null hypothesis of historic population growth and expansion [41]. Genetic variation allocated within and among populations was examined using an analysis of molecular variance (AMOVA) in Arlequin v3.5.1.3 [37]. We investigated the potential impact of barriers on genetic variation by performing a spatial analysis of molecular variance (SAMOVA; K = 2-12; 100 iterations) [42]; however, the process cannot test for range-wide panmixia as K cannot be set to K = 1. Alternatively, BAPS v6.0 [43], a Bayesian method for analysing population structure, can detect range-wide panmixia. A preliminary population mixture analysis for spatial clustering of individuals in BAPS used no a priori population information and K = 13. A subsequent BAPS run was then conducted using a priori population information. Allele frequencies, deviation from Hardy-Weinberg equilibrium (observed (Ho) versus expected (He) heterozygosity), and pairwise F values [44] were calculated with 999 permutations using GenAlEx v6.0 [39] and P values were corrected for multiple tests using a Benjamini-Hochberg correction for FDR [38]. Allelic richness was calculated in FSTAT v2.9.3 [45]. A Mantel’s test was performed to test for isolation by distance using the same procedure as for mitochondrial DNA. Bayesian clustering analyses were conducted using the programs STRUCTURE v2.3.3 [46,47] and BAPS v6.0 [48]. STRUCTURE was run with the following settings: K = 1-15, a burn-in of 100,000 followed by 500,000 runs, admixture assumed, correlated allele frequencies and including a priori population information 10 replicates were performed for each value of K. In STRUCTURE, it can be difficult to decide when K captures major structure in the data due to similar lnP(X|K) values, thus two additional analyses were conducted using STRUCTURE results: Bayes factor calculations [46] and ΔK [49] using STRUCTURE HARVESTER [50]. One limitation with using STRUCTURE HARVESTER is that it cannot detect when K = 1 is the true number of clusters. However, Bayes factors and BAPS are able to detect if K = 1, so the addition of these methods is useful in species where panmixia is a possibility. A preliminary population mixture analysis for spatial clustering of individuals in BAPS used no a priori population information and K = 13. A subsequent BAPS run was then conducted using a priori population information.

Species distribution modeling

Sample points

In addition to our sample (field and museum) locations, geo-referenced Clark’s nutcracker locations were obtained from the Global Biodiversity Information Facility (GBIF; http://data.gbif.org/, accessed on 22 January 2013). We excluded occurrences outside of North America, without geo-references, or recorded before 1950 from modeling and further inspected accuracy by plotting points using ArcMap v10.1 (ESRI: Redlands, CA). We removed duplicate records prior to model-building.

Bioclimatic data

Current bioclimatic data were extracted from the WORLDCLIM dataset (v1.4, http://www.worldclim.org/) and LGM bioclimatic data from the Model for Interdisciplinary Research on Climate (MIROC) database [51] at 2.5 min resolution (~4 x 4 km tiles). The current bioclimatic dataset ranges over a 50 year period (1950 - 2000), hence exclusion of observations prior to 1950. Nineteen bioclimatic variables are included in the WORLDCLIM current and LGM dataset [52]. ArcMap v10.1 (ESRI) was used to clip climatic variable layers to include only North America as using smaller geographic areas can improve the predictive power of Maxent models [53]. Prior to constructing SDM, ENMTools v1.3 [54] was used to determine which bioclimatic variables were correlated using R > 0.90 as a cutoff. Nine variables were correlated with at least one other variable and all but one from each set of correlated variables was removed.

Maximum entropy (Maxent) distribution modeling

Maxent v3.3.3 [55] was used to model suitable habitat distributions for Clark’s nutcracker at present and during the LGM using hinge features only, regularization multiplier = 1, max number of background points = 10,000, replicate run type of 10 cross-validations, 500 maximum iterations, and 0.00001 convergence threshold. Resulting layers were then exported to ArcMap v10.1, overlayed on a digital elevation map, and processed into images.

Results

Genetic analyses

Mitochondrial DNA control region sequences (n = 169; Table S1) from 15 populations range-wide revealed 48 variable sites in a 900 base pair (bp) region. In total, 68 haplotypes were recognized in the median-joining analysis (GenBank accession nos. KF687612-KF687679; Figure 2; Table 3). Eighteen haplotypes accounted for 119 individuals; 50 singleton haplotypes were found (Figure 2; Table 3). The largest haplotype group contained a total of 50 individuals from 14 of the 15 populations analysed. Most haplotypes were a single mutational step removed from each other. Overall, limited geographic clustering was observed in the maximum parsimony network (Figure 2).
Figure 2

Clark’s nutcracker mitochondrial DNA haplotype network.

Legend: Maximum parsimony network constructed using a median joining algorithm and post-processed with a maximum parsimony procedure for a 900 base pair fragment of mitochondrial control region from 169 Clark’s nutcrackers across North America. Each colour represents a different population (n = 15). Circle size is proportional to number of individuals with that haplotype. Haplotypes represented as pie charts include individuals from multiple populations with pie segments proportional to the number of individuals from each population. Capital letters are those assigned to shared haplotypes (see Table 3). Small black circles represent inferred nodes. See Table 1 for population abbreviations.

Table 3

Geographic distribution of shared haplotypes.

Haplotype CAB SAB MT WY UT CO NM SCA CCA NECA SOR NEOR WA NEWA SEBC Total
A 364510412114333- 50
B --3411-12-23--- 17
C --11-1--1--3--- 7
D -------41------ 5
E ----2--11----1- 5
F ---1-21-------- 4
G ------1---3---- 4
H --11-------11-- 4
I ------21------- 3
J --1---11------- 3
K --------2--1--- 3
L --1--------1--- 2
M ---2----------- 2
N -------2------- 2
O --1----1------- 2
P 1-1------------ 2
Q ----1---1------ 2
R -----1------1-- 2
Total 4 6 13 14 14 9 6 13 9 1 9 12 5 4 0 119
Unique 437554212138221 50

Haplotype codes correspond to those found in Figure 2. Population abbreviations and locations are found in Table 1.

Clark’s nutcracker mitochondrial DNA haplotype network.

Legend: Maximum parsimony network constructed using a median joining algorithm and post-processed with a maximum parsimony procedure for a 900 base pair fragment of mitochondrial control region from 169 Clark’s nutcrackers across North America. Each colour represents a different population (n = 15). Circle size is proportional to number of individuals with that haplotype. Haplotypes represented as pie charts include individuals from multiple populations with pie segments proportional to the number of individuals from each population. Capital letters are those assigned to shared haplotypes (see Table 3). Small black circles represent inferred nodes. See Table 1 for population abbreviations. Haplotype codes correspond to those found in Figure 2. Population abbreviations and locations are found in Table 1. Haplotype diversity (Hd) values ranged from 0.583 (southern Alberta (SAB)) to 0.964 (Montana (MT), New Mexico (NM), and central California (CCA)) and nucleotide diversity (π) ranged from 0.00127 (SAB) to 0.00402 (northeastern Oregon) for populations with greater than five samples (Table 1). When all samples were combined, overall Hd was 0.854, and overall π was 0.00324 indicating high levels of genetic diversity in Clark’s nutcracker. In 78 population pairwise comparisons of genetic differentiation, ΦST values ranged from -0.050 (P corrected = 0.779) for Utah (UT) and northeast Washington (NEWA) to 0.359 (P corrected = 0.100) for NM and SAB (Table 4). Most population pairs (57/78) had ΦST values less than 0.050, though 14 pairs had ΦST values between 0.050 and 0.150, four pairs had ΦST values between 0.150 and 0.250, and three pairs had ΦST values above 0.250. Only four of those ΦST values were significant after the FDR correction and all occurred between NM and other populations (UT, Wyoming (WY), southern California (SCA), and NEWA). The mismatch distribution did not deviate significantly from the expected signature, so we cannot reject the null hypothesis of historical population growth and expansion [41]. Geographic distance in 78 paired populations ranged from 339 km for WY and UT to 2074 km for NM and central Alberta (CAB). A low, but significant correlation between genetic and geographic distance was detected for mitochondrial DNA (R 2 = 0.082, P = 0.040; Figure 3). When the NM population was removed from the analyses, the relationship between genetic and geographic distance still trended towards isolation by distance, though it was no longer significantly correlated (R 2 = 0.064, P = 0.058; not shown).
Table 4

Population pairwise ΦST values for Clark’s nutcracker mitochondrial DNA.

MT CO
NM
UT
WY NEOR SOR
CCA
SCA
WA NEWA
SAB CAB
MT

0.832
0.1000.5090.714
0.373
0.316
0.5090.373
0.8250.530
0.468
0.574
CO
-0.026

0.2700.4140.883
0.546
0.714
0.7710.414
0.7710.373
0.373
0.414
NM
0.186
0.147
<0.001 <0.001
0.234
0.509
0.289 <0.001
0.373 <0.001
0.100
0.234
UT
0.005
0.017
0.281 0.589
0.296
0.289
0.5090.296
0.7140.779
0.473
0.270
WY
-0.012
-0.031
0.156 0.001
0.462
0.509
0.7710.373
0.7710.654
0.479
0.414
NEOR 0.0230.0030.0990.0420.0070.3830.7560.2700.7560.4140.2890.414
SOR 0.039-0.0160.0090.0720.0120.0130.7790.3730.7140.3730.1000.366
CCA 0.008-0.0250.1020.011-0.014-0.015-0.0310.6540.7710.5130.2700.456
SCA 0.0220.023 0.202 0.0430.0340.0450.063-0.0130.6910.4990.3340.366
WA -0.041-0.0350.102-0.019-0.032-0.020-0.025-0.0320.0030.7710.3730.714
NEWA 0.0050.037 0.282 -0.050-0.0050.0310.0800.0100.023-0.0460.8760.509
SAB 0.0170.0770.3590.0170.0190.0660.1530.0760.0590.032-0.0340.289
CAB 0.0030.0280.1800.0550.0280.0290.0610.0210.048-0.0180.0290.066

ΦST values are below diagonal, significance values corrected for false discovery rate above diagonal. Values are based on 110 permutations for mitochondrial DNA control region in 166 Clark’s nutcrackers from 13 populations (n ≥ 5) in North America. Bolded values indicate significance. See Table 1 and Figure 1 for sampling locations and abbreviations.

Figure 3

Mantel test of isolation by distance for mitochondrial DNA.

Legend: R 2 = 0.082, P = 0.040. Each point represents one population pairwise ΦST/ (1- ΦST) plotted against geographic distance between paired populations.

ΦST values are below diagonal, significance values corrected for false discovery rate above diagonal. Values are based on 110 permutations for mitochondrial DNA control region in 166 Clark’s nutcrackers from 13 populations (n ≥ 5) in North America. Bolded values indicate significance. See Table 1 and Figure 1 for sampling locations and abbreviations.

Mantel test of isolation by distance for mitochondrial DNA.

Legend: R 2 = 0.082, P = 0.040. Each point represents one population pairwise ΦST/ (1- ΦST) plotted against geographic distance between paired populations. Both AMOVA and SAMOVA showed that most genetic variation is contained within rather than among population groups. The highest value of between-group variation was observed when K = 1 for AMOVA analyses (3.4% between groups, 96.6% within groups) and for K = 2 in SAMOVA (17.8% between groups, 80.7% within groups), with one group consisting of NM and the other of all other populations pooled. The preliminary (no a priori population information) and secondary (using a priori population information) BAPS analyses for spatial clustering of individuals found no significant differentiation between individuals or populations (i.e., K = 1). A total of seven polymorphic loci (Table 5) were used for analyses; an eighth locus (ApCo37) was removed due to limited amplification success and subsequent missing data (~50%). We used the thirteen populations with greater than five samples for microsatellite analyses (Table 1; Table S1). The total number of alleles detected ranged from four in PnuA106W to 10 in ApCo40 with an average of 6.43. Only 13 of the 91 loci-population comparisons showed significant deviations from Hardy-Weinberg and all showed lower heterozygosity than expected. ApCo41 has a significant heterozygote deficit in five of the 13 populations sampled and while two of these (NEWA and CAB) had fewer samples (n ≤ 8), the others had a larger number of samples (n ≥ 14; Table 5).
Table 5

Summary table of seven microsatellite loci used to analyze 13 Clark’s nutcracker populations.

ApCo19 ApCo30 ApCo40 ApCo41 ApCo46 ApCo91 PnuA106W
Central Alberta (n=8)
An 3352442
Ar 2.3592.3613.1601.6922.9672.9631.429
Ho 0.5710.6250.7500.0000.4290.6250.143
He 0.5000.5550.6250.2450.6020.6480.133
P nsnsns**nsnsns
Southern Alberta (n=12)
An 3352622
Ar 2.1902.2743.0392.0004.0181.8461.908
Ho 0.5000.4000.5830.6670.7500.1430.417
He 0.4970.5350.6110.4440.8060.3370.413
P nsnsnsNsnsnsns
Montana (n=25)
An 3364553
Ar 2.1422.5663.4301.8353.4463.0621.878
Ho 0.6400.4400.8400.3200.7830.7500.320
He 0.4860.5860.7340.2820.7380.6840.351
P ns*nsNsnsnsns
Wyoming (n=30)
An 3493533
Ar 2.0222.5103.9371.6202.6862.3141.800
Ho 0.4830.3570.7240.1360.5500.4000.241
He 0.4630.5430.7930.2080.5560.4600.313
P nsns*******ns
Utah (n=20)
An 2463533
Ar 1.9622.5583.2471.8912.9262.4261.862
Ho 0.5790.2940.6500.3160.4710.5710.278
He 0.4780.5280.7000.3420.6190.5380.323
P ns***nsNs*nsns
Colorado (n=13)
An 3362543
Ar 2.2472.3533.5091.2733.6432.6322.062
Ho 0.3850.4170.7690.0910.7500.4440.385
He 0.4620.5170.7280.0870.7600.5120.411
P nsnsnsNsnsnsns
New Mexico (n=9)
An 3352432
Ar 2.2412.3173.4521.8673.0842.4671.975
Ho 0.6670.7780.7500.4000.4290.4000.111
He 0.4750.5490.7190.3200.6630.4600.475
P nsnsnsNsnsns*
Southern California (n=14)
An 3373542
Ar 2.1112.3753.6151.6463.2652.6451.976
Ho 0.4290.5711.0000.0770.5710.6360.286
He 0.4570.4870.7370.2100.6940.5660.490
P nsnsns**nsnsns
Central California (n=11)
An 3372443
Ar 1.9092.2314.2331.8343.1732.9612.247
Ho 0.3640.5450.6360.2730.8000.6360.636
He 0.3100.5170.8180.3510.6850.6650.533
P nsnsnsNsnsnsns
Southern Oregon (n=12)
An 2563442
Ar 1.9402.8943.7592.2022.8432.6451.908
Ho 0.5000.7501.0000.3000.7500.4170.417
He 0.4440.5970.7600.4050.6180.5490.413
P nsnsnsNsnsnsns
Northeast Oregon (n=20)
An 2383642
Ar 1.9702.6813.2941.8913.7982.4061.764
Ho 0.4500.6000.7000.1050.7500.5000.100
He 0.4890.6250.7040.3420.7900.5100.320
P nsnsns**nsns**
Washington (n=7)
An 2372333
Ar 1.9922.6224.0451.5002.7582.6822.359
Ho 0.5000.5710.8570.1670.5000.5000.714
He 0.4860.5710.7650.1530.6250.5690.500
P nsnsnsNsnsnsns
Northeast Washington (n=6)
An 2342532
Ar 1.9982.5452.9691.7733.9242.5451.992
Ho 0.3330.6670.6670.0000.8330.5000.833
He 0.5000.5000.6250.2780.7640.5000.486
P nsnsns*nsnsns
Overall An = 10An = 5An = 10An = 7An = 7An = 7An = 4

Only populations with greater than five samples were used; n = number of samples used in genotyping and analyses; An = number of alleles; Ar = allelic richness; Ho = observed and He = expected heterozygosity; P = departures from Hardy-Weinberg equilibrium (ns = not significant, *P < 0.05, **P < 0.01, ***P < 0.001. See Table 1 for population locations.

Only populations with greater than five samples were used; n = number of samples used in genotyping and analyses; An = number of alleles; Ar = allelic richness; Ho = observed and He = expected heterozygosity; P = departures from Hardy-Weinberg equilibrium (ns = not significant, *P < 0.05, **P < 0.01, ***P < 0.001. See Table 1 for population locations. Limited population differentiation was detected with a global F value of 0.070. Paired F values ranged from 0.000 (14/78 comparisons) to 0.123 (CCA and central Alberta (CAB); Table 6). After Benjamini-Hochberg FDR correction, 21 of 78 paired comparisons were significant (Table 6). A weak, but significant, correlation was found between genetic and geographic distance using Mantel’s test (R 2 = 0.078, P = 0.029; Figure 4).
Table 6

Population pairwise F values for Clark’s nutcracker.

MT CO NM UT WY NEOR SOR CCA SCA WA NEWA SAB CAB
MT 0.1850.4540.2050.1000.4540.210 0.032 0.028 0.4540.3110.0930.196
CO 0.0160.2380.189 0.035 0.1890.2050.017 0.035 0.4540.1890.268 0.016
NM 0.0010.0170.0840.1890.4540.3780.1960.4540.4540.4540.4140.189
UT 0.0090.0160.0330.4540.1890.454 0.023 0.016 0.4540.142 0.017 0.093
WY 0.012 0.028 0.0170.0000.0810.189 0.016 0.016 0.4540.072 0.017 0.035
NEOR 0.0000.0150.0000.0120.0160.167 0.024 0.050 0.4540.4540.1890.210
SOR 0.0120.0170.0070.0010.0130.0200.058 0.024 0.4670.293 0.031 0.032
CCA 0.040 0.072 0.021 0.063 0.059 0.054 0.0360.2340.0930.394 0.017 0.016
SCA 0.038 0.044 0.000 0.071 0.066 0.039 0.062 0.0140.1960.4540.093 0.024
WA 0.0000.0020.0000.0000.0000.0000.0000.0430.0260.4540.3580.267
NEWA 0.0140.0330.0000.0360.0400.0000.0170.0100.0000.0000.1960.072
SAB 0.0260.0110.004 0.045 0.046 0.017 0.052 0.090 0.0340.0080.027 0.037
CAB 0.020 0.075 0.0320.034 0.041 0.020 0.066 0.123 0.096 0.0210.071 0.057

F values are below diagonal and significance values corrected for false discovery rate are above diagonal. Values are based on 110 permutations using seven polymorphic microsatellite loci in 187 Clark’s nutcrackers from 13 populations in North America. See Table 2 and Figure 1 for population abbreviations.

Figure 4

Mantel test of isolation by distance for seven microsatellite loci.

Legend: R 2 = 0.078, P = 0.029. Each point represents one population pairwise F / (1- F) plotted against straight line geographic distance between paired populations.

F values are below diagonal and significance values corrected for false discovery rate are above diagonal. Values are based on 110 permutations using seven polymorphic microsatellite loci in 187 Clark’s nutcrackers from 13 populations in North America. See Table 2 and Figure 1 for population abbreviations.

Mantel test of isolation by distance for seven microsatellite loci.

Legend: R 2 = 0.078, P = 0.029. Each point represents one population pairwise F / (1- F) plotted against straight line geographic distance between paired populations. STRUCTURE results suggested that the optimal number of groups was K = 1 (Figure 5A). Further analyses produced peaks at K = 1, K = 4, and K = 12 (Figure 5B), though K = 1 produced the highest peak and highest posterior probabilities (mean (Pr (K=1)) = -2625.45) and is supported by Bayes factor values (Pr (K=1) = 1) and BAPS results.
Figure 5

Measures of optimal K values from STRUCTURE analyses.

Legend: A) Penalised log likelihood test (mean ± SE) and B) ∆K depicting clusters (K). The penalised log likelihood test takes the maximum ln Pr (X | K) as the correct number of clusters and ∆K infers the number of clusters from the difference between ln Pr (X | K).

Measures of optimal K values from STRUCTURE analyses.

Legend: A) Penalised log likelihood test (mean ± SE) and B) ∆K depicting clusters (K). The penalised log likelihood test takes the maximum ln Pr (X | K) as the correct number of clusters and ∆K infers the number of clusters from the difference between ln Pr (X | K).

Species distribution modeling

A total of 1563 presence records were used for training and 174 for testing the distribution models. After removing correlated bioclimatic layers, 10 layers were used for model construction (bio1-4, 8, 12, 14-15, 18-19). Mean area under the receiver operating curve (AUC) was 0.907 (range = 0.906 - 0.908), suggesting that the current model was highly suitable for backcasting to the paleodistribution model. Isothermality (mean diurnal temperature range/mean annual temperature range (bio3)), temperature seasonality (bio4), and mean annual temperature (bio1) were the most important variables contributing to the model at 44.3%, 37.7%, and 11.1% respectively. The current distribution predicted by the Maxent model (Figure 6A) closely approximated the known Clark’s nutcracker range in North America (Figure 1). The model returned a maximum 66% probability of suitable habitat throughout the range. The paleodistribution model returned a maximum 25% probability of predicted suitable habitat, mostly concentrated in the western mountainous regions and along the southeast edge of the ice sheets during the last glacial maximum (Figure 6B).
Figure 6

Current and paleodistribution models of suitable Clark’s nutcracker habitat in North America.

Legend: Probability of suitable Clark’s nutcracker habitat at A) present (overlaid on digital elevation model and B) during last glacial maximum (~21 kya), overlaid on current North American boundaries generated by species distribution modeling. In B), blue areas outside of current coastlines represent historical extent of terrestrial landmass.

Current and paleodistribution models of suitable Clark’s nutcracker habitat in North America.

Legend: Probability of suitable Clark’s nutcracker habitat at A) present (overlaid on digital elevation model and B) during last glacial maximum (~21 kya), overlaid on current North American boundaries generated by species distribution modeling. In B), blue areas outside of current coastlines represent historical extent of terrestrial landmass.

Discussion

Population genetic structure

Using a combination of measures of population differentiation and clustering analyses, our study suggests high levels of population connectivity between most Clark’s nutcracker populations in North America. We found limited differentiation in mitochondrial DNA (Table 4), with only the New Mexico population showing significant pairwise differences from four other populations. Microsatellite data indicate higher levels of population differentiation between Clark’s nutcracker populations, though this predominantly occurs between populations at the edges of the species’ range (e.g. CAB and SCA; Table 6). Geographic distance rather than barriers appear to influence mitochondrial and nuclear gene flow, as there is a weak, but significant effect of isolation by distance on nutcracker populations for both molecular markers (Figures 4 and 5). However, for mitochondrial DNA, this pattern seems to be driven by populations in New Mexico, suggesting limited peripheral isolation for this marker. For microsatellite data, additional peripheral populations (CAB, SAB, and SCA) are significantly differentiated in multiple comparisons (Table 6), suggesting peripheral isolation in addition to isolation by distance. Clustering analyses do not support strong population differentiation for either nuclear or mitochondrial markers, with a single population being the best fit for the data in both cases. In the case of microsatellite data, these results should be interpreted with caution as both STRUCTURE and BAPS may struggle with situations where differentiation is low (F < 0.030) [56], though global values (F = 0.070) suggest this is likely not the case. High levels of haplotype, nucleotide, and allelic diversity throughout most nutcracker populations suggest large population sizes and few founder effects or population bottlenecks. The exception to this is the southern Alberta population, which exhibits reduced haplotype and nucleotide diversity for mitochondrial markers relative to the rest of the populations, though microsatellite loci have high levels of allelic richness and heterozygosity (Table 5). This difference between marker genetic diversity may be due to a historical founder effect in previously glaciated southern Alberta (mtDNA), with nuclear gene flow being less limited (microsatellites). Patterns of potential panmixia and limited population differentiation are not unique to Clark’s nutcracker. Many other species that undergo irruptive and seasonal dispersal also exhibit limited population differentiation despite the presence of proposed barriers to dispersal. Studies of Pinus seed specialists, such as pygmy nuthatch (Sitta pygmaea) [57], crossbill species (Loxia spp.) [19], and Clark’s nutcracker’s sister species, Eurasian nutcracker (Nucifraga caryocatactes) [58], found no significant genetic differentiation among populations despite potential barriers to dispersal. Limited differentiation has been found in other species with irruptive or seasonal dispersal, such as boreal owl (Aegolius funereus) [59], and European redpoll species (Carduelis spp.) [60], yet Steller’s jay (Cyanocitta stelleri) [14] and white-breasted nuthatch (Sitta carolinensis) [61] exhibit significant differentiation between populations and limited gene flow corresponding with barriers to dispersal. In other taxa, peripheral populations are more likely to be isolated due to reduced gene flow [16,62,63], while maintaining panmixia in the majority of the range [64]. Overall, Clark’s nutcracker population connectivity seems to be primarily limited by distance with some peripheral isolation at the southern and northern edges of the range, which is reasonable for a high elevation species with irruptive and seasonal altitudinal dispersal.

Postglacial colonization

After the LGM, species that rapidly expanded from a single refugium with high levels of gene flow are expected to exhibit three characteristics: 1) a star-like phylogeny with many low frequency single haplotypes separated from high frequency central ancestral haplotypes by few mutational steps [65]; 2) low levels of genetic subdivision between and within populations [4,66]; and 3) a mismatch distribution of pairwise differences among haplotypes indicating a sudden increase in expansion from a single population [67]. These signals of expansion from a single refugium have been reported in several North American bird species (e.g., multiple species reviewed in 68, downy woodpecker (Picoides pubescens) [20]) as well as many plant species, including several Pinus species that Clark’s nutcracker specializes upon (reviewed in 13). Clark’s nutcracker mitochondrial DNA exhibits a star-like phylogeny with one major shared haplotype, and short connections between this haplotype and others (Figure 2), in addition to a modeled paleodistribution that suggests that this species expanded from a single refugium after the LGM. This pattern is less complex than that found in whitebark pine postglacial expansion: colonization is thought to have occurred from three refugia (Oregon, Idaho, and Colorado) [25]. Given that Clark’s nutcrackers range extends farther south than that of whitebark pine, nutcrackers may have been isolated in a more southern refugium and expanded into whitebark pine refugia from there, subsequently assisting with seed distribution northward. In addition to the aforementioned characteristics, temperate species that expanded north from a southern refugium have been shown to exhibit genetic differentiation between northern and southern populations corresponding to previously unglaciated and glaciated areas [12]. Southern populations were less impacted by glaciation and should show higher levels of nucleotide diversity and phylogeographic structure compared to northern populations [69]. This pattern occurs to varying degrees in Pinus species on which Clark’s nutcracker commonly feed: whitebark pine (Pinus albicaulis) [25], white pine (P. monticola) [70], limber pine (P. flexilis) [71,72], and Ponderosa pine (P. ponderosa) [73,74]. However, this pattern does not distinctly appear in Clark’s nutcracker. All southern and northern populations sampled exhibit high levels of heterozygosity, nucleotide and allelic diversity, and limited differentiation from other populations, making it very challenging to pinpoint a single refugium for this species. In addition, while species distribution models for current suitable nutcracker habitat performed very well (high AUC values), paleodistribution models returned reduced probabilities (< 30%) of suitable LGM habitat throughout unglaciated areas. Taken together, these results point to a single refugium or multiple small, connected refugia, the location of which is not obvious through genetic signatures or paleodistribution modeling.

Conclusions and future research

Throughout its range, Clark’s nutcracker exhibits low levels of population differentiation and does not appear to be limited by potential barriers to dispersal, though peripheral populations may be slightly isolated. Future research could integrate additional mitochondrial markers to elucidate the location of a glacial refugium or refugia for this species. Mitochondrial markers with lower rates of mutation in corvids (e.g. cytochrome B) [31] may retain historic genetic signals longer and thus provide additional historic information to this study. Samples at the northern and isolated southern extremes of the range may also increase our understanding of peripheral isolation and postglacial colonization for this species. Given the importance of Clark’s nutcracker to dispersal of Pinus species in North America [21,22] and current conservation and management concerns for many Pinus species [75], further understanding of nutcracker dispersal may also assist in future forest management and recovery plans. Summary table of Clark’s Nutcracker samples used in analyses. Sequence (mtDNA): Y = successfully sequenced. N = not sequenced. Genotyped (SSR): Y = successfully genotyped. N = not genotyped. Sources include Burg lab (wild), TS = T. Schaming, The Field Museum (TFM), Burke Museum at the University of Washington (UWBM), Museum of Southwest Biology (MSB), Louisiana State University Museum of Natural Sciences (LSU), American Museum of Natural History (AMNH), Smithsonian National Museum of Natural History (USNM), Royal Saskatchewan Museum (RSM) and Royal Alberta Museum (RAB). (XLS) Click here for additional data file.
  42 in total

1.  Inference of population structure using multilocus genotype data.

Authors:  J K Pritchard; M Stephens; P Donnelly
Journal:  Genetics       Date:  2000-06       Impact factor: 4.562

2.  Dynamics and phylogenetic implications of MtDNA control region sequences in New World Jays (Aves: Corvidae).

Authors:  M A Saunders; S V Edwards
Journal:  J Mol Evol       Date:  2000-08       Impact factor: 2.395

3.  Microsatellite and mitochondrial DNA homogeneity among phenotypically diverse crossbill taxa in the UK.

Authors:  S B Piertney; R Summers; M Marquiss
Journal:  Proc Biol Sci       Date:  2001-07-22       Impact factor: 5.349

4.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.

Authors:  Daniel Falush; Matthew Stephens; Jonathan K Pritchard
Journal:  Genetics       Date:  2003-08       Impact factor: 4.562

5.  Ice sheets promote speciation in boreal birds.

Authors:  Jason T Weir; Dolph Schluter
Journal:  Proc Biol Sci       Date:  2004-09-22       Impact factor: 5.349

6.  Rapid divergence and postglacial colonization in western North American Steller's jays (Cyanocitta stelleri).

Authors:  Theresa M Burg; Anthony J Gaston; Kevin Winker; Vicki L Friesen
Journal:  Mol Ecol       Date:  2005-10       Impact factor: 6.185

7.  Testing hypotheses of Pleistocene population history using coalescent simulations: phylogeography of the pygmy nuthatch (Sitta pygmaea).

Authors:  Garth M Spellman; John Klicka
Journal:  Proc Biol Sci       Date:  2006-12-22       Impact factor: 5.349

8.  Phylogeography of the white-breasted nuthatch (Sitta carolinensis): diversification in North American pine and oak woodlands.

Authors:  Garth M Spellman; John Klicka
Journal:  Mol Ecol       Date:  2007-04       Impact factor: 6.185

9.  Global relationships amongst black-browed and grey-headed albatrosses: analysis of population structure using mitochondrial DNA and microsatellites.

Authors:  T M Burg; J P Croxall
Journal:  Mol Ecol       Date:  2001-11       Impact factor: 6.185

10.  COMPARATIVE PHYLOGEOGRAPHY IN NORTH AMERICAN BIRDS.

Authors:  Robert M Zink
Journal:  Evolution       Date:  1996-02       Impact factor: 3.694

View more
  3 in total

1.  Limited geographic genetic structure detected in a widespread Palearctic corvid, Nucifraga caryocatactes.

Authors:  Kimberly M Dohms; Theresa M Burg
Journal:  PeerJ       Date:  2014-06-17       Impact factor: 2.984

2.  Clark's Nutcracker Breeding Season Space Use and Foraging Behavior.

Authors:  Taza D Schaming
Journal:  PLoS One       Date:  2016-02-16       Impact factor: 3.240

3.  The influence of latitude, geographic distance, and habitat discontinuities on genetic variation in a high latitude montane species.

Authors:  J A Hindley; B A Graham; P C Pulgarin-R; T M Burg
Journal:  Sci Rep       Date:  2018-08-07       Impact factor: 4.379

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.