Kung-Ping Lin1, Shu-Miaw Chaw2, Yun-Hwa Lo1, Teruo Kinjo3, Chien-Yi Tung4, Hsi-Chi Cheng5, Quintin Liu6, Yoko Satta6, Masako Izawa7, Shiang-Fan Chen8, Wen-Ya Ko1. 1. Department of Life Sciences and Institute of Genome Sciences, National Yang Ming Chiao Tung University, Taipei, Taiwan. 2. Biodiversity Research Center, Academia Sinica, Taipei, Taiwan. 3. Okinawa Zoo and Museum, Okinawa, Japan. 4. Cancer Progression Research Center, National Yang Ming Chiao Tung University, Taipei, Taiwan. 5. Endemic Species Research Institute, Nantou, Taiwan. 6. Department of Evolutionary Studies of Biosystems, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Japan. 7. Kitakyushu Museum of Natural History and Human History, Fukuoka, Japan. 8. Center for General Education, National Taipei University, New Taipei City, Taiwan.
Island species are often characterized by restricted geographic distributions and small
population sizes, which are considered to be more prone to adverse genetic factors, such as
low genetic diversity and inbreeding depression, than continental species (Ellstrand and Elam 1993; Frankham 1996, 1997, 2005). Both of these genetic factors can increase the
extinction risks of the threatened island species. Loss of genetic diversity can result from
inbreeding, population decline, or restricted gene flow, and is known to reduce the long-term
evolutionary potential of a species (Frankham et al.
1999; Frankham et al. 2002). Inbreeding
was shown to have deleterious effects on multiple aspects of reproductive and survival
characteristics in naturally outbreeding species (Lynch
and Walsh 1998). Therefore, resolving the magnitude of these genetic factors has been
considered important for the development of conservation plans, especially for threatened
insular species with limited gene flow and small population size. In addition, defining
diverged evolutionary units for a threatened population is also considered critical to prevent
potential outbreeding depression in crosses between units (Frankham 2010). Thus, distinct evolutionarily significant units (ESUs) with
significant genetic and ecological divergence in the wild need to be managed separately. To
provide the above critical information for conservation management, genomic data is especially
useful as, by gathering numerous neutral loci, they can provide precise population genetics
estimates of genetic diversity, inbreeding level, and demographic independence among the
target populations (Allendorf et al. 2010; Funk et al. 2012).Bats in the genus Pteropus of the family Pteropodidae are commonly known as
flying foxes or fruit bats. Pteropus is primarily an insular genus with most
species distributed on islands (Vincenot et al.
2017). Flying foxes are found in tropical and subtropical regions, ranging from
islands in East Africa to the south-central Pacific. In these island ecosystems,
Pteropus spp. play critical roles in pollination and seed dispersal for
both endemic plants and plants of economic importance to humans (Cox et al. 1991; Fujita and Tuttle
1991). However, the populations of many Pteropus spp. are declining
due to human disturbances (Mickleburgh et al.
1992). The major anthropogenic menaces to flying foxes include habitat loss caused by
deforestation and urbanization, overhunting for bushmeat, and conflicts with fruit growers.
Moreover, the consequences of population decline in these species are expected to be
exacerbated by their generally low reproductive rates, long maturation times, and small
population sizes (Mickleburgh et al. 2009; Aziz et al. 2017). Hitherto, 7 of 65 flying fox species
became extinct in the past 200 years, and 34 of the remaining species are currently considered
threatened, including Pteropus dasymallus, the Ryukyu flying fox, which has
been classified as Vulnerable (Vincenot 2017).The Ryukyu flying fox is indigenous to Taiwan, the Ryukyu Islands, and several islands off
the north coast of the Philippines (Kinjo and Nakamoto
2015). There are 5 recognized subspecies of P. dasymallus that can
be distinguished by their geographical distributions and by slight differences in coloration
pattern (Yoshiyuki 1989). Among them, the Formosan
flying fox (P. d. formosus) is the most threatened subspecies and considered
to be on the verge of extinction by the IUCN. Other subspecies include the Orii’s flying fox
(P. d. inopinatus; near threatened) with a population size of ~5000
individuals on Okinawa Island (Funakoshi et al.
2006, 2012), the Yaeyama flying fox
(P. d. yayeyama; near threatened) with a few thousand individuals on
Yaeyama Islands, and the Erabu flying fox (P. d. dasymallus; critically
endangered) and Daito flying fox (P. d. daitoensis; critically endangered) on
the Osumi Islands and Daito Islands, respectively (Figure
1).
Figure 1.
Geographic distribution of Pteropus dasymallus subspecies and sample
locations. The geographic ranges of the 5 subspecies of P. dasymallus are
circled. These 5 subspecies include: Erabu flying fox (P. d. dasymallus),
Daito flying fox (P. d. daitoensis), Orii’s flying fox (P. d.
inopinatus), Yaeyama flying fox (P. d. yayeyamae), and
Formosan flying fox (P. d. formosus). The classification of P.
dasymallus populations in the Gueishan Island and northern islands of
Philippines requires further investigation. The original geographic data was downloaded
from the IUCN Red List website (Vincenot 2017)
and edited by QGIS 3.4 (QGIS
Development Team 2019). Three samples of P. d. formosus were
collected from Taiwan and Green Island (designated as 178, 617, and 690). Four samples of
P. d. inopinatus were collected from Okinawa Island (designated as R10,
R36, R4, and R5). See online version for full colors.
Geographic distribution of Pteropus dasymallus subspecies and sample
locations. The geographic ranges of the 5 subspecies of P. dasymallus are
circled. These 5 subspecies include: Erabu flying fox (P. d. dasymallus),
Daito flying fox (P. d. daitoensis), Orii’s flying fox (P. d.
inopinatus), Yaeyama flying fox (P. d. yayeyamae), and
Formosan flying fox (P. d. formosus). The classification of P.
dasymallus populations in the Gueishan Island and northern islands of
Philippines requires further investigation. The original geographic data was downloaded
from the IUCN Red List website (Vincenot 2017)
and edited by QGIS 3.4 (QGIS
Development Team 2019). Three samples of P. d. formosus were
collected from Taiwan and Green Island (designated as 178, 617, and 690). Four samples of
P. d. inopinatus were collected from Okinawa Island (designated as R10,
R36, R4, and R5). See online version for full colors.Early studies suggested that Formosan flying foxes are native to Green Island, located in
southeast of Taiwan (Kano 1929; Kuroda 1933). Individuals of P. d.
formosus were also recorded on the east coast of Taiwan but they were presumed to
be solitary without a stable and sustainable population. The population size of P. d.
formosus on Green Island was estimated to be up to a few thousand before a dramatic
decline during the 1970s, which led to a near extirpation of the population on the island with
only a few individuals observed over the last 2 decades (Lin and Pei 1999; Chen et al. 2009).
Overhunting was thought to be the main factor causing this population decline, combined with
the effect of habitat loss due to windbreak planting in the coastal region. Although the
hunting pressure was greatly reduced after the implementation of a conservation law in 1983,
the lost habitat (mostly Ficus species) has never recovered. Therefore,
habitat abandonment by P. d. formosus could also be responsible for the
apparent disappearance of this population, especially since flying foxes are generally
considered highly mobile species that are capable of long-range migrations and interisland
movements (Webb and Tidemann 1996; McConkey and Drake 2007; Brown et al. 2011; Roberts et al.
2012). For example, the Orii’s flying fox has been reported to be able to travel 50
km across the sea between islands (Nakamoto et al.
2011). In 2004, a small but stable population (~50 individuals) was newly recorded on
Gueishan Island off the northeast coast of Taiwan (Wu
2010). However, the origin of this newly established population is unclear since
Gueishan Island is geographically closer to Yaeyama Islands (inhabited by the Yaeyama flying
fox) than to Green Island (Figure 1). Due to its small
population size, severe population decline, and fragmented habitats, the Formosan flying fox
is listed as an Endangered Species by the Wildlife Conservation Act in Taiwan (Cheng et al. 2017). Thus, a precise and detailed
conservation policy for this endangered taxon is urgently needed. In contrast, whether to
treat P. d. formosus as a distinct ESU from other populations on the Ryukyu
Islands remains an obscure issue given their high migratory abilities.Here, we applied double digest restriction-site associated DNA sequencing (ddRADSeq) method
to study the conservation genetics of P. d. formosus in Taiwan and P.
d. inopinatus in the main island of Okinawa. First, we assessed the adverse genetic
factors of these 2 populations by estimating their levels of nucleotide diversity and
inbreeding. Second, we evaluated whether these 2 populations should be treated as distinct
ESUs and managed separately by uncovering the pattern of genetic structure between these 2
populations. We further inferred the hypothetical demographic model of these 2 populations
based on their joint folded site frequency spectrum and revealed very different trajectories
of their recent population history. Our results can provide critical insights into the
conservation management decisions of both P. d. formosus and P. d.
inopinatus.
Materials and Methods
Sample Collection, DNA Extraction, and RAD Sequencing
We generated genome-wide single nucleotide polymorphism (SNP) data from 7 individuals.
Three individuals are P. d. formosus, an endangered population endemic to
Taiwan, while 4 are P. d. inopinatus, which inhabits Okinawa and has been
considered a stable population. The 3 P. d. formosus individuals,
designated as 178, 617, and 690, were captured from Hualien (eastern Taiwan), Green Island
(an island offshore of eastern Taiwan), and the open sea off the coast of Yilan
(northeastern Taiwan), respectively. The 4 P. d. inopinatus individuals,
designated as R10, R36, R4, and R5, are captive individuals collected from the wild and
kept at the University of the Ryukyus. The detailed sampling locations for all individuals
are presented in Figure 1 and Supplementary Table S1. For the 3
P. d. formosus individuals, frozen muscle tissues were obtained from
the Wildlife Cryobank of Taipei Zoo. For each of the 4 P. d. inopinatus
individuals, skin samples were excised from the wing membranes using a biopsy punch (3 mm
in diameter); 8 punches were collected per individual. All biopsies were preserved in
Allprotect Tissue Reagent (Qiagen) and stored at −20 °C. Genomic DNA was extracted from
samples using the DNeasy Blood and Tissue kit (Qiagen) following the manufacturer’s
instructions. To better lyse the tissues, the wing membrane samples were incubated in
buffer ATL and proteinase K at 56 °C for approximately 17 hours. An
additional 20 μL of proteinase K was then added and further incubation
for a few hours was conducted depending on lysis conditions.Considering that P. d. formosus and P. d. inopinatus
are morphologically similar taxa and that few genetic variants have been discovered among
P. dasymallus subspecies at mitochondrial loci (Hidetoshi Ota, personal
communication, 2010), we applied a genomic approach, double digest restriction-site
associated DNA sequencing (ddRADSeq) method, which is designed for sequencing orthologous
loci across individual genomes of the same or closely related species (Peterson et al. 2012). For a given individual
genome, 2 restriction enzymes, EcoRI and MseI, were used to cleave the DNA. The cleaved
DNA fragments were ligated with adaptors that contain barcodes and sequences complementary
to the PCR primers. After these ligated fragments were amplified by PCR, fragments with
lengths between 250 base pairs (bp) and 450 bp were retained and sent for paired-end
sequencing. Sequencing was carried out using the Illumina HiSeqTM 2000
sequencing system in the core facility center of Genomics company in New Taipei City.
Quality Controls and RADSeq Assembly
The demultiplexed paired-end sequence reads were processed with
SOAPnuke2 to discard low-quality reads and unwanted sequences (Chen et al. 2018). First, reads with correct
barcodes were retained exclusively and their barcodes were trimmed. Reads without correct
restriction enzyme cleavage sites were subsequently removed from the dataset. Low-quality
reads were also removed if the proportions of base pairs with Phred quality scores lower
than 15 were >40% or the proportions of unidentified base pairs were >10%. Next, we
removed the restriction enzyme cleavage sites from all remaining reads. The remaining
forward reads were trimmed to 87 bp to accommodate the differences in sequence lengths
caused by barcode trimmings and restriction enzyme cleavage site removals.We performed paired-end RAD sequence assemblies using Stacks 2.2 (Catchen et al. 2011). This software consists of
several core programs. The first program, ustacks, established “stacks,”
the piles of reads with exact matches that serve as putative alleles, for each individual
by aligning the raw forward reads with each other. The minimum number of reads (read
depth) that is required to start a stack was governed by m. Next, similar
stacks were aligned against each other within each individual to form putative loci (pairs
of putative alleles) if the number of mismatches between them was ≤ M
(maximum number of allowed mismatches). The second program, cstacks, was
performed to align the putative loci across all 7 individuals to synthesize a catalog with
the maximum number of allowed mismatches defined by the parameter n. The
putative loci from all individuals were subsequently matched against the catalog to
identify the unique loci combinations on each RAD locus across all individuals; this
procedure was conducted using the sstacks program. Thereafter, the reads
from both ends were merged into contigs using the gstacks program.
Lastly, the genetic variants and assembled contigs were called using the
populations program. We executed the above programs using a built-in
pipeline, denovo_maps.pl, with different sets of m,
M, and n; secondary read incorporations were disabled
and the deleveraging algorithm was enabled. Only sites that were sequenced in all
individuals were assembled to prevent systematic biases on population genetics statistics,
including estimates of nucleotide diversity and Tajima’s D (Arnold et al. 2013; Gautier et
al. 2013). RAD loci without correct matches with reverse reads were omitted from
the remaining studies.Stacks was run multiple times to search for the optimal set of assembled
parameters (m, M, n). First, since
m controls the starting read depth of each stack, a higher
m is expected to give higher sample coverage while resulting in fewer
assembled loci and SNPs; this trend diminishes with increased m (Paris et al. 2017). We therefore plotted the
distribution of sample coverage against different values of m (while
fixing M and n to 3). An optimal value of
m was chosen if its corresponding value was close to the plateau of
sample coverage on the inward curve. Next, we searched for the optimal values of
M and n while fixing m to the
previously determined value. We took a likelihood framework to search for the optimal
M and n by incorporating the observed SNP density
estimated from the data. For each given pair of (M, n),
the empirical per-site SNP density was calculated by the total number of SNPs in all
forward reads divided by the total number of base pairs of all forward reads. The
distribution of the number of SNPs on each sequence read can be constructed using a
binomial distribution with the number of trials equivalent to read length and the success
probability for each trial equivalent to SNP density. Subsequently, the likelihood
estimate of the empirical distribution of the SNP number per read can be calculated based
on the multinomial probability mass function,where x is the empirical number of reads
with i SNPs observed, q is
the binomial probability of observing i SNPs on an l-bp
long read (l = 87 in this study), and r is the total
number of reads (i.e., ∑x). For each given
pair of (M, n), we randomly sampled 100 000 reads 20
times from the assembly without replacement. The pair of (M,
n) that gives the highest likelihood estimate is the parameter set that
fits best to the empirical distribution of SNP number across all forward reads. In
addition, Kullback–Leibler (KL) divergences were also calculated to measure the similarity
between the empirical distribution of the number of SNPs per read and the theoretical
binomial distribution (Kullback and Leibler
1951). We also constructed an alternative likelihood estimator based on
Watterson’s theoretical distribution of segregating sites under one panmictic population
(Watterson 1975). All the parameter
optimization procedures were carried out using our custom Python scripts (available on
request).Since SNPs with excessive heterozygosity across all individuals can result from
misaligning reads of paralogous or repetitive loci, we applied the Hardy-Weinberg
equilibrium (HWE) test to exclude the assembled reads or SNPs that showed significant
deviations from HWE (P < 0.05). Two types of heterozygosity filters
were applied to the dataset to generate datasets for population genetics analyses that
rely on accurate nucleotide diversity estimates or only genetically unlinked SNPs. The
heterozygosity filters were conducted using the R package HardyWeinberg
(Graffelman and Camarena 2008; Graffelman 2015).
Nucleotide Diversity and Inbreeding Coefficient Estimations
We estimated genome-wide nucleotide diversity based on both pairwise differences and
Watterson’s estimator for both P. d. formosus and P. d.
inopinatus. For each estimate, the per-site nucleotide diversity was calculated
for each RAD locus. The mean and standard deviation were then computed based on its
empirical distribution across all RAD loci. Tajima’s D was also
calculated using the R package pegas 0.12 (Paradis 2010) with parametric P-values constructed
from the beta distribution (Tajima 1989a).
Fixation indices were calculated using GENEPOP 1.1.7 implemented in R
(Raymond and Rousset 1995; Rousset 2008). For comparison, we also computed the
above summary statistics for the RADSeq data of 3 widespread bat species (Lasiurus
cinereus, L. borealis, and Lasionycteris
noctivagans) that were obtained from Sovic
et al. (2016).
Genetic Structure Detection
We next performed both principal component analysis (PCA, conducted using
PLINK 1.9; Chang et al.
2015; Purcell and Chang 2020) and
STRUCTURE 2.3.4 (Pritchard et al.
2000) to detect the genetic structure between P. d. formosus and
P. d. inopinatus. A subset of SNPs was generated by randomly selecting
one SNP from each locus to assure that these SNPs are genetically unlinked. For PCA, a
Kruskal-Wallis H test was further performed on the coordinates of the first principal
component to detect the significance between 2 assigned groups. For
STRUCTURE analysis, the data-collecting period was set to 50 000 and
the burn-in period was set to 10 000, 30 000, 40 000, 40 000, and 40 000 for
K = 1, 2, 3, 4, and 5, respectively, where K is the
putative number of ancestral populations; these values were determined by observing the
trend of several summary statistics throughout the iterations of the program (Pritchard et al. 2000). The best-fit
K value was determined using the ΔK estimator (Evanno et al. 2005) and the built-in posterior
probability of STRUCTURE. To calculate the ΔK estimator,
we performed 20 repetitions for each given K value. We summarized the
ancestral components of these 20 repetitions using CLUMPP 1.1.2 (Jakobsson and Rosenberg 2007), for which the
FullSearch algorithm was applied for K = 1 and 2 while the Greedy
algorithm was applied for K = 3, 4, and 5. In addition, we also
quantified the degree of population differentiation between the 2 populations using the
ΦST estimator of the Analysis of Molecular Variance (AMOVA)
implemented in pegas 0.12. The P-value of the
ΦST estimator was calculated from 10 000 permutations.
Demographic History Inference
To infer the demographic history of the 2 P. dasymallus populations, we
compared the folded site frequency spectrum (fSFS) for each population with the simulated
fSFS under the steady-state neutral model. The simulated fSFS of an idealized
Wright-Fisher population was constructed from 1000 simulations generated by the coalescent
simulator MSMS with segregating sites set to the observed values in each
population (Ewing and Hermisson 2010). We
further applied fastsimcoal2 2.6 to infer the demographic history of
P. d. formosus and P. d. inopinatus based on the joint
fSFS of these 2 populations (Excoffier and Foll
2011; Excoffier et al. 2013). A number
of possible demographic models were tested. These models begin with the simplest model
(one population of constant population size over time), then toward several more realistic
models to which population splits, gene flow between split populations, and population
size changes over time in the progeny populations were introduced. For a given model, the
simulated joint fSFS was generated based on 100 000 coalescent simulations assuming the
observed total sequence length. The maximum number of cycles of the conditional
maximization algorithm (ECM) was set to 40 to search for the estimates that best-fit the
observed joint fSFS. This process was repeated 100 times and the estimate with highest
likelihood was selected to represent the model. To determine the model that fits our data
best, all tested models were compared with each other through the Akaike Information
Criterion (AIC; Akaike 1974), calculated using
a custom Python script. For the best-fit model, we performed 100 parametric bootstraps to
construct the confidence intervals (CIs) of each estimated demographic parameter of
interest. The mutation rate (μ) was set to 2.5 × 10–8 per
generation per nucleotide site, which was estimated based on human genomic data (Nachman and Crowell 2000). We also repeated all
simulations by assuming a different mutation rate of 2.0 × 10–9 according to an
estimate from a few introns in vespertilionid and miniopterid bats (Ray et al. 2008). The detailed workflow of all analyses can be
found in Supplementary Figure
S1.
Results
Determining the Optimal Parameters for RADSeq Assembly
We have successfully obtained the genomic sequence reads for all 7 sampled individuals
using ddRADSeq. The total clean base pairs obtained from each individual ranged from 0.91
Gb to 6.90 Gb with an average of 4.82 Gb. The average number of clean reads per genome was
47.28 million (M), ranging from 7.76 to 68.82 M. The detailed information for each
individual genome is given in Supplementary Table S1. To determine the most suitable assembly parameters in
Stacks, we first plotted the sample coverage against the parameter
m (the minimum depth required to start a stack) as shown in Figure 2A. The sample coverage increased with
m but gradually reached a plateau. In contrast, the total number of
polymorphic loci and monomorphic loci decreased considerably with increased
m (Figure 2B). Determining the
suitable m value appeared to become a trade-off between the coverage and
the number of assembled loci. We set m = 3 to be our assembly parameter
since its corresponding value on the curve begins to show a slower increment of sample
coverage. As a result, the average number of sample coverages among the 7 individuals was
38.49 (range 13.10–51.83; see also Supplementary Table S1) and the number of polymorphic loci was ~28 K (Supplementary Figure S1). Figure 2C shows the plot of log-likelihood estimates of
the empirical distribution of SNPs per read based on different sets of
(M, n) given m = 3. The highest
likelihood estimate was found to be at M = 1 and n = 1
within the range of M = 1–7 where n = M
or M+1. A similar result was also observed using the KL divergence
method: the similarity between the empirical distribution of SNPs per read and the
theoretical distribution based on the estimated SNP density also appeared to be the
highest (i.e., the lowest value of KL divergence) at M = 1 and
n = 1 (Figure 2C). We also
computed the log-likelihood estimates of the empirical distribution of SNPs per read based
on Watterson’s theoretical distribution of segregating site, which resulted in the same
optimal parameter set (Supplementary
Figure S2). Consequently, the parameters for Stacks were set to
M = 1, n = 1, and m = 3 for contig
assemblies and SNP callings. Based on the above assembly parameters, we obtained 188,588
RAD loci for a total of 43,675,221 bp with a mean locus length of 231.6 ± 36.7 bp (Figure 2D). Among them, 32,935 SNPs were identified in
28,047 polymorphic RAD loci after a series of quality control steps. The workflow of
sequence assemblies and quality controls is detailed in Supplementary Figure S1.
Figure 2.
Determining the optimal assembly parameters of Stacks and the length
distribution of restriction-site associated DNA (RAD) loci. (A) The
sample coverages under different values of m (the minimum read depth
required to start a stack) with M (the maximum mismatches allowed
within each individual) and n (the maximum mismatches allowed between
each pair of individuals) fixed to 3. The box plot summarizes the coverage of all 7
individuals while the dash indicates the mean value. (B) The number of
polymorphic loci and monomorphic loci under different values of m
given M = n = 3. (C) The multinomial
log-likelihoods and KL divergences under various (M,
n) values with m set to 3. (D) The
distribution of the lengths of each assembled RAD locus without unidentified
nucleotides under the optimal assembly parameters (m = 3 and
M = n = 1).
Determining the optimal assembly parameters of Stacks and the length
distribution of restriction-site associated DNA (RAD) loci. (A) The
sample coverages under different values of m (the minimum read depth
required to start a stack) with M (the maximum mismatches allowed
within each individual) and n (the maximum mismatches allowed between
each pair of individuals) fixed to 3. The box plot summarizes the coverage of all 7
individuals while the dash indicates the mean value. (B) The number of
polymorphic loci and monomorphic loci under different values of m
given M = n = 3. (C) The multinomial
log-likelihoods and KL divergences under various (M,
n) values with m set to 3. (D) The
distribution of the lengths of each assembled RAD locus without unidentified
nucleotides under the optimal assembly parameters (m = 3 and
M = n = 1).
Estimating Levels of Nucleotide Diversity, Inbreeding, and Genetic Structure
The Watterson’s estimator (θ) and the
pairwise differences (θ) were estimated for
P. d. formosus, P. d. inopinatus, and the pooled
population using 32,935 SNPs and 188,588 loci (Table
1). The estimates of Watterson’s θ
appeared to be almost the same but considerably low in both populations (2.14 ×
10–4 and 2.13 × 10–4 per site for the formosus
and inopinatus populations, respectively). Similar
θ values were also observed in the
formosus (2.17 × 10–4) and inopinatus (2.09
× 10–4) populations. Both θ and
θ estimates increased slightly when the 2
populations were pooled (2.39×10–4 and 2.27 × 10–4, respectively).
We further computed Tajima’s D for the 2 populations separately. Although
different signs of Tajima’s D were observed in the
formosus (0.11) and inopinatus (−0.12) populations,
both estimates did not deviate from neutrality (P = 0.88 and 0.95,
respectively). To estimate the level of inbreeding, we computed the fixation index
(F) based on genetically unlinked SNPs (Table 1). Positive F values were observed in both populations
in which the average F of the formosus (0.12) was
slightly higher than that of the inopinatus (0.10).
Table 1.
Estimates of nucleotide diversity, fixation index, and Tajima’s D of
2 subspecies of Pteropus dasymallus
Estimates of nucleotide diversity, fixation index, and Tajima’s D of
2 subspecies of Pteropus dasymallusθ
, Watterson’s estimator; θ,
pairwise difference; F, fixation index; D,
Tajima’s D.To evaluate the genetic structure between these 2 populations, we performed PCA among all
7 individuals. As a result, the formosus and inopinatus
samples were clearly divided into 2 groups at PC1 (P = 0.03 using
Kruskal-Wallis H test), while PC2 and PC3 reflected only the variances of within-group
individuals (Figure 3A and Supplementary Figure S3). We also
conducted STRUCTURE, a model-based analysis for population substructure
detection among these individuals and Figure 3B
illustrates the result at K = 3, which was the best-fit number of
putative ancestral populations (Supplementary Figure S4A and B). Comparable to the result of PCA, explicit
genetic structure was detected between the formosus and
inopinatus populations in which distinct patterns of ancestral
components were observed between them. In contrast, there was no evident genetic structure
within each population. Similar patterns were also observed when
STRUCTURE was performed at K = 2–5 (Supplementary Figure S4C). The
degree of genetic structure between these 2 populations was estimated to be 0.178 based on
the test statistic ΦST (AMOVA), revealing a considerable
proportion of between-population variance among these individuals (P =
0.03; Table 2).
Figure 3.
Principal component analysis (PCA) and STRUCTURE analysis for
detecting genetic structure between Pteropus dasymallus formosus and
P. d. inopinatus. (A) PCA was performed with 3
P. d. formosus and 4 P. d. inopinatus samples.
(B) STRUCTURE result of the 7 P.
dasymallus samples by assuming K = 3 where
K is the number of ancestral populations. Both PCA and
STRUCTURE were performed based on a total of 28,074 genetically
unlinked SNPs from the 7 samples. See online version for full colors.
Table 2.
Analysis of molecular variance (AMOVA) results of Pteropus dasymallus
formosus and Pteropus dasymallus inopinatus
df
Sum of squares
Variance components
Variance %
P-value
Between populations
1
15512.5
1929.2
17.8
0.03
Within populations
5
44490.1
8898.0
82.2
Total
6
60002.6
10827.2
Analysis of molecular variance (AMOVA) results of Pteropus dasymallus
formosus and Pteropus dasymallus inopinatusPrincipal component analysis (PCA) and STRUCTURE analysis for
detecting genetic structure between Pteropus dasymallus formosus and
P. d. inopinatus. (A) PCA was performed with 3
P. d. formosus and 4 P. d. inopinatus samples.
(B) STRUCTURE result of the 7 P.
dasymallus samples by assuming K = 3 where
K is the number of ancestral populations. Both PCA and
STRUCTURE were performed based on a total of 28,074 genetically
unlinked SNPs from the 7 samples. See online version for full colors.
Inferring Demographic History
To uncover their demographic history, we first estimated the fSFS of P. d.
formosus and P. d. inopinatus separately and detected
significant differences between them (Figure 4A). The
fSFS of P. d. formosus possessed excessive intermediate-frequency alleles
compared to the simulated fSFS under the neutral model of constant population size over
time (P = 10–62 using chi-square goodness of fit test), which
is consistent with the population decline that has been observed in the field during the
1970s. In contrast, the fSFS of P. d. inopinatus carried excessive
low-frequency alleles in comparison with the simulated steady-state fSFS
(P = 10–19), which is suggestive of a population
expansion.
Figure 4.
Folded site frequency spectra (fSFS) of Pteropus dasymallus formosus
and P. d. inopinatus. (A) The fSFS of 3 P. d.
formosus and 4 P. d. inopinatus were inferred separately
based on 21,106 and 24,020 SNPs, respectively. Both spectra were compared to the fSFS
of 1000 coalescent simulations with constant population size and the identical
segregating sites of the observed fSFS. Chi-square goodness of fit test was performed
on each fSFS; both spectra deviate from the steady-state neutral model
(P = 10–62 for P. d. formosus and
P = 10–19 for P. d. inopinatus).
(B) The joint fSFS of P. d. formosus and P. d.
inopinatus were constructed from 32,935 SNPs. The heatmap indicates the
proportions of SNP counts across different joint frequency classes. See online version
for full colors.
Folded site frequency spectra (fSFS) of Pteropus dasymallus formosus
and P. d. inopinatus. (A) The fSFS of 3 P. d.
formosus and 4 P. d. inopinatus were inferred separately
based on 21,106 and 24,020 SNPs, respectively. Both spectra were compared to the fSFS
of 1000 coalescent simulations with constant population size and the identical
segregating sites of the observed fSFS. Chi-square goodness of fit test was performed
on each fSFS; both spectra deviate from the steady-state neutral model
(P = 10–62 for P. d. formosus and
P = 10–19 for P. d. inopinatus).
(B) The joint fSFS of P. d. formosus and P. d.
inopinatus were constructed from 32,935 SNPs. The heatmap indicates the
proportions of SNP counts across different joint frequency classes. See online version
for full colors.Since these 2 populations share a common ancestor, we further constructed a joint fSFS
for our samples to infer their demographic history, time of divergence, and the level of
gene flow between populations using fastsimcoal2 (Figure 4B). The joint fSFS separates private and shared variants of the
2 populations and increases the number of frequency classes. Consequently, the statistical
power enhances considerably for detecting population demography and can thus be used to
infer complex demographic events such as population divergence and admixture. We performed
fastsimcoal2 likelihood estimations on a series of demographic models
listed in Table 3 assuming μ = 2.5
× 10–8 per site per generation. To determine the best model, the Akaike weights
(w) of these models were calculated and,
as a result, Model F was the model that best-fit the observed joint fSFS among all models
(w ≈ 1.0), whereas the other models fit
the data significantly worse (w ≤
1.1×10–19; see Table 3). Model F
assumed an ancestral population split into 2 progeny populations (i.e.,
formosus and inopinatus) and introduced parameters that
describe a recent population decline in P. d. formosus and a population
expansion in P. d. inopinatus, as well as gene flow between these 2
progeny populations (Figure 5A). Under Model F, the
maximum likelihood estimate (MLE) of the ancestral effective population size
(N) of the formosus and
inopinatus populations was estimated to be as large as 9,468 (Table 4). The effective population sizes dropped to
2324 and 2110 for formosus
(N) and inopinatu
(N), respectively, after they split. The
split time was dated to be 8518 generations ago. However, the formosus
population experienced a very recent and severe decline 4 generations ago
(t), which led to its low current
population size (N) of 223. In contrast, the
inopinatus population went through a population expansion 441
generations ago (t) and the current
effective population size (N) was estimated
to be 9547. The level of gene flow between the 2 populations was limited. The estimated
migration rate from the inopinatus population to the
formosus population was 1.0×10–3 (per generation) and that
of the reverse direction was 2.7×10–4. The prior search ranges for these MLEs
are provided in Supplementary Table
S2. We further constructed the bootstrap distributions (Figure 5B) and computed the 95% CIs for these parameters (Table 4).
Table 3.
Demographic model comparison of Pteropus dasymallus formosus and
Pteropus dasymallus inopinatus
Model A illustrates a population with constant size from which samples of
P. d. formosus and P. d. inopinatus were drawn
randomly. The rest of models describe one ancestral population split into 2 progeny
populations (“f” represents P. d. formosus and
“i” represents P. d. inopinatus). Models C–F and
H introduce gene flow (indicated by arrows) between the 2 progeny populations.
Models D–H allow population size changes in at least one progeny population.
Figure 5.
The best-fit demographic model and estimates of demographic parameters.
(A) Demographic history and estimates of demographic parameters were
inferred based on the observed joint fSFS of Pteropus dasymallus
formosus and P. d. inopinatus by assuming the best-fit
Model F (Table 3) using
fastsimcoal2. The maximum likelihood estimates (MLEs) of population
size (N), demographic event times (t with unit of
generation), and per-generation migration rate (m) are shown. Note
that the times of recent demographic events are not scaled to better demonstrate the
changes in the population sizes of P. d. formosus and P. d.
inopinatus. (B) Bootstrap distributions and quantiles for each demographic
parameter. Parametric bootstraps were performed 100 times based on Model F. Their
distributions and 5%, 25%, 50%, 75%, and 95% quantiles are shown. The white dot
represents the MLEs of each parameter. All inferred demographic parameters are labeled
as: N = population size of the common
ancestral population of the formosus (f) and
inopinatus (i);
N = the ancestral population size
of the formosus; N =
the ancestral population size of the inopinatus;
N = the current population size of
the formosus; N = the
current population size of the inopinatus;
t = the time of the divergence of
the 2 population; t = the time of a
recent decline in the formosus;
t = the time of a recent
expansion in the inopinatus;
m→ =
migration rate from the formosus to inopinatus;
m→ =
migration rate from the inopinatus to formosus.
Table 4.
Maximum likelihood estimates of demographic parameters of Pteropus dasymallus
formosus and Pteropus dasymallus inopinatus based on Model
F
Parameter
MLE
95% CI
Na
9468
(6014, 13599)
Nf
223
(201, 266)
Nf.a
2324
(2107, 2427)
Ni
9547
(8118, 11405)
Ni.a
2110
(2025, 2381)
tdiv
8518
(7823, 16480)
tf.de
4
(1, 5)
ti.ex
441
(369, 493)
mf→ i
2.7×10–4
(2.2 × 10–4, 3.0 × 10–4)
mi→ f
1.0×10–3
(8.6 × 10–4, 1.0 × 10–3)
Demographic model comparison of Pteropus dasymallus formosus and
Pteropus dasymallus inopinatusModel A illustrates a population with constant size from which samples of
P. d. formosus and P. d. inopinatus were drawn
randomly. The rest of models describe one ancestral population split into 2 progeny
populations (“f” represents P. d. formosus and
“i” represents P. d. inopinatus). Models C–F and
H introduce gene flow (indicated by arrows) between the 2 progeny populations.
Models D–H allow population size changes in at least one progeny population.Maximum likelihood estimates of demographic parameters of Pteropus dasymallus
formosus and Pteropus dasymallus inopinatus based on Model
FThe best-fit demographic model and estimates of demographic parameters.
(A) Demographic history and estimates of demographic parameters were
inferred based on the observed joint fSFS of Pteropus dasymallus
formosus and P. d. inopinatus by assuming the best-fit
Model F (Table 3) using
fastsimcoal2. The maximum likelihood estimates (MLEs) of population
size (N), demographic event times (t with unit of
generation), and per-generation migration rate (m) are shown. Note
that the times of recent demographic events are not scaled to better demonstrate the
changes in the population sizes of P. d. formosus and P. d.
inopinatus. (B) Bootstrap distributions and quantiles for each demographic
parameter. Parametric bootstraps were performed 100 times based on Model F. Their
distributions and 5%, 25%, 50%, 75%, and 95% quantiles are shown. The white dot
represents the MLEs of each parameter. All inferred demographic parameters are labeled
as: N = population size of the common
ancestral population of the formosus (f) and
inopinatus (i);
N = the ancestral population size
of the formosus; N =
the ancestral population size of the inopinatus;
N = the current population size of
the formosus; N = the
current population size of the inopinatus;
t = the time of the divergence of
the 2 population; t = the time of a
recent decline in the formosus;
t = the time of a recent
expansion in the inopinatus;
m→ =
migration rate from the formosus to inopinatus;
m→ =
migration rate from the inopinatus to formosus.We repeated all the simulations for inferring the demographic history with a different
mutation rate (μ = 2.0 × 10–9) that was estimated from 4
introns in vespertilionid and miniopterid bats (Ray et
al. 2008), and found that Model F remained the fittest demographic scenario among
all models tested. However, by assuming a lower mutation rate, the estimated effective
population sizes and dates of demographic events appeared to be 6–66 times larger and
2–300 times older, respectively, than those under the assumption of μ =
2.5 × 10–8 (Supplementary
Table S3).
Discussion
Low Level of Genetic Diversity and High Degree of Inbreeding in P. d.
formosu and P. d. inopinatus
In this study, we have successfully applied ddRADSeq to uncover genetic variants from 3
P. d. formosus and 4 P. d. inopinatus individuals.
Given that both subspecies are protected by local conservation laws, it is difficult to
obtain any new sample from the natural population. Nonetheless, we were able to obtain a
sufficiently large number of SNPs, which allowed us to obtain and compare several key
population genetics estimates between these 2 closely related populations. A very low
level of nucleotide diversity (θ = 2.14 ×
10–4 per site) was detected in the formosus population
(Table 1), which is ~10-fold lower than the mean
θ of Lasiurus cinereus
(2.57×10–3) and Lasiurus borealis (2.14 × 10–3),
2 widespread continental vespertilionid bats in America (Sovic et al. 2016; also see Supplementary Table S4). Surprisingly, P. d.
inopinatus, a larger population than P. d. formosus, also
harbors a similarly low level of nucleotide diversity
(θ = 2.13 × 10–4). Tajima (1989b) pointed out that changes in
nucleotide polymorphism estimates are slow in response to population fluctuations under
small sample sizes. The similarity in nucleotide diversity between these 2 populations can
be partially attributed to the small sample sizes in both populations. With a small number
of sample sizes, genetic variants with intermediate allele frequency in a population are
more likely to be sampled than those with low allele frequency. Therefore, the nucleotide
diversity estimates are less sensitive for detecting alteration in nucleotide diversity
caused by low-frequency alleles owing to recent change of population sizes. In this study,
we indeed detected signatures of recent population fluctuations in both populations based
on the joint fSFS from which the estimated population sizes are similar in both
populations prior to their recent demographic changes. Therefore, the estimates of
nucleotide diversity could differ considerably between P. d. formosus and
P. d. inopinatus with a larger sample size.Nucleotide diversity could be underestimated using ddRADSeq due to allele dropout (ADO),
that is, restriction enzymes fail to cleave their recognition sites because of mutations
(Arnold et al. 2013; Gautier et al. 2013; Cariou et
al. 2016). However, the effects of ADO are only significant when the sequence
divergence is >2%, and can be mitigated by the use of complete sampling of genetic
variants (Cariou et al. 2016). Therefore, our
estimates of nucleotide diversity were most likely little affected by ADO.Our estimated fixation index (F) revealed a considerable amount of
homozygosity departing from HWE in both populations (0.12 for the
formosus and 0.10 for the inopinatus). Based on the
results of PCA and STRUCTURE, we did not detect any obvious genetic
structure within each population (Figure 3).
Therefore, the observed excessive homozygosity in each population can be better explained
as the outcome of inbreeding. Insular populations are known to be more inbred (and thus
more prone to extinction) than mainland populations due to their smaller population sizes
and founder effect (Frankham 2008). In the
present study, we detected rather high levels of inbreeding in both P. d.
formosus and P. d. inopinatus, suggesting that both
populations might be vulnerable to the risk of inbreeding depression. However, this
concern of possible inbreeding depression is based only on a small sample size number.
Therefore, additional samples would be needed to confirm our estimates.
Distinct Recent Demographic History in P. d. formosus and P.
d. inopinatus
We inferred the demographic history based on the joint fSFS of P. d.
formosus and P. d. inopinatus by comparing the likelihood
estimates among several candidate models. The best-fit demographic model appeared to be
the population split model that assumes a recent population decline in the
formosus and a recent population expansion in the
inopinatus irrespective of the underlying assumption of mutation rates
(Table 3). It is important to point out that MLEs
for most parameters vary considerably with different mutation rates assumed in the model.
Particularly, the current population size of P. d. formosus
(N) becomes as large as 14 719 by assuming
μ = 2.0×10–9, whereas
N is only 223 when assuming
μ = 2.5 × 10–8 (Table
4 and Supplementary Table
S3). The assumed mutation rate of 2.0 × 10–9 was based on the study of
only 4 introns in several species of vespertilionid and miniopterid bats (Ray et al. 2008), while the mutation rate of 2.5 ×
10–8 was estimated from many pseudogenes across multiple chromosomes of human
genomes. Given that estimates of the nuclear mutation rates in Pteropodidae species are
currently unavailable, we are slightly in favor of reporting the MLEs of demographic
parameters under the assumption of μ = 2.5 × 10–8 due to its
genome-wide approach. We consider that this higher mutation rate is a conservative
approach for estimating the population size, which is also in good agreement with the
observed population status of P. d. formosus in the field.Lin and Pei (1999) reported that P. d.
formosus was once abundant on Green Island (the major habitat of
formosus) but could be barely sighted after the 1970s. However, it is
unclear whether the population on Green Island actually declined or this habitat was
simply abandoned. In this study, our best-fit demographic model suggested a very recent
population decline in the formosus population. The MLE for the time of
population collapse (t) is 4 generations
ago, which would be 28 years ago assuming a generation time of 7 years (Fox et al. 2008; Tidemann and Nelson 2011; Vincenot et al. 2017). The 95% CI spans from 7 years ago to 35 years ago,
supporting that the event likely occurred very recently (see Figure 5B and Table 4). The magnitude of
the decline was estimated to be ~10-fold, dropping from 2324
(N) to 223
(N) individuals. Therefore, it is more
likely that the Green Island population did suffer from a severe population decline rather
than habitat abandonment based on our findings. In contrast, we detected that P.
d. inopinatus on Okinawa Island may have experienced a recent population
expansion from 2,110 (N) to 9,547
(N) (a ~4.5-fold increase) based on the
MLEs of the observed fSFS. The time of this population expansion was estimated to be 441
generations ago (~3 kya).Our estimates also suggest that these 2 populations diverged around 8,518 generations ago
(~60 kya) with a 95% CI spanning from ~55 kya to ~115 kya in the late Pleistocene (Figure 5 and Table
4). During that time, the widening Kerama Gap between Okinawa Island and Yaeyama
Islands likely served as a geographic barrier to prevent gene flow between these 2
populations (Kizaki and Oshiro 1977; Ota 1998). Our results are compatible with previous
studies of the divergence of amphibians, reptiles, and the elegant scops owl (Otus
elegans) between the Central and Southern Ryukyus (Ota 1998 2000; Hsu
2005), supporting that deep-sea channels, such as the Kerama Gap, serve as
effective geographic barriers to drive the genetic differentiation of island species, even
for species with high migration ability like flying foxes. After the divergence of
P. d. formosus and P. d. inopinatus, both populations
suffered from population declines and had been maintained at relatively small population
sizes for a long period of time, which might have led to their present low levels of
nucleotide diversity.It should be noted that our inferences of demographic model and parameter estimates are
based on the joint fSFS of a relatively small number of samples (n) in
both populations (2n = 6 and 8 for P. d. formosus and
P. d. inopinatus, respectively). Although Beichman et al. (2018) noted that recent changes in population size
are relatively difficult to capture by SFS-based methods with small sample sizes,
theoretical studies have shown that the configuration of SFS is sensitive to subtle
changes in population genetic parameters (e.g., effective population size,
Ne) and the power of detecting such changes in SFS depends
more on the number of SNPs rather than on the sample size (Akashi 1999). In the study by Akashi (1999), the number of haplotypes used in the simulations can be as small
as 5 for successfully detecting changes in SFS with 2500 nucleotide sites. By analyzing
~30 000 SNPs, Robinson et al. (2014) also
showed that a small sample size (as low as 3 individuals) is sufficient to detect the
correct model of recent population decline and to accurately estimate the decline
time.Although the small sample sizes in our study may have potentially reduced the statistical
power to distinguish between the competing demographic models (causing false negative
outcomes), we found that the best model appears to fit significantly better than the
remaining models (Table 3). This suggests that the
effects of the underlying demographic events (in both populations) on the observed SFS
were strong and, therefore, allowed us to determine the best model. Finally, the validity
of our inferred model and MLEs could also suffer from biased samplings due to small sample
sizes. While our samples in both populations were collected from different locations and
time points (Figure 1), future studies will focus on
collecting more individuals from each of the 5 P. dasymallus subspecies
populations.Since the key parameters assumed during the RADSeq assembly could potentially bias
population genetics statistics (Harvey et al.
2015; Mastretta-Yanes et al. 2015;
Shafer et al. 2017), we also explored other
combinations of assembly parameter sets (m, M,
n) in Stacks, including the optimal parameters
determined by the procedure of Paris et al.
(2017). Subsequently, we repeated all coalescent simulations (using
fastsimcoal2) to infer the demographic history of both populations. As
a result, Model F remained the best-fit demographic scenario under the assembly parameters
of m = 3, M = 2, and n = 3 (referred to
as m3M2n3 hereafter; Supplementary Table S5). Most
inferred MLEs were also comparable between the 2 optimal parameter sets with differences
in the range of 0.64–2.29-fold changes (Supplementary Figure S5), except for the time of population decline in
P. d. formosus (t).
The difference in t was as large as
11.8-fold (t = 4 and 47 for
m3M1n1 and
m3M2n3, respectively; see Table 4 and Supplementary Tabe S2). In either case, we consider that the
population of P. d. formosus experienced a recent population decline. We
also tested the sensitivity of the estimates of nucleotide diversity, inbreeding
coefficient, and between-populations differentiation (ΦST)
under these 2 assembly parameter sets. These parameters remained robust using different
assembly parameters. The detailed results are provided in Supplementary Figures S6 and S7.
Strong Genetic Structure Between P. d. formosus and P. d.
inopinatus
Our results from PCA and STRUCTURE analyses provided evidence of genetic
structure between the formosus and inopinatus
populations (Figure 3). AMOVA also detected
significant between-population variance (Table 3).
Particularly, the results of STRCUTURE showed distinct patterns of
genetic ancestries for individuals between these 2 populations, while little individual
variation was observed within each population (Figure
3B). The evidence of genetic structure remained robust regardless of the number
of ancestral populations assumed in STRUCTURE (Supplementary Figure S4C). In Figure 3B, it is interesting to point out that, while
STRUCTURE assigned only one major ancestry to the individuals of
P. d. formosus, 2 major ancestries were assigned to all 4 individuals
of P. d. inopinatus. However, the observed pattern of mixed ancestry in
the P. d. inopinatus individuals is not necessarily the outcome of
population admixture, but may also result from shared ancestries with other populations
that are closely related to P. d. inopinatus. Under this scenario, the
proportion of mixed ancestries for a given individual genome reflects the phylogenetic
distance rather than admixture fraction of populations (Lawson et al. 2018). To unravel the most likely demographic
scenario resulting in the observed mixed ancestry, more samples from the other neighboring
subspecies populations should be included and tested with methods specific for detecting
admixtures (e.g., Patterson et al. 2012).
Nevertheless, our results provide strong evidence of genetic structure between the
formosus and inopinatus populations with limited gene
flow between them (Table 2). The migration rate
from the inopinatus to formosus
(m = 1.0 ×
10–3 per generation) is about 3 times higher than that of the reverse
direction (m = 2.7 ×
10–4; see Figure 5 and Table 4). Similarly, by analyzing microsatellite and
mitochondrial loci, Chen et al. (2021) also
suggested that the formosus is genetically differentiated from the
inopinatus. Therefore, based on our molecular evidence, P. d.
formosus and P. d. inopinatus perhaps should be treated as
distinct ESUs and managed separately in terms of conservation. In contrast, P. d.
formosus individuals from Green Island and the eastern coast of Taiwan could be
managed together since there is no apparent genetic structure within them.The Yaeyama flying fox (P. d. yayeyamae) inhabiting Yaeyama Islands and
Miyako Islands between Taiwan and Okinawa Island is the geographically closest P.
dasymallus subspecies to P. d. formosus. Chen et al. (2021) observed strong population substructure between
the yayeyamae and inopinatus but nonsignificant
differentiation between the yayeyamae and formosus.
Taki et al. (2020, unpublished data) detected
genetic differentiations among the 3 yayeyamae populations inhabiting
Yaeyama Islands: Miyako, Ishigaki-Taketomi-Kohama-Iriomote, and Yonaguni (from east to
west). Among them, the Yonaguni population, which is geographically closest to Taiwan
(~110 km apart), showed the highest differentiation from the other 2
yayeyamae populations. Therefore, it is possible that the
formosus population is also genetically distinct from the
yayeyamae populations inhabiting the islands of Miyako, Ishigaki,
Taketomi, Kohana, and Iriomote. Future studies should focus on elucidating the levels of
genetic structure between P. d. formosus and these
yayeyamae populations.In conclusion, we have revealed critical information for the conservation of the
endangered Formosan flying fox and the Orii’s flying fox. Pteropus dasymallus
formosus suffers from low genetic diversity and high inbreeding pressure,
likely resulting from its long-term small population history; the situation was
exacerbated by a drastic and very recent population decline. Conservation acts for its
population recovery are urgently needed. Habitat should be preserved and recovered on
Green Island as well as the eastern coast of Taiwan. In contrast, P. d.
inopinatus is a relatively stable population with recent growth. However, the
current population bears low genetic diversity and inbreeding pressure. Long-term
monitoring for the potential adverse consequences of these genetic factors should be
considered. Outcrossing between P. d. formosus and P. d.
inopinatus should be prevented before conducting any further studies assessing
the degree of outbreeding depression between these subspecies.Click here for additional data file.
Authors: David A Ray; Cedric Feschotte; Heidi J T Pagan; Jeremy D Smith; Ellen J Pritham; Peter Arensburger; Peter W Atkinson; Nancy L Craig Journal: Genome Res Date: 2008-03-13 Impact factor: 9.043
Authors: Julian M Catchen; Angel Amores; Paul Hohenlohe; William Cresko; John H Postlethwait Journal: G3 (Bethesda) Date: 2011-08-01 Impact factor: 3.154