| Literature DB >> 35754729 |
Katherine A Farquharson1, Elspeth A McLennan1, Yuanyuan Cheng1, Lauren Alexander1, Samantha Fox2,3, Andrew V Lee2,3, Katherine Belov1, Carolyn J Hogg1,4.
Abstract
Conservation breeding programs aim to maintain 90% wild genetic diversity, but rarely assess functional diversity. Here, we compare both genome-wide and functional diversity (in over 500 genes) of Tasmanian devils (Sarcophilus harrisii) within the insurance metapopulation and across the species' range (64,519 km2). Populations have declined by 80% since 1996 due to a contagious cancer, devil facial tumor disease (DFTD). However, predicted local extinctions have not occurred. Recent suggestions of selection for "resistance" alleles in the wild precipitated concerns that insurance population devils may be unsuitable for translocations. Using 830 wild samples collected at 31 locations between 2012 and 2021, and 553 insurance metapopulation devils, we show that the insurance metapopulation is representative of current wild genetic diversity. Allele frequencies at DFTD-associated loci were not substantially different between captive and wild devils. Methods presented here are valuable for others investigating evolutionary potential in threatened species, particularly ones under significant selective pressures.Entities:
Keywords: Animals; Genetics; Zoology
Year: 2022 PMID: 35754729 PMCID: PMC9218385 DOI: 10.1016/j.isci.2022.104474
Source DB: PubMed Journal: iScience ISSN: 2589-0042
Figure 1Genetic diversity across the range of the Tasmanian devil
(A) Image of a DFTD-free Tasmanian devil.
(B) Image of DFTD-infected devil.
(C) Map of sampling locations, with size of points proportional to the number of samples sequenced. Scale bar represents kilometers and arrow indicates North.
(D) PCoA of genome-wide diversity with all wild devils sequenced using DArTseq (N = 1780 loci, 830 devils), with samples shaded by their location as per the sampling map.
(E) PCoA of functional diversity at immune and reproductive genes with all wild devils sequenced using the target capture approach (N = 260 loci, 105 devils).
(F) PCoA of functional diversity at DFTD-associated loci with all wild devils sequenced using the target capture approach (N = 53 loci, 105 devils).
(G) PCoA of genome-wide diversity with wild devils shown in gray (N = 830), the zoo-based insurance population (N = 299) in pale orange, and Maria Island (N = 254) in dark orange sequenced using DArTseq (N = 1819 loci). (H) PCoA of functional diversity at immune and reproductive genes with wild devils shown in gray (N = 105), the zoo-based insurance population (N = 28) in pale orange, and Maria Island (N = 25) in dark orange sequenced using the target capture approach. (I) PCoA of functional diversity at DFTD-associated loci with wild devils shown in gray (N = 105), the zoo-based insurance population (N = 28) in pale orange, and Maria Island (N = 25) in dark orange sequenced using the target capture approach.
Figure 2Genetic clusters across the species’ range
fastSTRUCTURE plot showing the six different population clusters across the state as indicated by the map, in addition to the insurance population (IP) and Maria Island. The cluster assignment of each population is provided in Table 1; dotted lines are indicative of cluster boundaries.
Sample details and diversity statistics for genome-wide SNPs generated via DArTseq
| Location (primary | N | Sampling Years | HO (SE) | HE (SE) | HS (SE) | FIS (95% CI) | Ne (95% CI) | Harmonic mean N |
|---|---|---|---|---|---|---|---|---|
| Antill Ponds (3) | 2 | 2016–2019 | ||||||
| Bronte (3) | 39 | 2018–2020 | 0.173 (0.004) | 0.182 (0.004) | 1.002 (0.073) | 0.0531 (0.0166; 0.0949)∗ | 48.8 (28.2; 115.4) | 33.3 |
| Buckland (3) | 55 | 2015–2020 | 0.180 (0.004) | 0.184 (0.004) | 1.072 (0.034) | 0.0186 (−0.0071; 0.0474) | 64.8 (44.3; 109.0) | 50 |
| Cherry Tree Hill (3) | 1 | 2019 | ||||||
| Cockle Creek (3) | 1 | 2020 | ||||||
| Coles Bay (4) | 3 | 2018–2019 | ||||||
| Fentonbury (3) | 75 | 2018–2019 | 0.176 (0.004) | 0.181 (0.004) | 1.035 (0.036) | 0.0275 (0.0068; 0.0493)∗ | 55.1 (43.2; 73.0) | 63.7 |
| Fingal Valley (4) | 13 | 2012 | 0.182 (0.005) | 0.177 (0.004) | 1.084 (0.037) | −0.0241 (−0.0758; 0.0103) | 208 (85.4; ∞) | 12.1 |
| Forestier/Murdunna (4) | 24 | 2011–2012 | 0.159 (0.004) | 0.161 (0.004) | 0.928 (0.074) | 0.0114 (−0.0528; 0.0753) | 18.8 (11.6; 35.8) | 18.8 |
| Granville Harbor (1) | 80 | 2016–2019 | 0.158 (0.004) | 0.166 (0.004) | 0.936 (0.029) | 0.0476 (0.0224; 0.0724)∗ | 44.1 (35.6; 55.7) | 70.8 |
| Hobart Area (3) | 5 | 2014–2018 | ||||||
| IP | 299 | 2014–2021 | 0.172 (0.004) | 0.186 (0.004) | 0.999 (0.025) | 0.0747 (0.0612; 0.0892)∗ | 42.0 (26.8; 65.0) | 261.1 |
| Kempton/Brighton (3) | 38 | 2016–2019 | 0.181 (0.004) | 0.187 (0.004) | 1.081 (0.035) | 0.0278 (0.0039; 0.0500)∗ | 114.2 (73.8; 248.5) | 35.4 |
| Launceston (3) | 2 | 2019 | ||||||
| Lucaston (3) | 1 | 2018 | ||||||
| Lyell Highway (3) | 1 | 2020 | ||||||
| Maria Island | 254 | 2012–2021 | 0.164 (0.004) | 0.169 (0.004) | 0.974 (0.015) | 0.0321 (0.0211; 0.0436)∗ | 31.7 (28.5; 35.4) | 231.2 |
| Mayfield (3) | 1 | 2014 | ||||||
| Narawntapu (5) | 97 | 2014–2021 | 0.177 (0.004) | 0.181 (0.004) | 1.030 (0.035) | 0.0228 (0.0011; 0.0466)∗ | 32.3 (26.8; 39.5) | 81.9 |
| Robbins Island (1) | 20 | 2018 | 0.126 (0.004) | 0.128 (0.004) | 0.745 (0.040) | 0.0127 (−0.0261; 0.0488) | 29.8 (18.4; 62.4) | 18.1 |
| Rosebery/Zeehan (1) | 2 | 2015–2016 | ||||||
| Sandy Cape (1) | 2 | 2014 | ||||||
| Silver Plains (3) | 20 | 2020 | 0.169 (0.004) | 0.175 (0.004) | 1.004 (0.056) | 0.0369 (−0.0089; 0.0765) | 51.6 (26.3; 243.9) | 17.2 |
| South West (2) | 14 | 2017 | 0.133 (0.005) | 0.127 (0.004) | 0.779 (0.052) | −0.0479 (−0.1217; 0.0145) | 11.2 (5.6; 27.8) | 11.9 |
| Stanley (1) | 1 | 2020 | ||||||
| Stony Head (6) | 188 | 2014–2021 | 0.170 (0.004) | 0.173 (0.004) | 1.006 (0.013) | 0.0205 (0.0080; 0.0337)∗ | 31.8 (23.1; 43.7) | 172.1 |
| Strathgordon (3) | 9 | 2020 | 0.184 (0.005) | 0.169 (0.004) | 1.088 (0.047) | −0.0856 (−0.1755; −0.0364) | 12.1 (2.8; ∞) | 7.5 |
| Takone (1) | 4 | 2016 | ||||||
| Tasman Peninsula (4) | 22 | 2018–2019 | 0.155 (0.005) | 0.152 (0.004) | 0.913 (0.068) | −0.0182 (−0.0784; 0.0393) | 10.9 (6.3; 20.3) | 19.7 |
| Tomahawk (4) | 1 | 2018 | ||||||
| Woolnorth (1) | 24 | 2014–2019 | 0.140 (0.004) | 0.140 (0.004) | 0.832 (0.029) | −0.0010 (−0.0374; 0.0290) | 89.9 (55.1; 225.1) | 22.6 |
| wukalina (4) | 82 | 2017–2020 | 0.161 (0.004) | 0.175 (0.004) | 0.937 (0.042) | 0.0811 (0.0527; 0.1115)∗ | 41.8 (33.0; 54.3) | 65.7 |
| Macquarie Heads | 3 | 2008 |
N refers to sample size; HO, observed heterozygosity; SE, standard error; HE, expected heterozygosity; HS, standardized heterozygosity; FIS, inbreeding coefficient; 95% CI represent the lower and upper 95% confidence intervals; Ne, effective population size; and the harmonic mean N is the weighted sample size used in the calculation of Ne, adjusted for missing data. ∗denotes statistical significance at α = 0.05 based on bootstrapped 95% CI not including zero.
Samples collected as part of the founder intake for the Insurance population during 2008. These samples were included to understand how devils were moving into South Western Tasmania.
Figure 3Genetic diversity at each sampling location
Locations are ordered from west to east with the insurance population (IP) and Maria Island at the far right. Locations are shaded as per Figure 1C. (A) Standardized heterozygosity (HS) measured by 1819 genome-wide SNPs; error bars represent standard errors. (B) Standardized heterozygosity measured by 260 immune and reproductive SNPs sequenced using the target capture approach; error bars represent standard errors. (C) Inbreeding (FIS) measured by 1819 genome-wide SNPs. Statistically significant inbreeding estimates are indicated by ∗ and determined as the 95% CI (error bars) obtained by bootstrapping excluding zero.
Figure 4Functional diversity at four immune gene families
Each column represents a sampling location, with top row shaded as per Figure 1C. Each row represents a haplotype. Haplotypes are shaded and ordered by their frequencies (darker blue = more frequent). The dendrogram groups sampling locations by similarity. A full-sized heatmap with haplotype names is provided in Figures S3–S6.
Figure 5Frequency of common wild allele at DFTD-associated loci
Heatmap of the frequency of the most common wild allele at the 53 DFTD-associated loci for the wild clusters; insurance population (IP) and Maria Island. Each column represents a cluster (sampling locations designated from results of fastSTRUCTURE analysis, see Table 1 for cluster assignments). Each row represents a DFTD-associated locus (labeled by genomic position). The dendrogram groups clusters by similarities in the frequencies of the common allele. Loci are shaded by the frequency of the most common wild allele (darker purple = more frequent) at each cluster. The genomic positions of outlier SNPs identified from the FST analysis are bolded.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| MagAttract HMW DNA kit | Qiagen, Germany | Cat #:67563 |
| KAPA HyperPlus Kit | Roche | Cat #:0896385001 |
| myBaits Custom DNA-seq kit | Arbor Biosciences | Cat #:300596.v5 |
| dsDNA HS Assay Kit | ThermoFisher Scientific | Cat #:Q32851 |
| Quantifast Sybr Green PCR MasterMix | Qiagen | Cat #: 204054 |
| Target capture genotypes | This paper | DRYAD: |
| DArTseq data genotypes | This paper | DRYAD: |
| Tasmanian devil reference genome mSarHar1.11 | Ensembl | Ensembl: GCF_902635505.1 |
| Probes for custom target capture baits | This paper | Supplementary Data 1 and DRYAD ( |
| Stacks v2.53 | ||
| FastQC | ||
| MultiQC | ||
| Trimmomatic v0.38 | ||
| Picard v2.18.4 | Broad Institute Github Repository | |
| ngsCAT v0.1 | ||
| GATK v4.1.9.0 | ||
| PHASE v2.1.1 | ||
| BEDTools v2.25.0 | ||
| VCFtools v0.1.14 | ||
| SAMtools v1.7 | ||
| BWA v0.7.17 | ||
| R v4.1.1 | ||
| GenAlEx v6.502 | ||
| NeEstimator v2.1 | ||
| PMx | ||
| Illumina NovaSeq 6000 | Illumina | |
| Illumina HiSeq 2000 | Illumina | |