| Literature DB >> 32231688 |
Karim Karimi1, A Hossain Farid1, Mehdi Sargolzaei2,3, Sean Myles4, Younes Miar1.
Abstract
Knowledge of linkage disequilibrium (LD) patterns is necessary to determine the minimum density of markers required for genomic studies and to infer historical changes as well as inbreeding events in the populations. In this study, we used genotyping-by-sequencing (GBS) approach to detect single nucleotide polymorphisms (SNPs) across American mink genome and further to estimate LD, effective population size (Ne), and inbreeding rates based on excess of homozygosity (FHOM) and runs of homozygosity (ROH). A GBS assay was constructed based on the sequencing of ApeKI-digested libraries from 285 American mink using Illumina HiSeq Sequencer. Data of 13,321 SNPs located on 46 scaffolds was used to perform LD analysis. The average LD (r 2 ± SD) between adjacent SNPs was 0.30 ± 0.35 over all scaffolds with an average distance of 51 kb between markers. The average r 2 < 0.2 was observed at inter-marker distances of >40 kb, suggesting that at least 60,000 informative SNPs would be required for genomic selection in American mink. The Ne was estimated to be 116 at five generations ago. In addition, the most rapid decline of population size was observed between 100 and 200 generations ago. Our results showed that short extensions of homozygous genotypes (500 kb to 1 Mb) were abundant across the genome and accounted for 33% of all ROH identified. The average inbreeding coefficient based on ROH longer than 1 Mb was 0.132 ± 0.042. The estimations of FHOM ranged from -0.44 to 0.34 among different samples with an average of 0.15 over all individuals. This study provided useful insights to determine the density of SNP panel providing enough statistical power and accuracy in genomic studies of American mink. Moreover, these results confirmed that GBS approach can be considered as a useful tool for genomic studies in American mink.Entities:
Keywords: American mink; effective population size; genotyping-by-sequencing; inbreeding rate; linkage disequilibrium; single nucleotide polymorphism
Year: 2020 PMID: 32231688 PMCID: PMC7083153 DOI: 10.3389/fgene.2020.00223
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The average r2 and D′ between adjacent SNPs on each scaffold in American mink genome and the percentage of SNP pairs with r2 > 0.2 and r2 > 0.8.
| Scaffold number | Scaffold length (Mb) | Number of SNPs | Average | Average D′ ± SD | ||
| Scaffold 1 | 37.08 | 503 | 0.33 ± 0.36 | 0.82 ± 0.29 | 45.45 | 19.81 |
| Scaffold 2 | 29.80 | 406 | 0.29 ± 0.37 | 0.79 ± 0.29 | 35.65 | 19.13 |
| Scaffold 3 | 24.17 | 456 | 0.32 ± 0.35 | 0.83 ± 0.28 | 44.78 | 18.26 |
| Scaffold 4 | 24.19 | 311 | 0.25 ± 0.34 | 0.80 ± 0.28 | 32.58 | 14.04 |
| Scaffold 5 | 23.24 | 376 | 0.28 ± 0.34 | 0.78 ± 0.32 | 38.50 | 16.43 |
| Scaffold 6 | 20.31 | 758 | 0.25 ± 0.32 | 0.77 ± 0.32 | 34.76 | 11.76 |
| Scaffold 7 | 25.40 | 282 | 0.34 ± 0.39 | 0.81 ± 0.32 | 43.71 | 22.16 |
| Scaffold 8 | 26.72 | 1,234 | 0.25 ± 0.33 | 0.80 ± 0.31 | 33.16 | 13.29 |
| Scaffold 9 | 15.31 | 139 | 0.42 ± 0.35 | 0.83 ± 0.30 | 69.64 | 21.43 |
| Scaffold 10 | 22.50 | 1,320 | 0.27 ± 0.34 | 0.79 ± 0.31 | 36.59 | 15.41 |
| Scaffold 11 | 13.42 | 141 | 0.25 ± 0.34 | 0.77 ± 0.34 | 31.96 | 14.43 |
| Scaffold 12 | 13.10 | 284 | 0.29 ± 0.34 | 0.82 ± 0.29 | 37.85 | 15.25 |
| Scaffold 13 | 13.34 | 50 | 0.31 ± 0.37 | 0.74 ± 0.32 | 42.42 | 18.18 |
| Scaffold 14 | 13.56 | 188 | 0.31 ± 0.38 | 0.80 ± 0.31 | 41.12 | 21.50 |
| Scaffold 16 | 16.70 | 278 | 0.24 ± 0.32 | 0.81 ± 0.29 | 31.51 | 12.33 |
| Scaffold 17 | 13.29 | 56 | 0.34 ± 0.36 | 0.89 ± 0.19 | 48.39 | 22.58 |
| Scaffold 18 | 12.91 | 329 | 0.26 ± 0.33 | 0.75 ± 0.33 | 36.90 | 12.30 |
| Scaffold 19 | 12.66 | 173 | 0.38 ± 0.39 | 0.87 ± 0.23 | 47.44 | 23.08 |
| Scaffold 20 | 13.35 | 104 | 0.32 ± 0.37 | 0.81 ± 0.30 | 42.11 | 18.42 |
| Scaffold 21 | 12.91 | 176 | 0.26 ± 0.35 | 0.78 ± 0.30 | 34.26 | 12.96 |
| Scaffold 22 | 13.11 | 144 | 0.35 ± 0.37 | 0.84 ± 0.31 | 48.00 | 22.67 |
| Scaffold 23 | 16.07 | 79 | 0.37 ± 0.35 | 0.81 ± 0.28 | 54.17 | 16.67 |
| Scaffold 24 | 11.92 | 50 | 0.31 ± 0.37 | 0.84 ± 0.21 | 37.50 | 18.75 |
| Scaffold 25 | 23.23 | 203 | 0.24 ± 0.35 | 0.75 ± 0.35 | 29.92 | 14.96 |
| Scaffold 26 | 12.29 | 226 | 0.24 ± 0.32 | 0.82 ± 0.31 | 30.56 | 10.42 |
| Scaffold 27 | 10.49 | 150 | 0.25 ± 0.34 | 0.73 ± 0.34 | 35.71 | 14.29 |
| Scaffold 28 | 12.49 | 961 | 0.26 ± 0.33 | 0.81 ± 0.30 | 35.29 | 12.71 |
| Scaffold 29 | 10.52 | 126 | 0.39 ± 0.40 | 0.74 ± 0.35 | 46.30 | 25.93 |
| Scaffold 30 | 16.12 | 68 | 0.23 ± 0.33 | 0.75 ± 0.32 | 27.66 | 12.77 |
| Scaffold 31 | 12.85 | 145 | 0.30 ± 0.35 | 0.79 ± 0.33 | 39.56 | 16.48 |
| Scaffold 32 | 11.01 | 116 | 0.24 ± 0.35 | 0.76 ± 0.32 | 28.07 | 14.04 |
| Scaffold 33 | 10.69 | 56 | 0.24 ± 0.31 | 0.88 ± 0.24 | 30.30 | 12.12 |
| Scaffold 34 | 10.66 | 89 | 0.18 ± 0.28 | 0.71 ± 0.39 | 24.07 | 7.41 |
| Scaffold 36 | 18.43 | 1,211 | 0.25 ± 0.33 | 0.80 ± 0.30 | 31.77 | 13.67 |
| Scaffold 41 | 16.55 | 219 | 0.24 ± 0.34 | 0.73 ± 0.35 | 29.92 | 14.17 |
| Scaffold 45 | 11.91 | 85 | 0.36 ± 0.39 | 0.88 ± 0.22 | 45.65 | 19.57 |
| Scaffold 47 | 10.04 | 147 | 0.36 ± 0.38 | 0.80 ± 0.33 | 50.00 | 21.28 |
| Scaffold 49 | 10.33 | 104 | 0.37 ± 0.41 | 0.76 ± 0.33 | 46.67 | 24.44 |
| Scaffold 64 | 15.38 | 203 | 0.25 ± 0.37 | 0.73 ± 0.33 | 30.36 | 16.96 |
| Scaffold 66 | 11.16 | 52 | 0.19 ± 0.33 | 0.60 ± 0.42 | 21.62 | 10.81 |
| Scaffold 68 | 10.66 | 90 | 0.54 ± 0.41 | 0.90 ± 0.25 | 64.52 | 35.48 |
| Scaffold 70 | 12.54 | 215 | 0.30 ± 0.37 | 0.83 ± 0.28 | 36.19 | 20.00 |
| Scaffold 72 | 10.47 | 379 | 0.28 ± 0.36 | 0.75 ± 0.34 | 33.82 | 17.87 |
| Scaffold 73 | 13.59 | 237 | 0.27 ± 0.34 | 0.81 ± 0.29 | 40.13 | 13.82 |
| Scaffold 93 | 12.16 | 189 | 0.31 ± 0.33 | 0.79 ± 0.33 | 46.75 | 14.29 |
| Scaffold 99 | 10.59 | 213 | 0.32 ± 0.34 | 0.85 ± 0.24 | 45.05 | 16.22 |
| Overall | 719.22 | 13,321 | 0.30 ± 0.35 | 0.79 ± 0.30 | 39.09 | 16.97 |
FIGURE 1The average r2 per each scaffold (46 scaffolds) in American mink using GBS data.
Average r2 and D′ ± SD over physical distances up to 1000 kb, pooled over all scaffolds, in American mink.
| SNP pairs distance (kb) | LD parameters | ||
| Number of pairs | Average | Average D′ ± SD | |
| 0–10 | 9,024 | 0.38 ± 0.28 | 0.79 ± 0.27 |
| 10–20 | 4,659 | 0.30 ± 0.23 | 0.73 ± 0.28 |
| 20–30 | 4,513 | 0.25 ± 0.22 | 0.72 ± 0.28 |
| 30–40 | 4,055 | 0.21 ± 0.21 | 0.71 ± 0.28 |
| 40–50 | 4,355 | 0.19 ± 0.20 | 0.71 ± 0.29 |
| 50–60 | 4,163 | 0.18 ± 0.20 | 0.70 ± 0.29 |
| 60–70 | 4,160 | 0.17 ± 0.19 | 0.70 ± 0.29 |
| 70–80 | 4,110 | 0.16 ± 0.19 | 0.68 ± 0.29 |
| 80–90 | 4,164 | 0.15 ± 0.18 | 0.67 ± 0.29 |
| 90–100 | 3,980 | 0.14 ± 0.18 | 0.67 ± 0.28 |
| 100–200 | 38,171 | 0.14 ± 0.17 | 0.66 ± 0.31 |
| 200–300 | 35,858 | 0.13 ± 0.17 | 0.66 ± 0.31 |
| 300–400 | 34,748 | 0.12 ± 0.16 | 0.64 ± 0.31 |
| 400–500 | 34,209 | 0.11 ± 0.16 | 0.61 ± 0.32 |
| 500–600 | 33,502 | 0.10 ± 0.14 | 0.60 ± 0.32 |
| 600–700 | 33,036 | 0.09 ± 0.14 | 0.57 ± 0.32 |
| 700–800 | 32,085 | 0.09 ± 0.13 | 0.56 ± 0.32 |
| 800–900 | 30,318 | 0.08 ± 0.13 | 0.55 ± 0.32 |
| 900–1000 | 29,765 | 0.08 ± 0.12 | 0.55 ± 0.32 |
FIGURE 2The LD decay represented by the average r2 for three SNP sets: SNP pairs separated by inter-marker distances of (A) 0 up to 100 kb using consecutive 10 kb bins, (B) 0 up to 1000 kb using consecutive 100 kb bins, and (C) 0 up to 10 Mb using consecutive 1 Mb bins.
FIGURE 3Effective population sizes (Ne) in American mink as a function of generations.
Descriptive statistics of runs of homozygosity (ROH) in different length categories: average number of ROH per individual, average length per ROH class, minimum and maximum number of ROH detected per individual, cumulative frequency of each ROH class, and correlation coefficient between FROH and FHOM in each length category.
| ROH class | Average number of ROH per individual ± SD | Average ROH length (Mb) ± SD | Minimum number of ROH per individual | Maximum number of ROH per individual | Cumulative frequency (%) | Correlation coefficient with FHOM |
| ROH > 500 kb | 216 ± 56 | 2.16 ± 2.00 | 40 | 332 | 100 | 0.87 |
| ROH > 1 Mb | 115 ± 44 | 2.87 ± 2.11 | 22 | 241 | 67 | 0.84 |
| ROH > 2 Mb | 84.50 ± 25 | 4.09 ± 2.20 | 8 | 155 | 36.5 | 0.78 |
| ROH > 4 Mb | 31 ± 10.90 | 6.30 ± 2.22 | 2 | 71 | 13.35 | 0.68 |
| ROH > 8 Mb | 5.60 ± 2.90 | 10.08 ± 2.12 | 0 | 16 | 2.35 | 0.49 |
| ROH > 16 Mb | 0.08 ± 0.29 | 17.65 ± 2.90 | 0 | 2 | 0.34 | 0.12 |
FIGURE 4Trend of FROH based on the cut-off length categories for defining ROH.