| Literature DB >> 32299342 |
Christian Reimer1,2, Ngoc-Thuy Ha3,4, Ahmad Reza Sharifi3,4, Johannes Geibel3,4, Lars Friis Mikkelsen5, Martin Schlather4,6, Steffen Weigend4,7, Henner Simianer3,4.
Abstract
BACKGROUND: Göttingen Minipigs (GMP) is the smallest commercially available minipig breed under a controlled breeding scheme and is globally bred in five isolated colonies. The genetic isolation harbors the risk of stratification which might compromise the identity of the breed and its usability as an animal model for biomedical and human disease. We conducted whole genome re-sequencing of two DNA-pools per colony to assess genomic differentiation within and between colonies. We added publicly available samples from 13 various pig breeds and discovered overall about 32 M loci, ~ 16 M. thereof variable in GMPs. Individual samples were virtually pooled breed-wise. FST between virtual and DNA pools, a phylogenetic tree, principal component analysis (PCA) and evaluation of functional SNP classes were conducted. An F-test was performed to reveal significantly differentiated allele frequencies between colonies. Variation within a colony was quantified as expected heterozygosity.Entities:
Keywords: Conservation; Göttingen Minipigs; Population structure
Mesh:
Year: 2020 PMID: 32299342 PMCID: PMC7161115 DOI: 10.1186/s12864-020-6590-4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Mean coefficients of relationship within a colony sample and between the sample and the remaining stock, including number of successfully extracted probes
| RE | DA2 | DA3 | NR | NI | |
|---|---|---|---|---|---|
| Relationship within sample | 0.357 | 0.396 | 0.396 | 0.403 | 0.387 |
| Relationship Sample/ Remaining Stock | 0.376 | 0.404 | 0.402 | 0.410 | 0.397 |
| Successful DNA extractions | 28 | 24 | 23 | 28 | 24 |
Fig. 1PCA based on pairwise comparisons of reference allele frequencies in all pools (top) and GMP DNA pools only (bottom); Variance explained by PC in brackets. Distribution of variance explained by PC’s on the right
Mean FST and DR between European and Asian breeds and GMP
| GMP | Asian | European | ||
|---|---|---|---|---|
| GMP | 0.066 | 0.262 | 0.307 | |
| FST | Asian | 0.271 | 0.327 | |
| European | 0.163 | |||
| GMP | 0.113 | 0.364 | 0.408 | |
| DR | Asian | 0.372 | 0.428 | |
| European | 0.246 |
Fig. 2Genome wide FST vs. Reynolds distances for all pairwise comparisons. Comparisons within breed types in the respective colors, comparisons between breed types in grey
Fig. 3UPGMA tree based on genome-wide FST values; resampling frequency based on 100 random samples of 100 loci in rectangles
Proportion of SNP significantly different between colony and remote pool in F-test at 5%
| RE_1 | RE_2 | DA2_1 | DA2_2 | DA3_1 | DA3_2 | NR_1 | NR_2 | NI_1 | NI_2 | |
|---|---|---|---|---|---|---|---|---|---|---|
| RE | 0.00 | 0.00 | 0.06 | 0.06 | 0.06 | 0.06 | 0.06 | 0.05 | 0.06 | 0.06 |
| DA2 | 0.07 | 0.07 | 0.00 | 0.00 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 | 0.05 |
| DA3 | 0.07 | 0.07 | 0.05 | 0.05 | 0.00 | 0.00 | 0.05 | 0.05 | 0.04 | 0.04 |
| NR | 0.08 | 0.08 | 0.06 | 0.06 | 0.07 | 0.06 | 0.00 | 0.00 | 0.06 | 0.07 |
| NI | 0.07 | 0.07 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.04 | 0.00 | 0.00 |
Fig. 4Proportion of significantly different loci at 5% Bonferroni corrected F-test level against number of tested loci
Expected Heterozygosity within pools
| RE_1 | RE_2 | DA2_1 | DA2_2 | DA3_1 | DA3_2 | NR_1 | NR_2 | NI_1 | NI_2 | |
|---|---|---|---|---|---|---|---|---|---|---|
| Hexp | 0.284 | 0.285 | 0.278 | 0.277 | 0.283 | 0.276 | 0.272 | 0.271 | 0.280 | 0.283 |
| SD | 0.188 | 0.185 | 0.189 | 0.189 | 0.189 | 0.190 | 0.193 | 0.193 | 0.189 | 0.185 |
| Nloci [mio.] | 16.497 | 16.497 | 16.496 | 16.496 | 16.496 | 16.497 | 16.497 | 16.497 | 16.497 | 16.498 |
| NNA [mio.] | 1′831 | 1′455 | 2′556 | 2′754 | 2′493 | 2′167 | 2′116 | 2′204 | 1′380 | 1′154 |
Expected Heterozygosity estimated from the virtual union of both unit pools
| RE | DA2 | DA3 | NR | NI | |
|---|---|---|---|---|---|
| Hexp | 0.298 | 0.292 | 0.294 | 0.285 | 0.295 |
| SD | 0.175 | 0.175 | 0.178 | 0.181 | 0.176 |
| Nloci [M] | 16.498 | 16.498 | 16.498 | 16.499 | 16.499 |
| NNA | 260 | 452 | 346 | 441 | 231 |
Correlation between genotypes for loci that were completely fixed within each unit
| RE | DA2 | DA3 | NR | NI | |
|---|---|---|---|---|---|
| RE | 1.00 | −0.10 | 0.02 | 0.05 | 0.14 |
| DA2 | −0.10 | 1.00 | −0.07 | −0.02 | 0.01 |
| DA3 | 0.02 | −0.07 | 1.00 | 0.02 | 0.12 |
| NR | 0.05 | −0.02 | 0.02 | 1.00 | 0.11 |
| NI | 0.14 | 0.01 | 0.12 | 0.11 | 1.00 |
Number of private polymorphism; left: completely recorded loci; right: missing information (NA) allowed
| Without NA | with NA | |
|---|---|---|
| RE | 555′591 | 555′765 |
| DA2 | 163′853 | 163′935 |
| DA3 | 134′158 | 134′265 |
| NR | 192′896 | 192′974 |
| NI | 156′502 | 156′579 |
Relative amount [%] of significantly differentiated and oppositely fixed loci per functional class and relative abundance of loci in differentiated classes in comparison to all background loci
| Relative amount of loci per class | Relative abundance compared to background | |||
|---|---|---|---|---|
| 5% bonf | Opp. Fixed | 5% bonf | Opp. fixed | |
| 3_prime_UTR_variant | 1.0790 | 1.1905 | 0.9906 | 1.0913 |
| 5_prime_UTR_variant | 0.2728 | 0.3345 | 0.9919 | 1.2232 |
| coding_sequence_variant | 0.0002 | 0.0000 | 0.9558 | 0.0000 |
| downstream_gene_variant | 6.4463 | 7.1600 | 1.0199 | 1.1224 |
| intergenic_variant | 19.9130 | 19.4910 | 0.9892 | 0.9698 |
| intron_variant | 65.4566 | 64.3371 | 1.0019 | 0.9848 |
| missense_variant | 0.2416 | 0.3683 | 1.0033 | 1.5229 |
| start_lost | 0.0009 | 0.0000 | 1.2111 | 0.0000 |
| stop_gained | 0.0020 | 0.0047 | 0.9747 | 2.1566 |
| stop_gained,splice_region_variant | 0.0001 | 0.0000 | 0.5296 | 0.0000 |
| stop_gained,start_lost | 0.0000 | 0.0000 | 5.2287 | 0.0000 |
| stop_lost | 0.0005 | 0.0000 | 1.0325 | 0.0000 |
| synonymous_variant | 0.5686 | 0.4842 | 1.0205 | 0.8975 |
| upstream_gene_variant | 6.0184 | 6.6297 | 0.9945 | 1.0989 |
Fig. 5Relative abundance of functional SNP classes in dependence from pairwise FST bewteen units
Annotation of deleterious missense variants with pairwise FST of 1
| Chr | Pos [bp] | Pool 1 | Pool 2 | Ensembl ID | RS ID | SIFT | Gene name |
|---|---|---|---|---|---|---|---|
| 3 | 1′741‘574 | DA2 | DA3 | ENSSSCG00000007567 | rs1108271931 | 0 | CHST12 |
| 3 | 1′741‘574 | DA2 | NI | ENSSSCG00000007567 | rs1108271931 | 0 | CHST12 |
| 9 | 9′545‘629 | DA3 | NR | ENSSSCG00000014852 | rs793754709 | 0 | ARRB1 |
| 9 | 9′545‘629 | NR | RE | ENSSSCG00000014852 | rs793754709 | 0 | ARRB1 |
| 10 | 2′699‘680 | DA2 | DA3 | ENSSSCG00000042899 | – | 0.03 | – |
| 10 | 2′699‘680 | DA2 | NR | ENSSSCG00000042899 | – | 0.03 | – |
| 13 | 207′155‘372 | DA2 | NR | ENSSSCG00000025294 | rs343158168 | 0.03 | AIRE |
| 14 | 132′394‘490 | DA2 | DA3 | ENSSSCG00000046807 | – | 0.04 | – |
| 14 | 132′394‘490 | DA2 | NR | ENSSSCG00000046807 | – | 0.04 | – |
| 18 | 50′955‘382 | DA3 | NR | ENSSSCG00000039535 | rs713007753 | 0.01 | CAMK2B |
Relative amount (in per cent) of private polymorphism loci per functional class (absolute number of loci in brackets)
| RE | DA2 | DA3 | NR | NI | |
|---|---|---|---|---|---|
| 3_prime_UTR_variant | 1.2176 (6317) | 1.1844 (1790) | 1.2121 (1516) | 1.2375 (2211) | 1.1192 (1634) |
| 5_prime_UTR_variant | 0.378 (1961) | 0.3474 (525) | 0.3622 (453) | 0.4125 (737) | 0.3199 (467) |
| coding_sequence_variant | 0.0002 (1) | NA (NA) | NA (NA) | 0.0006 (1) | NA (NA) |
| downstream_gene_variant | 7.346 (38113) | 7.1572 (10817) | 7.1015 (8882) | 7.2189 (12898) | 6.6622 (9727) |
| intergenic_variant | 39.5045 (204959) | 39.6937 (59991) | 41.8575 (52352) | 39.7633 (71045) | 40.8015 (59571) |
| intron_variant | 43.3518 (224920) | 43.5875 (65876) | 41.7496 (52217) | 43.179 (77148) | 43.6521 (63733) |
| missense_variant | 0.4587 (2380) | 0.4268 (645) | 0.4302 (538) | 0.525 (938) | 0.389 (568) |
| start_lost | 0.001 (5) | 0.0013 (2) | NA (NA) | NA (NA) | 0.0007 (1) |
| stop_gained | 0.0021 (11) | 0.0053 (8) | 0.0048 (6) | 0.0022 (4) | 0.0048 (7) |
| stop_gained,splice_region_variant | NA (NA) | NA (NA) | NA (NA) | NA (NA) | NA (NA) |
| stop_gained,start_lost | NA (NA) | NA (NA) | NA (NA) | NA (NA) | NA (NA) |
| stop_lost | 0.0008 (4) | 0.0007 (1) | NA (NA) | 0.0011 (2) | NA (NA) |
| synonymous_variant | 0.6557 (3402) | 0.6213 (939) | 0.5525 (691) | 0.6571 (1174) | 0.5082 (742) |
| upstream_gene_variant | 7.0837 (36752) | 6.9746 (10541) | 6.7297 (8417) | 7.0029 (12512) | 6.5424 (9552) |
Additional porcine samples used in Reimer et al. (2014)
| Breed | Number of Samples | Average Depth | Class | Subclass |
|---|---|---|---|---|
| Duroc | 4 | 5.98 | European | Domestic |
| Hampshire | 2 | 6.49 | European | Domestic |
| Jiangquhai | 1 | 8.20 | Asian | Domestic |
| Large White | 14 | 6.46 | European | Domestic |
| Landrace | 5 | 6.36 | European | Domestic |
| Meishan | 4 | 6.83 | Asian | Domestic |
| Pietrain | 5 | 5.61 | European | Domestic |
| Xiang | 2 | 6.27 | Asian | Domestic |
| European wild boar | 6 | 6.44 | European | Wild |
| Asian wild boar | 5 | 6.27 | Asian | Wild |
| Göttingen Minipigs external | 1 | 12.76 | Minipig | Göttingen |
| Göttingen Minipigs | 10 | 13.01 | Minipig | Göttingen |
| Mini-LEWE | 2 | 13.93 | Minipig | Berlin |
| Mini-LEWE pool | 10 | 13.14 | Minipig | Berlin |
| Wuzhishan | 1 | 11.02 | Asian | Domestic |