| Literature DB >> 27461929 |
Z Wang1,2, Q Chen1,2, R Liao1,2, Z Zhang1,2, X Zhang1,2, X Liu3, M Zhu3, W Zhang3, M Xue4, H Yang4, Y Zheng4, Q Wang1,2, Y Pan1,2.
Abstract
The Chinese Taihu pig breeds are an invaluable component of the world's pig genetic resources, and they are the most prolific breeds of swine in the world. In this study, the genomes of 252 pigs of the six indigenous breeds in the Taihu Lake region were sequenced using the genotyping by genome reducing and sequencing approach. A total of 950 million good reads were obtained using an Illumina Hiseq2000 at an average depth of 13× (for SNP calling) and an average coverage of 2.3%. In total, 122 632 indels, 31 444 insertions, 44 056 deletions and 455 CNVs (copy number variants) were identified in the genomes of the pigs. Approximately 2.3% of these genetic markers were mapped to gene exon regions, and 25% were in QTL regions related to economically important traits. The KEGG pathway or GO enrichment analyses revealed that genetic variants assumed to be large-effect mutations were significantly overrepresented in 22 SNP, 56 indel, 26 insertion, 28 deletion and three CNV gene sets. A total of 343 breed-specific SNPs were also identified in the six Chinese indigenous pigs. The findings from this study can contribute to future investigations of the genetic diversity, population structure, positive selection signals and molecular evolutionary history of these pigs at the genome level and can serve as a valuable reference for improving the breeding and cultivation of these pigs.Entities:
Keywords: zzm321990GGRSzzm321990; Chinese indigenous pigs; breed-specific SNPs; genome sequencing
Mesh:
Year: 2016 PMID: 27461929 PMCID: PMC5248613 DOI: 10.1111/age.12465
Source DB: PubMed Journal: Anim Genet ISSN: 0268-9146 Impact factor: 3.169
Summary statistics for the sequencing data
| Breed | Meishan | Erhualian | Mi | Fengjing | Shawutou | Jiaxing Black | Overall | Average | |
|---|---|---|---|---|---|---|---|---|---|
| Middle | Small | ||||||||
| Number | 50 | 69 | 31 | 36 | 16 | 21 | 29 | 252 | – |
| Average genome coverage (%) | 2.0 | 2.9 | 1.7 | 2.7 | 1.2 | 1.8 | 2.9 | – | 2.3 |
| Average sequencing depth | 11.1 | 10.5 | 23.5 | 8.8 | 21.2 | 20.4 | 9.4 | – | 13.0 |
| Breed‐specific SNPs | 38 | 54 | 7 | 88 | 4 | 1 | 151 | 343 | – |
Sequencing data used for ultimate SNP identification.
Number of identified bred‐specific SNPs of the 105 550 overall SNPs.
Figure 1(a) Distance distribution of adjacent SNPs and (b) distribution of indels of different lengths.
Figure 2Distribution of the density of different types of genetic variants across chromosomes calculated as the number of variants per 100 kb.
Number distribution of variants detected in each chromosome
| Chr | SNP | Indel | Insertion | Deletion | CNV | Genes | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| Genes | Ratio (%) | Gain | Loss | Genes | Ratio (%) |
| Genes | Ratio (%) |
| Genes | Ratio (%) |
| Genes | Ratio (%) |
| |
| 1 | 3632 | 662 | 30.00 | 3323 | 1321 | 1023 | 46.30 | 1420 | 516 | 23.36 | 1013 | 639 | 28.93 | 20 | 22 | 1.00 | 2209 |
| 2 | 2857 | 640 | 29.80 | 2284 | 922 | 904 | 42.10 | 859 | 361 | 16.81 | 575 | 491 | 22.87 | 10 | 11 | 0.51 | 2147 |
| 3 | 2065 | 482 | 33.40 | 2072 | 789 | 727 | 50.30 | 709 | 295 | 20.43 | 471 | 385 | 26.66 | 7 | 8 | 0.55 | 1444 |
| 4 | 1427 | 389 | 31.00 | 1430 | 550 | 547 | 43.60 | 637 | 266 | 21.21 | 456 | 335 | 26.71 | 9 | 10 | 0.80 | 1254 |
| 5 | 4764 | 369 | 31.50 | 3907 | 1507 | 540 | 46.10 | 576 | 239 | 20.39 | 411 | 290 | 24.74 | 8 | 9 | 0.77 | 1172 |
| 6 | 2490 | 635 | 32.70 | 2342 | 897 | 944 | 48.60 | 884 | 349 | 17.98 | 591 | 480 | 24.73 | 13 | 13 | 0.67 | 1941 |
| 7 | 2191 | 487 | 29.50 | 1785 | 769 | 632 | 38.30 | 771 | 280 | 16.96 | 526 | 358 | 21.68 | 11 | 12 | 0.73 | 1651 |
| 8 | 918 | 247 | 29.20 | 1161 | 442 | 422 | 49.80 | 518 | 227 | 26.80 | 390 | 269 | 31.76 | 8 | 9 | 1.06 | 847 |
| 9 | 2053 | 397 | 27.80 | 1701 | 668 | 560 | 39.20 | 738 | 293 | 20.53 | 540 | 353 | 24.74 | 6 | 6 | 0.42 | 1427 |
| 10 | 898 | 185 | 35.30 | 778 | 308 | 254 | 48.50 | 324 | 130 | 24.81 | 225 | 152 | 29.01 | 8 | 9 | 1.72 | 524 |
| 11 | 695 | 156 | 37.00 | 533 | 266 | 208 | 49.30 | 222 | 99 | 23.46 | 149 | 128 | 30.33 | 2 | 2 | 0.47 | 422 |
| 12 | 1408 | 392 | 34.20 | 1337 | 555 | 559 | 48.80 | 409 | 183 | 15.98 | 266 | 252 | 22.01 | 6 | 10 | 0.87 | 1145 |
| 13 | 1718 | 479 | 31.20 | 2045 | 804 | 717 | 46.70 | 1051 | 396 | 25.81 | 733 | 487 | 31.75 | 12 | 12 | 0.78 | 1534 |
| 14 | 2627 | 514 | 37.50 | 2369 | 963 | 717 | 52.30 | 1103 | 380 | 27.74 | 746 | 471 | 34.38 | 10 | 11 | 0.80 | 1370 |
| 15 | 1519 | 296 | 30.80 | 1228 | 532 | 445 | 46.40 | 562 | 221 | 23.02 | 398 | 266 | 27.71 | 10 | 11 | 1.15 | 960 |
| 16 | 642 | 154 | 34.70 | 642 | 285 | 225 | 50.70 | 339 | 122 | 27.48 | 234 | 146 | 32.88 | 4 | 4 | 0.90 | 444 |
| 17 | 3817 | 265 | 38.50 | 2892 | 1250 | 334 | 48.50 | 390 | 149 | 21.66 | 251 | 196 | 28.49 | 5 | 5 | 0.73 | 688 |
| 18 | 1023 | 195 | 38.10 | 817 | 315 | 248 | 48.40 | 352 | 121 | 23.63 | 241 | 147 | 28.71 | 2 | 3 | 0.59 | 512 |
| X | 739 | 169 | 15.10 | 1366 | 584 | 405 | 36.20 | 572 | 191 | 17.05 | 429 | 226 | 20.18 | 17 | 22 | 1.96 | 1120 |
| Y | 1 | 0 | 0.00 | 19 | 6 | 3 | 23.10 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 0 | 0 | 0.00 | 13 |
| Total | 37 484 | 7113 | 31.20 | 34 031 | 13 733 | 10 414 | 45.60 | 12 436 | 4818 | 21.11 | 8645 | 6071 | 26.60 | 168 | 189 | 0.83 | 22 824 |
Number of identified variants within genic regions.
Number of total genes contained in the Ensembl gene database of pig.
Statistics of variants in functional regions
| Category | SNP (%) | Indel (%) | Insertion (%) | Deletion (%) |
|---|---|---|---|---|
| Intergenic | 68 066 (64.5) | 74 868 (61.1) | 22 799 (72.5) | 31 620 (71.8) |
| Start/End codon | 4 (0.0) | 7 (0.0) | 64 (0.2) | 99 (0.2) |
| Exonic | 2400 (2.3) | 3314 (2.7) | 672 (2.1) | 1007 (2.3) |
| Intronic | 20 249 (19.2) | 25 398 (20.7) | 3344 (10.6) | 4870 (11.1) |
| Untranslated region | 14 831 (14.1) | 19 045 (15.5) | 4565 (14.5) | 6460 (14.7) |
| Total | 105 550 | 122 632 | 31 444 | 44 056 |
Figure 3Population structures of Chinese indigenous pigs in the Taihu Lake region revealed by principal components analysis based on (a) the dataset of the SNPs in genic regions, (b) the dataset of the SNPs in the intergenic regions, (c) the dataset of the SNPs in the QTL regions related to meat and carcass quality traits, (d) the dataset of the SNPs in the QTL regions related to reproduction traits. MMS, Middle Meishan; SWT, Shawutou; EHL, Erhualian; M, Mi; FJ, Fengjing; JXB, Jiaxing Black; SMS, Small Meishan.
Distribution of SNPs and indels among different traits
| Trait | SNP | Indel | Insertion | Deletion | CNV | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| Genes |
|
| Genes |
|
| Genes |
|
| Genes |
|
| Genes | |
| Meat and carcass quality | 15 428 | 4309 | 1214 | 18 883 | 5856 | 1733 | 4950 | 1319 | 779 | 7044 | 1922 | 997 | 30 | 24 | 30 |
| Health | 2987 | 851 | 257 | 3807 | 1140 | 360 | 1023 | 280 | 165 | 1416 | 396 | 216 | 4 | 3 | 3 |
| Production | 658 | 1684 | 420 | 7675 | 2260 | 608 | 2012 | 542 | 300 | 2852 | 768 | 367 | 7 | 7 | 7 |
| Reproduction | 8077 | 357 | 78 | 9531 | 2876 | 795 | 2427 | 634 | 387 | 3451 | 943 | 492 | 13 | 11 | 13 |
| Exterior | 3876 | 946 | 263 | 4696 | 1348 | 383 | 1188 | 300 | 178 | 1718 | 152 | 220 | 12 | 11 | 11 |
Variants located in QTL regions.
Variants located in both gene and QTL regions.
Number of genes related to variants.