| Literature DB >> 26117497 |
Jung-Woo Choi1, Won-Hyong Chung2, Kyung-Tai Lee1, Eun-Seok Cho1, Si-Woo Lee1, Bong-Hwan Choi1, Sang-Heon Lee3, Wonjun Lim3, Dajeong Lim1, Yun-Gyeong Lee3, Joon-Ki Hong4, Doo-Wan Kim4, Hyeon-Jeong Jeon1, Jiwoong Kim5, Namshin Kim6, Tae-Hun Kim7.
Abstract
Pigs have been one of the most important sources of meat for humans, and their productivity has been substantially improved by recent strong selection. Here, we present whole-genome resequencing analyses of 55 pigs of five breeds representing Korean native pigs, wild boar and three European origin breeds. 1,673.1 Gb of sequence reads were mapped to the Swine reference assembly, covering ∼99.2% of the reference genome, at an average of ∼11.7-fold coverage. We detected 20,123,573 single-nucleotide polymorphisms (SNPs), of which 25.5% were novel. We extracted 35,458 of non-synonymous SNPs in 9,904 genes, which may contribute to traits of interest. The whole SNP sets were further used to access the population structures of the breeds, using multiple methodologies, including phylogenetic, similarity matrix, and population structure analysis. They showed clear population clusters with respect to each breed. Furthermore, we scanned the whole genomes to identify signatures of selection throughout the genome. The result revealed several promising loci that might underlie economically important traits in pigs, such as the CLDN1 and TWIST1 genes. These discoveries provide useful genomic information for further study of the discrete genetic mechanisms associated with economically important traits in pigs.Entities:
Keywords: Pig; SNP; signature of selection; whole-genome sequencing
Mesh:
Year: 2015 PMID: 26117497 PMCID: PMC4535618 DOI: 10.1093/dnares/dsv011
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Summary of sequencing results and SNPs from the five pig breeds including KWB, KNP, DUR, LAN, and YOR used in this study
| Sample name | No. of sample | Raw_Reads | Mapped reads | A_Coveragea | Ave_Foldb |
|---|---|---|---|---|---|
| Korean wild boar | 10 | 4,637,563,674 | 2,881,666,665 | 98.72% | 11.21 X |
| Korean native pig | 10 | 4,738,183,150 | 3,017,226,894 | 98.94% | 11.74 X |
| Duroc | 6 | 2,134,670,142 | 1,788,806,414 | 98.16% | 11.60 X |
| Landrace | 14 | 6,092,030,408 | 4,652,217,189 | 99.18% | 12.93 X |
| Yorkshire | 15 | 7,330,204,916 | 4,225,021,030 | 98.86% | 10.96 X |
| Total |
aA_Coverage, assembly coverage calculated as the proportion of bases in the genome assembly that were covered by at least one read.
bAve_Fold, average fold that was calculated as the average depth of coverage across the whole genome.
Summary of all the detected SNPs identified from all of the five breeds used in this study
| Fields | Total | KWB | KNP | DUR | LAN | YOR |
|---|---|---|---|---|---|---|
| Sample counts | 55 | 10 | 10 | 6 | 14 | 15 |
| SNP | 20,123,573 | 13,973,333 | 9,592,404 | 6,625,918 | 10,872,881 | 11,032,246 |
| Transition | 13,352,480 | 9,266,133 | 6,304,688 | 4,351,967 | 7,166,396 | 7,259,773 |
| Transversion | 6,771,093 | 4,707,200 | 3,287,716 | 2,273,951 | 3,706,485 | 3,772,473 |
| SNP categoriesa | ||||||
| Synonymous coding | 53,057 | 35,517 | 24,367 | 15,340 | 26,903 | 28,430 |
| Non-synonymous coding | 34,990 | 22,267 | 16,146 | 10,871 | 18,058 | 19,101 |
| Start lost | 26 | 15 | 15 | 9 | 14 | 15 |
| Stop gained | 393 | 230 | 189 | 92 | 189 | 214 |
| Stop lost | 49 | 39 | 25 | 22 | 31 | 28 |
| Non-coding exon | 96,102 | 60,277 | 46,017 | 26,706 | 51,131 | 51,305 |
| UTR 5′ | 11,888 | 7,430 | 5,562 | 2,886 | 6,188 | 6,602 |
| UTR 3′ | 68,432 | 46,001 | 31,269 | 21,670 | 35,645 | 36,640 |
| Splice site acceptor | 266 | 175 | 137 | 94 | 159 | 165 |
| Splice site donor | 275 | 193 | 133 | 96 | 161 | 160 |
| Intron | 4,955,607 | 3,445,188 | 2,337,787 | 1,602,182 | 2,660,083 | 2,690,109 |
| Intergenic | 14,902,488 | 10,356,001 | 7,130,757 | 4,945,950 | 8,074,319 | 8,199,477 |
| Functional categoriesb | ||||||
| Missense | 35,065 | 22,321 | 16,186 | 10,902 | 18,103 | 19,144 |
| Nonsense | 393 | 230 | 189 | 92 | 189 | 214 |
| Silent | 53,057 | 35,517 | 24,367 | 15,340 | 26,903 | 28,430 |
UTR, untranslated region.
aSNP categories: categorized by the effects of SNPs.
bFunctional categories: categorized by functional effects of coding SNPs.
Figure 1.Overview of the identified SNPs. (a) The number of SNPs identified from 55 pigs of the five breeds used in this study (KWB, KNP, DUR, LAN, and YOR). (b) The pie chart showing functional categories of all the detected SNPs. In the functional categories, the ‘Others’ represents the sum of three functional categories including ‘start lost’, ‘stop gained’, and ‘stop lost’. This figure is available in black and white in print and in colour at DNA Research online.
Figure 2.Genetic structure of the 55 pigs of the five pig breeds (KWB, KNP, DUR, LAN, and YOR) in this study. (a) Neighbour-joining phylogenetic tree of all the 55 pigs. (b) Population structure using the STRUCTURE program, which Bayesian cluster all 55 samples from the five breeds. (c) The similarity matrix of the 55 pigs based on the calculated SNP identity. Each of all the 55 pigs was presented in the same order on those three (a–c) plots. This figure is available in black and white in print and in colour at DNA Research online.
Figure 3.Z-scores of pooled heterozygosity (ZHp) overlapping the CLDN1 gene region. The highlighted orange-coloured box in the dotted line indicates genomic regions including the CLDN1 gene. (a) ZHp scores at the region of 136–139 Mb on Swine chromosome 13. Each point represents a ZHp score estimated from a 150 kb window. Five different shapes of the points represent each breed, respectively (see the legend for details). (b) The structure of RefSeq genes included in the 136–139 Mb region, which were presented by the UCSC genome browser. (c) Hp score of each SNP detected in the CLDN1. Colour gradation was used to show the degree of homozygosity: red (Hp = 0) and green (Hp = 0.5) colours represent the complete homozygosity and heterozygosity, respectively. Namely, high level of homozygosity represents the deepest red colour, while high level of heterozygosity increases green colour intensity. This figure is available in black and white in print and in colour at DNA Research online.
Figure 4.Z-scores of pooled heterozygosity (ZHp) overlapping the TWIST1 gene region. The highlighted orange-coloured box in the dotted line indicates genomic regions including the TWIST1 gene. (a) ZHp scores at the region of 96–99 Mb on Swine chromosome 9. Each point represents a ZHp score estimated from a 150 kb window. (b) The structure of Ensembl and RefSeq genes included in the 96–99 Mb region, which were presented by the UCSC genome browser. (c) Hp score of each SNP detected in the TWIST1. The presented region includes 5 kb flanking regions at both sides of the TWIST1 gene. This figure is available in black and white in print and in colour at DNA Research online.