| Literature DB >> 26018558 |
Jung-Woo Choi1, Bong-Hwan Choi1, Seung-Hwan Lee2, Seung-Soo Lee3, Hyeong-Cheol Kim4, Dayeong Yu1, Won-Hyong Chung1, Kyung-Tai Lee1, Han-Ha Chai1, Yong-Min Cho1, Dajeong Lim1.
Abstract
Over the last 30 years, Hanwoo has been selectively bred to improve economically important traits. Hanwoo is currently the representative Korean native beef cattle breed, and it is believed that it shared an ancestor with a Chinese breed, Yanbian cattle, until the last century. However, these two breeds have experienced different selection pressures during recent decades. Here, we whole-genome sequenced 10 animals each of Hanwoo and Yanbian cattle (20 total) using the Illumina HiSeq 2000 sequencer. A total of approximately 3.12 and 3.07 billion sequence reads were mapped to the bovine reference sequence assembly (UMD 3.1) at an average of approximately 10.71- and 10.53-fold coverage for Hanwoo and Yanbian cattle, respectively. A total of 17,936,399 single nucleotide polymorphisms (SNPs) were yielded, of which 22.3% were found to be novel. By annotating the SNPs, we further retrieved numerous nonsynonymous SNPs that may be associated with traits of interest in cattle. Furthermore, we performed whole-genome screening to detect signatures of selection throughout the genome. We located several promising selective sweeps that are potentially responsible for economically important traits in cattle; the PPP1R12A gene is an example of a gene that potentially affects intramuscular fat content. These discoveries provide valuable genomic information regarding potential genomic markers that could predict traits of interest for breeding programs of these cattle breeds.Entities:
Keywords: Hanwoo; SNP; Signature of selection; Whole-genome sequencing; Yanbian cattle
Mesh:
Substances:
Year: 2015 PMID: 26018558 PMCID: PMC4443289 DOI: 10.14348/molcells.2015.0019
Source DB: PubMed Journal: Mol Cells ISSN: 1016-8478 Impact factor: 5.034
Summary of sequencing results for Hanwoo and Yanbian cattle breeds used in this study.
| Sample Name | No. Sample | Raw_Reads | PCR duplication reads | Mapped reads | Properly paired reads | A_Coverage | Ave_Fold |
|---|---|---|---|---|---|---|---|
| Hanwoo | 10 | 3,625,402,354 | 3,182,976,738 | 3,123,899,715 | 2,989,529,698 | 98.78 | 10.71X |
| Yanbian cattle | 10 | 3,643,308,976 | 3,069,068,463 | 3,069,068,463 | 2,954,705,966 | 98.74 | 10.53X |
|
| |||||||
| Total | 20 | 7,268,711,330 | 6,306,225,775 | 6,192,968,178 | 5,944,235,664 | 98.76 | 10.62X |
Properly Paired reads, “properly paired” means that both ends of the reads were mapped with correct orientation and their fragment sizes were less than 500 bp.
A_Coverage, assembly coverage calculated as the proportion of bases in the genome assembly that were covered by at least one read.
Ave_Fold, average fold that was calculated as the average depth of coverage across the whole genome.
Summary of SNPs identified from Hanwoo and Yanbian cattle in this study.
| Fields | Hanwoo | Yanbian cattle | Total | |
|---|---|---|---|---|
| Sample counts | 10 | 10 | 20 | |
| SNP count | 13,544,560 | 15,857,687 | 17,926,093 | |
| dbSNP | 11,477,894 | 12,892,393 | 13,936,399 | |
| Ti/Tv ratio | 2.26 | 2.26 | 2.26 | |
| Exon | Synonymous variant | 49,684 | 59,232 | 70,079 |
| Non-synonymous variant | 36,457 | 42,595 | 50,827 | |
| Initiator codon variant | 4 | 4 | 5 | |
| Start lost | 34 | 43 | 48 | |
| Stop gained | 399 | 491 | 580 | |
| Stop lost | 16 | 25 | 27 | |
| Stop retained variant | 39 | 43 | 52 | |
| Non coding exon variant | 5,393 | 6,180 | 7,322 | |
| Splice site | Splice region variant | 8,499 | 10,098 | 11,961 |
| Splice acceptor variant | 224 | 269 | 318 | |
| Splice donor variant | 230 | 268 | 331 | |
| Intron | Intron variant | 3,347,348 | 4,047,275 | 4,703,213 |
| Intragenic variant | 40,614 | 49,950 | 56,469 | |
| UTR | 5 prime UTR variant | 4,094 | 4,917 | 5,855 |
| 5prime UTR premature - startcodon gain variant | 674 | 818 | 956 | |
| 3prime UTR variant | 26,280 | 31,680 | 37,323 | |
| Intergenic | Upstream | 531,123 | 637,361 | 742,563 |
| Downstream | 545,133 | 656,190 | 764,392 | |
| Intergenic region | 9,502,323 | 11,284,480 | 13,090,691 | |
| Missense | 36,511 | 42,667 | 50,907 | |
| Nonsense | 399 | 491 | 580 | |
| Silent | 49,723 | 59,275 | 70,131 |
SNP count; the overlapped SNP loci between samples were counted as one.
Because the analysis to categorize the SNPs was done non-exclusively, some SNPs were counted at multiple categories.
SNP categories were clustered by six genomic regions: Exon, Splice site, Intron, UTR, Flanking region and Intergenic
Upstream/downstream: 5 Kbp regions that are adjacent to the both ends of a gene were defined as upstream and downstream regions respectively.
Fig. 1.The SNP distribution according to the functional categories
Fig. 2.Network of functional terms of nsSNP in Hanwoo. Nodes (circles) are annotated functional terms. Edges connecting nodes represent gene share, being thickness proportional to the number of genes shared between terms (i.e., the degree of gene set overlap).
Fig. 3.Distribution of ZHp scores across all 29 Bos taurus autosomes for HAN and YAN. The upper and lower plots indicate the ZHp score distribution of Hanwoo and Yanbian cattle, respectively. Even numbered chromosomes are presented in gray and odd numbered chromosomes are black. The blue and red horizontal lines indicate ZHp thresholds of −4 and −5 that could be strong candidates of selective sweeps in this study.
Fig. 4.A putative selective sweep at the PPP1R12A locus in Hanwoo. (A) and (B) present the Z score of the pooled heterozygosity (ZHp) plot for Hanwoo and Yanbian cattle, respectively. The red and blue horizontal lines in these plots indicate ZHp scores of −5 and −4, respectively. Each circle-point represents ZHp score in each window used in this study. The yellow-highlighted marks the vicinity of the sweep including the PPP1R12A gene. ZHp scores in the interval were indicated as a green color. (C) The annotated gene structure of the PPP1R12A gene as well as underlying QTLs for the genomic region as visualized using the Animal QTLdb.