Literature DB >> 25049829

Identification of recently selected mutations driven by artificial selection in hanwoo (korean cattle).

Dajeong Lim1, Cedric Gondro2, Hye Sun Park1, Yong Min Cho1, Han Ha Chai1, Hwan Hoo Seong1, Bo Suk Yang3, Seong Koo Hong1, Won Kyung Chang1, Seung Hwan Lee3.   

Abstract

Hanwoo have been subjected over the last seventy years to intensive artificial selection with the aim of improving meat production traits such as marbling and carcass weight. In this study, we performed a signature of selection analysis to identify recent positive selected regions driven by a long-term artificial selection process called a breeding program using whole genome SNP data. In order to investigate homozygous regions across the genome, we estimated iES (integrated Extended Haplotype Homozygosity SNP) for the each SNPs. As a result, we identified two highly homozygous regions that seem to be strong and/or recent positive selection. Five genes (DPH5, OLFM3, S1PR1, LRRN1 and CRBN) were included in this region. To go further in the interpretation of the observed signatures of selection, we subsequently concentrated on the annotation of differentiated genes defined according to the iES value of SNPs localized close or within them. We also described the detection of the adaptive evolution at the molecular level for the genes of interest. As a result, this analysis also led to the identification of OLFM3 as having a strong signal of selection in bovine lineage. The results of this study indicate that artificial selection which might have targeted most of these genes was mainly oriented towards improvement of meat production.

Entities:  

Keywords:  Hanwoo; SNP; Signatures of Selection

Year:  2013        PMID: 25049829      PMCID: PMC4093327          DOI: 10.5713/ajas.2012.12456

Source DB:  PubMed          Journal:  Asian-Australas J Anim Sci        ISSN: 1011-2367            Impact factor:   2.509


INTRODUCTION

Modern breeds of cattle were domesticated about 10,000 years ago to produce the distinct breed characteristics for milk or meat products from natural and human artificial selection (Bradley et al., 1999). During this history of artificial selection, mutations in genes that control important characteristics, such as high milk yield in modern dairy cows, have been selected to fixation. Hanwoo (Korean cattle) have become highly specialized for meat quality undergoing strong artificial selection (Yoon et al., 2008). Hanwoo have been intensively selected for marbling (intramuscular fat) through a progeny test in a breeding program since the 1930s. As a result of artificial selection, such as the Hanwoo progeny test, the breeding value for the marbling score could increase to 0.05 standard deviation (SD) in the Hanwoo population (Lee et al., 2011). Artificial selection might also affect genomic regions controlling Hanwoo marbling. Understanding the genetic mechanism leading to phenotypic differentiation requires identification of the genome regions that have been under long term artificial selection. This strong artificial selection will increase the frequency of favorable alleles at the loci affecting meat quality traits in the meat production breeds. In this process a small region of the genome surrounding the mutations is also selected, resulting in a small genome region that shows reduced variation. This region of reduced variation is referred to as a signature of selection that is identified by distributions of nucleotides around favorable mutations that differ statistically from that expected purely by chance (Kim and Stephan, 2002). Many methods have been developed for detection of selection signatures from genome analyses. Most of methods are used to compare the distribution of allelic frequencies by calculating population genetics statistics such as FST (Weir et al., 2005), linkage disequilibrium (Kim and Nielsen, 2004), Tajima (Tajima, 1989), Wu’s H-test (Fay and Wu, 2000) and the integrated Haplotype Score (iHS) (Voight et al., 2006), which is a method based on extended haplotype homozygosity (EHH) statistics (Sabeti, et al., 2002). By identifying the signatures of past selection and then identifying the functional genes and mutations involved, it is possible to identify the major genetic and metabolic pathways that control important agricultural characteristics of our modern breeds. When data are available from a large number of populations by large scale SNP data, the analysis can distinguish the genetic variation between similar populations. Searches for signatures of selection have successfully revealed many genes that are important in livestock. For example, the International Bovine HapMap project described a range of breeds that has been historically selected for different phenotypic traits. Hayes also proposed to identify divergently selected regions of the genome between dairy cattle and beef cattle breeds within Bos taurus (Hayes et al., 2009). MacEachern et al. reported the results of comparison of allelic frequencies between Australian Angus and Holstein cattle (MacEachern et al., 2009). The aim of this study is to perform a test for selection signatures within the Hanwoo population that share a similar phenotype and to detect divergently selected genomic regions. Analysis of within-population selection signatures indicate that at least some mutations, which have been differentially selected in Hanwoo, are still segregated within its population. Finally, positional candidate genes are determined in proximity to the genomic positions showing the most significant indication of selection.

MATERIALS AND METHODS

Animals and genotype assay

Carcass data and DNA samples for QTL analysis were obtained from 266 Hanwoo steers descending from 66 sires and unrelated dams (2 to 10 progeny per sire) from two NIAS experimental stations, Dae-Kwan-Ryoung and Nam-Won. Genomic DNA for genotyping assays was extracted from a blood sample and SeoLin Bioscience (Seoul, Korea) performed the SNP genotyping using the Affymetrix MegAllele GeneChip Bovine Mapping 10K SNP array (Affymetrix Inc., 2006). Three hundred steers were genotyped but 34 steers failed to genotype due to low DNA quality from phenol and chloroform contamination. Genotype data were received on 8,344 SNP and all those SNP were physically mapped to a chromosome (in bp) using the bovine genome sequence (Btau-3.1).

Analysis of SNP statistics

Genotypes were tested for Hardy-Weinberg equilibrium (HWE) to identify possible typing errors using a chi-square test in R/SNPassoc Package (R Development Core Team). SNP not in HWE (p<0.05), monomorphic SNPs and minor allele frequency (<1%) were removed in this study. Finally, a total of 8,344 SNPs, genotype data were received on 4,522 SNP.

Extended haplotype homozygosity (EHH)

The counting algorithm of Tang et al. (2007) was implemented for identifying differential extended haplotype homozygosity regions within Hanwoo (Korean cattle). A haplotype can be identified by patterns of SNPs. Haplotype maps can be used to determine complex genetic variations of inherited diseases or complex traits. For the proportion of homozygous individuals, EHHS,, at the ith and jth SNP were calculated in two steps. First, for each SNP, EHHS, between SNP and incrementally distant flanking SNP were calculated until EHHS, k<0; this was performed for both j>i and j Second, the extended haplotype homozygosity of SNP was calculated: iES = (EHHS,) for i_j_k for the region 3’ of i (or i_j_k for the region 5’ of i). Extended haplotype homozygous regions were plotted based on the standardized log-ratio of iES within Hanwoo. We calculated a standardized integrated extended haplotype homozygsity (iES(z)) value to identify significant regions of positive selection (p = 0.0001, z = 3.5).

Evolutionary analysis of genes in signatures of selection (SoS)

We identified genes within the signature of selection and obtained orthologous genes for the four species Homo sapiens, Mus musculus, Sus scrofa, Gallus gallus from the Ensembl Compara database (Flicek et al., 2008), which reports pairwise conserved synteny relations based on nucleotide alignments. Protein sequences of the orthologous genes were aligned with ClustalW. The protein sequence alignment and the corresponding coding sequences were converted into codon alignment using the pal2nal program (http://coot.embl.de/pal2nal). We obtained the orthologous sequences information for five genes in candidate regions. If amino acid changes are selectively neutral (i.e., mutations that are neither advantageous or deleterious), they will be fixed at the same rate as synonymous mutations and ω ratio (dN/dS) = 1. ω values>1 are taken to indicate that amino acid changes are accumulating at a faster rate than is acceptable under a neutral mutation model. That is to say, the rate of amino acid changes (dN) significantly exceeds the rate of synonymous changes (dS) at the DNA level. The codon-based likelihood model proposed by Goldman and Yang (Goldman and Yang, 1994) is implemented in the program codeml of PAML package (Yang, 1997). The program is useful for estimating synonymous and non-synonymous substitution rates (dN/dS). To estimate the dN/dS values, model 0 (M0, null model) with a single dN/dS value for all branches of the tree and model 2 (M2, positive selection) with the branches of interest were assumed to have different dN/dS ratios. We obtained a log-likelihood value for each model. We tested for positive selection by comparing twice the log-likelihood difference among models (M0 vs. M2) from a chi-square distribution with n-1 df, where n is the number of branches of the phylogeny in the LRT (Yang and Bielawski, 2000). If a significant p-value is obtained, it can be concluded that the positive selection model (M2) is the favored model. Next, models of variable selective pressures among amino acid sites were used to test for the presence of sites under positive selection. The four models (M1a, M2a, M7, M8) in the CODEML program of the PAML package were tested (Lynn et al., 2005).

RESULTS AND DISCUSSION

Identification of candidate genes in the positively selected region

Figure 1 shows the plot of iES scores for one strong candidate region identified by our genome-wide scan. We extended core regions in both directions up to 1.5 cM from a core SNP (rs29012432, 41.75 cM) and annotated a subset of genes in the core region. The total distance between the first point to the left and to the right of the core SNP is from 40.93 cM to 43.87 cM of BTA3. We also detected the core SNPs that were 20.79 to 21.84 Mb of BTA22. There is a clear clustering of high values into the region where some SNPs show evidence of selection. As a result, five genes were determined as putative targets of recent artificial selection as follows: diphthine synthase 5 (DPH5), sphingosine-1-phosphate receptor 1 (S1PR1), olfactomedin 3 (OLFM3), leucine-rich repeat neuronal protein 1 (LRRN1) and protein cereblon (CRBN). A summary statistics for positively selected regions presenting the highest values of the iES analysis is shown in Table 1.
Figure 1.

Genome wide extended haplotype homozygosity (EHH) profiling to detect signature of selection in Hanwoo.

Table 1.

Summary statistics of the integrated EHHS (iES) values for selection signature in candidate genes

ChromosomeCandidate regionCloset SNP name and position (bp)iES value
BTA3DPH5 (DPH5 homolog (S. cerevisiae))rs29020061 (40,929,695)7.21
S1PR1(sphingosine-1-phosphate receptor 1)rs29018907 (41,249,954)7.46
OLFM3(olfactomedin 3)rs29018230 (41,792,758)9.82
BTA12LRRN1(Leucine-rich repeat neuronal protein 1)rs29015171 (20,796,087)12.55
CRBN (Protein cereblon)rs29017072 (21,841,651)11.71
DPH5 encodes a component of the diphthamide synthesis pathway. Diphthamide is a post-translationally modified histidine residue found only on translation elongation factor 2 (EF2). EF2 affects adipocyte differentiation in lipid and energy metabolism with differences in protein synthesis (Bluher et al., 2004). OLFM3 is an olfatomedin-related protein that interacts with myocilin (Torrado et al., 2002), which is a major cause of glaucoma and may play a role in cytoskeletal function (Stone et al., 1997). Sensory systems have undergone major evolutionary changes in mammalian lineages. Some studies suggest that a subset of olfactory genes have been positively selected (Sharon et al., 1999; Clark et al., 2003). Animals require great olfactory performances for social communication. Therefore, this result indicates that genes associated with the sensory system are determined by species specificity in terms of evolution. We also observed a signal for the selection targeting aspects of lipid metabolism. S1PR1 is one of five G protein-coupled receptors (S1PR1–5) of sphingosine-1-phosphate (S1P) (Hannun and Obeid, 2008), which is a potent lipid mediator produced from the metabolism of sphingolipid by the actions of sphingosine kinase. High density lipoprotein (HDL) is stimulated from the binding of S1P in HDL with its receptors EDG1/S1P1 and EDG3/S1P3 (Kimura et al., 2003). Regulation of lipid synthesis and degradation is important in meat animals. The human aims to decrease total cholesterol and increases the HDL fraction that is known as “good” cholesterol (Tang et al., 2001). Expression differences in EDG1 have been previously reported between a high-marbled steer group and a low-marbled steer group in musculus longissimus muscle across all ages (Sasaki et al., 2006). Recently, SNP in 5′ flanking region of EDG1 was associated with marbling in Japanese black cattle population (Yamada et al., 2009). LRRN encodes a type I transmembrane protein with unknown function and is associated with neural development (Andreae et al., 2007). CRBN directly interacts with the α1 subunit of AMP-activated protein kinase (AMPK) and reduces the activation of AMPK (Lee et al., 2011). AMPK is known to activate fatty acid oxidation in skeletal muscle by activating PPARα and PGC1 (Lee et al., 2006). We also found a gene (EDEM1, ER degradation enhancer, mannosidase alpha-like 1) related to the adipogenesis. It is located (BTA22:19.03–19.05 Mb) near selected regions of BTA22: 20.79 to 21.84 Mb. EDEM1 is one of the ER stress markers that strongly correlate with total adiposity. Recently, fatty acids also induced ER stress in some cell lines (Wei et al., 2007). It has been suggested that as a marker of obesity in humans it increases adipocyte expression (Sharma et al., 2008). Table 2 summarizes the functions of the candidate genes showing evidence for selection signatures using Gene Ontology (http://www.geneontology.org/) and KEGG pathway (http://www.genome.jp/kegg/). This observation probably reflects the signal of a partial selective sweep and may be an ongoing process in its flanking region known as “hitchhiking”.
Table 2.

Gene Ontology and KEGG pathway of the candidate genes showing evidence for selection signatures

Candidate geneGO termKEGG pathway
DPH5Peptidyl-diphthamide biosynthetic process from peptidyl-histidine(GO:0017183)-
S1PR1Angiogenesis (GO:0001525), cell adhesion (GO:0007155), G-protein coupled receptor protein signaling pathway (GO:0007186), inhibition of adenylate cyclase activity by G-protein signaling pathway (GO:0007193), brain development (GO:0007420)Neuroactive ligand-receptor interaction
OLFM3Eye photoreceptor cell development (GO:0042462)-
LRRN1Integral to membrane (GO:0016021)-
CRBNNegative regulation of protein homooligomerization (GO:0032463), negative regulation of ion transmembrane transport (GO:0034766)-

Evidence of positive selection between species

We next implemented a further test to study interspecific divergence between bovine and the other species against the five candidate genes. It is important to know how these genes have been positively selected along bovine lineage with different selective pressures. The evolutionary forces operating on particular genes use the ratio of non-synonymous (dN) to synonymous (dS) substitution. To study differences in selection pressures, we conducted likelihood ratio tests comparing a one-ratio model to a two-ratio alternative model. Table 3 shows the results of the likelihood ratio test using different evolutionary models. Only one gene showed significant acceleration in the ω-ratio on the bovine lineage. For OLFM3, the two-ratio (ω = 0.12) models detected significant positive selection in bovine lineage (Figure 2). It suggests that OLFM3 is an accelerated protein evolution driven by positive selection or a relaxation of constraints. The branch shows evidence of positive selection. To identify particular codon sites subjected to positive selection in the gene, recommended site-specific models (NSsites = 1, 2, 7, 8) implemented in the PALM program were applied. We compared the lnL values from M1a, M2a, M7 and M8. M1a and M2a are neural models with ω fixed = 1 and selection model with ω fixed = 1, respectively. Model 7 uses a β-distribution of sites between the intervals ω = 0 and ω = 1. M8 adds an extra class of sites to the M7 model, allowing for sites with ω>1. If the ω -ratios for some sites are >1, sites with ith posterior probabilities for those sites are likely to be under positive selection. However, neither model (M1a vs M2a and M7 vs M8) detected site classes as significantly favored (data not shown). In other words, no particular codon (amino acid) sites are subjected to adaptive evolution. Among candidate genes, OLFM3 is also likely to have undergone adaptive evolution in the bovine lineage. Its role could be more essential in bovine lineage because this species maintained the complete functions.
Table 3.

Likelihood estimates of different evolutionary models (Model0 vs Model2)

Gene NamedN/dSDegree of freedomx2p-value
S1PR10.0511.93NS
DPH50.0910.18NS
OLFM30.12131.47<0.001*
LRRN10.0315.34NS
CRBN0.0511.21NS

Degree of freedom is the difference in the number of parameters between evolutionary models.

x2 is twice the difference of log likelihood between models.

p-value is the probability that two models should differ in log likelihood given the degree of freedom.

Figure 2.

Phylogeny of OLFM3. Branch lengths were estimated by maximum likelihood under the free-ratio model that assumes an independent ω-value for each branch. ω-values are shown for each branch.

The approach undertaken in the present paper will allow signatures of selection to be identified for the unique high intramuscular fat (marbling) of the Hanwoo breed under very intensive artificial selection pressure in the process of breeding programs. In the past 30 years, the body weight at 18 months of age increased from 331 to 574 kg. The average annual genetic gain for carcass traits and marbling was also 4.05 kg and 0.37 grade (1 to 7 grades). The annual genetic gain was also 0.02 to 0.82 kg/yr. As a result, it was assumed that the Hanwoo breed might achieve dramatically increased genetic improvement. This suggests that although Hanwoo have experienced recent selective pressure with a short divergence time, signatures of selection have been observed with a fitness advantage during the process of an artificial breeding program. However, this study has a limitation in that low density SNP data were used for identifying highly homozygous regions. In addition, we observed the putative signatures of selection with only the Hanwoo breed. Therefore, additional biological studies are necessary to identify putative selection signatures and differences between Hanwoo and other breeds. Robust results can be clearly observed and obtained by application of this method.
  30 in total

1.  Detecting a local signature of genetic hitchhiking along a recombining chromosome.

Authors:  Yuseob Kim; Wolfgang Stephan
Journal:  Genetics       Date:  2002-02       Impact factor: 4.562

2.  Endoplasmic reticulum stress markers are associated with obesity in nondiabetic subjects.

Authors:  Neeraj K Sharma; Swapan K Das; Ashis K Mondal; Oksana G Hackney; Winston S Chu; Philip A Kern; Neda Rasouli; Horace J Spencer; Aiwei Yao-Borengasser; Steven C Elbein
Journal:  J Clin Endocrinol Metab       Date:  2008-08-26       Impact factor: 5.958

3.  AMPK activation increases fatty acid oxidation in skeletal muscle by activating PPARalpha and PGC-1.

Authors:  Woo Je Lee; Mina Kim; Hye-Sun Park; Hyoun Sik Kim; Min Jae Jeon; Ki Sook Oh; Eun Hee Koh; Jong Chul Won; Min-Seon Kim; Goo Taeg Oh; Michung Yoon; Ki-Up Lee; Joong-Yeol Park
Journal:  Biochem Biophys Res Commun       Date:  2005-12-12       Impact factor: 3.575

4.  Statistical method for testing the neutral mutation hypothesis by DNA polymorphism.

Authors:  F Tajima
Journal:  Genetics       Date:  1989-11       Impact factor: 4.562

5.  A codon-based model of nucleotide substitution for protein-coding DNA sequences.

Authors:  N Goldman; Z Yang
Journal:  Mol Biol Evol       Date:  1994-09       Impact factor: 16.240

6.  A genomics approach to the detection of positive selection in cattle: adaptive evolution of the T-cell and natural killer cell-surface protein CD2.

Authors:  David J Lynn; Abigail R Freeman; Caitriona Murray; Daniel G Bradley
Journal:  Genetics       Date:  2005-03-31       Impact factor: 4.562

7.  Identification of a gene that causes primary open angle glaucoma.

Authors:  E M Stone; J H Fingert; W L Alward; T D Nguyen; J R Polansky; S L Sunden; D Nishimura; A F Clark; A Nystuen; B E Nichols; D A Mackey; R Ritch; J W Kalenak; E R Craven; V C Sheffield
Journal:  Science       Date:  1997-01-31       Impact factor: 47.728

8.  Role of insulin action and cell size on protein expression patterns in adipocytes.

Authors:  Matthias Blüher; Leanne Wilson-Fritch; John Leszyk; Palle G Laustsen; Silvia Corvera; C Ronald Kahn
Journal:  J Biol Chem       Date:  2004-05-06       Impact factor: 5.157

9.  A genome map of divergent artificial selection between Bos taurus dairy cattle and Bos taurus beef cattle.

Authors:  B J Hayes; A J Chamberlain; S Maceachern; K Savin; H McPartlan; I MacLeod; L Sethuraman; M E Goddard
Journal:  Anim Genet       Date:  2008-12-05       Impact factor: 3.169

10.  Statistical methods for detecting molecular adaptation.

Authors: 
Journal:  Trends Ecol Evol       Date:  2000-12-01       Impact factor: 17.712

View more
  5 in total

Review 1.  An interpretive review of selective sweep studies in Bos taurus cattle populations: identification of unique and shared selection signals across breeds.

Authors:  Beatriz Gutiérrez-Gil; Juan J Arranz; Pamela Wiener
Journal:  Front Genet       Date:  2015-05-13       Impact factor: 4.599

2.  Comparative Transcriptomic Analysis of the Pituitary Gland between Cattle Breeds Differing in Growth: Yunling Cattle and Leiqiong Cattle.

Authors:  Xubin Lu; Abdelaziz Adam Idriss Arbab; Zhipeng Zhang; Yongliang Fan; Ziyin Han; Qisong Gao; Yujia Sun; Zhangping Yang
Journal:  Animals (Basel)       Date:  2020-07-25       Impact factor: 2.752

3.  Differential Gene Expression in Longissimus Dorsi Muscle of Hanwoo Steers-New Insight in Genes Involved in Marbling Development at Younger Ages.

Authors:  Sara de Las Heras-Saldana; Ki Yong Chung; Hyounju Kim; Dajeong Lim; Cedric Gondro; Julius H J van der Werf
Journal:  Genes (Basel)       Date:  2020-11-21       Impact factor: 4.096

4.  Genomic Footprints in Selected and Unselected Beef Cattle Breeds in Korea.

Authors:  Dajeong Lim; Eva M Strucken; Bong Hwan Choi; Han Ha Chai; Yong Min Cho; Gul Won Jang; Tae-Hun Kim; Cedric Gondro; Seung Hwan Lee
Journal:  PLoS One       Date:  2016-03-29       Impact factor: 3.240

5.  A Meta-Assembly of Selection Signatures in Cattle.

Authors:  Imtiaz A S Randhawa; Mehar S Khatkar; Peter C Thomson; Herman W Raadsma
Journal:  PLoS One       Date:  2016-04-05       Impact factor: 3.240

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.