Literature DB >> 26260653

Impact of Parental Bos taurus and Bos indicus Origins on Copy Number Variation in Traditional Chinese Cattle Breeds.

Liangzhi Zhang1, Shangang Jia2, Martin Plath2, Yongzhen Huang2, Congjun Li3, Chuzhao Lei2, Xin Zhao2, Hong Chen4.   

Abstract

Copy number variation (CNV) is an important component of genomic structural variation and plays a role not only in evolutionary diversification but also in domestication. Chinese cattle were derived from Bos taurus and Bos indicus, and several breeds presumably are of hybrid origin, but the evolution of CNV regions (CNVRs) has not yet been examined in this context. Here, we of CNVRs, mtDNA D-loop sequence variation, and Y-chromosomal single nucleotide polymorphisms to assess the impact of maternal and paternal B. taurus and B. indicus origins on the distribution of CNVRs in 24 Chinese domesticated bulls. We discovered 470 genome-wide CNVRs, only 72 of which were shared by all three Y-lineages (B. taurus: Y1, Y2; B. indicus: Y3), whereas 265 were shared by inferred taurine or indicine paternal lineages, and 228 when considering their maternal taurine or indicine origins. Phylogenetic analysis uncovered eight taurine/indicine hybrids, and principal component analysis on CNVs corroborated genomic exchange during hybridization. The distribution patterns of CNVRs tended to be lineage-specific, and correlation analysis revealed significant positive or negative co-occurrences of CNVRs across lineages. Our study suggests that CNVs in Chinese cattle partly result from selective breeding during domestication, but also from hybridization and introgression.
© The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Entities:  

Keywords:  CNV; Y-chromosomal SNPs; mtDNA D-loop; parental origin

Mesh:

Substances:

Year:  2015        PMID: 26260653      PMCID: PMC4558867          DOI: 10.1093/gbe/evv151

Source DB:  PubMed          Journal:  Genome Biol Evol        ISSN: 1759-6653            Impact factor:   3.416


Introduction

Cattle have a long history of domestication and selective breeding (Bradley et al. 1996). Several cattle genomes have been published in recent years (Elsik et al. 2009; Canavez et al. 2012), and so systematic investigations of genomic changes accompanying the domestication process—such as breed-specific genomic differentiation, hybridization, and introgression—have become feasible, including the identification of genome-wide single nucleotide polymorphisms (SNPs) (Decker et al. 2009; Gibbs et al. 2009; Jansen et al. 2013; McTavish et al. 2013) and copy number variations (CNVs) (Liu et al. 2010; Pollinger et al. 2010). Genic CNV regions (CNVRs) have the potential to affect phenotypes through the disruption of gene dosages, unmasking of recessive alleles, and loss of regulatory elements or regulatory polymorphisms (Weischenfeldt et al. 2013). Furthermore, gene duplications played a role in the evolution of sexual dimorphism (Novembre and Ramachandran 2011) and facilitated gene neofunctionalization or subfunctionalization during evolutionary diversification (Adler et al. 2014). Several investigations of cattle CNVs have been conducted, focusing mostly on breed-specific patterns of CNV (Bae et al. 2010; Fadista et al. 2010; Liu et al. 2010; Hou et al. 2011; Stothard et al. 2011; Zhan et al. 2011; Bickhart et al. 2012; Jiang et al. 2012, 2013). However, modern breeding practices commonly used in the cattle industry include artificial insemination and hybridization of breeds (thereby introducing more variation in CNVs), for example, when importing foreign excellent cattle sperm for insemination of Chinese cows (Li, Xie, et al. 2013). This highlights the necessity for additional, comparative investigations of lineage-specific patterns of CNVs. Moreover, artificial selection during the domestication process or in modern breeding programs leads to rapid genomic evolution (Novembre and Ramachandran 2011; Parsch and Ellegren 2013), and it remains imperative to understand the links between domestication history and artificial selection, genomic changes (including CNVs), and phenotypic divergence in production animals (Kijas et al. 2009; Novembre and Ramachandran 2011). Selection for phenotypic traits associated with high CNV of certain genomic regions, for example, could help explain why some CNVs are present in domestic animals despite the predicted loss of redundant gene copies with time (Liu et al. 2010; Zhang et al. 2014). With 28 recognized breeds, China is particularly rich in genomic resources for research on cattle (Qiu et al. 1988). The high degree of phenotypic and genomic differences between breeds is partly due to a long domestication history and adaptation to different local/climatic conditions throughout China (Chan et al. 2010; Jansen et al. 2013). Another important source of variation is the origin of breeds from either Bos taurus (taurine) or Bos indicus (indicine cattle): Phylogeographic studies on sequence variation of maternally (mitochondrial) and paternally inherited (Y chromosomal) markers found Chinese cattle to be divided into two main clades (taurine and indicine), the ancestors of which likely originated from the Near East and the Indian subcontinent, respectively (MacHugh et al. 1997). Today, taurine cattle are dominant in northern China and indicine cattle in southern China (Chen and Qiu 1993; Lai et al. 2006; Lei et al. 2006; Jia et al. 2007, 2010). Analysis of Y-chromosome SNPs identified three haplotypes, namely Y1 (taurine origin), Y2 (taurine) and Y3 (indicine) (Götherström et al. 2005), and subsequent investigations using Y-SNPs and Y-STRs confirmed that Y2 dominated in the north (91.4%) and Y3 in the south of China (90.8%) (Li, Zhang, et al. 2013). A number of studies also suggest that hybridization and introgression of taurine and indicine cattle occurred, especially in central parts of China, but inferences were mostly made on the basis of mtDNA and Y-chromosomal information only (Lai et al. 2006; Jia et al. 2010; Li, Xie, et al. 2013). Y-chromosomes and mtDNA, however, generally lack recombination and thus, are of limited use to unravel patterns of genome evolution after hybridization and artificial selection (McTavish et al. 2013). With an increasing number of genomic data sets being published every year, genome-wide markers are increasingly utilized to analyze the evolutionary/genomic histories (sometimes including domestication effects) not only of model species such as Drosophila fruit flies (Emerson et al. 2008), humans (Novembre and Ramachandran 2011) and chimpanzee (Gatto et al. 2006), but also of nonmodel organisms (Qu et al. 2013) and increasingly domestic animals: Cattle (MacHugh et al. 1997; Gibbs et al. 2009; McTavish et al. 2013), sheep (Kijas et al. 2009), dogs (Pollinger et al. 2010), horses (McCue et al. 2012; Metzger et al. 2013), and pig (Li, Tian, et al. 2013). In cattle, SNPs have been applied to study their genomic diversity and to make inferences about their domestication history, and a recent study corroborated crossbreeding of taurine and indicine cattle in central Asia (Decker et al. 2014). However, to the best of our knowledge, breed-specific differences in CNV, and especially the evolution of CNVRs after hybridization between taurine and indicine cattle, have not yet been addressed. In our present study, we inferred the origins (taurine or indicine) of 24 Chinese bulls from 12 different breeds (supplementary table S1, Supplementary Material online) based on Y-chromosomal SNPs (Ginja et al. 2009) and mtDNA D-loop sequence variation (Jia et al. 2010) and reanalyzed a genome-wide CNV data set generated by means of microarray-based comparative genomic hybridization (array CGH) (Zhang et al. 2014). Simultaneously considering the maternal and paternal origins of those breeds allowed interpreting breed-specific differences in CNVR in light of their domestication history that involved not only prolonged artificial selection but also hybridization between distant lineages (Lai et al. 2006; Jia et al. 2010).

Materials and Methods

Sample Collection

We collected blood samples of n = 24 bulls from 12 typical and common cattle breeds throughout China (supplementary table S1 and fig. S1, Supplementary Material online): Anxi (AX), Bohaihei (BH), Chinese Holstein (HD), Jiaxian (JX), Jinnan (JN), Hainan (HN), Luxi (LC), Mongolian (MG), Nanyang (NY), Qinchuan (QQ), Wannan (WN), and Zaosheng cattle (ZS). Of these, MG, AX, and ZS stemmed from the northern range of China, HN and WN from southern parts, whereas the others came from central China (supplementary fig. S1, Supplementary Material online). BH is the only black breed and HD is the main dairy cow breed in China. For our quantitative real-time polymerase chain reaction (qPCR) approach, we additionally collected five pure Angus bulls (AG)—an introduced breed—from Shaanxi Province as reference samples of confirmed taurine origin. Sample collection was carried out in accordance with the ethical guidelines approved by the Animal Care Commission of the College of Animal Science and Technology, Northwest A & F University. Genomic DNA was extracted (Sambrook and Russell 2001) and purified from whole blood using the DNA purification kit (Plus Minipreps DNA Purification System; Promega, Beijing, China), and quantified using spectrophotometry and agarose gel electrophoresis.

Y-Chromosomal and mtDNA (D-loop) Haplotyping

We determined Y-chromosomal haplotypes of the 24 bulls according to previously published protocols (Li, Xie, et al. 2013; Li, Zhang, et al. 2013). In brief, two primer pairs were used for PCR amplification (supplementary table S2, Supplementary Material online), and after purification PCR products were Sanger-sequenced by Sangon Biotech (Shanghai, China). We distinguished Y1 from Y2- and Y3-haplotypes based on the SNP (C/A, position 423 in AY936543) of UTY-19, and Y3 from Y1 and Y2 by using the SNP (T/C, position 655 in AF241271) of ZFY-10. Maternal haplotypes of 19 bulls were determined by sequencing the entire D-loop (supplementary table S2, Supplementary Material online, for primers), and sequences were aligned using ClustalW (Larkin et al. 2007). The clades of taurine and indicine origins inferred by means of a maximum-likelihood tree following the Tamura–Nei model (Tamura and Nei 1993) with 1,000 bootstrap replicates calculated using MEGA6 (Tamura et al. 2013), and the general time reversible (GTR) substitution model (Gatto et al. 2006) in MrBayes v.3.2 (Ronquist et al. 2012). For another n = 5 individuals, no information could be retrieved as we ran out of DNA isolate as a result of the CNV analyses.

Array CGH Platform and Data Analysis

The 24 Chinese bulls were included in our CGH array approach described elsewhere (Zhang et al. 2014), and one Angus bull was used as the reference sample. CNV calling for copy number gains and losses was based on log2 signal intensity (Olshen et al. 2004): Segments with mean |log2 ratios| ≥ 0.5 and covering at least five consecutive probes were considered to represent CNVRs. Chromosome (i.e., karyogram) plotting, heat map drawing, indicator analysis (De Cáceres et al. 2012), and nonmetric multidimensional scaling (NMDS) were carried out using the R-packages ggbio, heatmap.2, indicspecies, and vegan, respectively. Principal component analysis (PCA) was performed in STAMP v.2.02 (Parks and Beiko 2010). A phylogenetic tree was constructed based on binary data (CNVR present/absent) in MrBayes v.3.2 (Ronquist et al. 2012) using the GTR substitution model (Gatto et al. 2006).

Validation of CNVRs by Using qPCR

We performed qPCR analysis to validate copy number differences detected by our CGH approach on CFX-96 Real-Time PCR Detection System (Bio-Rad). The primers were designed by using Beacon Designer (PREMIER Biosoft, USA) or from the previous paper (Fadista et al. 2010; Zhang et al. 2014) (supplementary table S5, Supplementary Material online). Two control primers (Primer1 and Primer7, in supplementary table S5, Supplementary Material online), which served as an internal standard, were coamplified with the corresponding test primers. PCR reaction was done in a volume of 20 µl containing 20 ng of genomic DNA, 0.4 µM of each primer, and SYBR Premix Ex Taq II reagents (TaKaRa Biotechnology, Dalian, China). Three replicate reactions were performed for primer pairs, and a comparative CT method was used to calculate the copy number. The ΔΔCt values were determined by comparing test samples and Angus reference (two-copy state) with internal control. Finally, the relative copy number for each sample was calculated as 2−ΔΔCt (Zhang et al. 2014).

Results

Identifying Y-Chromosomal and mtDNA Lineages

Distinguishing taurine from indicine cattle based on morphological characteristics can lead to misclassifications, and Y-chromosomal SNPs in combination with mitochondrial sequence variation are widely accepted for exploring the paternal and maternal origins of cattle (Ginja et al. 2009; Jia et al. 2010; Li, Zhang, et al. 2013). By combining SNPs of the Y-chromosomal UTY and ZFY genes, we could unambiguously assign n = 24 bulls to two taurine Y-chromosomal haplotypes (Y1, n = 2; Y2, n = 13) and an indicine haplotype (Y3, n = 9; supplementary table S2, Supplementary Material online). Likewise, we analyzed the maternal origins of the studied bulls in a Bayesian phylogenetic approach, based on Sanger-sequenced mtDNA (D-loop) amplicons, which divided the samples into two major clades (i.e., taurine, n = 8; indicine, n = 11; fig. 1a). Combining information from both analyses on the taurine/indicine clustering, we inferred 11 samples to have unambiguous taurine or indicine origin, that is, both parents had the same origins (supplementary table S2, Supplementary Material online). In eight samples from five central Chinese breeds (BH, JN, JX, NY, and QQ), however, we found a mismatch between paternal and maternal information and classified those bulls as hybrids (supplementary table S2, Supplementary Material online). In summary, we confirmed northern breeds to be of taurine origin, southern breeds to be of indicine origin, whereas central Chinese breeds appear to be of hybrid origin (supplementary fig. S1, Supplementary Material online).
F

Phylogenetic trees of Chinese bulls derived from (a) mitochondrial (D-loop) sequence variation and (b) CNVR information (present/absent). Putative hybrids of taurine and indicine cattle (based on the D-loop and Y-chromosomal SNPs) are marked with “*”. Posterior probabilities/bootstrap values are shown if >0.5 or >50.

Phylogenetic trees of Chinese bulls derived from (a) mitochondrial (D-loop) sequence variation and (b) CNVR information (present/absent). Putative hybrids of taurine and indicine cattle (based on the D-loop and Y-chromosomal SNPs) are marked with “*”. Posterior probabilities/bootstrap values are shown if >0.5 or >50.

CNVRs in Different Clades of Cattle

Emerson et al. (2008) demonstrated that the genome-wide distribution of CNVs varied significantly among genomic regions of Drosophila melanogaster. In our present study, we identified 356 CNVRs in the n = 24 examined bulls that could be mapped to chromosomes 1 through 29, the X-chromosome and the mitochondrion, amounting to 38.8 Mb (i.e., 1.33% of the 2,918.1 Mb Btau_4.0 cattle genome [Elsik et al. 2009]; fig. 2). When including the 114 unassigned sequences, the total number of 470 discovered CNVRs (62.1 Mb) corresponds to 2.13% of the cattle genome. We inferred 314 sequence losses, 112 gains, and 44 cases of combined losses and gains within the same CNVR (table 1 and supplementary table S3, Supplementary Material online). Eighty-two CNVRs were unique, that is, found in only one individual, whereas 388 CNVRs were shared by at least two individuals, 48 of which had a frequency ≥0.5 (table 1).
F

Distribution of different CNVRs across chromosomes (excluding the mitochondrion) in the cattle genome. Y2-individuals are shown above the chromosomes in green (gains), red (losses) and dark blue (both), whereas below chromosomes, information for Y3-individuals (same colors as for Y2) and Y1-bulls (purple: gains, yellow: losses, black: both) is given.

Table 1

Summary Statistics for CNV in Chinese Bulls (n = 24)

Paternal HaplotypeSample Size (n)CountUniqueGainsLossesBothTotal Length (kb)
Y1 (taurine)269 (37.0)55 (37.5)30 (15.0)38 (19.0)1 (0.5)10,497 (152)
Y2 (taurine)13337 (25.9)87 (6.7)90 (6.9)218 (16.8)29 (2.2)44,785 (133)
Y3 (indicine)9251 (27.9)47 (5.2)65 (7.2)178 (19.8)8 (0.9)32,895 (131)
Total24470 (19.6)82 (3.4)112 (4.7)314 (13.1)44 (1.8)62,073 (132)

Note.—Numbers in parentheses are normalized by sample size or CNVR counts.

Distribution of different CNVRs across chromosomes (excluding the mitochondrion) in the cattle genome. Y2-individuals are shown above the chromosomes in green (gains), red (losses) and dark blue (both), whereas below chromosomes, information for Y3-individuals (same colors as for Y2) and Y1-bulls (purple: gains, yellow: losses, black: both) is given. Summary Statistics for CNV in Chinese Bulls (n = 24) Note.—Numbers in parentheses are normalized by sample size or CNVR counts. We found a significant difference in the composition of the 469 CNVRs (excluding one CNVR in mtDNA) in our n = 24 samples (Cochran’s Q-test: Q23, 468 = 135.4, P = 2.2 × 10−16). When considering the three Y-chromosomal haplotypes, CNVRs were more frequent in Y2- and Y3-haplotypes than in Y1-bulls (table 1 and supplementary fig. S2, Supplementary Material online). We compared the extent of CNVR-sharing and found that 105 CNVRs were shared by Y1 (105/125; 84%) and Y2 (105/390; 27%), 254 by Y2 (254/390; 65%) and Y3 (254/324; 78%), 83 by Y1 (83/125; 66%) and Y3 (83/324; 26%), and 72 by all three Y-haplotypes (supplementary fig. S2, Supplementary Material online). We considered the paternal (and maternal) taurine/indicine origins of the examined bulls and found 265 (and 228) CNVRs to be shared by taurine and indicine cattle, 145 (and 60) to be unique to taurine and 59 (and 166) to indicine cattle (supplementary fig. S2 and , Supplementary Material online).

Phylogenetic Analysis Based on CNVRs

To shed light on the potential evolutionary contributions of CNVs to the formation of major cattle breeds in China, we conducted Bayesian phylogenetic analysis of the different breeds based on CNVR information. Individuals of the same breed clustered closely in the phylogenetic tree (fig. 1b), which was corroborated by sample correlation plot analysis (supplementary fig. S3, Supplementary Material online). A geographic pattern emerged in which the northern (taurine) samples (MG and AX cattle) clustered together and were separated from others; a similar pattern was apparent for a southern cluster (WN, HN, and LC breeds) of indicine decent (fig. 1b and supplementary fig. S3, Supplementary Material online). Analysis of variance (ANOVA) confirmed that the grouping of taurine and indicine cattle was statistically significant; that is, ANOVA yielded significant results for the comparisons of Y1 + Y2 versus Y3 (F1, 468 = 6.49, P = 0.011), and Y1- versus Y2- versus Y3-haplotypes (F2, 468 = 6.65, P = 0.0013). The position of putative hybrid samples in the CNVR phylogenetic tree (fig. 1b) was partly in accordance with their inferred maternal (e.g., BH803 in the taurine clade and NY9172 in the indicine clade) or paternal origins (e.g., JN2 and QQ63307 in the taurine clade). In the eight hybrids, six samples clustered in the taurine clade of the CNVR-based tree, but in the indicine clade in the mtDNA (D-loop)-based tree. This suggests that introgression of indicine into taurine cattle in central Chinese breeds was mainly female-mediated, involving indicine cows and taurine bulls.

PCA to Detect Hybridization and Introgression

In another approach to explore the effects of hybridization among breeds, we performed PCA using CNVR information. Plotting the samples according to their paternal Y-chromosomal or maternal mt-haplotype uncovered intermediate positions of putative hybrid individuals (marked with * in fig. 3) between the taurine and indicine clusters. The central Chinese QQ breed, for example, has recently received foreign (taurine) sperm during artificial insemination (Li, Xie, et al. 2013), which explains why QQ63307 (with an indicine maternal background) had a large number of Y1-specific CNVRs and, as an exception among the hybrid bulls, was placed in the Y1/Y2 cluster (figs. 1b and 3 and supplementary fig. S4, Supplementary Material online). It is interesting to note that JN16 with indicine parents (maternal type: indicine; paternal type: Y3) clustered in the taurine group. Overall, our results were in good agreement with our a priori predictions regarding the distinctness of northern and southern breeds as well as ongoing hybridization and introgression in central parts of China and, more generally, validate the use of CNVR information to study the domestication history and evolution of cattle breeds.
F

PCA of CNVRs. (a) Plotting split by Y-chromosomal haplotypes, and (b) split into maternal groups of taurine (T) and indicine (I) origins. NA, information not available.

PCA of CNVRs. (a) Plotting split by Y-chromosomal haplotypes, and (b) split into maternal groups of taurine (T) and indicine (I) origins. NA, information not available.

NMDS to Identify Lineage-Specific CNVRs

To examine the contribution of lineage-specific CNVRs to the observed differentiation between breeds in more detail, we applied a powerful statistical approach derived from community ecology, namely NMDS (Dixon 2003), which visualizes similarity between individuals in complex data sets. Congruent with the results from phylogenetic analyses and PCA, samples were generally divided into two groups of taurine and indicine cattle (supplementary fig. S4 b, Supplementary Material online). We used indicator analysis (De Cáceres et al. 2012) to detect CNVRs that were statistically more abundant in groups than expected by chance (table 2 and supplementary table S4, Supplementary Material online) and thus identified 63 indicator CNVRs. Twenty of these corresponded to specific paternal Y-haplotypes, 24 to the paternal taurine/indicine grouping, 9 to maternal taurine/indicine haplotypes, and 40 to CNVR-clades of taurine and indicine origin as inferred in our Bayesian phylogenetic analysis (fig. 1b; P < 0.05 in all cases). Finally, rerunning the NMDS using only indicator CNVRs resulted in a much stronger clustering (supplementary fig. S4, Supplementary Material online).
Table 2

Lineage-Specific CNVRs in Paternal (Y-Chromosomal) and Maternal (mtDNA) Haplotypes

CNVRsFrequency in Groups
Significance (Indicator Analysis)
Y1Y2PTPIMTMICTCI
CNVR680.000.310.270.780.000.640.250.88Y3*, PI*, MI**, CI**
CNVR990.000.000.000.330.000.180.000.38Y3*, PI*, CI*
CNVR1620.000.000.000.330.000.180.000.38Y3*, PI*, CI*
CNVR3550.000.000.000.330.000.180.000.38Y3*, PI*, CI*
CNVR4690.000.000.000.440.250.180.190.13Y3*, PI**
CNVR2130.000.000.000.330.130.180.060.25Y3*, PI*
CNVR3080.000.000.000.440.000.270.060.38Y3*, PI*
CNVR4680.000.000.000.330.130.180.060.25Y3*, PI*
CNVR4090.000.310.270.890.250.550.310.88Y3**, PI**, CI*
CNVR100.000.150.130.780.380.360.250.63Y3**, PI**
CNVR3260.000.540.470.780.380.640.381.00Y2 + Y3*, CI**
CNVR1170.000.920.801.000.750.820.811.00Y2 + Y3**
CNVR1260.000.850.730.780.750.730.631.00Y2 + Y3*
CNVR561.000.770.800.220.880.270.880.00Y1 + Y2**, PT*, CT***
CNVR2091.000.620.670.000.380.270.560.13Y1 + Y2***, PT**
CNVR3951.000.380.470.110.500.180.500.00Y1*, CT*
CNVR4161.000.380.470.000.500.270.440.00Y1**, PT*, CT*
CNVR411.000.310.400.000.380.090.380.00Y1**
CNVR1721.000.230.330.110.380.090.380.00Y1*
CNVR4401.000.380.470.220.630.360.500.13Y1*
CNVR960.500.380.400.220.630.180.500.00CT*
CNVR1650.500.460.470.110.380.270.500.00CT*
CNVR120.500.770.730.110.750.270.750.00PT**, CT**
CNVR1230.500.380.400.000.130.270.380.00PT*
CNVR2000.000.540.470.000.380.180.310.25PT*
CNVR2271.000.920.930.440.750.640.880.50PT*
CNVR2240.000.230.200.670.000.550.130.88PI*, MI*, CI***
CNVR760.000.080.070.560.130.270.060.63PI*, CI**
CNVR3890.500.000.070.560.000.360.060.63PI**, CI**
CNVR3220.000.080.070.670.130.360.190.50PI**
CNVR260.500.310.330.780.380.450.380.75PI*
CNVR1460.000.000.000.330.130.090.060.25PI*
CNVR3790.500.620.601.000.750.730.690.88PI*
CNVR750.500.460.470.890.380.820.441.00MI*, CI**
CNVR1700.500.540.530.890.380.910.501.00MI**, CI*
CNVR270.000.310.270.560.000.450.310.50MI*
CNVR600.000.380.330.330.000.450.190.63MI*
CNVR1990.000.230.200.440.000.360.190.50MI*
CNVR3380.500.310.330.440.000.450.310.50MI*
CNVR4210.000.310.270.220.000.450.130.50MI*
CNVR2280.000.310.270.670.130.550.131.00CI***
CNVR2540.000.150.130.330.000.360.000.63CI***
CNVR3660.000.150.130.560.000.360.060.75CI***
CNVR3990.000.460.400.670.250.550.251.00CI***
CNVR4050.000.380.330.780.250.550.251.00CI***
CNVR620.000.080.070.330.000.270.000.50CI**
CNVR950.000.460.400.780.380.550.311.00CI**
CNVR3070.000.080.070.440.000.270.000.63CI**
CNVR3310.000.080.070.440.000.270.000.63CI**
CNVR200.000.080.070.220.000.180.000.38CI*
CNVR230.500.690.670.780.750.640.561.00CI*
CNVR380.000.150.130.110.000.270.000.38CI*
CNVR460.000.150.130.110.000.270.000.38CI*
CNVR470.000.080.070.220.000.180.000.38CI*
CNVR690.000.150.130.330.000.270.060.50CI*
CNVR2210.000.080.070.220.000.270.000.38CI*
CNVR3230.000.080.070.220.000.180.000.38CI*
CNVR3441.000.380.470.890.380.730.441.00CI*
CNVR3710.000.230.200.220.130.360.060.50CI*
CNVR3970.000.380.330.560.380.450.250.75CI*
CNVR4070.500.620.600.890.880.640.561.00CI*
CNVR4380.000.380.330.560.250.450.250.75CI*
CNVR4610.000.150.130.330.130.270.060.50CI*

Note.—Y-haplotypes indicate paternal taurine (PT; Y1, Y2) or indicine origins (PI; Y3). Mt-haplotypes indicate maternal taurine (MT) or indicine (MI) origins. Also given are inferred CNVR-clades for taurine (CT) or indicine bulls (CI).

*P < 0.05, **P < 0.01, ***P < 0.001.

Lineage-Specific CNVRs in Paternal (Y-Chromosomal) and Maternal (mtDNA) Haplotypes Note.—Y-haplotypes indicate paternal taurine (PT; Y1, Y2) or indicine origins (PI; Y3). Mt-haplotypes indicate maternal taurine (MT) or indicine (MI) origins. Also given are inferred CNVR-clades for taurine (CT) or indicine bulls (CI). *P < 0.05, **P < 0.01, ***P < 0.001.

Validation of Selected CNVRs by Quantitative Real-Time PCR

Five CNVRs were selected to validate our CGH results. CNVR117 and CNVR209 were chosen as examples of lineage-specific CNVRs, whereby CNVR117 and CNVR226 had already been examined in previous studies, and data were reanalyzed in this study (Fadista et al. 2010; Zhang et al. 2014), and CNVR217 and CNVR283 were chosen as examples of CNVRs with low and high frequencies of occurrence, respectively (table 2). We designed five primer pairs for qPCR amplification, and two primer pairs for our internal reference (supplementary table S5, Supplementary Material online). In all five assays, qPCR confirmed the patterns predicted from our prior analyses on CNV (see above), demonstrating the reliability of our results (supplementary table S5, Supplementary Material online). We selected CNVR117 for additional validation of its specificity to bulls showing the Y1-haplotype (see results from our indicator analysis) using a larger sample size (n = 29) including n = 5 Angus bulls of known taurine (Y1) origin (supplementary table S2, Supplementary Material online), and found all Y1-bulls to shown more copy numbers than Y2 or Y3-bulls (supplementary table S6 and fig. S5, Supplementary Material online). This result underscores the validity of using CNVR117 to distinguish Y1 from Y2 or Y3-bulls.

Co-Occurrence of Different CNVRs

We investigated positive (Spearman’s correlation coefficient rs ≥ 0.6, P < 0.01) and negative links (rs ≤ −0.6, P < 0.01) between different CNVRs, as statistically significant links may be indicative of correlated selection on different genomic regions in the different breeds, as well as random genetic drift and subsequent breed-specific accumulation of certain CNVR combinations. Significant positive and negative links decreased strongly at CNVRs with prevalence greater than 3/24 (i.e., CNVRs present in more than 3 of the 24 bulls; supplementary table S7 and fig. S6, Supplementary Material online), which likely reflects our sample of n = 1–3 individuals per breed and implies that most “links” are driven by breed-specific CNVR patterns (i.e., the contribution of single or few breeds led to the statistically significant results). Likewise, some breed/group-specific CNVRs were not found in other breeds/groups, resulting in negative links. However, we identified 20 positive and 6 negative links for CNVRs with a prevalence ≥0.5 (supplementary table S8 and fig. S6, Supplementary Material online), which cannot be explained by the contributions of specific CNVR combinations in single breeds. These links partly reflect different origins from taurine or indicine ancestors; for example, all CNVRs with a frequency ≥0.5 that showed negative links were connected to CNVR56, which was specific to taurine cattle (see also table 2).

Discussion

Strictly paternally (Y-chromosomal) (Cai et al. 2006; Li, Xie, et al. 2013) and maternally inherited (mitochondrial) markers (Lai et al. 2006; Lei et al. 2006; Jia et al. 2010), both of which largely lack recombination, are well established to infer the paternal and maternal origins of cattle, respectively. Besides the two genes analyzed here, previous studies used sequence variation of the Y-chromosomal genes USP9Y, ZFY, DDX3Y, and UTY to determine the paternal origins of cattle breeds (Götherström et al. 2005; Li, Xie, et al. 2013; Li, Zhang, et al. 2013); however, our present study demonstrates that SNPs of the two genes ZFY and UTY were sufficient to unambiguously assign the 24 bulls studied herein to three paternal haplotypes (Y1, Y2, and Y3)—in several cases (like Angus bulls) matching our a priori predictions. A previous study reported breed-specific differences in mtDNA D-loop sequence variation using a much larger data set of Chinese cattle and detected four taurine (T1A, T2, T3, and T5) and two indicine haplotypes (I1 and I2) (Jia et al. 2010). Our present study made use of the discriminatory power of D-loop sequence variation to determine the maternal origins of breeds, and our combined analysis not only identified two clades of taurine and indicine origins but also confirmed hybrid status of eight bulls stemming from Central China (Lei et al. 2006). Crossbreeding of taurine and indicine in Chinese cattle breeds occurred historically (Chen and Qiu 1993) and is likely to increase in the future as regional interconnectivity increases. Hybridization is expected to introduce additional variation into cattle genomes, for example, due to recombination and assortment, and our different analyses on CNVRs (like PCA and NMDS) support this view, as a considerable degree of overlap between taurine and indicine groups suggests admixture of B. taurus and B. indicus. Y-chromosomal and mtDNA markers are of limited use for reconstructing the evolutionary and/or domestication history of a given species as they do not allow tracing complex patterns of genome evolution (Ginja et al. 2009). Therefore, traditional methods, based on Y-chromosomal and mtDNA sequence variation, failed to unravel evolutionary processes shaping cattle genomes, although those markers are invaluable to detect crossbreeding of cattle breeds (Cai et al. 2006; Lai et al. 2006; Ginja et al. 2010). Our present study exemplifies the utility of genome-wide CNVRs, determined through whole-genome CGH arrays, to study the domestication history and genome evolution of cattle breeds, and our results are in good agreement with the results of a recent study using genome-wide SNPs (Decker et al. 2014). Given financial and time constraints of population-wide cattle genome sequencing, the method presented in this study offers a convenient and useful novel approach that provides a high number of polymorphic positions over the entire genome and thus represents an elegant alternative to other approaches based on SNPs, or microsatellite length polymorphisms. Our strategy of combining the analysis of CNVRs with analyses of the paternal and maternal origins of cattle enabled us to trace signatures of mosaic genomic evolution (O’Brien et al. 2014). For example, an interesting finding in our present study was that one bull (JN16), which was inferred to be of indicine paternal and maternal origin, was placed into the taurine clade when analyzing CNVRs (fig. 1b). This finding suggests that complex crossbreeding and backcrossing after hybridization of the two lineages—possibly in combination with selection for preferable phenotypic traits associated with CNVRs—may affect patterns of genomic introgression. Based on our observations of lineage-specific distribution patterns and (positive and negative) co-occurrences of different CNVRs, we argue that our findings at least in part reflect selection during the domestication history of cattle. Signatures of domestication and artificial selection have left their imprints in the cattle genome, as determined through SNPs (Kijas et al. 2009; Habier et al. 2010). The Y1-haplotype is nowadays dominant in north-central Europe, and the Y2-haplotype prevails in central Europe (Edwards et al. 2011), whereas in Chinese breeds, Y2 and Y3 dominate in the north and south, respectively (Li, Zhang, et al. 2013). A previous study found the Y1-haplotype to originate from European aurochs, specifically, from introgression after hybridization with local domestic cattle (Götherström et al. 2005). Our results indicate that some CNVRs tend to be specific to (or to occur at a higher frequency in) haplotype lineages or taurine/indicine groups, which could be interpreted as a signature of divergent selective regimes during their domestication history, even though genetic drift and random fixation of CNVRs certainly also need to be considered. For example, using a larger sample size we confirmed that CNVR117 was specific of Y1-bulls. European aurochs, from which the Y1-haplotype is derived, may have conferred higher copy numbers of CNVR117 compared with the Y2 and Y3 groups, and future studies will need to elaborate on the question of how this variation affects phenotypic traits and whether and how selection for those traits may be responsible for higher copy numbers of CNVR117 in Y1-cattle. Linking genomic structural variation to phenotypic differences and physiological performance remains a major challenge, and so answering the question of whether and how artificial selection may have affected CNVR evolution during the domestication of cattle will greatly benefit from future studies trying to link CNV to individual’s development, physiology, behavior, and morphology.

Supplementary Material

Supplementary figures S1–S6 and tables S1–S8 are available at Genome Biology and Evolution online (http://www.gbe.oxfordjournals.org/).
  51 in total

1.  The evolution of tropical adaptation: comparing taurine and zebu cattle.

Authors:  E K F Chan; S H Nagaraj; A Reverter
Journal:  Anim Genet       Date:  2010-10       Impact factor: 3.169

2.  A new insight into cattle's maternal origin in six Asian countries.

Authors:  Shangang Jia; Yan Zhou; Chuzhao Lei; Ru Yao; Zhiying Zhang; Xingtang Fang; Hong Chen
Journal:  J Genet Genomics       Date:  2010-03       Impact factor: 4.275

3.  Genetic diversity and origin of Chinese cattle revealed by mtDNA D-loop sequence variation.

Authors:  Song-Jia Lai; Yi-Ping Liu; Yan-Xing Liu; Xue-Wei Li; Yong-Gang Yao
Journal:  Mol Phylogenet Evol       Date:  2005-07-28       Impact factor: 4.286

4.  New World cattle show ancestry from multiple independent domestication events.

Authors:  Emily Jane McTavish; Jared E Decker; Robert D Schnabel; Jeremy F Taylor; David M Hillis
Journal:  Proc Natl Acad Sci U S A       Date:  2013-03-25       Impact factor: 11.205

5.  Mitochondrial diversity and the origins of African and European cattle.

Authors:  D G Bradley; D E MacHugh; P Cunningham; R T Loftus
Journal:  Proc Natl Acad Sci U S A       Date:  1996-05-14       Impact factor: 11.205

6.  Y chromosome diversity and paternal origin of Chinese cattle.

Authors:  Ran Li; Wen-Mei Xie; Zhen-Hua Chang; Shao-Qiang Wang; Rui-Hua Dang; Xian-Yong Lan; Hong Chen; Chu-Zhao Lei
Journal:  Mol Biol Rep       Date:  2013-09-25       Impact factor: 2.316

7.  Y chromosome haplotype analysis in Portuguese cattle breeds using SNPs and STRs.

Authors:  Catarina Ginja; Luís Telo da Gama; Maria Cecilia T Penedo
Journal:  J Hered       Date:  2008-10-01       Impact factor: 2.645

8.  Genomic analyses identify distinct patterns of selection in domesticated pigs and Tibetan wild boars.

Authors:  Mingzhou Li; Shilin Tian; Long Jin; Guangyu Zhou; Ying Li; Yuan Zhang; Tao Wang; Carol K L Yeung; Lei Chen; Jideng Ma; Jinbo Zhang; Anan Jiang; Ji Li; Chaowei Zhou; Jie Zhang; Yingkai Liu; Xiaoqing Sun; Hongwei Zhao; Zexiong Niu; Pinger Lou; Lingjin Xian; Xiaoyong Shen; Shaoqing Liu; Shunhua Zhang; Mingwang Zhang; Li Zhu; Surong Shuai; Lin Bai; Guoqing Tang; Haifeng Liu; Yanzhi Jiang; Miaomiao Mai; Jian Xiao; Xun Wang; Qi Zhou; Zhiquan Wang; Paul Stothard; Ming Xue; Xiaolian Gao; Zonggang Luo; Yiren Gu; Hongmei Zhu; Xiaoxiang Hu; Yaofeng Zhao; Graham S Plastow; Jinyong Wang; Zhi Jiang; Kui Li; Ning Li; Xuewei Li; Ruiqiang Li
Journal:  Nat Genet       Date:  2013-10-27       Impact factor: 38.330

9.  MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space.

Authors:  Fredrik Ronquist; Maxim Teslenko; Paul van der Mark; Daniel L Ayres; Aaron Darling; Sebastian Höhna; Bret Larget; Liang Liu; Marc A Suchard; John P Huelsenbeck
Journal:  Syst Biol       Date:  2012-02-22       Impact factor: 15.683

10.  Assessing the applicability of the GTR nucleotide substitution model through simulations.

Authors:  Laurent Gatto; Daniele Catanzaro; Michel C Milinkovitch
Journal:  Evol Bioinform Online       Date:  2007-02-04       Impact factor: 1.625

View more
  5 in total

1.  Y-chromosome polymorphisms of the domestic Bactrian camel in China.

Authors:  Huiling Chen; Zhanjun Ren; Jiping Zhao; Chengdong Zhang; Xuejiao Yang
Journal:  J Genet       Date:  2018-03       Impact factor: 1.166

2.  Genome-wide analysis reveals differential selection involved with copy number variation in diverse Chinese Cattle.

Authors:  Liu Yang; Lingyang Xu; Bo Zhu; Hong Niu; Wengang Zhang; Jian Miao; Xinping Shi; Ming Zhang; Yan Chen; Lupei Zhang; Xue Gao; Huijiang Gao; Li Li; George E Liu; Junya Li
Journal:  Sci Rep       Date:  2017-10-30       Impact factor: 4.379

3.  Distribution and Functionality of Copy Number Variation across European Cattle Populations.

Authors:  Maulik Upadhyay; Vinicus H da Silva; Hendrik-Jan Megens; Marleen H P W Visker; Paolo Ajmone-Marsan; Valentin A Bâlteanu; Susana Dunner; Jose F Garcia; Catarina Ginja; Juha Kantanen; Martien A M Groenen; Richard P M A Crooijmans
Journal:  Front Genet       Date:  2017-08-23       Impact factor: 4.599

4.  Maternal genetic and phylogenetic characteristics of domesticated cattle in northwestern China.

Authors:  Yuan Cai; Ting Jiao; Zhaomin Lei; Li Liu; Shengguo Zhao
Journal:  PLoS One       Date:  2018-12-27       Impact factor: 3.240

5.  Genome-wide patterns of copy number variation in the Chinese yak genome.

Authors:  Xiao Zhang; Kun Wang; Lizhong Wang; Yongzhi Yang; Zhengqiang Ni; Xiuyue Xie; Xuemin Shao; Jin Han; Dongshi Wan; Qiang Qiu
Journal:  BMC Genomics       Date:  2016-05-20       Impact factor: 3.969

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.