Literature DB >> 35049839

Identification of Copy Number Variations and Genetic Diversity in Italian Insular Sheep Breeds.

Rosalia Di Gerlando1, Salvatore Mastrangelo1, Marco Tolone1, Ilaria Rizzuto1, Anna Maria Sutera2, Angelo Moscarelli1, Baldassare Portolano1, Maria Teresa Sardina1.   

Abstract

Copy number variants (CNVs) are one of the major contributors to genetic diversity and phenotypic variation in livestock. The aim of this work is to identify CNVs and perform, for the first time, a CNV-based population genetics analysis with five Italian sheep breeds (Barbaresca, Comisana, Pinzirita, Sarda, and Valle del Belìce). We identified 10,207 CNVs with an average length of 1.81 Mb. The breeds showed similar mean numbers of CNVs, ranging from 20 (Sarda) to 27 (Comisana). A total of 365 CNV regions (CNVRs) were determined. The length of the CNVRs varied among breeds from 2.4 Mb to 124.1 Mb. The highest number of shared CNVRs was between Comisana and Pinzirita, and only one CNVR was shared among all breeds. Our results indicated that segregating CNVs expresses a certain degree of diversity across all breeds. Despite the low/moderate genetic differentiation among breeds, the different approaches used to disclose the genetic relationship showed that the five breeds tend to cluster in distinct groups, similar to the previous studies based on single-nucleotide polymorphism markers. Gene enrichment was described for the 37 CNVRs selected, considering the top 10%. Out of 181 total genes, 67 were uncharacterized loci. Gene Ontology analysis showed that several of these genes are involved in lipid metabolism, immune response, and the olfactory pathway. Our results corroborated previous studies and showed that CNVs represent valuable molecular resources for providing useful information for separating the population and could be further used to explore the function and evolutionary aspect of sheep genome.

Entities:  

Keywords:  copy number variations; genetic diversity; sheep breed

Year:  2022        PMID: 35049839      PMCID: PMC8773107          DOI: 10.3390/ani12020217

Source DB:  PubMed          Journal:  Animals (Basel)        ISSN: 2076-2615            Impact factor:   2.752


1. Introduction

Copy number variants are DNA segments widely dispersed in mammalian genomes, including deletions, duplications, and insertions, ranging from 1 kb to several Mb, that vary compared with a reference genome [1]. According to the number of copies of the segment, CNVs can be classified in deletions (or losses) and duplications (or gains). CNVs involving large genomic regions can affect the gene structure and gene dosage, which, in turn, has an impact on gene expression. In fact, these structural variations are one of the major contributors to genetic diversity and phenotypic variation in many species, including sheep [2,3,4,5], cattle [6,7,8,9], pigs [10], and dogs [11,12,13]. Although CNVs have been mapped in most species, their use as markers for population genetics studies has been proposed in few livestock species, such as cattle [8,14,15,16], sheep [17], goat [18], and turkey [19]. Examining genetic diversity in local breeds is important because it could help evaluate the evolutionary processes that lead to divergence and differences between and within them. Therefore, understanding the multiple components of functional breed diversity have important implication for breed management and genetic improvement practices, especially in breeds that are locally adapted and have not undergone intense artificial selection [6]. As microsatellites and single-nucleotide polymorphisms (SNPs) have been used to examine population structures and genetic diversity in order to obtain information on origin, history, and adaptation of breeds [20,21], the use of CNVs could be relevant. Several authors found CNVRs harboring annotated genes related to expressed phenotypes caused by the specific evolutionary history of the populations [8,15,19,22,23]. Moreover, due to CNV’s less-known linkage disequilibrium (LD) patterns, CNV-based population genomics results could offer additional new insights for functional and evolutionary studies in livestock [18]. The aim of this work was to identify CNVs and perform, for the first time, a CNV-based population genetics analysis of five insular Italian sheep breeds. This study could offer new insights into the genomic architecture of local sheep and facilitate our understanding of the evolution and subsequent selection within the sheep genome.

2. Materials and Methods

Blood samples were collected by private and official veterinarians of the local health authorities in the contest of sanitary programs. All the procedures were in agreement with the recommendations of the European Union Directive 2010/63/EU, to ensure appropriate animal care.

2.1. Sampling and Genotyping

A total of 667 individuals from five Italian sheep breeds—Barbaresca (BARB, n = 30), Comisana (COM, n = 72), Pinzirita (PIN, n = 77), Sarda (SAR, n = 30), and Valle del Belìce (VDB, n = 468) —were sampled for this study. These breeds present differences in both phenotypic (coat color, body size, and weight) and production traits (e.g., milk production), and show excellent adaptability to the local environments [24,25]. The COM, PIN, SAR, and VDB breeds are reared for milk production, while BARB is a dual-purpose breed. In particular, the PIN is an ancient local breed; due to its good adaptive traits and hardiness, it is reared on farms located in marginal areas for milk production, representing an important genetic resource for present and future needs [26]. The COM is one of the most important breeds mostly reared in Central and Southern Italy. The breed is valued for its high milk yield, it is generally completely white in color, and its face is brick-red with a white frontal stripe. The SAR—a hornless breed with white fleece, selected since the 1930s for milk production—represents nearly the totality of the insular population with about 3 million sheep reared [27,28]. The VDB is the most important dairy sheep in Sicily; it is likely that this breed derives from the PIN breed, to which it is similar for the horned trait in males, crossed with the COM breed, to which it is similar for coat color (i.e., white with red head) and milk production. Subsequently, the cross between these two breeds was likely crossed with the SAR breed [24,25,26,27,28,29]. The BARB is a dual-purpose ancient sheep with a long and pendulous tail, reared in a very restricted area in central Sicily under a semi-extensive farming system. The breed seems to originate from crosses between Tunisian Barbary sheep from North Africa and the PIN breed and is, at present, highly endangered [30]. In Italy, the BARB together with the Laticauda are the only two fat-tail sheep breeds [31]. Genotyping was performed using the Illumina OvineSNP50K BeadChip v2 array containing 54,241 SNPs. The positions of the SNPs on the chromosomes were determined from the ovine Oar_v3.1 genome assembly. All the 667 genotyped individuals passed the quality control criteria of call rate > 98%. Unmapped SNPs and sex chromosomes were excluded from the analysis, leaving 52,413 markers for CNV mapping.

2.2. CNV and CNVR Detection

The optimal segmenting (CNAM) module of SVS 8.7.0 (Golden Helix Inc., Bozeman, MT, USA; www.goldenhelix.com (accessed on 3 July 2021)) was used to identify CNVs using the univariate approach that segments each sample independently [32]. We imported the Log R Ratio (LRR) values for each SNP from GenomeStudio 2.0 software (Illumina Inc., San Diego, CA, USA) into SVS 8.7.0. Quality assurance of the LRR data and filtering of outlier samples were performed using SVS software, as described by Pinto et al. [33]. Individuals were screened for their GC content, which is correlated to long-range waviness of LRR values. Outlying samples were detected by the SVS 8.7.0 for waviness [34] and those identified were deleted. The CNVRs were determined by aggregating the overlapping CNVs identified in at least two detections across all samples within each breed [35]. Overlapping was identified with the BEDTools software [36]. CNVRs were treated as individual loci and only those identified, within each breed, in at least five individuals were used to reduce false positives within the dataset. The VENN diagram web tool (https://bioinformatics.psb.ugent.be/webtools/Venn/ (accessed on 3 July 2021)) was used to create a Venn diagram showing the overlap between CNVRs identified in different breeds.

2.3. Comparison of CNVRs between Breeds

Three input files were constructed and applied to analyze the genetic relationships among breeds [15]. The first contained 427 animals with presence (“1”) or absence (“0”) of each CNVR loci (n = 365). The second dataset included presence/absence data of the CNVR loci in each of the five sheep breeds, while the third dataset contained information on the CNVR loci frequencies in each breed. Different approaches were used in order to disclose population structure and diversification of these five breeds. The GenAlEx 6.5 software [37] was used to calculate the pairwise population PhiPT values (analogous to Wrights’ Fst index) by mean of an AMOVA using 9999 permutations [15]. The PAST 3.22 software [38] was used to perform the Principal Component Analysis (PCA) of pairwise individual genetic distances, and a Hierarchical Clustering Analysis (HCA) using Euclidian distance measure and UPGMA as clustering method. Moreover, the Heatmap analysis using the top 10% of CNVRs based on their variance among the five breeds was conducted.

2.4. Gene Content and Functional Annotation

The gene content of CNVRs was assessed based on Oar_v3.1 in the Genome Data Viewer browser from the US National Center for Biotechnology Information (NCBI) database. Gene ontology analysis was performed with PANTHER Classification System v16.0 [39] using Bonferroni correction at a significance level of 0.05. To investigate the biological function and phenotypes that are known to be affected by each identified gene, we conducted a comprehensive literature search, including information from other species.

3. Results

3.1. CNVs and CNVRs Detection

The quality control performed with SVS 8.7.0 allowed for the identification of 240 outlier individuals. Therefore, the final dataset used for analyses comprised a total of 427 individuals (BARB (n = 19), COM (n = 43), PIN (n = 47), SAR (n = 24), and VDB (n = 294)). The total number of CNVs (Table S1) called across the 26 autosomal chromosomes was 10,207 and varied in terms of number and size among the breeds (Table 1 and Figure 1).
Table 1

Summary of CNVs identified in each breed.

BreedN. SampleN. CNVsCNVs per Sample Min–Max (Average)LossGainMin Length (bp)Max Length (bp)Mean Length (bp)
BARB1943115–33 (23)32810319,0282,499,938222,990
COM43117218–39 (27)95721519,0413,660,245344,790
PIN47121611–43 (26)96325323,5874,399,691399,121
SAR2448112–28 (20)33514623,5873,692,295272,509
VDB294690711–52 (24)5469143813,12814,995,713569,560
Total42710,20711–52 (24)8052215513,12814,995,7131,808,970

Barbaresca (BARB), Comisana (COM), Pinzirita (PIN), Sarda (SAR), and Valle del Belìce (VDB).

Figure 1

CNV count for 26 autosomes across five breeds. Barbaresca (BARB), Comisana (COM), Pinzirita (PIN), Sarda (SAR), and Valle del Belìce (VDB).

The breeds showed similar mean numbers of CNVs, ranging from 20 (SAR) to 27 (COM). BARB showed lowest mean length, while VDB had the longest one. Among identified CNVs, 8052 were deletions (loss) and 2155 were duplications (gain). CNVs ranged from 13,128 bp to 14.99 Mb in size with an average length of 1.81 Mb. The highest number of CNVs (n = 715) was found on chromosome 2 in the VDB, while no CNV was identified on chromosomes 13, 22, 24, and 26 in the BARB and SAR breeds. Descriptive statistics of CNVRs identified in the five sheep breeds are reported in Table 2. A total of 1240 CNVRs were obtained across all breeds with 960 losses and 280 gains.
Table 2

Summary of CNVRs identified in each breed.

BreedN. CNVRsLossGainMin Length (bp)Max Length (bp)Mean Length (bp)
BARB83612242,4052,013,519178,021
COM1951593619,3223,295,789324,599
PIN1861473923,5872,962,879347,976
SAR89612843,4561,954,981196,644
VDB68753215514,26411,305,268434,543
Total124096028014,26411,305,2681,481,783

Barbaresca (BARB), Comisana (COM), Pinzirita (PIN), Sarda (SAR), and Valle del Belìce (VDB).

A total of 365 CNVRs (Table S2) were determined by aggregating the overlapping CNVs identified across all samples and present in at least five individuals of the same breed. In particular, 16, 67, 60, 23, and 349 CNVRs were identified in the BARB, COM, PIN, SAR, and VDB breeds, respectively. The length of the CNVRs varied among breeds from 2.4 Mb (in BARB) to 124.1 Mb (in VDB). A comparison of the CNVRs among breeds is showed in the Venn diagram (Figure 2). The highest number of shared CNVRs was between COM and PIN (n = 23), followed by the ones shared between COM and VDB (n = 15). Only one CNVR was shared among all breeds.
Figure 2

Venn diagram representing common and unique CNVRs found among the five breeds. Barbaresca (BARB), Comisana (COM), Pinzirita (PIN), Sarda (SAR), and Valle del Belìce (VDB).

3.2. CNVR Genetic Diversity Analyses

In the PCA analysis (Figure 3), the first two components (PC1 and PC2) explained 8.4% and 6.6% of the total variance, respectively. The individuals of BARB had the most compact clustering. The SAR breed was positioned in the same area. The VDB individuals showed a more spread cluster. Finally, COM and PIN were positioned a little further from the other breeds (bottom right area of the Figure 3). A cluster dendrogram per breed is depicted in Figure 4. The results show two distinct branches separating the three dairy Sicilian sheep breeds (COM, PIN, and VDB) from the SAR and the dual purpose BARB. Finally, we performed a cluster heatmap analysis using the top 10% of CNVRs (Table S3) considering their variances among the five breeds (Figure 5). This final result clearly arranged the breeds according to the above reported analyses.
Figure 3

Principal components (PC) analysis for the genetic differentiations among sheep breeds using PC1 and PC2. Barbaresca (BARB ∙), Comisana (COM ∙), Pinzirita (PIN ▫), Sarda (SAR ◊), and Valle del Belìce (VDB ▪).

Figure 4

Dendrogram cluster analysis based on frequency of CNVRs in the five sheep breeds. Barbaresca (BARB), Comisana (COM), Pinzirita (PIN), Sarda (SAR), and Valle del Belìce (VDB).

Figure 5

Heatmap analysis based on hierarchical cluster using top 10% of CNVRs in the five sheep breeds. Barbaresca (BARB), Comisana (COM), Pinzirita (PIN), Sarda (SAR), and Valle del Belìce (VDB).

To evaluate CNVRs’ contribution to population differentiation, we estimate the pairwise PhiPT genetic distances among the five breeds (Table 3). The Analysis of Molecular Variance (AMOVA) based on PhiPT values indicated that most of the genetic diversity occurred within populations (88%) while the variability among populations contributed 12% (Table S4). The value of PhiPT varies between 0 (no population differentiation) and 1 (full differentiation). The PhiPT distances between breeds are statistically significant with p-value < 0.0001 based on 9999 permutations. The highest value was estimated between BARB and PIN, and the lowest one between PIN and COM.
Table 3

Pairwise PhiPT genetic distances among the five breeds.

BARBCOMPINSARVDB
BARB0.000
COM0.2550.000
PIN0.2640.0770.000
SAR0.2200.1960.2030.000
VDB0.0970.1260.1140.0840.000

Barbaresca (BARB), Comisana (COM), Pinzirita (PIN), Sarda (SAR), and Valle del Belìce (VDB).

3.3. Gene Enrichment and Functional Annotations of CNVRs

Gene enrichment was described for the 37 CNVRs selected considering the top 10%, and 29 of them encompassing genes. The CNVR_309 located on chromosome 19 was the one that contained the greatest number of genes (n = 49) (Table S3). Out of 181 total genes, 67 were uncharacterized loci. Only the CNVR_39, located on chromosome 2 and containing theLRP1B (Low-Density Lipoprotein Receptor-Related Protein 1B) gene, was common to all breeds. Based on PANTHER analysis, the enriched GO terms included biological processes (biological regulation, cellular and metabolic processes, response to stimulus, and immune system process), molecular function (binding, catalytic activity molecular function regulator, and transporter activity), and cellular component terms (cellular anatomical entity, intracellular, and protein-containing complex) (Table 4).
Table 4

The gene ontology (GO) in the CNVRs identified in the five sheep breeds.

Accession Number Biological Process Gene Symbol
GO:0065007Biological regulationDEAF1, HRAS, HNRNPF, PIDD1, TFAP2D, EPS8L2, EEFSEC, PAIP1, PNPLA2, KLF15, EPHB3, TXNRD3, ZXDC, FXYD4, CNBP, PLXNA1, GATA2, KHDRBS2, RUVBL1, CREB1, PDGFA, STIM1, IRF7, SYT4, SENP6, STIM1, TBL1XR1
GO:0009987Cellular processDEAF1, PSMD13, HRAS, EFCC1, HNRNPF, SUN1, PIDD1, TFAP2D, EPS8L2, MCM2, EEFSEC, MYO6, CHCHD6, METTL21A, NTNG2, PAIP1, PNPLA2, TMEM80, NNT, KLF15, EPHB3, FAM20C, SEC61A1, PKP3, GET4, SLC25A22, RPN1, RRM1, PKP3, TXNRD, ZXDC, RAB7A, ISY1, COPG1, FXYD4, RTKN2, CNBP, PLXNA1, GATA2, KHDRBS2, MYO6, RUVBL1, ANO9, CREB1, PDGFA, CHCHD6, STIM1, INTS1, IRF7, SYT4, SENP6, STIM1, SHROOM3, TBL1XR1, EFCC1, IQSEC1
GO:0008152Metabolic processDEAF1, PSMD13, EFCC1, HNRNPF, PIDD1, TFAP2D, MCM2, EEFSEC, METTL21A, PAIP1, PNPLA2, NNT, KLF15, EPHB3, FAM20C, SEC61A1, SLC25A22, RPN1, RRM1, ZXDC, METTL21A, ISY1, CNBP, GATA2, KHDRBS2, RUVBL1, CREB1, PDGFA, INTS1, IRF7
GO:0050896Response to stimulusHRAS, PIDD1, EPS8L2, NLRP6, MCM2, CHCHD6, NTNG2, EPHB3, PLXNA1, PDGFA, CHCHD6, SYT4
GO:0002376Immune system processANO9, SIGGIR, PKP3, STIM1, PLXNA1, IFITM3, IFITN5, NNT, CREB1, IRF7
Accession number Molecular function Gene Symbol
GO:0005488BindingDEAF1, HRAS, EFCC1, HNRNPF, SUN1, TFAP2D, EPS8L2, MCM2, EEFSEC, MYO6, PAIP1, NNT, ACAD9, KLF15, SEC61A1, PKP3, RRM1, PKP3, RTKN2, CNBP, GATA2, KHDRBS2, MYO6, CREB1, PDGFA, STIM1, IRF7, POLR2L, SYT4, STIM1, SHROOM3, EFCC1
GO:0003824Catalytic activityHRAS, PIDD1, MGLL, MCM2, MYO6, METTL21A, PNPLA2, B4GALNT4, NNT, ACAD9, EPHB3, FAM20C, ALDH1L1, RRM1, TXNRD3, MYO6, RUVBL1, POLR2L, SENP6
GO:0098772Molecular function regulatorDEAF1, EFCC1, TFAP2D, EPS8L2, KLF15, ZXDC, FXYD4, GATA2, CREB1, STIM1, IRF7, TBL1XR1
GO:0005215Trasporter activitySEC61A1, SLC25A22, FXYD4, ANO9, STIM1
Accession number Cellular Component Gene Symbol
GO:0110165Cellular anatomical entityDEAF1, PSMD13, ABTB1, HRAS, EFCC1, HNRNPF, SUN1, PIDD1, MGLL, LRP1B, STBD1, TFAP2D, EPS8L2, MCM2, MYO6, CHCHD6, METTL21A, NTNG2, PNPLA2, TMEM80, NNT, ACAD9, KLF15, EPHB3, FAM20C, SEC61A1, PKP3, GET4, RPN1, RRM1, TXNRD3, NUP210, ZXDC, CD151, RAB7A, RAB43, ISY1, COPG1, RTKN2, CNBP, IFITM5, PLXNA1, GATA2, KHDRBS2, RUVBL1, ANO9, CREB1, PDGFA, SLC41A3, INTS1, IRF7, SYT4, SENP6, STIM1, SHROOM3, TBL1XR1, ADAP1
GO:0005622IntracellularDEAF1, PSMD13, ABTB1, EFCC1, HNRNPF, SUN1, PIDD1, TFAP2D, MCM2, MYO6, CHCHD6, METTL21A, PNPLA2, NNT, ACAD9, KLF15, FAM20C, SEC61A1, PKP3, GET4, RPN1, RRM1, TXNRD3, NUP210, ZXDC, RAB7A, RAB43, ISY1, COPG1, RTKN2, CNBP, GATA2, KHDRBS2, RUVBL1, CREB1, STIM1, INTS1, IRF7, SYT4, SENP6, SHROOM3, TBL1XR1, ADAP1
GO:0032991Protein-containing complexPSMD13, ABTB1, HNRNPF, SUN1, MCM2, TMEM80, EPHB3, SEC61A1, GET4, RPN1, RRM1, NUP210, ISY1, COPG1, PLXNA1, RUVBL1, CREB1, INTS1, TBL1XR1

4. Discussion

In general, several studies of genetic diversity have been conducted in Italian sheep breeds [40,41], particularly in insular breeds, using molecular markers as microsatellites [24] and SNPs [30], while knowledge is limited regarding their characterization and genetic variation using CNVs. In this work, we studied the genomic variability of five Italian sheep breeds based on CNVs and CNVRs information. The number of CNVs and CNVRs identified in this study was not directly comparable with other previously reported papers due to differences in the used algorithms, technologies, filter criteria, and numbers of tested samples and breeds [42]. Although the use of SNP arrays is, nowadays, a standard method, they vary in density and number of markers, ranging from 50K [2,3,4] to 600K [43,44,45]; for example, Ma et al. [44], analyzing 48 Chinese sheep with the 600K SNP array, identified a higher number of CNVRs (1296) with a smaller size (about 96 Kb) compared with ours. Furthermore, Ma et al. [3] reported 111 CNVRs from 160 Chinese sheep with an average size of 123.78 Kb using the 50K SNP array. The amount of “loss” CNVRs results prevalent respect to the “gain” ones, in agreement with previous studies [3,4,46]. The detected CNVRs are probably underestimated due to SNP density on the 50K array. This drawback could probably be avoided by using Ovine HD SNPs, which could provide higher resolution and sensitivity for CNV detection and population analysis than the low-density SNP array. Thus, the density of array is an important factor that affects the CNV discovery and, therefore, their use in population genetic analyses [17]. Different approaches were used to disclose population structure and differentiation among the five breeds; in general, the CNVRs characterization and genetic diversity analyses demonstrate that the five breeds tend to cluster in distinct groups. The genetic distances obtained by CNVRs using PhiPT values (Table 3) indicated greater differentiation between BARB and PIN, and a lower distance between COM and PIN. These last two breeds were also grouped together for the higher number of shared CNVRs (n = 23) (Figure 2). Similar results for COM and PIN were found, in previous studies, using microsatellite markers [24], SNP array [30], and whole-genome resequencing data [47]. PCAs generated from identified CNVRs showed less variation among the five breeds than those based on thousands of SNPs [25,30]. For example, CNVRs cannot distinguish the BARB from the other Sicilian breeds and showed shared area between COM and PIN. Similar results were also reported in cattle [48] and horse [49]. Bickhart et al. [50] performed a MDS analysis based on CNVs’ genotypes and compared it with the plot based on SNPs in a study on taurine and zebuine cattle, showing that the separation and clustering of the taurine using CNVs were not superior to those based on SNPs; the authors suggested that CNV genotyping still has room for improvement. In fact, compared with SNPs, CNVs suffer from small sampling size and difficulty to genotype, making it difficult to use them for fine clustering, especially within a group [48]. Long-term adaptation to different environmental conditions or different selection schemes increases the presence of specific copies of genes and, therefore, variation in CNVs among breeds [19,51]. Our results showed low level of differentiation among the five breeds due to breeding practices and similar environmental conditions, gene flow, and shared ancestral components. All these factors led to an increase in shared CNVRs among breeds. Dendrogram per breed (Figure 4) showed two main groups, one with VDB, COM, and PIN (with COM and PIN closer), the other group formed by BARB and SAR. This result is due to the highest number on CNVRs shared by VDB, COM, and PIN than with BARB and SAR, in which they lacked. Therefore, the obtained results suggest that CNVs/CNVRs represent a valuable molecular resource to provide good information for separating the populations among them and could be further used for exploring the function and evolutionary aspect of sheep genome. Out of the 37 genomic regions (Table S3), 29 CNVs encompassed 181 genes, some of which were related to lipid metabolism, immune response, olfactory receptor, and different biological process. The proteins of the LRP1B gene (within CNVR_59, common to all breeds), participate in a wide range of physiological processes, including the regulation of lipid metabolism, neurodevelopment, and transport of nutrients and vitamins [52,53], but also in cell proliferation process, making it a potential candidate gene for the supernumerary nipple phenotype [54]. The KHDRBS2 gene, within the CNVR_310 with the highest variance, has been associated with fertility and reproductive traits in cattle [55], goat [56], and sheep [57]. Moreover, the FILIP1 gene (CNVR_185), reported by Salehian-Dehkordi et al. [57] has been linked with fertility traits. The ACAD9 gene is a recently identified acyl-CoA dehydrogenase that demonstrates maximum activity with unsaturated long-chain acyl-CoAs [58]. We found several genes involved in lipid metabolism (AGMO, PNPLA2, SIRT3, KLF15, MGLL, ACAD9, and COPG1) [59,60,61,62,63] and immune response (ANO9, SIGGIR, PKP3, STIM1, PLXNA1, IFITM3, IFITN5, NNT, CREB1, and IRF7) [64,65,66,67,68,69]. The genes LOC101123149 and LOC101123408 are olfactory receptors. Olfactory receptors are interesting candidates for physiological requirements of domestic animals [70] and for feed efficiency, as their expression has been detected in the gut and may be related to feed intake. Olfactory receptors in the gut may serve as sensors of chemical or nutritional status and may have a role in nutrient absorption or digestive function [71]. Finally, two candidate genes—TBL1XR1 and NNT—mapped within the CNVRs were identified only in Barbaresca breed. TBL1XR1 has been reported as a candidate gene for backfat thickness in cattle [72], and NNT as a candidate gene related with heat stress response [73]. Among the five breeds involved in this study, the Barbaresca is the only sheep with fat tail [30,31]. The fat tail is considered an adaptive response of animals to a hazardous environment and for facing future climate changes. Fat depots act as an energy reserve that allows sheep to survive extreme environments and conditions such as prolonged droughts, cold, and food scarcity [74,75,76,77]. Therefore, these genes are consistent with the phenotypic characteristics of the breed. All these aforementioned genesbelong to CNVR loss, while only FILIP1 gene is found in a CNVR gain.

5. Conclusions

In this work, we detected the CNVs and CNVRs in five sheep breeds and performed a CNV-based population genetics analysis. The present study provides a broader CNV map and is the first population genetic analysis in Italian sheep breeds conducted using CNVs. The number of CNVRs identified is not directly comparable with others previously reported due to technical differences in the methods used. Our results indicated that segregating CNVs expresses a certain degree of diversity across all breeds. CNV genetic markers may not be compatible with current population analyses, because they violate the classical population genetics assumptions based on the infinite allele model and the infinite site model for SNP. However, despite the low genetic differentiation among the five sheep breeds involved in this study, the clustering methods based on CNVRs arranged groups according to the breed they belong to. Therefore, our results corroborated previous studies and showed that CNVs represent a valuable molecular resource for providing good information for separating the population and could be further used to explore the function and evolutionary aspects of sheep genome.
  65 in total

Review 1.  Structural variation in the human genome.

Authors:  Lars Feuk; Andrew R Carson; Stephen W Scherer
Journal:  Nat Rev Genet       Date:  2006-02       Impact factor: 53.242

2.  A genome-wide association study reveals candidate genes for the supernumerary nipple phenotype in sheep (Ovis aries).

Authors:  W-F Peng; S-S Xu; X Ren; F-H Lv; X-L Xie; Y-X Zhao; M Zhang; Z-Q Shen; Y-L Ren; L Gao; M Shen; J Kantanen; M-H Li
Journal:  Anim Genet       Date:  2017-07-12       Impact factor: 3.169

3.  Genetic variants in SIRT3 transcriptional regulatory region affect promoter activity and fat deposition in three cattle breeds.

Authors:  Linsheng Gui; Jieyun Hong; Sayed Haidar Abbas Raza; Linsen Zan
Journal:  Mol Cell Probes       Date:  2016-12-12       Impact factor: 2.365

4.  Genome-wide association study for milk somatic cell score in holstein cattle using copy number variation as markers.

Authors:  M Durán Aguilar; S I Román Ponce; F J Ruiz López; E González Padilla; C G Vásquez Peláez; A Bagnato; M G Strillacci
Journal:  J Anim Breed Genet       Date:  2016-08-30       Impact factor: 2.380

5.  Diet and the evolution of human amylase gene copy number variation.

Authors:  George H Perry; Nathaniel J Dominy; Katrina G Claw; Arthur S Lee; Heike Fiegler; Richard Redon; John Werner; Fernando A Villanea; Joanna L Mountain; Rajeev Misra; Nigel P Carter; Charles Lee; Anne C Stone
Journal:  Nat Genet       Date:  2007-09-09       Impact factor: 38.330

6.  Diversity of copy number variation in the worldwide goat population.

Authors:  Mei Liu; Yang Zhou; Benjamin D Rosen; Curtis P Van Tassell; Alessandra Stella; Gwenola Tosser-Klopp; Rachel Rupp; Isabelle Palhière; Licia Colli; Brian Sayre; Paola Crepaldi; Lingzhao Fang; Gábor Mészáros; Hong Chen; George E Liu
Journal:  Heredity (Edinb)       Date:  2018-11-06       Impact factor: 3.821

7.  GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research--an update.

Authors:  Rod Peakall; Peter E Smouse
Journal:  Bioinformatics       Date:  2012-07-20       Impact factor: 6.937

8.  Analysis of cattle olfactory subgenome: the first detail study on the characteristics of the complete olfactory receptor repertoire of a ruminant.

Authors:  Kyooyeol Lee; Dinh Truong Nguyen; Minkyeung Choi; Se-Yeoun Cha; Jin-Hoi Kim; Hailu Dadi; Han Geuk Seo; Kunho Seo; Taehoon Chun; Chankyu Park
Journal:  BMC Genomics       Date:  2013-09-02       Impact factor: 3.969

9.  Genome wide linkage disequilibrium and genetic structure in Sicilian dairy sheep breeds.

Authors:  Salvatore Mastrangelo; Rosalia Di Gerlando; Marco Tolone; Lina Tortorici; Maria Teresa Sardina; Baldassare Portolano
Journal:  BMC Genet       Date:  2014-10-10       Impact factor: 2.797

10.  Genome-Wide Association Study in Mexican Holstein Cattle Reveals Novel Quantitative Trait Loci Regions and Confirms Mapped Loci for Resistance to Bovine Tuberculosis.

Authors:  Sara González-Ruiz; Maria G Strillacci; Marina Durán-Aguilar; Germinal J Cantó-Alarcón; Sara E Herrera-Rodríguez; Alessandro Bagnato; Luis F Guzmán; Feliciano Milián-Suazo; Sergio I Román-Ponce
Journal:  Animals (Basel)       Date:  2019-08-30       Impact factor: 2.752

View more
  4 in total

1.  Genome-wide identification of copy number variation and association with fat deposition in thin and fat-tailed sheep breeds.

Authors:  Shadan Taghizadeh; Mohsen Gholizadeh; Ghodrat Rahimi-Mianji; Mohammad Hossein Moradi; Roy Costilla; Stephen Moore; Rosalia Di Gerlando
Journal:  Sci Rep       Date:  2022-05-25       Impact factor: 4.996

2.  Copy Number Variation (CNV): A New Genomic Insight in Horses.

Authors:  Nora Laseca; Antonio Molina; Mercedes Valera; Alicia Antonini; Sebastián Demyda-Peyrás
Journal:  Animals (Basel)       Date:  2022-06-02       Impact factor: 3.231

3.  Genome-wide evaluation of copy gain and loss variations in three Afghan sheep breeds.

Authors:  Mohammad Hossein Moradi; Roqiah Mahmodi; Amir Hossein Khaltabadi Farahani; Mohammad Osman Karimi
Journal:  Sci Rep       Date:  2022-08-22       Impact factor: 4.996

4.  Genome-wide analysis of CNVs in three populations of Tibetan sheep using whole-genome resequencing.

Authors:  Linyong Hu; Liangzhi Zhang; Qi Li; Hongjin Liu; Tianwei Xu; Na Zhao; Xueping Han; Shixiao Xu; Xinquan Zhao; Cunfang Zhang
Journal:  Front Genet       Date:  2022-09-07       Impact factor: 4.772

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.