Literature DB >> 34841617

Signatures of selection are present in the genome of two close autochthonous cattle breeds raised in the North of Italy and mainly distinguished for their coat colours.

Francesca Bertolini1, Giulia Moscatelli2, Giuseppina Schiavo2, Samuele Bovo2, Anisa Ribani2, Mohamad Ballan2, Massimo Bonacini3, Marco Prandi3, Stefania Dall'Olio2, Luca Fontanesi2.   

Abstract

Autochthonous cattle breeds are genetic resources that, in many cases, have been fixed for inheritable exterior phenotypes useful to understand the genetic mechanisms affecting these breed-specific traits. Reggiana and Modenese are two closely related autochthonous cattle breeds mainly raised in the production area of the well-known Protected Designation of Origin Parmigiano-Reggiano cheese, in the North of Italy. These breeds can be mainly distinguished for their standard coat colour: solid red in Reggiana and solid white with pale shades of grey in Modenese. In this study we genotyped with the GeneSeek GGP Bovine 150k single nucleotide polymorphism (SNP) chip almost half of the extant cattle populations of Reggiana (n = 1109 and Modenese (n = 326) and used genome-wide information in comparative FST analyses to detect signatures of selection that diverge between these two autochthonous breeds. The two breeds could be clearly distinguished using multidimensional scaling plots and admixture analysis. Considering the top 0.0005% FST values, a total of 64 markers were detected in the single-marker analysis. The top FST value was detected for the melanocortin 1 receptor (MC1R) gene mutation, which determines the red coat colour of the Reggiana breed. Another coat colour gene, agouti signalling protein (ASIP), emerged amongst this list of top SNPs. These results were also confirmed with the window-based analyses, which included 0.5-Mb or 1-Mb genome regions. As variability affecting ASIP has been associated with white coat colour in sheep and goats, these results highlighted this gene as a strong candidate affecting coat colour in Modenese breed. This study demonstrates how population genomic approaches designed to take advantage from the diversity between local genetic resources could provide interesting hints to explain exterior traits not yet completely investigated in cattle.
© 2021 The Authors. Journal of Animal Breeding and Genetics published by John Wiley & Sons Ltd.

Entities:  

Keywords:  zzm321990Bos tauruszzm321990; coat colour; genetic resource; genome; population genomics

Mesh:

Year:  2021        PMID: 34841617      PMCID: PMC9300179          DOI: 10.1111/jbg.12659

Source DB:  PubMed          Journal:  J Anim Breed Genet        ISSN: 0931-2668            Impact factor:   3.271


INTRODUCTION

Autochthonous cattle breeds constitute important genetic resources, in many cases unexploited or poorly characterized. Many local breeds have been developed by the combined action of several factors and events mainly driven by the recent interplay between economic, social and environmental conditions that contributed to define their genetic history (e.g. Felius et al., 2014). Directional selection, that finally shaped the current genetic pools, fixed or almost fixed inheritable exterior phenotypes that could be considered breed‐specific traits useful to distinguish different breeds. One of the main exterior phenotypes that characterize different breeds is coat colour. Two autochthonous cattle breeds, Reggiana and Modenese, are raised mainly in the production area of the well‐known Protected Designation of Origin (PDO) Parmigiano‐Reggiano cheese, in the North of Italy. Reggiana and Modenese are historically considered the ancestral cattle populations from that this cheese has been originated. Their names derive from the two geographically close provinces of Reggio Emilia and Modena, located in the Emilia‐Romagna region where they have been mainly originated and where most of the farms raising Reggiana and Modenese cattle are now localized. The Herd Book of these two breeds were officially recognized in 1962 (Reggiana) and in 1957 (Modenese). In recent history, between the 1980’ and in first years of the 2000, both breeds have experienced a progressing decline of the population size that reached a minimum of approximately 500 Reggiana heads and 260 Modenese heads. Since then, the population size has been increasing. The recovery of these two breeds can be mainly attributed to the development of two mono‐breed branded Parmigiano‐Reggiano cheeses that can be produced from milk of only Reggiana cows or from milk of only Modenese cows. The market prize of these niche products is higher than the undifferentiated Parmigiano‐Reggiano cheese obtained by cosmopolitan breeds, hereby compensating the reduced milk yield that characterize these two breeds (Fontanesi, 2009; Gandini et al., 2007; Petrera et al., 2016; Russo et al., 2007). In 2020, Reggiana and Modenese accounted a total of approximately 2800 cows (raised in approximately 100 farms) and 500 cows (raised in approximately 60 farms), respectively. Reggiana and Modenese can be distinguished according to their standard coat colour and muzzle colour (Figure 1): a classical red coat colour, indicated with the term “fromentino,” over the whole body, with pink or pale muzzle colour are the main pigmentation features of the Reggiana cattle; white coat colour of the body with some pale grey shades and a black muzzle with a depigmented inverted “V” are the main characteristics of Modenese cattle, also known as Bianca Val Padana (Bianca = White) that is the second name of this breed, which derives from its coat colour.
FIGURE 1

Pictures of Reggiana (a) and Modenese (b) sires [Colour figure can be viewed at wileyonlinelibrary.com]

Pictures of Reggiana (a) and Modenese (b) sires [Colour figure can be viewed at wileyonlinelibrary.com] A few studies that investigated DNA markers in candidate genes and at the genome‐wide level were carried out in both breeds, using relatively small sample sizes. Here, candidate gene markers were used in association studies with production traits in Reggiana sires (Fontanesi et al., 2015) and to compare the frequency of relevant alleles affecting milk production traits amongst breeds, which included Reggiana and Modenese cattle (Fontanesi et al., 2007; Scotti et al., 2010). Polymorphisms in three coat colour‐affecting genes have been analysed in these breeds to identify markers useful to authenticate the breed origin of the mono‐breed Parmigiano‐Reggiano cheeses (Russo et al., 2007; Fontanesi et al., 2010; Fontanesi et al., 2012, 2010). Here, the melanocortin 1 receptor (MC1R) allele caused by a frameshift mutation (allele e) has been indicated to determine the red coat colour in Reggiana cattle (Klungland et al., 1995; Russo et al., 2007 ). A genome‐wide association study for exterior traits has been recently carried out in Reggiana (Bovo et al., 2021) and genome‐wide analyses of signature of selection and population genomic parameters have been carried out in Reggiana against a few other cosmopolitan and local cattle breeds (Bertolini et al., 2015,2018,2020; Mastrangelo, Ciani, et al., 2018; Mastrangelo et al., 2016). Modenese breed was analysed using single nucleotide polymorphism (SNP) information in a comparative analysis with Holstein and Maremmana breeds (Catillo et al., 2018). Genetic relationships amongst Italian cattle breeds that included SNP chip data from few Reggiana and Modenese cattle indicated that the two breeds are closely related compared to several other breeds (Mastrangelo, Sardina, et al., 2018). Considering the limited information that is available in terms of genomic differences between these two geographically close breeds, in this study we genotyped almost half of the extant cattle populations of Reggiana and Modenese breeds. We then used genome‐wide information in comparative analyses to detect signatures of selection that might be derived by the divergent directional selection and genetic drifts that have contributed to shape the genome structures of these two autochthonous cattle breeds.

MATERIALS AND METHODS

Ethic statement

Animal samples used in this study were collected following the recommendation of directive 2010/632.1.

Animals and genotyping data

A total of 1435 cattle included in this study (Reggiana, n = 1109; Modenese, n = 326) were genotyped with the GeneSeek GGP Bovine 150k SNP chip following the manufacturer's protocol. PLINK software v. 1.9 (Chang et al., 2015) was used to filter genotyping data. Only markers with minor allele frequency (MAF) <0.01 across the two breeds, with a call rate >90% in each breed, mapped in unique positions in the autosomes of the ARS‐UCD1.2 cattle genome version were retained. The Herd Book of the Reggiana breed (ANABORARE, 2020) considers that the Reggiana cattle should have the homozygous recessive genotype e/e at the MC1R gene, which causes the red coat colour of the breed (Bovo et al., 2021; Klungland et al., 1995; Russo et al., 2007). For this reason, further filtering was applied, retaining for this study only Reggiana cattle with the e/e genotype that could be retrieved from the GeneSeek GGP Bovine 150 k SNP chip. After filtering, the data set accounted for 1109 Reggiana samples (98 samples were excluded because they were heterozygous E/e at the MC1R gene and 2 samples did not pass the quality criteria for genotyping), 326 Modenese samples and a total of 128,574 markers.

Population genomic analyses

Observed and expected heterozygosity (HO and HE, respectively) were calculated with PLINK v. 1.9 (Chang et al., 2015). Inbreeding coefficient of an individual (I) relative to the subpopulation (S) (FIS), fixation index (FST) and inbreeding coefficient of an individual (I) relative to the total (T) population (FIT) were calculated with VCFtools software (Danecek et al., 2011). Linkage disequilibrium (LD) was measured using r for all SNP pairs of each chromosome using PLINK v. 1.9 (Chang et al., 2015) and within‐breed LD decay was estimated using bins of 10 kb. Plots were generated in R v.3.5.1. (R Core Team, 2018) with the ggplot2 package (Wickam, 2016). Recent and historical effective population size (Ne) were estimated using the SNeP software (Barbato et al., 2015), using the maximum distance between SNP to be analysed of 10 Mb and the binwidth of 100 kb for the calculation of LD. To perform population structure analyses, pruning of SNPs in high LD was carried out of PLINK 1.9 (Chang et al., 2015) with the‐‐indep‐pairwise command (options: window size of 50 kb, step size of 10 and r threshold of 0.1). A total of 14,131 SNPs was retained (average of 487 ± 166 SNPs for each chromosome). Multidimensional scaling (MDS) analysis was performed using a matrix of genome‐wide identity‐by‐state (IBS) pairwise distances as implemented in PLINK 1.9 (Chang et al., 2015). Population stratification was evaluated with the ADMIXTURE v3.1 software (Alexander et al., 2009). Analyses were preformed considering the number of subpopulations (K) that ranged from 1 to 39 and retaining the cross‐validation error (CV) for each K.

FST analyses, gene annotation and haploblock analysis

The fixation index FST is a measure of population genetic differentiation. FST provides information on the genomic variation at a locus between populations relative to that within populations (Wright, 1951). FST is based on the measure of differences in allele frequencies between populations, which capture loci that are differentially fixed as results of differential selection. High FST values indicate local positive selection, which is a characteristic of genomic regions that have gone through differential selection, whereas low FST values suggest negative or neutral selection. Therefore, FST is particularly suitable to detect signatures of selection between breeds as it is less affected by genetic drift than other methods proposed to detect signatures of selection across livestock breeds (e.g. Zhao et al., 2015). Wright's FST for each SNP in the pairwise comparison between Reggiana and Modenese populations was calculated with PLINK 1.9 (Chang et al., 2015) using the method of Weir and Cockerham (1984). Overall averaged FST was calculated considering all SNPs in the pairwise comparisons. Signatures of selection were determined using pairwise FST analyses using two approaches: (i) single‐marker pairwise FST analysis and (ii) averaged genome window FST comparative analysis. The SNP with the top 0.0005% FST (99.95th percentile; top 64 SNPs) defined the threshold to detect signatures of selection. In the window‐based approach, 4,987 windows of 1 Mb with a step of 500 kb, were tested by computing an average FST based of SNPs overlapping the window. A total of 9,953 windows of 500 kb with a step of 250 kb were also tested. Analyses, based on method of Weir and Cockerham (1984) were performed with VCFtools (Danecek et al., 2011). All windows that contained at least four SNPs were then retained. The top mean FST (mFST) 20 windows were considered in these analyses (99.6th and 99.8th percentiles for the 1‐Mb and 0.5‐Mb window‐based analyses, respectively). Annotation of the genome regions including the SNPs and windows that trespassed the defined threshold was retrieved from the Bos taurus genome assembly version ARS‐UCD1.2 and considering a region of ± 200 kb around the detected SNPs or considering the overlapping or partially overlapping windows. Genes were retrieved using Ensembl Biomart tool (http://www.ensembl.org/biomart/martview/) and then evaluated considering their functional roles according to an extensive literature search. Functional gene enrichment analysis of genes closed to the SNPs detected in the single‐marker FST analysis was performed with Enrichr (Chen et al., 2013). Over‐representation analysis run over the Gene Ontology v.2021 (http://geneontology.org), KEGG v.2021 (http://www.kegg.jp/) and GWAS catalogue v.2019 (https://www.ebi.ac.uk/gwas/) human libraries. Terms with an adjusted p‐value <0.05 and at least two input genes were retained as statistically over‐represented. Haplotype block analysis of the MC1R and ASIP gene regions was performed using the software Haploview v. 4.2 (Barret et al., 2005) using default options.

RESULTS

Population genomic parameters and structures of the two cattle breeds

Table 1 summarises some basic population genomic parameters calculated in the two cattle breeds. The average within‐breed MAF was higher in the Reggiana (mean and standard deviation: 0.271 ± 0.147) breed than in the Modenese breed (0.257 ± 0.151). The MAF distribution (Figure S1) confirmed the highest number of SNPs (n = 8085) with the lowest MAF values (ranging from 0.01 to 0.05) that was detected in the Modenese breed than in Reggiana (n = 6,649). Within‐breed HO and HE heterozygosity were lower in Modenese than in Reggiana (Table 1) reflecting the other SNP‐based information reported above.
TABLE 1

Population genomic parameters calculated in the Reggiana and Modenese cattle breeds

BreedNo. of animalsAverage MAF1 HO 2 HE 3 FIS 4 FIT 5
Reggiana11090.271 ( ± 0.147)0.3590.3600.0030.068
Modenese3260.257 ( ± 0.151)0.3540.349−0.0170.050

1Minor allele frequency and standard deviation in brackets.

2Observed heterozygosity.

3Expected heterozygosity.

4Inbreeding coefficient of an individual (I) relative to the subpopulation (S).

5Inbreeding coefficient of an individual (I) relative to the total (T) population.

Population genomic parameters calculated in the Reggiana and Modenese cattle breeds 1Minor allele frequency and standard deviation in brackets. 2Observed heterozygosity. 3Expected heterozygosity. 4Inbreeding coefficient of an individual (I) relative to the subpopulation (S). 5Inbreeding coefficient of an individual (I) relative to the total (T) population. Figure 2 reports the two‐dimensional MDS‐plots obtained using genome information from the Reggiana and Modenese breeds. The two breeds are separated by the first three coordinates into two compact and distinguishable clouds.
FIGURE 2

Multidimensional scaling plots produced using genotyping information from each investigated cattle of the Reggiana (red dots) and Modenese (blue dots) breeds. Different components (C) are considered in the plots [Colour figure can be viewed at wileyonlinelibrary.com]

Multidimensional scaling plots produced using genotyping information from each investigated cattle of the Reggiana (red dots) and Modenese (blue dots) breeds. Different components (C) are considered in the plots [Colour figure can be viewed at wileyonlinelibrary.com] The results of the ADMIXTURE analysis are showed in Figure 3. Despite a high number of K was considered, the minimum value of CV error was not detected. However, the higher decrease in K is observed with K = 2, and after that the K values remains quite constant (Figure 3a). The population stratification at K = 2 (Figure 3b) is consistent with the clusters detected by the MDS analysis. If a higher K is considered (e.g. K = 4; Figure 3c), a higher level of stratification of the Reggiana breed could be observed in contrast with the Modenese breed that tended to be more homogeneous.
FIGURE 3

Results of the ADMIXTURE analysis. a) Cross validation (CV) error with K from 1 to 39. b) Plot distribution with K = 2. c) Plot distribution with K = 4. For the last two plots, putative subpopulations (therefore, 2 for K = 2 and 4 for K = 4) are labelled with a different colour [Colour figure can be viewed at wileyonlinelibrary.com]

Results of the ADMIXTURE analysis. a) Cross validation (CV) error with K from 1 to 39. b) Plot distribution with K = 2. c) Plot distribution with K = 4. For the last two plots, putative subpopulations (therefore, 2 for K = 2 and 4 for K = 4) are labelled with a different colour [Colour figure can be viewed at wileyonlinelibrary.com] Linkage disequilibrium was higher in the Modenese breed than in the Reggiana breed as it was evident from the LD decay (Figure 4a) and the average LD calculated for all autosomes in the two breeds (Figure 4b and Table S1). This information reflected the lower Ne value obtained in the Modenese breed than in the Reggiana breed (120 vs. 215, respectively) as also evidenced from its progressive decline plot over the past generations (Figure 4c). The similar trend in LD values estimated for each chromosome in the two breeds, which confirmed the general higher LD values in the Modenese than in the Reggiana breed, also evidenced that the SNPs present in the chip might largely affect the LD structure in the two cattle breeds.
FIGURE 4

Population genomic parameters represented in the Reggiana (red points and lines) and in Modenese (blue points and lines) breeds: (a) linkage disequilibrium (LD) decay over distance; (b) average LD calculated for all autosomes; (c) effective population size (Ne) over the past generations [Colour figure can be viewed at wileyonlinelibrary.com]

Population genomic parameters represented in the Reggiana (red points and lines) and in Modenese (blue points and lines) breeds: (a) linkage disequilibrium (LD) decay over distance; (b) average LD calculated for all autosomes; (c) effective population size (Ne) over the past generations [Colour figure can be viewed at wileyonlinelibrary.com]

FST derived signatures of selection between the two breeds

The global averaged FST value across all SNPs obtained comparing Reggiana and Modenese was 0.066. Figure 5 reports the Manhattan plots obtained in the single‐marker (a) and window‐based (b and c) FST analyses that compared genomic information of the Reggiana and Modenese breeds.
FIGURE 5

Manhattan plots obtained in the single‐marker (a) and window‐based FST analyses using windows of 0.5 Mb (b) or windows of 1 Mb (c), in which the y axis reports the mean FST values (mFST). In the single‐marker analysis, the top 20 markers have been annotated, including two markers within the top 64 list, which are close to genes (in blue) that have been also contained in windows detected with the window‐based approaches. The regions detected with the window‐based approaches (b and c) are annotated with the genes close to SNPs reported in the single‐marker analysis. The two main coat colour genes are indicated in red. A few genes in the ASIP region on BTA13 identified in the window‐based analyses are annotated. The threshold lines are defined according to the percentiles reported in Materials and methods [Colour figure can be viewed at wileyonlinelibrary.com]

Manhattan plots obtained in the single‐marker (a) and window‐based FST analyses using windows of 0.5 Mb (b) or windows of 1 Mb (c), in which the y axis reports the mean FST values (mFST). In the single‐marker analysis, the top 20 markers have been annotated, including two markers within the top 64 list, which are close to genes (in blue) that have been also contained in windows detected with the window‐based approaches. The regions detected with the window‐based approaches (b and c) are annotated with the genes close to SNPs reported in the single‐marker analysis. The two main coat colour genes are indicated in red. A few genes in the ASIP region on BTA13 identified in the window‐based analyses are annotated. The threshold lines are defined according to the percentiles reported in Materials and methods [Colour figure can be viewed at wileyonlinelibrary.com] Table 2 reports the top 20 markers with the highest FST values, which ranged from 0.977 to 0.637. The full set of markers (n = 64) trespassing the 99.95th percentile threshold is reported in Table S2. Table 3 includes the top 20 0.5 Mb genome windows and Table S3 contains information on the top 20 1 Mb genome windows identified using the two applied window‐based analyses, respectively. Averaged FST values in these windows ranged from 0.344 to 0.222 in the top (99.8th percentile) 0.5‐Mb genome windows and from 0.255 to 0.163 in the top (99.6th percentile) 1‐Mb genome windows. The drastic drop of FST values from the single‐marker to the window‐based analyses might indicate that the two breeds could be distinguished by a few highly separated loci that experienced a rapid decay of LD apart from informative short genome regions, diluting the FST values in the window‐based approaches.
TABLE 2

Top 20 markers identified in the single‐marker FST analysis between the two breeds

Markers1 BTA2 Position3 FST Closest gene (bp)4
MC1R1814,705,6450.977 MC1R (0)*
BovineHD0700013748745,833,4000.783 PPP2CA (9618)*
ARS‐BFGL‐NGS−1141402121,791,0540.773 FURIN (0)
ARS‐BFGL‐NGS−281541826,500,8400.752 GOT2 (53,100)
BovineHD06000333816112,511,2160.736 LDB2 (357,854)
BovineHD13000182971363,480,2540.717 EIF2S2 (0), ASIP (182,542)*
BTA−78954‐no‐rs745,800,2750.705 TCF7 (0)*
ARS‐BFGL‐NGS−5505945,545,4190.702 IKZF1 (0)
ARS‐BFGL‐NGS−5595745,766,6950.694 TCF7 (2218)*
ARS‐BFGL‐NGS−73679745,729,8370.692 TCF7 (39,076)*
BTA−86548‐no‐rs1116,591,3220.671 RASPGRP3 (476,598)
BovineHD25000071202524,908,0140.665 IL4R (0)
BovineHD24000151792453,014,5830.661 DCC (0)
BovineHD0600009128631,158,9860.652 GRID2 (0)*
BovineHD0600009122631,135,4820.647 GRID2 (0)*
ARS‐BFGL‐NGS−350811446,102,1330.647 SAMD12 (36,424), EXT1 (62,323)
BovineHD21000067522122,531,2470.645 SLC28A1 (0)
ARS‐BFGL‐NGS−20141745,691,0370.639 VDAC1 (0)*
BovineHD0500003920512,981,3580.638 TMTC2 (405,765)
ARS‐BFGL‐NGS−16203399,840,4800.637 RAD54L (0)

The markers are ranked according to the FST value. All 99.95th percentile markers are reported in Table S2.

1Marker name in the GeneSeek GGP Bovine 150 k SNP chip.

2 Bos taurus chromosome.

3Position of the marker in the ARS‐UCD1.2 cattle genome version.

4Distance in bp of the marker with the indicated gene is reported within the bracket. When the marker overlaps the gene, a value equal to 0 bp is indicated. The star symbol indicates those genes that are also included in the top 0.5 and/or 1 Mb windows in the window‐based FST analyses (see also Figure 5).

TABLE 3

The top 20 0.5 Mb genome windows identified in the FST analysis between the two breeds

BTA1 Bin start2 Bin end3 No. of SNPs4 Average FST 5 Genes6
1814,500,00115,000,000180.344 CPNE7, DPEP1, CHMP1A, CDK10, SPATA2L, VPS9D1, ZNF276, FANCA, SPIRE2, TCF25, MC1R, TUBB3, DEF8, DBNDD1, GAS8, U1, SHCBP1, VPS35
745,750,00146,250,000150.328 TCF7, SKP1, PPP2CA, CDKL3, UBE2B, CDKN2AIPNL, JADE2, SAR1B, U6, SEC24A, CAMLG
2213,250,00113,750,000150.310 ENTPD3, RPL14, ZNF619, ZNF621, 7SK, U6
61500,000190.287 U6, APELA
745,500,00146,000,000190.286 C7H5orf15, VDAC1, TCF7, SKP1, PPP2CA, CDKL3, UBE2B, CDKN2AIPNL
1814,250,00114,750,000170.281 CDH15, SLC22A31, ANKRD11, SPG7, RPL13, CPNE7, DPEP1, CHMP1A, CDK10, SPATA2L, VPS9D1, ZNF276, FANCA, SPIRE2, TCF25, MC1R, TUBB3, DEF8
1642,250,00142,750,000100.276 UBIAD1, MTOR, ANGPTL7, EXOSC10, SRM, MASP2, TARDBP
893,250,00193,750,000210.259 SMC2
893,000,00193,500,000230.257
1363,250,00163,750,000210.257 CHMP4B, PXMP4, E2F1, ZNF341, NECAB3, RALY, EIF2S2, ASIP, AHCY
1644,250,00144,750,00060.256 U1, GPR157, CA6, ENO1, U6
112,750,0013,250,000280.249 CNNM4, CNNM3, ANKRD23, ANKRD39, SEMA4C, COX5B, ACTR1B
151500,000220.243
630,000,00130,500,000220.238 PDLIM5, 7SK, HPGDS, SMARCAD1
635,000,00135,500,000280.234 SNCA
1815,500,00116,000,000220.227 NETO2, ITFG1, U6, PHKB
15250,001750,000230.225
694,250,00194,750,000150.224 U6, ANTXR2
665,250,00165,750,000300.224 COX7B2, H4C14, GABRA4, GABRB1
1644,000,00144,500,000130.222 SPSB1, H6PD, U1, GPR157, CA6

The windows are ranked according to the average FST value.

1 Bos taurus chromosome.

2Start position of the genome window in the ARS‐UCD1.2 cattle genome version.

3End position of the genome window in the ARS‐UCD1.2 cattle genome version.

4Number of SNPs included in the 0.5 Mb genome window.

5Average FST value based on SNPs included in the genome window.

6Genes annotated in the reported genome window (ARS‐UCD1.2 cattle genome version).

Top 20 markers identified in the single‐marker FST analysis between the two breeds The markers are ranked according to the FST value. All 99.95th percentile markers are reported in Table S2. 1Marker name in the GeneSeek GGP Bovine 150 k SNP chip. 2 Bos taurus chromosome. 3Position of the marker in the ARS‐UCD1.2 cattle genome version. 4Distance in bp of the marker with the indicated gene is reported within the bracket. When the marker overlaps the gene, a value equal to 0 bp is indicated. The star symbol indicates those genes that are also included in the top 0.5 and/or 1 Mb windows in the window‐based FST analyses (see also Figure 5). The top 20 0.5 Mb genome windows identified in the FST analysis between the two breeds The windows are ranked according to the average FST value. 1 Bos taurus chromosome. 2Start position of the genome window in the ARS‐UCD1.2 cattle genome version. 3End position of the genome window in the ARS‐UCD1.2 cattle genome version. 4Number of SNPs included in the 0.5 Mb genome window. 5Average FST value based on SNPs included in the genome window. 6Genes annotated in the reported genome window (ARS‐UCD1.2 cattle genome version). The 64 markers were distributed in 23 different autosomes (Table S2). Some of them were also captured in the window‐based analyses: eight markers encompassing three chromosomes overlapped the top 0.5‐Mb windows and 15 markers encompassing five chromosomes overlapped the top 1‐Mb windows (Table S2). The top marker (FST = 0.977) was the frameshift mutation in the MC1R gene that causes the e allele at the Extension locus and that determines the classical red coat colour of the Reggiana cattle (Bovo et al., 2021; Klungland et al., 1995; Russo et al., 2007). This result was due to the fact that Reggiana cattle selected for this study were fixed for this recessive MC1R allele whereas Modenese cattle were almost fixed for an alternative allele (only one Modenese animal carried the e allele). This polymorphic site, located on BTA18, was in the first top genome window of the 0.5 Mb analysis and in the third and fourth top sliding windows in the 1 Mb analysis. The haploblock structure of this region in Reggiana cattle indicated that a relatively low level of LD is present in the MC1R gene region in both Reggiana and Modenese breeds, with some LD blocks only upstream or downstream this gene (Figure S2). The second top polymorphism in the single‐marker analysis was localized on BTA7 about 9.6 kb from the protein phosphatase 2 catalytic subunit alpha (PPP2CA) gene (Table 2). Additional four markers in the same chromosome region (Table 2), that contributed to the second and fifth highest averaged FST values in the 0.5‐Mb genome window analysis (Table 3), were within or close to the transcription factor 7 (TCF7) and voltage dependent anion channel 1 (VDAC1) genes. Other three top 99.95th percentile markers, that were also contained in top genome windows considering both the 0.5‐Mb and 1‐Mb size, were located on BTA13 within or very close to the eukaryotic translation initiation factor 2 subunit beta (EIF2S2) and agouti signalling protein (ASIP) genes (Table 2 and Table 3; Table S2). ASIP is the gene that determines the Agouti locus, which affects coat colour in many mammalian species (Searle, 1968). The LD structure of this region in Modenese cattle showed a haplotype block in the correspondence of the RALY heterogeneous nuclear ribonucleoprotein (RALY) and eukaryotic translation initiation factor 2 subunit beta (EIF2S2) genes, which are upstream the ASIP gene (Figure S3). The Reggiana breed had a main haploblock in the correspondence of the itchy E3 ubiquitin protein ligase (ITCH) gene, which is downstream the ASIP gene (Figure S3). Additional top SNPs on BTA5, BTA6 and BTA24 were also included in top genome windows detected with the 0.5 and/or 1 Mb window‐approaches. Markers on BTA5 were within or close to the myosin heavy chain 9 (MYH9) gene, SNPs on BTA6 were within the glutamate ionotropic receptor delta type subunit 2 (GRID2) gene and the marker on BTA24 was in a desert region where the closest genes are cadherin 2 (CDH2) and cadherin related 23 (CDH23) (Table 3, Table S2 and Table S3). Gene enrichment analysis returned significant results only for the Human GWAS catalogue. Amongst the 12 over‐represented phenotypes (Table S4), nine were related to pigmentation (e.g. skin colour, skin pigmentation, skin ageing, freckles and tanning processes) and involved the ASIP, ERBB4, EIF2S2 and MC1R genes. Melanogenesis was the most over‐represented KEGG pathway (adjusted p‐value = 0.14; ASIP, MC1R and TCF7 genes) whereas the regulation of tyrosine phosphorylation of STAT protein (GO:0,042,509; adjusted p‐value = 0.15; ERBB4, GHR and PPP2CA genes) was the most over‐represented GO Biological Process.

DISCUSSION

Reggiana and Modenese are considered two iconic breeds that are part of the history of the livestock production sector of the North of Italy from which the well‐known PDO Parmigiano‐Reggiano cheese was originated. Genomic population parameters calculated for the two breeds are in agreement to those that usually describe the small population size of many autochthonous breeds. The geographical closeness of the two breeds (both were developed in close provinces of the North of Italy), as it could be expected, also resulted in a relatively high genetic closeness when Reggiana and Modenese were analysed together with many other Italian cattle breeds (Mastrangelo, Sardina, et al., 2018). Despite this closeness, Reggiana and Modenese cattle were clearly distinguished using genomic information obtained with the GeneSeek GGP Bovine 150k SNP chip. Admixture patterns and MDS‐plots were able to separate all animals belonging to the two breeds. Genomic differences could be derived by the combined action of divergent artificial directional selection and genetic drift followed by genetic isolation due to the use of different male and female genetic stocks. A pairwise comparison between these two breeds, shown in Figure 5, and the subsequent genetic investigation highlighted that the most relevant genomic differences may affect coat colour genes. Gene enrichment analysis confirmed that pigmentation and related traits explained the most relevant differences that emerged between these two breeds. This is of particular interest as these genomic differences might be eventually masked or under‐rated if comparisons would have included more breeds in averaged FST analyses, which are the commonly used methodologies for these types of investigations (Munoz et al., 2019; Bovo et al., 2020). Both Reggiana and Modenese breeds originally derived from unspecialized triple purpose cattle (work‐dairy‐beef); therefore, one of the main drivers that contributed to separate them and that represents the main characterizing phenotype is their different coat colour. Solid red (Reggiana) and solid white with grey shades (Modenese) are the colours that define the standards of these two breeds. This phenotype is the most relevant descriptor used to admit animals in one or the other breed herd book. There is a story telling tradition that suggests that the selection for different coat colours in Reggiana and Modenese would derive from the ancient rivalry between the two close towns (i.e. Reggio Emilia and Modena) from which the two breeds took their names. The most relevant signature of selection that differentiated Reggiana and Modenese breeds was determined at the e allele of the MC1R gene, which causes the classical red (fromentino) coat colour of the Reggiana cattle (Bovo et al., 2021; Klungland et al., 1995; Russo et al., 2007). The high FST value (almost equal to 1) reached by the causative mutation at the MC1R allele dropped in the window‐based analyses, as it was averaged across all SNPs included in 0.5 Mb or 1 Mb. The relatively low LD that is present in the MC1R gene region of BTA18 in the Reggiana indicated an ancient origin of the fixed e allele in this breed and that more haplotypes or haplotype blocks containing this causative mutation were present in Reggiana cattle. The strong selection pressure that fixed (or almost fixed) this Extension allele, therefore, did not result in an extended fixation of several other close SNPs on BTA18. Instead, the genetic determinism of the white coat colour with some pale grey shades of the Modenese cattle, has not been unravelled yet. This type of coat colour, also described in several other taurine breeds, have not been fully clarified in any other populations as well. Modenese breed is almost fixed for the wild type allele at the MC1R gene, as also reported in a previous study (Russo et al., 2007). According to the classical epistatic interaction between the Extension and Agouti loci, wild type alleles at the MC1R gene would give the possibility to express mutated alleles at the Agouti locus (Searle, 1968). Therefore, it is quite remarkable that a strong FST signal between Reggiana and Modenese was detected in the ASIP gene region on BTA13. The signal was not as high as it was observed for the MC1R gene region even if it was confirmed using single‐marker and the two window‐based approaches. The relatively lower FST value of this region if compared to that of the MC1R gene region could be due to the lack of the causative mutation(s) in the SNP chip and/or to the masking effect of the mutated MC1R allele in Reggiana that would epistatically cover the mutated allele(s) at the ASIP gene. Mutated ASIP alleles could be also present in the Reggiana breed but at lower frequency than in Modenese breed, reducing in this way allele frequency differences between the two breeds and, in turn the FST values in this BTA13 region. The LD analysis in Reggiana and Modenese indicated high LD in different regions of the BTA13 that includes ASIP, suggesting the presence of different haplotype structures in the two breeds, with different potential regulatory effects over ASIP that we could hypothesize (which should be, however, demonstrated). Variability at the ASIP gene determined by copy number variations (CNVs), probably with regulatory effects on gene expression, has been already associated with the white coat colour in different sheep and goat breeds (Norris & Whan, 2008; Fontanesi et al., 2009, 2011). Therefore, it would be possible to speculate that, in Modenese breed, CNVs or other regulatory mutations affecting ASIP gene could determine a similar phenotypic effect on coat colour as already observed in the other two ruminant species (i.e. sheep and goat). Recently, Trigo et al. (2021) reported that in Nellore cattle (Bos indicus), which are selected for white coat colour, a structural variant affecting the ASIP gene expression is associated with darker coat pigmentation on specific parts of the body. Few studies have investigated variability in the Bos taurus ASIP gene. Royo et al (2005) did not report any variability in the ASIP coding region of cattle from six Spanish and three French brown breeds. None of the ASIP polymorphisms reported in Korean cattle were associated to any coat colours (Do et al., 2007). Girardot et al. (2006) reported in Normande cattle an insertion in a regulatory region of the ASIP gene that was suggested to be implicated in the brindle coat colour pattern of the breed. It will be important to characterize the ASIP gene in Modenese cattle, including all regulatory regions and surrounding genes, to disentangle its expected effect on coat colour that it could be possible to predict from the results of this study. To further support the hypothesised role of the ASIP gene (or of close genes on BTA13) in determining the white coat colour of the Modenese cattle, no signatures of selection were determined in genomic regions harbouring other genes affecting coat colour in cattle. This would exclude the involvement of other well‐known genes determining white patterns, like the v‐kit Hardy‐Zuckerman 4 feline sarcoma viral oncogene homolog (KIT) gene on BTA6 and the microphthalmia‐associated transcription factor (MITF) gene on BTA22 (Fontanesi, Scotti, et al., 2010; Fontanesi, Tazzoli, et al., 2012, 2010), or genes diluting coat colours, like the premelanosome protein (PMEL) gene on BTA5, which was recently shown to dilute the coat colour in Reggiana cattle (Bovo et al., 2021). Other signatures of selection were evident from the FST comparative analyses between the two breeds. These signatures might be caused by genetic drift that would be subsequently due to the constrains generated by the use of sires and dams that could assure the requested coat colour phenotype needed to register the animals to their herd books. These genomic differences could contribute to further differentiate these two breeds for some production performances or other phenotypic traits, but their effect should be demonstrated using other approaches. The use of other pairwise methods to detect signature of selection (e.g. Bertolini et al., 2020) could also identify additional genomic regions that might be involved in differentiating these breeds or that could provide more complete genomic pictures of the results of the genetic drift, bottleneck and admixture with other breeds or populations that have probably contributed to shape the current genetic pool of these two cattle autochthonous breeds.

CONCLUSIONS

Population genomic analyses applied to compare the genome architecture of two closely related cattle genetic resources (Reggiana and Modenese) allow the identification of hints that could explain their main phenotypic differences. Signatures of selection were evidenced in two genome regions encompassing major coat colour‐affecting genes. One region on BTA18, including the MC1R gene, whose role in determining the red coat colour of Reggiana was already well established, could provide a proof of concept for the general interpretation of the results obtained in a region of BTA13, which includes the ASIP gene. Whole genome resequencing of Reggiana and Modenese is currently planned. This will help to characterise variability in the ASIP gene and investigate their association with the white coat colour in Modenese breed. The identification of Modenese specific DNA markers could be useful to develop methods to authenticate the origin of the milk and thus of the cheese declared to be derived from this breed. This study demonstrates how population genomic approaches designed to take advantage from the diversity between local genetic resources could provide interesting information to explain exterior traits not yet completely investigated in cattle.

CONFLICT OF INTEREST

The authors declare they do not have any competing interests. Data reported in this work can be shared after signature of an agreement on their use with University of Bologna.

AUTHOR CONTRIBUTION

L.F. designed the study, interpreted the results and obtained funding. L.F., F.B. and G.M. wrote the paper. A.R. and G.M. performed the wet lab work. G.M., F.B., G.S., S.B. and M. Ba. conducted bioinformatic analyses. M. Bo. and M.P. provided samples and data. S.B., G.S., S.D. and M. Ba. contributed to data interpretation. All authors read and approved the submitted version. Supplementary Material Click here for additional data file.
  29 in total

1.  Haploview: analysis and visualization of LD and haplotype maps.

Authors:  J C Barrett; B Fry; J Maller; M J Daly
Journal:  Bioinformatics       Date:  2004-08-05       Impact factor: 6.937

2.  Preselection statistics and Random Forest classification identify population informative single nucleotide polymorphisms in cosmopolitan and autochthonous cattle breeds.

Authors:  F Bertolini; G Galimberti; G Schiavo; S Mastrangelo; R Di Gerlando; M G Strillacci; A Bagnato; B Portolano; L Fontanesi
Journal:  Animal       Date:  2017-06-23       Impact factor: 3.240

3.  Coat colours in the Massese sheep breed are associated with mutations in the agouti signalling protein (ASIP) and melanocortin 1 receptor (MC1R) genes.

Authors:  L Fontanesi; S Dall'Olio; F Beretti; B Portolano; V Russo
Journal:  Animal       Date:  2011-01       Impact factor: 3.240

4.  Genome-wide identification of runs of homozygosity islands and associated genes in local dairy cattle breeds.

Authors:  S Mastrangelo; M T Sardina; M Tolone; R Di Gerlando; A M Sutera; L Fontanesi; B Portolano
Journal:  Animal       Date:  2018-03-26       Impact factor: 3.240

5.  The coding sequence of the ASIP gene is identical in nine wild-type coloured cattle breeds.

Authors:  L J Royo; I Alvarez; I Fernández; J J Arranz; E Gómez; F Goyache
Journal:  J Anim Breed Genet       Date:  2005-10       Impact factor: 2.380

6.  Copy number variation and missense mutations of the agouti signaling protein (ASIP) gene in goat breeds with different coat colors.

Authors:  L Fontanesi; F Beretti; V Riggio; E Gómez González; S Dall'Olio; R Davoli; V Russo; B Portolano
Journal:  Cytogenet Genome Res       Date:  2009-12-16       Impact factor: 1.636

7.  Genetic heterogeneity at the bovine KIT gene in cattle breeds carrying different putative alleles at the spotting locus.

Authors:  L Fontanesi; M Tazzoli; V Russo; J Beever
Journal:  Anim Genet       Date:  2009-11-26       Impact factor: 3.169

8.  A gene duplication affecting expression of the ovine ASIP gene is responsible for white and black sheep.

Authors:  Belinda J Norris; Vicki A Whan
Journal:  Genome Res       Date:  2008-05-20       Impact factor: 9.043

9.  Second-generation PLINK: rising to the challenge of larger and richer datasets.

Authors:  Christopher C Chang; Carson C Chow; Laurent Cam Tellier; Shashaank Vattikuti; Shaun M Purcell; James J Lee
Journal:  Gigascience       Date:  2015-02-25       Impact factor: 6.524

10.  Signatures of selection are present in the genome of two close autochthonous cattle breeds raised in the North of Italy and mainly distinguished for their coat colours.

Authors:  Francesca Bertolini; Giulia Moscatelli; Giuseppina Schiavo; Samuele Bovo; Anisa Ribani; Mohamad Ballan; Massimo Bonacini; Marco Prandi; Stefania Dall'Olio; Luca Fontanesi
Journal:  J Anim Breed Genet       Date:  2021-11-28       Impact factor: 3.271

View more
  2 in total

1.  Signatures of selection are present in the genome of two close autochthonous cattle breeds raised in the North of Italy and mainly distinguished for their coat colours.

Authors:  Francesca Bertolini; Giulia Moscatelli; Giuseppina Schiavo; Samuele Bovo; Anisa Ribani; Mohamad Ballan; Massimo Bonacini; Marco Prandi; Stefania Dall'Olio; Luca Fontanesi
Journal:  J Anim Breed Genet       Date:  2021-11-28       Impact factor: 3.271

2.  Seven Shades of Grey: A Follow-Up Study on the Molecular Basis of Coat Colour in Indicine Grey Cattle Using Genome-Wide SNP Data.

Authors:  Gabriele Senczuk; Vincenzo Landi; Salvatore Mastrangelo; Christian Persichilli; Fabio Pilla; Elena Ciani
Journal:  Genes (Basel)       Date:  2022-09-07       Impact factor: 4.141

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.