Literature DB >> 30776288

Linkage disequilibrium and past effective population size in native Tunisian cattle.

Slim Ben Jemaa1, Nejia Thamri2, Sofiane Mnara3, Emmanuelle Rebours4, Dominique Rocha4, Mekki Boussaha4.   

Abstract

To carry out effective genome-wide association studies, information about linkage disequilibrium (LD) is essential. Here, we used medium-density SNP chips to provide estimates of LD in native Tunisian cattle. The two measures of LD that were used, mean r2 and D', decreased from 0.26 to 0.05 and from 0.73 to 0.40, respectively, when the distance between markers increased from less than 20 Kb to 200 Kb. The decay in LD over physical distance occurred at a faster rate than that reported for European and other indigenous breeds, and reached background levels at less than 500 Kb distance. This is consistent with the absence of strong selective pressure within the Tunisian population and suggests that, in order to be effective, any potential genome-wide association mapping studies will need to use chips with higher marker density. An analysis of effective population size (Ne) based on LD data showed a decline in past Ne, with a sudden drop starting about eight generations ago. This finding, combined with the high levels of recent inbreeding revealed by runs of homozygosity (ROH) analysis, indicate that this population is endangered and may be in urgent need of a conservation plan that includes a well-designed genetic management program.

Entities:  

Year:  2019        PMID: 30776288      PMCID: PMC6428135          DOI: 10.1590/1678-4685-GMB-2017-0342

Source DB:  PubMed          Journal:  Genet Mol Biol        ISSN: 1415-4757            Impact factor:   1.771


Introduction

Native Tunisian cattle are small animals that are well adapted to the harsh environmental conditions that characterize their rearing area, such as reduced food resources, elevated temperatures during the hot season, and an abundance of parasites and pathogens. They are currently endangered by the result of extensive crossbreeding with imported breeds (Djemali ). Local cattle in Tunisia belong to three main ecotypes, defined according to coat color. These are the Blonde du Cap Bon (BLCAP) (white), the Brune de l’Atlas Grise (BRATG) (grey), and the Brune de l’Atlas Fauve (BRATF) (tawny). Native Tunisian cattle have not been subjected to artificial selection, and are in need of breeding programs to improve their production performance. Nowadays, the availability of high-throughput genotyping technologies makes it possible to use genetic markers to map genes that are responsible for economically important traits. Such information can then be incorporated into breeding programs. Before one can efficiently map these genes, it is, however, necessary to first determine the marker density that is compatible with the distances across which linkage disequilibrium (LD) extends in the population of interest. LD refers to the non-random association of alleles at different loci in gametes. LD patterns vary substantially among populations and can arise both through physical linkage between loci and, more generally, through a variety of evolutionary forces that structure a genome, such as genetic drift, admixture, and natural selection. By characterizing the pattern of LD within a population, we are able to determine the amount of markers needed to optimize the mapping resolution of genetic variants related to economic traits. Furthermore, LD is used routinely for the study of the demographic history of populations in many species, such as humans (Reich ; McEvoy ), cattle (Khatkar ; Porto-Neto ), and sheep (McRae ; Al-Mamun ). It can also be used to infer recent and historical effective population size (Ne). Ne is defined as the size of an “ideal” population – a hypothetical population that is panmictic, large enough that sampling error is negligible (thereby preventing genetic drift), with an equal sex ratio, and without migration, mutation, or selection – that would experience the effects of inbreeding or drift to the same degree as observed in the actual breeding population (Falconer and Mackay, 1996). Ne is an important parameter for monitoring the genetic health of endangered populations as it reflects the risk of extinction that is posed by genetic drift (which drives the loss of alleles in endangered populations). Here, we characterized LD in the genome of local Tunisian cattle by analyzing 87 individuals using the Illumina BovineSNP50 chip assay. We then used LD estimates to infer changes in the past effective population size of native Tunisian cattle over time. In addition, to get an overview of population inbreeding, we present a genome-wide characterization of runs of homozygosity (ROH) in the Tunisian individuals.

Materials and Methods

Animal sampling

We identified for sampling 40 individuals of indigenous Tunisian cattle; these originated from both the northwestern and central parts of Tunisia, which are two of the main regions in the country where local cattle breeds are found. Animal sampling was carried out by a multi-institutional team representing the Livestock and Pasture Office (OEP), the Office of Silvopastoral Development of the North-West (ODESYPANO), the National Gene Bank (BNG), and the National Institute of Agronomic Research of Tunisia (INRAT). Since most native cattle have been extensively crossed with imported breeds, significant efforts were made to select purebred individuals. Selection was performed based on morphological criteria and targeted isolated mountainous regions where no historical artificial insemination activity was recorded. Our sample included 25 BRATF and 15 BRATG individuals. Blood samples were collected in EDTA Vacutainer tubes under procedures approved by the Tunisian Veterinary Authority.

DNA extraction and genotyping assays

DNA was extracted using the phenol-chloroform protocol (Cheng ). DNA quantity and quality were evaluated using a Nanodrop instrument, and DNA samples were then genotyped on the BovineSNP50 BeadChip Ver. 2 (Illumina, San Diego, CA, USA) at the Labogena core facility (Jouy-en-Josas, France) using standard operating procedures, as recommended by the manufacturer (http://www.illumina.com).

Marker quality control and selection

Overall, we obtained genotyping data from all 40 animals for 52,274 SNPs. Quality control of these data was performed using PLINK software v1.9 (Purcell ). Samples with a genotyping rate of <90% across autosomal SNPs and markers with a call rate <90% were discarded. Using these criteria, we excluded one BRATF individual and 2,232 SNPs from further analysis. An exact test for Hardy-Weinberg equilibrium (p <0.01) was then carried out on the remaining SNPs using PLINK, which resulted in the retention of 49,362 SNPs for subsequent analyses. We combined these newly generated genotypes with those already available for 48 individuals of Tunisian cattle: 18 BRATF, 15 BRATG, and 15 BLCAP (Ben Jemaa ). Thus, our final data set was composed of 48,458 SNPs genotyped for 87 Tunisian individuals. To evaluate the effect of minor allele frequencies (MAFs) on LD, we estimated the LD score using three different MAF thresholds: 0.01 (MAF 0.01), 0.05 (MAF 0.05), and 0.1 (MAF 0.1). The application of these three thresholds resulted in the retention of 42,699, 38,078, and 33,146 SNPs, respectively. In order to facilitate comparisons among the three datasets, we reduced the size of the MAF 0.01 and MAF 0.05 datasets to 33,333 and 33,509 SNPs, respectively.Marker coverage and SNP density per chromosome are indicated for each MAF threshold in Table S1, and the distribution of allele frequencies for each MAF cutoff value is illustrated in Figure S1.

Detection of runs of homozygosity

To assess levels of inbreeding in our 87 individuals, we examined our genetic data for runs of homozygosity (ROH). These were detected in sliding windows of 20 SNPs using PLINK. No more than five missing calls and one heterozygous SNP were allowed in each window. The minimum length of a ROH segment was set to 1 Mb, and a ROH was declared if it contained at least 20 SNPs. The minimum required SNP density was one SNP per 200 kb and the maximum gap allowed between any two consecutive SNPs was 1000 kb. ROHs were classified into seven categories based on length (1 to 4.999 Mb, 5 to 9.999 Mb, 10 to 14.999 Mb, 15 to 19.999 Mb, 20 to 24.999 Mb, 25 to 49.999 Mb, >50 Mb).

Estimation of linkage disequilibrium

We used the genotyping data from these 87 individuals to estimate linkage disequilibrium in our sample. For this, we used Haploview v4.2 (Barrett ) to generate two standard descriptive LD parameters, D’ and r, for all syntenic SNP pairs that were less than 50 Mb apart. All values of r2 between syntenic markers were corrected for sample size using the following equation: where n is the number of haplotypes in the sample (Villa-Angulo ). To better understand how LD estimates were related to physical distances between syntenic marker pairs, we used R software (https://www.r-project.org/) to analyze LD decay over the following windows of genetic distance (Kb): 0-19; 20-39; 40-59; 60-99; 100-199; 200-499; 500-999; 1,000-1,999; 2,000-4999; 5,000-9,999; 10,000-11,999; 12,000-14,999; 15,000-16,999; 17,000-19,999; 20,000-29,999; 30,000-50,000. Average r2 and D’ values were determined and plotted for all combined autosomes for each distance window and for each MAF cutoff value. LD parameters and the percentage of SNP pairs with r2 values >0.2 or 0.8 were also calculated between adjacent SNPs within each chromosome. We further determined the amount of background linkage disequilibrium by calculating r2 and D’ for non-syntenic SNPs. This was done by randomly selecting 1,700 SNPs (~5% of the markers) distributed over all chromosomes.

Estimation of effective population size

We used SNeP software (Barbato ) to estimate Ne and its change over time using the observed extent of LD calculated for the three MAF thresholds. SNeP estimates the historic effective population size based on the following equation (Corbin ). where N T(t) is the effective population size estimated t generations ago, f(ct) is the mapping function used to estimate the recombination rate (ct) t generations ago, r2adj is the LD estimation, adjusted for sampling bias (r2adj=r2-(1/2N), where N is the population sample size), and “a” is a constant that accounts for mutation. Inter-marker distances used in our NT(t) estimates ranged from 0.02 to 50 Mb. We also set a to 1 when f(ct) was calculated using the Sved and Feldman (1973) 0 approximation: f(ct) = d(1-d/2), where d is the linkage distance between SNPs, which is proportional to the physical distance, δ (δ=10-8*d).

Results

Detection of ROHs in the Tunisian cattle genome

A total of 3,313 ROHs were identified in the 87 Tunisian cattle individuals. The average number of ROHs per individual was 38 ± 12.6 (range: 16 to 81). Most ROHs (85%) were between 1 and 5 Mb long. Chromosome BTA01 had the highest number of ROHs (305), while BTA28 had the least (39) (Figure S2). Overall, the number of ROHs per chromosome tended to decrease with chromosome length (correlation between chromosome length and the number of ROHs was ~92%). Among the 87 individuals, 36 (~41%) had a total sum of ROHs >100 Mb, while 15 individuals (~ 17%) had a total sum of ROHs >250 Mb (Figure S3). Among these, 14 individuals (~93%) had at least one ROH that was more than 25 Mb long (Table S2). Approximately 5.4%, on average, of the entire genome of Tunisian cattle was covered by ROH segments. Chromosome BTA25 showed the highest coverage by ROHs (12.4%) while BTA01 had the least (1.6%) (Figure S2).

Characterization of Linkage Disequilibrium

The extent of LD was first evaluated for all syntenic SNP pairs for all chromosomes combined. Table 1 shows the values estimated for the decay of r2, r2 corrected for sample size, and D’ for all 29 autosomes; this information is displayed graphically in Figure 1. Correction for sample size resulted in a slight decrease (~7% on average) in r2 estimates for markers that were less than 500 Kb apart. Overall, LD parameters declined rapidly over short inter-marker distances. For example, the estimated r2 values averaged over the three MAF thresholds decreased from 0.26 for syntenic SNPs less than 20 Kb apart to 0.05 for those that were less than 200 Kb apart. However, LD decay was slower over longer physical distances. For example, mean D’ dropped from 0.33 for SNPs separated by 200-499 Kb to 0.28 for those that were more than 30 Mb apart.
Table 1

Average D’ and r2 estimates by distance class.

Distance_class (Kb)Number of pairsAverage D’ (sd)Average r2 (sd)Average r2 corrected (sd)
MAF0.01MAF0.05MAF0.1MAF0.01MAF0.05MAF0.1MAF0.01MAF0.05MAF0.1MAF0.01MAF0.05MAF0.1
0-195465515800.738(0.34)0.729(0.33)0.718(0.33)0.227(0.3)0.254(0.29)0.291(0.31)0.223(0.299)0.250(0.29)0.287(0.30)
20-3911 88812 26412 0740.684(0.35)0.651(0.34)0.630(0.34)0.152(0.22)0.179(0.24)0.203(0.25)0.147(0.226)0.174(0.23)0.199(0.25)
40-599 5569 7389 5710.605(0.35)0.567(0.34)0.533(0.34)0.108(0.18)0.127(0.19))0.142(0.2)0.103(0.179)0.122(0.19)0.137(0.19)
60-9919 31919 75919 3980.540(0.36)0.486(0.34)0.447(0.32)0.076(0.14)0.087(0.14)0.096(0.15)0.071(0.137)0.081(0.14)0.091(0.15)
100-19947 35348 23347 3110.460(0.35)0.397(0.31)0.349(0.28)0.046(0.09)0.051(0.090.055(0.1)0.040(0.090)0.046(0.09)0.050(0.09)
200-499139 280141 276138 2330.402(0.34)0.328(0.290.277(0.24)0.027(0.05)0.028(0.05)0.029(0.05)0.021(0.047)0.023(0.04)0.024(0.04)
500-999228 259231 215226 8520.383(0.330.306(0.28)0.254(0.23)0.021(0.03)0.022(0.03)0.023(0.030.016(0.033)0.017(0.03)0.017(0.03)
1 000-1 999447 448453 482442 9030.374(0.33)0.297(0.27)0.245(0.22)0.020(0.03)0.021(0.03)0.021(0.03)0.014(0.029)0.015(0.02)0.015(0.02)
2 000-4 9991 290 5931 303 8731 274 3100.366(0.33)0.289(0.27)0.238(0.22)0.018(0.03)0.019(0.03)0.020(0.03)0.013(0.027)0.014(0.02)0.014(0.02)
5 000-9 9992 010 3702 027 5581 982 1590.358(0.33)0.280(0.27)0.229(0.21)0.017(0.02)0.018(0.02)0.018(0.02)0.011(0.025)0.012(0.02)0.013(0.02)
10 000-11 999760 396765 970749 8640.354(0.33)0.275(0.26)0.225(0.21)0.016(0.02)0.017(0.02)0.017(0.02)0.011(0.023)0.012(0.02)0.012(0.02)
12 000-14 9991 100 1431 106 0241 081 4840.352(0.33)0.273(0.26)0.223(0.210.016(0.02)0.017(0.02)0.017(0.02)0.010(0.023)0.011(0.02)0.011(0.02)
15 000-16 999708 390711 382695 3920.349(0.33)0.270(0.26)0.221(0.21)0.016(0.02)0.017(0.02)0.017(0.02)0.010(0.022)0.011(0.02)0.011(0.02)
17 000-19 9991 027 4511 033 8651 011 2270.348(0.33)0.269(0.26)0.220(0.21)0.016(0.02)0.016(0.020.017(0.02)0.010(0.022)0.011(0.02)0.011(0.02)
20 000-29 9993 100 8693 115 8083 048 0490.345(0.33)0.265(0.26)0.217(0.21)0.015(0.02)0.016(0.02)0.016(0.02)0.009(0.022)0.010(0.02)0.010(0.02)
30 000-49 9994 710 1894 739 6684 629 7390.341(0.33)0.263(0.26)0.214(0.2)0.015(0.02)0.015(0.02)0.016(0.02)0.009(0.021)0.010(0.02)0.010(0.02)
Non-syntenic SNPs1 444 1511 444 1511 444 1510.342(0.32)0.258(0.25)0.214(0.2)0.0145(0.02)0.015(0.02)0.015(0.02)0.008(0.020)0.009(0.02)0.009(0.02)
Figure 1

Average LD decay pooled over all autosomes for three different minor allele frequency (MAF) thresholds in the Tunisian cattle population: (a) D’ , (b) r (as estimated by Haploview) and r2 corrected for sample size (r2_cor).

The effect of MAF on the extent of LD among syntenic SNPs was studied using three different MAF thresholds: 1, 5, and 10% (Figure 1 and Table 1). Overall, we analyzed 17,056,201 SNP pairs for MAF=0.01, 17,164,817 for MAF=0.05, and 16,813,297 SNP pairs for MAF=0.1. It seemed that the MAF had a strong effect on mean r2 over short physical distances (≤ 200 Kb), while its effect on D’ was more pronounced over long distances (Figure 1 and Table 1). For instance, within the range 0-19 Kb, mean r2 increased by 28% passing from MAF=0.01 to MAF=0.1, while the increase was only 8% within the 500-999 Kb interval. The range of D’ and r2 values for SNP pairs more than 500 Kb apart were similar to those observed for non-syntenic markers, for which around 79% of D’ values were lower than 0.4 and 94% of r2 values were lower than 0.05 (Figure S4). The distribution of r2 values between adjacent SNP pairs, combined over all autosomes, is shown in Figure 2. Most adjacent markers (~80%) had an average r2 value <0.2 while only 2% were in strong or complete LD (0.8 BTA 19, 23, 25, 26, 28, and 29 having the lowest LD values and BTA 1, 6, and 8 having the highest ones. Per-chromosome values ranged from 0.106 ± 0.17 for BTA28 to 0.158 ± 0.228 for BTA06 (r2) and from 0.494 ± 0.33 for BTA23 to 0.158 ± 0.22 for BTA06 (D’). Unsurprisingly, chromosomes with the lowest LD values tended to have the lowest proportion of marker pairs with strong and moderate LD.
Figure 2

Distribution of r2 values between adjacent SNP pairs pooled over all autosomes.

Table 2

Mean r and D’ between adjacent SNPs on each chromosome in Tunisian native cattle and percentage of marker pairs with r >0.2 or 0.8.

BTAIntermarker average distance (Kb)Average D’ ± sdAverage r2 ± sd% r2 >0.2% r2 >0.8
171.47 ± 64.510.583 ± 0.350.153 ± 0.2121.612.82
276.82 ± 78.740.573 ± 0.350.143 ± 0.2019.812.64
374.60 ± 71.880.574 ± 0.350.141 ± 0.2020.172.22
475.29 ± 64.980.572 ± 0.350.134 ± 0.2018.572.27
586.16 ± 86.860.553 ± 0.350.127 ± 0.1917.442.25
671.51 ± 79.120.583 ± 0.350.158 ± 0.2221.933.40
777.68 ± 97.710.564 ± 0.350.144 ± 0.2019.822.68
873.45 ± 63.080.579 ± 0.350.147 ± 0.2120.472.65
981.52 ± 79.840.57 ± 0.350.141 ± 0.2018.223.02
1074.76 ± 110.100.555 ± 0.350.140 ± 0.2019.742.89
1175.18 ± 72.590.552 ± 0.350.143 ± 0.2119.813.24
1283.46 ± 135.710.551 ± 0.350.127 ± 0.1918.271.74
1372.71 ± 65.710.563 ± 0.350.131 ± 0.1817.871.77
1471.59 ± 66.720.563 ± 0.350.142 ± 0.2019.502.44
1577.33 ± 70.820.558 ± 0.340.130 ± 0.1917.712.08
1674.79 ± 78.750.565 ± 0.350.139 ± 0.2020.402.05
1773.13 ± 77.880.547 ± 0.350.128 ± 0.1918.001.95
1878.10 ± 81.840.549 ± 0.350.136 ± 0.2019.662.26
1969.24 ± 63.140.529 ± 0.350.116 ± 0.1816.631.34
2070.29 ± 60.830.559 ± 0.350.134 ± 0.2018.192.29
2177.96 ± 86.180.554 ± 0.350.139 ± 0.1921.371.76
2273.13 ± 64.580.549 ± 0.350.126 ± 0.1918.061.76
2373.63 ± 75.660.494 ± 0.340.108 ± 0.1714.461.23
2474.13 ± 65.680.541 ± 0.350.129 ± 0.1916.702.22
2568.39 ± 56.420.506 ± 0.340.122 ± 0.1717.581.01
2672.89 ± 61.700.521 ± 0.350.109 ± 0.1713.471.38
2771.65 ± 71.040.529 ± 0.350.122 ± 0.1916.381.53
2872.90 ± 62.370.51 ± 0.340.106 ± 0.1713.721.37
2976.08 ± 78.600.526 ± 0.350.110 ± 0.1714.020.99
Mean74.82 ± 75.620.551 ± 0.350.132 ± 0.1918.262.11

Effective population size

Average effective population size over the past 64 generations was estimated from the mean r2 values for the 29 bovine autosomes using the three MAF thresholds (Table 3). Figure 3 illustrates an example for MAF 0.1. Similar Ne estimates were found between the MAF 0.1 and the MAF 0.05 datasets. However, between these two datasets and the MAF 0.01 dataset, we observed discrepancies that increased over generations. In general, we observed a continual decrease in Ne across generations, with the average value falling from 1410 to 51 after 64 generations. It appeared that Ne declined slowly between 64 and 32 generations ago, and then this decline underwent a sudden acceleration from eight generations ago to the present, when Ne dropped from an average of 232 to an average of 191 (Table 3).
Table 3

Average estimated effective population size truncated at 64 generations for three different MAF thresholds.

Generations agoNe_MAF0.1Ne_MAF0.05Ne_MAF0.01Mean Ne
150515251
275757876
3989810199
4124124127125
5158159163160
7189190195191
8229230237232
11284284292287
14352354366357
18447450463453
24579577597584
32775770799781
451016102310531031
641393139714401410
Figure 3

Average-estimated effective population size in the Tunisian cattle population, plotted against average generations in the past, truncated at 64 generations. Estimated effective population size was plotted on a log scale.

Discussion

In the present study, we first identified runs of homozygosity in a sample of 87 Tunisian cattle individuals. Characterization of ROHs, particularly their size and frequency, provides information about relatedness within a population. Our findings showed that more than one sixth of the individuals in our sample had a total sum of ROHs >250 Mb, with almost all individuals having at least one ROH >25 Mb. Because long ROH fragments are indicators of recent inbreeding, we can hypothesize that consanguineous mating is quite common in the local livestock farming system. This is particularly problematic as long autozygous ROHs are generally associated with an increased incidence of strongly deleterious mutations as well as mildly deleterious variants (Szpiech ). However, the mean number of ROHs per individual observed for native Tunisian cattle (38 ± 12.6) was low compared to that found in other European cattle breeds. For instance, Ferencakovic reported that Brown Swiss, Tyrol Grey, and Pinzgauer cattle had, on average, 94.76 ± 14.55, 70.86 ± 9.51, and 59.96 ± 9.91 ROHs per individual, respectively. This most likely reflects differences in the history of breed formation between European breeds and Tunisian cattle, with the former originating from a more recent bottleneck and subjected to a more pronounced selection pressure. Next, we aimed to assess the extent of genome-wide LD and to infer the effective population size in the native Tunisian cattle population using medium-density SNP chips and two of the most popular metrics for quantifying LD: D’ and r2. Our results revealed a gradual decline in these two parameters over increasing inter-marker distances, with a steeper rate of decay for r2 than for D’. Furthermore, we observed an inflation in estimates of D’ (which was more noticeable over long inter-marker distances) with a decrease in the MAF threshold used. This finding has previously been reported in other studies (e.g., Khatkar ; Bohmanova ; Espigolan ) and might be explained by the fact that, in the formula for D’, the denominator is set to the minimal product of SNP allele frequencies and will be lower when the difference between allele frequencies is higher, thus leading to over-estimation of D’. Khatkar reported that the observed inflation in D’ estimates with an increase in the proportion of rare alleles is mainly due to the fact that rare alleles are, in general, younger than common alleles, and hence may still be in LD. Instead, mean values of r2 dropped noticeably as the MAF threshold was lowered at all inter-marker distances. Correction for sample size resulted in only a small change in r2 estimates at short inter-marker distances, indicating that 87 individuals were sufficient to estimate r2 with acceptable reliability. However, this number may be insufficient for a correct estimate of D’, because small samples may fail to sample rare fourth gametes which, therefore, can inflate D’ (Khatkar ). Indeed, Bohmanova reported that a minimal sample size of 55 animals was required for accurate estimation of r2, while 444 animals were required for D’, and other studies have likewise reported that D’ is more sensitive to sample size variations than r (e.g., Ardlie ; Du ). We therefore considered our estimates of r2 to be more reliable than the D’ estimates. In order to further reduce the effect of sampling bias on r estimates in our study, it would be reasonable to consider only values obtained using a MAF threshold of 0.1. This is because LD around common alleles (those with a moderate-to-high frequency) can be measured with a modest sample size (80 ± 100 chromosomes) to a precision within 10 ± 20% of the asymptotic limit (Reich ). Around 74% of the SNPs in the MAF 0.1 dataset had a moderate-to-high MAF (>0.2) (Figure S1), making the r2 estimates that were based on this dataset robust over short inter-marker distances. Differences in LD were observed between chromosomes and might result from variations in chromosome length, with longer chromosomes having higher LD than shorter ones. Indeed, we found a high correlation (~78%) between r2 value and chromosome length. In agreement with our results, Stapley found that LD extended further on the macrochromosomes than on microchromosomes in the zebra finch genome. Another possible explanation for inter-chromosomal variations in LD might be the difference in total sum of ROHs observed between high- and low-LD chromosomes (Table S3). This hypothesis is corroborated by the high positive correlation (~82%) found between LD and total sum of ROHs per chromosome. Similarly, studies of LD in humans have reported that ROHs are more common in regions with high linkage disequilibrium and a low recombination rate (e.g., Gibson ). Next, we compared our results with those obtained from other taurine breeds using a similar SNP panel. We found that LD decay in the Tunisian population was faster than that reported by Biegelmeyer for a sample of 391 Braford and 2079 Hereford cattle, as well as that published by Edea for Ethiopian local cattle (Figure 4). Qanbari found that, in German Holstein cattle, mean r2 decreased from 0.30 between marker pairs separated by less than 25 Kb to 0.12 for those separated by 100-200 Kb; this decrease was slower than that observed in our sample (0.29 to 0.05 for Tunisian native cattle over the same change in physical distance). Instead, in reporting LD estimates for a sample of 887 North American Holstein bulls, Bohmanova found that mean r2 for markers 40-60 kb apart was 0.20, which is higher than that found in our study (0.14) for the same inter-marker distance. However, it is difficult to compare our results directly with these previous studies due to differences in the scale of sampling, given that the level of LD decreases with increasing sample size (Hill and Robertson, 1968). When we restricted our comparisons to studies with smaller sample sizes, we found that the mean r value for adjacent markers in the present study (pooled over all autosomes; mean = 0.132, range: 0.106 to 0.158) was lower than those reported by Flury for 128 indigenous Swiss cattle (mean r = 0.24, range: 0.19 to 0.26), by Mastrangelo for 76 Italian Modicana individuals (mean r = 0.200, range: 0.168 to 0.234), and by Beghain for 30 Blonde d’Aquitaine bulls (mean r = 0.205, range: 0.172 to 0.241). In summary, although our results are based on a smaller sample size, they paint a coherent picture: the lower level of genome-wide LD and its faster decay in Tunisian cattle compared to other breeds is consistent with the absence of strong selective pressure and with the exclusive use of natural reproductive methods within this population (unlike Holstein, Hereford, or Braford breeds). This hypothesis is further corroborated by the fact that mean r2 values between adjacent SNPs for the artificially unselected Italian breed Cinisara were similar to those reported in our study (mean r = 0.168, range: 0.138 to 0.184; Mastrangelo ).
Figure 4

Decay in linkage disequilibrium (r2) over increasing inter-marker distance in Tunisian, Hereford, Braford, and local Ethiopian cattle.

We used average r2 estimates (combined over all autosomes) at different inter-marker distances to infer past effective population sizes of the local Tunisian cattle population. Short inter-marker distances reflect the effective population size over historical periods of time while long inter-marker distances give information on population size in the immediate past (Hill, 1981). Using a MAF threshold of 0.01 tends to increasingly overestimate Ne across generations, while a MAF threshold of 0.05 is less subject to erroneous estimates of effective population size. Our study revealed that the Tunisian cattle population has undergone a sudden, substantial drop in Ne, starting from eight generations ago. If we assume a generation time for local cattle of 5 years, this period corresponds to the mid-1970s, and indeed, it was reported that the numbers of local and cross-bred cattle in Tunisia fell by 32% between 1975 and 1990 because of the massive introduction of European breeds (Kayouli, 2001). In addition, it is worth noting that, due to the small sample size in our study, it is likely that the values we found for recent Ne underestimate the true Ne in the whole native Tunisian cattle population; according to Corbin , when Ne is assumed to be variable, a severe underestimation occurs in recent Ne estimates. Therefore, the inferences for recent Ne in the present study should be considered cautiously due to the sampling bias. Taken together, the pattern of ROHs and the low levels of long-range LD found in the present study, combined with the relatively high levels of genetic diversity reported in a previous study (Ben Jemaa ) may indicate that native Tunisian cattle have not been subjected to strong genetic drift. We thus hypothesize that there has been a significant drop in the effective population size of Tunisian cattle over successive generations, but with higher real Ne values than those found in our study (especially regarding recent Ne estimates). The fact that the recent effective population size that was inferred here for local Tunisian cattle was small implies that this population is endangered. According to Franklin (1980), populations with Ne <50 are considered to be at immediate risk of extinction. Other authors have gone even further, suggesting that a threshold of Ne = 50 is too small to ensure long-term population survival. For instance, Meuwissen (2009) reported that a threshold of Ne = 100 would be necessary to ensure the long-term viability of an animal population. This is because such small populations are exposed to loss of genetic variation through high levels of both inbreeding and genetic drift, which can substantially increase the extinction probability of populations in changing environments. In conclusion, this work presents the first assessment of ROH patterns and LD distribution at the genome-wide level in the native Tunisian cattle population. Compared to European breeds, we found that Tunisian cattle had less LD, and that it decayed faster with physical distance between markers. Therefore, a higher SNP density might be necessary to carry out effective genome-wide association mapping within this population. We provide evidence of a rapid decline in the effective population size in Tunisian cattle, associated with high levels of recent inbreeding within a significant proportion of individuals. These two observations indicate an urgent need to establish a conservation plan that includes a well-designed genetic management program for this population.
  5 in total

1.  Population demographic history and population structure for Pakistani Nili-Ravi breeding bulls based on SNP genotyping to identify genomic regions associated with male effects for milk yield and body weight.

Authors:  Saher Islam; Umesh K Reddy; Purushothaman Natarajan; Venkata Lakshmi Abburi; Amna Arshad Bajwa; Muhammad Imran; Muhammad Yasir Zahoor; Muhammad Abdullah; Aamir Mehmood Bukhari; Sajid Iqbal; Kamran Ashraf; Asif Nadeem; Habibur Rehman; Imran Rashid; Wasim Shehzad
Journal:  PLoS One       Date:  2020-11-24       Impact factor: 3.240

2.  Runs of homozygosity analysis of South African sheep breeds from various production systems investigated using OvineSNP50k data.

Authors:  E F Dzomba; M Chimonyo; R Pierneef; F C Muchadeyi
Journal:  BMC Genomics       Date:  2021-01-06       Impact factor: 3.969

3.  Estimation of Linkage Disequilibrium and Effective Population Size in Three Italian Autochthonous Beef Breeds.

Authors:  Maria Chiara Fabbri; Christos Dadousis; Riccardo Bozzi
Journal:  Animals (Basel)       Date:  2020-06-14       Impact factor: 2.752

4.  Genome-wide scan for selection signatures reveals novel insights into the adaptive capacity in local North African cattle.

Authors:  Slim Ben-Jemaa; Salvatore Mastrangelo; Seung-Hwan Lee; Jun Heon Lee; Mekki Boussaha
Journal:  Sci Rep       Date:  2020-11-10       Impact factor: 4.379

5.  Linkage Disequilibrium and Effective Population Size of Buffalo Populations of Iran, Turkey, Pakistan, and Egypt Using a Medium Density SNP Array.

Authors:  Shirin Rahimmadar; Mokhtar Ghaffari; Mahdi Mokhber; John L Williams
Journal:  Front Genet       Date:  2021-12-07       Impact factor: 4.599

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.