Literature DB >> 32455573

Genome-Wide Patterns of Homozygosity and Relevant Characterizations on the Population Structure in Piétrain Pigs.

Huiwen Zhan1, Saixian Zhang1, Kaili Zhang1, Xia Peng1, Shengsong Xie1, Xinyun Li1, Shuhong Zhao1, Yunlong Ma1.   

Abstract

Investigating the patterns of homozygosity, linkage disequilibrium, effective population size and inbreeding coefficients in livestock contributes to our understanding of the genetic diversity and evolutionary history. Here we used Illumina PorcineSNP50 Bead Chip to identify the runs of homozygosity (ROH) and estimate the linkage disequilibrium (LD) across the whole genome, and then predict the effective population size. In addition, we calculated the inbreeding coefficients based on ROH in 305 Piétrain pigs and compared its effect with the other two types of inbreeding coefficients obtained by different calculation methods. A total of 23,434 ROHs were detected, and the average length of ROH per individual was about 507.27 Mb. There was no regularity on how those runs of homozygosity distributed in genome. The comparisons of different categories suggested that the formation of long ROH was probably related with recent inbreeding events. Although the density of genes located in ROH core regions is lower than that in the other genomic regions, most of them are related with Piétrain commercial traits like meat qualities. Overall, the results provide insight into the way in which ROH is produced and the identified ROH core regions can be used to map the genes associated with commercial traits in domestic animals.

Entities:  

Keywords:  Piétrain pigs; ROH islands; effective population size; inbreeding coefficients

Mesh:

Year:  2020        PMID: 32455573      PMCID: PMC7291003          DOI: 10.3390/genes11050577

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


1. Introduction

The Piétrain pig originated in Belgium. It was founded by crossing local swine with Berkshire, English Large White and Bayeux pig and then was exported to other countries after 1950s. Piétrain pig has medium size, well-developed muscles and white skin with black spots. It is unavoidable to create close inbreeding during breed formation [1]. Whereas those strong inbreeding and selection shaped the Piétrain’s important characteristics like high lean percentages [2]. Currently, this breed is popular around the world, and it is commonly used as terminal sire in the commercial crossing system. Runs of homozygosity (ROH) are consecutive homozygous segments. The autozygous regions exist due to parents descending the same haplotypes to their progeny [3]. In the process of evolution, multiple factors including mutation, migration, random genetic drift and selection pressure make genotype frequency change, which consequently contributes to change the characterization of runs of homozygosity. In animal genetics, the occurrence of homozygous segments is influenced by the locations of recessive-disease, population history as well as consanguinity levels [4]. The size of effective population (Ne) can also reflect the patterns of runs of homozygosity indirectly. As an important parameter of population genetics, it provides an important reference for genome architecture research. At present, compared with other existing methods, the whole genome inbreeding coefficient based on ROH (F) is considered to be the most accurate calculation method for predicting the inbreeding level of each individual [5,6]. In addition, the inbreeding coefficient estimated from ROH can be divided into recent and ancestral inbreeding events [5]. Therefore, ROH is applied to predict inbreeding effects in animal genetics [7,8,9]. In theory, the variations productized by an individual would transmit to its offspring, and then they may share identical genomic segments [4]. Therefore, ROH can be used to optimize mating schemes for minimizing inbreeding in livestock breeding [10,11]. In addition, ROH is also used to detect associations between genotype and phenotype, and to identify regions associated with traits of economic interest in farm animals [11,12,13]. In general, ROH patterns in farm animals are aligned with their breeding history. There was a significant correlation between shared short ROH and genomic regions under selection in a population [11]. Accordingly, those ROH regions are probably the results of natural selection and long-term breeding programs. In theory, the genomic regions that were shaped by selection often show a significant reduction in nucleotide diversity and increase in homozygosity around the causal mutation [4]. Therefore, the genomic regions underlying artificial selection in the process of animal breeding are often connected with the high-frequency ROH regions [13]. It suggested that the position of runs of homozygosity may imply the genomic segments that have been subjected to selective pressure [14,15]. Previous ROH studies focused on human and discussed the association between homozygous and complex diseases like stomach cancer, heart diseases, schizophrenia and albinism [16,17,18,19]. The autozygosity of deleterious recessive allele can reflect the cause of recessive-disease effectively [4]. With the development of high-density Chip and sequencing technology, it becomes easier to do homozygous studies. In addition, it benefits from the development of ROH detection methods [20]. Accordingly, researches in the causes and patterns of homozygosity have received more attention in recent years. The present study is based on the detection of runs of homozygosity, investigating the genomic distribution and features of ROH in Piétrain pig using 50k single nucleotide polymorphism (SNP) GeneChip data. We further investigated the biologic significance of high frequency ROH islands. As expected, we find that ROH islands are associated with some important economic traits, such as meat quality. In addition, the whole genome inbreeding coefficient based on ROH is assessed for prediction ability through comparison with traditional pedigree inbreeding coefficient and other genomic inbreeding coefficients. Overall, the results from this study can provide insights into the pattern of runs of homozygosity and direct its application in animal breeding.

2. Materials and Methods

2.1. Data Preparation

376 Piétrain pigs that belonged to the same generation without generation overlapping were used. All research involving animals was conducted under protocols (No. 5 proclaim of the Standing Committee of Hubei People’s Congress) approved by the Standing Committee of Hubei People’s Congress and the ethics committee of Huazhong Agricultural University in China. Pigs were genotyped for 50,264 SNPs spread over all chromosomes using Illumina PorcineSNP50 Bead Chip. Only SNPs on autosomes were considered in the following ROH analyses. All analyses were based on Sus scrofa (Pig) 11.1 reference genome version. The next step was to quality-control using PLINK software and R programming. The filtering parameters were as follows: (ⅰ) SNPs with a call rate less than 95% were excluded, (ⅱ) individuals with a call rate less than 95% were discarded, (ⅲ) SNPs with minor allele frequency less than 5% were removed and (ⅳ) the deviated markers (p < 0.1 × 10−3) were excluded based on Hardy–Weinberg test. Finally, we performed a relationship test and removed high-related individuals.

2.2. Linkage Disequilibrium and Effective Population Size

The effective population size (Ne) was defined as the number of individuals that harbor the same distribution of allele frequencies under random genetic drift in an idealized population [21]. In theory, the prediction of Ne can be calculated through the linkage disequilibrium (LD) scores. With the availability of high-density chip in pigs, it was feasible to calculate the LD scores between pairwise markers. In this analysis, the LD scores (r) between pairwise SNPs was calculated using PLINK v1.07. The effective population sizes (Ne) were calculated according to the formula [22,23]: where c represents the linkage map distance in Morgans [24,25,26]. The physical distance to genetic distance conversion rates were calculated based on total physical chromosome length stated on Sscrofa11.1 (https://www.ncbi.nlm.nih.gov/assembly/GCF_000003025.6) and each chromosome’s genetic length from the National Center for Biotechnology Information (NCBI) (https://www.ncbi.nlm.nih.gov/genome/gdv/?org=sus-scrofa). In addition, E(r was the expected value of r for the distance c. In addition, where it was the time corresponding to c, we recruited. In addition, Ne(t) reflects the effective population size at t generations ago. A series of distance bin between SNPs were given as follows: 0.1, 0.2, 0.5, 1, 2, 5 cM, allowing 5% floating range of SNPs distance, to compute Ne(t) before 500, 250, 100, 50, 25, 10 generations. The whole process was performed for SNPs on autosomes [27].

2.3. Identification of Runs of Homozygosity

Runs of homozygosity were identified for each available sample separately using PLINK v1.07 which provides a sliding window to detect autozygous fragments [28]. The minimum ROH length was set at 1 Mb to exclude quite common short fragments deriving from LD. To avoid the false discovery rate, the minimum number of SNPs that contained in each ROH was set to 10 which derived from the density of chip. Up to one missing genotype and one heterozygous locus were allowed in a sliding window. In addition, if two SNPs within a segment are too far apart from more than 1 Mb, that segment were split in two [16]. The detected runs of homozygosity were classified by physical length as follows: 1 to <5 MB, 5 to <10 MB and ≥10 MB [7,13]. For each category, the total number of ROH in each chromosome was count for all the available samples. To explore the distribution of ROH across the whole genome, overlapping regions were obtained from ROH pools with all sampled individuals using PLINK software. To a certain overlapping region, the percentage of individuals containing that region among the whole population was calculated for every overlapping region. The empirical P-values were generated by ranking of the percentage values from high to low. Next, we defined core regions as ROH islands in the top 1% of the subjects.

2.4. Inbreeding Coefficient

To assess the accuracy of prediction inbreeding effects using ROH, three types of inbreeding coefficients were calculated in this study, including Pedigree-based inbreeding coefficients (F), ROH-based inbreeding coefficients (F), and inbreeding coefficients based on the excess of homozygosity (F). Pedigree-based inbreeding coefficients (F) were calculated for Piétrain pigs, which were born between 2014 and 2017. The pedigree data were recorded by KFNetsKing software. The total record contains 3834 individuals from 126 litters, including 305 experimental samples. They were descended from 34 parent stock (PS) sires and 126 PS sows. Furthermore, there were a total of 29 grandparent (GP) sires, 93 GP sows, 22 great-grandparent (GGP) sires, 59 GGP sows, 10 great-great-grandparent (GGGP) sires and 23 GGGP sows in record. Among 126 litters records, 118 litters have half sibling litter records in this population (Table S1). In order to avoid the inaccuracy caused by the absence of pedigree information, the samples without pedigree records or only having one or two generation records were excluded from analysis sets. Finally, a total of 921 samples were left to calculate F with pedigree depth ranging from 3 to 4 generations. Pedigree-based inbreeding coefficients were estimated by the pedigreemm package in R (https://cran.r-project.org/web/packages/pedigreemm/index.html). Genomic inbreeding coefficients (F) were calculated for 305 Piétrain pigs with genotype information using the following formula: In which O is the number of observed autosomal homozygous genotypes for individual i and E is the number of expected autosomal homozygous genotypes for individual i, L is the total number of genotyped autosomal SNPs. Those parameters were calculated in PLINK v1.07 [28]. Genomic inbreeding coefficient based on ROHs was provided by McQuillan et al. (2008) [29], defined as the length of the autosome covered by ROHs divided by the overall length of the autosome. The formula is: where ∑L is the total length of ROHs, and L is the total length of the autosomal genome which is 2478,445 kb in Sus scrofa (Pig). Spearman’s correlation coefficients were calculated between all above inbreeding coefficients through pairwise comparison using SAS.

2.5. Functional Annotation for Core Regions of ROH

The core regions of ROH were annotated to identify the known genes in those regions using the genomic database in Ensembl (http://ensemble.org). The core regions of ROH were also matched with previously reported pig Quantitative Trait Locus (QTL) obtained from pig Quantitative Trait Locus Database (Pig QTLdb) (https://www.animalgenome.org/cgi-bin/QTLdb/SS/index). In addition, Gene Ontology (GO) terms and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways annotations were performed using the list of genes detected in core regions of ROH in The Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.8 tool.

3. Results

3.1. Characteristics of Genotypic Data

To investigate the pattern of homozygosity, 376 Piétrain pigs were used in chip analysis. After quality control, 4 samples were removed for low genotyping, 1788 SNPs were filtered in missingness test, 2179 markers were excluded based on HWE test and 11,616 SNPs failed in the minor allele frequency test. In addition, we calculated relationship coefficient using identity-by-descent implemented in PLINK v1.07 and 71 samples were excluded (PI_HAT ≥ 0.5). Finally, a total of 305 individuals with 35,549 SNPs were available for further analysis.

3.2. The Patterns of Linkage Disequilibrium and Past Effective Population Size

In this analysis, the LD scores of all pairwise SNPs within a physical distance 10 Mb on the same chromosome were calculated. As shown in Figure 1, the autosomes have similar downward trend. In addition, it can be clearly observed that the mean r values decreased gradually with the increase of physical distance.
Figure 1

The tendencies of linkage disequilibrium decay across autosomes. The x-axis indicates the diverse classes of distance between SNPs pairs up to 1.0 Mb in each chromosome and the total autosomes, the y-axis indicates the proportion of varied r classes.

According to previous studies, the effective population size of commercial pigs has gradually decreased under intensive artificial selection [30,31]. Shin et al. found that the effective population size has reduced by 99.6% through 10,000 generations in Korean Yorkshire [27]. To investigate the tendency in effective population size of Piétrain pigs from 500 generations ago to present, we set a series of bins of interval distances of pairwise SNPs. Then, a vector of LD scores—mean r2 for each bin could be calculated. Based on the mean r values, we estimated effective population size before 500, 250, 100, 50, 25, 10 generations ago. Ne at 500 generations ago was about 309 individuals which corresponds to the interval distances of pairwise SNPs about 0.1 cM, and then undergoes a narrowing process. While up to 10 generations ago, Ne has reduced to 46 individuals—a reduction of 85.1% (Table 1).
Table 1

The effective population size over generations for Piétrain pigs estimated from linkage disequilibrium data.

Chr.cM/Mblinkage Disequilibrium (r2) of Different Interval Distance (Kb)Estimated Ne in n Generations Ago
100200500100020005000500250100502510
10.53650.44720.39560.31880.24130.16660.0932576.1356.0199.1146.5116.590.7
20.86910.44640.39760.32100.24740.17210.0977356.8218.0121.787.569.253.1
30.97230.44860.40180.32480.24930.17520.1000316.1191.4106.977.460.546.3
41.00000.44820.39800.32290.24880.17530.1004307.8189.1104.875.558.844.8
51.45240.45180.40190.32640.25170.17660.1014208.9128.171.051.240.130.5
60.96900.44770.39600.32040.24570.17180.0989318.3196.7109.479.262.247.0
71.28690.44950.40180.32530.24950.17490.1006238.0144.680.658.445.834.7
80.94030.44800.39990.32390.24750.17310.0991327.7199.5111.080.963.548.3
90.98930.44620.39890.32380.24760.17470.1006313.6190.4105.576.859.745.2
101.85510.45080.40120.32490.24900.17550.1010164.2100.656.040.631.724.0
111.07470.45040.40150.32480.24940.17460.0999283.9173.496.770.055.041.9
121.82420.44820.39940.32290.24860.17440.1006168.7103.157.541.432.424.5
130.60670.45050.40310.32550.24900.17420.0991502.6305.1170.8124.397.674.9
140.73310.45150.39990.32170.24650.17140.0982414.3255.9143.8104.282.462.6
150.90210.44700.40060.32420.24850.17470.1005342.9207.4115.583.865.549.6
161.16500.44980.40150.32470.25030.17580.1003262.5160.089.364.350.338.5
171.53970.44880.39970.32340.24870.17400.0998199.4121.967.949.038.529.3
181.18590.44760.39800.32220.24760.17330.1003260.1159.588.764.150.337.8
Total1.10570.44880.39980.32340.24810.17380.0995309.0188.9105.476.460.045.8

3.3. Identification and Classification of ROH

In this study, a total of 23,434 ROH regions were identified from 305 available individuals of Piétrain pigs. We found that the total physical length of ROH on 18 autosomes per individual was about 507.267 Mb on average. The density of three size categories ROH was presented in Figure 2. The average length of ROH was 8.195 Mb, and the size of ROH between 1 Mb and 5 Mb accounts for the majority of the total ROH (50.994%). The distribution of total length of ROH in each chromosome was presented in Figure 3A. The total ROH length in chromosome 1 was about 30,553.10 Mb, which was the longest one among all chromosomes. Similarly, the mean length of 13.325 Mb in chromosome 1 was also the largest among all chromosomes. ROHs were classified into three size classes according to the physical length: short (1 to <5 Mb), intermediate (5 to <10 Mb) and long (≥10 Mb). The length of short ROH was 122.216 Mb per individual, the intermediate ROH 144.041 Mb and the long ROH 363.227 Mb. Correspondingly, the proportion of each class to the total number of ROHs was measured: the short ROHs account for 11,950 (50.994%), the intermediate ROHs account for 6269 (26.752%) and the long ROHs account for 5215 (22.254%). The distribution of ROH regions on autosomes was shown in Figure 3B. The greatest number of ROHs across genome was located in chromosome 1 with 2293 regions.
Figure 2

Density estimates of runs of homozygosity (ROH) in three size categories. The x-axis indicates the size of ROH (Kb), the y-axis indicates the value of density.

Figure 3

Distribution of ROH in each chromosome. (A) Total length of ROH on 18 autosomes. (B) Counts of three classes of ROH on 18 autosomes.

In this analysis, 1574 overlapping genomic regions were obtained from ROH pools. Moreover, 16 ROH regions whose coverage rate are in the top 1% were selected as the core regions. Among them, there are 6 regions in chromosome 1, 5 regions in chromosome 8 and 3 regions in chromosome 6, while chromosome 15 and chromosome 18 harbored 1 region, respectively. To further investigate these core regions, we highlighted the distribution of ROHs frequencies, allele frequencies and the gene diversity in Figure 4. For each 100Kb window, we calculated the number of individuals that were homozygous in the window and plot the distribution of homozygous SNPs frequency in chromosomes 1, 6 and 8 by black line. The distribution of the core regions was consistent with the corresponding allele frequencies. Compared with other genomic regions, there were a smaller number of genes in these regions. However, we observed several genes which were related to Piétrain specific economic traits around ROH regions, such as MRAP2 which was related to energy homeostasis machinery, and CLGN which was involved in the control of spermatogenesis and infertility.
Figure 4

The distribution of ROH core regions in chromosome 1, 8 and 6, the allele frequency and gene diversity in each SNP among the corresponding regions. Blue horizontal lines represent distribution of ROH of per individual. Black line represents frequency of homozygous SNPs averaged in each step of 100Kb. Green line represents gene diversity per SNP.

3.4. Comparison of Different Inbreeding Coefficients

Pedigree-based inbreeding coefficients (F) were estimated using 921 individuals with pedigree depth ranging from 3 to 4 generations. In addition, the genomic inbreeding coefficients (F and F) were calculated using 305 individuals with genotype data. The mean value of F based on the total length of ROH was 0.205. More specifically, the mean of inbreeding coefficient based on short ROH (1 to <5 MB) was 0.049, the mean based on the intermediate ROH (5 to <10 MB) was 0.058 and long ROH (≥10 MB) 0.147. To further investigate the inbreeding coefficients obtained by different calculation methods, we performed pairwise comparisons between F, F and F in samples with both pedigree information and chip data. Among them, F and F had a high correlation of 0.949 (p-value = 1.908e-06) as shown in Figure 5A, whereas neither F nor F was highly correlated with F: r = 0.161 (p-value = 0.452) between F and F, r = 0.197 (p-value = 0.357) between F and F, respectively (Figure S1). These results suggested computational methods and principles applied in estimating pedigree-based inbreeding coefficients and genomic inbreeding coefficients were distinctive, which also affected the accuracy of calculation, especially in the absence or lack of pedigree information. More specifically, we expected the different categories of ROHs may reflect different population demography. According to previous studies, long ROH was related to more recent parental relatedness, which can reflect the degree of inbreeding effect [8,11]. To confirm this assumption, we have investigated the relationship between the inbreeding coefficients based on three categories of ROHs and F. The results presented in Figure 5B were consistent with previous assumption, the genomic inbreeding coefficients based on the long ROH was strongly correlated with F (r = 0.910, p-value = 2.614e-06). By contrast, neither short ROH nor intermediate ROH has high correlation with F (r = 0.038 for short ROH with p-value = 0.860 and r = 0.499 for intermediate ROH with p-value = 0.014, respectively), as shown in Figure 5C,D. The results suggested that long ROH may arise from recent inbreeding, whereas short and intermediate ROH have different mechanisms.
Figure 5

Relationship between ROH-based inbreeding coefficients and homozygosity-based inbreeding coefficients. (A) shows F and F (total) had a high correlation (r = 0.949, p-value = 1.908e-06). (B) shows F (long ROH) also was strongly correlated with F (r = 0.910, p-value = 2.614e-06). (C) shows F and F (short) had a low correlation (r = 0.038, p-value = 0.860). (D) shows F and F (intermediate) had a low correlation (r = 0.499, p-value = 0.014).

3.5. Functional Annotation for Core Regions of ROH

In this analysis, a total of 16 core ROH regions were investigated and sought for QTLs related to important production traits in pig QTLdb (Table 2). To further investigate the potential biologic significance of those core regions of ROH, the genes that located in the core regions and in the 500 Kb windows around the ROH were annotated. The window size was determined by the LD decay. There were 127 genes around core ROH regions (Table S2). Among those genes, we noticed some of them were related to biologic processes and some of them were even associated with economical traits, including SLC35F1, B3GAT2, DCBLD1, LMBRD1, OMA1, MRAP2 and CLGN (Table 3).
Table 2

A list of quantitative trait locus overlapping with the core runs of homozygosity (ROH) regions in Piétrain pig.

CategoryTraitQTL ID
Meat and Carcass AnatomyQTL:28022,QTL:8699,QTL:154422,QTL:29582,QTL:10245
Fat compositionQTL:31377,QTL:31364,QTL:31383,QTL:32083,QTL:18547,QTL:101788,QTL:101825,QTL:31356,QTL:31374,QTL:101552,QTL:18546,QTL:31372,QTL:31361,QTL:31368,QTL:18548,QTL:101865,QTL:31360,QTL:6435,QTL:31369,QTL:18549,QTL:101898,QTL:31370
fatnessQTL:22293,QTL:139226
Meat colorQTL:36140,QTL:36138
pHQTL:23051,QTL:18667,QTL:5871
textureQTL:23050,QTL:7819,QTL:7885,QTL:3864,QTL:23052
ChemistryQTL:9042
Health blood parametersQTL:29669,QTL:29665,QTL:29663,QTL:29668,QTL:29670,QTL:29666,QTL:6549,QTL:6318,QTL:29667
Immune capacityQTL:127811
Exterior QTL:124729,QTL:17665,QTL:32743,QTL:16383,QTL:125736,QTL:64738,QTL:64740,QTL:64686
Production feed conversionQTL:27977
feed intakeQTL:31324
GrowthQTL:22971
Reproduction Litter traitsQTL:31861,QTL:122507,QTL:64741,QTL:18327
Reproductive organsQTL:18371,QTL:126623,QTL:126620
Reproductive traitsQTL:7662,QTL:18326,QTL:18329,QTL:18328,QTL:18325
Table 3

A list of candidate genes overlapping with the core ROH regions in Piétrain pig.

Chr.Position (Kb)Length (Kb)ROH Mapping Frequency1GenesGene Function
1 53078–53113;35.052;76.38%; MRAP2 regulate the energy homeostasis machinery [32]
1 50245–50798;553.125;73.14%; B3GAT2 implicated in cellular migration and adhesion in the nervous system [33]
1 50245–50798;553.125;73.14%; LMBRD1 participated in compound integral component of membrane [34]
1 43257–450112;1754.52;72.17%; SLC35F1 play count role of control heat rate and cardiovascular disease [35]
1 43257–45012;1754.52;72.17%; DCBLD1 participated in compound integral component of membrane [36]
6 153543–153633;90.199;69.90%; OMA1 participated in compound integral component of membrane [37]
8 86943–87935;86188–86896;991.802;708.867;68.28%;66.67%; CLGN important modulator in spermatogenesis and infertility [38]

1 ROH mapping frequency represents the percentage of individuals containing that region among the whole population.

4. Discussion

Runs of homozygosity are contiguous homozygous genotype fragments in the genome that are present in an individual since the identical gametes are inherited from the parents. In previous studies, Bosse et al. studied genetic patterns of ROH from several breeds (n = 52) through 60K SNPchip and the smallest population (Hampshire) only contained 2 individuals [10]. Purfield et al. used multiple breeds of cattle to determine diversity of ROH, including Angus (n = 39), Belgian Blue (n = 38), Hereford (n = 40) and Simmental (n = 58) using 50K SNP [13]. It was reliable that using a larger sample size to investigate the genomic characterization in Piétrain pigs in this study. Overall, runs of homozygosity are widespread in Piétrain genome, but there was no predictable position distribution. All detected ROH were divided by physical length into three categories and the short ROH which accounted for the largest proportion (50.994%) was considered to be caused by LD [39]. We estimated r at various distances to represent LD degree. The mean value of r calculated for adjacent SNPs in this analysis was 0.557 which was similar to the result that Stratz et al. (2014) estimated in Piétrain pigs [40]. In addition, based on the value of r, the mean LD extent can be extended to a distance of 5 Mb, suggesting that LD can help form patterns of the short ROH. As for intermediate and long ROHs that exceed the range of LD extent, they should were broken in the process of recombination, but did not, suggesting that they are supposed to be shaped by other factors [4]. To some extent, the factors like founder effects, genetic drift further increase the difficulty of exploring the ROH formation mechanism. In population genetics, the decline rate of linkage disequilibrium and the linkage map distance can be used to infer population history and recent effective population size [41]. First, we calculate Ne under the assumption that 1 cM approximate to 1 Mb in physical distance following the previous study [31], and the result under this approximation was show in Table S3. In theory, this assumption may bias the value of Ne and affect the estimation accuracy, because different chromosomes have diverse linkage conditions due to their distinct physical and genetic distance. Corbin et al. proposed that the assumption that 1 cM approximate to 1 Mb in physical distance would result in lack of consistency for Ne estimated [41]. Therefore, the physical distance to genetic distance conversion rates were recalculated based on physical chromosome length and chromosome’s genetic length in this study. We found that each chromosome possesses specific linkage condition, while the decline tendency was identical. In addition, Corbin et al. presented that estimation of Ne would be very sensitive and the result would be unreliable when generations <7. Hence, our estimation of Ne process terminated at 10 generations. We observed that Ne decreased gradually over generations, implying the decline of genetic variations in the long-term evolutionary process. Combining with the prevalence of ROH and numerous short ROH in this analysis, it suggested that the decrease of effective population size, as a signal of decline in genetic diversity, can also reflect high levels of homozygosity indirectly. It is universally considered that inbreeding was an important factor affecting the pattern of homozygosity. In conventional breeding without available marker data, the inbreeding coefficient, F, was estimated through pedigree record, following a principle that an individual transmits an allele to an offspring with 50% probability all the time. However, in the transport process, meiosis was full of randomness with a large stochastic variance. Furthermore, in realistic breeding, it’s common that pedigree record was incomplete or missing. Therefore, it was difficult to trace back the relationship between ancestors and distant parents, which has also become a vital factor affecting the accuracy of inbreeding estimate. This inaccuracy led to the weak correlation between pedigree-based inbreeding coefficients and genomic inbreeding coefficients as we observed. In recent years, the inbreeding coefficient based on ROH has received more attention, and it was considered to be an accurate estimator to control the probability of inbreeding [5,42]. Compared with F, another genomic inbreeding estimator F takes all of the homozygote into account, including separate single locus and homozygous regions. In that case, it will confound all of the history events in experimental population. At the same time, it will increase errors and biases caused by homozygotes generated by random recombination. On the contrary, F could exclude accidental errors though removing single homozygous and quite short segments. In addition, it has been used in animal genetic analysis [7]. For example, Zhang et al. employed ROH to calculated inbreeding coefficients in cattle population and investigated the distribution of functional variants [11]. Bosse et al. remarked on the impact of demography and recombination on the formation of ROH to master genome inbreeding [10]. Mastrangelo et al. utilized F to improve mating systems in Italian sheep breeds [6]. Furthermore, based on several recent researches, the short ROH and intermediate ROH were more potentially result from LD and ancient inbreeding, whereas long ROH could represent the level of recent inbreeding more appropriately by excluding obstacle of LD and random effects [8,43,44]. However, the size of boundary between classes was specific to different populations and breeds. Thus, it is more feasible to use total ROH instead of long ROH to estimate level of inbreeding. In this study, we identified 16 ROH core regions where homozygous rate are pretty high for a specific population, many previous studies revealed that those homozygous segments had a major impact on gene mapping and disease [45,46,47]. Through sequence alignment in Pig QTLdb and gene annotation, we noticed that most of the segments harbored genes which could regulate important biologic functions of Piétrain and most of them may have an impact in economic traits. In this analysis, those regions were mostly linked with meat and carcass traits in Pig QTLdb, for example, QTL:22,293 harbored in one core region of ROH was related with average backfat thickness. In addition, a great part of them was relevant to production traits, like QTL:27977, a location associated with feed conversion ratio. In addition, there were several QTLs associated with reproduction traits, such as QTL:18325, a location expression in sperm motility. In addition, some of the core ROH regions also harbored QTLs related to exterior, like QTL:32743, associated with breed’s coat color. Among them, we noticed several functional related genes in basic biologic processes, such as SLC35F1, a gene played a key role in controlling heat rate; B3GAT2, was involved in cellular migration and adhesion in the nervous system; DCBLD1, LMBRD1 and OMA1, all participated in compound integral component of membrane. In addition, we also noticed some genes which are related with Piétrain economical traits: MRAP2 can regulate the energy homeostasis machinery; CLGN, an important modulator in spermatogenesis and infertility. As terminal sires in pig industry, Piétrain pig was subjected to high-intensity artificial selection for a long time. Moreover, these observations are consistent with the expectation that homozygosity is associated with some biologic function and important economic traits affected by positive selection. After long-term positive selection, especially more recent artificial selection, regulatory regions that control the important economic traits would be highly homogenous because of directional selection. In other words, those specific regions are easier to be exposed in ROH core regions. Those observed phenomena indicate that positive selection is a significant force shaping patterns of homozygosity in the process of evolution, and the high-frequency ROH can contribute to exploit potential target genes of human-imposed selection.

5. Conclusions

In summary, this study investigated the patterns of runs of homozygosity, LD, effective population size and inbreeding coefficients, which provided a clear understanding of how diversity evolved in Piétrain pigs. The decrease of effective population size can mirror artificial selection in breeding process and the recession of genetic diversity. This phenomenon also proves the formation of ROH. Moreover, several genes associated with production traits were found overlapping with ROH regions. The high relationship between inbreeding coefficients (F) based on homozygous genotypes and the inbreeding coefficients (F) based on ROH exposes that the proportion of ROH has an instruction sense in assessing the population’s level of inbreeding. Thus, using genomic information to detect ROH is a powerful method for estimating population inbreeding, especially in assessing the level of inbreeding where pedigrees records were incomplete or missing. Furthermore, the categories of ROHs may be related with the generations on the time frame, short ROH more reflects ancient inbreeding, while long ROH involves in recent inbreeding. Overall, the investigation of ROH patterns provides a helpful reference for selection and assortative mating in modern inbreeding and points an original direction for research of population genetic structure.
  42 in total

1.  Cancer: new-age tumour suppressors.

Authors:  Allan Balmain
Journal:  Nature       Date:  2002-05-16       Impact factor: 49.962

2.  Extent of linkage disequilibrium and effective population size in Finnish Landrace and Finnish Yorkshire pig breeds.

Authors:  P Uimari; M Tapio
Journal:  J Anim Sci       Date:  2010-10-29       Impact factor: 3.159

3.  Investigations on the pattern of linkage disequilibrium and selection signatures in the genomes of German Piétrain pigs.

Authors:  P Stratz; K Wimmers; T H E Meuwissen; J Bennewitz
Journal:  J Anim Breed Genet       Date:  2014-07-21       Impact factor: 2.380

4.  Genomic patterns of homozygosity in worldwide human populations.

Authors:  Trevor J Pemberton; Devin Absher; Marcus W Feldman; Richard M Myers; Noah A Rosenberg; Jun Z Li
Journal:  Am J Hum Genet       Date:  2012-08-10       Impact factor: 11.025

5.  Identification of a putative lysosomal cobalamin exporter altered in the cblF defect of vitamin B12 metabolism.

Authors:  Frank Rutsch; Susann Gailus; Isabelle R Miousse; Terttu Suormala; Corinne Sagné; Mohammad Reza Toliat; Gudrun Nürnberg; Tanja Wittkampf; Insa Buers; Azita Sharifi; Martin Stucki; Christian Becker; Matthias Baumgartner; Horst Robenek; Thorsten Marquardt; Wolfgang Höhne; Bruno Gasnier; David S Rosenblatt; Brian Fowler; Peter Nürnberg
Journal:  Nat Genet       Date:  2009-01-11       Impact factor: 38.330

6.  A second generation human haplotype map of over 3.1 million SNPs.

Authors:  Kelly A Frazer; Dennis G Ballinger; David R Cox; David A Hinds; Laura L Stuve; Richard A Gibbs; John W Belmont; Andrew Boudreau; Paul Hardenbol; Suzanne M Leal; Shiran Pasternak; David A Wheeler; Thomas D Willis; Fuli Yu; Huanming Yang; Changqing Zeng; Yang Gao; Haoran Hu; Weitao Hu; Chaohua Li; Wei Lin; Siqi Liu; Hao Pan; Xiaoli Tang; Jian Wang; Wei Wang; Jun Yu; Bo Zhang; Qingrun Zhang; Hongbin Zhao; Hui Zhao; Jun Zhou; Stacey B Gabriel; Rachel Barry; Brendan Blumenstiel; Amy Camargo; Matthew Defelice; Maura Faggart; Mary Goyette; Supriya Gupta; Jamie Moore; Huy Nguyen; Robert C Onofrio; Melissa Parkin; Jessica Roy; Erich Stahl; Ellen Winchester; Liuda Ziaugra; David Altshuler; Yan Shen; Zhijian Yao; Wei Huang; Xun Chu; Yungang He; Li Jin; Yangfan Liu; Yayun Shen; Weiwei Sun; Haifeng Wang; Yi Wang; Ying Wang; Xiaoyan Xiong; Liang Xu; Mary M Y Waye; Stephen K W Tsui; Hong Xue; J Tze-Fei Wong; Luana M Galver; Jian-Bing Fan; Kevin Gunderson; Sarah S Murray; Arnold R Oliphant; Mark S Chee; Alexandre Montpetit; Fanny Chagnon; Vincent Ferretti; Martin Leboeuf; Jean-François Olivier; Michael S Phillips; Stéphanie Roumy; Clémentine Sallée; Andrei Verner; Thomas J Hudson; Pui-Yan Kwok; Dongmei Cai; Daniel C Koboldt; Raymond D Miller; Ludmila Pawlikowska; Patricia Taillon-Miller; Ming Xiao; Lap-Chee Tsui; William Mak; You Qiang Song; Paul K H Tam; Yusuke Nakamura; Takahisa Kawaguchi; Takuya Kitamoto; Takashi Morizono; Atsushi Nagashima; Yozo Ohnishi; Akihiro Sekine; Toshihiro Tanaka; Tatsuhiko Tsunoda; Panos Deloukas; Christine P Bird; Marcos Delgado; Emmanouil T Dermitzakis; Rhian Gwilliam; Sarah Hunt; Jonathan Morrison; Don Powell; Barbara E Stranger; Pamela Whittaker; David R Bentley; Mark J Daly; Paul I W de Bakker; Jeff Barrett; Yves R Chretien; Julian Maller; Steve McCarroll; Nick Patterson; Itsik Pe'er; Alkes Price; Shaun Purcell; Daniel J Richter; Pardis Sabeti; Richa Saxena; Stephen F Schaffner; Pak C Sham; Patrick Varilly; David Altshuler; Lincoln D Stein; Lalitha Krishnan; Albert Vernon Smith; Marcela K Tello-Ruiz; Gudmundur A Thorisson; Aravinda Chakravarti; Peter E Chen; David J Cutler; Carl S Kashuk; Shin Lin; Gonçalo R Abecasis; Weihua Guan; Yun Li; Heather M Munro; Zhaohui Steve Qin; Daryl J Thomas; Gilean McVean; Adam Auton; Leonardo Bottolo; Niall Cardin; Susana Eyheramendy; Colin Freeman; Jonathan Marchini; Simon Myers; Chris Spencer; Matthew Stephens; Peter Donnelly; Lon R Cardon; Geraldine Clarke; David M Evans; Andrew P Morris; Bruce S Weir; Tatsuhiko Tsunoda; James C Mullikin; Stephen T Sherry; Michael Feolo; Andrew Skol; Houcan Zhang; Changqing Zeng; Hui Zhao; Ichiro Matsuda; Yoshimitsu Fukushima; Darryl R Macer; Eiko Suda; Charles N Rotimi; Clement A Adebamowo; Ike Ajayi; Toyin Aniagwu; Patricia A Marshall; Chibuzor Nkwodimmah; Charmaine D M Royal; Mark F Leppert; Missy Dixon; Andy Peiffer; Renzong Qiu; Alastair Kent; Kazuto Kato; Norio Niikawa; Isaac F Adewole; Bartha M Knoppers; Morris W Foster; Ellen Wright Clayton; Jessica Watkin; Richard A Gibbs; John W Belmont; Donna Muzny; Lynne Nazareth; Erica Sodergren; George M Weinstock; David A Wheeler; Imtaz Yakub; Stacey B Gabriel; Robert C Onofrio; Daniel J Richter; Liuda Ziaugra; Bruce W Birren; Mark J Daly; David Altshuler; Richard K Wilson; Lucinda L Fulton; Jane Rogers; John Burton; Nigel P Carter; Christopher M Clee; Mark Griffiths; Matthew C Jones; Kirsten McLay; Robert W Plumb; Mark T Ross; Sarah K Sims; David L Willey; Zhu Chen; Hua Han; Le Kang; Martin Godbout; John C Wallenburg; Paul L'Archevêque; Guy Bellemare; Koji Saeki; Hongguang Wang; Daochang An; Hongbo Fu; Qing Li; Zhen Wang; Renwu Wang; Arthur L Holden; Lisa D Brooks; Jean E McEwen; Mark S Guyer; Vivian Ota Wang; Jane L Peterson; Michael Shi; Jack Spiegel; Lawrence M Sung; Lynn F Zacharia; Francis S Collins; Karen Kennedy; Ruth Jamieson; John Stewart
Journal:  Nature       Date:  2007-10-18       Impact factor: 49.962

7.  Risk factors for non-fatal myocardial infarction in young South Asian adults.

Authors:  J Ismail; T H Jafar; F H Jafary; F White; A M Faruqui; N Chaturvedi
Journal:  Heart       Date:  2004-03       Impact factor: 5.994

8.  Assessment of runs of homozygosity islands and estimates of genomic inbreeding in Gyr (Bos indicus) dairy cattle.

Authors:  Elisa Peripolli; Nedenia Bonvino Stafuzza; Danísio Prado Munari; André Luís Ferreira Lima; Renato Irgang; Marco Antonio Machado; João Cláudio do Carmo Panetto; Ricardo Vieira Ventura; Fernando Baldi; Marcos Vinícius Gualberto Barbosa da Silva
Journal:  BMC Genomics       Date:  2018-01-09       Impact factor: 3.969

9.  The distribution of runs of homozygosity and selection signatures in six commercial meat sheep breeds.

Authors:  Deirdre C Purfield; Sinead McParland; Eamon Wall; Donagh P Berry
Journal:  PLoS One       Date:  2017-05-02       Impact factor: 3.240

10.  SNP genotypes reveal breed substructure, selection signatures and highly inbred regions in Piétrain pigs.

Authors:  W Gorssen; R Meyermans; N Buys; S Janssens
Journal:  Anim Genet       Date:  2019-12-06       Impact factor: 3.169

View more
  3 in total

1.  Genome-wide detection of genetic structure and runs of homozygosity analysis in Anhui indigenous and Western commercial pig breeds using PorcineSNP80k data.

Authors:  Yao Jiang; Xiaojin Li; Jiali Liu; Wei Zhang; Mei Zhou; Jieru Wang; Linqing Liu; Shiguang Su; Fuping Zhao; Hongquan Chen; Chonglong Wang
Journal:  BMC Genomics       Date:  2022-05-17       Impact factor: 4.547

2.  Investigation of the Genetic Architecture of Pigs Subjected to Breeding Intensification.

Authors:  Anatoly Kolosov; Lyubov Getmantseva; Maria Kolosova; Timofey Romanets; Nekruz Bakoev; Elena Romanets; Ilona Bakoeva; Olga Kostyunina; Yuri Prytkov; Olga Tretiakova; Siroj Bakoev
Journal:  Genes (Basel)       Date:  2022-01-22       Impact factor: 4.096

3.  High-density single nucleotide polymorphism chip-based conservation genetic analysis of indigenous pig breeds from Shandong Province, China.

Authors:  Yanping Wang; Xueyan Zhao; Cheng Wang; Wenwen Wang; Qin Zhang; Ying Wu; Jiying Wang
Journal:  Anim Biosci       Date:  2020-10-19
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.