Literature DB >> 33995477

Genome-Wide Detection of Runs of Homozygosity in Laiwu Pigs Revealed by Sequencing Data.

Yifei Fang1, Xinyu Hao1, Zhong Xu1, Hao Sun1, Qingbo Zhao1, Rui Cao1, Zhe Zhang2, Peipei Ma1, Yanxiao Sun3, Zengmin Qi3, Qingkui Wei3, Qishan Wang2, Yuchun Pan2.   

Abstract

Laiwu pigs, distinguished by their high intramuscular fat of 7-9%, is an indigenous pig breed of China, and recent studies also found that Laiwu pigs showed high resistance to Porcine circovirus type 2. However, with the introduction of commercial varieties, the population of Laiwu pigs has declined, and some lineages have disappeared, which could result in inbreeding. Runs of homozygosity (ROH) can be used as a good measure of individual inbreeding status and is also normally used to detect selection signatures so as to map the candidate genes associated with economically important traits. In this study, we used data from Genotyping by Genome Reducing and Sequencing to investigate the number, length, coverage, and distribution patterns of ROH in 93 Chinese Laiwu pigs and identified genomic regions with a high ROH frequency. The average inbreeding coefficient calculated by pedigree was 0.021, whereas that estimated by all detected ROH segments was 0.133. Covering 13.4% of the whole genome, a total of 7,508 ROH segments longer than 1 Mb were detected, whose average length was 3.76 Mb, and short segments (1-5 Mb) dominated. For individuals, the coverage was in the range between 0.56 and 36.86%. For chromosomes, SSC6 had the largest number (n = 688), and the number of ROH in SSC12 was the lowest (n = 215). Thirteen ROH islands were detected in our study, and 86 genes were found within those regions. Some of these genes were correlated with economically important traits, such as meat quality (ECI1, LRP12, NDUFA4L2, GIL1, and LYZ), immunity capacity (IL23A, STAT2, STAT6, TBK1, IFNG, and ITH2), production (DCSTAMP, RDH16, and GDF11), and reproduction (ODF1 and CDK2). A total of six significant Gene Ontology terms and nine significant Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways were identified, most of which were correlated with disease resistance and biosynthesis processes, and one KEGG pathway was related to lipid metabolism. In addition, we aligned all of the ROH islands to the pig quantitative trait loci (QTL) database and finally found eight QTL related to the intramuscular fat trait. These results may help us understand the characteristics of Laiwu pigs and provide insight for future breeding strategies.
Copyright © 2021 Fang, Hao, Xu, Sun, Zhao, Cao, Zhang, Ma, Sun, Qi, Wei, Wang and Pan.

Entities:  

Keywords:  Laiwu pig; candidate genes; inbreeding coefficient; runs of homozygosity; selection

Year:  2021        PMID: 33995477      PMCID: PMC8116706          DOI: 10.3389/fgene.2021.629966

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

Laiwu pigs, a precious Chinese indigenous pig breed, are mainly distributed in Shandong Province, China. They are distinguished by a good meat quality, especially a high intramuscular fat (IMF) content of 9–12%, compared with the major commercial lean pig breeds (Yang et al., 2015). Recent studies have also found that Laiwu pigs showed high resistance to certain infectious diseases, such as Porcine circovirus type 2 (PCV2) (Li et al., 2016). These breed-specific features of Laiwu pigs are the consequences of natural and human-mediated selection. Therefore, detection of selection signatures across Laiwu pigs’ genome may help us to reveal the genetic mechanism underlying the process of IMF deposition, as well as the genes involved in resistance to PCV2. In addition, during the last 40 years, with the introduction of western pig breeds such as Duroc and Large White pigs, the Laiwu pig population has shrunk, and some lineages have disappeared. This caused a decrease in both the nucleotide diversity and additive genetic variance in the Laiwu pig populations and led to inbreeding, which can undermine fitness traits due to the expression of highly homozygous detrimental recessive alleles (Howard et al., 2017). Thus, monitoring the genetic inbreeding levels within populations is of great importance in the genetic protection and improvement of Laiwu pigs. The classical method for estimating the inbreeding level of a population is by calculating the inbreeding coefficient through pedigree-based information (Wright, 1922), but pedigree errors are not rare in livestock populations (Ron et al., 1996). Therefore, several more efficient alternative methods based on genome-wide sequencing data have been applied to calculate inbreeding coefficients, including ROH-based inbreeding coefficients (FROH) (McQuillan et al., 2008), which has recently become a popular method and has been successfully used in studying genomic inbreeding for various kinds of livestock populations (Ferencakovic et al., 2011; Szmatoła et al., 2019; Zhan et al., 2020). Runs of homozygosity (ROH) are defined as continuous homozygous DNA segments in diploid genomes, which are a result of the inheritance of identical haplotypes under the condition that both parents descended from a common ancestor (Gibson et al., 2006). Inbreeding is one of the main reasons for the formation of ROH (Curik et al., 2014), indicating that ROH can be used as a good measure of individual inbreeding status. Meanwhile, it has been found that both ROH number and length can provide insights into genome-wide inbreeding levels (Purfield et al., 2012), and the length of the identified ROH segments could also reflect the number of generations since inbreeding occurred (McQuillan et al., 2008). In addition to inbreeding, other population events can also result in the formation of ROH, including genetic drift, population bottleneck, and selection (Frankham, 1996). Thus, the population history, structure, and germplasm features could be revealed by the identification and characterization of ROH (MacLeod et al., 2013; Peripolli et al., 2017). It has been found that the distribution pattern of ROH is not random across a genome (Bosse et al., 2012), and genomic regions with the highest frequency of ROH are named “ROH islands” or “ROH hotspots” (Nothnagel et al., 2010; Pemberton et al., 2012a). ROH islands are observed to be shared among individuals, which is probably a ramification of selection, as genomic regions harboring selection signatures usually overlap with ROH islands (Zhang et al., 2015). As positive directional selection can further reduce genetic diversity and increase homozygosity, thus contributing to the formation of ROH islands, genomic regions under selection are more likely to overlap with ROH islands than the rest of the genome (Pemberton et al., 2012a). Therefore, the analysis of ROH islands is commonly used to reveal putative genomic regions under selection and to disclose the genetic background of economically important traits among livestock populations (Zhang et al., 2015; Xiao et al., 2017; Onzima et al., 2018). In recent years, ROH detection has been used commonly in pigs, but few studies on ROH analysis of Laiwu pigs can be found. The purposes of this study were to detect and characterize the ROH patterns in Laiwu pig populations using the data obtained by the Genotyping by Genome Reducing and Sequencing (GGRS) (Chen et al., 2013) method, as well as to investigate ROH islands that could contain candidate genes related to the characterized traits. Furthermore, we aimed to compare inbreeding coefficients estimated using ROH, pedigree, and single-nucleotide polymorphism (SNP) information.

Materials and Methods

DNA Sampling and Sequencing

According to the genealogical records, 93 unrelated or distantly related Laiwu pigs (including 17 male pigs and 76 female pigs) were selected from the National Conservation Farm for Laiwu Pigs in Shandong Province, China. DNA samples were extracted from their ear tissues with commercial kits (Lifefeng Biotech, Co., Ltd., Shanghai, China) and then genotyped using the GGRS method (Chen et al., 2013). The GGRS method includes the following steps: First, the DNA sequencing libraries were prepared. Next, they were sequenced using paired-end (2 × 150 bp) sequencing methodology with an Illumina HiSeq2000 sequencer. Quality control for this procedure was conducted by filtering raw reads, as reads with base average quality scores less than 30 (an error rate of base calling of 0.1%) were excluded using NGS QC Toolkit v2.3 (Patel and Jain, 2012). The other remaining reads were then aligned to the pig genome reference (Sscrofa11.1) by BWA software (Li and Durbin, 2010). Aligned reads were used for further SNP identification and genotyping, SAMTOOLS v1.4 was used with default settings in this procedure, and missing SNP genotypes were imputed with BEAGLE v4.1 (Browning and Browning, 2016). After imputation, quality control was adopted again by applying the following criteria: all SNPs were detected in more than 30% of the samples and had no less than six-fold sequencing depth on average, as well as a minor allele frequency (MAF) ≥5%. In addition, reads mapped to the mitochondrial and sex chromosomes were excluded due to the high rates of discordance during the process of inheritance.

Estimation of Genetic Diversity

We calculated the observed heterozygosity (Ho) and expected heterozygosity (He) by PLINK v1.9 (Purcell et al., 2007). According to the classical theory of quantitative genetics proposed by Falconer D.S., if a population always keeps the same amounts of offspring for each family during the process of breed conservation, the Ne can be estimated using the following formula (Falconer, 1962): in which Ne is the effective population size, and Nm and Nf are the number of male and female animals in this population, respectively.

Runs of Homozygosity Detection

ROH for each animal were estimated using “- homozyg” of PLINK v1.9 (Purcell et al., 2007) with the default parameter: –homozyg-snp 100, –homozyg-window-snp 50, –homozyg-window-missing 5, and so on. We used a sliding window consisting of an appointed number of SNPs to scan through each sample’s whole genome to locate the homozygous regions. One Mb was set as an ROH length threshold to exclude short homozygous segments induced by LD effects. Other criteria applied for the identification of ROH were as follows (Peripolli et al., 2018): (1) 50 SNPs were contained in each sliding window; (2) an ROH consisted of no less than 100 consecutive SNPs; (3) the density should be higher than one SNP per 50 kb; (4) SNPs with missing genotypes less than five and with a heterozygous genotype less than one were allowed in an ROH due to genotyping error. All homozygous segments filtered were classified into three length classes: 1–5, 5–10, and >10 Mb, identified as ROH1–5 Mb, ROH5–10 Mb, and ROH>10 Mb, respectively.

Inbreeding Coefficients

We used several different methods to calculate inbreeding coefficients according to the pedigree and genomic information, and these inbreeding coefficients were further compared by Pearson’s correlation analysis. The R package “pedigree” was used to estimate the inbreeding coefficients based on pedigree (FPED) for all individuals (Coster and Coster, 2010). Inbreeding coefficients were calculated according to the genomic information, which includes those estimated from ROH (FROH) (McQuillan et al., 2008) and from SNP information (FSNP). FROH was calculated according to the following formula: where LROH is defined as the total length of the genome covered by all ROH segments for each individual, and Lauto is the length of the sequenced genome, which equaled 2.26 Gb in this research. Following the study of Xu et al. (2019), we divided FROH into four types of estimates based on the length of the observed ROH: FROH_all (>1 Mb), FROH1–5 Mb (1–5 Mb), FROH5–10 Mb (5–10 Mb), and FROH>10 Mb (>10 Mb). It has been reported that a lower limit of ROH length of 1 Mb has a genetic distance of 1 cM approximately (Zanella et al., 2016), which allows us to use the FROH _all (>1 Mb) to trace the ancestral inbreeding events that occurred 50 generations ago. Similarly, the corresponding spans for the last three FROH estimates are 10 to 50 generations, 5 to 10 generations, and 5 generations. GCTA software was used to obtain the three estimates of inbreeding coefficients based on SNP (FSNP1, FSNP2, and FSNP3) with the option “-ibc” (Yang et al., 2011): they were derived based on the variance of the additive genotypes (VanRaden, 2008), the degree of homozygous excess (Wright, 1948), and the correlation between the uniting gametes (Wright, 1922). Discussed hereafter are the formulae used: In which Yi is the amount of the reference allele copies at the i-th SNP locus, pi refers to the frequency of the allele at the i-th SNP locus for this individual, hi is calculated as: hi = 2pi(1−pi) and n is the amount of SNPs.

Detection of Runs of Homozygosity Islands and Candidate Genes

To detect the ROH islands, we calculated the frequency of occurrences within the ROH regions of each SNP across the individuals and made a Manhattan figure by plotting these values in conformity with the position of each SNP on chromosomes. The SNPs in the top 1% of the frequency of occurrence were selected as a hint of a potential ROH island (Pemberton et al., 2012b). A chain of adjoining selected SNPs can merge to integrate an ROH island. Genes that were fully or partially contained by these ROH islands were identified and annotated using the University of California, Santa Cruz Genome Browser[1]. Further analysis of gene functions was performed using the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway and Gene Ontology (GO) enrichment analyses by DAVID 6.8[2], and terms with a p-value greater than 0.05 were filtered.

Results

DNA Sequencing and Genetic Diversity

A total of 369 million clean reads were obtained, and each individual pig had approximately 4.75 million clean reads. The average sequencing coverage of the entire genome was 2.5%, and the average sequencing depth was 8×. With the invalid data filtered out, we finally obtained 182,021 informative SNPs. The sequenced SNPs were distributed evenly along the genome (Figure 1), and the average density was approximately 1 SNP per 10.3 Kb. We mapped these SNPs to the Ensemble pig gene annotation set (Ensemble release 92) and found that 91,064 SNPs were within gene regions, including 3,218 SNPs within exons and 9,817 within untranslated regions. The result shows that approximately 50% SNPs were within gene regions, but only 1.7% of SNPs were within exons. This result is unexpected. GGRS libraries were not designed to target polymorphisms within genes, so this proportion seems relatively high. Similar results have been reported in Jinhua pigs (Xu et al., 2019). The average Ho and He of this Laiwu pig population were 0.27 and 0.38, respectively. The value of Ho is lower than the result reported before (0.284), whereas He is higher (0.281) (Wang et al., 2021). In addition, the Ne of this population was 52.6, which suggests it is quite likely that inbreeding has happened.
FIGURE 1

Distribution of the sequenced SNPs on all chromosomes. The y-axis represents chromosomes, and the x-axis represents the corresponding chromosomal position (Mb). Different colors of each 200-kb genome block denote the number of SNPs.

Distribution of the sequenced SNPs on all chromosomes. The y-axis represents chromosomes, and the x-axis represents the corresponding chromosomal position (Mb). Different colors of each 200-kb genome block denote the number of SNPs.

Inbreeding Coefficients Estimate

The inbreeding coefficient of each individual estimation based on pedigree, ROH of different lengths, and the SNPs are shown in Table 1. The average inbreeding coefficient estimates obtained from different lengths of ROH segments follow this order, FROH>10 Mb < FROH5–10 Mb < FROH1–5 Mb. Compared with them, FPED, the average inbreeding coefficient calculated from the pedigree, was the lowest (0.021).
TABLE 1

Descriptive statistics for the eight types of inbreeding coefficient estimates calculated from the pedigree (FPED), the identified ROH (FROH_ all, FROH1–5 Mb, FROH5–10 Mb, and FROH > 10 Mb), and the SNP information (FSNP1, FSNP2, and FSNP3).

Inbreeding coefficientMeanMinMaxSD
FPED0.0210.0000.2500.040
FROH1–5 Mb0.0650.0030.1230.027
FROH5–10 Mb0.0390.0020.1090.023
FROH>10 Mb0.0380.0050.1570.028
FROH–all0.1330.0060.3690.071
FSNP10.2760.0520.7920.129
FSNP20.2760.2300.5130.276
FSNP30.1540.1410.5540.123
Descriptive statistics for the eight types of inbreeding coefficient estimates calculated from the pedigree (FPED), the identified ROH (FROH_ all, FROH1–5 Mb, FROH5–10 Mb, and FROH > 10 Mb), and the SNP information (FSNP1, FSNP2, and FSNP3). The correlation coefficients among all of the results estimated by different methods are shown in Table 2. The correlation between inbreeding coefficients estimated by the pedigree and the genomic data was weak (varied from −0.188 to 0.055). The correlation among all of the estimates calculated based on the ROH information was strong. FROH_ was strongly correlated with the other ROH-based estimates, and the correlation coefficients between FROH_ all and FROH1–5 Mb were the highest (0.910), whereas the weakest correlation among the ROH-based estimates was between F5–10 Mb and F>10 Mb, which was 0.593. The correlation among all the values calculated by the SNPs was strong and statistically significant. However, the correlations between the estimates based on the SNPs and ROH data were relatively strong, except for FSNP2, which was significantly correlated with all of the ROH-based inbreeding coefficients.
TABLE 2

Correlation coefficients (lower panel) among eight types of inbreeding coefficient estimates (FPED, FROH1–5 Mb, FROH5–10 Mb, FROH 10 Mb, FROH_ all, FSNP1, FSNP2, and FSNP3).

CorrelationFPEDFROH1–5 MbFROH5–10 MbFROH > 10 MbFROH_allFSNP1FSNP2FSNP3
FPED1
FROH1–5 Mb0.0291
FROH5–10 Mb–0.0950.701**1
FROH>10 Mb0.0350.645**0.593**1
FROH_all0.0550.910**0.810**0.870**1
FSNP1–0.0510.1060.1700.1080.1291
FSNP2–0.1880.447**0.364**0.325**0.422**0.508**1
FSNP3–0.1440.335**0.3170.2600.332**0.842**0.892**1
Correlation coefficients (lower panel) among eight types of inbreeding coefficient estimates (FPED, FROH1–5 Mb, FROH5–10 Mb, FROH 10 Mb, FROH_ all, FSNP1, FSNP2, and FSNP3).

Genomic Distribution of Runs of Homozygosity

In our study, 7,508 ROH segments were found in 93 individuals, with an average length of 3.76 Mb. The segment with the longest segment (45.15 Mb) was SSC15, which contains 978 SNPs. In contrast, the shortest segment (1 Mb) only consisted of 265 SNPs (Figure 2A). In total, all ROH segments covered 13.4% of the entire genome.
FIGURE 2

Distribution of the ROH. (A) Length distribution of the ROH. X-axis represents the length of the ROH (Mb) using a base-10 log scale. Y-axis represents the number of ROH for different ROH lengths. (B) Bars and the red dotted line represent the number of ROH and the ROH coverage, respectively, on each chromosome.

Distribution of the ROH. (A) Length distribution of the ROH. X-axis represents the length of the ROH (Mb) using a base-10 log scale. Y-axis represents the number of ROH for different ROH lengths. (B) Bars and the red dotted line represent the number of ROH and the ROH coverage, respectively, on each chromosome. The statistics of ROH number and length classified by ROH length are shown in Table 3. Among the three classes of ROH segments, the majority were short ROH segments (1–5 Mb), accounting for 78.46% of the total number of ROH segments, and it also had the highest genome coverage of 48.43% of the total length of all ROH segments. ROH5–10 Mb had a lower quantity of 1,197 (15.94% of the total ROH number), and it made up 28.81% of the total ROH length. The number of long ROH segments was the lowest, only 5.57%, but it still had a coverage ratio of 22.76%.
TABLE 3

Descriptive statistics for ROH of different lengths (Mb): 1–5, 5–10, >10, and >1.

ROH length (Mb)ROH numberPercent (%)Mean length (Mb)Standard deviationGenome coverage (%)
1–55,89378.492.321.056.49
5–101,19715.946.791.293.86
>104185.5715.386.063.05
Total (>1)7,5081003.763.7113.40
Descriptive statistics for ROH of different lengths (Mb): 1–5, 5–10, >10, and >1. For individuals, high variability was observed in both the number of ROH segments and the genome coverage; the former varied from 6 to 166, and the latter varied from 0.56 to 36.86%. For chromosomes, the number and coverage of ROH on each chromosome are shown in Figure 2B. These results suggest that SSC6 has the highest number of ROH segments of 688, and its coverage is 16.078%. In contrast, ROH segments in SSC12 have the lowest number of 215, and its coverage ratio is also the lowest (8.56%). SSC4 has the highest coverage ratio of 19.95%.

Detection of Runs of Homozygosity Islands

To identify the genomic regions with high ROH frequency, we first calculated the frequency of SNP occurrence in the ROH and selected the top 1% among those with the highest frequency (present in at least 33.3% of the samples) (Figure 3). There were four adjacent SNPs on SSC4 at ∼35 Mb with the highest frequency of 60.2%, which is similar to the findings of Wang et al. (2021), but no gene has been mapped in this region according to the reference genome (Sscrofa11.1). Neighboring selected SNPs that constitute regions with the highest frequency of ROH are called ROH islands. The genomic regions of the ROH islands detected in Laiwu pigs, distributed unevenly on different chromosomes with lengths varying from 0.031 to 14.4 Mb, are presented in Table 4. The longest ROH island was found to be on SSC6 and consisted of 320 contiguous SNPs, and the longest ROH cod-spot was on SSC4, containing 1,177 contagious SNPs.
FIGURE 3

Frequency of occurrences of each SNP within ROH regions among all of the individuals. Horizontal red line represents the 33% threshold. Genes related to meat quality have been identified.

TABLE 4

List of ROH islands observed in this Laiwu pig population.

ChromosomeStart (bp)End (bp)Length (bp)Number of SNPsNumber of genes
2125893724126531130637406261
33941424441549236213499215917
432235908458766351364072743819
519810174342275911441741732035
5189877841915368616590280
5184349151851760082685120
680696814807284463163270
9138352276139076532724256900
10607104496507935843689094513
11606460206633010656840862309
1325594502749224189774194
175124010151391328151227160
175032694850888804561856421
Frequency of occurrences of each SNP within ROH regions among all of the individuals. Horizontal red line represents the 33% threshold. Genes related to meat quality have been identified. List of ROH islands observed in this Laiwu pig population.

Candidate Genes Within Runs of Homozygosity Islands

Thirteen ROH islands were detected in our study, and there were 86 genes within those regions according to the University of California, Santa Cruz database (Supplementary Table 1). The position and number of the constituent SNPs and genes in those ROH islands are listed in Table 4. SSC5 has the most islands, and the longest island, which contains 320 SNPs and 35 genes, is also found between 19,810,174 and 34,227,591 bp. In some ROH islands that contain fewer contagious SNPs, no gene was detected. Candidate genes within all of the ROH islands detected were correlated with many economically important traits, such as meat quality (ECI1, LRP12, NDUFA4L2, GIL1, and LYZ), immunity capacity (IL23A, STAT2, STAT6, TBK1, IFNG, and ITH2), production (DCSTAMP, RDH16, and GDF11), and reproduction (ODF1 and CDK2). All genes found were further subjected to GO and KEGG enrichment analyses. Six significant GO terms and nine significant KEGG pathways are shown in Figure 4 and Supplementary Table 2. Most terms and pathways were correlated with disease resistance and biosynthesis processes, and one KEGG pathway was related to lipid metabolism. In addition, we aligned all of these ROH islands to the pig quantitative trait loci (QTL) database and finally found eight QTLs related to the IMF trait.
FIGURE 4

Bar plot of GO terms and KEGG pathways enriched (p < 0.05) based on ROH islands. Orange bars show enriched GO terms, and green bars show KEGG pathways.

Bar plot of GO terms and KEGG pathways enriched (p < 0.05) based on ROH islands. Orange bars show enriched GO terms, and green bars show KEGG pathways.

Discussion

Characteristics of Runs of Homozygosity

In our study, the number of small ROH segments was the highest, whereas the number of medium and large segments was much lower, especially large segments (>10 Mb). Small ROH can reflect ancestry inbreeding history, whereas large segments are usually formed by recent inbreeding. These results indicated that both ancient and recent inbreeding events might affect this Laiwu population, but recent inbreeding was not frequent. However, it should be noted that the length of ROH may not always reflect the generations from inbreeding events that took place, as randomness exists during the formation of ROH, which can be affected by the dynamic rates of recombination, as well as the stochastic nature during gamete formation (Liu et al., 2010). Kirin et al. (2010) reported that short segments (<4 Mb) could also indicate the consequence of a decreased population size and bottleneck event. The variability of the number of ROH among individuals was high, which coincides with the results of studies on cattle and sheep (Purfield et al., 2017; Peripolli et al., 2018). ROH number and coverage also differ from chromosome to chromosome, but the number of ROH tends to increase as the length of the chromosome increases. It has been reported that ROH are easy to form and inherit on some chromosomes but relatively difficult on other chromosomes, suggesting that some parts of the genome are less supportive of maintaining the accumulation of IBD than others (Nandolo et al., 2018). This phenomenon could be related to the selection. Chromosomes with high ROH coverage may suggest that they have been affected by positive selection, which will increase the accumulation of advantageous alleles and the neutral alleles genetically linked with them (Smith and Haigh, 1974). In contrast, the chromosome with low ROH coverage could indicate purifying selection, which decreases the number of homozygous deleterious alleles (Ferenčaković et al., 2013). In our study, some genomic regions of SSC4 and SSC12, which have the highest coverage and lowest coverage, respectively, may have been under natural or artificial selection. The SNPs used to detect ROH regions were obtained from GGRS, a high-throughput reduced representation sequencing method in our study. It is a cost-effective approach and is designed for outbreed population, making it suitable for the population of Laiwu pigs in our research. However, the challenge is that the distribution of the data generated by it is unbalanced among individuals, which often causes a large portion of missing genotypes, which would impact subsequent analysis. To address this issue, imputation is inevitably used, and it has been included in the GGRS protocol. After imputation, we have also taken quality control measures, including filtering out the SNPs with a MAF < 0.05. This filter is necessary to ensure the precision of imputation (Wang et al., 2015). There might be concerns about the necessity and appropriateness of imputation and the subsequent filtration with MAF, but several researchers have conducted studies that may help address such concerns. Imputed data have been used to detect ROH as found in several published works (Power et al., 2014; Chitneedi et al., 2017). It has been reported that using imputed data for ROH detection can produce comparable results with using the data that are restricted to genotyped SNPs only (Keller et al., 2012); Zhang et al. (2018) have also conducted the simulation to study the reliability of the use of imputed data in ROH detection and found that using imputed data with an average missing genotype rate of 0.7 could give a comparable amount of ROH detected compared with using data with no missing genotypes. Admittedly, removing monomorphic SNPs may reduce the chance to detect ROH, but it helps to exclude false-positive results. It has been reported that removing SNPs with low MAF (<0.05) before the analysis can minimize the trade-off between the exclusion of non-autozygous ROHs and the cost of missing shorter autozygous ROHs (Howrigan et al., 2011). To date, few studies have been done comparing the performance of software used for ROH detection and the ways of determining appropriate parameters (Howrigan et al., 2011). Actually, a standard methodology for ROH detection is still ambiguous (Biscarini et al., 2020). In our study, we used PLINK software with default parameters to detect ROH; the same method has been applied in several published works (Peripolli et al., 2018; Xu et al., 2019).

Runs of Homozygosity as a Tool of Inbreeding Estimation

The traditional method of estimating inbreeding coefficients is based on pedigree, and it is not as efficient as the method based on genomic information because errors usually exist in pedigree records. FPED only indicates the expectation of homozygosis, whereas FROH and FSNP can reflect the realized homozygous loci (Visscher et al., 2006). In our study, the value of FPED was much lower than FROH, which may be due to errors in the pedigree information. This may also be due to an inadequate pedigree, as FROH could suggest an inbreeding history 50 generations ago, but our pedigree only contains 8–10 generations of records. Some events, such as selection and mutation, were not involved in the method of estimating inbreeding coefficients, so the value of FPED was likely to have a bias (Ceballos et al., 2018). In addition, our sequencing and genotyping method used reduced sequencing, which only covered 2–3% of the whole genome, so we may have overestimated the homozygosity of some regions and produced false-positive results during the process of identifying the ROH segments. Overestimating the number of ROH would also lead to an overestimation of inbreeding coefficients based on ROH. In our study, the correlation between FPED and FROH or FPED and FSNP was insignificant, which may be due to the inaccuracy of the pedigree. It has been reported that the correlation between FPED and FROH would become stronger as the length of the ROH increase, as long ROH segments can reflect a recent inbreeding situation, which coincides with the trend found in our results (Magi et al., 2014). The correlation among the four indexes calculated by ROH was strong, especially between FROH_all and FROH1–5 Mb because the length of most ROH segments ranged from 1 to 5 Mb. The correlation between FSNP and FROH was relatively strong, which may because the identifying of ROH was based on the results of SNP genotyping. FSNP2 correlates most strongly with FROH because they both directly reflect homozygosity.

Analysis of Candidate Genes Within Runs of Homozygosity Islands

We identified and annotated several candidate genes contained by ROH islands and found they were related to various breeding traits, including meat quality, immunity, production, reproduction, etc. Because Laiwu pigs are characterized for high IMF contents, several candidate genes detected in our study were reported to be associated with lipid deposition and meat quality. ECI1 expression was observed to have a positive correlation with IMF content (Lu et al., 2017). LRP12 was reported to have highly significant associations with gluteus medius saturated fatty acid content in Duroc pigs (Melo et al., 2013). NDUFA4L2 can affect metabolic enzyme activities, which has a direct impact on meat quality (Liu et al., 2016). GIL1 negatively correlates with fat accumulation, and LYZ has been proven to be related to conductivity 24, pH1, and drip loss of porcine meat (Srikanchai et al., 2010). Some candidate genes relating to immunity were identified, such as IL23A, STAT2, STAT6, TBK1, and IFNG. It is noteworthy that ITH2 was reported to be one of the putative PCV2 Cap-interacting host proteins by coimmunoprecipitation combined with liquid chromatography–mass spectrometry approach (Zhou et al., 2020), which may explain the strong ability of PCV2 resistance of Laiwu pigs. Some genes are related to production traits. DCSTAMP can affect feed gain ratio (Tao et al., 2017), and RDH16 is also correlated with feed efficiency (Zhao et al., 2016). GDF11 was proven to be the candidate gene involved in growth and body size in micro pigs (Son et al., 2020) and can raise the number of ribs in knock-out mice (Zhang et al., 2016). ODF1, participating in spermatogenesis and CDK2, affects the meiosis of oocytes, affecting the reproduction traits of pigs. WIF1 could be a crucial mutation associated with ear size (Liang et al., 2019), which can correlate with the big ears of Laiwu pigs. According to the Pig QTL database, several QTLs related to meat quality, immunity, the reproduction process, and other traits overlapping with the detected ROH islands are listed. In particular, some QTLs involved in the process of IMF deposition have already been reported: ID = 639 (Bink et al., 2000), ID = 145 (Gerbens et al., 2001), ID = 150, ID = 151, ID = 299 (Ovilo et al., 2002a,b), ID = 313 (de Koning et al., 1999), ID = 639 (Bink et al., 2000), and ID = 4169 (Ovilo et al., 2000).

Conclusion

In summary, this study investigated the patterns of ROH and calculated inbreeding coefficients in 93 Laiwu pigs, which revealed how diversity evolved in the Laiwu population. We found that this population has been affected by historical inbreeding events, but recent inbreeding events are not frequent. Moreover, we identified 13 ROH islands putatively under natural and/or artificial selection harboring genes with molecular functions related to characteristic traits. We found that most genes contained by ROH islands were associated with disease resistance and biosynthesis processes. In addition, several genes related to economically important traits, such as meat quality, were also detected. These results may help us understand the characteristics of Laiwu pigs and provide insight for future breeding strategies.

Data Availability Statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author Contributions

YP and QWe designed the experiment. YF, XH, ZX, and HS performed the experiment. QZ, PM, and ZZ developed some analysis software used in this study. YF wrote the manuscript with RC’s kind help. All authors have read and approved the final manuscript.

Conflict of Interest

YS, ZQ, and QWe were employed by Breeding Company of Pigs, Laiwu, China. The remaining author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  8 in total

1.  Genome-wide detection of genetic structure and runs of homozygosity analysis in Anhui indigenous and Western commercial pig breeds using PorcineSNP80k data.

Authors:  Yao Jiang; Xiaojin Li; Jiali Liu; Wei Zhang; Mei Zhou; Jieru Wang; Linqing Liu; Shiguang Su; Fuping Zhao; Hongquan Chen; Chonglong Wang
Journal:  BMC Genomics       Date:  2022-05-17       Impact factor: 4.547

2.  Genome-Wide Association Studies, Runs of Homozygosity Analysis, and Copy Number Variation Detection to Identify Reproduction-Related Genes in Bama Xiang Pigs.

Authors:  Jiayuan Mo; Yujie Lu; Siran Zhu; Lingli Feng; Wenjing Qi; Xingfa Chen; Bingkun Xie; Baojian Chen; Ganqiu Lan; Jing Liang
Journal:  Front Vet Sci       Date:  2022-05-31

3.  Whole-genome sequence analysis unveils different origins of European and Asiatic mouflon and domestication-related genes in sheep.

Authors:  Ze-Hui Chen; Ya-Xi Xu; Xing-Long Xie; Dong-Feng Wang; Diana Aguilar-Gómez; Guang-Jian Liu; Xin Li; Ali Esmailizadeh; Vahideh Rezaei; Juha Kantanen; Innokentyi Ammosov; Maryam Nosrati; Kathiravan Periasamy; David W Coltman; Johannes A Lenstra; Rasmus Nielsen; Meng-Hua Li
Journal:  Commun Biol       Date:  2021-11-18

4.  Assessment of Heterozygosity and Genome-Wide Analysis of Heterozygosity Regions in Two Duroc Pig Populations.

Authors:  Donglin Ruan; Jie Yang; Zhanwei Zhuang; Rongrong Ding; Jinyan Huang; Jianping Quan; Ting Gu; Linjun Hong; Enqin Zheng; Zicong Li; Gengyuan Cai; Xiaopeng Wang; Zhenfang Wu
Journal:  Front Genet       Date:  2022-01-27       Impact factor: 4.599

5.  Comparative Transcriptomic Profiles of Differentiated Adipocytes Provide Insights into Adipogenesis Mechanisms of Subcutaneous and Intramuscular Fat Tissues in Pigs.

Authors:  Pan Zhang; Bo Zhang; Peng Shang; Yu Fu; Ruixue Nie; Yangzom Chamba; Hao Zhang
Journal:  Cells       Date:  2022-01-31       Impact factor: 6.600

6.  Sequencing Reveals Population Structure and Selection Signatures for Reproductive Traits in Yunnan Semi-Fine Wool Sheep (Ovis aries).

Authors:  Yi Guo; Jiachong Liang; Chunrong Lv; Yi Wang; Guoquan Wu; Xiangdong Ding; Guobo Quan
Journal:  Front Genet       Date:  2022-03-07       Impact factor: 4.599

7.  Simultaneous testing of rule- and model-based approaches for runs of homozygosity detection opens up a window into genomic footprints of selection in pigs.

Authors:  Jan Berghöfer; Nadia Khaveh; Stefan Mundlos; Julia Metzger
Journal:  BMC Genomics       Date:  2022-08-06       Impact factor: 4.547

8.  Genetic diversity and population structure of Tongcheng pigs in China using whole-genome SNP chip.

Authors:  Jiao Yuan; Xiang Zhou; Guoqiang Xu; Sanping Xu; Bang Liu
Journal:  Front Genet       Date:  2022-08-25       Impact factor: 4.772

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.