Literature DB >> 35360858

Variants Within Genes EDIL3 and ADGRB3 are Associated With Divergent Fecal Egg Counts in Katahdin Sheep at Weaning.

Gabrielle M Becker1, Joan M Burke2, Ronald M Lewis3, James E Miller4, James L M Morgan5, Benjamin D Rosen6, Curtis P Van Tassell6, David R Notter7, Brenda M Murdoch1.   

Abstract

Gastrointestinal nematodes (GIN) pose a severe threat to sheep production worldwide. Anthelmintic drug resistance coupled with growing concern regarding potential environmental effects of drug use have demonstrated the necessity of implementing other methods of GIN control. The aim of this study was to test for genetic variants associated with resistance or susceptibility to GIN in Katahdin sheep to improve the current understanding of the genetic mechanisms responsible for host response to GIN. Linear regression and case-control genome-wide association studies were conducted with high-density genotype data and cube-root transformed weaning fecal egg counts (tFEC) of 583 Katahdin sheep. The case-control GWAS identified two significant SNPs (P-values 1.49e-08 to 1.01e-08) within introns of the gene adhesion G protein-coupled receptor B3 (ADGRB3) associated with lower fecal egg counts. With linear regression, four significant SNPs (P-values 7.82e-08 to 3.34e-08) were identified within the first intron of the gene EGF-like repeats and discoidin domains 3 (EDIL3). These identified SNPs were in very high linkage disequilibrium (r 2 of 0.996-1), and animals with alternate homozygous genotypes had significantly higher median weaning tFEC phenotypes compared to all other genotypes. Significant SNPs were queried through public databases to identify putative transcription factor binding site (TFBS) and potential lncRNA differences between reference and alternate alleles. Changes in TFBS were predicted at two SNPs, and one significant SNP was found to be within a predicted lncRNA sequence with greater than 90% similarity to a known lncRNA in the bovine genome. The gene EDIL3 has been described in other species for its roles in the inhibition and resolution of inflammation. Potential changes of EDIL3 expression mediated through lncRNA expression and/or transcription factor binding may impact the overall immune response and reduce the ability of Katahdin sheep to control GIN infection. This study lays the foundation for further research of EDIL3 and ADGRB3 towards understanding genetic mechanisms of susceptibility to GIN, and suggests these SNPs may contribute to genetic strategies for improving parasite resistance traits in sheep.
Copyright © 2022 Becker, Burke, Lewis, Miller, Morgan, Rosen, Van Tassell, Notter and Murdoch.

Entities:  

Keywords:  GWAS; gastrointestinal nematodes; hair sheep; helminth; parasites

Year:  2022        PMID: 35360858      PMCID: PMC8960952          DOI: 10.3389/fgene.2022.817319

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


Introduction

Gastrointestinal nematodes (GIN) are arguably considered to be one of the greatest health and economic threats to small ruminant production worldwide (Karrow et al., 2014; Escribano et al., 2019; Hassan et al., 2019; Chitneedi et al., 2020). GIN infection contributes to substantial economic losses in both meat and dairy production (Charlier et al., 2020). Symptoms of infection vary depending on the species present, but may include diarrhea, anemia or hypoproteinemia; in extreme cases, high parasite burden can lead to animal death even within the prepatent period of infection (Zajac 2006; Emery et al., 2016). GIN can have substantial repercussions on animal production even when infections are not fatal, as animals experience weight loss and/or poor growth and fail to meet their production potential (Haehling et al., 2020). The lifecycles of GIN parasites are similar and can be divided between a free-living environmental stage and a parasitic stage within the specific gastrointestinal niche. Sheep that are susceptible to GIN may harbor thousands of worms and contribute to the continued contamination of parasite eggs into the environment (McRae et al., 2015); depending on the helminth species, individual female worms may produce between 100–15,000 eggs per day (Roeber et al., 2013; Emery et al., 2016). Lambs tend to be the most susceptible to severe disease from GIN infection, as their immune systems are not fully mature and have not been previously exposed to parasite antigen (Miller and Horohov 2006; Greer and Hamie 2016). Over time and exposure, sheep tend to become more tolerant of GIN infection (McRae et al., 2015; Benavides et al., 2016). For this reason, young animals are frequently used when examining potential genetic differences in susceptibility and resistance to GIN parasites. Hair sheep are generally considered to be more tolerant of GIN infection than conventional wool breeds and are capable of maintaining higher packed cell volumes and lower fecal egg counts (FEC) during parasite challenge (Wildeus 1997; Notter et al., 2003; Vanimisetti et al., 2004; González-Garduño et al., 2013). The Katahdin are a composite breed of hair sheep that were founded through crosses of the St. Croix Caribbean hair sheep to temperate wool breeds such as the Suffolk and Wiltshire Horn (Wildeus 1997). Similar to the St. Croix, Katahdin sheep have been described as having increased parasite resistance traits in comparison to wool breeds (Burke and Miller 2004; Vanimisetti et al., 2004; Ngere et al., 2018). Since their development in the 1950s, the Katahdin have become an economically important breed in the United States (Thorne et al., 2021). Antiparasitic drugs have been the classic approach to help control GIN infection in sheep since their inception. This approach has been impacted in recent years by the growing incidence of anthelmintic drug resistance among GIN parasites (Papadopoulos et al., 2012; Ploeger and Everts 2018; Bosco et al., 2020; Stewart et al., 2020). Selective breeding for parasite resistance is one method that can be utilized to both lessen reliance on pharmaceutical use and improve sheep health and production (McRae et al., 2014; Keane et al., 2018; Aboshady et al., 2020). Parasite resistance can be estimated through several indicator traits, including FEC of parasite eggs per gram of feces, anemia scoring through FAMACHA (FAffa MAlan CHArt) or packed cell volume and measurement of immunoglobulin A (IgA) activity (Davies et al., 2006; Shaw et al., 2012; Ngere et al., 2018; Naeem et al., 2020); of these, FEC is the most common (Ngere et al., 2018). Parasite resistance based on FEC phenotypes has been estimated to be a moderately heritable trait (h of 0.2–0.3), although there are differences in parasite resistance both between and within breeds (Riggio et al., 2013; Brown and Fogarty 2016; Keane et al., 2018). The current literature suggests that resistance or susceptibility to GIN is likely a polygenic trait controlled by many genes, with each contributing a relatively small overall effect (Kemper et al., 2011; McRae et al., 2014). Genetic markers in genes involved with the innate and adaptive immune responses, including major histocompatibility complex (MHC) and interferon-γ genes, cytokine signaling, mucin secretion and hemostasis pathways have been previously reported and reviewed (Benavides et al., 2016; Al Kalaldeh et al., 2019; Ahbara et al., 2021; González-Garduño et al., 2021). However, it is unlikely that associated markers identified in one breed will be equally applicable to other breeds of sheep, due in part to the differences in linkage disequilibrium (LD) and allele frequencies between breeds (Benavides et al., 2015). The aim of this study was to further the current understanding of the genetic basis which may influence parasite susceptibility in Katahdin sheep. To accomplish this, a genome-wide association study (GWAS) was performed with high-density (HD) genotype data and weaning FEC of 583 Katahdin sheep. The reference DNA sequences associated with significant SNPs were queried through public functional databases for putative transcription factor binding sites (TFBSs) and sequence identity to known non-coding RNA. These predictive analyses were conducted in order to evaluate possible functional consequences of significant SNPs towards building a better understanding of the potential genetic mechanisms that differentiate susceptible from resistant sheep.

Materials and Methods

Fecal Egg Count Data Collection and Preparation

A total of 583 Katahdins (n = 275 male, n = 308 female) were used in the current study. To ensure representative exposure to GIN, animals were sampled from 20 different Katahdin flocks located in 12 states across the US (AR, GA, ID, IN, MO, NY, OH, OR, TX, VA, WI and WV) (Supplementary Figure S1). All participating flocks were enrolled in the National Sheep Improvement Program (NSIP) and producers consented to perform animal sampling for research purposes. Animals were not treated with anthelmintics within 30 days of FEC sampling. Sampled animals were born in 2019 (n = 282), 2018 (n = 209) or 2017 or earlier (n = 92). Stool samples were collected directly from the rectum of animals at weaning (70.24 ± 11.04 days of age). Weaning FEC were quantified by a certified parasitology laboratory using the modified McMaster technique (Louisiana State University School of Veterinary Medicine and the Virginia-Maryland College of Veterinary Medicine). From previous work, the GIN infections present on most farms used in the current study were expected to be mixed populations of H. contortus, Trichostrongylus spp., and to a lesser extent, Teladorsagia spp., Cooperia spp. and Oesophagostomum spp (Notter et al., 2017). As observed in other studies, the FEC data of this study were not normally distributed (Woolaston and Piper 1996; Doyle and Eady 2001; Pollott and Greeff 2004; Kemper et al., 2011; Al Kalaldeh et al., 2019). Normalcy of FEC data was assessed with the Shapiro-Wilks test, kurtosis and skewness analyses conducted in R (data not shown). As a result, weaning FEC were cube-root transformed to normalize the distribution. Average weaning transformed FEC (tFEC) for all samples was 10.18; the minimum and mode weaning tFEC values were 0 (not detected, n = 119), the maximum was 38 (n = 1) and the median was 8 (n = 23) (Supplementary Figure S2).

Genotyping and Genome-Wide Association Analyses

Blood samples were collected by the participating sheep producers and stored on blood cards until genotyping. Sample DNA were extracted from blood cards and genotyped with the high-density (HD) Illumina 600 K SNP BeadChip (Illumina Inc., San Diego, CA, USA) comprised of 606,006 markers. All DNA extraction and genotyping were conducted at Neogen Corporation—GeneSeek Operations, Lincoln, NE, USA. DNA was extracted by Neogen using the MagMAX™ DNA Multi-Sample Ultra Kit (Thermo Fisher Scientific Cat. A25598). All samples had a genotypic call rate (CR) > 90%: genotype markers with GenCall score <0.15 were initially excluded, followed by non-autosomal markers, markers with CR < 90%, minor allele frequency <1% or Hardy-Weinberg Equilibrium P-value < 1.0e-06. Redundant markers were filtered such that the marker with the highest CR per genomic location was retained for analyses. Following quality control, 505,914 high-quality autosomal markers and 583 samples were used for analyses. Association analyses were conducted through SNP and Variation SuiteTM v8.9.0 (Golden Helix, Inc., Bozeman, MT, ww.goldenhelix.com) using the Efficient Mixed-Model eXpedited (EMMAX) (Kang et al., 2010) with additive, dominant and recessive inheritance models as linear regression (LR) or case-control (CC), with cases defined as animals with undetected FEC (n = 119). In all tests a genomic relationship matrix was fitted as a random effect and variables with significant relationship to tFEC were fitted as fixed effects. These variables included birth month, birth/rear type (single birth/single reared, twin birth/twin reared, triplet birth/triplet reared, quadruplet birth/quadruplet or triplet reared, quadruplet or triplet birth/twin or single reared) and flock; year was also significant (Supplementary Figure S2) but was not included due to multicollinearity with other variables. Genome-wide significance was determined by adaptive permutation testing through plink v1.07 with the flags --assoc and --aperm. Each SNP underwent a minimum of 10 permutations and up to 1,000,000 permutations (Purcell et al., 2007, https://zzz.bwh.harvard.edu/plink/perm.shtml#aperm). Other parameters included alpha (0), beta (0.001), intercept (1) and slope (0.001). Permutation testing achieved a minimum empirical P-value of 1.00e-06; therefore, genome-wide significance was set to P-values < 1.00e-06. Any significant SNPs identified were further analyzed with the Kruskal-Wallis test (weaning tFEC ∼ SNP) and visualized with the package ggpubr in R version 3.6.3 (Hollander and Wolfe 1973; R Core Team 2020; Kassambara 2020).

Linkage Disequilibrium

The linkage disequilibrium (LD) of significant SNPs identified in the LR recessive model were evaluated. LD was estimated using the composite haplotype method in the SNP and Variation SuiteTM v8.9.0 software (Weir 1996; Golden Helix, Inc., Bozeman, MT, ww.goldenhelix.com). LD was reported using the r statistic as it is thought to be more robust than the D’ statistic and is generally more favored in the context of association studies (Ardlie et al., 2002; Kijas et al., 2014).

Non-Coding RNA Prediction Analysis

The significant SNPs positioned within RNA-seq track reads observed in NCBI were investigated against known RNA sequences. The 1,000 bp surrounding each significant SNP in the reference genome (Oar_rambouillet_v1.0) were queried through RNAcentral with the SNP reference and alternative allele (The RNAcentral Consortium, 2019). Due to the position of the SNP within the RNA-seq reads, rs416102123 was at position 501, rs413712238 was at position 729 and rs417983470 was at position 224 of the respective query sequences.

Transcription Factor Binding Site Prediction

The search tool ConSite was used to analyze input sequences against a TFBS profile collection drawn from the JASPAR database (Sandelin et al., 2004; Fornes et al., 2020). For each significant SNP identified from the LR recessive GWAS, the reference and the alternate alleles were queried along with the 15 nucleotides immediately 3′ and 5′ of the SNP in the reference genome assembly (Oar_rambouillet_v1.0), for a total input of 31 bp. The scores of each putative TFBS were examined to identify potential differences between the reference and alternate allele for each SNP.

Results

Genome-Wide Association Analyses

Genome-wide association analyses were performed with HD genotypes and weaning tFEC data of Katahdin sheep. The LR and CC analyses used a mixed-model GWAS to investigate additive, dominant and recessive inheritance models. Quantile-quantile (QQ) plots were constructed for each model to visualize the deviation of observed P-values from expected P-values. The QQ plots of the recessive models indicated the best control of type I/type II error (Supplementary Figure S3). With the LR recessive inheritance model, four significant SNPs were identified within the first intron of the gene epidermal growth factor (EGF)-like repeats and discoidin domains 3 (EDIL3), also known as developmental endothelial locus-1 (DEL-1) (Figure 1). All four significant SNPs were near to one another, with a minimum distance of 6.3 kbp and maximum distance of 30.8 kbp between SNPs. The SNPs rs405327900 and rs413712238 had the smallest and identical P-values (P-value = 3.34e-08), followed by rs417983470 (P-value = 6.66e-08) and rs416102123 (P-value = 7.82e-08). The alternate alleles of significant SNPs identified were observed at a frequency of 35% in this study population. The proportion of weaning tFEC phenotypic variance explained by significant SNPs were 5.4%, 5.4%, 5.2%, and 5.1%, respectively (Table 1).
FIGURE 1

Results of GWAS of Katahdin sheep with weaning tFEC. Manhattan plot from recessive EMMAX model with genome-wide significance threshold defined by P-values <1.0e-06 (black line).

TABLE 1

Significant SNPs from linear regression and case-control GWAS. Table displaying information for significant SNPs identified in linear regression EMMAX GWAS with weaning tFEC phenotypes and recessive, dominant or additive inheritance models and case-control EMMAX GWAS with a recessive inheritance model. A, additive; CC, case-control; Chr, chromosome; D, dominant; FDR, false discovery rate; LR, linear regression; MAF, minor allele frequency; PVE, proportion of variance explained; R, recessive.

ModelChrrs NumberPosition bp P-ValueMAFPVEGene
UnadjustedFDR
LR-R5rs40532790088,129,8573.34e-080.01690.35320.0539 EDIL3
5rs41371223888,154,4113.34e-080.00840.35110.0539
5rs41798347088,160,6896.66e-080.01120.35110.0516
5rs41610212388,144,9427.82e-080.00990.35000.0511
LR-A2rs42876870014,076,3091.49e-080.00750.02140.0567 PALM2AKAP2
LR-D2rs42876870014,076,3091.49e-080.00750.02140.0567 PALM2AKAP2
10rs4173806326,660,6352.32e-070.05880.36710.047524 kpb 5′ of PCDH17
CC-R9rs4168819895,046,4851.01e-080.00510.42180.0580 ADGRB3
9rs4216577775,325,6355.81e-080.01470.47860.0521
Results of GWAS of Katahdin sheep with weaning tFEC. Manhattan plot from recessive EMMAX model with genome-wide significance threshold defined by P-values <1.0e-06 (black line). Significant SNPs from linear regression and case-control GWAS. Table displaying information for significant SNPs identified in linear regression EMMAX GWAS with weaning tFEC phenotypes and recessive, dominant or additive inheritance models and case-control EMMAX GWAS with a recessive inheritance model. A, additive; CC, case-control; Chr, chromosome; D, dominant; FDR, false discovery rate; LR, linear regression; MAF, minor allele frequency; PVE, proportion of variance explained; R, recessive. The results of the LR additive and dominant GWAS models were similar. Both of these models identified the significant SNP rs428768700 with P-value = 1.49e-08 and 5.7% proportion of variance explained. This SNP is positioned within the first intron of the PALM2 and AKAP2 fusion gene (PALM2AKAP2). The annotation of this gene is unclear, as PALM2 and AKAP2 have been previously annotated as separate genes (NCBI Resource Coordinators 2016). No animals were homozygous for the alternate allele at this marker: 25 animals were heterozygous and the remaining 558 animals were homozygous for the reference allele. The dominant model identified an additional significant SNP, rs417380632, with P-value = 2.32e-07 and 4.8% proportion of variance explained. This SNP is located approximately 24 kbp before the start of the gene protocadherin 17 (PCDH17) (Table 1). The recessive CC model identified two significant SNPs on chromosome 9: rs416881989 with P-value = 1.01E-08 and 5.8% proportion of variance explained and rs421657777 with P-value = 5.81E-08 and 5.2% proportion of variance explained (Table 1). These markers are located 279 kbp apart and are within the gene adhesion G protein-coupled receptor B3 (ADGRB3). Marker rs421657777 is positioned within intron 17 and marker rs416881989 is positioned within intron 26 of ADGRB3. Although G protein-coupled receptors are involved in many physiological processes (Suchý et al., 2020), little is known about the specific role of adhesion G protein-coupled receptors in the sheep immune response. Sheep with homozygous alternate genotypes (AA) at these markers had significantly lower tFEC compared to sheep with heterozygous and homozygous references genotypes (Figure 2).
FIGURE 2

Significant SNP genotypes and weaning tFEC phenotypes from the case-control GWAS. (A) Kruskal-Wallis results for SNP rs416881989, (B) Kruskal-Wallis results for SNP rs421657777. The mean tFEC value is represented by the blue line.

Significant SNP genotypes and weaning tFEC phenotypes from the case-control GWAS. (A) Kruskal-Wallis results for SNP rs416881989, (B) Kruskal-Wallis results for SNP rs421657777. The mean tFEC value is represented by the blue line. Analysis of LD between the four significant SNPs identified in the LR recessive GWAS revealed that SNPs were in near to perfect LD (r = 0.996–1). The LD between SNPs rs405327900, rs413712238 and rs416102123 was r = 1, indicating perfect LD, while the LD of rs417983470 to all others was r = 0.996, indicating high LD. Given the strength of correlations between these SNPs, the frequencies at which these alleles occurred were determined within the sample population. The majority of samples were either homozygous for the alternate alleles (11.8% of samples), homozygous for the reference alleles (41.9% of samples) or were entirely heterozygous (46.1% of samples) at all four SNPs (Table 2). One sample was excluded from frequency calculations as it possessed alternate alleles at three SNPs and a heterozygous genotype at SNP rs417983470. Additionally, eight samples had missing genotypes inferred to be placed into an allele distribution.
TABLE 2

Summary of significant SNP genotype frequencies (linear regression GWAS, recessive) in Katahdin study population. The majority of animals examined (n = 582) had either entirely homozygous alternate, entirely homozygous reference or entirely heterozygous genotypes at the four significant SNPs. One animal is not represented in the genotype frequency table as it possessed homozygous alternate genotypes with the exception of a single heterozygous genotype at SNP rs417983470. A further eight animals had missing genotypes inferred in order to be placed into allele distributions.

SNP Genotypers405327900 (A/C)rs416102123 (G/A)rs413712238 (A/G)rs417983470 (A/G)Frequency
Homozygous AlternateCCAAGGGG0.118
Homozygous ReferenceAAGGAAAA0.419
HeterozygousACGAAGAG0.461
Summary of significant SNP genotype frequencies (linear regression GWAS, recessive) in Katahdin study population. The majority of animals examined (n = 582) had either entirely homozygous alternate, entirely homozygous reference or entirely heterozygous genotypes at the four significant SNPs. One animal is not represented in the genotype frequency table as it possessed homozygous alternate genotypes with the exception of a single heterozygous genotype at SNP rs417983470. A further eight animals had missing genotypes inferred in order to be placed into allele distributions.

Kruskal-Wallis Test

The Kruskal-Wallis test was used as a non-parametric analysis of variance to examine weaning tFEC phenotypes between genotypes of significant SNPs. Due to LD (r = 1), SNPs rs405327900, rs416102123 and rs413712238 all exhibited identical Kruskal-Wallis test results, with overall p = 0.00018. The homozygous alternate genotypes (CC, AA, GG) were significantly different when compared to heterozygous (AC, GA, AG) (p = 2.3e-05) and homozygous reference genotypes (AA, GG, AA) (p = 0.00091), respectively. Again based on the Kruskal-Wallis test, with an overall p = 0.00023, the homozygous alternate genotype (GG) for SNP rs417983470 significantly differed from the heterozygous (AG) (p = 3.1e-05) and homozygous reference (AA) (p = 0.001) genotypes. For all four SNPs, the homozygous alternate genotype was found to have a significantly higher median weaning tFEC than all other genotypes (Figure 3).
FIGURE 3

Significant SNP genotypes and weaning tFEC phenotypes from the linear regression recessive GWAS. (A) Kruskal-Wallis results for SNP rs405327900 represents rs416102123 and rs413712238 as well due to perfect LD, (B) Kruskal-Wallis results for SNP rs417983470. The mean tFEC value is represented by the blue line.

Significant SNP genotypes and weaning tFEC phenotypes from the linear regression recessive GWAS. (A) Kruskal-Wallis results for SNP rs405327900 represents rs416102123 and rs413712238 as well due to perfect LD, (B) Kruskal-Wallis results for SNP rs417983470. The mean tFEC value is represented by the blue line. Three of the significant SNPs (rs413712238, rs417983470, rs416102123) were located within a region with previously reported RNA-seq reads (NCBI Genome Data Viewer: RNA-seq intron-spanning reads, aggregate (filtered), NCBI Ovis aries Anotation Release 103—log base 2 scaled) (Figure 4). These SNPs were investigated for similarity to known non-coding RNA sequences curated by the RNAcentral database. This analysis identified that SNP rs413712238 was located within a target RNA sequence which achieved an identity score >90% (Figure 5A). The sequence identified is that of NONBTAT023693.2, a lncRNA 311 nucleotides in length in the Bos taurus genome (Bos_taurus_UMD_3.1/bosTau6 assembly). The reference allele query matched at 289 positions and the alternate allele query matched at 290 positions of the target RNA sequence (Figure 5B).
FIGURE 4

Genomic context of significant SNPs within gene EDIL3. Figure displays image from the National Center for Biotechnology Information (NCBI) Genome Data Viewer tool. Genomic context is displayed for the Oar_rambouillet_v1.0 genome assembly. The SNPs rs417983470, rs413712238 and rs416102123 are within NCBI Ovis aries Annotation Release 103 RNA-seq reads. Relevant linkage disequilibrium (LD), long non-coding RNA (lncRNA) query and transcription factor binding site (TFBS) prediction results are noted.

FIGURE 5

RNAcentral results for SNP rs413712238. (A) Table contains a summary of RNAcentral query results in which query SNP sequence matches with ≥90% identity score to known RNA sequence. SNP rs413712238 matched with bovine lncRNA (UMD_3.1 reference genome assembly). (B) Full sequence of lncRNA NONBTAT023693.2 (Bos taurus) vs. the query reference and alternate allele sequences for rs413712238. The location of SNP rs413712238 is given in red.

Genomic context of significant SNPs within gene EDIL3. Figure displays image from the National Center for Biotechnology Information (NCBI) Genome Data Viewer tool. Genomic context is displayed for the Oar_rambouillet_v1.0 genome assembly. The SNPs rs417983470, rs413712238 and rs416102123 are within NCBI Ovis aries Annotation Release 103 RNA-seq reads. Relevant linkage disequilibrium (LD), long non-coding RNA (lncRNA) query and transcription factor binding site (TFBS) prediction results are noted. RNAcentral results for SNP rs413712238. (A) Table contains a summary of RNAcentral query results in which query SNP sequence matches with ≥90% identity score to known RNA sequence. SNP rs413712238 matched with bovine lncRNA (UMD_3.1 reference genome assembly). (B) Full sequence of lncRNA NONBTAT023693.2 (Bos taurus) vs. the query reference and alternate allele sequences for rs413712238. The location of SNP rs413712238 is given in red. To investigate the potential consequences of EDIL3 intronic variants, significant SNPs were examined for putative TFBSs. The SNPs rs405327900, rs413712238 and rs416102123 were located within predicted TFBS sequences. TFBS matrix score differences between alternate and reference alleles were observed for two of the three SNPs. The SNP rs405327900 was predicted within the transcription factors T-box transcription factor T (TBXT, also known as brachyury) and proto-oncogene c-Fos (c-Fos). Whereas the alternate allele introduced a TFBS for c-Fos which was not present at the reference allele, the binding score prediction for TBXT was greater with the reference allele compared to the alternate allele. Regarding SNP rs416102123, a sequence of 10 nucleotides matched with transcription factors belonging to the Rel/NF-κB family: Rel class, proto-oncogene c-Rel (c-Rel) and transcription factor p65 (also known as RelA). These TFBS were predicted to have greater scores with the alternate allele vs. reference allele. A TFBS for interferon regulatory factor 1 (IRF1) was identified with rs416102123 reference and lost with alternate allele sequence (Table 3, Supplementary Figure S4). Matrix scores of all predicted TFBS results are reported in Supplementary Table S1.
TABLE 3

Results from ConSite transcription factor binding site (TFBS) analysis showing differences between reference and alternate allele sequences. SNPs rs405327900 and rs416102123 had predicted TFBS score differences between their reference and alternate alleles. (*) denotes transcription factor is only present in alternate allele sequence, (^) denotes transcription factor is only present in reference allele sequence.

Transcription Factor Score difference Alt vs. Ref
rs405327900
 TBXT/Brachyury−2.308
 c-Fos*+8.141
rs416102123
 REL class+0.148
 c-Rel+0.844
 p65/RelA+1.310
 IRF1^−10.652
Results from ConSite transcription factor binding site (TFBS) analysis showing differences between reference and alternate allele sequences. SNPs rs405327900 and rs416102123 had predicted TFBS score differences between their reference and alternate alleles. (*) denotes transcription factor is only present in alternate allele sequence, (^) denotes transcription factor is only present in reference allele sequence.

Discussion

This GWAS is the first to identify significant associations with the gene EDIL3 and sheep response to GIN infection. This study capitalized on the increased resolution of the HD genotype array to detect associations within a relatively large sample size. Several prior studies have reported quantitative trait loci (QTL) within sequence near EDIL3, including Sallé et al. (2012), Kemper et al. (2011), Becker et al. (2020). These studies reported SNPs of significance or suggestive significance positioned 3.7 Mb 5′ (rs401461177), 7.1 Mb 5′ (rs415558729) and 8.7 Mb 3′ (rs55632043) of EDIL3, respectively. None of the aforementioned studies tested the four SNPs significant in the present study, as these SNPs are unique to the HD array and could not be investigated in studies that utilize 50K array data. The gene EDIL3 encodes the protein epidermal growth factor (EGF)-like repeat and discoidin I-like domain-containing protein 3, a secreted glycoprotein which acts as an integrin ligand and has non-redundant roles in multiple stages of the immune response, including myelopoiesis, anti-inflammatory regulation of neutrophil infiltration and resolution of inflammation (Mitroulis et al., 2017; Chen et al., 2018; Hajishengallis and Chavakis 2019; Li et al., 2021). Endothelial cells secrete EDIL3 to limit neutrophil recruitment to sites of infection and restrain the initiation of inflammation. This is accomplished by interfering with the interaction of lymphocyte function-associated antigen-1 (LFA-1) integrin on leukocytes with intercellular adhesin molecule-1 (ICAM-1) on the surface of vascular endothelial cells (Shin et al., 2013; Li et al., 2021). It is also thought to inhibit ICAM-1-dependent chemokine release (CXCL2 and CCL3) by neutrophils, and EDIL3 is thought to be involved in downstream processes of inflammation through binding to the αvβ3 integrin on the macrophage and phosphatidylserine on the apoptotic neutrophil cell to mediate efferocytosis and inflammation resolution (Hanayama et al., 2004; Kourtzelis et al., 2019; Li et al., 2021). EDIL3 is reciprocally regulated with the proinflammatory cytokine IL-17 (Shin et al., 2013; Choi et al., 2015; Yan et al., 2018; Li et al., 2021). IL-17 is produced by Th17 cells to promote recruitment of macrophages and neutrophils to aggravate chronic inflammation (McRae et al., 2015; Sehrawat and Rouse, 2017). IL-17 functions with TNF-α to enhance expression of neutrophil-attracting chemokines (CXCL1, CXCL2, CXCL5), which leads to an increase in leukocyte transmigration as well as CXCR2-dependent neutrophil transmigration in vivo (Griffin et al., 2012). Considering these functions, EDIL3 protein could have potential consequences on the initiation, sustainment and/or resolution of inflammation during GIN parasite infection in sheep. To investigate the potential role of EDIL3 markers associated with tFEC phenotypes, the reference genome regions encompassing significant SNPs were investigated for differences between predicted TFBS or ncRNA sequences of reference vs. alternate SNP alleles. Differences in predicted putative TFBS were identified at two SNPs within the first intron of EDIL3. These findings are of interest, as transcription mediated by RNA-polymerase II may be activated or repressed by the presence of transcription factors binding in specific regions of DNA (Sandelin et al., 2004). The SNP rs405327900 was predicted within a c-Fos TFBS and SNP rs416102123 was predicted within REL class/c-Rel/p65 TFBS and IRF1 TFBS. The current literature contains well-documented roles for these transcription factors within the immune system (Taki et al., 1997; Foletta et al., 1998; Pahl 1999; Taniguchi et al., 2001; Vallabhapurapu and Karin, 2009; Forero et al., 2019). Fos proteins enhance DNA-binding activity through the formation of stable heterodimers with Jun proteins (Hess et al., 2004) and members of the Fos and Jun protein families are components of the activator protein-1 (AP-1) transcription factor complex (Hess et al., 2004; Ray et al., 2006; Szalóki et al., 2015). The transcription factors p65 and c-Rel are members of the Rel/NF-κB family (Pahl 1999). These transcription factors are thought to be important in many processes, including: inducing expression of cytokines and chemokines, proteins involved in antigen presentation, cell adhesion molecules, genes involved in stress response, apoptosis and angiogenesis (Ballard et al., 1988; Roebuck 1999; Shaulian and Karin 2002; Eferl and Wagner, 2003; Herrmann et al., 2003; Liang et al., 2004; Gao et al., 2006; Bunting et al., 2007; Son et al., 2008; Reuter et al., 2010). Although these transcription factors do not have confirmed roles within EDIL3 gene regulation, transcription factor AP-1 has been predicted to target the human EDIL3 gene according to the MotifMap Predicted Transcription Factor Targets database (Xie et al., 2009; Rouillard et al., 2016). Importantly, NF-κB and AP-1 are involved in pro-inflammatory pathways. Neutrophil recruitment by bronchial epithelial cells was found to be regulated by NF-κB and AP-1 transcription factors through expression of proinflammatory cytokines (Desaki et al., 2000). Differential activation and promoter binding of AP-1 and NF-κB have been associated with both IL-8 and ICAM-1 gene expression, which regulate transendothelial migration of neutrophils (Roebuck 1999; Desaki et al., 2000; Reuter et al., 2010). The transcription factor interferon regulatory factor 1 (IRF1) is involved in both innate and adaptive immunity (Taniguchi et al., 2001; Honda and Taniguchi 2006; Kano et al., 2008). IRF1 is involved in induction of type 1 interferon genes as well as the activation of interferon-stimulated genes (Henderson et al., 1997). It has been found that T cells from mice lacking IRF1 fail to mount a Th1 response, and instead undergo exclusive Th2 differentiation in vitro (Taki et al., 1997). Long non-coding RNA (lncRNA) expression has been found to correlate with the expression of nearby genes (Ebisuya et al., 2008; Cabili et al., 2011; Engreitz et al., 2016) and lncRNA may regulate miRNA function, thereby influencing gene expression (Ballantyne et al., 2016). lncRNAs play major roles in gene regulation as well as many other biological processes, and the deregulation of lncRNA have been indicated in disease processes (Wu et al., 2014; Dhanoa et al., 2018; Jiang et al., 2019). In this study, the reference sequence surrounding SNP rs413712238 matched with a high identity to a lncRNA described within the bovine genome. Further work is necessary to confirm the presence of lncRNA transcript within ovine EDIL3 and to explore the potential functional implications. The significant SNPs identified in this study were in high LD (r = 0.996–1) and genotypes were observed at high frequencies in the study population. Animals that possessed the homozygous alternate genotype at each significant SNP were likely more susceptible to GIN, as indicated by significantly higher median weaning tFEC phenotypes. These animals were predicted to match the lncRNA NONBTAT023693.2 with a greater identity score, possess a TFBS for c-Fos and lack a TFBS for IRF1. Animals with either homozygous reference or heterozygous genotypes were likely less susceptible to GIN infection when compared to animals with homozygous alternate genotypes. Animals that possessed homozygous reference genotypes at each significant SNP had lower predicted identity to NONBTAT023693.2 lncRNA, lacked c-Fos TFBS and possessed IRF1 TFBS. Animals that were heterozygous at significant SNPs possessed a combination of reference and alternate allele genetic attributes, but were phenotypically similar to animals with only reference alleles. These results suggest that the presence of c-Fos TFBS on both strands within EDIL3, either in conjunction with or independent of the presence of intronic lncRNA, might influence gene expression towards a more susceptible immune response to GIN. Conversely, the presence of IRF1 TFBS on one or both strands may influence a less susceptible response to GIN. Based on these results, the expression of EDIL3 appears to contribute to GIN susceptibility in Katahdin sheep, and this expression may be mediated by allelic variants within TFBS and/or lncRNA sequence within the first intron (Figure 4). The evidence presented here supports the hypothesis that EDIL3-mediated susceptibility to GIN is recessively inherited. The prediction of c-Fos TFBS within rs405327900 sequence of more susceptible animals and IRF1 TFBS exclusively within rs416102123 sequence of less susceptible animals suggests opposing roles for these transcription factors within the context of EDIL3 gene regulation. Both c-Fos and IRF1 have been known to promote or suppress expression of target genes, and it is unclear how changes in EDIL3 expression may function in the context of GIN infection in sheep. Upregulation of Th17-associated genes has been associated with both resistance and susceptibility to GIN in sheep (McRae et al., 2015). The Th17 response has been linked with the inability to control L3 colonization, adult worm infection and egg production during infection by T. circumcincta in Blackface lambs (Gossner et al., 2012). Lambs of St. Croix and Barbados Blackbelly descent mounted a stronger Th17 response during H. contortus challenge compared with more susceptible composite wool lambs (MacKinnon et al., 2009), and genetic loci related to Th17 genes were recently associated with Florida Native sheep resistance to H. contortus (Estrada-Reyes et al., 2019). It may be hypothesized that suppression of EDIL3 gene transcription by either c-Fos or IRF1 may have respectively harmful or beneficial effects on GIN infection through reciprocal upregulation of IL-17 and the Th17 response. The precise effects of a Th17 response likely depends upon the breed of sheep and predominant parasite species involved in the infection. Additionally, deficiency of EDIL3 in mice has been shown to activate transforming growth factor β (TGF-β); upregulation of TGF-β may be associated with an effective immune response to GIN in mammalian hosts (Ahbara et al., 2021), and earlier expression of TGF-β and IL10 have been associated with resistant compared to susceptible Morada Nova sheep (Toscano et al., 2019). Further research is necessary to understand what involvement EDIL3 may have on the immune response to GIN in Katahdin sheep. In this study, four SNPs were identified within the first intron of EDIL3 that were significantly associated with weaning tFEC in Katahdin sheep. Furthermore, animals with alternate homozygous genotypes at significant SNPs were found to have significantly greater median weaning tFEC phenotypes, indicating that these genotypes incur greater risk for susceptibility to GIN. Significant SNPs may have functional consequences through altered TFBS and/or lncRNA sequence, thereby affecting EDIL3 gene expression. Further work is needed to elucidate causative variants and precise functional mechanisms as well as to confirm the presence of predicted TFBS and lncRNA. To the best of our knowledge, this is the first study to associate the immune gene EDIL3 with response to parasites in sheep.
  93 in total

1.  ConSite: web-based prediction of regulatory elements using cross-species comparison.

Authors:  Albin Sandelin; Wyeth W Wasserman; Boris Lenhard
Journal:  Nucleic Acids Res       Date:  2004-07-01       Impact factor: 16.971

Review 2.  IRFs: master regulators of signalling by Toll-like receptors and cytosolic pattern-recognition receptors.

Authors:  Kenya Honda; Tadatsugu Taniguchi
Journal:  Nat Rev Immunol       Date:  2006-09       Impact factor: 53.106

3.  Differential Activation of the Transcription Factor IRF1 Underlies the Distinct Immune Responses Elicited by Type I and Type III Interferons.

Authors:  Adriana Forero; Snehal Ozarkar; Hongchuan Li; Chia Heng Lee; Emily A Hemann; Marija S Nadjsombati; Matthew R Hendricks; Lomon So; Richard Green; Chandra N Roy; Saumendra N Sarkar; Jakob von Moltke; Stephen K Anderson; Michael Gale; Ram Savan
Journal:  Immunity       Date:  2019-08-27       Impact factor: 31.745

Review 4.  AP-1 as a regulator of cell life and death.

Authors:  Eitan Shaulian; Michael Karin
Journal:  Nat Cell Biol       Date:  2002-05       Impact factor: 28.824

5.  Salivary IgA: a suitable measure of immunity to gastrointestinal nematodes in sheep.

Authors:  R J Shaw; C A Morris; M Wheeler; M Tate; I A Sutherland
Journal:  Vet Parasitol       Date:  2011-11-20       Impact factor: 2.738

Review 6.  Immunological aspects of nematode parasite control in sheep.

Authors:  J E Miller; D W Horohov
Journal:  J Anim Sci       Date:  2006-04       Impact factor: 3.159

7.  Expression of developmental endothelial locus-1 in a subset of macrophages for engulfment of apoptotic cells.

Authors:  Rikinari Hanayama; Masato Tanaka; Keiko Miwa; Shigekazu Nagata
Journal:  J Immunol       Date:  2004-03-15       Impact factor: 5.422

8.  Alarming levels of anthelmintic resistance against gastrointestinal nematodes in sheep in the Netherlands.

Authors:  H W Ploeger; R R Everts
Journal:  Vet Parasitol       Date:  2018-09-17       Impact factor: 2.738

Review 9.  The host immune response to gastrointestinal nematode infection in sheep.

Authors:  K M McRae; M J Stear; B Good; O M Keane
Journal:  Parasite Immunol       Date:  2015-12       Impact factor: 2.280

Review 10.  Evolution of the sheep industry and genetic research in the United States: opportunities for convergence in the twenty-first century.

Authors:  J W Thorne; B M Murdoch; B A Freking; R R Redden; T W Murphy; J B Taylor; H D Blackburn
Journal:  Anim Genet       Date:  2021-05-06       Impact factor: 3.169

View more
  1 in total

1.  Single Nucleotide Polymorphism Effects on Lamb Fecal Egg Count Estimated Breeding Values in Progeny-Tested Katahdin Sires.

Authors:  David R Notter; Marzieh Heidaritabar; Joan M Burke; Masoud Shirali; Brenda M Murdoch; James L M Morgan; Gota Morota; Tad S Sonstegard; Gabrielle M Becker; Gordon L Spangler; Michael D MacNeil; James E Miller
Journal:  Front Genet       Date:  2022-05-03       Impact factor: 4.772

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.