Literature DB >> 32183688

Identification of the ABCC4, IER3, and CBFA2T2 candidate genes for resistance to paratuberculosis from sequence-based GWAS in Holstein and Normande dairy cattle.

Marie-Pierre Sanchez1, Raphaël Guatteo2, Aurore Davergne3, Judikael Saout4, Cécile Grohs4, Marie-Christine Deloche4,5, Sébastien Taussat4,5, Sébastien Fritz4,5, Mekki Boussaha4, Philippe Blanquefort6, Arnaud Delafosse7, Alain Joly8, Laurent Schibler5, Christine Fourichon2, Didier Boichard4.   

Abstract

BACKGROUND: Bovine paratuberculosis is a contagious disease, caused by Mycobacterium avium subsp. paratuberculosis (MAP), with adverse effects on animal welfare and serious economic consequences. Published results on host genetic resistance to MAP are inconsistent, mainly because of difficulties in characterizing the infection status of cows. The objectives of this study were to identify quantitative trait loci (QTL) for resistance to MAP in Holstein and Normande cows with an accurately defined status for MAP.
RESULTS: From MAP-infected herds, cows without clinical signs of disease were subjected to at least four repeated serum ELISA and fecal PCR tests over time to determine both infected and non-infected statuses. Clinical cases were confirmed using PCR. Only cows that had concordant results for all tests were included in further analyses. Positive and control cows were matched within herd according to their birth date to ensure a same level of exposure to MAP. Cows with accurate phenotypes, i.e. unaffected (control) or affected (clinical or non-clinical cases), were genotyped with the Illumina BovineSNP50 BeadChip. Genotypes were imputed to whole-genome sequences using the 1000 Bull Genomes reference population (run6). A genome-wide association study (GWAS) of MAP status of 1644 Holstein and 649 Normande cows, using either two (controls versus cases) or three classes of phenotype (controls, non-clinical and clinical cases), revealed three regions, on Bos taurus (BTA) chromosomes 12, 13, and 23, presenting significant effects in Holstein cows, while only one of those was identified in Normande cows (BTA23). The most significant effect was found on BTA13, in a short 8.5-kb region. Conditional analyses revealed that only one causal variant may be responsible for the effects observed on each chromosome with the ABCC4 (BTA12), CBFA2T2 (BTA13), and IER3 (BTA23) genes as good functional candidates.
CONCLUSIONS: A sequence-based GWAS on cows for which resistance to MAP was accurately defined, was able to identify candidate variants located in genes that were functionally related to resistance to MAP; these explained up to 28% of the genetic variance of the trait. These results are very encouraging for efforts towards implementation of a breeding strategy aimed at improving resistance to paratuberculosis in Holstein cows.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32183688      PMCID: PMC7077142          DOI: 10.1186/s12711-020-00535-9

Source DB:  PubMed          Journal:  Genet Sel Evol        ISSN: 0999-193X            Impact factor:   4.297


Background

Paratuberculosis, also referred to as Johne’s disease (JD), is an infectious, contagious, and incurable disease caused by Mycobacterium avium subsp. paratuberculosis (MAP). Worldwide, no country is known to be free of MAP; it primarily affects domestic ruminants, and a high bovine herd prevalence (up to 50%) has been reported in Europe [1]. In cattle, while adult-to-calf transmission is possible in utero or via contaminated colostrum or milk, the main route of transmission occurs via the uptake of MAP-infected feces by calves. After a transient phase of shedding, calves undergo a latency phase that can last for several months or years. Subclinical symptoms of the disease include weight loss and reduced milk production together with a humoral immune response and fecal shedding; because of this, subclinical cases may continue to contaminate their environment for years. Clinical cases finally develop chronic diarrhea and severe emaciation, ending in death. Thus, JD has substantial adverse effects on animal welfare as well as a detrimental economic impact on the cattle sector [2]. There are no effective treatment protocols and vaccination (which provides only partial protection) is limited because it can induce a false positive reaction at the intradermal test used for tuberculosis detection [3]. For this reason, several countries have initiated programs for the control of JD with the objective of reducing MAP contamination, mainly by testing animals, culling seropositive animals and preventing the exposure of calves, which are the most susceptible animals in infected farms [4]. However, these programs have a high cost and a limited efficiency in clearing MAP mainly because of the disease’s long periods of latency (from infection to seropositive conversion or fecal shedding) and incubation (from infection to clinical symptoms) [5]. These periods also vary greatly among individuals, as does the amount of bacteria shed by infected cattle, which suggests the existence of individual variability in resistance to MAP. As recently reviewed by Brito et al. [6], various studies have shown the role of host genetics in resistance to MAP and have estimated heritability values for this trait between 0.03 and 0.27. In addition, numerous genome-wide association studies (GWAS) have been conducted using genotyping of single-nucleotide polymorphisms (SNPs) at low [7], medium [8-14], or high [6, 15, 16] densities as well as whole-genome sequences [17, 18] imputed from data of the 1000 Bull Genomes project [19]. These GWAS analyses have led to the identification of quantitative trait loci (QTL) associated with resistance/susceptibility to MAP on all Bos taurus (BTA) autosomes, with the most frequently identified QTL located on BTA1, 6, 7, and 23. Unfortunately, the results of genetic analyses (heritability estimates and QTL identification) differ quite a bit among published studies. These inconsistencies may arise from multiple factors such as differences in the population analyzed, disease prevalence in the population, the statistical model applied, and the definition used for the MAP-resistant phenotype. The studies mentioned above generally defined phenotypes based on a single measurement of serum ELISA, milk ELISA, MAP culture in feces or tissue, or PCR on feces, which may result in an incorrect diagnosis due to (i) the lack of sensitivity of these tests [20], (ii) the lack of concordance between ELISA and fecal culture tests [21], (iii) the potential misidentification of unexposed individuals as resistant, and (iv) variability in longitudinal serological and fecal shedding patterns among animals [5]. To avoid these drawbacks, the present work was designed as a case–control study based on confirmed phenotypes. Only confirmed positive or negative individuals were included, based on the criteria previously defined from a longitudinal study of MAP fecal shedding and serological patterns in dairy cattle [5]. To avoid a putative bias of non-exposure, the negative animals were born in the same herd and in the same period as the positive animals. In total, 1644 Holstein and 649 Normande cows with relevant and accurate contrasted phenotypic profiles (control and non-clinical/clinical cases) were genotyped with a medium-density SNP chip (50 K SNP); whole-genome sequences (WGS) were subsequently imputed using the 1000 Bull Genomes reference population (run6) [19]. Here, we present the results obtained from GWAS analyses conducted in both Holstein and Normande cows on the imputed WGS datasets.

Methods

Animals and phenotypes

Animals of certified parentage were recruited from 2034 French herds enrolled in paratuberculosis control plans. From these herds, mainly located in the northwestern region of France (Fig. 1), cows were recruited based on their serological status. Serum samples were analyzed by ELISA (Idexx, Montpellier, France, or Idvet, Montpellier, France). Based on a prior study [5], tests were considered positive if the S/P value (sample optical density over positive control optical density, as a percentage) was 90% or higher, and negative if S/P was lower than 45% (Idexx) or 60% (Idvet). For each cow, at least two ELISA tests were carried out, with at least 8 months between the tests. Strict conditions had to be met for negative cows to be included as controls. They had to be at least 48 months old, to exclude animals still in the latency period. In addition, only cows born in the same herd and the same period (± 1.5 months) as affected cows were retained, in order to maximize the potential for exposure to MAP. All animals with conflicting results were removed from the analysis. All cows that presented concordant serological statuses, either positive or negative, were then subjected to additional standardized ELISA and PCR analyses in a single laboratory (Laboratoire Départemental du Bas-Rhin, LD67, Strasbourg, France) with the same test kits throughout the experiment (Idexx for serology; Adiagene (Saint Brieuc, France) for PCR MAP quantification in the feces); an animal was only retained if its subclinical status or control phenotype was confirmed. Cows were considered to be shedders if the qPCR threshold cycle (Ct) was 35 or less, and non-shedders if it was 40 or more. All samples with intermediate results (35 < Ct < 40) were considered to be of uncertain status and were excluded from the study. Regarding the Elisa test, the threshold of 45 was used. Both tests had to provide concordant results. Based on these requirements, 1465 control cows were confirmed and 658 were eliminated; and 1138 subclinical cases were confirmed and 580 were eliminated. In addition, clinical cases (245 Holstein and 74 Normande) were identified by clinical diagnosis and confirmed by the presence of MAP in feces. Finally, following this rigorous protocol, the statuses of 2453 cows (1759 Holstein and 694 Normande), for which well conserved blood samples were available for DNA extraction, were confirmed as positive non-clinical or clinical cases or negative controls. These phenotypes were coded in subsequent analyses in one of two ways: (i) 0/1/2 for controls, non-clinical cases, and clinical cases, respectively, and (ii) 0/1 for controls and cases (non-clinical and clinical), respectively.
Fig. 1

Locations of herds enrolled in control plans and number of cows recruited

Locations of herds enrolled in control plans and number of cows recruited

Genotyping and imputation to whole-genome sequences

Animals with confirmed phenotypes were genotyped using the BovineSNP50 (50 K, Illumina Inc., San Diego) Beadchip. After quality control, 43,801 autosomal SNPs were retained. The filters used were those applied in the French national evaluation system [22]: individual call rate higher than 95%, SNP call rate higher than 90%, minor allele frequency (MAF) higher than 1% in at least one major French dairy cattle breed, and genotype frequencies in Hardy–Weinberg equilibrium with P > 10−4. Animals that were incompatible with their parents (when all genotypes were available) were excluded. In the end, 2293 cows (1644 Holstein and 649 Normande) were kept for the genetic study (Table 1).
Table 1

Number of Holstein and Normande cows with confirmed phenotypes

PhenotypeCriteriaNumber of HolsteinNumber of NormandeTotal number
Control (0)≥ 2 ELISA S/P < 45%*8382331071
1 Idexx ELISA S/P < 45%
1 qPCR Ct ≥ 40
Non-clinical case (1)≥ 2 ELISA S/P ≥ 90%577347924
1 Idexx ELISA S/P > 45%
1 qPCR Ct ≤ 35
Clinical case (2)Clinical signs and presence of MAP in feces22969298
Total16446492293

*2 ELISA performed at the regional level, thresholds were 45% with Idexx and 60% with Idvet test

Number of Holstein and Normande cows with confirmed phenotypes *2 ELISA performed at the regional level, thresholds were 45% with Idexx and 60% with Idvet test Then, the 50 K SNP genotypes were imputed to whole-genome sequences (WGS). A two-step approach was applied in order to improve the accuracy of imputed genotypes of the WGS variants [23]: from 50 to 777 K high-density (HD) SNPs using FImpute software [24], and then from imputed HD SNPs to WGS, using the Minimac software [25]. In spite of a longer computing time, Minimac was preferred to FImpute for the imputation of WGS because it infers allele dosages in addition to the best-guess genotypes. Compared to best-guess genotypes, allele dosages are expected to be more correlated to the true genotypes [26] and to lead to better targeting of causative mutations in GWAS analyses [27]. Imputations from 50 K to the HD SNP level were performed using within-breed reference sets of 776 Holstein and 546 Normande bulls that were genotyped with the Illumina BovineHD BeadChip (Illumina Inc., San Diego, CA) [28]. WGS variants were imputed from HD SNP genotypes using WGS of 2333 Bos taurus animals, from the 6th run of the 1000 Bull Genomes Project [26, 29]. These animals represent 51 cattle breeds and include 544 Holstein and 44 Normande individuals. In the Normande breed, the number of animals with whole-genome sequences was rather limited but we identified most of them as influential ancestor bulls with a high cumulated contribution to the breed (74%). We applied the protocol described in [30] and produced 23,781,173 and 23,610,986 autosomal variants for the Holstein and Normande cows, respectively. The precision of imputation from HD SNP to sequence level was assessed using the coefficient of determination (R2) calculated with the Minimac software [25]. In order to remove variants with low imputation accuracies, only variants with an R2 value higher than 30% and a MAF higher than 1% were retained for further association analyses, i.e. 7,914,731 and 7,673,760 variants for Holstein and Normande, respectively, with a mean R2 close to 81% (R2 classes between 30 and 80% included a limited number of variants).

Whole-genome sequence association analyses

We performed single-trait association analyses between all variants and the MAP resistance/susceptibility phenotypes (0/1/2 and 0/1). All association analyses were performed using the mlma option of the GCTA software (version 1.24), which applies a mixed linear model that includes the variant to be tested [31]. The categorical nature of the phenotype (2 or 3 classes) was not accounted for in this model assuming a normal distribution. where is the vector of individual phenotypes; is the overall mean; is the additive fixed effect of the variant to be tested for association; is the vector of predicted allele dosages, varying between 0 and 2; is the vector of random polygenic effects, with the genomic relationship matrix (GRM) that is calculated by using the HD SNP genotypes [32], and the polygenic variance that is estimated based on the null model and then fixed while testing for the association between each variant and the trait of interest; and is the vector of random residual effects, with the identity matrix and the residual variance. No other fixed effect was included in the model. Indeed, because of the experimental design, it was not possible to test the usual factors of variation such as herd effect (only a few selected records retained per herd) or effects of age or season. In order to account for multiple testing, we estimated the number of independent chromosome segments by applying the formula of Goddard [33] , with , the effective population size and , the length of the genome in Morgans. We assumed that five SNPs were needed to saturate and summarize each segment and we applied the Bonferroni correction to the thresholds by considering 50,000 independent tests. Therefore, the 5% genome-wide threshold of significance corresponded to a nominal P value of 10−6 (−log10(P) = 6). After identification of the lead variant in a given region, variants with significant effects that were located less than 500 kbp apart were grouped in the same QTL region. Then, the boundaries of QTL regions were determined by considering the positions of variants that were included in the upper third of the peak. The process was repeated with the next top variants. When two consecutive QTL regions had overlapping confidence intervals, they were grouped in a unique QTL region. Variants that were located within QTL regions were annotated using Variant Effect Predictor (VEP) [34] based on the UMD3.1 bovine reference genome assembly.

Conditional association analyses

In order to determine if the presence of multiple significant variants in a genomic region was a reflection of distinct causal mutations or due to linkage disequilibrium (LD) with a single causal mutation, conditional analyses were carried out in the most significant QTL regions using the cojo option of GCTA [35]. Conditional association analyses were performed by including in the model the most significant variant as a fixed effect and by testing all variants that were not in strong LD with the conditional variant (r2 < 0.9).

Results

GWAS results

Genetic and residual variances were estimated using the genomic relationship matrices calculated from HD genotypes. Heritability estimates for phenotypes 0/1 or 0/1/2 were similar and high for both Normande and Holstein cows; they ranged from 0.51 to 0.57 depending on the phenotype and on the breed (Table 2).
Table 2

Variance and heritability values estimated from the genomic relationship matrix calculated using HD genotypes

BreedPhenotypeGenetic varianceResidual varianceh2 (SE)a
Normande0/10.120.120.51 (0.15)
0/1/20.210.200.52 (0.15)
Holstein0/10.140.100.57 (0.09)
0/1/20.290.210.57 (0.09)

aNote that these heritability estimates are likely to be strongly overestimated, due to the selection of extreme phenotypes (see Discussion)

Variance and heritability values estimated from the genomic relationship matrix calculated using HD genotypes aNote that these heritability estimates are likely to be strongly overestimated, due to the selection of extreme phenotypes (see Discussion) Then, we tested the significance of effects for more than 7 million variants in Normande and Holstein cows for both the 0/1 and 0/1/2 phenotypes. In total, 2446 variants (SNPs or indels) presented significant effects (−log10(P) ≥ 6) in at least one analysis (Fig. 2). In Normande cows, we identified 41 variants with significant effects for the 0/1 phenotype, but no variant reached the level of significance for the 0/1/2 phenotype. In Holstein, 1566 and 1719 variants had significant effects for phenotypes 0/1 and 0/1/2, respectively. Variants with significant effects were located on BTA23 in Normande cows and on BTA12, 13, and 23 in Holstein cows (Fig. 2). In the Holstein breed, variants that exhibited the most significant effects were located on BTA13, with a −log10(P) value up to 38.1 for the 0/1/2 phenotype.
Fig. 2

−log10(P) values plotted against the position of variants on Bos taurus (BTA) autosomes for the 0/1 and 0/1/2 phenotypes of Normande and Holstein cows

−log10(P) values plotted against the position of variants on Bos taurus (BTA) autosomes for the 0/1 and 0/1/2 phenotypes of Normande and Holstein cows All variants with significant effects were grouped into four, eleven, and nine QTL for the Normande 0/1, Holstein 0/1, and Holstein 0/1/2 analyses, respectively (Table 3). The four QTL found in Normande cows for the 0/1 phenotype were located between 24.8 and 32.6 Mbp on BTA23 (Fig. 3). As shown in Fig. 3, the effects of this region were also very close to significance for the 0/1/2 phenotype in the same breed (−log10(P) = 5.99). In neighboring regions on BTA23, we also identified four QTL for the 0/1 phenotype (23.5–28.8 Mbp) and three QTL for the 0/1/2 phenotype (25.1–28.4 Mbp) in Holstein cows.
Table 3

Description of the QTL identified in GWAS analyses for the 0/1 or 0/1/2 phenotypes of Normande and Holstein cows

BreedPhenotypeQTLBTAConfidence intervalTOP variant (resistance allele)
Start (bp*)End (bp*)Number of significant variantsNumber of genes−log10(P)IDbp*Freq.EffectSEGeneFunctional annotation
Normande0/112324,804,03425,602,017736.4rs13451743125,575,7320.43− 0.170.03BOLA-DRB2Intron
Normande0/122326,952,21427,814,2321266.4rs11003150926,977,6080.37− 0.150.03NOTCH4Synonymous
Normande0/132328,576,81228,671,569727.0rs20851712428,605,2430.31− 0.170.03TRIM15Intron
Normande0/142332,128,35132,606,3312506.9rs10894202632,551,8690.27− 0.170.03Intergenic
Holstein0/111268,908,05669,745,89126829.4rs37881290369,314,3300.81− 0.160.03Intergenic
Holstein0/121269,774,74671,240,801281110.8rs4316123270,723,0870.88− 0.190.03Intergenic
Holstein0/131272,272,36772,416,27314408.5rs4304734872,351,5830.77− 0.150.03Intergenic
Holstein0/141275,283,61575,380,545607.7rs4310447475,379,7700.90− 0.180.03Intergenic
Holstein0/151276,760,85177,323,51121518.5rs37856858377,285,1890.76− 0.140.02Intergenic
Holstein0/161279,096,88080,067,268617.5rs37948845280,064,1500.84− 0.140.03UBAC2Intron
Holstein0/171363,497,96063,506,53220018.5rs10957020963,502,5660.91− 0.340.04Intergenic
Holstein0/182323,529,49123,976,4323917.1rs21037292623,956,9870.34− 0.110.02PKHD1Intron
Holstein0/192325,115,93325,582,10136038.8rs20918323625,181,6610.66− 0.120.02ELOVL5Intron
Holstein0/1102326,665,22027,629,951114288.2rs47036529326,733,1040.69− 0.170.03ENSBTAG00000023541Intron
Holstein0/1112327,630,24828,817,6101357.4rs10953904328,085,4100.81− 0.130.02IER3Upstream
Holstein0/1/211267,954,91269,097,4273106.9rs11005182168,949,6750.67− 0.210.04Intergenic
Holstein0/1/221269,162,59870,723,08726738.7rs4166708570,127,5190.91− 0.280.05ABCC4Intron
Holstein0/1/231271,239,31771,240,801306.7rs38200773771,240,8010.74− 0.250.05Intergenic
Holstein0/1/241272,272,36772,415,27112507.4rs46364461872,356,3810.95− 0.560.10Intergenic
Holstein0/1/251276,760,85177,323,5116517.0rs38358331677,253,5890.76− 0.170.03Intergenic
Holstein0/1/261363,497,96063,506,53220038.1rs10957020963,502,5660.91− 0.710.05Intergenic
Holstein0/1/272325,119,80825,602,06745638.9rs21065510425,554,2480.83− 0.240.04Intergenic
Holstein0/1/282326,680,41827,629,951345289.0rs47036529326,733,1040.69− 0.260.04ENSBTAG00000023541Intron
Holstein0/1/292327,759,54228,398,451937.4rs20928476228,012,2990.84− 0.210.04Intergenic

* Positions of variants on UMD3.1

Fig. 3

-log10(P) values plotted against the position of variants on Bos taurus (BTA) autosome 23 for the 0/1 and 0/1/2 phenotypes of Normande and Holstein cows

Description of the QTL identified in GWAS analyses for the 0/1 or 0/1/2 phenotypes of Normande and Holstein cows * Positions of variants on UMD3.1 -log10(P) values plotted against the position of variants on Bos taurus (BTA) autosome 23 for the 0/1 and 0/1/2 phenotypes of Normande and Holstein cows However, in Holsteins, the most significant effects were found on BTA13 (Fig. 4) and to a lesser extent on BTA12 (Fig. 5). The QTL detected on BTA13 was located in a very narrow 8-kpb region, and the variant showing the most significant effects (rs109570209) was located at 63,502,566 bp in both the 0/1 and 0/1/2 analyses. Moreover, although no significant QTL was observed in this region in Normande cows, the analysis revealed a narrow peak that was located exactly in the same region, and the variants with the highest −log10(P) values were located at 63,502,649 and 63,502,566 bp (Fig. 4). Finally, in Holstein cows, the remaining QTL were located on BTA12. For the 0/1 phenotype, six distinct QTL were located between 68.9 and 80.1 Mbp, while for the 0/1/2 phenotype five QTL were detected between 67.9 and 77.3 Mbp (Fig. 5).
Fig. 4

−log10(P) values plotted against the position of variants on Bos taurus (BTA) autosome 13 for the 0/1 and 0/1/2 phenotypes of Normande and Holstein cows

Fig. 5

−log10(P) values plotted against the position of variants on Bos taurus (BTA) autosome 12 for the 0/1 and 0/1/2 phenotypes of Holstein cows

−log10(P) values plotted against the position of variants on Bos taurus (BTA) autosome 13 for the 0/1 and 0/1/2 phenotypes of Normande and Holstein cows −log10(P) values plotted against the position of variants on Bos taurus (BTA) autosome 12 for the 0/1 and 0/1/2 phenotypes of Holstein cows It should be noted that these results were based on the UMD3.1 bovine reference genome assembly. In this assembly, the QTL regions on BTA12 and 23 are of limited quality and the corresponding number of QTL may be overestimated. Thus, we realigned these regions on the most-recent ARS-UCD1.2 assembly, with the results presented in Fig. 6. For the QTL on BTA13 and 23, we found few changes, i.e. the most-recent assembly was very similar to the UMD3.1 assembly for the variants located in these regions. In contrast, on BTA12, the analysis with the most recent assembly detected a smaller number of QTL (e.g., three for the 0/1/2 phenotype in Holstein versus five with the UMD3.1 assembly) located within a shorter interval (5 Mbp with ARS-UCD1.2 versus 9.4 Mbp with the UMD3.1 assembly).
Fig. 6

−log10(P) values plotted against the position of variants on UMD3.1 and ARS-UCD1.2 assemblies of Bos taurus (BTA) autosomes 12, 13, and 23 for the 0/1/2 phenotype of Holstein cows

−log10(P) values plotted against the position of variants on UMD3.1 and ARS-UCD1.2 assemblies of Bos taurus (BTA) autosomes 12, 13, and 23 for the 0/1/2 phenotype of Holstein cows

Functional annotations

Functional annotations revealed that 74% of the 1868 distinct variants located within the confidence intervals of the QTL were intergenic (65% for 0/1 in Normande, 81% for 0/1 in Holstein, and 70% for 0/1/2 in Holstein) (Table 4). Therefore, the remaining variants were located in genes, mainly in introns (16%) and upstream regions (7%), and more rarely in downstream regions (1.4%), exons (0.5% missense and 0.5% synonymous), 3′ UTR regions (0.3%), and 5′ UTR regions (0.1%).
Table 4

Functional annotations of variants with significant effects (−log10(P) ≥ 6) located within confidence intervals of the QTL

Functional annotationNormande 0/1Holstein 0/1Holstein 0/1/2Total distinct
Intergenic2611899301384
Intronic11232218303
Upstream234126130
Downstream082426
3′ UTR0255
5′ UTR0022
Synonymous1189
Missense0199
Total40146713221868
Functional annotations of variants with significant effects (−log10(P) ≥ 6) located within confidence intervals of the QTL Depending on the breed and the phenotype analyzed, the QTL linked with MAP resistance/susceptibility had confidence intervals that ranged in size from 1.5 kbp to 1.6 Mbp, and contained between 3 and 456 variants with significant effects. A minority of the QTL detected were located entirely in intergenic regions (one QTL in the Normande 0/1 analysis, on BTA23; three QTL in the Holstein 0/1 analysis, two on BTA12 and one on BTA13; and four QTL in the Holstein 0/1/2 analysis, three on BTA12 and one on BTA13), while all other QTL contained variants located in 1 to 28 distinct genes. All the QTL found on BTA23 in the three analyses were located in the major histocompatibility complex (MHC) region, which is known to be particularly gene-rich. In each of the three analyses, the QTL that contained the largest number of genes were located around 27 Mbp. Here, within an interval of 1 Mbp, 26 and 28 genes were detected in Normande and Holstein cows, respectively. Although a large number of genes was located on BTA23, we identified a limited number of positional candidate genes within the confidence intervals of all of the QTL that we detected: 31, 42, and 38 in the Normande 0/1, Holstein 0/1, and Holstein 0/1/2 analyses, respectively (Fig. 7). The majority of these genes (29) were located on BTA23 and found in all three analyses. Two other genes, also located on BTA23, were shared in the 0/1 analyses of both breeds, while nine genes (4 on BTA12 and 5 on BTA23) appeared to be specific to the Holstein breed only (0/1 and 0/1/2 phenotypes), and two genes were identified in only one analysis (Holstein 0/1).
Fig. 7

Number of overlapping candidate genes located in confidence intervals of QTL among Normande 0/1, Holstein 0/1, and Holstein 0/1/2 analyses and lists of corresponding genes

Number of overlapping candidate genes located in confidence intervals of QTL among Normande 0/1, Holstein 0/1, and Holstein 0/1/2 analyses and lists of corresponding genes Therefore, we were able to identify positional candidate genes in all QTL regions except the QTL located on BTA13, i.e. the region that presented the most significant effects in Holstein cows. The confidence interval of this QTL, which was very narrow, contained only intergenic variants. However, two genes are located in the upstream region (SNTA1 at 63,408,786–63,490,256 bp) and in the downstream region (CBFA2T2 at 63,632,327–63,670,697 bp) of this QTL. The results discussed so far were obtained following the removal of variants with poor imputation accuracy or low MAF. When, instead, we considered all the variants of this region, including those imputed with poor accuracy (R2 < 0.3) and/or with a MAF < 0.01, we obtained another picture of the GWAS results. Figure 8 presents these results for the 0/1/2 phenotype in both Normande and Holstein cows. In this region, variants with the highest imputation accuracies were located in the vicinity of the BovineSNP50 variant (rs110002750 at 63,500,701 bp) in the intergenic region between the SNTA1 and CBFA2T2 genes. The closest variants that were located in genes were imputed with a very low accuracy and were therefore excluded from the GWAS analysis. However, their significance levels were similar to those found for the intergenic variants with the most significant effects. Therefore, it is likely that the confidence interval of this QTL also includes these variants with low imputation R2, and is larger than initially indicated (~ 70 kb vs 1.5 kb), meaning that this QTL probably also comprises SNTA1 and CBFA2T2.
Fig. 8

−log10(P) (dots) and R2 (line) values, calculated by Minimac, plotted against the position of variants in the [63,480,000-63,550,000] interval on Bos taurus (BTA) autosome 13 for the 0/1/2 phenotype of Normande and Holstein cows. Intergenic variants in blue, Bovine SNP50 variant in red, SNTA1 genic variants in yellow, and CBFA2T2 genic variants in purple

−log10(P) (dots) and R2 (line) values, calculated by Minimac, plotted against the position of variants in the [63,480,000-63,550,000] interval on Bos taurus (BTA) autosome 13 for the 0/1/2 phenotype of Normande and Holstein cows. Intergenic variants in blue, Bovine SNP50 variant in red, SNTA1 genic variants in yellow, and CBFA2T2 genic variants in purple

Conditional GWAS

In order to distinguish multiple true causal mutations in a QTL region from those caused by LD, the variants with the most significant effects in each region, namely the top variants in Table 3 (11, 1, and 10 distinct variants for BTA12, 13, and 23, respectively), were tested in conditional analyses. Each of the top variants was individually included in the mixed model (1) as a fixed effect in addition to the variant to be tested in analyses of Holstein cows for both the 0/1 and 0/1/2 phenotypes (see Additional file 1: Figure S1). Of the 11 variants added in the model for BTA12, only one removed the effects of all other variants tested on this chromosome, and only for the 0/1/2 phenotype. This variant was located at 70,127,519 bp (rs41667085) in an intronic region of the ABCC4 gene, which encodes the multidrug resistance-associated protein 4, and was the variant on BTA12 that presented the most significant effect on the 0/1/2 phenotype (−log10(P) = 8.7). Interestingly, for the 0/1 phenotype, it ranked 62nd in the peak (−log10(P) = 10.3); instead, the variant with the most significant effect on this phenotype was located at 70,723,087 bp (−log10(P) = 10.8). For the 0/1 phenotype, none of the 11 variants tested in the conditional analyses completely removed the peak. However, two variants located in the ABCC4 gene, at 70,053,503 and 70,052,385 bp, had significant effects on the 0/1 phenotype and ranked 8th and 9th, respectively, in the QTL peak. Thus, this gene is a strong candidate also for the 0/1 phenotype. Conditional analyses of BTA13 included both the variant located at 63,502,566 bp (rs109570209), which had the most-significant effects on both the 0/1 and the 0/1/2 phenotypes, but also the neighboring variants located at 63,502,649 (rs380877320) and 63,500,701 (rs110002750), that had effects that were closest to the level of significance in Normande cows. None of these intergenic variants completely removed the signal for either of the two phenotypes. Of the ten variants included in conditional analyses of BTA23, five completely removed the peak for both phenotypes. Three were located in genes, at 25,181,661 bp (rs209183236) in ELOVL5 (intron), 26,733,104 bp (rs470365293) in ENSBTAG00000023541 (intron), and 28,085,410 bp (rs109539043) in IER3 (upstream region). The two others were located in intergenic regions at 25,554,248 bp (rs210655104) and 28,012,299 bp (rs209284762).

Discussion

Individual diagnostic tests for paratuberculosis are known to lack sensitivity and specificity, and these characteristics have been put forward as explanations for the low heritability of MAP-related traits that has been reported in the literature. Since the study of Settles et al. in 2009 [9] to the most recent publication in 2019, 13 GWAS analyses have been conducted for traits linked with bovine paratuberculosis [6-18]. These studies examined various numbers of animals and SNPs, and all succeeded in identifying genomic regions associated with MAP resistance/susceptibility. However, when taken together the results were generally poorly concordant. As mentioned before, many factors are likely contributing to these discrepancies, but one of the main ones, in particular, is the difficulty of properly characterizing the MAP status of cows due to (1) the long latency and incubation periods of the disease and (2) the lack of sensitivity of and concordance between milk ELISA and fecal culture. In our study, we put a great deal of effort into defining accurate phenotypes. Control cows were chosen from affected herds and among those born in the same month as confirmed cases, this strategy being designed to increase the probability of their exposure to MAP. In addition, control cows were tested at least four times, as successive tests are known to be often inconsistent [5], and all cows had to be old enough for the latency period to be over, to limit the risk of false negative results. These constraints led to the exclusion of many animals from the study. Affected animals were selected from those that had previously been tested positive and their status was confirmed here with both ELISA and PCR tests; therefore, in order to include an affected cow for further analysis, we required both blood and fecal samples, and the lack of samples for many clinical cases resulted in their exclusion from the study. Because of all these conditions, only a small proportion (~ 5%) of all potential animals were included in the analysis. All confirmation tests were performed in a single laboratory with the same ELISA and PCR testing protocols throughout in order to make the results more reliable and comparable. Consequently, we are confident that the phenotypes used here accurately reflected the true status of the cows regarding MAP infection: not affected or affected with or without clinical signs. The case–control design modified the distribution of the phenotypes and concentrated on the most extreme animals. In addition, it contributed to a much more balanced distribution of phenotypes than in the overall population of affected herds. Altogether, these choices resulted in heritability estimates that were much higher (around 50%) than those previously reported in the literature (3 to 27%) [6]. By removing the intermediate phenotypes, which were considered to be of uncertain status, our study likely overestimated heritability coefficients through the selection of individuals with extreme phenotypes. However, a part of our higher estimate can be explained by our focus on better-defined and thus likely more-heritable phenotypes. Nevertheless, these high estimates must be considered as specific to this selected study group, which strongly deviates from the general population, and are likely to be biased by this kind of selective genotyping. All cows were genotyped with the Illumina SNP50 BeadChip. We intentionally did not use a low-density chip in order to reduce the potential impact of the reduced accuracy of imputation to 50 K. Imputation to HD, using 788 Holstein and 551 Normande key ancestors as a reference, is known to be very accurate [28]. We were then able to take advantage of the work of the 1000 Bull Genomes project to impute genotypes of the cows at the WGS level to directly identify candidate variants for resistance to paratuberculosis in Holstein and Normande cows. Our original design was intended to be the same in both Holstein and Normande breeds, with a full dataset for approximately 1500 cows in each breed. However, Holstein and Normande cows represent 64% and 9% of French herds, respectively, and it was much more difficult to recruit Normande than Holstein cows. For this reason, the final dataset was much smaller in the Normande (649) than in the Holstein (1644) breed, which clearly affected detection power; with a predefined significance threshold (−log10(P) ≥ 6), we identified more QTL regions with more significant effects in Holstein than in Normande cows. In the Holstein analysis, we detected 9 (0/1/2 phenotype) and 11 (0/1 phenotype) QTL, located on BTA12, 13, and 23, while only four significant QTL (0/1 phenotype), all located on BTA23, reached the significance level in the Normande analysis. Nevertheless, by analyzing both breeds independently, we were able to confirm the effects of the QTL located on BTA13 (with significant effects in Holstein and close to the significance threshold in Normande) and BTA23 (significant effects in both breeds). Another strategy would be to validate the three Holstein QTL in the Normande breed. Accordingly, the Bonferroni correction is limited to the number of candidate SNPs tested in the Normande breed. With this approach, the QTL on BTA13 would be validated (−log10(P) = 4.7) whereas the QTL on BTA12 would remain non-significant. However, additional data are needed in the Normande breed to increase the size of the population examined and improve the balance between numbers of case and control phenotypes. By arbitrarily defining QTL regions in 1-Mbp-intervals, we identified several QTL on BTA12 and 23. However, for each of these chromosomes, conditional GWAS led to the identification of variants that each explained all the effects detected on BTA12 (rs41667085 in the ABCC4 gene) and on BTA23 (rs209183236 in ELOVL5, rs4703655293 in ENSBTAG00000023541, rs109539043 in IER3, and rs210655104 and rs209284762 in intergenic regions). These results suggest that the identified variants could be the causative variants, or at least in strong LD with the causative variants. On both BTA12 and 23, we identified a large region that contains variants with significant effects. It could be due to the long-range LD that exists in these regions and/or, as shown by large segments with low imputation accuracy in QTL regions on BTA12 and 23, to local misassemblies of the bovine reference genome assembly UMD3.1. Moreover, for some QTL, conditional analyses failed to remove the peak. This could indicate that several causal mutations that affect resistance to paratuberculosis are located in this region. Indeed, it is important to note that all these results were obtained using the UMD3.1 reference genome assembly, as for run6 of the 1000 Bull Genomes project, which was used for WGS imputation. When we compared the results obtained from different genome assemblies (ARS-UCD1.2 versus UMD3.1), the QTL regions on BTA13 and 23 were relatively well conserved, and we did not observe major changes for the variants located in these peaks. Instead, the QTL identified on BTA12 (a region of limited quality in UMD3.1) were located within a narrower peak in the ARS-UCD1.2 genome assembly (5 Mbp, versus 9.4 Mbp with UMD3.1 for the 0/1/2 phenotype), which was more consistent with the results that we obtained from the conditional analyses (a single QTL in the region). Notably, although this study estimated relatively high heritability values for resistance to paratuberculosis, we found only a limited number of genomic regions associated with this trait in both breeds. Nevertheless, in Holstein cows, when we analyzed only the QTL with the most significant effects on each chromosome, the cumulative effects of the detected QTL explained 16 and 28% of the genetic variance of the 0/1 and 0/1/2 phenotypes, respectively. For both phenotypes, the QTL that explained the largest phenotypic variance was located on BTA13 and was responsible for 8% (0/1 phenotype) and 16% (0/1/2 phenotype) of the genetic variance, despite the fact that the resistance allele was present at a high frequency (0.91). Other QTL individually explained between 2 and 6% of the total phenotypic variance. Several previous GWAS analyses have reported QTL on BTA23 that are associated with resistance to MAP, in the vicinity of the MHC [7–9, 16, 18], while only one previous study, a meta-analysis conducted in US Holsteins based on 50 K SNP genotypes, found QTL at ~ 70 Mbp on BTA12 and ~ 65 Mbp on BTA13 [8]. McGovern et al. [18], who performed a GWAS on imputed whole-genome sequences, identified two SNPs associated with the humoral response to MAP on BTA13, at 62,037,755 bp and 66,373,805 bp, i.e. on either side of but quite far from the QTL we detected (63,497,960–63,506,532 bp) and outside its confidence interval. In this study, we did not find the QTL detected on the other chromosomes and, in particular, those located on BTA1, 6 or 7, which were the most commonly detected QTL for resistance to paratuberculosis in Holstein cows [6–12, 15, 16, 18, 36]. In each of the QTL regions detected here, we were able to identify genes of interest, some of which have been previously associated with traits related to the intestine or with responses to infection in mice, rats, or humans. Both ABCC4 and CBFA2T2 have been associated with intestinal inflammation and abnormal intestinal morphology (mucosa, goblet cell, enteroendocrine cell, or epithelium). Of all the genes located in the vicinity of the MHC on BTA23, C4A, ENSBTAG00000006864, and IER3 have been linked to abnormal intestinal morphology or physiology, whereas PKHD1, ENSBTAG00000013919, ENSBTAG00000038397, and SKIV2L are involved in bowel diseases (Crohn’s disease, ulcerative colitis, syndromic diarrhea, gastrointestinal ulcer, or tricho-hepato-enteric-syndrome). Some of these genes have also been associated with an abnormal response to infection (C4A, ENSBTAG00000006864, and IER3), increased/decreased susceptibility to bacterial infection (C4A and ENSBTAG00000006864), or induced colitis (ABCC4 and IER3). As bovine paratuberculosis is an enteric disease caused by the MAP bacterium leading to granulomatous enteritis, all of these genes are good functional candidates to explain inter-individual differences in resistance/susceptibility to paratuberculosis. In the QTL regions detected on BTA12, 13, and 23, the best candidates appear to be, respectively, ABCC4, CBFA2T2, and IER3 because they contain (ABCC4 and IER3) or are the closest (CBFA2T2) to the variants with the most significant effects. In addition, we note here that our analyses of control/case (0/1) and control/non-clinical case/clinical case (0/1/2) phenotypes did not lead to exactly the same results. Although heritability estimates of both phenotypes were very similar in both breeds, there were differences in the GWAS results. In Holstein and Normande cows, depending on the phenotype, the number and identity of QTL detected differed (larger number for the 0/1 phenotype), as well as the functional candidate genes, probably reflecting different biological functions that contribute to each phenotype. For example, analysis of the 0/1 phenotype led to the detection of more QTL on BTA23 in both breeds and on BTA12 in Holsteins. In addition, on BTA12, QTL effects were more significant for the 0/1 phenotype than for the 0/1/2 phenotype. In contrast, the effects detected on BTA13 were much more significant for the 0/1/2 phenotype (−log10(P) = 38.1) than for the 0/1 phenotype (−log10(P) = 18.5). Since all the cows in this study were born in the same herd and within the same period as the infected cows, and therefore probably exposed to MAP, the control/case phenotype, which was mainly associated with the QTL located on BTA12 and 23, should reflect a cow’s ability to be resistant to MAP. Instead, distinguishing between non-clinical and clinical cases reflects the potential for a MAP-infected animal to postpone manifestation of clinical signs of the disease. Thus, these results appear to be concordant with the best functional candidate genes found in each QTL region, with the ABCC4 (BTA12) and IERC (BTA23) genes appearing to be directly involved in responses to infection, whereas the CBFA2T2 gene (BTA13) was previously found to be associated with intestinal inflammation or abnormal morphology in mice.

Conclusions

In this study, we demonstrate that a focus on the most accurate phenotypes increases heritability estimates. These accurate phenotypes, combined with genotypes imputed to the whole-genome sequence level, made it possible to identify three chromosomal regions with important effects on resistance/susceptibility to MAP. In each of these regions, we were able to pinpoint one candidate gene that could be functionally related to MAP infection (ABCC4, CBFA2T2, and IER3) with candidate variants that could be either causal variants or in strong LD with the causal variants. Due to the large percentage of genetic variance explained, these QTL merit inclusion in future genomic evaluations with an appropriate weight. However, these QTL do not explain all of the relevant genetic variance, and the best model may very well end up containing QTL and markers from throughout the whole genome. Our results for the Holstein breed are very encouraging for strategies that aim at implementing selection for improved resistance to MAP. In the Normande breed, the study design was too limited to allow us to draw any general conclusions; efforts are underway to enlarge the sample pool in order to increase detection power. Additional file 1: Figure S1. −log(P-value) plotted against the position of variants detected by GWAS (in grey) and conditional GWAS (GWAS_COJO; in blue).
  34 in total

1.  GCTA: a tool for genome-wide complex trait analysis.

Authors:  Jian Yang; S Hong Lee; Michael E Goddard; Peter M Visscher
Journal:  Am J Hum Genet       Date:  2010-12-17       Impact factor: 11.025

2.  Candidate genes associated with the heritable humoral response to Mycobacterium avium ssp. paratuberculosis in dairy cows have factors in common with gastrointestinal diseases in humans.

Authors:  S P McGovern; D C Purfield; S C Ring; T R Carthy; D A Graham; D P Berry
Journal:  J Dairy Sci       Date:  2019-03-07       Impact factor: 4.034

3.  Identification of loci associated with susceptibility to subspecies () tissue infection in cattle.

Authors:  J N Kiser; S N White; K A Johnson; J L Hoff; J F Taylor; H L Neibergs
Journal:  J Anim Sci       Date:  2017-03       Impact factor: 3.159

Review 4.  A review of prevalences of paratuberculosis in farmed animals in Europe.

Authors:  Søren Saxmose Nielsen; Nils Toft
Journal:  Prev Vet Med       Date:  2008-09-24       Impact factor: 2.670

Review 5.  Ante mortem diagnosis of paratuberculosis: a review of accuracies of ELISA, interferon-gamma assay and faecal culture techniques.

Authors:  S S Nielsen; N Toft
Journal:  Vet Microbiol       Date:  2008-01-03       Impact factor: 3.293

6.  Meta-analysis of two genome-wide association studies of bovine paratuberculosis.

Authors:  Giulietta Minozzi; John L Williams; Alessandra Stella; Francesco Strozzi; Mario Luini; Matthew L Settles; Jeremy F Taylor; Robert H Whitlock; Ricardo Zanella; Holly L Neibergs
Journal:  PLoS One       Date:  2012-03-02       Impact factor: 3.240

7.  Longitudinal data collection of Mycobacterium avium subspecies Paratuberculosis infections in dairy herds: the value of precise field data.

Authors:  Ynte H Schukken; Robert H Whitlock; Dave Wolfgang; Yrjo Grohn; Annabelle Beaver; JoAnn VanKessel; Mike Zurakowski; Rebecca Mitchell
Journal:  Vet Res       Date:  2015-06-19       Impact factor: 3.683

8.  Consequences of splitting whole-genome sequencing effort over multiple breeds on imputation accuracy.

Authors:  Aniek C Bouwman; Roel F Veerkamp
Journal:  BMC Genet       Date:  2014-10-03       Impact factor: 2.797

9.  Genome-wide association study of susceptibility to infection by Mycobacterium avium subspecies paratuberculosis in Holstein cattle.

Authors:  Fazli Alpay; Yalda Zare; Mamat H Kamalludin; Xixia Huang; Xianwei Shi; George E Shook; Michael T Collins; Brian W Kirkpatrick
Journal:  PLoS One       Date:  2014-12-04       Impact factor: 3.240

10.  Evaluation of the accuracy of imputed sequence variant genotypes and their utility for causal variant detection in cattle.

Authors:  Hubert Pausch; Iona M MacLeod; Ruedi Fries; Reiner Emmerling; Phil J Bowman; Hans D Daetwyler; Michael E Goddard
Journal:  Genet Sel Evol       Date:  2017-02-21       Impact factor: 4.297

View more
  9 in total

1.  Identification of loci associated with susceptibility to Mycobacterium avium subsp. paratuberculosis infection in Holstein cattle using combinations of diagnostic tests and imputed whole-genome sequence data.

Authors:  Maria Canive; Oscar González-Recio; Almudena Fernández; Patricia Vázquez; Gerard Badia-Bringué; José Luis Lavín; Joseba M Garrido; Ramón A Juste; Marta Alonso-Hearn
Journal:  PLoS One       Date:  2021-08-27       Impact factor: 3.240

2.  Increased accuracy of genomic predictions for growth under chronic thermal stress in rainbow trout by prioritizing variants from GWAS using imputed sequence data.

Authors:  Grazyella M Yoshida; José M Yáñez
Journal:  Evol Appl       Date:  2021-05-18       Impact factor: 4.929

3.  Genome-wide association analysis identified both RNA-seq and DNA variants associated to paratuberculosis in Canadian Holstein cattle 'in vitro' experimentally infected macrophages.

Authors:  Olivier Ariel; Jean-Simon Brouard; Andrew Marete; Filippo Miglior; Eveline Ibeagha-Awemu; Nathalie Bissonnette
Journal:  BMC Genomics       Date:  2021-03-07       Impact factor: 3.969

4.  Identification of Candidate Variants Associated With Bone Weight Using Whole Genome Sequence in Beef Cattle.

Authors:  Qunhao Niu; Tianliu Zhang; Ling Xu; Tianzhen Wang; Zezhao Wang; Bo Zhu; Xue Gao; Yan Chen; Lupei Zhang; Huijiang Gao; Junya Li; Lingyang Xu
Journal:  Front Genet       Date:  2021-11-29       Impact factor: 4.599

5.  Genome-Wide Association Study of Body Conformation Traits by Whole Genome Sequencing in Dazu Black Goats.

Authors:  Bowen Gu; Ruifan Sun; Xingqiang Fang; Jipan Zhang; Zhongquan Zhao; Deli Huang; Yuanping Zhao; Yongju Zhao
Journal:  Animals (Basel)       Date:  2022-02-23       Impact factor: 2.752

6.  Genomic prediction with whole-genome sequence data in intensely selected pig lines.

Authors:  Roger Ros-Freixedes; Martin Johnsson; Andrew Whalen; Ching-Yi Chen; Bruno D Valente; William O Herring; Gregor Gorjanc; John M Hickey
Journal:  Genet Sel Evol       Date:  2022-09-24       Impact factor: 5.100

Review 7.  Genome-wide association studies for the identification of cattle susceptible and resilient to paratuberculosis.

Authors:  Marta Alonso-Hearn; Gerard Badia-Bringué; Maria Canive
Journal:  Front Vet Sci       Date:  2022-09-12

8.  New insights into the genetic resistance to paratuberculosis in Holstein cattle via single-step genomic evaluation.

Authors:  Marie-Pierre Sanchez; Thierry Tribout; Sébastien Fritz; Raphaël Guatteo; Christine Fourichon; Laurent Schibler; Arnaud Delafosse; Didier Boichard
Journal:  Genet Sel Evol       Date:  2022-10-15       Impact factor: 5.100

9.  Combining Random Forests and a Signal Detection Method Leads to the Robust Detection of Genotype-Phenotype Associations.

Authors:  Faisal Ramzan; Mehmet Gültas; Hendrik Bertram; David Cavero; Armin Otto Schmitt
Journal:  Genes (Basel)       Date:  2020-08-05       Impact factor: 4.096

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.