| Literature DB >> 22164200 |
Colleen A Fisher1, Eric K Bhattarai, Jason B Osterstock, Scot E Dowd, Paul M Seabury, Meenu Vikram, Robert H Whitlock, Ynte H Schukken, Robert D Schnabel, Jeremy F Taylor, James E Womack, Christopher M Seabury.
Abstract
Members of the Toll-like receptor (TLR) gene family occupy key roles in the mammalian innate immune system by functioning as sentries for the detection of invading pathogens, thereafter provoking host innate immune responses. We utilized a custom next-generation sequencing approach and allele-specific genotyping assays to detect and validate 280 biallelic variants across all 10 bovine TLR genes, including 71 nonsynonymous single nucleotide polymorphisms (SNPs) and one putative nonsense SNP. Bayesian haplotype reconstructions and median joining networks revealed haplotype sharing between Bos taurus taurus and Bos taurus indicus breeds at every locus, and specialized beef and dairy breeds could not be differentiated despite an average polymorphism density of 1 marker/158 bp. Collectively, 160 tagSNPs and two tag insertion-deletion mutations (indels) were sufficient to predict 100% of the variation at 280 variable sites for both Bos subspecies and their hybrids, whereas 118 tagSNPs and 1 tagIndel predictively captured 100% of the variation at 235 variable sites for B. t. taurus. Polyphen and SIFT analyses of amino acid (AA) replacements encoded by bovine TLR SNPs indicated that up to 32% of the AA substitutions were expected to impact protein function. Classical and newly developed tests of diversity provide strong support for balancing selection operating on TLR3 and TLR8, and purifying selection acting on TLR10. An investigation of the persistence and continuity of linkage disequilibrium (r2≥0.50) between adjacent variable sites also supported the presence of selection acting on TLR3 and TLR8. A case-control study employing validated variants from bovine TLR genes recognizing bacterial ligands revealed six SNPs potentially eliciting small effects on susceptibility to Mycobacterium avium spp paratuberculosis infection in dairy cattle. The results of this study will broadly impact domestic cattle research by providing the necessary foundation to explore several avenues of bovine translational genomics, and the potential for marker-assisted vaccination.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22164200 PMCID: PMC3227585 DOI: 10.1371/journal.pone.0027744
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Relationship between minor allele frequencies estimated from pyrosequencing and allele-specific genotyping of 96 individuals from 31 breeds.
| Bovine Gene | Total 454 SNPs | Overall Correlation (r) | Overall RSQ (r2) |
|
| 4 | 0.998 | 0.996 |
|
| 44 | 0.935 | 0.874 |
|
| 39 | 0.958 | 0.918 |
|
| 28 | 0.948 | 0.898 |
|
| 39 | 0.942 | 0.887 |
|
| 15 | 0.879 | 0.773 |
|
| 15 | 0.959 | 0.920 |
|
| 13 | 0.877 | 0.769 |
|
| 22 | 0.975 | 0.950 |
|
| 25 | 0.749 | 0.561 |
| Totals/Avg | 244 | 0.922 | 0.855 |
Total SNPs detected via pyrosequencing.
P<0.05 for all TLR genes.
RSQ is the squared correlation coefficient (r2).
Figure 1For validated bovine TLR SNPs detected via pyrosequencing (n = 244), a regression analysis was performed for pyrosequencing allele frequency (AF) estimates corresponding to the true minor alleles.
(<0.5), as defined by allele-specific genotyping assays, and minor AFs (MAFs) directly ascertained by genotyping (n = 96 elite sires; 31 breeds). The true minor alleles (<0.5) were correctly identified for 236/244 (97%) SNPs via pyrosequencing. This analysis provided strong statistical evidence (P = 0.999846; Ho: slope = 1) for little or no bias in the pyrosequencing-based estimates of allele frequency.
Summary data for validated polymorphisms detected in the bovine TLR gene family.
| BovineGene | BTA Assign | TotalHaps | Sires Phased | MAF≤0.10 | Avg r2 all | Avg r2
| Valid.SNPs | HapSNPs | Valid.Indels | Valid.nsSNPs | RegionSize | QTL orAssoc. |
|
| BTA6 | 8 | 547 | 3 | 0.24 | 0.49 | 5 | 5 | 0 | 2 | 2.184 | Q |
|
| BTA17 | 38 | 532 | 38 | 0.19 | 0.24 | 44 | 41 | 1 | 20 | 3.224 | Q, A |
|
| BTA27 | 40 | 78 | 20 | 0.29 | 0.57 | 56 | 52 | 0 | 3 | 9.469 | A |
|
| BTA8 | 29 | 532 | 23 | 0.10 | 0.08 | 28 | 27 | 0 | 7 | 3.470 | Q, A |
|
| BTA16 | 29 | 526 | 29 | 0.20 | 0.31 | 43 | 41 | 3 | 9 | 5.334 | No |
|
| BTA6 | 20 | 526 | 13 | 0.09 | 0.12 | 15 | 15 | 0 | 6 | 2.327 | Q, A |
|
| BTAX | 9 | 96 | 7 | 0.28 | 0.28 | 15 | 15 | 0 | 1 | 4.285 | Q |
|
| BTAX | 6 | 96 | 1 | 0.70 | 0.69 | 13 | 13 | 0 | 8 | 3.702 | Q |
|
| BTA22 | 20 | 545 | 9 | 0.27 | 0.29 | 22 | 22 | 0 | 3 | 5.033 | Q |
|
| BTA6 | 43 | 524 | 34 | 0.27 | 0.15 | 35 | 35 | 0 | 13 | 3.859 | Q |
|
|
|
|
|
|
|
|
|
|
|
|
BTA assignments based on NCBI Refseq (Btau5.2).
Total number of haplotypes predicted from all validated markers and best pair reconstructions [31] with probabilities≥0.90.
Number of cattle exhibiting best pair phase probabilities≥0.90. BTAX haplotypes were directly ascertained. 96 animals were genotyped for TLR3, TLR7, and TLR8. For all other loci, 549 animals were genotyped.
Number of polymorphisms with minor allele frequencies≤0.10.
Average intragenic linkage disequilibrium (r2) values estimated for adjacent SNP and indel sites for all cattle or for B. t. taurus (B.t.t.).
Number of putative SNPs validated as polymorphic.
Number of validated SNPs incorporated into discrete haplotypes.
Number of putative indels validated as polymorphic.
Number of nonsynonymous SNPs validated as polymorphic, including the putative TLR5 nonsense SNP.
Size of the genic region. Kb = Kilobase.
Bovine health-related QTL overlapping or proximal to investigated gene (Q), or intragenic variation associated (A) with disease susceptibility in case-control studies [19]-[27], [46].
Tentative association in this study.
Figure 2Median joining (MJ) haplotype networks for bovine TLR3 using haplotypes predicted for all cattle (n = 96 AI sires, 31 breeds).
Because MJ networks require the absence of recombination [66], each network represents intragenic regions of elevated LD. Haplotypes predicted for B. t. taurus, B. t. indicus and hybrids (termed “composites”) are color coded. Numbers indicate SNP positions in numerical order (see Table S2 for SNP information). Node sizes are proportional to haplotype frequency, and all branch lengths are drawn to scale. Alphabetized letters at nodes represent the breed distribution of each haplotype (Table S4). Median vectors are indicated as “mv”.
Figure 3Median joining (MJ) haplotype network for bovine TLR8 using haplotypes directly ascertained for all cattle (n = 96 AI sires, 31 breeds).
Haplotypes observed for B. t. taurus, B. t. indicus and hybrids (termed “composites”) are color coded. Numbers indicate SNP positions in numerical order (see Table S2 for SNP information). Node sizes are proportional to haplotype frequency, and all branch lengths are drawn to scale. Alphabetized letters at nodes represent the breed distribution of each haplotype (Table S4).
Figure 4Median joining (MJ) haplotype network for bovine TLR10 using haplotypes predicted for all cattle (n = 96 AI sires, 31 breeds; 48 Purebred Angus; 405 Holstein cattle).
Haplotypes predicted for B. t. taurus, B. t. indicus and hybrids (termed “composites”) are color coded. Numbers indicate SNP positions in numerical order (see Table S2 for SNP information). Node sizes are proportional to haplotype frequency, and all branch lengths are drawn to scale. Alphabetized letters at nodes represent the breed distribution of each haplotype (Table S4). Notably, given the complexity of the network, only nodes representing≥10 cattle are labeled (A-F), which collectively represents>93% of the cattle meeting the phase requirements (n = 524 cattle with best-pair probabilities≥0.90). Median vectors are indicated as “mv”.
Summary data for 22 nonsynonymous SNPs and one putative nonsense SNP predicted to impact protein function.
| Bovine Gene | SNP | dbSNP ID | GenBank Protein ID | AA Subst. | Protein Domain | PolyPhen Result | SIFT Result | SNP Freq |
|
| G>T |
| NP_776622.1 | W119L | LRR_TYP1 | PrD | AF | 0.008 |
| T>A |
| NP_776622.1 | F227L | NCP | PsD | T | 0.015 | |
| C>T |
| NP_776622.1 | T311M | NCP | PrD | AF | 0.006 | |
| C>T |
| NP_776622.1 | S485F | LRR_TYP2 | PrD | AF | 0.015 | |
| G>A |
| NP_776622.1 | R563H | LRRCT | B | AF | 0.066 | |
| G>C |
| NP_776622.1 | E738Q | TIR | PsD | AF | 0.001 | |
|
| G>A |
| NP_001008664.1 | G426S | LRR8 | PsD | AF | 0.058 |
| G>T |
| NP_001008664.1 | S664I | LRRCT | PsD | T | 0.432 | |
|
| A>C |
| NP_776623.5 | N151T | LRR3 | PsD | T | 0.009 |
| A>G |
| NP_776623.5 | K381R | LRR6 | B | AF | 0.005 | |
| A>G |
| NP_776623.5 | H587R | LRRCT | PrD | AF | 0.003 | |
|
| C>T |
| NP_001035591.1 | R125* | NCP | PsD | ND | 0.053 |
| G>A |
| NP_001035591.1 | R262H | NCP | PrD | T | 0.004 | |
| C>G |
| NP_001035591.1 | F643L | NCP | B | AF | 0.003 | |
|
| T>G |
| NP_001001159.1 | L43R | NCP | PrD | AF | 0.003 |
| A>G |
| NP_001001159.1 | R87G | LRR1 | B | AF | 0.017 | |
| T>A |
| NP_001001159.1 | F494I | LRR5 | PrD | AF | 0.024 | |
|
| A>G |
| NP_001028933.1 | N439S | NCP | PrD | AF | 0.021 |
|
| G>A |
| ABQ52584.1 | S477N | NCP | B | AF | 0.562 |
| A>C |
| ABQ52584.1 | K903T | TIR | PsD | AF | 0.010 | |
|
| G>A |
| NP_001070386.1 | R18H | SigPep | PsD | T | 0.018 |
| C>G |
| NP_001070386.1 | I134M | LRR3 | B | AF | 0.013 | |
| A>C |
| NP_001070386.1 | K753T | TIR | PsD | AF | 0.010 |
SNPs with “rs” numbers were previously described [23]-[25], [30], [59] and validated in this study.
Amino acid (AA) substitutions predicted from corresponding SNPs, GenBank Proteins, and previous studies [23]-[25], [30], [60].
Protein domain locations predicted by SMART (http://smart.embl-heidelberg.de/). Only confidently predicted domains are depicted (NCP = no confident prediction; LRRs are named in order of prediction).
Results from PolyPhen and SIFT [38]-[39]. Results other than “Benign (B)” or “Tolerated (T)” are predicted to be Possibly Damaging (PsD), Probably Damaging (PrD), or Affect Protein Function (AF). SIFT could not be used (ND) to model the TLR5 putative nonsense SNP.
Observed frequency of nonsynonymous SNP allele across all 31 cattle breeds.
The frequency of this SNP in U.S. dairy cattle (n = 405, 3 Herds) was 0.068.
Summary data for tests of selection across all members of the bovine TLR gene family.
| Gene | SiresPhased | Tajima's | Coalescent | Sires Phased | Tajima's | Coalescent |
|
| 95 (99%) | 0.55535 |
| 64 (98%) | 1.49328 |
|
|
| 92 (96%) | 0.51385 |
| 64 (98%) | -0.06547 |
|
|
| 78 (81%) | 2.35965 |
| 54 (83%) | 3.63792 |
|
|
| 83 (86%) | 2.12744 |
| 59 (91%) | 3.59176 |
|
|
| 94 (98%) | 2.07897 |
| 63 (97%) | 2.65634 |
|
|
| 89 (93%) | -0.83191 |
| 64 (98%) | 0.93683 |
|
|
| 86 (90%) | 0.69344 |
| 59 (91%) | 0.44166 |
|
|
| 91 (95%) | 0.16727 |
| 65 (100%) | -0.71248 |
|
|
| 96 (100%) | -0.19828 |
| 65 (100%) | -0.17037 |
|
|
| 96 (100%) | 3.53957 |
| 65 (100%) | 3.28763 |
|
|
| 95 (99%) | 1.15800 |
| 64 (98%) | 1.26794 |
|
|
| 92 (96%) | -0.29809 |
| 61 (94%) | -1.78285 |
|
Number and proportion of cattle from the sequencing discovery panel with best-pair phase probabilities≥0.90 for all cattle (n = 96), and for B. t. taurus cattle (n = 65).
Tajima's D statistic [43] for all cattle and for B. t. taurus breeds.
Significance levels were estimated by coalescent simulation using 10,000 replicates [67]. All bolded loci were also significant (P<0.05) via application of the beta distribution [67].
Phased variation within TLR3 Network 1 and TLR3 Network 2.
Significant after correction for multiple tests (α / n locus-specific tests; α = 0.05).
Significant after adding in the best-pairs of haplotypes for taurine sires with probabilities<0.90 and correction for multiple testing (α = 0.05).
Figure 5Relationship between the number of validated SNPs and SNP diversity here denoted as the effective number of SNPs across all 10 TLR loci in A) all cattle, and B) taurine cattle.
The linear regressions and estimated 95% confidence intervals are shown in each panel.
Summary statistics for single marker association tests with risk of Mycobacterium avium spp paratuberculosis infection.
| 95% Confidence Interval | ||||||
| Marker | dbSNP ID | Model | Odds Ratio |
| Lower Bound | Upper Bound |
|
|
| Recessive | 3.20 | 0.032 | 1.11 | 9.24 |
|
|
| Recessive | 3.21 | 0.031 | 1.11 | 9.25 |
|
|
| Recessive | 2.51 | 0.020 | 1.15 | 5.48 |
|
|
| Additive | 0.27 | 0.026 | 0.09 | 0.86 |
|
|
| Additive | 2.58 | 0.012 | 1.23 | 5.43 |
|
|
| Additive | 0.53 | 0.041 | 0.29 | 0.97 |
95% Confidence interval for odds ratio.
Odds ratio adjusted for the effect of birth year.
P-value not corrected for multiple comparisons.
P-value marginal (0.053) after locus-specific FDR correction [45] (http://sdmproject.com/utilities/?show=FDR).
P-value<0.05 after locus-specific FDR correction [45] (http://sdmproject.com/utilities/?show=FDR). TLR1, TLR6, and TLR10 were considered a single locus for multiple test correction.