| Literature DB >> 19527513 |
Tim Downing1, David J Lynn, Sarah Connell, Andrew T Lloyd, A K Bhuiyan, Pradeepa Silva, A N Naqvi, Rahamame Sanfo, Racine-Samba Sow, Baitsi Podisi, Olivier Hanotte, Cliona O'Farrelly, Daniel G Bradley.
Abstract
BACKGROUND: The comparative analysis of genome sequences emerging for several avian species with the fully sequenced chicken genome enables the genome-wide investigation of selective processes in functionally important chicken genes. In particular, because of pathogenic challenges it is expected that genes involved in the chicken immune system are subject to particularly strong adaptive pressure. Signatures of selection detected by inter-species comparison may then be investigated at the population level in global chicken populations to highlight potentially relevant functional polymorphisms.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19527513 PMCID: PMC3224688 DOI: 10.1186/1471-2148-9-136
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Gene structure of IL-4Rα. Exons are shown in green, introns in grey and amplicon regions by the red arrows. The UTRs are shown in blue, the leader sequence in red, unknown regions in black and promoter sequence is shown in beige. The numbers shown represent the base positions in relation to the GenBank entries for the mRNA and CDS.
Generated PAML parameters for free-ratio (M1) and significant site-specific test (M2a, M1a; M7, M8) results for IL-4Rα.
| Model | Parameters | Likelihood | 2 | P value | |
|---|---|---|---|---|---|
| M1 | -2907.434066 | See additional file | - | - | |
| M2a | -2798.740466 | 102.747134 | 4.88 × 10-23 | ||
| M1a | -2850.114033 | ||||
| M81 | -2798.740987 | 102.747656 | 4.88 × 10-23 | ||
| M7 | -2850.114815 | ||||
1 Bayes Empirical Bayes analysis suggests 28 sites where P(ω > 1) > 95.0%. 2ΔML is twice the difference of the likelihoods of the variable minus the neutral model. The number of degrees of freedom is 2 for the site-specific model LRTs.
Sites potentially under selection according to BEB analysis of PAML M8 results for the most frequent haplotype.
| Base position | P | aA | S.E. | P( | Exon | Bases | SNP alleles and amino acids | |
|---|---|---|---|---|---|---|---|---|
| 4429–31 | 3 | T | 9.983 | 0.878 | 0.998 | 1 | ACA | V, F: CCA (P); B: GCA (A) |
| 4435–37 | 5 | F | 10.002 | 0.772 | 1.000 | 1 | TTT | Chicken, C: CTT (L); R, V, F: TTC (F); B TTG (F) |
| 4477–79 | 19 | L | 9.996 | 0.807 | 0.999 | 1 | CTG | V, F: CGC (R); B: CCA (P) |
| 7453–56 | 23 | V | 9.951 | 1.029 | 0.995 | 2 | GTT | V, F: TTT (F); B: CTT (L) |
| 7534–36 | 50 | E | 10.003 | 0.77 | 1.000 | 2 | GAA | V, F: CCA (P); B: CGA (R) |
| 7582–84 | 66 | L | 10.000 | 0.786 | 1.000 | 2 | CTT | V, F: TTT (F); B: AAT (N) |
| 7594–5, 8394 | 70 | R | 9.984 | 0.871 | 0.998 | 2, 3 | AGA | V: TCA (S); F: ATA (M) |
| 9583–85 | 125 | T | 9.999 | 0.788 | 1.000 | 4 | ACT | C, B: GCT (A); V, F: TCT (S) |
| 9631–33 | 141 | L | 9.771 | 1.636 | 0.976 | 4 | TTG | C, G, B: CTG (L); V, F: ATG (M) |
| 9646–48 | 146 | S | 9.995 | 0.811 | 0.999 | 4 | AGC | V, F: CGC (R); B: GGC (G) |
| 9715–17 | 169 | Q | 9.972 | 0.933 | 0.997 | 4 | CAA | V, F: CGC (R); B: CCC (P) |
| 9721–23 | 171 | E | 9.970 | 0.942 | 0.997 | 4 | GAA | V, F: GCA (A); B: GGA (G) |
| 12367–69 | 418 | M | 9.661 | 1.904 | 0.964 | 9 | ATG | V: CTG (L); B: GTG (V); F: TTG (L) |
| 12628–30 | 509 | A | 9.895 | 1.253 | 0.989 | 9 | GCA | V, F: GTA (V) |
| 12631–33 | 510 | R | 9.966 | 0.963 | 0.996 | 9 | AGA | V: AGT (S); B: AGG (R); F: AGC (S) |
| 12652–54 | 517 | H | 9.995 | 0.811 | 0.999 | 9 | CAC | Chicken, R: CAT (H); RJF, F, V: CAA (Q); B: AAC (N) |
| 12661–63^ | 520 | P | 9.110 | 2.54 | 0.930 | 9 | CCT | Chicken, R, RJF: CTT (L) |
| 12742–44 * | 547 | I | 9.619 | 2.097 | 0.954 | 9 | ATA | R, C: TTA (L) |
| 12823–25 | 574 | H | 9.941 | 1.07 | 0.994 | 9 | CAT | V, F: CAC (H); B, F: CAT (H) |
| 12844–46 | 581 | V | 9.976 | 0.914 | 0.997 | 9 | GTG | V, F: ATG (M); B: CTG (L) |
| 12871–73 | 590 | G | 9.580 | 2.076 | 0.956 | 9 | GGC | Chicken, RJF, B, F, V: AGC (S) |
| 12955–57 * | 618 | E | 9.622 | 2.092 | 0.955 | 9 | GAG | V, F, B: GCG (A) |
| 12979–81 | 626 | S | 9.579 | 2.078 | 0.956 | 9 | AGC | V: CGC (R); F: GGC (G) |
| 12985–87 | 628 | E | 9.934 | 1.101 | 0.993 | 9 | GAA | Chicken: GAG (E); RJF, R, G, C: GAC (D) |
| 13042–44 | 647 | A | 9.661 | 1.902 | 0.964 | 9 | GCC | V, B, F: GTC (V) |
| 13078–80 | 659 | N | 9.727 | 1.746 | 0.971 | 9 | AAT | V, F: AAA (K); B: AAC (N) |
| 13096–98 | 665 | R | 9.981 | 0.889 | 0.998 | 9 | CGA | Chicken: CAA (Q), TGA (stop); R, G, C: AGA (R): RJF, F: ATA (M); V: ACA (T); B: AAA (K) |
| 13123–25 | 674 | S | 9.950 | 1.033 | 0.994 | 9 | TCT | V: TGT (C); B: TTT (F); F: TAT (Y) |
| 13138–40 | 679 | A | 9.994 | 0.82 | 0.999 | 9 | GCA | V: GGC (G); B: GTG (V); F: GGT (G) |
P is the amino acid position. S.E. is the standard error for ω. B stands for bamboo partridge, F for grey francolin, V for green JF, C for Ceylon JF, R for red JF, G for grey JF and RJF for the genome sequence. * Significant in analysis with divergent sample FJ542675 only. ^Almost significant.
Frequencies and predicted functional impacts of chicken nonsynonymous SNPs on the IL-4Rα protein product compared to the red JF genome sequence
| Base Positions | Amino Acid | Prediction | Score | Certainty | N2 | Outcome | ||
|---|---|---|---|---|---|---|---|---|
| Position | Red JF | M.A. | ||||||
| 4435–37 | 5 | F | L | neutral | 0.316 | 3 | 111 | not significant |
| 4450–52 | 10 | T | A | neutral | 0.104 | 7 | 1 | neutral |
| 9622–24 | 138 | N | H | neutral | 0.320 | 3 | 1 | not significant |
| 12661–63 | 520 | L | P | neutral | 0.413 | 1 | 102 | not significant |
| 12871–73 | 590 | S | G | neutral | 0.329 | 3 | 173 | not significant |
| 12883–85 | 594 | L | R | neutral | 0.270 | 4 | 174 | not significant |
| 13096–98 | 665 | M | R | neutral | 0.495 | 0 | 172 | not significant |
| 13096–98 | 665 | M | Q | neutral | 0.119 | 7 | 7 | neutral |
| 13096–98 | 6651 | R | Q | neutral | 0.510 | 5 | 7 | not significant |
| 13096–98 | 6651 | R | stop | - | - | - | 1 | deleterious |
| 13111–13 | 670 | S | Y | neutral | 0.036 | 9 | 163 | neutral |
| 13111–13 | 670 | S | F | neutral | 0.061 | 8 | 17 | neutral |
| 13111–13 | 6701 | Y | F | neutral | 0.023 | 9 | 17 | neutral |
| 13177–79 | 692 | T | S | neutral | 0.037 | 9 | 157 | neutral |
| 13177–79 | 692 | T | N | neutral | 0.060 | 8 | 23 | neutral |
| 13177–79 | 6921 | S | N | neutral | 0.028 | 9 | 23 | neutral |
M.A. is the minor allele(s), in some cases this is the most frequent in the chicken samples. Amino acid sites 5, 10, 524 and 669 are polymorphic in the outgroup samples as well. 1 Polymorphic within chicken samples only. 2 N is the number of observed samples with the M.A. Substitutions where the PMut certainty values ≤ 6 did not have statistical support for the predicted change.
Figure 2Median-joining network of chicken haplotypes for all SNPs. Populations are denoted in the legend. Branch lengths are proportional to the number of mutational differences between haplotypes. The outgroup samples are represented by the colourless nodes. Branch lengths are considerably reduced in order to show the details of the chicken population network. V represents the green JF sequences; F the grey francolin; B the bamboo partridge; G the grey JF; C the Ceylon JF; R the red JF; and RJF the genome sequence.
Figure 3Median-joining network of chicken haplotypes for nonsynonymous SNPs. Populations are denoted in the legend. Branch lengths are proportional to the number of mutational differences between haplotypes. The outgroup samples are represented by the colourless nodes. V represents the green JF sequences; F the grey francolin; B the bamboo partridge; G the grey JF; C the Ceylon JF; R1 and R2 the red JF; and RJF the genome sequence.
SNP data, summary statistics and tests of neutrality
| Test | N1 | S2 | H3 | Tajima's | Fu & Li's | Fay & Wu's | Fu's | ||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| All | 90 | 100 | 115 | 0.990 | 5.19 | 3.37 | 1.69 | 1.22 | 1.86 | -21.40 | -34.06 |
| < | |||||||||||
| Asia | 30 | 95 | 51 | 0.993 | 5.37 | 3.89 | 1.32 | 1.25 | 1.68 | -19.45 | -16.06 |
| Africa | 40 | 86 | 53 | 0.983 | 4.72 | 3.36 | 1.36 | 1.11 | 1.55 | -27.34 | -10.13 |
| Broilers | 20 | 79 | 23 | 0.944 | 5.14 | 3.51 | 1.69 | 2.15 | 2.47 | -10.85 | 1.38 |
1 Chickens sampled. 2 SNPs. 3 Haplotypes. 4 Haplotype diversity. 5 Mean pairwise differences per kb. 6 Watterson's estimator per kb. Only p values generated by simulations < 0.05 are given; p > 0.05 are denoted "ns". The resequenced region length is 5,298 bp.
Recombination at IL-4Rα according the percentage GC content, Hudson's R and Rand Kelly's Zper kb from DnaSP
| GC content (%) | ||||||||
|---|---|---|---|---|---|---|---|---|
| Total | Coding | Non-coding | All | Asia | Africa | Broilers | ||
| 44.5 | 46.3 | 43.8 | 33.60 | 35 | 27 | 21 | 17 | 66.13 |
1 Coalescent p < 0.001 for all, p < 0.001 for Asia, p = 0.004 for Africa and p = 0.013 for broilers. 2 Coalescent p = 0.017.
McDonald-Kreitman tests between the chicken populations and the red JF genome sequence and the outgroup samples.
| Test | Type | Intraspecific | Interspecific | P value |
|---|---|---|---|---|
| Chicken & genome sequence | Silent | 6 | 93 | 0.0016 |
| Nonsynonymous | 6 | 10 | ||
| Chicken & 6 outgroups | Silent | 6 | 447 | 0.0403 |
| Nonsynonymous | 4 | 71 | ||