| Literature DB >> 31034467 |
Saber Qanbari1,2, Carl-Johan Rubin3, Khurram Maqbool4, Steffen Weigend5,6, Annett Weigend5, Johannes Geibel1,6, Susanne Kerje3, Christine Wurmser7, Andrew Townsend Peterson8, I Lehr Brisbin9, Ruedi Preisinger10, Ruedi Fries7, Henner Simianer1,6, Leif Andersson3,4,11.
Abstract
We carried out whole genome resequencing of 127 chicken including red jungle fowl and multiple populations of commercial broilers and layers to perform a systematic screening of adaptive changes in modern chicken (Gallus gallus domesticus). We uncovered >21 million high quality SNPs of which 34% are newly detected variants. This panel comprises >115,000 predicted amino-acid altering substitutions as well as 1,100 SNPs predicted to be stop-gain or -loss, several of which reach high frequencies. Signatures of selection were investigated both through analyses of fixation and differentiation to reveal selective sweeps that may have had prominent roles during domestication and breed development. Contrasting wild and domestic chicken we confirmed selection at the BCO2 and TSHR loci and identified 34 putative sweeps co-localized with ALX1, KITLG, EPGR, IGF1, DLK1, JPT2, CRAMP1, and GLI3, among others. Analysis of enrichment between groups of wild vs. commercials and broilers vs. layers revealed a further panel of candidate genes including CORIN, SKIV2L2 implicated in pigmentation and LEPR, MEGF10 and SPEF2, suggestive of production-oriented selection. SNPs with marked allele frequency differences between wild and domestic chicken showed a highly significant deficiency in the proportion of amino-acid altering mutations (P<2.5×10-6). The results contribute to the understanding of major genetic changes that took place during the evolution of modern chickens and in poultry breeding.Entities:
Mesh:
Year: 2019 PMID: 31034467 PMCID: PMC6508745 DOI: 10.1371/journal.pgen.1007989
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Summary statistics for chicken whole genome resequencing.
| Population | Code | Sequencing | N | Depth | nSNPs | |
|---|---|---|---|---|---|---|
| Red jungle fowl (Thailand) | RJFt | Individual | 25 | 11.0 | 17,422,645 | 0.40 ± 0.18 |
| Red jungle fowl (India) | RJFi | Individual | 10 | 2.5 | 9,470,039 | 0.13 ± 0.07 |
| Broiler A | BRA | Individual | 20 | 11.5 | 12,355,756 | 0.34 ± 0.16 |
| Broiler B | BRB | Individual | 20 | 11.9 | 11,525,631 | 0.32 ± 0.16 |
| Pooled Broiler D | BRpD | Pooled | 25 | 40.0 | 10,498,251 | - |
| White layer | WL | Individual | 25 | 8.1 | 7,638,111 | 0.15 ± 0.12 |
| Brown layer | BL | Individual | 25 | 7.6 | 8,812,787 | 0.20 ± 0.14 |
| Rhode White (layer) | RWp | Pooled | 48 | 30.0 | 8,614,223 | - |
1Averaged over number of samples when sequenced individually.
2Number of polymorphic sites within population.
3Average ± standard deviation of nucleotide diversity estimated in 40kb windows.
Fig 1Analysis of SNP diversity.
(A) Comparison of the minor allele frequency spectrum of coding sequences in RJFt and commercial populations. (B) Visualization of the distribution of population-specific and group-specific variants detected from individual sequencing only. Each triangle represents the number (103) of variants exclusively segregating or detected in the corresponding population and overlapping sections denote group-specific variants. (C) Heatmap of the allele frequency distribution of population-specific variants. (D) Principal component analysis of chicken populations. Populations are coded as RJFt = red jungle fowl (Thailand), RJFi = red jungle fowl (India), BL = Brown layer, WL = White layer, RWp = Rhode-White pool, BRA = Broiler line A, BRB = Broiler line B and BRpD = Broiler line pool D, BRs = three commercial broiler lines (BRA, BRB and BRpD). show a substantially smaller proportion of rare alleles that can be attributed to the smaller effective population size caused by recent selective breeding leading to loss of rare alleles.
A list of candidate genes harbouring missense mutations with ΔAF > 0.7 in two contrasts ‘RJFs vs. Coms’ and ‘BRs vs. LRs’.
| Wild | Layers | Broilers | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Chr_Pos | Ref | Alt | RJFt | RJFi | BL | WL | RWp | BRA | BRB | BRpD | ΔRJFsComs | ΔRJFsBRs | ΔRJFsLRs | ΔBRsLRs | AA | SIFT | Functionb | |
| 2_50876521 | T | C | 0.06 | 0.00 | 1.00 | 0.00 | 1.00 | 0.98 | 0.95 | 0.84 | 0.64 | 0.89 | 0.25 | K/R | Tolerated_low_confidence (1) | Wing development | ||
| 4_67124162 | T | A | 0.12 | 0.06 | 1.00 | 1.00 | 1.00 | 0.85 | 0.98 | 0.93 | 0.91 | 0.83 | 0.08 | S/T | Tolerated (0.4) | Pigmentation | ||
| 10_12832456 | G | A | 0.06 | 0.00 | 0.54 | 0.88 | 1.00 | 0.9 | 0.8 | 0.83 | 0.77 | 0.82 | 0.04 | V/M | Tolerated (0.17) | Wing development. | ||
| Z_10819221 | G | A | 0.03 | NA | 1.00 | 0.88 | 1.00 | 0.84 | 1.00 | 0.21 | 0.93 | 0.65 | 0.28 | S/N | Tolerated (0.36) | Chicken feathering | ||
| Z_16761767 | C | G | 0.71 | NA | 0.00 | 0.00 | NA | 0.00 | 0.00 | 0.00 | 0.71 | 0.71 | 0.00 | P/A | Tolerated_low_confidence (0.14) | Melanocyte regeneration | ||
| 8_28464218 | T | C | 0.56 | NA | 0.71 | 1.00 | 0.87 | 0.00 | 0.07 | 0.35 | 0.06 | 0.30 | 0.42 | C/R | Deleterious (0.01) | Leptin receptor | ||
| 9_23584350 | A | C | 0.10 | NA | 0.61 | 1.00 | 0.84 | 0.02 | 0.07 | 0.00 | 0.32 | 0.72 | 0.07 | I/L | NA | Delayed puberty | ||
| 27_1768503 | C | G | 0.14 | NA | 0.94 | 0.84 | 0.58 | 0.05 | 0.00 | 0.00 | 0.26 | 0.65 | 0.12 | G/A | Tolerated (1) | Osteoporosis | ||
| Z_56413219 | T | C | 0.20 | NA | 0.00 | 0.00 | 0.00 | 0.81 | 0.90 | 0.78 | 0.21 | 0.20 | 0.63 | I/T | Tolerated (0.82) | Muscle generation | ||
aFrequencies of variant alleles (Alt) are reported for RJFt = red jungle fowl (Thailand), RJFi = red jungle fowl (India), BL = Brown layer, WL = White layer, RWp, Rhode and White pool, BRA = Broiler line A, BRB = Broiler line B and BRpD = Broiler line pool D. AA = Amino acid substitutions, SIFT<0.05 indicates a likely deleterious missense mutation. Sites with ΔAF>0.7 in two contrasts of ‘RJFs vs. Coms’ and ‘BRs vs. LRs’ are in bold.
Pairwise F between populations.
| RJFi | BRA | BRB | BL | WL | |
|---|---|---|---|---|---|
| 0.27 ± 0.12 | 0.18 ± 0.10 | 0.20 ± 0.11 | 0.30 ± 0.14 | 0.38 ± 0.14 | |
| 0.34 ± 0.14 | 0.37 ± 0.15 | 0.51 ± 0.19 | 0.57 ± 0.20 | ||
| 0.09 ± 0.09 | 0.30 ± 0.16 | 0.37 ± 0.17 | |||
| 0.32 ± 0.17 | 0.39 ± 0.18 | ||||
| 0.53 ± 0.21 |
Pairwise F values ± (standard deviation) are reported for RJFt = red jungle fowl (Thailand), RJFi = red jungle fowl (India), BL = Brown layer, WL = White layer, RWp, Rhode and White pool, BRA = Broiler line A, BRB = Broiler line B and BRpD = Broiler line pool D.
List of putative selective sweeps revealed by F analysis.
| CHR | BIN_START | BIN_END | nSNPs | CONTRAST | GENE | ||
|---|---|---|---|---|---|---|---|
| 1 | 8600001 | 8640000 | 342 | 0.51 | 6.28 | RJFs/Coms | Gene desert |
| 1 | 32480001 | 32520000 | 647 | 0.75 | 7.22 | RJFs/LRs,RJFs/Coms | Gene desert |
| 1 | 55380001 | 55420000 | 636 | 0.77 | 10.59 | RJFs/Coms,RJFs/BRs,RJFs/LRs | |
| 1 | 55900001 | 55940000 | 762 | 0.59 | 7.51 | RJFs/Coms,RJFs/BRs,RJFs/LRs | |
| 1 | 102160001 | 102200000 | 489 | 0.50 | 6.10 | RJFs/Coms | Gene desert |
| 1 | 127560001 | 127600000 | 388 | 0.61 | 6.46 | RJFs/BRs,RJFs/Coms | Gene desert |
| 1 | 142020001 | 142060000 | 581 | 0.50 | 6.00 | RJFs/Coms | Gene desert |
| 2 | 24680001 | 24720000 | 1108 | 0.55 | 6.89 | RJFs/Coms | |
| 2 | 25800001 | 25840000 | 471 | 0.68 | 7.39 | RJFs/BRs,RJFs/Coms,BRs/LRs,RJFs/LRs | Gene desert |
| 2 | 27840001 | 27880000 | 248 | 0.76 | 6.46 | BRs/LRs,RJFs/LRs | Gene desert |
| 2 | 50840001 | 50880000 | 271 | 0.58 | 6.04 | RJFs/BRs | |
| 2 | 70840001 | 70880000 | 533 | 0.78 | 7.63 | RJFs/LRs,RJFs/Coms | |
| 2 | 73280001 | 73320000 | 583 | 0.59 | 6.07 | RJFs/BRs | |
| 2 | 82440001 | 82480000 | 540 | 0.51 | 6.21 | RJFs/Coms,RJFs/LRs,RJFs/BRs | Gene desert |
| 2 | 119220001 | 119260000 | 422 | 0.68 | 9.15 | RJFs/Coms,RJFs/BRs,RJFs/LRs | Gene desert |
| 2 | 143420001 | 143460000 | 459 | 0.59 | 7.52 | RJFs/Coms,RJFs/LRs | Gene desert |
| 3 | 29400001 | 29440000 | 568 | 0.86 | 7.53 | BRs/LRs,RJFs/Coms | |
| 4 | 56940001 | 56980000 | 832 | 0.52 | 6.34 | RJFs/Coms | |
| 4 | 71740001 | 71780000 | 313 | 0.61 | 6.34 | RJFs/BRs,RJFs/Coms | Gene desert |
| 5 | 29920001 | 29960000 | 604 | 0.56 | 7.14 | RJFs/Coms,RJFs/BRs, | Gene desert |
| 5 | 31100001 | 31140000 | 512 | 0.51 | 6.25 | RJFs/Coms,RJFs/BRs,BRs/LRs | Gene desert |
| 5 | 32300001 | 32340000 | 593 | 0.54 | 6.68 | RJFs/Coms,RJFs/BRs | Gene desert |
| 5 | 48920001 | 49400000 | 355 | 0.79 | 6.73 | BRs/LRs | |
| 9 | 12400001 | 12440000 | 477 | 0.55 | 6.92 | RJFs/Coms | |
| 10 | 6400001 | 6440000 | 836 | 0.54 | 6.78 | RJFs/Coms,RJFs/LRs | |
| 11 | 80001 | 140000 | 535 | 0.78 | 6.60 | BRs/LRs,RJFs/LRs | |
| 14 | 13700001 | 13740000 | 1798 | 0.71 | 9.65 | RJFs/Coms,RJFs/BRs,RJFs/LRs | |
| 14 | 14280001 | 14320000 | 1877 | 0.64 | 8.40 | RJFs/Coms,RJFs/BRs,RJFs/LRs | |
| 15 | 2980001 | 3020000 | 314 | 0.71 | 6.69 | RJFs/LRs,RJFs/Coms | |
| 24 | 6140001 | 6180000 | 510 | 0.65 | 7.00 | RJFs/BRs,RJFs/Coms | |
| 26 | 120001 | 160000 | 185 | 0.63 | 6.68 | RJFs/BRs |
a All genes residing in the top differentiated windows, some windows carry more than one gene as indicated here.
Fig 2Genome-wide visualization of candidate selective sweeps.
Each dot represents a 40 kb window in steps of 20 kb along the genome. (A) ZF scores in different contrasts of chicken populations. Candidate genes are indicated for each signal. Signals marked by a star represent regions lacking annotated genes. (B) Manhattan plot of the Z|ΔPi| scores between two RJF and four commercial populations. (C) High resolution illustration of putative sweeps on GGA14. The heatmap visualizes the region as F values among multiple populations and groups, where the genes or known elements overlapping the candidate sweeps are indicated underneath.
Fig 3A graphical illustration of regions with extremely low nucleotide diversity across populations on GGA1 and GGA2.
In panel A, two regions of high homozygosity are centered over ALX1 and KITLG on GGA1. Red and green rectangles, respectively display the chromosomal positions of ALX1 and KITLG. Panel B visualizes an extensive putative sweep on GGA2 overlapping the EGFR locus. Each dot represents a 40 kb window. The standard errors of ZPi-scores in each window across scans are smoothed over the region in grey. Nucleotide diversity was estimated for RJFs (two red jungle fowl populations), Coms (four commercial lines), BRs (the two commercial broiler lines, BRA and BRB), LRs (two layer populations, BL and WL) and ALL (all six populations of RJFs and commercials).
Fig 4Analysis of enrichment for different categories of SNPs.
Panel A and B represent the contrasts ‘RJFs vs. Coms’ and ‘BRs vs. LRs’, respectively. UpDwStream stands for sites residing 5 kb up- and downstream of genes. The black line represents the total number of SNPs in each ΔAF bin and colored lines represent log2 fold changes of the observed SNP count for each category in each bin against the expected SNP count.