| Literature DB >> 20502703 |
Hiromi Sawai1, Hie Lim Kim, Kaori Kuno, Sayaka Suzuki, Hideo Gotoh, Masaru Takada, Naoyuki Takahata, Yoko Satta, Fumihito Akishinonomiya.
Abstract
It is postulated that chickens (Gallus gallus domesticus) became domesticated from wild junglefowls in Southeast Asia nearly 10,000 years ago. Based on 19 individual samples covering various chicken breeds, red junglefowl (G. g. gallus), and green junglefowl (G. varius), we address the origin of domestic chickens, the relative roles of ancestral polymorphisms and introgression, and the effects of artificial selection on the domestic chicken genome. DNA sequences from 30 introns at 25 nuclear loci are determined for both diploid chromosomes from a majority of samples. The phylogenetic analysis shows that the DNA sequences of chickens, red and green junglefowls formed reciprocally monophyletic clusters. The Markov chain Monte Carlo simulation further reveals that domestic chickens diverged from red junglefowl 58,000+/-16,000 years ago, well before the archeological dating of domestication, and that their common ancestor in turn diverged from green junglefowl 3.6 million years ago. Several shared haplotypes nonetheless found between green junglefowl and chickens are attributed to recent unidirectional introgression of chickens into green junglefowl. Shared haplotypes are more frequently found between red junglefowl and chickens, which are attributed to both introgression and ancestral polymorphisms. Within each chicken breed, there is an excess of homozygosity, but there is no significant reduction in the nucleotide diversity. Phenotypic modifications of chicken breeds as a result of artificial selection appear to stem from ancestral polymorphisms at a limited number of genetic loci.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20502703 PMCID: PMC2873279 DOI: 10.1371/journal.pone.0010639
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Species and breeds used in this study, sampling locations and years, and sample designations.
| Species | Latin name | Breed | Sample ID |
| Place of collection (Year of collection) |
| Domestic chickens |
| Shamo | SHAMO | 1 | Okukuji, Japan (2004) |
| White Leghorn | WL | 1 | NILGS | ||
| Koshamo | KOSHA151, 152, 153, 154 | 4 | Kagoshima, Tokushima, Japan (1994) | ||
| Ukokkei (Silky) | UKO37, 38, 39, 40 | 4 | Yamagata, Niigata, Okinawa, Japan (1994) | ||
| Red junglefowl |
| RJF41, 45, 56, 58 | 4 | Palemberg, Indonesia (1994) | |
| Green junglefowl |
| GJF301, 302, 303, 304 | 4 | Jakarta, Indonesia (1993) | |
| Quail |
| QUAIL | 1 | NILGS, Tsukuba, Japan (2004) |
n = the number of individuals sampled per species or breed.
NILGS = National Institute of Livestock and Grassland Science.
Chromosomal locations and nucleotide lengths of the introns sequenced in this study.
| Intron | Locus (abbreviation) | Length (bp) | Chromosome | GeneID |
| 1 | Adenylate kinase ( | 376 | 17 | D00251 |
| 2 | Annexin V, intron 5 ( | 821 | 4 | 428767 |
| 3 | Annexin V, intron 7 ( | 416 | 4 | 428767 |
| 4 | Crystalline, beta A1 ( | 801 | 19 | 396499 |
| 5 | Hemoglobin, alpha D ( | 315 | 14 | 416651 |
| 6 | Creatine kinase, brain ( | 529 | 5 | 396248 |
| 7 | Actin, beta, intron 2 ( | 485 | 2 | X00182 |
| 8 | Actin, beta, intron 3 ( | 312 | 2 | X00182 |
| 9 | Transcriptional repressor-delta EF1 ( | 665 | 2 | 396029 |
| 10 | Fatty acid synthase ( | 1335 | 18 | 396061 |
| 11 | Growth hormone 1 ( | 449 | 27 | 378781 |
| 12 | Glyceraldehyde-3-phosphate dehydrogenase ( | 286 | 1 | 374193 |
| 13 | Heat shock 108-kDa protein 1 ( | 698 | 1 | 374163 |
| 14 | Interleukin 8 ( | 571 | 4 | 396495 |
| 15 | Ribosomal protein-coding gene L37A ( | 1121 | 7 | 769981 |
| 16 | Ribosomal protein-coding gene L5 ( | 613 | 8 | 395269 |
| 17 | Ribosomal protein-coding gene L7A, intron 3 ( | 358 | 17 | 417158 |
| 18 | Ribosomal protein-coding gene L7A, intron 4 ( | 351 | 17 | 417158 |
| 19 | Luteinizing hormone/choriogonadotropin receptor ( | 513 | 3 | 395776 |
| 20 | Myosin light chain ( | 477 | 7 | 396470 |
| 21 | Nicotinic acetylcholine receptor, gamma-subunit ( | 708 | 9 | 429151 |
| 22 | Opsin intron 1 ( | 528 | 19 | 396377 |
| 23 | Opsin intron 3 ( | 227 | 19 | 396377 |
| 24 | Opsin intron 4 ( | 381 | 19 | 396377 |
| 25 | Rhodopsin visual pigment intron 4 ( | 879 | 12 | 396486 |
| 26 | Ribosomal protein-encoding gene L30 ( | 1006 | 2 | 425416 |
| 27 | Transforming growth factor-beta 2 ( | 561 | 3 | 421352 |
| 28 | Vimentin ( | 625 | 2 | 420519 |
| 29 | Spindlin on Z ( | 957 | Z | 395344 |
| 30 | Clathrin heavy chain ( | 676 | 19 | 395272 |
The genome data is build 2.1. For three cases (intron 1, 7, and 8) of which Gene ID is not available, accession numbers are indicated.
Figure 1The average-difference tree based on 1,000 concatenated sequences of randomly selected diploid sequences.
The proportion supporting a cluster is shown at each node as the realization of that cluster in the 1,000 individual trees. The TURKEY and QUAIL sequences are used as outgroups. The boxed RedDB indicates that the sequences were taken from the database of red junglefowl. Significant contributions of the domestic chicken genome to this database sequence are evident. The scale shown below the figure is a branch length corresponding to a per-site number of substitutions of 0.01 (1%). Abbreviations for samples are as follows. SHAMO: Shamo, WL: White leghorn, UKO: ukokkei, KOSHA: Koshamo, RJF: red junglefowl, GJF: green junglefowl.
Figure 2Nucleotide divergence among mtDNA D-loop (y axis) and nuclear intron sequences (x axis).
The multiple-hit substitutions were corrected by the method of Jukes and Cantor [23] and Kimura [24]. Group A and A′ comprise comparisons of QUAIL and TURKEY, respectively vs. Gallus samples. Group B comprises comparisons of GJFs vs. RJFs and chickens. Group C represents inter- and intra-group comparisons of RJFs and chickens. The dotted line indicates d mt = d nuc. Each individual data point represents the comparison of a single species pair. Abbreviations for samples are the same as in Figure 1.
Number of haplotypes (H) and the percent nucleotide diversity (π) in the 30 introns.
| Introns | Chickens | RJF | GJF | ||||||||||||
|
| p | O( |
| E( |
| p | O( |
| E( |
| p | O( |
| E( | |
| 1 | 18 | 0.61 | 6 | 7 | 5.5 | 8 | 0.82 | 1 | 4 | 4.5 | 8 | 0.37 | 1 | 5 | 3.2 |
| 2 | 17 | 0.67 | 2 | 8 | 7.9 | 8 | 1.05 | 2 | 3 | 6.4 | 8 | 0.73 | 2 | 3 | 5.7 |
| 3 | 18 | 0.66 | 2 | 8 | 6.1 | 8 | 0.66 | 2 | 3 | 4.4 | 8 | 0.95 | 0 | 5 | 5 |
| 4 | 10 | 0.30 | 3 | 3 | 4.5 | 6 | 0.26 | 0 | 4 | 3.5 | – | – | – | – | – |
| 5 | 18 | 0.47 | 7 | 2 | 3.9 | 8 | 0 | 4 | 1 | 1 | 8 | 0.33 | 2 | 3 | 2.6 |
| 6 | 18 | 0.25 | 3 | 6 | 4.1 | 8 | 0.23 | 1 | 4 | 3.1 | 8 | 0.13 | 1 | 3 | 2.4 |
| 7 | 18 | 0.33 | 5 | 5 | 4.6 | 8 | 0.27 | 0 | 5 | 3.2 | 8 | 0.56 | 2 | 5 | 4.4 |
| 8 | 16 | 1.14 | 6 | 6 | 6.5 | 8 | 0.85 | 0 | 5 | 4.3 | 8 | 0.90 | 2 | 4 | 4.4 |
| 9 | 16 | 0.71 | 4 | 6 | 7.4 | 8 | 0.82 | 0 | 4 | 5.5 | 8 | 0.79 | 1 | 4 | 5.5 |
| 10 | 18 | 1.26 | 4 | 9 | 12.7 | 8 | 1.06 | 1 | 5 | 7.1 | 8 | 1.37 | 0 | 7 | 7.4 |
| 11 | 18 | 0.65 | 5 | 10 | 6.3 | 8 | 0.69 | 1 | 5 | 4.6 | 8 | 0.40 | 2 | 3 | 3.6 |
| 12 | 18 | 0.50 | 5 | 6 | 4.2 | 8 | 0.83 | 2 | 3 | 4.1 | 8 | 1.65 | 2 | 4 | 5.3 |
| 13 | 16 | 1.26 | 4 | 7 | 9.8 | 8 | 1.63 | 2 | 5 | 6.8 | 8 | 1.79 | 0 | 6 | 7 |
| 14 | 18 | 0.72 | 3 | 8 | 7.5 | 8 | 0.43 | 2 | 3 | 4.2 | 8 | 1.17 | 2 | 4 | 6 |
| 15 | 16 | 0.42 | 5 | 6 | 6.7 | 8 | 0.65 | 3 | 3 | 5.7 | 8 | 1.93 | 2 | 4 | 7.3 |
| 16 | 18 | 0.58 | 4 | 6 | 7 | 8 | 0.40 | 2 | 3 | 4.2 | 8 | 0.67 | 2 | 5 | 5.1 |
| 17 | 18 | 0.49 | 5 | 8 | 4.8 | 8 | 0.65 | 2 | 4 | 4.1 | 8 | 1.93 | 0 | 7 | 6.1 |
| 18 | 18 | 0.67 | 7 | 8 | 5.6 | 8 | 0.28 | 3 | 3 | 2.7 | – | – | – | – | – |
| 19 | 18 | 0.82 | 5 | 5 | 7.5 | 8 | 0.56 | 1 | 4 | 4.4 | 8 | 1.00 | 1 | 5 | 5.5 |
| 20 | 18 | 0.85 | 6 | 8 | 7 | 8 | 0.31 | 2 | 4 | 3.4 | 8 | 1.23 | 1 | 6 | 5.7 |
| 21 | 16 | 0.40 | 2 | 9 | 5.9 | 8 | 0.24 | 4 | 2 | 3.6 | 8 | 0.71 | 1 | 5 | 5.5 |
| 22 | 16 | 0.72 | 7 | 6 | 6.8 | 8 | 0.89 | 1 | 3 | 5.3 | 8 | 1.17 | 2 | 3 | 5.8 |
| 23 | 18 | 0.86 | 5 | 6 | 5.1 | 8 | 0.87 | 0 | 3 | 3.8 | 8 | 0.57 | 2 | 3 | 3.1 |
| 24 | 18 | 0.49 | 6 | 6 | 5 | 8 | 1.90 | 3 | 4 | 6.1 | 8 | 1.90 | 1 | 4 | 6.1 |
| 25 | 18 | 0.57 | 4 | 9 | 8.2 | 8 | 0.48 | 1 | 6 | 5.2 | 8 | 0.60 | 2 | 6 | 5.6 |
| 26 | 16 | 1.09 | 2 | 10 | 9.5 | 8 | 1.08 | 1 | 4 | 6.3 | 8 | 1.77 | 0 | 8 | 7.1 |
| 27 | 18 | 0.12 | 8 | 4 | 2.9 | 8 | 0.33 | 2 | 3 | 3.7 | 8 | 0.71 | 0 | 4 | 5.1 |
| 28 | 18 | 1.22 | 4 | 6 | 9.8 | 8 | 0.27 | 2 | 2 | 3.5 | 8 | 0.63 | 0 | 5 | 5 |
| 29 | 13 | 0.08 | – | 3 | 2.7 | 7 | 0 | 0 | 1 | 1 | 8 | 0.15 | – | 4 | 2.8 |
| 30 | 18 | 0.47 | 7 | 5 | 6.6 | 8 | 0.36 | 1 | 5 | 4.2 | 8 | 0.86 | 2 | 4 | 5.7 |
n = the number of sampled chromosomes.
O(F) = the observed number of homozygotes.
E(H) = the expected number of haplotypes or alleles based on πL and Ewens's sampling theory [34], where L is the length of an intron, which is given in Table 2.
A dash indicates that the sequences were not obtained in this study.
Figure 3The proportion of shared haplotypes at 30 introns.
The 30 intron numbers in Table 2 are shown on the left. Different colors indicate different patterns of shared haplotypes as specified by the key on the right margin. CHICKENs in this Figure include only UKO and KOSHA samples. Abbreviations for samples are the same as in Figure 1.
Figure 4‘Structure’ graph for individual proportions of shared ancestry.
Ancestry was estimated from diploid intron sequences in the samples from four KOSHAs, four UKOs, four RJFs, and four GJFs. Predefined populations are shown on the abscissa. Different colors within each block indicate different ancestries. Abbreviations for samples are the same as in Figure 1
Figure 5The three species and demographic parameters as estimated by MCMC and TSL methods.
With the TSL method, only pairs of species are taken into account at once, such that one of the three consecutive subscripts of a demographic parameter is shown in parentheses in the text. The symbols of θ and τ stand for the nucleotide divergence due to the extent of polymorphism and the species divergence, respectively. The subscript G, R, C means the three species, green junglefowl, red junglefowl and domestic chickens, respectively.
Comparison of means ± standard deviations between MCMC (Markov chain and Monte Carlo) and TSL (two species likelihood) estimates (%) based on 26 intron sequences with and without GJF303 and 304.
| Estimates | MCMC | TSL [median] | |
| With GJF303 and 304 | |||
| θC | 0.24±0.06 [0.13, 0.37] | – | |
| θR | 0.10±0.03 [0.05, 0.16] | – | |
| θG | 0.58±0.08 [0.43, 0.75] | – | |
| θCR | 1.60±0.14 [1.34, 1.90] | 0.58±0.16 [0.58] | |
| θRCG | 1.94±0.30 [1.43, 2.58] | θCG = 1.10±0.25 [1.07],θRG = 1.06±0.22 [1.04] | |
| τCR | 0.01±0.003 [0.007, 0.02] | 0.13±0.08 [0.12] | |
| τRCG | 0.12±0.01 [0.10, 0.14] | τCG = 0.31±0.15 [0.32],τRG = 0.31±0.13 [0.33] | |
| Without GJF303 and 304 | |||
| θC | 0.26±0.06 [0.15, 0.39] | – | |
| θR | 0.11±0.03 [0.06, 0.18] | – | |
| θG | 0.34±0.06 [0.25, 0.46] | – | |
| θCR | 0.71±0.14 [0.47, 1.00] | 0.57±0.16 [0.58] | |
| θRCG | 1.61±0.15 [1.34, 1.92] | θCG = 0.80±0.10 [0.80],θRG = 0.86±0.08 [0.86] | |
| τCR | 0.01±0.003 [0.008, 0.02] | 0.13±0.08 [0.12] | |
| τRCG | 0.63±0.06 [0.52, 0.74] | τCG = 0.59±0.03 [0.58],τRG = 0.55±0.02 [0.55] | |
The symbols of θ and τ stand for the nucleotide divergence due to the extent of polymorphism and the species divergence, respectively. θ = 4 N eµg and τ = t sµ, where g is the generation time in units of years, N e is the effective population size, t s is the species divergence time and µ is the per-year nucleotide substitution rate.
Scale and shape parameters for gamma distributions of prior MCMC analyses are given in Table S2, and the result for set 1 is shown here.
Two different pairs of species were used for the TSL method. The estimates for the pair of chickens and green junglefowl are shown as θCG and τCG, and those for the pair of red and green junglefowls are indicated as θRG and τRG.