| Literature DB >> 27215397 |
Hyeonsoo Jeong1, Kwondo Kim2,3, Kelsey Caetano-Anollés4, Heebal Kim2,5, Byung-Ki Kim6, Jun-Koo Yi6, Jae-Jung Ha6, Seoae Cho3, Dong Yep Oh6.
Abstract
Chicken, Gallus gallus, is a valuable species both as a food source and as a model organism for scientific research. Here, we sequenced the genome of Gyeongbuk Araucana, a rare chicken breed with unique phenotypic characteristics including flight ability, large body size, and laying blue-shelled eggs, to identify its genomic features. We generated genomes of Gyeongbuk Araucana, Leghorn, and Korean Native Chicken at a total of 33.5, 35.82, and 33.23 coverage depth, respectively. Along with the genomes of 12 Chinese breeds, we identified genomic variants of 16.3 million SNVs and 2.3 million InDels in mapped regions. Additionally, through assembly of unmapped reads and selective sweep, we identified candidate genes that fall into heart, vasculature and muscle development and body growth categories, which provided insight into Gyeongbuk Araucana's phenotypic traits. Finally, genetic variation based on the transposable element insertion pattern was investigated to elucidate the features of transposable elements related to blue egg shell formation. This study presents results of the first genomic study on the Gyeongbuk Araucana breed; it has potential to serve as an invaluable resource for future research on the genomic characteristics of this chicken breed as well as others.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27215397 PMCID: PMC4877703 DOI: 10.1038/srep26484
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Gyeongbuk Araucana chicken.
(A) blue egg shell of Gyeongbuk Araucana. (B) general appearance of Gyeongbuk Araucana chicken breed.
Results of genetic variants annotation using Snpeff.
| Gyeongbuk Araucana | Leghorn | Korean native Chicken | ||||
|---|---|---|---|---|---|---|
| SNP | INDEL | SNP | INDEL | SNP | INDEL | |
| Region type | ||||||
| Downstream | 641519 | 112867 | 655710 | 114991 | 1007516 | 152666 |
| Exon | 96037 | 2051 | 98252 | 2017 | 150948 | 2340 |
| Intergenic | 3827999 | 712665 | 3923139 | 726943 | 6084648 | 973939 |
| Intron | 3418642 | 633210 | 3501287 | 646854 | 5369842 | 867038 |
| Splice site acceptor | 145 | 136 | 147 | 129 | 226 | 152 |
| Splice site donor | 184 | 152 | 184 | 143 | 289 | 166 |
| Splice site region | 12613 | 2580 | 12810 | 2611 | 19885 | 3377 |
| Upstream | 656713 | 106253 | 668732 | 107865 | 1014086 | 141518 |
| UTR 3′ | 75671 | 17685 | 77469 | 18094 | 120027 | 23795 |
| UTR 5′ | 15091 | 1965 | 14885 | 1883 | 20808 | 2056 |
| Functional class | ||||||
| Missense | 25167 | — | 25851 | — | 40408 | — |
| Nonsense | 175 | — | 173 | — | 293 | — |
| Silent | 69784 | — | 71254 | — | 108748 | — |
| Total | 7124664 | 1312246 | 7304278 | 1339702 | 11275071 | 1794984 |
A summary of results for other chicken breeds is shown in Table S2.
Figure 2Population stratification of various chicken breeds.
(A) Results of principal component analysis (PCA) of 15 chicken breeds. PC1 (x-axis) versus PC2 (y-axis). PC1 and PC2 represent 12.85% and 11.08% of the total variance in the PCA, respectively. (B) Linkage disequilibrium (LD) pattern of GA, KNC, and LH.
Figure 3Manhattan plot of distribution of pairwise nucleotide variation (log2 (θ ratio)) and genetic differentiation (Weir and Cockerham’s FST).
Results of transposable elements variants annotation using Snpeff.
| Region Type | Gyeongbuk Araucana | Leghorn | Korean native Chicken |
|---|---|---|---|
| Downstream | 2067 | 4174 | 14619 |
| Exon | 11 | 28 | 48 |
| Intergenic | 14059 | 30101 | 128283 |
| Intron | 7890 | 18947 | 91577 |
| None | 17 | 21 | 36 |
| Splice site acceptor | 4 | 13 | 33 |
| Splice site donor | 7 | 14 | 29 |
| Splice site region | 44 | 81 | 288 |
| Upstream | 2688 | 5169 | 14997 |
| UTR 3′ | 156 | 292 | 1298 |
| UTR 5′ | 108 | 207 | 236 |
| Total | 22033 | 48797 | 216315 |
A summary of results for other chicken breeds is shown in Table S4.
Figure 4Results of analyses using transposable element variants (TEVs).
(A) A circular plot for distribution of TEVs on the genome of GA. Four types of TE are represented by different colors (LTR : green, LINE : red, SINE: orange, and DNA: blue). (B) A clustering tree based on LTR presence patterns for 28 chicken genomes. Three groups for LH, GA, and KNC are well-defined. The neighbor-joining tree using whole genome variants and other TE types (LINE, SINE, and DNA) are shown in Figure S4.