| Literature DB >> 34329477 |
Kejun Wang1,2, Haifei Hu3, Yadong Tian1,2, Jingyi Li4, Armin Scheben5, Chenxi Zhang1,2, Yiyi Li1,2, Junfeng Wu1,2, Lan Yang1,2, Xuewei Fan1,2, Guirong Sun1,2, Donghua Li1,2, Yanhua Zhang1,2, Ruili Han1,2, Ruirui Jiang1,2, Hetian Huang1,2, Fengbin Yan2, Yanbin Wang2, Zhuanjian Li1,2, Guoxi Li1,2, Xiaojun Liu1,2, Wenting Li1,2, David Edwards3, Xiangtao Kang1,2.
Abstract
Domestication and breeding have reshaped the genomic architecture of chicken, but the retention and loss of genomic elements during these evolutionary processes remain unclear. We present the first chicken pan-genome constructed using 664 individuals, which identified an additional approximately 66.5-Mb sequences that are absent from the reference genome (GRCg6a). The constructed pan-genome encoded 20,491 predicated protein-coding genes, of which higher expression levels are observed in conserved genes relative to dispensable genes. Presence/absence variation (PAV) analyses demonstrated that gene PAV in chicken was shaped by selection, genetic drift, and hybridization. PAV-based genome-wide association studies identified numerous candidate mutations related to growth, carcass composition, meat quality, or physiological traits. Among them, a deletion in the promoter region of IGF2BP1 affecting chicken body size is reported, which is supported by functional studies and extra samples. This is the first time to report the causal variant of chicken body size quantitative trait locus located at chromosome 27 which was repeatedly reported. Therefore, the chicken pan-genome is a useful resource for biological discovery and breeding. It improves our understanding of chicken genome diversity and provides materials to unveil the evolution history of chicken domestication.Entities:
Keywords: zzm321990 IGF2BP1zzm321990 ; body size; chicken; major gene; pan-genome
Mesh:
Year: 2021 PMID: 34329477 PMCID: PMC8557422 DOI: 10.1093/molbev/msab231
Source DB: PubMed Journal: Mol Biol Evol ISSN: 0737-4038 Impact factor: 16.240
Fig. 1.Pan-genome of chicken. (a) Geographical distribution of samples used for pan-genome construction. (b) Pan-genome gene classification. (c) Word cloud of the GO enrichment of biological process for variable genes. (d) Pan-genome modeling. The pan-genome modeling shows no more dramatic increases when the number of accession genomes is over 220, indicating that selected individuals were sufficient to capture the majority of PAVs within Gallus gallus. Upper and lower lines represent the pan-genome number and core-genome number, respectively.
Fig. 2.Distribution of gene PAV. (a) The heatmap shows the PAV of variable genes within wild relatives, native breeds, and commercial breeds. (b) The principal component analysis of chicken breeds based on gene PAV. Wild: wild relatives (red jungle fowls); Native, native breeds; commercial breeds consist of two broiler breeds (BRA and BRB) and two layer breeds (BL and WL). (c) Neighbor-Joining phylogenetic tree constructed based on gene PAV matrix.
Fig. 3.Change of PAV frequency in promoter region during breeding and PAV-based GWAS. Scatter plots showing gene occurrence frequencies in Native breeds and Com (commercial) breeds for 0–1 kb (a), 1–2 kb (b), and 2–3 kb (c) upstream promoter regions, respectively. Manhattan plots showing significant promoter region PAVs associated with 151 traits for 0–1 kb (d), 1–2 kb (e), and 2–3 kb (f) upstream promoter regions. All association analysis result was plotted according to the physical location and P-value, with each dot representing an association analysis result. The upper and lower dashed lines represent the significant and suggestive thresholds, respectively. CW1, claw weight; CR, the ratio of claw weight to body weight; DPW, double pinion weight; SEW, semi-evisceration weight.
Fig. 4.Structure and frequency of the three alleles in IGF2BP1 promoter region. (a) Genomic structure of three alleles in IGF2BP1 promoter region in relation to evolutionarily conserved elements (77 vertebrates basewise PhyloP conservation score). Variant alleles in the promoter region of IGF2BP1 include wild type (W) and two mutant alleles (L1 and L2). The conserved elements are indicated by red arrows. Asp-F, 2k-F, and Asp-R are the PCR primers for the identification of the allelic type. (b) Allelic frequency of IGF2BP1 promoter region in the validated population by allelic-specific PCR genotyping. PCR product sizes of W, L1 and L2 are 2345, 290 and 791 bp, respectively. The gel shows the six genotypes derived from the combinations of the three alleles.
Fig. 5.Single-marker genotype association of IGF2BP1 promoter region in the validated Gushi×Anak F2 population with 734 individuals. Eight representative association events were included and others were showed in supplementary figure S10, Supplementary Material online. The number in the bracket is the proportion of phenotype variance explained by IGF2BP1 loci. CW1, claw weight; CR, the ratio of claw weight to body weight; SL12, shank length; BBL12, breast bone length; DPW, double pinion weight; SEW, semi-evisceration weight; CW, carcass weight; LW, leg weight. All traits were phenotyped at 12 weeks of age.
Fig. 6.Comparison of transcriptional activity and expression among three IGF2BP1 genotypes. (a) Comparison of transcriptional activity among different IGF2BP1 promoter region in chicken DF-1 cells. Left shows the constructions of the inserted fragment into the pGL3-Basic plasmid. Significance of two-tailed Student’s t-test: **P < 0.01; ***P < 0.001. (b) Comparison of mRNA expression of IGF2BP1 between L1L1 (Ross 308) and WW (Gushi) chickens in five tissues at 6 weeks of age. Breast, breast muscle; Leg, leg muscle. P-values were calculated using a two-tailed Student’s t-test. (c) Comparison of mRNA expression of IGF2BP1 between L1L1, L2L2, and WW in an IGF2BP1 genotype segregating population at 3 weeks of age.