| Literature DB >> 34009300 |
Quan-Kuan Shen1,2,3, Min-Sheng Peng1,2,3, Adeniyi C Adeola1,2,4, Ling Kui5, Shengchang Duan6, Yong-Wang Miao7, Nada M Eltayeb8, Jacqueline K Lichoti9, Newton O Otecko1,2,3, Maria Giuseppina Strillacci10, Erica Gorla10, Alessandro Bagnato10, Olaogun S Charles11, Oscar J Sanke12, Philip M Dawuda13, Agboola O Okeyoyin14, John Musina15, Peter Njoroge15, Bernard Agwanda15, Szilvia Kusza16, Hojjat Asadollahpour Nanaei17, Rana Pedar17, Ming-Min Xu1,2,3, Yuan Du6, Lotanna M Nneji1,2, Robert W Murphy18, Ming-Shan Wang19,20, Ali Esmailizadeh1,17, Yang Dong21,22,23, Sheila C Ommeh15,24, Ya-Ping Zhang1,2,3,25,26.
Abstract
Domestication of the helmeted guinea fowl (HGF; Numida meleagris) in Africa remains elusive. Here we report a high-quality de novo genome assembly for domestic HGF generated by long- and short-reads sequencing together with optical and chromatin interaction mapping. Using this assembly as the reference, we performed population genomic analyses for newly sequenced whole-genomes for 129 birds from Africa, Asia, and Europe, including domestic animals (n = 89), wild progenitors (n = 34), and their closely related wild species (n = 6). Our results reveal domestication of HGF in West Africa around 1,300-5,500 years ago. Scanning for selective signals characterized the functional genes in behavior and locomotion changes involved in domestication of HGF. The pleiotropy and linkage in genes affecting plumage color and fertility were revealed in the recent breeding of Italian domestic HGF. In addition to presenting a missing piece to the jigsaw puzzle of domestication in poultry, our study provides valuable genetic resources for researchers and breeders to improve production in this species.Entities:
Keywords: Africa; breed; domestication; genome; guineafowl; selection
Mesh:
Year: 2021 PMID: 34009300 PMCID: PMC8214406 DOI: 10.1093/gbe/evab090
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Fig. 1.Genome architecture of the HGFv1 assembly. (A) Circos plot of HGFv1 assembly, repeat content, gene density, and GC content (%). The 1–16 scaffolds, representing 90% of the total length of 1,127 scaffolds, are shown as the outermost tracks. The repeat content is indicated by the gray line. Regions within the gene density of more than ten genes are shown as red spikes, whereas those with five to ten genes are indicated by orange spikes. Green spikes represent regions with fewer than five genes. Regions of the genome with GC content higher than average (37.54%) are shown in light blue. All data were plotted in 100-kb windows. (B) Dot plots showing synteny between the HGFv1 assembly and chicken reference genome GRCg6a (GCA_000002315.5). The sky-blue dots represent the accordant alignments, whereas the firebrick dots are reverse adjustments.
Quality Metrics for HGFv1 Assembly Compared with Previous Poultry Genome Assemblies
| Genome Assembly | Largest Contig (Mb)a | N50 Contigs (Mb)a | Largest Scaffold (Mb)a | N50 Scaffolds (Mb)a | N gaps (Mb)b | Buscoc | LAId |
|---|---|---|---|---|---|---|---|
| 42.03 | 14.17 | 199.08 | 98.08 | 5.38 | C: 94.5% [S: 93.3%, D: 1.2%], F: 3.1%, M: 2.4% | 10.29 | |
| 1.66 | 0.23 | 194.43 | 97.48 | 20.12 | C: 94.7% [S: 93.7%, D: 1.0%], F: 3.1%, M: 2.2% | N/A | |
| 65.77 | 17.49 | 197.6 | 91.31 | 9.78 | C: 91.1% [S: 90.0%, D: 1.1%], F: 5.4%, M: 3.5% | 7.44 | |
| 0.54 | 0.03 | 202.84 | 76.12 | 2.91 | C: 87.2% [S: 85.9%, D: 1.3%], F: 6.7%, M: 6.1% | 7.09 | |
| 0.26 | 0.02 | 190.65 | 59 | 35.29 | C: 90.5% [S: 89.5%, D: 1.0%], F: 5.4%, M: 4.1% | N/A | |
| 0.25 | 0.02 | 94.47 | 14.23 | 20.83 | C: 93.8% [S: 92.9%, D: 0.9%], F: 4.0%, M: 2.2% | N/A | |
| 5.28 | 0.55 | 175.65 | 82.19 | 10.39 | C: 94.6% [S: 93.5%, D: 1.1%], F: 3.2%, M: 2.2% | N/A | |
| 0.2 | 0.02 | 24.05 | 5.2 | 35.37 | C: 93.4% [S: 92.7%, D: 0.7%], F: 3.9%, M: 2.7% | N/A |
Statistics were calculated by stats.sh script contained in BBMap (v. 38.45).
Sum of all “N” nucleotides in the genome assembly.
Busco (v. 3.0.2) assessment according to Aves obd9 (n = 4,915). C: Complete; S: complete and single copy; D: complete and duplicated; F: fragmented; M: missing.
Evaluation of assembly continuity for repetitive sequences via the LAI. “N/A” means that intact LTR-RT content and total LTR sequence content are too low to calculate LAI.
Fig. 2.Geographic distribution, phylogeny, and population structure. (A) Geographic distribution of guinea fowl samples showing on the world map (downloaded from http://bzdt.ch.mnr.gov.cn). Phylogenetic tree of 89 domestic HGF, 34 wild HGF, and 6 wild relatives (as the outgroup) constructed using 8,662,417 SNPs with FastTree. The red solid dots indicate nodes with more than 85% of 100 bootstraps. (B) PCA for HGF and the outgroup based on 8,662,417 biallelic SNPs. (C) PCA for HGF without the outgroup based on 4,695,945 SNPs, because the SNPs only in the outgroup were excluded. (D) Model-based clustering analysis with K = 4, 5, and 6. The lowest cross-validation error value was observed when K = 5.
Fig. 3.Genetic diversity, genetic load, LD, and runs of homozygosity. (A) Nucleotide diversity (Pi). (B) The counts of singletons. (C) Genetic load in exonic region. (D) Decay of LD. (E) Runs of homozygosity.
Fig. 4.Inference of demographic history. (A) The effective population changes through the time estimated by using SMC++. (B) The demographic model inferred with momi2.
Selective Genes Associated to the HGF Domestication and Subsequent Breeding
| GO ID | Term | Significant Gene | Fisher’s Exact Test |
|---|---|---|---|
| GO:0004890 | GABA-A receptor activity | 0.0038 | |
| GO:0016300 | tRNA (uracil) methyltransferase activity |
| 0.0055 |
| GO:0032452 | Histone demethylase activity |
| 0.0055 |
| GO:0071558 | Histone demethylase activity (H3-K27 specific) |
| 0.0055 |
| GO:0005230 | Extracellular ligand-gated ion channel activity | 0.0056 | |
| GO:0016917 | GABA receptor activity | 0.0058 | |
| GO:0008381 | Mechanosensitive ion channel activity |
| 0.011 |
| GO:0030507 | Spectrin binding |
| 0.011 |
| GO:0015276 | Ligand-gated ion channel activity | 0.0148 | |
| GO:0022834 | Ligand-gated channel activity | 0.0148 | |
| GO:0004525 | Ribonuclease III activity |
| 0.0164 |
| GO:0004859 | Phospholipase inhibitor activity |
| 0.0164 |
| GO:0032296 | Double-stranded RNA-specific ribonuclease activity |
| 0.0164 |
| GO:0032451 | Demethylase activity |
| 0.0164 |
| GO:0055102 | Lipase inhibitor activity |
| 0.0164 |
| GO:0022836 | Gated channel activity | 0.0203 | |
| GO:0015349 | Thyroid hormone transmembrane transporter activity |
| 0.0218 |
| GO:0016175 | Superoxide-generating NADPH oxidase activity |
| 0.0218 |
| GO:0050664 | Oxidoreductase activity, acting on NAD(P)H, oxygen as acceptor |
| 0.0272 |
| GO:0008175 | tRNA methyltransferase activity |
| 0.0326 |
| GO:0015020 | Glucuronosyltransferase activity |
| 0.0326 |
| GO:0016891 | Endoribonuclease activity, producing 5′-phosphomonoesters |
| 0.0326 |
| GO:0004383 | Guanylate cyclase activity |
| 0.0379 |
| GO:0016893 | Endonuclease activity, active with either ribo- or deoxyribonucleic acids and producing 5′-phosphomonoesters |
| 0.0432 |
| GO:0008092 | Cytoskeletal protein binding | 0.0448 | |
| GO:0004521 | Endoribonuclease activity |
| 0.0485 |
| GO:0030983 | Mismatched DNA binding |
| 0.0485 |
Fig. 5.Scan of selective signals in the domestic Italian breed Camosciata. (A) The PBS statistic was constructed using the breed Camosciata as target, the breed Selvatica as control and the grouped wild HGF background. (B) The Pi-Ratio was calculated from the ratio of nucleotide diversity for the grouped wild HGF background to that for the breed Camosciata. All dots represent the sliding window of 10 kb with 10 kb step size. The windows with values over top 1% quantile for both two statistics were overlapped to identify selective genes which were noted with cross labels. (C) The genes located in the LD block of scaffold 1:7,845,520–17,375,296. The selective genes were in red. (D) The Camosciata breed with cream white plumage (mutation) and the Selvatica breed with wild plumage type. The Camosciata was selected from a small breeding flock originated in France and brought to Italy in 1922 and differs from the solid white variety due to the pigmented skin of the neck and the visible white spotting of the plumage absent in the solid white birds. The photos were taken at Az. Agricola E. Oggioni in Italy and provided by Maria Giuseppina Strillacci and Erica Gorla. (E) The mutation p.Trp218Gly in exon 1 of TYR gene. The alignment of protein sequences was shown. This mutation was indicated by star label. The neighboring histidine residues which were essential for TYR function via binding to copper ions were noted with arrows.
GO-Enriched Selective Genes in the Domestic Italian Breed Camosciata
| GO ID | Term | Significant Genes | Fisher’s Exact Test |
|---|---|---|---|
| GO:0008502 | Melatonin receptor activity |
| 0.028 |
| GO:0005496 | Steroid binding |
| 0.029 |
| GO:0030060 | L-malate dehydrogenase activity |
| 0.041 |
| GO:0003824 | Catalytic activity | 0.046 |