| Literature DB >> 33929504 |
Pelin Sahlén1, Liu Yanhu2, Jinrui Xu3, Eniko Kubinyi4, Guo-Dong Wang2,5, Peter Savolainen1.
Abstract
Research on the genetics of domestication most often focuses on the protein-coding exons. However, exons cover only a minor part (1-2%) of the canine genome, whereas functional mutations may be located also in regions beyond the exome, in regulatory regions. Therefore, a large proportion of phenotypical differences between dogs and wolves may remain genetically unexplained. In this study, we identified variants that have high allelic frequency differences (i.e., highly differentiated variants) between wolves and dogs across the canine genome and investigated the potential functionality. We found that the enrichment of highly differentiated variants was substantially higher in promoters than in exons and that such variants were enriched also in enhancers. Several enriched pathways were identified including oxytocin signaling, carbohydrate digestion and absorption, cancer risk, and facial and body features, many of which reflect phenotypes of potential importance during domestication, including phenotypes of the domestication syndrome. The results highlight the importance of regulatory mutations during dog domestication and motivate the functional annotation of the noncoding part of the canine genome.Entities:
Keywords: cis-regulatory regions; domestication; epigenetics
Mesh:
Year: 2021 PMID: 33929504 PMCID: PMC8086526 DOI: 10.1093/gbe/evab076
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
(a) Fold enrichment of variants for coding exonic sequences in different FST bins (y axis). (b) Human phenotypes that are enriched for genes carrying at least one missense exonic variant with an FST value greater than or equal to 0.5. All P-values are FDR-corrected and FDR threshold 0.05 is used. The size of the blue dots is proportional to the FDR. The human phenotype ontology database (https://hpo.jax.org/app/) was used for enrichment analyses due to the lack of such functional annotation for the canine genes. The tree is constructed using the distance between two gene sets based on the number of genes in the intersection and the union of two sets. The distance matrix is then used to construct a hierarchical clustering tree based on the number of shared and unique genes between the different sets.
The enrichment of highly differentiated variants between wolves and dogs in regulatory elements; open chromatin regions assayed by ATAC-seq in multiple tissues, enhancers in livers assayed by ChIP-seq against H3K27Ac and promoters of protein-coding genes.
(a) The list of KEGG (the database of manually drawn pathway maps) pathways (b) human phenotypes (HPO) enriched for promoters regulated by regions with highly differentiated variants between wolves and dogs. FDR threshold of 0.01 is used and only the first 30 terms are shown. The size of the blue dots is proportional to the FDR value. The tree is constructed using the distance between two gene sets based on the number of genes in the intersection and the union of two sets. The distance matrix is then used to construct a hierarchical clustering tree based on the number of shared and unique genes between the different sets.