| Literature DB >> 32071365 |
Isabel Álvarez1, Iván Fernández1, Amadou Traoré2, Lucía Pérez-Pardal3, Nuria A Menéndez-Arias1, Félix Goyache4.
Abstract
The Djallonké (West African Dwarf) sheep is a small-sized haired sheep resulting from a costly evolutionary process of natural adaptation to the harsh environment of West Africa including trypanosome challenge. However, genomic studies carried out in this sheep are scant. In this research, genomic data of 184 Djallonké sheep (and 12 Burkina-Sahel sheep as an outgroup) generated using medium-density SNP Chips were analyzed. Three different statistics (iHS, XP-EHH and nSL) were applied to identify candidate selection sweep regions spanning genes putatively associated with adaptation of sheep to the West African environment. A total of 207 candidate selection sweep regions were defined. Gene-annotation enrichment and functional annotation analyses allowed to identify three statistically significant functional clusters involving 12 candidate genes. Genes included in Functional Clusters associated to selection signatures were mainly related to metabolic response to stress, including regulation of oxidative and metabolic stress and thermotolerance. The bovine chromosomal areas carrying QTLs for cattle trypanotolerance were compared with the regions on which the orthologous functional candidate cattle genes were located. The importance of cattle BTA4 for trypanotolerant response might have been conserved between species. The current research provides new insights on the genomic basis for adaptation and highlights the importance of obtaining information from non-cosmopolite livestock populations managed in harsh environments.Entities:
Mesh:
Year: 2020 PMID: 32071365 PMCID: PMC7028950 DOI: 10.1038/s41598-020-59839-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Plots showing cross-validation errors for each K tested using the program Admixture v1.23, with the lowest error for K = 4 (Plot A), and dispersion of the Djallonké and Burkina–Sahel individuals according the two first factors computed via PCA (Plots B, C and D). Factors 1 and 2 explained 3.2% and 2.4% of the total variance, respectively. Contour plots illustrate the 75% confidence region of the relationships among the individuals assigned to each Cluster identified using Admixture v1.23 for K = 3 (Plot B), K = 4 (Plot C) and K = 5 (Plot D). The location of most Burkina-Sahel individuals is highlighted Furthermore, barplots of individual ancestries estimated using Admixture v1.23 for K = 3, K = 4 and K = 5 are shown (Plot E).
Figure 2Manhattan plots summarizing results corresponding to iHS (Plot A), XP-EHH (Plot B) and nSL (Plot C) analyses. SNPs with normalized values higher than |2| are highlighted in green.
Summary of the candidate selection sweep regions identified per ovine chromosome (OAR).
| OAR | N | Length | n |
|---|---|---|---|
| 1 | 18 | 3549100 | 67 (10)a |
| 2 | 34 | 6581006 | 44 (34) |
| 3 | 27 | 5892878 | 105 (19) |
| 4 | 16 | 2450404 | 45 (12) |
| 5 | 8 | 1845327 | 27 (10) |
| 6 | 10 | 2590704 | 43 (5) |
| 7 | 4 | 521393 | 11 (1) |
| 8 | 9 | 1097043 | 17 (5) |
| 9 | 5 | 983744 | 17 (2) |
| 10 | 4 | 1077755 | 16 (9) |
| 11 | 2 | 116626 | 2 (2) |
| 12 | 7 | 822135 | 16 (4) |
| 13 | 7 | 960058 | 18 (3) |
| 14 | 1 | 57444 | 2 (0) |
| 15 | 6 | 914153 | 21 (3) |
| 16 | 6 | 181219 | 7 (3) |
| 17 | 4 | 578619 | 9 (4) |
| 18 | 10 | 1669082 | 27 (7) |
| 19 | 2 | 153579 | 3 (1) |
| 20 | |||
| 22 | 3 | 486195 | 7 (2) |
| 23 | 17 | 1407368 | 34 (11) |
| 24 | 2 | 286693 | 3 (2) |
| 25 | 1 | 107671 | 2 (2) |
| 26 | 4 | 627507 | 12 (2) |
| Totals | 207 | 34957703 | 555 (153) |
The number of selection sweeps identified (N), the sum of their lengths (in bp) and the sum of the SNPs defining them (n) are given per chromosome.
asum of the SNPs identified by two different EHH-based statistics.
Significantly enriched functional term clusters (enrichment score higher than 1.3) for genes identified within the 75 kb regions flanking the candidate selection sweeps identified using genome-wide iHS, XP-EHH and nSL statistics following DAVID analysis.
| Functional Cluster (enrichment score) | Category | Term and Description | Genes | P-Value | Fold Enrichment |
|---|---|---|---|---|---|
| Cluster 1 (3.24) | INTERPRO | IPR014760:Serum albumin, N-terminal | ENSOARG00000013782 ( | 8.61E-05 | 38.40 |
| INTERPRO | IPR000264:ALB/AFP/VDB | ENSOARG00000013782 ( | 8.61E-05 | 38.40 | |
| INTERPRO | IPR020857:Serum albumin, conserved site | ENSOARG00000013782 ( | 8.61E-05 | 38.40 | |
| SMART | SM00103:ALBUMIN | ENSOARG00000013782 ( | 8.81E-05 | 37.89 | |
| INTERPRO | IPR020858:Serum albumin-like | ENSOARG00000013782 ( | 1.70E-04 | 32.00 | |
| PIR_SUPERFAMILY | PIRSF002520:serum albumin | ENSOARG00000013782 ( | 1.19E-03 | 47.29 | |
| INTERPRO | IPR021177:Serum albumin/Alpha-fetoprotein | ENSOARG00000013782 ( | 2.51E-03 | 36.00 | |
| GOTERM_BP_DIRECT | GO:0006810~transport | ENSOARG00000013782 ( | 4.33E-01 | 2.04 | |
| Cluster 2 (1.39) | INTERPRO | IPR023415:Low-density lipoprotein (LDL) receptor class A, conserved site | ENSOARG00000003950 ( | 1.70E-02 | 12.47 |
| SMART | SM00192:LDLa | ENSOARG00000003950 ( | 5.20E-02 | 8.08 | |
| INTERPRO | IPR002172:Low-density lipoprotein (LDL) receptor class A repeat | ENSOARG00000003950 ( | 7.59E-02 | 4.04 | |
| Cluster 3 (1.31) | INTERPRO | IPR023415:Low-density lipoprotein (LDL) receptor class A, conserved site | ENSOARG00000002144 ( | 3.83E-02 | 5.33 |
| SMART | SM00192:LDLa | ENSOARG00000002144 ( | 5.10E-02 | 4.74 | |
| INTERPRO | IPR002172:Low-density lipoprotein (LDL) receptor class A repeat | ENSOARG00000002144 ( | 5.96E-02 | 4.46 |
Figure 3Circular map, created using the ShinyCircos[45] software, summarizing information on Functional Clusters identified using selection sweeps in Djallonké sheep. Chromosomes are shown in the outermost circle. The innermost circles show the distribution of selection sweeps within a chromosome. Differences in height among selection sweeps inform on the number of SNPs defining them. Candidate genes forming the significantly enriched functional term clusters are indicated besides their genomic localization. At the center of the map, links among candidate genes belonging to the same functional cluster are illustrated using the same color.
Description of the sheep genes included in the three functional clusters displaying enrichment score higher than 1.3.
| Functional | Gene | Sheep gene information | Orthologous cattle gene information | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cluster | name | description | EnsemblID | OAR | start | end | Cattle EnsemblID | BTA | start | end |
| Cluster 1 | albumin [Source:NCBI gene;Acc:443393] | ENSOARG00000013782 | 6 | 88136611 | 88159187 | ENSBTAG00000017121 | 6 | 90232762 | 90251126 | |
| GC, vitamin D binding protein [Source:HGNC Symbol;Acc:HGNC:4187] | ENSOARG00000012835 | 6 | 86619919 | 86657661 | ENSBTAG00000013718 | 6 | 88695940 | 88739180 | ||
| alpha fetoprotein [Source:HGNC Symbol;Acc:HGNC:317] | ENSOARG00000013966 | 6 | 88166794 | 88190190 | ENSBTAG00000017131 | 6 | 90258976 | 90280522 | ||
| afamin [Source:HGNC Symbol;Acc:HGNC:316] | ENSOARG00000014129 | 6 | 88198267 | 88224250 | ENSBTAG00000047833 | 6 | 90291078 | 90312046 | ||
| Cluster 2 | melanocortin 2 receptor [Source:NCBI gene;Acc:443482] | ENSOARG00000003950 | 23 | 43893855 | 43894748 | ENSBTAG00000011038 | 24 | 44006264 | 44024553 | |
| calcium and integrin binding 1 [Source:NCBI gene;Acc:100141300] | ENSOARG00000012128 | 18 | 20723490 | 20727112 | ENSBTAG00000021275 | 21 | 22043354 | 22046525 | ||
| Ovis aries melanocortin 5 receptor (MC5R), mRNA. [Source:RefSeq mRNA;Acc:NM_001078656] | ENSOARG00000002239 | 23 | 43867867 | 43870160 | ENSBTAG00000009143 | 24 | 43983880 | 43984857 | ||
| caveolin 1 [Source:NCBI gene;Acc:445468] | ENSOARG00000001337 | 4 | 51786039 | 51820791 | ENSBTAG00000017869 | 4 | 52173110 | 52208687 | ||
| Cluster 3 | low density lipoprotein receptor class A domain containing 4 [Source:HGNC Symbol;Acc:HGNC:1224] | ENSOARG00000002144 | 23 | 43646788 | 43801055 | ENSBTAG00000009139 | 24 | 43861727 | 43919241 | |
| LDL receptor related protein 11 [Source:VGNC Symbol;Acc:VGNC:30988] | ENSOARG00000002898 | 8 | 73814860 | 73836141 | ENSBTAG00000032007 | 9 | 88115145 | 88183314 | ||
| complement factor I [Source:NCBI gene;Acc:101119307] | ENSOARG00000005291 | 6 | 15708801 | 15753015 | ENSBTAG00000034501 | 6 | 16765034 | 16815352 | ||
| very low density lipoprotein receptor [Source:HGNC Symbol;Acc:HGNC:12698] | ENSOARG00000013303 | 2 | 70468444 | 70486178 | ENSBTAG00000018517 | 8 | 42109679 | 42141155 | ||
Besides the gene name and description, the following information is provided: the identification of the gene retrieved from the Ensembl Genes 91 database, the ovine chromosome (OAR) on which the gene is located and the positions (in bp) of start and end of the gene within the chromosome. The same information is given for the orthologous cattle genes.