| Literature DB >> 35456409 |
Min Wang1,2,3,4,5, Yu Liu1,2,3,4,5, Xiaokun Bi1,2,3,4,5, Hongying Ma6, Guorong Zeng7, Jintu Guo7, Minghao Guo7, Yao Ling1, Chunjiang Zhao1,2,3,4,5.
Abstract
In the present study, genome-wide CNVs were detected in a total of 301 samples from 10 Chinese indigenous horse breeds using the Illumina Equine SNP70 Bead Array, and the candidate genes related to adaptability to high temperature and humidity in Jinjiang horses were identified and validated. We determined a total of 577 CNVs ranging in size from 1.06 Kb to 2023.07 Kb on the 31 pairs of autosomes. By aggregating the overlapping CNVs for each breed, a total of 495 CNVRs were detected in the 10 Chinese horse breeds. As many as 211 breed-specific CNVRs were determined, of which 64 were found in the Jinjiang horse population. By removing repetitive CNV regions between breeds, a total of 239 CNVRs were identified in the Chinese indigenous horse breeds including 102 losses, 133 gains and 4 of both events (losses and gains in the same region), in which 131 CNVRs were novel and only detected in the present study compared with previous studies. The total detected CNVR length was 41.74 Mb, accounting for 1.83% of the total length of equine autosomal chromosomes. The coverage of CNVRs on each chromosome varied from 0.47% to 15.68%, with the highest coverage on ECA 12, but the highest number of CNVRs was detected on ECA1 and ECA24. A total of 229 genes overlapping with CNVRs were detected in the Jinjiang horse population, which is an indigenous horse breed unique to the southeastern coast of China exhibiting adaptability to high temperature and humidity. The functional annotation of these genes showed significant relation to cellular heat acclimation and immunity. The expression levels of the candidate genes were validated by heat shock treatment of various durations on fibroblasts of horses. The results show that the expression levels of HSPA1A were significantly increased among the different heat shock durations. The expression level of NFKBIA and SOCS4 declined from the beginning of heat shock to 2 h after heat shock and then showed a gradual increase until it reached the highest value at 6 h and 10 h of heat shock, respectively. Breed-specific CNVRs of Chinese indigenous horse breeds were revealed in the present study, and the results facilitate mapping CNVs on the whole genome and also provide valuable insights into the molecular mechanisms of adaptation to high temperature and humidity in the Jinjiang horse.Entities:
Keywords: Chinese indigenous horse breeds; Jinjiang horse; copy number variants; heat adaptation
Mesh:
Year: 2022 PMID: 35456409 PMCID: PMC9033042 DOI: 10.3390/genes13040603
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Sample information of ten Chinese indigenous horse breeds.
| Breed Name | Number of Original Samples | Number of Samples after Quality Control | Region | Group | Climate Type |
|---|---|---|---|---|---|
| Kazakh | 17 | 15 | Xinjiang | Kazakh Horse Type | Temperate continental climate |
| Inner | 23 | 20 | Inner Mongolia | Mongolian Horse Type | Temperate continental climate |
| Daan | 26 | 25 | Jilin Province | Mongolian Horse Type | Temperate monsoon climate |
| Chakouyi | 34 | 30 | Gansu Province | Hequ Horse Type | Alpine climate |
| Naqu | 29 | 27 | Tibet | Tibetan Horse Type | Alpine climate |
| Jinjiang | 57 | 55 | Fujian Province | Southwest Horse Type | Subtropical maritime monsoon climate |
| Zhaotong | 26 | 25 | Yunnan Province | Southwest Horse Type | Subtropical monsoon climate |
| Tengchong | 22 | 22 | Yunnan Province | Southwest Horse Type | Subtropical monsoon climate |
| Lijiang | 31 | 28 | Yunnan Province | Southwest Horse Type | Subtropical monsoon climate |
| Baise | 36 | 35 | Guangxi | Southwest Horse Type | Subtropical monsoon climate |
| Total | 301 | 282 | - | - | - |
Statistical results of CNV identification *.
| Breed Name | Sample Size | Number of CNVs | Average Number of Individual CNVs | Average Length of CNVs (Kb) | Length Range of CNVs (Kb) |
|---|---|---|---|---|---|
| Baise | 35 | 64 (24) | 1.83 (0.69) | 270.21 | 3.95~1166.66 |
| Chakouyi | 30 | 54 (23) | 1.80 (0.77) | 209.09 | 2.69~1166.66 |
| Daan | 25 | 34 (10) | 1.36 (0.40) | 307.60 | 23.50~2023.07 |
| Inner_Mongolian | 20 | 37 (8) | 1.85 (0.40) | 253.25 | 2.69~1269.78 |
| Jinjiang | 55 | 134 (72) | 2.44 (1.31) | 204.90 | 2.69~1993.59 |
| Kazakh | 15 | 20 (4) | 1.33 (0.27) | 263.03 | 13.92~1432.70 |
| Lijiang | 28 | 74 (36) | 2.64 (1.29) | 305.76 | 2.69~1886.77 |
| Naqu | 27 | 48 (11) | 1.78 (0.41) | 303.24 | 19.78~1607.80 |
| Tengchong | 22 | 45 (16) | 2.05 (0.73) | 275.73 | 19.30~1993.59 |
| Zhaotong | 25 | 67 (24) | 2.68 (0.96) | 226.83 | 1.06~1578.11 |
| Total | 282 | 577 (228) | 2.05 (0.81) | 261.96 | 1.06~2023.07 |
* The numbers in parentheses are breed-specific CNV counts.
Statistical results of CNVR identification *.
| Breed Name | Sample Size | CNVRs | Gain | Loss | Mixed | Average Number of |
|---|---|---|---|---|---|---|
| Baise | 35 | 52 (21) | 36 (11) | 14 (9) | 2 (1) | 1.49 (0.60) |
| Chakouyi | 30 | 50 (24) | 32 (10) | 17 (13) | 1 (1) | 1.67 (0.80) |
| Daan | 25 | 29 (9) | 19 (3) | 10 (6) | 0 (0) | 1.16 (0.36) |
| Inner_Mongolian | 20 | 33 (8) | 22 (1) | 11 (7) | 0 (0) | 1.65 (0.40) |
| Jinjiang | 55 | 113 (64) | 79 (38) | 33 (25) | 1 (1) | 2.05 (1.15) |
| Kazakh | 15 | 19 (6) | 15 (5) | 4 (1) | 0 (0) | 1.27 (0.40) |
| Lijiang | 28 | 63 (31) | 38 (12) | 23 (18) | 2 (1) | 2.25 (1.11) |
| Naqu | 27 | 42 (11) | 26 (3) | 15 (8) | 1 (0) | 1.56 (0.41) |
| Tengchong | 22 | 38 (15) | 30 (9) | 7 (5) | 1 (1) | 1.73 (0.68) |
| Zhaotong | 25 | 56 (22) | 38 (13) | 17 (9) | 1 (0) | 2.24 (0.88) |
| Total | 282 | 495 (211) | 335 (105) | 151 (101) | 9 (5) | 1.76 (0.75) |
* The numbers in parentheses are breed-specific CNVR counts.
Figure 1Breed-specific CNVR analysis across the ten horse breeds. The lower left bar chart shows the total amount of CNVRs contained in each original CNVR dataset (classified by horse breeds). In the lower right chart, the first 10 dots indicate the corresponding horse breeds on the left, and their breed-specific CNVR numbers are shown in the upper bar chart. The shared CNVRs between or among breeds are indicated with the vertical lines connecting the dots which represent the breeds on the left, and the numbers of CNVRs shared by the connected breeds are shown in the upper bar chart.
Descriptive statistics of CNVRs on equine autosomes.
| Chr | Length of Chromosomes (Mb) | Number of CNVRs | Length of CNVRs (bp) | Percentage (%) | Average Length of CNVRs (bp) |
|---|---|---|---|---|---|
| 1 | 188.26 | 34 | 6,227,042 | 3.31% | 183,148.29 |
| 2 | 121.35 | 16 | 1,984,086 | 1.64% | 124,005.38 |
| 3 | 121.35 | 17 | 2,359,505 | 1.94% | 138,794.41 |
| 4 | 109.46 | 24 | 2,907,700 | 2.66% | 121,154.17 |
| 5 | 96.76 | 7 | 458,305 | 0.47% | 65,472.14 |
| 6 | 87.23 | 8 | 556,343 | 0.64% | 69,542.88 |
| 7 | 100.79 | 9 | 1,237,197 | 1.23% | 137,466.33 |
| 8 | 97.56 | 9 | 1,463,869 | 1.50% | 162,652.11 |
| 9 | 85.79 | 4 | 753,364 | 0.88% | 188,341.00 |
| 10 | 85.16 | 8 | 1,117,790 | 1.31% | 139,723.75 |
| 11 | 61.68 | 3 | 348,307 | 0.56% | 116,102.33 |
| 12 | 36.99 | 11 | 5,799,518 | 15.68% | 527,228.91 |
| 13 | 43.78 | 2 | 434,027 | 0.99% | 217,013.5 |
| 14 | 94.6 | 4 | 653,789 | 0.69% | 163,447.25 |
| 15 | 92.85 | 7 | 979,123 | 1.05% | 139,874.71 |
| 16 | 88.96 | 2 | 701,010 | 0.79% | 350,505.00 |
| 17 | 80.72 | 7 | 832,567 | 1.03% | 118,938.14 |
| 18 | 82.64 | 14 | 2,861,413 | 3.46% | 204,386.64 |
| 19 | 62.68 | 1 | 1,128,766 | 1.80% | 1,128,766.00 |
| 20 | 65.34 | 6 | 698,764 | 1.07% | 116,460.67 |
| 21 | 58.98 | 4 | 658,611 | 1.12% | 164,652.75 |
| 22 | 50.93 | 2 | 973,451 | 1.91% | 486,725.5 |
| 23 | 55.56 | 3 | 504,860 | 0.91% | 168,286.67 |
| 24 | 48.29 | 3 | 520,059 | 1.08% | 173,353.00 |
| 25 | 40.28 | 4 | 1,067,167 | 2.65% | 266,791.75 |
| 26 | 43.15 | 13 | 2,429,584 | 5.63% | 186,891.08 |
| 27 | 40.25 | 4 | 602,367 | 1.50% | 150,591.75 |
| 28 | 47.35 | 2 | 310,645 | 0.66% | 155,322.5 |
| 29 | 34.78 | 4 | 222,270 | 0.64% | 55,567.5 |
| 30 | 31.4 | 2 | 198,567 | 0.63% | 99,283.5 |
| 31 | 26 | 5 | 750,488 | 2.89% | 150,097.6 |
| Total | 2280.92 | 239 | 41,740,554 | 1.83% | 174,646.67 |
Figure 2Statistics of CNVRs on equine autosomes. (a) Map of CNVRs in the horse genome. Red, blue and green represent gain, loss and both (gain and loss), respectively. (b) Size range distribution of the CNVRs detected. (c) Scale distribution of the CNVRs detected.
Comparison of CNVRs identified in this study with those identified in ten previous studies.
| Study | Platform | Breed | Sample | CNVR Count | CNVR Range | Genome Enrichment % | Reference Genome | Overlapped CNVR Count with the Present Study |
|---|---|---|---|---|---|---|---|---|
| Doan et al. (2012) | Array CGH | 15 | 16 | 775 | 0.2–3.5 | 3.7 | EquCab 2.0 | 22 |
| Metzger et al. (2013) | Illumina Equine 70 K SNP BeadChip | 17 | 717 | 50 | 0.5–0.9 | 1.7–22.0 | EquCab 2.0 | 28 |
| Dupuis et al. (2013) | Illumina Equine 70 K SNP BeadChip | 4 | 447 | 478 | 0.1–2.7 | 2.3 | EquCab 2.0 | 24 |
| Ghosh et al. (2014) | Array CGH | 16 | 38 | 258 | 1–2.5 | 1.15 | EquCab 2.0 | 19 |
| Wang et al. (2014) | Array CGH | 6 | 6 | 353 | 6.1–0.5 | 0.61 | EquCab 2.0 | 11 |
| Kader et al. (2016) | Illumina Equine 70 K SNP BeadChip | 3 | 96 | 122 | 0.2–2.2 | 0.8 | EquCab 2.0 | 14 |
| Ghosh et al. (2016) | Array CGH | NA | 63 | 245 | NA | NA | EquCab 2.0 | 20 |
| Schurink et al. (2018) | Axiom Equine Genotyping Array (670,796 SNPs) | 1 | 222 | 5350 | 0.12–1.03 | 11.2 | EquCab 2.0 | 22 |
| Solé et al. (2019) | Axiom Equine Genotyping Array (670,796 SNPs) | 8 | 1755 | 939 | 1–21.3 | NA | EquCab 2.0 | 80 |
| Corbi-Botto et al. (2019) | Illumina GGP Equine 70 K | 1 | 24 | 87 | 0.5–2 | 0.6 | EquCab 2.0 | 10 |
| Present study | Illumina Equine 70 K SNP BeadChip | 10 | 300 | 239 | 1.06–2.44 | 1.83 | EquCab 3.0 | - |
Functional enrichment analysis of CNVR-overlapping genes in the Jinjiang horse.
| Category | ID | Term | Counts | Genes | |
|---|---|---|---|---|---|
| KEGG | ecb05134 | Legionellosis | 4 | 3.33 × 10−3 |
|
| KEGG | ecb03040 | Spliceosome | 5 | 5.81 × 10−3 |
|
| KEGG | ecb04064 | NF-kappa B signaling pathway | 4 | 1.02 × 10−2 |
|
| KEGG | ecb05145 | Toxoplasmosis | 4 | 1.45 × 10−2 |
|
| KEGG | ecb05166 | Human T-cell leukemia virus 1 infection | 5 | 1.86 × 10−2 |
|
| GO_BP | GO:0051092 | Positive regulation of NF-kappa B transcription factor activity | 6 | 1.82 × 10−3 |
|
| GO_BP | GO:0032757 | Positive regulation of interleukin-8 production | 3 | 1.25 × 10−2 |
|
| GO_BP | GO:0007274 | Neuromuscular synaptic transmission | 3 | 1.25 × 10−2 |
|
| GO_BP | GO:0002876 | Positive regulation of chronic inflammatory response to antigenic stimulus | 2 | 1.31 × 10−2 |
|
| GO_MF | GO:0003676 | Nucleic acid binding | 14 | 1.35 × 10−2 |
|
| GO_MF | GO:0004872 | Receptor activity | 6 | 1.48 × 10−2 |
|
| GO_MF | GO:0005102 | Receptor binding | 7 | 3.02 × 10−2 | |
| GO_MF | GO:0031072 | Heat shock protein binding | 3 | 3.16 × 10−2 |
|
| GO_CC | GO:0048471 | Perinuclear region of cytoplasm | 11 | 9.40 × 10−3 |
|
| GO_CC | GO:0005887 | Integral component of plasma membrane | 18 | 1.42 × 10−2 |
|
| GO_CC | GO:0005913 | Cell–cell adherens junction | 7 | 2.24 × 10−2 |
Figure 3Statistics of GO terms and pathways. (a) Histogram of the top 30 GO terms. The ordinate shows the enriched GO term; the abscissa shows the number of genes in the term. Orange, green and blue indicate biological processes, cellular components and molecular functions, respectively. (b) Bubble diagram of the top 20 pathways. The pathway names are shown in the legend on the left. The abscissa is the enrichment factor, which represents the ratio of the proportion of genes annotated to a pathway in a differential gene to the proportion of genes in all genes annotated to that pathway. The larger the enrichment factor, the more significant the level of enrichment of the differentially expressed genes in this pathway. The size and color of the dots represent the number of enriched genes and the magnitude of significance, respectively.
Figure 4Expression levels of the four candidate genes at different heat shock durations. Means without a common superscript are significantly different (p < 0.05) from others.