Literature DB >> 28222154

A non-synonymous SNP with the allele frequency correlated with the altitude may contribute to the hypoxia adaptation of Tibetan chicken.

Sichen Li1, Diyan Li1, Xiaoling Zhao1, Yan Wang1, Huadong Yin1, Lanyun Zhou1, Chengling Zhong1, Qing Zhu1.   

Abstract

The hypoxia adaptation to high altitudes is of considerable interest in the biological sciences. As a breed with adaptability to highland environments, the Tibetan chicken (Gallus gallus domestics), provides a biological model to search for genetic differences between high and lowland chickens. To address mechanisms of hypoxia adaptability at high altitudes for the Tibetan chicken, we focused on the Endothelial PAS domain protein 1 (EPAS1), a key regulatory factor in hypoxia responses. Detected were polymorphisms of EPAS1 exons in 157 Tibetan chickens from 8 populations and 139 lowland chickens from 7 breeds. We then designed 15 pairs of primers to amplify exon sequences by Sanger sequencing methods. Six single nucleotide polymorphisms (SNPs) were detected, including 2 missense mutations (SNP3 rs316126786 and SNP5 rs740389732) and 4 synonymous mutations (SNP1 rs315040213, SNP4 rs739281102, SNP6 rs739010166, and SNP2 rs14330062). There were negative correlations between altitude and mutant allele frequencies for both SNP6 (rs739010166, r = 0.758, p<0.001) and SNP3 (rs316126786, r = 0.844, P<0.001). We also aligned the EPAS1 protein with ortholog proteins from diverse vertebrates and focused that SNP3 (Y333C) was a conserved site among species. Also, SNP3 (Y333C) occurred in a well-defined protein domain Per-AhR-Arnt-Sim (PAS domain). These results imply that SNP3 (Y333C) is the most likely casual mutation for the high-altitude adaption in Tibetan chicken. These variations of EPAS1 provide new insights into the gene's function.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28222154      PMCID: PMC5319789          DOI: 10.1371/journal.pone.0172211

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Low oxygen is one of the more severe environmental challenges for animals living in high-altitude regions such as the Tibetan Plateau, where the average altitude is above 4000 m. The partial pressure of oxygen at 4000 m is approximately 50% of that at sea-level, and the hypoxia imposes severe constraints on aerobic metabolism and leading to high-altitude illness[1, 2][3] and the mechanisms of hypoxic adaptation [4]. The Tibetan chicken (Gallus gallus; TC) is an aboriginal breed found in the Qinghai-Tibetan Plateau (QTP), where the altitude range from 2200 to 4100 m. They are smaller, mature at older ages and lay smaller eggs than breed at lower altitude (Table A in S1 File). Also, TC has higher red blood cells numbers, blood oxygen affinity, and hemoglobin concentrations [5], and loser mean corpuscular volume. Hatchability at the high altitude is greater for TC [6] than that of lowland chickens [7] and they are is less sensitive to pulmonary hypertension syndrome (PHS) and hypocapnia [8, 9]. It is obviously that TC process unique physiological properties and there provide a model for studying genetic mechanisms of hypoxia adaptation. Endothelial PAS domain protein 1 (EPAS1) (also known as hypoxia inducible factor-2a (HIF-2a)), belongs to a member of the transcription factor family characterized by a basic helix-loop-helix (bHLH) and by a (Per-AhR-Arnt-Sim) PAS domain. Thus, it plays a prominent role in the process of adaptation to hypoxia and in the promotion of livability during hypoxia stress [10, 11]. EPAS1 is a prime regulator during chronic hypoxia and directly regulates key genes such as Erythropoietin (EPO), and vascular endothelial growth factor (VEGF) [12]. Also, EPAS1 has been implicated processes such as erythropoiesis, iron homeostasis, pulmonary hypertension and vascular permeability [13, 14]. Genome-wide scans and whole genome re-sequencing analyses to identify selection signatures of high-altitude adaptation in vertebrates have been performed across a range of species, including humans[15], dogs [16, 17], grey wolves [18], yaks [19], Tibetan antelopes [20], and cattle [21]. In the humans, dogs, and grey wolf convergent evolution for EPAS1 gene has been noted [15-18]. Because of an association between the allele frequency of the SNPs near or in EPAS1 and the altitude. Other studies have focused on the effects of mitochondrial DNA polymorphisms in adaptation of TC to high-altitude. MT-CO3 [22] and ATP-6 [23] are candidate genes of hypoxia adaptation. Genes involved in the Ca2+ signaling pathway might be related to the adaptation of TC to hypoxic conditions at high altitude[24], and genome re-sequencing, also identified unique hypoxia adaptation of TC, especially genes related to development of the cardiovascular and respiratory systems, DNA repairs, responses to radiation, inflammation, and immune responses [24]. In the present study, we hypothesis that the sequence variations in the EPAS1 gene may contribute to the high altitude adaptation in TC. Here, we compared on the polymorphisms of EPAS1 coding sequence in populations that included 157 Tibetan chickens and 139 lowland chickens to determine whether there were genetic differences between high and low altitude chickens in EPAS1 gene.

Materials and methods

Sampling

In total, 157 Tibetan chickens (TC) were collected from 8 different locations in Qinghai, Tibet, Yunnan, and Sichuan provinces. They included 4 Tibet areas (Shigatse (RKZ), Lhoka (SN), Lhasa (LS), and Nyingchi (LZ)), 2 Sichuan areas (Garze (GZ) and Aba (AB)), 1 Yunnan area (Diqing (DQ)), and 1 Qinghai area (Haibei (QH)). Also, 139 lowland chickens (LC) from 7 different Chinese native chicken breeds were sampled (below 1800 m). Details of sample number, breed, altitude, latitude, and longitude are provided in Table 1. A sample of 1 mL blood from each individual was obtained from the brachial vein. No animal was injured in during capture and blood collection. The protocol was approved by the Committee of the Care and Use of Laboratory Animals of the State-level Animal Experimental Teaching Demonstration Center of Sichuan Agricultural University (Approval ID: DKY-S20143136).
Table 1

Altitude, longitude and latitude of the sampling locations for the 8 Tibetan and 7 Lowland chicken populations.

BreedSampling locationPopulationSample SizeAltitude (m)Longitude (E)Latitude (N)
Tibetan
Shigatse,TibetShigatse (RKZ)30390089°60'28°92'
Lhoka,TibetLhoka (SN)14370090°03'28°27'
Lhasa,TibetLhasa (LS)24365091°01'29°26'
Garze,SichuanGarze (GZ)16339099°22'28°34'
Aba,SichuanAba (AB)253300102°33'31°27'
Diqing,YunnanDiqing (DQ)13328099°53'28°08'
Haibei,QinghaiHaibei (QH)72700100°86'36°85'
Nyingchi,TibetNyingchi (LZ)20240094°26'29°25'
Lowland
Emei blackEmei,SichuanEmei (EM)231800103°41'29°49'
ShimianYaan,SichuanShimian (SM)141120102°13'29°40'
Jiuyuan blackWanyuan,SichuanJiuyuan (JY)25900108°21'31°84'
Pengxian yellowYaan,SichuanPengxian (PX)12600102°98'29°98'
Blue-shelled DongxiangYaan,SichuanDongxiang(LK)14600102°98'29°98'
Muchuan black-boneMuchuan,SichuanMuchuan (MC)20540103°90'29°02'
WenchangWenchang,HainanWenchang(WC)3110110°87'19°72'

DNA extraction, PCR amplification, and genotyping

DNA was extracted by the phenol-chloroform method [25]. The high altitude DNA pool was constructed from 50 samples, each containing 100 ng DNA from randomly selected TC. The LC pool was constructed in the same way. These two gene pools were used to detect SNPs. After blasting chicken EPAS1 mRNA sequence (Ensembl accession ENSGALT00000016253) in the chicken genome, we designed 15 primer pairs (Table 2) to amplify the 15 exons using Primer V6.0 software [26]. PCR was performed in 25 μL of reaction volume. The ingredient contain 50 ng DNA template, 1×buffer (including 1500 μmol L-1 Mg2Cl2, 200 μmol L-1 dNTPs, and 1.5 U of Taq DNA polymerase) and 1 μmol L-1 of each primer. Besides, the PCR procedure was as following: initial denaturation for 5 min at 9500, followed by 35 cycles of 95°C for 30 s; annealing at prescribed annealing temperature for 30 s; and primer extension at 72°C for 45 s. The final extension was performed at 72°C for 7 min. PCR products were sequenced on an ABI 3730 DNA sequencer. Then, the SNPs were determined and pairs of 4 primers (P1, P7, P12, and P14) were used for individual genotyping. PCR procedures and systems were the same as above.
Table 2

Primer information for detecting SNPs in coding regions.

NamesSequences (5'-3')Sizes of product (bp)Annealing temperatures (°C)Target regionPurpose
P1F1:CAAGGGGTCTGAAGGCTGAT53455Exon1Identification of SNP and individual genotyping
R2:TGGGAAAACAAAACAAAAACTG
P2F:GTTGCCTTGTTATTGGATGTGAT47356Exon2Identification of SNP
R:GGGGGAGTTGTCTTTTCTTGAT
P3F:CCCCCACTAATTGTCTCCTG39555Exon3Identification of SNP
R:TCCACCTGTTTTAACTTGATGAA
P4F:GATGGTCTCTGCAGCTATGTTAC42255Exon4Identification of SNP
R:AAGTGTGAGGAGGGCAAGTGT
P5F:GAGTCGGTTTTTCGTTCACAA44355Exon5Identification of SNP
R:AGCAGGGGTACCATTTTCTCTAA
P6F:ACTGTGGTTTTAATCCCTTTTGTT26355Exon6Identification of SNP
R:CTACTGGTTGTGGCTTATTATTTG
P7F:ATTGGCCACCCTGTTGATG57955Exon7Identification of SNP and individual genotyping
R:AGAAGAAAAAGGCTATGGTGTAAT
P8F:GGAACTGTGCCTTGCCTGTCA53057.8Exon8Identification of SNP
R:GCCTGGTTCTGCCCTCATTCA
P9F:AGGGCAGAACCAGGCTTTTT42055Exon9Identification of SNP
R:TGGGATTTGCATAGAGAACATAAC
P10F:ACTGTAGTTGTCTTGGCTTCTTAT43855Exon10Identification of SNP
R:AGGCTGCTGCTTCTATTTGT
P11F:AGCTGGCCACTTTCCTTTAC74055Exon11Identification of SNP
R:AGCCATGCCTGTCTCCTT
P12F:GTGGGATGACTGACAAAAGGAA39755Exon12Identification of SNP and individual genotyping
R:TGAGGTGGGTTAATGACAGATGG
P13F:GAGTGGGGTAATCATAAGAA43455Exon13Identification of SNP
R:ATTTGCCTCCCACATAAG
P14F:TGTGCAGGCATTTCAGAG44355Exon14Identification of SNP and individual genotyping
R:TGCCAACATTTTTCAGC
P15F:GGTATAAATAGAAGAGGGAAGAGA50355Exon15Identification of SNP
R:ATAAAGGGAGGGTAATCAAACT

1.2 F and R are the abbreviation of forward and reverse primer, respectively.

1.2 F and R are the abbreviation of forward and reverse primer, respectively.

Data analysis

The sequences were edited and aligned by DNAstar Software (DNAstar Inc. Madison, WI, USA). Sequence variations were identified using MEAG5 [27]. The complete EPAS1 sequence of Red Jungle Fowl (G.g gallus, Ensemble accession number: ENSGALG00000010005) was used as reference for determining the variable sites of chicken. Hardy-Weinberg equilibrium, pairwise linkage disequilibrium (LD), and association analysis were conducted by Haploview software (version 4.2, http://www.broad.mit.edu/mpg/haploview/). We calculated the nucleotide diversity (Pi) and average number of nucleotide differences (K) for the TC and LC populations by DNAspV5 [28]. The Fixation index (F) between the high (≥3300m) and low altitude groups (≤600m) were calculated by Arlequin 3.5 [29]. Differences of allele frequencies between TC and LC groups were analyzed using Pearson’ Chi-square test, odds ratio (OR), 95% confidence intervals (95% CLs), and MAF (Minor allele frequencies) were calculated. Correlation coefficients between the altitude and frequencies of mutant alleles were calculated. All these parameters were calculated by SPSS software, Version 22 [30] and P < 0.05 was considered as statistical significance.

Protein secondary structure prediction and variable sites analysis

The protein domains of EPAS1 were predicted by the Pfam web service [31]. The EPAS1 ortholog proteins of 16 vertebrates were retrieved from Ensembl [32]. Multiple sequence alignments were performed by Clustal X (V2.1) [33] and displayed by Jalview (v2.8) [34]. To find the best model for a Maximum Likelihood (ML) based tree, we use Prottest V3.4 to compare all models and chose the ML tree as starting topology. Then we used BEAST V1.8.3 to generate the trees by the best model. To find the best fit model, we use Prottest V3.4 [35] to exam all the models and chose the Maximum likelihood tree as starting topology. Then BEAST 2 [36] were used to generate the trees by the best model. The protein sequence of chicken EPAS1 was searched in Ensembl database (ENSGALP00000016234).

Results

Sequence variations in EPAS1 gene

The length of EPAS1 exon sequences was 2730 bp (Ensembl accession numbers ENSGALT00000016253; no insertion/deletions were detected). A total of 6 single-nucleotide polymorphisms (SNPs) were found in the exon of EPAS1 gene from the 15 chicken populations (Table 3 and Fig 1). There were four synonymous mutations (SNP1: rs315040213, SNP2: rs14330062, SNP4: rs739281102, and SNP6: rs739010166) and two non-synonymous substitutions (SNP3: rs316126786 and SNP5: rs740389732).
Table 3

Four synonymous mutations and two non-synonymous mutations in EPAS1.

SNP locus1Genomic location2Nucleotide mutationsAmino acid mutations
Reference alleleMutation alleleReferenceMutation
SNP1 rs315040213Chr3:25981719 (+)CTTyrTyr
SNP2 rs14330062Chr3:25976150 (+)TCThrThr
SNP3 rs316126786Chr3:25976203 (+)AGTyrCys
SNP4 rs739281102Chr3:25980904 (+)AGArgArg
SNP5 rs740389732Chr3:25981687 (+)AGMetVal
SNP6 rs739010166Chr3:25981719 (+)CTThrThr

1 SNP loci are exported from dbSNP.

2The genomic location of EPAS1 mutation is based on Ensembl Gallus gallus version 84.4 (galgal4) Chromosome 3.

Fig 1

The sequencing images of 6 SNPs in the coding region of chicken EPAS1 gene.

The position and base substitutions in coding regions refer to the sequence of ensemble ENSGALT00000016253.

The sequencing images of 6 SNPs in the coding region of chicken EPAS1 gene.

The position and base substitutions in coding regions refer to the sequence of ensemble ENSGALT00000016253. 1 SNP loci are exported from dbSNP. 2The genomic location of EPAS1 mutation is based on Ensembl Gallus gallus version 84.4 (galgal4) Chromosome 3.

SNP genotypes

Genotype frequencies are presented in Table 4 and Fig 2. To detect SNP sites which had putative associations with a high-altitude environment, we focused on genotypes which difference (P < 0.05) between TC and LC. Excluding SNP2 (P = 0.062), SNP4 (P = 0.108, OR = 0.623, 0.408 ≤ 95% CI ≤ 0.952), and SNP5 (P = 0.119, OR = 0.675, 0.437 ≤ 95% CI ≤ 1.045), the other SNPs, which were the SNP1 (P = 0.001, OR = 1.826, 1.246 ≤ 95% CI ≤ 2.676), SNP3 (P = 2.16E-06, OR = 0.257, 0.142 ≤ 95% CI ≤ 0.464), and SNP6 (P = 3.76E-07, OR = 0.212, 95% CI = 0.120–0.374) were different between TC and LC groups. Results of odds ratios shared that the probability of the ability to adapt hypoxia for chickens with a C allele at SNP1 polymorphic site was more than 1.826 times that of chickens with T allele. The odds ratios of allele A at SNP3 and allele C at SNP6 were less 0.257 and 0.212 fold than chickens with G and T alleles, respectively.
Table 4

Genotype of the EPAS1 gene SNPs.

SNP locusGenotypesp value in Pearson Chi-square test
SNP1 rs315040213CCCTTT0.001
TC174 (47.10%)370 (44.50%)13 (8.20%)
LC294 (67.60%)36 (25.80%)9 (6.40%)
SNP2 rs14330062TTTCCC0.062
TC152 (96.80%)5 (3.10%)0 (0%)
LC139 (100%)0 (0%)0 (0%)
SNP3 rs316126786AAAGGG2.16E-06
TC142 (90.40%)14 (8.90%)1 (0.60%)
LC94 (67.60%)42 (30.20%)3 (2.10%)
SNP4 rs739281102AAAGGG0.108
TC119 (75.70%)30 (19.10%)8 (5%)
LC90 (64.70%)38 (27.30%)11 (7.90%)
SNP5 rs740389732AAAGGG0.119
TC122 (77.70%)26 (16.50%)9 (5.70%)
LC94 (67.60%)36 (25.80%)9 (6.40%)
SNP6 rs739010166CCCTTT3.76E-07
TC141 (89.80%)15 (9.50%)1 (0.60%)
LC91 (65.40%)37 (26.60%)11 (7.90%)

1TC and

2LC abbreviations for Tibetan and lowland chickens, respectively.

3Values represent the genotype number and percentage of the population.

Fig 2

All samples genotypes.

These genotypes were derived from Sanger sequencing of 157 Tibetan and 139 Lowland chickens and were marked as red block (homozygous reference, abbreviated as ref), yellow block (heterozygote, abbreviated as het), and green block (homozygote alternative, abbreviated as alt). N: non-synonymous; S: synonymous.1The mutation sites of EPAS1. 2The abbreviations of sample locations, shown are in Table 1.

All samples genotypes.

These genotypes were derived from Sanger sequencing of 157 Tibetan and 139 Lowland chickens and were marked as red block (homozygous reference, abbreviated as ref), yellow block (heterozygote, abbreviated as het), and green block (homozygote alternative, abbreviated as alt). N: non-synonymous; S: synonymous.1The mutation sites of EPAS1. 2The abbreviations of sample locations, shown are in Table 1. 1TC and 2LC abbreviations for Tibetan and lowland chickens, respectively. 3Values represent the genotype number and percentage of the population.

Correlation coefficients between altitudes and allele frequencies of the EPAS1 gene

Table 5 contains the observed allelic frequencies of each polymorphic site for each sample from the 15 populations (Allele and genotype frequencies of all SNPs were shown in Tables B, C, and D in S1 File.). We found that SNP2 was unique to TC with the allele frequency of 1.5%. The other five SNPs (SNP1, SNP4 and SNP6, SNP3, and SNP5) were shared by TC and LC. Using Pearson chi-square test and Fisher exact test except the SNP5 (P = 0.077), the other four SNPs whose allele frequencies were significantly different between TC and LC, implying that these four SNPs were associated with high-altitude adaptation. Notably, The non-synonymous substitutions SNP3 (P = 1.94E-06) was significantly different between TC and LC. There were significant negative linear correlations between altitude and the frequency of mutant allele at SNP6 (rs739010166, r = 0.758, p<0.001) and SNP3 (rs316126786, r = 0.844, P<0.001). There was no correlation for the other SNPs.
Table 5

Allele frequency, correlation and odds ratio of mutation loci in the EPAS1 gene.

SNP sitesAllele distributionp value in Pearson Chi-square testOR195% CI2MAF3RPearson rP-value8
AlleleTC5LC6
SNP1 rs315040213C218 (69.4%)7224 (80.5%)0.0011.8261.246-~2.6760.253370.2310.480.07
T96 (30.6%)54 (19.4%)
SNP2 rs14330062T5 (1.5%)0 (0%)0.034NA4NA-NA0.00840.1320.3630.184
C309 (98.4%)278 (100%)
SNP3 rs316126786A298 (94.9%)230 (82.7%)1.94E-060.2570.142–0.4640.10810.7120.8440.000*
G16 (5%)48 (17.2%)
SNP4 rs739281102A268 (85.3%)218 (78.4%)0.0280.6230.408–0.9520.1790.2230.4720.076
G46 (14.6%)60 (21.5%)
SNP5 rs740389732A270 (85.9%)224 (80.5%)0.0770.6750.437–1.0450.16550.170.4130.126
G44 (14%)54 (19.4%)
SNP6 rs739010166C297 (94.5%)219 (78.7%)9.53E-090.2120.120–0.3740.12830.5750.7580.001*
T17 (5.4%)59 (21.2%)

1OR represents odds ratio

2Cl represents confidence interval

3MAF represents the minor allele frequency.

4 NA means cannot be calculated.

5 TC abbreviations for Tibetan chicken.

6 LC abbreviations for Lowland chicken.

7 Values represent the allele frequency and percentage in the population.

8P-value was calculated by Pearson r.

*represent significant level P<0.05.

1OR represents odds ratio 2Cl represents confidence interval 3MAF represents the minor allele frequency. 4 NA means cannot be calculated. 5 TC abbreviations for Tibetan chicken. 6 LC abbreviations for Lowland chicken. 7 Values represent the allele frequency and percentage in the population. 8P-value was calculated by Pearson r. *represent significant level P<0.05. There was a striking change of allele frequency along with the altitude elevation in SNP3. As shown in Fig 3A, the high mutant allele frequencies of SNP3 (Y333C) in the populations WC (27.4%, n = 31, 10 m), LK (25%, n = 14, 600 m), and PX (20.8%, n = 12, 600 m) were found. Then, the frequencies declined in the populations JY (14%, n = 25, 900 m), SM (10.7%, n = 14, 1120 m), and EM (8.7%, n = 23, 1800 m). And they dropped the populations QH (6.7%, n = 15, 2700 m) and DQ (8.3%, n = 12, 3280 m). Moreover, when the altitude was more than 3,400 m, the mutation essentially disappeared, as shown in the populations GZ (0%, n = 16, 3390 m), LS (0%, n = 24, 3650 m), SN (0%, n = 14, 3700 m), and RKZ (0%, n = 30, 3900 m). The same tendency was found in SNP6 (Fig 3C).
Fig 3

The correlation analysis between frequencies and the altitude for SNP3 and SNP6.

(A) Plot of the correlation analysis between frequencies of the SNP3 and the altitudes among fifteen population, “r” represents the correlation coefficient (r = 0.844, P<0.001). (B) The Fst of 15 different populations from TC and LC were calculated based on SNP3 of EPAS1 gene. (C) Plot of the correlation analysis between frequencies of the SNP6 and the altitudes among fifteen chicken population, “r” represents the correlation coefficient (r = 0.578, P = 0.001). (D) The Fst of 15 different populations from TC and LC were calculated based on SNP6 of EPAS1 gene.

The correlation analysis between frequencies and the altitude for SNP3 and SNP6.

(A) Plot of the correlation analysis between frequencies of the SNP3 and the altitudes among fifteen population, “r” represents the correlation coefficient (r = 0.844, P<0.001). (B) The Fst of 15 different populations from TC and LC were calculated based on SNP3 of EPAS1 gene. (C) Plot of the correlation analysis between frequencies of the SNP6 and the altitudes among fifteen chicken population, “r” represents the correlation coefficient (r = 0.578, P = 0.001). (D) The Fst of 15 different populations from TC and LC were calculated based on SNP6 of EPAS1 gene. These results suggested that the frequencies of SNP3 and SNP6 in EPAS1 decreased with the elevation of altitude.

The Hardy-Weinberg equilibrium, nucleotide diversity, and population differentiation of the EPAS1

To test whether all of the SNPs at EPAS1 loci were in equilibrium in each population, the Hardy-Weinberg equilibrium (HWE) tests was performed on each SNP locus (Table 6). Except for SNP4 and SNP5, all alleles in TC were in accordance with Hardy-Weinberg equilibrium (P>0.05). In contrast, there was a departure from Hardy-Weinberg equilibrium (P < 0.05) for LC in SNP1, SNP4, SNP5, and SNP6. Notably, SNP3 in both populations was in accordance with Hardy-Weinberg equilibrium. Besides, there were no significant diferences between TC and LC for nucleotide diversity (Pi) and the average number of nucleotide differences (K) (Table E in S1 File). Then we calculated the LD block of Tibetan chicken and Lowland chicken and we found close linkage between SNP3 and SNP6 in TC (Figure A in S1 File). Besides, the observed heterozygosity of all SNPs was from 0.03 to 0.44 in TC and from 0 to 0.30 in LC (Table 6). The minor allele frequencies (MAF) of all mutations were more than 0.005 (Table 5).
Table 6

The Hardy-Weinberg equilibrium (HWE) tests of the candidate SNPs in EPAS1 in TC and the LC.

SNP locusTC4LC5FstP-value3
χ2Pearson's pHo1He2χ2Pearson's pHoHe
SNP1 rs3150402130.3960.5280.445860.425884.1410.0418*0.258990.314160.18057<0.000**
SNP2 rs143300620.0010.8390.031850.0314401000.012520.16031
SNP3 rs3161267860.9550.3280.089170.097030.4610.4970.302160.286730.22956<0.000**
SNP4 rs7392811028.7350.0031**0.191080.250875.1430.023*0.273380.339710.02240.07038
SNP5 rs74038973215.360.00090**0.165610.241754.1410.041*0.258990.314160.028050.02933
SNP6 rs7390101660.7070.4000.095540.102755.780.016*0.266190.335580.27826<0.000**

*Represent significant P-value of less than 0.05.

** represent significant P-value of less than 0.001.

1.2 Ho and He represent the observed heterozygosity and expected heterozygosity.

3 The P-value is calculated by Fst value.

4 TC abbreviations for Tibetan chickens.

5 LC abbreviations for Lowland chickens.

*Represent significant P-value of less than 0.05. ** represent significant P-value of less than 0.001. 1.2 Ho and He represent the observed heterozygosity and expected heterozygosity. 3 The P-value is calculated by Fst value. 4 TC abbreviations for Tibetan chickens. 5 LC abbreviations for Lowland chickens. To fully understand the population differentiation that occurred between TC and LC, we analyzed the distribution of alleles at the EPAS1 for the high altitude (≥3300m) and low altitude groups (≤600m). The Fixation index (F) was calculated to evaluate the population differentiation degree. As shown in Table 6, the F values for SNP1 (0.181, P < 0.001), SNP3 (0.229, P < 0.001), and SNP6 (0.278, P < 0.001) for two groups reflected remarkable population differentiation.

Protein secondary structure prediction and non-synonymous sites analysis

SNP3 (Y333C) was located in a well-defined protein domain (PAS domain, Fig 4). Using the crystal structure model of PAS domain for human EPAS1, we found that mutation SNP3 in chicken (Y333C) occurred in a beta sheet which may affect the thermodynamic stability of the domain. To evaluate the its functional, we aligned the mutant EPAS1 protein with its ortholog proteins of diverse vertebrates (Fig 4) and found that the amino acid mutation (Y333C) were conserved among species. By contrast, M765V was not found in other species. Because mutant functional effects showed that Y333C was deleterious, while M765V were tolerated implied that Y333C was the most likely casual mutation for the EPAS1 in LC but not TC.
Fig 4

EPAS1 mutation in coding regions.

Structural and evolutionary analysis of the 4 amino acid variants. The protein coordinate is based on Ensembl ID ENSGALP00000016234. The upper layer displayed the Pfam domains of the protein. The orthologous proteins from 17 different vertebrates were aligned with the residues shown as green. Cow, ENSBTAP00000004836; pig, ENSSSCP00000009011; human, ENSP00000406137; dog, ENSCAFP00000003819; elephant, ENSLAFP00000010336; mouse, ENSMUSP00000024954; opossum, ENSMODP00000001136; lizard, ENSACAP00000004025; xenopus, ENSXETP00000031612; zebrafish, ENSDARP00000074832; wallaby, ENSMEUT00000004049; horse, ENSECAT00000015683; cat, ENSFCAT00000013633; rat, ENSRNOT00000034991; panda, ENSAMET00000006660; chimpanzee, ENSPTRT00000022124. The ML tree is shown in the left area. (PAS) Per-Arnt-Sim; (HIF) hypoxia-inducible factor; (ODDD) O2-dependent degradation domain; (TAD) C-terminal transactivation domain.

EPAS1 mutation in coding regions.

Structural and evolutionary analysis of the 4 amino acid variants. The protein coordinate is based on Ensembl ID ENSGALP00000016234. The upper layer displayed the Pfam domains of the protein. The orthologous proteins from 17 different vertebrates were aligned with the residues shown as green. Cow, ENSBTAP00000004836; pig, ENSSSCP00000009011; human, ENSP00000406137; dog, ENSCAFP00000003819; elephant, ENSLAFP00000010336; mouse, ENSMUSP00000024954; opossum, ENSMODP00000001136; lizard, ENSACAP00000004025; xenopus, ENSXETP00000031612; zebrafish, ENSDARP00000074832; wallaby, ENSMEUT00000004049; horse, ENSECAT00000015683; cat, ENSFCAT00000013633; rat, ENSRNOT00000034991; panda, ENSAMET00000006660; chimpanzee, ENSPTRT00000022124. The ML tree is shown in the left area. (PAS) Per-Arnt-Sim; (HIF) hypoxia-inducible factor; (ODDD) O2-dependent degradation domain; (TAD) C-terminal transactivation domain.

Discussion

Long-term exposure to hypoxia reduced metabolic activity, retarded development, and increased embryonic mortality in animal fetes [37] and chronic hypoxia decreased growth of developing chick embryos [38, 39]. The Tibetan chicken, a Chinese native chicken breed, adapts itself well to the low pressure and high altitude plateau environment [7, 40]. Here, we investigated specific SNPs in EPAS1 gene and discuss their possible contributions to the high-altitude adaptation of the Tibetan chicken. Amino acid replacement at key sites can difference protein function [41]. EPAS1 protein has an important role in regulating transcriptional responses to hypoxia in physiological and pathologic conditions including up-regulating the expression of erythropoietin (EPO) [42], Thus, it is not surprising that Mutations at EPAS1 are associated with hematologic phenotypes [43]. Here, we found 4 synonymous mutations and 2 non-synonymous substitutions which resulted in amino acid mutations of Tyr to Cys at SNP3 and Met to Val at SNP5, respectively. Except for SNP5, the other 5 SNPs were significantly different between TC and LC for genotypic frequencies. Both genotypic and allelic results for SNP1, SNP3, and SNP6 revealed significant differences between two chicken groups. Except for SNP4 and SNP5, all the alleles in the TC were in accordance with Hardy-Weinberg equilibrium. While for LC there was a significant departure from Hardy-Weinberg equilibrium (P < 0.05) in SNP1, SNP4, SNP5, and SNP6. These results implied an association of SNP1 and SNP6 might be correlate with high-altitude adaptation. Also there is linkage between SNP3 and SNP6 in TC (Figure A in S1 File). The Fst values for SNP1, SNP3, and SNP6 of EPAS1 between the high altitude group (HIC≥3300m) and low altitude group (LOC ≤ 600m) reflected remarkable population differentiation. Overall, both the Fst value and the LD block demonstrate a remarkable population differentiation between the TC and LC. We deduce reasons why the previous study[24] did not find SNP3 are that compared to other SNPs, SNP3 may have low frequency in the TC population and sample size, only 40 Tibetan chickens from 3 different areas were involved. A correlation coefficient is a number that quantifies some type of association between random variables and observed data [44]. Pearson correlation between altitude and allele frequency showed that in TC there was a significant relationship in synonymous mutation SNP6 and non-synonymous mutation SNP3. The frequencies of SNP3 (Y333C) and SNP6 were high in LC but decreased along with the elevation of altitude. The same relationship also found for the EPAS1 gene in cattle [45]. There was an association of double variants in the oxygen degradation domain (ODDD) of EPAS1 in Angus cattle with High-altitude pulmonary hypertension (HAPH). These mutations were prevalent in lowland cattle, but they disappeared for the animals located at high altitudes[45]. Like Angus cattle, the mutation we found was located in a well-defined domain and quite conserved. Human EPAS1 is a basic helix-loop-helix transcription factor that contains a Per-AhR-Arnt-Sim (PAS) domain and shares 48% homology [46, 47]. The transcriptional response to hypoxia appears in several physiological and pathologic conditions, including up-regulation in the expression of erythropoietin (EPO) [48]. Mutations at EPAS1 in humans are associated with hematologic phenotypes [43]. HIF proteins are constantly degraded with sufficient oxygen availability, but one released under hypoxia condition. HIFs are protected by inhibition of oxygen-dependent hydroxylation of specific residues in the oxygen-dependent degradation domain under hypoxic condition. This prevents interactions with the von Hippel-Lindau ubiquitin ligase complex and proteasome destruction. HIF2a is found in all human tissues and its effects on downstream cascades include regulation of angioenic factors VEGF and TGFa, and cell permeability and stimulation of erythropoietic and glycolytic protein [46, 49, 50]. Clinical studies found that several gain-of-function mutations that occurred in the primary hydroxylation site (Pro531) at EPAS1 caused erythroctosis and pulmonary hypertension [51].While, the unique EPAS1 mutation in the Tibetan people were associated with lower hemoglobin concentrations [52, 53], suggesting that it played a loss-of-function role to high blood viscosity and cardiovascular disorders. As erythrocytosis is a common symptom of chronic mountain sickness which can lead to high blood viscosity and cardiovascular disorders, a decrease of hemoglobin level may provide a protective mechanism for Tibetan people [43]. Nonetheless, it is worth noting that all EPAS1 variations detected preciously in the Tibetan population are in introns [52, 53]. Transgenic mice carrying a G536W variant in EPAS1 developed high right ventricular systolic pressure and medial hypertrophy of pulmonary arteries in addition to erythrocytosis under normoxic condition [54]. The key amino acid mutation G305S was identified at EAPS1 PAS domain in dogs was predicted to be damaging, which is also likely to cause the loss of function of EPAS1 and it is also associated with blood flow resistance [16]. Recently, the missense mutation Q597L was identified in EPAS1 in the Tibetan Cashmere goat, which occurred next to the hypoxia-inducible factor-1 (HIF-1) domain, and was exclusively enriched in the high-altitude population [55]. Thus, HIF2a dysregulation is well recognized in association with pulmonary hypertension, frequency in context of erythrocytosis, with or without severe hypoxemia. Here, we found a non-synonymous mutation SNP3 (Y333C) in PAS domain which is similar to the mutation in dogs and Angus cattle. In order to adapt to the same selection pressure, mutations that produce an adaptive change in one species may preclude possibilities in other species because of differences in genetic background and experiments involving resurrected ancestral proteins have revealed continuous effects of historical substitutions [56]. Although the function of mutation (SNP3 Y333C) in chickens is unknown, based on the previously studies, we inferred that SNP3 (Y333C) might also be harmful to chickens at high altitudes. The lower oxygen and extreme environment selectively swept out mutation SNP3 (Y333C). Taking together, these results indicate that EAPS1 gene may play a key role in the adaptation of the Tibetan chicken to a high-altitude environment at the molecular level. The adaptation patterns however, may differ in human, dog, grey wolf, and Tibetan Cashmere goat. In conclusion, genetic analysis in the present study for EPAS1 gene in TC and LC populations indicated that the non-synonymous SNP3 (Y333C) may have critical role in the high-altitude adaptation of TC. Although the adaptation patterns may not be the same in humans and dogs, these novel variations of EPAS1 could put new insights into the gene function.

The LD Block, production performance, allele frequency and nucleotide diversities.

Figure A. The LD block of Tibetan chicken. TC means Tibetan chicken. Table A. The partial production performance of Tibetan and 7 lowland chicken breeds. The production performance data of 8 breeds of chicken is based on poultry genetic resources in China. Table B. Allele and genotype frequencies of the SNP1 rs315040213 and SNP2 rs14330062 in the EPAS1. “C” represents the reference allele and “T” represents the mutant allele. Numbers represent allele/genotype frequency, with the figures in brackets representing the number of individuals with each genotype. Table C. Allele and genotype frequencies of the SNP3 rs316126786 and SNP4 rs739281102 in the EPAS1. “A” represents the reference allele and “G” represents the mutant allele. Numbers represent allele/genotype frequency, with the figures in brackets representing the number of individuals with each genotype. Table D. Allele and genotype frequencies of the SNP5 rs740389732 and SNP6 rs739010166 in the EPAS1. “A” represents the reference allele and “G” represents the mutant allele. Numbers represent allele/genotype frequency, with the figures in brackets representing the number of individuals with each genotype. “C” represents the reference allele and “T” represents the mutant allele. Numbers represent allele/genotype frequency, with the figures in brackets representing the number of individuals with each genotype. Table E. Nucleotide diversities of EPAS1 gene in Tibetan chicken and Lowland chicken. 1Pi is the abbreviation of nucleotide diversity. 2K is the abbreviation of average number of nucleotide differences.3TC means Tibetan chicken.4LC means Lowland chicken. (DOCX) Click here for additional data file.
  46 in total

1.  Natural selection on EPAS1 (HIF2alpha) associated with low hemoglobin concentration in Tibetan highlanders.

Authors:  Cynthia M Beall; Gianpiero L Cavalleri; Libin Deng; Robert C Elston; Yang Gao; Jo Knight; Chaohua Li; Jiang Chuan Li; Yu Liang; Mark McCormack; Hugh E Montgomery; Hao Pan; Peter A Robbins; Kevin V Shianna; Siu Cheung Tam; Ngodrop Tsering; Krishna R Veeramah; Wei Wang; Puchung Wangdui; Michael E Weale; Yaomin Xu; Zhe Xu; Ling Yang; M Justin Zaman; Changqing Zeng; Li Zhang; Xianglong Zhang; Pingcuo Zhaxi; Yong Tang Zheng
Journal:  Proc Natl Acad Sci U S A       Date:  2010-06-07       Impact factor: 11.205

Review 2.  Hypoxia-inducible factors in physiology and medicine.

Authors:  Gregg L Semenza
Journal:  Cell       Date:  2012-02-03       Impact factor: 41.582

3.  Sequencing of 50 human exomes reveals adaptation to high altitude.

Authors:  Xin Yi; Yu Liang; Emilia Huerta-Sanchez; Xin Jin; Zha Xi Ping Cuo; John E Pool; Xun Xu; Hui Jiang; Nicolas Vinckenbosch; Thorfinn Sand Korneliussen; Hancheng Zheng; Tao Liu; Weiming He; Kui Li; Ruibang Luo; Xifang Nie; Honglong Wu; Meiru Zhao; Hongzhi Cao; Jing Zou; Ying Shan; Shuzheng Li; Qi Yang; Peixiang Ni; Geng Tian; Junming Xu; Xiao Liu; Tao Jiang; Renhua Wu; Guangyu Zhou; Meifang Tang; Junjie Qin; Tong Wang; Shuijian Feng; Guohong Li; Jiangbai Luosang; Wei Wang; Fang Chen; Yading Wang; Xiaoguang Zheng; Zhuo Li; Zhuoma Bianba; Ge Yang; Xinping Wang; Shuhui Tang; Guoyi Gao; Yong Chen; Zhen Luo; Lamu Gusang; Zheng Cao; Qinghui Zhang; Weihan Ouyang; Xiaoli Ren; Huiqing Liang; Huisong Zheng; Yebo Huang; Jingxiang Li; Lars Bolund; Karsten Kristiansen; Yingrui Li; Yong Zhang; Xiuqing Zhang; Ruiqiang Li; Songgang Li; Huanming Yang; Rasmus Nielsen; Jun Wang; Jian Wang
Journal:  Science       Date:  2010-07-02       Impact factor: 47.728

Review 4.  Hypoxia: adapting to high altitude by mutating EPAS-1, the gene encoding HIF-2α.

Authors:  Martha C Tissot van Patot; Max Gassmann
Journal:  High Alt Med Biol       Date:  2011       Impact factor: 1.981

5.  Jalview Version 2--a multiple sequence alignment editor and analysis workbench.

Authors:  Andrew M Waterhouse; James B Procter; David M A Martin; Michèle Clamp; Geoffrey J Barton
Journal:  Bioinformatics       Date:  2009-01-16       Impact factor: 6.937

6.  Effects of long-term, high-altitude hypoxemia on ovine fetal cardiac output and blood flow distribution.

Authors:  M Kamitomo; J G Alonso; T Okai; L D Longo; R D Gilbert
Journal:  Am J Obstet Gynecol       Date:  1993-09       Impact factor: 8.661

7.  Ensembl 2016.

Authors:  Andrew Yates; Wasiu Akanni; M Ridwan Amode; Daniel Barrell; Konstantinos Billis; Denise Carvalho-Silva; Carla Cummins; Peter Clapham; Stephen Fitzgerald; Laurent Gil; Carlos García Girón; Leo Gordon; Thibaut Hourlier; Sarah E Hunt; Sophie H Janacek; Nathan Johnson; Thomas Juettemann; Stephen Keenan; Ilias Lavidas; Fergal J Martin; Thomas Maurel; William McLaren; Daniel N Murphy; Rishi Nag; Michael Nuhn; Anne Parker; Mateus Patricio; Miguel Pignatelli; Matthew Rahtz; Harpreet Singh Riat; Daniel Sheppard; Kieron Taylor; Anja Thormann; Alessandro Vullo; Steven P Wilder; Amonida Zadissa; Ewan Birney; Jennifer Harrow; Matthieu Muffato; Emily Perry; Magali Ruffier; Giulietta Spudich; Stephen J Trevanion; Fiona Cunningham; Bronwen L Aken; Daniel R Zerbino; Paul Flicek
Journal:  Nucleic Acids Res       Date:  2015-12-19       Impact factor: 16.971

Review 8.  Biology of hypoxia-inducible factor-2alpha in development and disease.

Authors:  S A Patel; M C Simon
Journal:  Cell Death Differ       Date:  2008-02-15       Impact factor: 15.828

9.  Whole-genome sequencing of six dog breeds from continuous altitudes reveals adaptation to high-altitude hypoxia.

Authors:  Xiao Gou; Zhen Wang; Ning Li; Feng Qiu; Ze Xu; Dawei Yan; Shuli Yang; Jia Jia; Xiaoyan Kong; Zehui Wei; Shaoxiong Lu; Linsheng Lian; Changxin Wu; Xueyan Wang; Guozhi Li; Teng Ma; Qiang Jiang; Xue Zhao; Jiaqiang Yang; Baohong Liu; Dongkai Wei; Hong Li; Jianfa Yang; Yulin Yan; Guiying Zhao; Xinxing Dong; Mingli Li; Weidong Deng; Jing Leng; Chaochun Wei; Chuan Wang; Huaming Mao; Hao Zhang; Guohui Ding; Yixue Li
Journal:  Genome Res       Date:  2014-04-10       Impact factor: 9.043

10.  Genetic convergence in the adaptation of dogs and humans to the high-altitude environment of the tibetan plateau.

Authors:  Guo-Dong Wang; Ruo-Xi Fan; Weiwei Zhai; Fei Liu; Lu Wang; Li Zhong; Hong Wu; He-Chuan Yang; Shi-Fang Wu; Chun-Ling Zhu; Yan Li; Yun Gao; Ri-Li Ge; Chung-I Wu; Ya-Ping Zhang
Journal:  Genome Biol Evol       Date:  2014-08       Impact factor: 3.416

View more
  1 in total

1.  Tibial dyschondroplasia is highly associated with suppression of tibial angiogenesis through regulating the HIF-1α/VEGF/VEGFR signaling pathway in chickens.

Authors:  Shu-Cheng Huang; Mujeeb Ur Rehman; Yan-Fang Lan; Gang Qiu; Hui Zhang; Muhammad Kashif Iqbal; Hou-Qiang Luo; Khalid Mehmood; Li-Hong Zhang; Jia-Kui Li
Journal:  Sci Rep       Date:  2017-08-22       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.