Literature DB >> 31239472

Genomic analysis reveals variant association with high altitude adaptation in native chickens.

Hamed Kharrati-Koopaee1, Esmaeil Ebrahimie2,3,4,5, Mohammad Dadpasand6, Ali Niazi1, Ali Esmailizadeh7,8.   

Abstract

Native chickens are endangered genetic resources that are kept by farmers for different purposes. Native chickens distributed in a wide range of altitudes, have developed adaptive mechanisms to deal with hypoxia. For the first time, we report variants associated with high-altitude adaptation in Iranian native chickens by whole genome sequencing of lowland and highland chickens. We found that these adaptive variants are involved in DNA repair, organs development, immune response and histone binding. Amazingly, signature selection analysis demonstrated that differential variants are adaptive in response to hypoxia and are not due to other evolutionary pressures. Cellular component analysis of variants showed that mitochondrion is the most important organelle for hypoxia adaptation. A total of 50 variants was detected in mtDNA for highland and lowland chickens. High-altitude associated with variant discovery highlighted the importance of COX3, a gene involved in cell respiration, in hypoxia adaptation. The results of study suggest that MIR6644-2 is involved in hypoxia and high-altitude adaptations by regulation of embryo development. Finally, 3877 novel SNVs including the mtDNA ones, were submitted to EBI (PRJEB24944). Whole-genome sequencing and variant discovery of native chickens provided novel insights about adaptation mechanisms and highlights the importance of valuable genomic variants in chickens.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31239472      PMCID: PMC6592930          DOI: 10.1038/s41598-019-45661-7

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


Introduction

Chickens (Gallus gallus domesticus) are, domestically, considered as widespread birds. Chickens have been domesticated in India and South-East of Asia[1]. They are used for different purposes including decoration, religions, cock fighting, and food production[2]. Village chickens are also considered as important sources of food in Iran. Therefore, farmers keep them and consequently, their morphology, behavior, physiology and adaptation to local environment conditions have been remarkably changed[3]. Iran is a large country that contains a special range of altitudes and climates. Therefore, there is a remarkable genetic diversity of indigenous chickens in Iran[4]. However, there is no previous report on Iranian native chickens at the genomic level. The next generation sequencing (NGS) has become a tool to describe genomic features[5]. Investigating native chickens at the genomic level can clarify the genomic basis of their differences and the specific traits of indigenous chickens can also be precisely explored[6]. Usually, hypoxia occurs in high-attitude and pathological conditions. Hypoxia refers to the lack of oxygen[7], and the adaptation to hypoxia is a complex process contains biological pathways and gene networks[8]. Thereby, understanding genetic factors that underlie adaptation to high-altitude conditions could lead to a new source of knowledge in order to understand the adaptation process[9]. There are studies carried out in order to identify the genetic factors for high-altitude adaptation of Tibetan chickens, Yak, Tibetan pigs and Tibetan people[10]. Results of these studies show that hypoxia is involved in cerebral edema, tumorigenesis, myocardial ischemia and diabetes in human being, and some genes including Endothelin 1 (EDN1), Erythropoietin (EPO) and Aldosterone synthase (CYP11B2) are reported as the key genes of hypoxia adaptation[11-14]. Recent evidences indicate that highland chickens are adapted based on factors including the small body size, the ability to outstand foraging, high hatchability, large organs (liver, heart and lungs), and higher hemoglobin concentration. Furthermore, the positive selection of highland populations is related to the cardiovascular and respiratory system development, responses to the radiation, inflammation, DNA repair, and the immune responses[15]. Recently, one of the most significant discussions about high-altitude condition is known as the mitochondrion organelle, because there is a close association between the lack of oxygen and cell respiration. In a previous study, nine single nucleotide polymorphisms (SNP) of MT-COI (mitochondria-cytochrome c oxidase subunit I) gene were detected for the high-altitude adaptation, but none of them was a missense mutation[16]. In the study herein, the whole genome sequencing of highland chickens (Isfahan, Altitude = 2087 m) and lowland chickens (Mazandaran, Altitude = 54 m) was carried out in order to discover genomic variants, associated with high-altitude adaptation.

Results

Quality control

Results of the quality control indicate that short reads were sequenced based on an appropriate form. Trimming was carried out according to the Phred score and the nucleotide contribution. Removing at least ten primary bases, we trimmed 3′ side of short reads in order to minimize mapping errors. In addition, 5% of reads that contained the lowest Phred scores were also removed.

Phenotypic data, phylogenetic and population structure analysis

Results of principle component analysis indicated that six components (combinations of traits) characterized 92 percent of the total variance; therefore, these components were selected in order to be utilized in the discriminate analysis (Table 1). Results of the discriminate analysis showed that collected lowland and highland samples can be classified into two separated groups, phenotypically. The accuracy was estimated to be 75% (Table 2).
Table 1

Principle component analysis of phenotypic traits for discriminate analysis and clustering the highland and lowland populations.

Component123456
Eigenvalue6.452.642.071.140.790.69
Proportion0.430.170.130.070.050.04
Cumulative0.430.600.740.820.870.92

Twenty-four quantitative traits were measured on 16 highland and lowland chickens by our investigation. In order to reduce the measurement error, all chickens were mature while recording and the same recording protocol was also utilized in order to investigate the phenotypic traits of all the birds. Results of PCA show that six components of traits characterized 92 percent of total variance. These six components were selected for the discriminate analysis and classification of the studied population based on phenotypic traits.

Table 2

The classification of highland and lowland chickens based on the discriminate analysis.

Put into groupTrue group
LowlandHighland
Lowland82
Highland24
Total sample106
Correct sample84
Proportion0.800.66
Proportion correct0.75

Results of PCA (Table 1) were used for the discriminate analysis. Based on our findings, collected lowland and highland samples can be classified phenotypically into two separated groups. For instance, eight of ten lowland chickens are classified in the lowland chicken group, correctly. Similarly, two of six highland chickens are classified in the highland group. Also, the total accuracy of classification is estimated to be 75%.

Principle component analysis of phenotypic traits for discriminate analysis and clustering the highland and lowland populations. Twenty-four quantitative traits were measured on 16 highland and lowland chickens by our investigation. In order to reduce the measurement error, all chickens were mature while recording and the same recording protocol was also utilized in order to investigate the phenotypic traits of all the birds. Results of PCA show that six components of traits characterized 92 percent of total variance. These six components were selected for the discriminate analysis and classification of the studied population based on phenotypic traits. The classification of highland and lowland chickens based on the discriminate analysis. Results of PCA (Table 1) were used for the discriminate analysis. Based on our findings, collected lowland and highland samples can be classified phenotypically into two separated groups. For instance, eight of ten lowland chickens are classified in the lowland chicken group, correctly. Similarly, two of six highland chickens are classified in the highland group. Also, the total accuracy of classification is estimated to be 75%. Nei’s genetic distance between highland and lowland populations was estimated to be 3 percent. Thus, results of phylogenic analysis revealed that there was no significant genetic distance between highland and lowland populations. The analysis of genetic population structure was carried out by multiple correspondence analysis (MCA) based on 10000 SNVs genotypes. The first two components of variants were 29.5% and 26.5%, respectively. Results indicated that highland chickens were classified closely; while lowland chickens were classified together. Consequently, highland and lowland chickens could be classified into two groups (Fig. 1).
Figure 1

Classification of the highland and lowland chickens according to the results of multiple corresponding analyses (H: highland and L: lowland). The information of 10000 SNVs genotypes were utilized in order to classify the highland and lowland chickens based on multiple corresponding analysis by R program and Ca package. Results show that highland and lowland chickens can be divided into two groups based on their geographical distribution.

Classification of the highland and lowland chickens according to the results of multiple corresponding analyses (H: highland and L: lowland). The information of 10000 SNVs genotypes were utilized in order to classify the highland and lowland chickens based on multiple corresponding analysis by R program and Ca package. Results show that highland and lowland chickens can be divided into two groups based on their geographical distribution.

Mapping, variants discovery and statistical analysis

The mapping efficiency versus reference genome was estimated between 94–97% for samples (Table 3). Parameters were optimized in order to decrease false variants for the variants discovery, (See Supplementary Table S1 for optimization process). The process of optimization was carried out in three steps, and parameters of the step 3 were selected in order to be utilized in the variants detection.
Table 3

The summary of short reads alignments against reference genome for highland and lowland native chickens.

SamplesTotal readsMapped reads%Reads in Pair%Coverage before trimmingCoverage after trimming
Lowland7246232696.1392.269.008.10
Lowland7042138897.0093.538.707.88
Lowland7285023296.1492.039.107.93
Lowland6242484696.0192.087.706.82
Lowland6480404496.5993.018.107.27
Highland6910525496.5493.078.607.55
Highland7183722096.9193.619.007.96
Highland7489441695.3491.459.308.08
Highland6749517494.3590.758.407.26
Highland7031473495.5892.418.707.73

After trimming, the short reads of highland and lowland chickens were mapped against the standard reference genome of chicken (Gallus gallus, Ensembl-release 84). Mapping efficiency was estimated between 94–97% for samples, and the coverage analysis was carried out based on the lander and waterman equation.

The summary of short reads alignments against reference genome for highland and lowland native chickens. After trimming, the short reads of highland and lowland chickens were mapped against the standard reference genome of chicken (Gallus gallus, Ensembl-release 84). Mapping efficiency was estimated between 94–97% for samples, and the coverage analysis was carried out based on the lander and waterman equation. We found that more than 20 million variations were generated, including single nucleotide variant (SNV), multi nucleotide variant (MNV), insertion, deletion and replacement (See Supplementary Table S2). Results of the statistical analysis indicated that there was a significant (P < 0.001) difference of variant distribution between highland and lowland samples (Table 4). Thus, we can realize that harsh environmental conditions, including the lack of oxygen, impacted on the variant distribution. In addition, the Chi-square test was carried out for each chromosome, separately. Most of chromosomes had significant differences in harboring associated variants with altitude alteration; therefore, those chromosomes that do not have significant differences are given in Table 5, including mtDNA, W, 4, 11, and 32.
Table 4

Fisher exact test for the identification of differences in variants distribution between highland and lowland samples.

VariantsP-value
SNV***0.0012
MNV***0.0001
Insertion***0.0001
Deletion***0.0001
Replacement***0.0002

(P < 0.001).

In order to analyze differences of variants distribution, the input file of variants was constructed based on the IDEG6 website instruction and statistical analysis was performed for each variant between highland and lowland samples, separately. Results indicated that variants distributions between highland and lowland were significant, and a possible explanation might also be defined as: differences in variants distribution can be affected by environmental conditions including high- altitude condition.

Tablee 5

Chi-square test to analyze the differential distributed variations between highland and lowland chickens for chromosomes (non-significant chromosomes are presented).

Variants and chromosomes
SexMaleSNVMNVInsertionDeletionReplacement
5,Mt9, 11, 12, 18, 19, 25, 328, 10, 13, 19 21, 25, Mt9, 10, 16, 18, 22, 256, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 20, 22, 24, 26, 27, Mt
FemaleMt11, 32Mt3, 4, 11, W

Mt: Mitochondria DNA.

In addition to the statistical distribution analysis of each variant (Table 4), we analyzed the differential distributed variations between highland and lowland chickens for each chromosome incorporating the sex of birds in this study. We found that the distribution of variants was significant for the most of the chromosomes; therefore, to summarize, non-significant chromosomes were only presented in Table 5.

Fisher exact test for the identification of differences in variants distribution between highland and lowland samples. (P < 0.001). In order to analyze differences of variants distribution, the input file of variants was constructed based on the IDEG6 website instruction and statistical analysis was performed for each variant between highland and lowland samples, separately. Results indicated that variants distributions between highland and lowland were significant, and a possible explanation might also be defined as: differences in variants distribution can be affected by environmental conditions including high- altitude condition. Chi-square test to analyze the differential distributed variations between highland and lowland chickens for chromosomes (non-significant chromosomes are presented). Mt: Mitochondria DNA. In addition to the statistical distribution analysis of each variant (Table 4), we analyzed the differential distributed variations between highland and lowland chickens for each chromosome incorporating the sex of birds in this study. We found that the distribution of variants was significant for the most of the chromosomes; therefore, to summarize, non-significant chromosomes were only presented in Table 5.

Differential variants of highland and lowland ecotypes

In order to remove the common variants between highland and lowland chickens, total variants of those samples were compared. Differential variants were also kept in order to be utilized in further analyses (Table S3). Finally, the frequency thresholds of 50% and 100% were determined in order to carry out comparisons between the groups of males and females (Table S4). The threshold frequency is considered as the percentage of samples that contain variants. Generally, 97610 and 17024 variants were detected for both males and females as the differential variants of highland and lowland chickens, respectively. For the first time, 16623 and 3024 variants were identified as new variants in males and females. Results of filtering in the male birds, based on the overlap, showed that 614 and 96995 variants were located in coding and non-coding regions. We found that 202 out of 614 coding variants will lead to amino acid changes. It was observed that fewer coding and non-coding variants were detected among female birds (Table 6).
Table 6

Classification of differential variants in different types of categories.

SexTotal variantsNovel variantsCoding variantsNone coding variantsAmino acid changes
Male976101662361496995202
Female1702430241901683451

Numerous differential variants were produced in this investigation (Table S4). Logically, filtration is highly required for the classification of differential variants in order to have a better understanding about their functions by available annotations. For example, known variants annotation was used to identify the novel variants. CDS (coding region) annotation option was selected for the recognition of coding variants in order to find out which variants were located on coding and non-coding regions. Coding variants were collected for functional consequences and amino acid changes analysis. Therefore, reference genome, CDS, and mRNA annotations were used to analyze the amino acid changes. In addition, Standard genetic code was selected in CLC genomics Genomic Workbench (8.5.1) for amino acid change analysis.

Novel variants: variants were reported for the first time.

Coding variants: variants were located in coding regions.

Amino acid changes: variations can change the protein sequence.

Classification of differential variants in different types of categories. Numerous differential variants were produced in this investigation (Table S4). Logically, filtration is highly required for the classification of differential variants in order to have a better understanding about their functions by available annotations. For example, known variants annotation was used to identify the novel variants. CDS (coding region) annotation option was selected for the recognition of coding variants in order to find out which variants were located on coding and non-coding regions. Coding variants were collected for functional consequences and amino acid changes analysis. Therefore, reference genome, CDS, and mRNA annotations were used to analyze the amino acid changes. In addition, Standard genetic code was selected in CLC genomics Genomic Workbench (8.5.1) for amino acid change analysis. Novel variants: variants were reported for the first time. Coding variants: variants were located in coding regions. Amino acid changes: variations can change the protein sequence. Totally, more than 30 GO terms were reported for differential variants in females. Considering the molecular functions, the most frequency of GO terms belonged to 5′-3′ exodeoxyribonuclease activity and histone binding, respectively. Furthermore, candidate genes of GO terms were suggested including DCLRE1C and ATAD2. It was also found that considering cellular components, the mitochondrion was identified as the most important organelle for hypoxia condition differing between highland and lowland ecotypes. Results of biological process analysis showed that most of the GO terms were related to the DNA repair processes including double-strand break repair and inter strand cross-link repair. Results of gene ontology enrichment among males indicated that several candidate genes such as TATDN1, POLQ and DCLRE1A were associated with DNA repair. It was observed in the biological process analysis that there were several Go terms, including pericardium morphogenesis and thalamus development. There were also several biological pathways suggested for DNA repair and biosynthetic (e.g., DNA biosynthetic process; double-strand break repair; DNA repair; and inter strand cross-link repair). The NK T cell differentiation was reported as a biological pathway of immune response in the hypoxia condition. To see more results of gene ontology enrichments analyses in males and females, refer to the Tables 7 and 8.
Table 7

Gene ontology enrichments analysis for differential variants – Male birds.

GOGO termDescription and frequencyOverlapping genesChrType
Molecular function0004536Deoxyribonuclease activity (5%)TATDN1 (Asp190Glu)2SNV
0044822Poly (A) RNA binding (9%)RARS2 (Gln274Lys)3SNV
0003785Actin monomer binding (5%) COBLL1 7SNV
0003887DNA-directed DNA polymerase activity (14%)POLQ (Thr514Ala)1SNV
00515755′-deoxyribose-5phosphate lyase activity (10%)
0015410Manganese-transporting ATPase activity (9%)ATP2C1 (Asp383Glu)2SNV
0034185Apolipoprotein binding (5%)LRP6 (Thr522Ser)1SNV
0005041Low-density lipoprotein receptor activity (5%)
00353125′-3′ exodeoxyribonuclease activity (14%) DCLRE1A 6SNV
0030553cGMP binding (10%) CNGA3 1SNV
0004582Dolichyl - phosphate beta-D-mannosyltransferase activity (14%) ALG5 1SNV
Cellular component0000799Nuclear condensin complex (6%) NCAPD3 24SNV
0005901Caveola (9%)LRP6 (Thr522Ser)1SNV
0000139Golgi membrane (11%) GOSR1 19SNV
0005801Cis-Golgi network (6%)
0005887Integral to plasma membrane (29%) SLC38A6 5MNV
0070419Nonhomologous end joining complex (9%) PRKDC 2SNV
0005654Nucleoplasm (12%)
0000784Nuclear chromosome, Telomeric region (6%) DCLRE1A 6SNV
003068690S preribosome (6%) UTP20 1SNV
0070382Exocytic vesicle (6%) SYTL2 1SNV
Biological process0006303Double-strand break repair via nonhomologous end joining (4%) DCLRE1A 6SNV
0006874Cellular calcium ion homeostasis (12%)ATP13A4 (lu589Asp)
0003344Pericardium morphogenesis (6%)LRP6 (Thr522Ser)1SNV
0021794Thalamus development (5%)
0030901Midbrain development (6%)
0021987Cerebral cortex development (7%)
0060059Embryonic retina morphogenesis in camera-type eye (4%)
0060325Face morphogenesis (1%)
0071897DNA biosynthetic process (1%) POLQ 1SNV
0006302Double-strand break repair (18%)
2000042Negative regulation of double-strand break repair via homologous recombination (4%)
0001865NK T cell differentiation (2%)BMX (Arg122Lys)1SNV
0006281DNA repair (25%) PRKDC 2SNV
0036297Inter strand cross-link repair (4%) DCLRE1A 6SNV
Table 8

Gene ontology enrichments analysis for differential variants – Female birds.

GOGo termDescription and frequencyOverlapping genesChrType
Molecular function0005198Structural molecule activity (7%)PSMD13 (Ile118Thr)5SNV
00353125′-3′ exodeoxyribonuclease activity (14%) DCLRE1C 1Deletion
0005049Nuclear export signal receptor activity (3%) RANBP17 13SNV
0008536Ran GTPase binding (7%)
0042393Histone binding (13%) ATAD2 2SNV
0008453Alanine-glyoxylate transaminase activity (14%)AGXT2 (Ala436Gly)ZSNV
0030170Pyridoxal phosphate binding (12%)
0004222Metalloendopeptidase activity (9%)UQCRC1 (Arg211Gln)12SNV
0050839Cell adhesion molecule binding (8%)CD200 (His135Asn)1SNV
0004872Receptor activity (13%)
Cellular component0070545PeBoW complex (3%) PES1 15Deletion
0005750Mitochondrial respiratory chain complex III (9%)UQCRC1 (Arg211Gln)12SNV
0022624Proteasome accessory complex (9%)PSMD13 (Ile118Thr)5SNV
0008541Proteasome regulatory particle (6%)
0070419Nonhomologous end joining complex (12%) DCLRE1C 1Deletion
0000784Nuclear chromosome, telomeric region (13%)
0005769Early endosome (2%) ZFYVE9 8SNV
0031410Cytoplasmic vesicle (17%)
0005840Ribosome (12%)HSPA14 (Leu397Va)1SNV
0005913Cell-cell adherens junction (17%)CD200 (His135Asn)1SNV
Biological process0006835Dicarboxylic acid transport (3%) SLC13A2 19SNV
0000463Maturation of LSU-rRNA from tricistronic rRNA transcript (SSU-rRNA, 5.8S rRNA, LSU-rRNA) (7%) PES1 15Deletion
0006611Protein export from nucleus (8%) RANBP17 13SNV
0006303Double-strand break repair via non-homologous end joining (9%) DCLRE1C 1Deletion
0036297Inter strand cross-link repair (3%)
0031848Protection from non-homologous end joining at telomere (4%)
0090305Nucleic acid phosphodiester bond hydrolysis (15%)
0007156Homophilic cell adhesion (8%) CD200 His135Asn 1SNV
0007157Heterophilic cell-cell adhesion (8%)
0008037Cell recognition (5%)
0006511Ubiquitin-dependent protein catabolic process (15%) PSMD13 Ile118Thr 5SNV
0043248Proteasome assembly (3%)
0009060Aerobic respiration (9%)UQCRC1 (Arg211Gln)12SNV
0006122Mitochondrial electron transport, ubiquinol to cytochrome c (3%)

Results of the GO analysis are given in Tables 7 and 8. The output of amino acid change analysis for differential variants was used for gene ontology enrichment analyses in three parts, including biological process, molecular function and cellular component. The gene ontology (GO) association file, which includes gene names and associated gene ontology terms, were downloaded from the gene ontology consortium (http://geneontology.org/) and imported to CLC Genomic Workbench (8.5.1). The significance level of GO analysis was also determined as 0.01.

Gene ontology enrichments analysis for differential variants – Male birds. Gene ontology enrichments analysis for differential variants – Female birds. Results of the GO analysis are given in Tables 7 and 8. The output of amino acid change analysis for differential variants was used for gene ontology enrichment analyses in three parts, including biological process, molecular function and cellular component. The gene ontology (GO) association file, which includes gene names and associated gene ontology terms, were downloaded from the gene ontology consortium (http://geneontology.org/) and imported to CLC Genomic Workbench (8.5.1). The significance level of GO analysis was also determined as 0.01.

Identification of potential selective sweeps between the highland and lowland Chickens

The process of signature selection analysis (SSA) is carried out in this study in order to identify those genomic regions that were under the selection pressure. The main goal of SSA was to demonstrate that differential variants are actually adaptive in response to hypoxia and are not due to other evolutionary pressures or mechanisms of genetic changes. According to the extremely high FST (top 5%), it was found that 95 biological categories and 858 candidate genes were reported from different databases including gene ontology consortium (GO), human phenotype ontology (HP), Kyoto encyclopedia of genes and genomes (KEEG), miRbase (mi), transcription factor database (TF) and the genomic landscape of the population differentiation (FST) among highland and lowland chickens is shown in Fig. 2.
Figure 2

Genomic landscape of the population differentiation in highland and lowland chickens. In current study signature selection analysis was carried out for confirming the results of differential variant analysis. Therefore, for each population of highland and lowland chicken differentiation (FST) was calculated for each SNV as described by Weir and Cockerham (1984) and sex chromosomes were removed. The average FST values of SNVs in each window were calculated and genomic landscape of the population differentiation between highland and lowland chicken was drawn by qqman package and R program (version 3.2.2). Based on extremely high Fst (top 5%) 95 biological categories and 858 candidate genes are reported to identify genomic regions which are under selection pressure.

Genomic landscape of the population differentiation in highland and lowland chickens. In current study signature selection analysis was carried out for confirming the results of differential variant analysis. Therefore, for each population of highland and lowland chicken differentiation (FST) was calculated for each SNV as described by Weir and Cockerham (1984) and sex chromosomes were removed. The average FST values of SNVs in each window were calculated and genomic landscape of the population differentiation between highland and lowland chicken was drawn by qqman package and R program (version 3.2.2). Based on extremely high Fst (top 5%) 95 biological categories and 858 candidate genes are reported to identify genomic regions which are under selection pressure. According to the results of SSA, it was found that several candidate genes were associated with the hypoxia condition. Several candidate genes including MRE11A, DDIAS, PRIM2, DNASE, and MMS22L were suggested as examples of the DNA response to UV radiation and DNA repair. More importantly, it should be noted that there were several biological categories enriched by SSA including immune response (TRAT1, BMX, CXCR4), protein hemostasis (PSMD13), and mitochondrion function (COX7A2, MTO1, ME3, MICU3, HCCS). Table 9 shows a summary of biological categories and candidate genes. Additionally, the results of SSA are completely provided in the Supplementary Dataset 11.
Table 9

The summary results of signature selection analysis (biological categories and candidate genes).

TypeIDCategoryGene enrichments and functionsP-value
BPGO:0044763Single-organism cellular process

MRE11A (double-strand break repair protein)

MMS22L (DNA repair protein)

TIMM10 (translocase of inner mitochondrial membrane 10 homolog)

0.03
CCGO:0071944Cell periphery

TRAT1 (T cell receptor associated transmembrane adaptor 1)

CXCR4(C-X-C chemokine receptor type 4)

0.4E-02
GO:0005886Plasma membraneBMX (NK T cell differentiation)
TFTF:M00809_0Factor: FOX;motif: KWTTGTTTRTTTW

PRIM2 (DNA primase)

MRPS11(28S ribosomal protein S11, mitochondrial)

MRPL46 (39S ribosomal protein L46, mitochondrial)

MELANOMA (Gallus gallus mitochondrial ribosomal protein L28 (MRPL28), nuclear gene encoding mitochondrial protein)

0.2E-03
TF:M00747_1Factor: IRF-1; motif: TTCACTTDNASE2 (deoxyribonuclease-2-beta precursor)0.2E-02
TF:M03559_0Factor: Pit-1; motif: NNWWATTCAT

DDIAS (Induced by UV radiation. DNA damage-induced apoptosis suppressor

DLD (dihydrolipoyl dehydrogenase, mitochondrial)

HCCS (cytochrome c-type heme lyase)

PSMG3 (proteasome assembly chaperone 3)

0.8E-06
TF:M00432_0Factor: TTF1; motif: ASTCAAGTRKBDNF (Brain-derived neurotrophic factor)0.3E-02
TF:M00206_0Factor: HNF-1; motif: DGTTAATKAWTNACCAM

COX7A2 (cytochrome c oxidase subunit 7A2)

MTO1 (protein MTO1 homolog, mitochondrial)

SERPINB10 (Heterochromatin-associated protein MENT. Chromatin DNA binding)

0.6E-02
TF:M00410_0Factor: SOX9; motif: NNNNAACAATRGNNME3(malic enzyme 3, NADP(+)-dependent, mitochondrial)0.6E-02
TF:M00042_0Factor: SOX5; motif: NNAACAATNNDARS2 (aspartyl-tRNA synthetase 2, mitochondrial)0.02
TF:M01292_1Factor: HOXA13; motif: ATAAMAPRELID1 (PRELI domain-containing protein 1, mitochondrial)0.1E-06
miMI:bta-miR-22-5pMI:bta-miR-22-5pMICU3 (mitochondrial calcium uptake family member 3)0.04

In this study, signature selection analysis was carried out for confirming the results of differential variants analysis. Therefore, we can demonstrate that the differential variants are actually adaptive in response to hypoxia and are not due to other evolutionary pressures or mechanisms of genetic change. Based on our results, 95 biological categories and 858 candidate genes were reported by signature selection analysis. We found that the results of signature selection analysis are similar to differential variant analysis. For example, DNA repair (MRE11A, MMS22L) and immune response (BMX, TRAT1, CXCR4). The summary results of signature analysis are presented in Table 9. In this way, all results are prepared completely in supplementary dataset 11.

The summary results of signature selection analysis (biological categories and candidate genes). MRE11A (double-strand break repair protein) MMS22L (DNA repair protein) TIMM10 (translocase of inner mitochondrial membrane 10 homolog) TRAT1 (T cell receptor associated transmembrane adaptor 1) CXCR4(C-X-C chemokine receptor type 4) PRIM2 (DNA primase) MRPS11(28S ribosomal protein S11, mitochondrial) MRPL46 (39S ribosomal protein L46, mitochondrial) MELANOMA (Gallus gallus mitochondrial ribosomal protein L28 (MRPL28), nuclear gene encoding mitochondrial protein) DDIAS (Induced by UV radiation. DNA damage-induced apoptosis suppressor DLD (dihydrolipoyl dehydrogenase, mitochondrial) HCCS (cytochrome c-type heme lyase) PSMG3 (proteasome assembly chaperone 3) COX7A2 (cytochrome c oxidase subunit 7A2) MTO1 (protein MTO1 homolog, mitochondrial) SERPINB10 (Heterochromatin-associated protein MENT. Chromatin DNA binding) In this study, signature selection analysis was carried out for confirming the results of differential variants analysis. Therefore, we can demonstrate that the differential variants are actually adaptive in response to hypoxia and are not due to other evolutionary pressures or mechanisms of genetic change. Based on our results, 95 biological categories and 858 candidate genes were reported by signature selection analysis. We found that the results of signature selection analysis are similar to differential variant analysis. For example, DNA repair (MRE11A, MMS22L) and immune response (BMX, TRAT1, CXCR4). The summary results of signature analysis are presented in Table 9. In this way, all results are prepared completely in supplementary dataset 11.

Common variants between highland and lowland chickens

Generally, in both males and females, 53780 variants were shared between highland and lowland chickens; 17835 new variants were also reported for the first time. It was found that 516 out of 53780 variants could lead to changes in amino acid sequences (Supplementary Table S5). Results of gene ontology enrichment analysis of detected variants, in females and males, indicated that GO terms were related to the cell survival. For example, cell proliferation, cell growth (ROS1), digestion (PRSS3), chromosome organization (BRCA2), RNA processing (TDRD9), telomere capping (POT1), and cytokinesis (CEP55). In order to see more details of common variants, refer to the Supplementary text and Supplementary Tables S6–S7.

mtDNA variants

In general, for both highland and lowland chickens, 50 variants were detected in mtDNA (10 samples), and most of them were considered as the novel variants (Table 10). One significant gene ontology (GO term: 1902600) was observed in mtDNA variants. It was reported that, for highland chickens, hydrogen ion transmembrane transport was a biological pathway in males. It is involved in the directed movement of hydrogen ion (proton) across a membrane. Noticeably, COX3 gene (cytochrome c oxidase III) was considered in this biological pathway.
Table 10

Classification of mtDNA variation in highland and lowland chickens

PopulationSNVInsertionReplacementTotalNew variantsCoding variantsAmino acid changes
Highland-male21112323145
Highland-female9009841
Lowland-male8008862
Lowland-female1000101072

In this study, mtDNA variants were analyzed, separately. After variants calling, all mtDNA variants including SNV, insertion and replacement were obtained from highland and lowland chickens, separately. Filtrations were also carried out same as what described for differential variants. We recognized that highland chickens had more variants than lowland chickens (32 versus 18 variants). It might be explained that more variants in highland chickens were created in order to adapt to the high-altitude conditions.

Novel variants: variants were reported for the first time.

Coding variants: variants were located on coding regions.

Amino acid changes: The protein sequence can be changed by the variations.

Classification of mtDNA variation in highland and lowland chickens In this study, mtDNA variants were analyzed, separately. After variants calling, all mtDNA variants including SNV, insertion and replacement were obtained from highland and lowland chickens, separately. Filtrations were also carried out same as what described for differential variants. We recognized that highland chickens had more variants than lowland chickens (32 versus 18 variants). It might be explained that more variants in highland chickens were created in order to adapt to the high-altitude conditions. Novel variants: variants were reported for the first time. Coding variants: variants were located on coding regions. Amino acid changes: The protein sequence can be changed by the variations.

Validations of novel differential SNVs between highland and lowland ecotypes and mtDNA variants

Results of the validation indicated that 19 out of 32 novel variants were validated in mtDNA. Five variations of regions including 5928, 6758, 8070, 8330, 11378 had the most validating percentage of mtDNA (Table 11). Results of the validation of novel differential SNVs showed that 19 out of 3845 variants were validated in five new samples (Table 11).
Table 11

Validation of novel differential SNVs and mtDNA variants in highland chickens.

ChrRegionGenotypeValidation %Variant type
MtDNA164C/T20Single nucleotide variant (SNV)
307C/T20
443C/T20
5928 C/A 40
6758 T/C 40
6800T/C20
7530C/G20
8070 T/C 40
9533A/G20
10072A/G20
11378C/T 40
11963C/T20
16586A/G20
253T/C20
258C/T20
4580G/A20
8330 T/C 40
11378 C/T 40
12094T/C20
11254679A/G20
1103830963C/T20
1 2704537 G/A 60
1165925917G/T20
189054158C/T20
1180719232A/G20
1138527429G/A20
24877832C/T20
256043229C/T20
249059638T/C20
333701810A/G20
365865344A/C20
476983077A/G20
62014167A/C20
717112008C/G20
88248431A/T20
92581777C/T20
12816375A/G20
178030611G/A20

Validations were performed for two groups of detected variants in this study. Frist, novel differential SNVs between highland and lowland chickens, and second, mtDNA variations in highland chickens. Thus, other five whole genomes of highland samples (Isfahan) were sequenced, separately. The whole genome sequencing, trimming and variant detection were carried out based on provided information in material and method sections. A total of 3845 SNVs was reported as new variants for differential variants analyses, and 32 variations were detected in mtDNA for highland samples. The region and chromosomes of each SNV were evaluated in the new five samples by R program (www.R-project.org) in order to validate the variants. The calculation of validation’s percentage was based on samples that had variants. Finally, 19 variants were validated for mtDNA and novel differential variants.

Validation of novel differential SNVs and mtDNA variants in highland chickens. Validations were performed for two groups of detected variants in this study. Frist, novel differential SNVs between highland and lowland chickens, and second, mtDNA variations in highland chickens. Thus, other five whole genomes of highland samples (Isfahan) were sequenced, separately. The whole genome sequencing, trimming and variant detection were carried out based on provided information in material and method sections. A total of 3845 SNVs was reported as new variants for differential variants analyses, and 32 variations were detected in mtDNA for highland samples. The region and chromosomes of each SNV were evaluated in the new five samples by R program (www.R-project.org) in order to validate the variants. The calculation of validation’s percentage was based on samples that had variants. Finally, 19 variants were validated for mtDNA and novel differential variants.

The analysis of genomic variants enrichment for the differential variant between highland and lowland chickens

The chromosome that had the most standard frequency in females was known as the chromosome 27 (Fig. 3). In fact, 234 variants were recognized on chromosome 27; however, 100 variants (43% of variants) were located in the range of 2692–1 × 106 base pairs (16% of the chromosome length) (Fig. 4a). Findings illustrated that 40 ENSGALT (Ensembl transcript) IDs were available in this region, and five Ensembl gene IDs were also detected by BioMart tools (Table 12). We found that MIR6644-2 gene contributed in the gastrulation. Gastrulation is recognized as one of the most important steps in animals’ embryonic development. Furthermore, the single layered blastula develops to a multilayered organization, called gastrula, during the gastrulation[17].
Figure 3

Standard frequency of differential variants between highland and lowland chicken for the selection of chromosomes that have the highest standard frequency. Dividing variants numbers by the chromosome length (Mb), standard frequencies of differential variants were estimated for each chromosome, then chromosomes with high-density of variants were selected (marked with asterisk) in order to be utilized for further analysis. Chromosomes 30–33 are not available in the genome reference.

Figure 4

Cumulative frequency graph of variant numbers for chromosomes 27 (a) and 2 (b) in female and male chickens for genomic variant enrichment analysis. Chromosomes 2 and 27 (Fig. 2) were selected in order to draw the cumulative frequencies graph of variants along the chromosome. It is shown by Figure (a) that the most variants enrichments are available in the range of 2692–1 × 106 base pairs (separated with horizontal line) of chromosome 27. A total of 234 variants was recognized on chromosome 27, and 100 (42.72%) variants were also located in the range of 2692–1 × 106 region. Figure (b) indicates that 17891 variants were recognized on chromosome 2, but 7600 variants (43%) were located in the range of 41 × 106–81 × 106 (27% of chromosome length) base pairs. Finally, these regions with the highest density of variation were selected for the identification of candidate genes.

Table 12

Genomic variants enrichment analysis for the identification of candidate gene associated with hypoxia in males and females.

SexGene IDBiotypeFunctionProduct nameGene nameUniprot ID
FemaleENSGALG00000028220Protein codingstructural constituent of cytoskeletonKeratin LOC107055272 E1C3X3
ENSGALG00000026257Protein codingstructural constituent of cytoskeletonKeratin LOC107055272 E1BXQ5
ENSGALG00000028852miRNAGastrulationMIR6644-2 MIR6644-2 No protein
ENSGALG00000026447Protein codingCytoskeleton organizationKeratin LOC428303 E1C3N8
ENSGALG00000027740miRNAUnknownUnknownUnknownNo protein
MaleENSGALG00000011733Protein codingimmune responseC-C chemokine receptor type 2 CCR2 F1NPK8
ENSGALG00000011734Protein codingchemotaxisC-C chemokine receptor 8 like CCR8L F1NPK7
ENSGALG00000011735Protein codingchemokine receptor activitychemokine XC receptor 1 CCXCR1-L Q702H5
ENSGALG00000011808Protein codingmacrophage receptor activityC-C chemokine receptor type 9 CCR9 F1NMB1
ENSGALG00000012671Protein codingHormone activityProlactin Prl A0A0B5L5F3
ENSGALG00000012700Protein codingubiquitin-protein transferase activityRing finger protein 182 RNF182 E1BZ35
ENSGALG00000012702Protein codingChromatin bindingProtein Jumonji JARID2 F1NWZ0
ENSGALG00000012732Protein codingprotein phosphatase regulator activityPhosphatase and actin regulator PHACTR1 F1NWB1
ENSGALG00000012748Protein codingElongation of very long chain fatty acids protein 2fatty acid elongase activity ELOVL2 E1BYE9

Identification of the genomic regions, which had the highest density of differential variant between highland and lowland chickens, is considered as the main purpose of genomic variants enrichment analysis. The candidate genes located in the regions of genome, which had the highest density of variants, can be observed in this table. We found that the most important functions that can be considered in low oxygen conditions are Immune response, chromatin binding, and gastrulation. In addition, MIR6644-2 was reported as the candidate gene in order to develop the gastrulation process, and was not implicated in the previous investigations about high-altitude adaptation of chickens.

Standard frequency of differential variants between highland and lowland chicken for the selection of chromosomes that have the highest standard frequency. Dividing variants numbers by the chromosome length (Mb), standard frequencies of differential variants were estimated for each chromosome, then chromosomes with high-density of variants were selected (marked with asterisk) in order to be utilized for further analysis. Chromosomes 30–33 are not available in the genome reference. Cumulative frequency graph of variant numbers for chromosomes 27 (a) and 2 (b) in female and male chickens for genomic variant enrichment analysis. Chromosomes 2 and 27 (Fig. 2) were selected in order to draw the cumulative frequencies graph of variants along the chromosome. It is shown by Figure (a) that the most variants enrichments are available in the range of 2692–1 × 106 base pairs (separated with horizontal line) of chromosome 27. A total of 234 variants was recognized on chromosome 27, and 100 (42.72%) variants were also located in the range of 2692–1 × 106 region. Figure (b) indicates that 17891 variants were recognized on chromosome 2, but 7600 variants (43%) were located in the range of 41 × 106–81 × 106 (27% of chromosome length) base pairs. Finally, these regions with the highest density of variation were selected for the identification of candidate genes. Genomic variants enrichment analysis for the identification of candidate gene associated with hypoxia in males and females. Identification of the genomic regions, which had the highest density of differential variant between highland and lowland chickens, is considered as the main purpose of genomic variants enrichment analysis. The candidate genes located in the regions of genome, which had the highest density of variants, can be observed in this table. We found that the most important functions that can be considered in low oxygen conditions are Immune response, chromatin binding, and gastrulation. In addition, MIR6644-2 was reported as the candidate gene in order to develop the gastrulation process, and was not implicated in the previous investigations about high-altitude adaptation of chickens. The most standard frequency of variants of chromosome 2 in males was estimated to be 119.60 (Fig. 3). A total of 17891 variants was detected on chromosome 2, but 7600 variants (43%) were located in the range of 41 × 106–81 × 106 (27% of chromosome length) base pairs; however, this region, which had a high density of variation, was selected for further analysis (Fig. 4b). It was shown by results of UCSC and the table browser analysis that 412 ENSGALT IDs were available for the selected region and consequently, nine Ensemble gene IDs were identified by BioMart tools (Table 12). We found that some functions were involved in the adaptation to hypoxia including chemotaxis, immune response, inflammatory response, and positive regulations of monocyte chemotaxis. Other results indicated that Jumonji protein might have an important role in the adaptation to hypoxia. Reviewing the studies, we found that Jumonji protein family would contribute in the wide range of chromatin regulation, genes expression, and signaling pathways[18]. While the Jumonji is known as the DNA binding and transcriptional repressor, this protein interacts with the Polycomb repressive complex 2 (PRC2) in humans, which plays a critical role in the regulation of gene expression during the embryonic development (www.ncbi.nlm.nih.gov/gene/3720). This study indicates that PRL gene can play a very important role in the condition of lack of oxygen. It has been illustrated that the PRL gene is associated with reproductive traits; specially egg production traits[19]. In fact, PRL gene is a multifunctional hormone that impacts on multiple physiological processes including cardiovascular system; however, they have been poorly described[20].

The possible gene network of differential variants

Results of the analysis of gene network indicate that seven differential candidate genes contribute in the regulation of carcinogenesis. It is observed that most of the candidate genes induce carcinogenesis, especially BMX and LRP6 (Fig. 5). However, HSPA14 and DCLRE1A genes had an inhibition impact on the tumorigenesis. See the Supplementary Table S8 for more details about underlying references of relationships. Figure 6 shows the cellular location of candidate gene. For example, LRP6 is considered as the receptor; therefore, it is shown in the membrane cell. In addition, BMX and HSPA14 were activated in cytoplasm, but other candidate genes were in the nucleus. This information could be useful in order to explore the contribution of genes in biological pathways. For instance, DCLRE1A and PRKDC have a critical role in the process of DNA repair[21]. Therefore, their location can also be observed in nucleus. These results could be helpful in order to understand the process of diseases creation and select the appropriate drug design method, based on the cellular location of key proteins in carcinogenesis.
Figure 5

The network analysis for differential candidate gene in hypoxia condition for carcinogenesis. The high-dose UV radiation can lead to DNA damage and tumorigenesis in high-altitude condition; thus, gene networks analyses of key genes involved in the high-altitude adaptation could provide new information about carcinogenesis. A total of 27 candidate genes was identified for differential variants in males and females in this investigation (Tables 7 and 8). Chicken Ensembl gene identifications were converted into human orthologs in order to construct an informative network and finally, results showed that seven candidate genes were enriched in carcinogenesis network and had a regulatory role. Specifically, BMX and LRP6 induce the carcinogenesis and HSPA14, and DCLRE1A inhibits that.

Figure 6

The cellular location of the key proteins in the regulation of carcinogenesis. This figure shows the cellular location of the key proteins in three parts of the cell (Membrane, Nucleus and Cytoplasm). LRP6 is known as receptor, thus cellular location of this protein is shown in the membrane. Similarly, other proteins including DCLRE1A contribute in the DNA repair process. Consequently, it is illustrated in nucleus. These results can be useful for better understanding of the diseases creation process and the selection of appropriate drug design methods, based on cellular locations of key proteins in carcinogenesis.

The network analysis for differential candidate gene in hypoxia condition for carcinogenesis. The high-dose UV radiation can lead to DNA damage and tumorigenesis in high-altitude condition; thus, gene networks analyses of key genes involved in the high-altitude adaptation could provide new information about carcinogenesis. A total of 27 candidate genes was identified for differential variants in males and females in this investigation (Tables 7 and 8). Chicken Ensembl gene identifications were converted into human orthologs in order to construct an informative network and finally, results showed that seven candidate genes were enriched in carcinogenesis network and had a regulatory role. Specifically, BMX and LRP6 induce the carcinogenesis and HSPA14, and DCLRE1A inhibits that. The cellular location of the key proteins in the regulation of carcinogenesis. This figure shows the cellular location of the key proteins in three parts of the cell (Membrane, Nucleus and Cytoplasm). LRP6 is known as receptor, thus cellular location of this protein is shown in the membrane. Similarly, other proteins including DCLRE1A contribute in the DNA repair process. Consequently, it is illustrated in nucleus. These results can be useful for better understanding of the diseases creation process and the selection of appropriate drug design methods, based on cellular locations of key proteins in carcinogenesis.

The submission of novel variants to European Bioinformatics Information (EBI)

Many SNVs are detected as the novel variants in this investigation. Novel variants were submitted to EBI database with the accession number of PRJEB24944. See Table S9 for more details. Discovered variants can be utilized for future breeding programs. The analysis of data with hierarchical pattern recognition algorithms can be ranked, thus the best combination of variants distinguishing highland and lowland chickens will also be found[22,23].

Discussion

Full genome sequencing of highland and lowland chicken ecotypes are utilized in this investigation in order to unravel molecular mechanisms of adaptation to the condition of lack of oxygen. Recently, researchers are more willing to study the mtDNA variation and ATP production in low oxygen conditions. According to our findings, hydrogen ion transmembrane transports in mitochondrial inner membrane, as a biological pathway involves in mitochondrial respiratory, and COX3 candidate gene (cytochrome c oxidase III or MT-CO3) was also detected for this biological pathway. It is the terminal enzyme of the respiratory chain of mitochondria, which is involved in the transfer of electrons from reduced cytochrome c to the molecular oxygen[24]. Results of this study corroborate the findings of Sun et al.[25] who investigated the association between MT-CO3 (mitochondrially encoded cytochrome c oxidase subunit III) gene and high-altitude adaptation in Tibetan chickens. They demonstrated that eight SNPs were available (single nucleotide polymorphisms), and five of them were shared by Tibetan chickens and lowland chickens. Results showed that highland chickens had more mtDNA variants than lowland chickens (32 versus 18 variants). It is a reasonable approach to consider that more variants in highland chickens were created in order to be adapted to high altitude conditions. Amazingly, there is a severe sex bias in mitochondrial nucleotide variation, as male highland chickens had more variants than female highland chicken (a ratio of almost 3:1). A possible explanation for this might be that female and male chickens are different in sexual traits such as, fertility, gaining and consequently, energy consumption. In this way, Camus et al.[26] indicated that mtDNA variation can affect the regulation of healthy traits and contribute in sex-related differences. Furthermore, the interactions between mtDNA and nuclear DNA involved in sex-specific transcript responses[27]. In addition, we have detected novel mtDNA variations for the first time. Thus, our findings provided new potential molecular genetics for the genetic adaptation to high-altitude conditions. Logically, further examinations will be required in order to describe the main role of novel variants in low oxygen conditions. We found that there is a non-mitochondrial variant (chromosome 12), which is involved in the cell respiration. Mitochondrial respiratory complex chain III contributes in the ATP synthesis, electron transport, and metabolism. It is a complex protein located in the mitochondrial inner membrane[28]. Here, UQCRC1 gene was reported in order to encode ubiquinol cytochrome c reductase or cytochrome bc1 complex protein. This might be explained by the fact that the activity of cytochrome c oxidase will decrease in the low oxygen condition and will also lead to the less activity of bc1 compound. Logically, the generation of reduced ubiquinol will be decreased[29]. Figure 7 shows the protein structure of UQCRC1.
Figure 7

UQCRC1 gene encodes ubiquinol cytochrome c reductase or cytochrome bc1 complex protein. It is a complex protein which is located on the mitochondrial inner membrane and involves in the cell respiration. The variants locations are shown by the yellow circle in the whole structure. Purple and green atoms show the reference and variants structures, respectively.

UQCRC1 gene encodes ubiquinol cytochrome c reductase or cytochrome bc1 complex protein. It is a complex protein which is located on the mitochondrial inner membrane and involves in the cell respiration. The variants locations are shown by the yellow circle in the whole structure. Purple and green atoms show the reference and variants structures, respectively. Additionally, the proteasomes and protein translation tools are identified as interesting results of the cellular components analysis. There are evidence suggesting an association between hypoxic condition and protein hemostasis. Low oxygen condition affects the gene expression profiles in cell, in this way, transcription and translation process are changeable. Also, protein synthesis is one of the most energy-consuming process in cell related to normal oxygen concentration[30,31]. Thereby, cell reduces the messenger RNA (mRNA) rate to save energy metabolism while continuing the production of the proteins associated with survival factors[32]. Gene ontology enrichment and genomic variants enrichment analysis, including immune response, histone banding, and organ development, also achieved the same results. Direct evidences were found on the association between solar radiation and immune response about the hypoxia condition in this investigation. Thus, it was recognized that NK (natural killer) T cell differentiation was considered as the immune response. BMX gene contributes in the NK T cell differentiation. BMX gene encodes cytoplasmic tyrosine-protein kinase in human being[33]. It plays a critical role in the induced interleukin-6 (IL6), adaptation of different cell systems against stress, growth, and differentiation of hematopoietic cells[34]. Findings of this study are consistent with those of McNamee et al.[35], which indicate that the differentiation of T cells could be stimulated by hypoxia and high altitude conditions. Also, Zhang et al.[15] showed that there is an association between the high altitude condition and inflammation responses. Protein structures and the variant location of BMX are shown in Fig. 8. Similarly, results of the analysis of genomic variants enrichment indicated that chemokine receptors have a critical role in the immune response. Chemokines are defined as small protein molecules produced by cells of the immune system. They contribute in the migration of immune cells to an infection site. Furthermore, chemokine receptors are involved in monocyte infiltration of inflammatory responses against tumors (Uniprot: Q8QG57).
Figure 8

BMX gene encodes cytoplasmic tyrosine-protein kinase and contributes in the NK T cell differentiation. It plays a critical role in the induced interleukin-6 (IL6), adaptation of different cell systems against stress, growth, and differentiation of hematopoietic cell. Therefore, BMX gene was considered as the immune response in hypoxia conditions. The variants locations are shown by the yellow circle in the whole structure. Purple and green atoms also show the reference and variants structures, respectively.

BMX gene encodes cytoplasmic tyrosine-protein kinase and contributes in the NK T cell differentiation. It plays a critical role in the induced interleukin-6 (IL6), adaptation of different cell systems against stress, growth, and differentiation of hematopoietic cell. Therefore, BMX gene was considered as the immune response in hypoxia conditions. The variants locations are shown by the yellow circle in the whole structure. Purple and green atoms also show the reference and variants structures, respectively. Several studies indicated that epigenetic regulations in cells might be caused by the hypoxia[36-38]. Results of gene ontology showed that histone binding might be associated with the hypoxia condition. Furthermore, we suggested that ATAD2 gene can be considered for the histone binding process. ATAD2 gene belongs to the ATPase family and is involved in the chromatin binding and histone binding (UniProt: Q6PL18). This finding is in accordance with that of Zhang et al.[15], which indicated that there was an association between histone binding and high altitude conditions. In addition, the analysis of genomic variants enrichment confirmed that Jumonji protein and JARID2 can contribute in the hypoxia adaptation process by chromatin bindings. It has been demonstrated that, JARID2 gene plays an essential role in embryonic and organ developments including heart development, and neural tube fusion (UniProt: Q92833). Other interesting findings of the gene ontology enrichment analysis are considered as organ developments including brain development and pericardium morphogenesis. A strong relationship between high altitude condition and organs development has been reported by Zhang and Burggren[39]. For instance, highland chickens have smaller organs including heart, liver and lungs[15]. Our study presents evidences in order to prove that brain development has a critical role in the adaptation to high altitude conditions. It may be explained by the fact that brain is the most sensitive organ to the lack of oxygen, thus cerebral anoxia induces neuronal cell death and apoptosis and will also lead to hypoxic brain injury[40]. We found that there are gene ontology terms which impact on the brain development in highland chickens including mid brain development, cerebral cortex development, and thalamus development. It is recognized that pericardium morphogenesis was effective in high altitude conditions. Similarly, Patterson and Zhang[41] demonstrated that normal heart formation and maturation in fetus depend on low oxygen stress, but further abnormal low oxygen condition has a negative influence on the structure, function, and gene expression in the fetal heart tissue. This is the evidence which is in accordance with our results. Therefore, LRP6 gene was considered as a candidate gene for organs development in high altitude conditions. LRP6 gene encodes low-density lipoprotein receptor-related protein 6 that has included co-receptor groups in the canonical Wnt pathway (Wnt/β-catenin pathway). It is considered as a very important regulator of tissue development and hemostasis[42]. Figure 9 shows the protein structure and variant location of LRP6.
Figure 9

LRP6 gene was considered as a candidate gene for organs development in high altitude conditions. LRP6 gene encodes low-density lipoprotein receptor-related protein 6 and is a key regulator of tissue development and hemostasis. The yellow circle in the whole structure shows variants locations. The reference and variants structures are also shown by purple and green atoms, respectively.

LRP6 gene was considered as a candidate gene for organs development in high altitude conditions. LRP6 gene encodes low-density lipoprotein receptor-related protein 6 and is a key regulator of tissue development and hemostasis. The yellow circle in the whole structure shows variants locations. The reference and variants structures are also shown by purple and green atoms, respectively. Other interesting GO terms contained in high-altitude adaptation pathways- including DNA repair, double strand break repair, DNA biosynthetic process and inter strand cross-link repair activity- can be found in this study. One possible explanation might be defined as: high-dose UV radiation can lead to DNA damage, cell apoptosis, skin cancer and tissue injury of mammals[43,44] in high-altitude conditions. In this study, we reported DCLRE1C gene as a candidate gene for DNA repair process. In humans, DCLRE1C gene encodes a single-strand nuclear protein that has a specific 5′-3′ exonuclease activity. Interestingly, this protein has main functions in the regulation of the cell cycle in response to DNA damage[21]. Amazingly, it should be noted that comparable results were obtained by SSA and differential variants analysis. The results of differential variants analysis indicated that adaptive variants were involved in processes of DNA repair, organs development, immune response, epigenetic regulation, mitochondrion function (cell respiration), and protein hemostasis. Interestingly, similar results were reported by SSA and it is also recognized that both analysis show the similar gene’s functions that contribute in response to hypoxia. For instance, cell respiration and mitochondrion function were considered as the most important results of differential variants analysis in low oxygen condition. Thus, UQCRC1 (mitochondrial respiratory chain complex III) was suggested as the candidate gene of cell respiration. Also, several candidate genes associated with cell respiration and mitochondrion function, including COX7A2, ME3 and HCC, were reported by SSA. According to the results of SSA, it is recognized that adaptive variants in differential variant analysis are due to the adaptation process. Table 13 shows a summary of the comparison of results of SSA and differential variants analysis.
Table 13

The comparison results of signature selection and differential variant analysis.

CategoriesGene nameDescriptionSSADVA
DNA repairMMS22LDNA repair protein*
PRIM2DNA primase*
MRE11ADouble-strand break repair protein*
DNASE2Deoxyribonuclease-2-beta precursor*
DDIASInduced by UV radiation. DNA damage-induced apoptosis suppressor*
DCLRE1A5′-3′ exodeoxyribonuclease activity*
TATDN1Deoxyribonuclease activity*
DCLRE1ADouble-strand break repair*
COBLL1DNA biosynthetic process*
POLQDouble-strand break repair*
PRKDCDNA repair*
DCLRE1AInter strand cross-link repair*
Immune responseTRAT1T cell receptor associated transmembrane adaptor 1**
BMXBMX non-receptor tyrosine kinase*
BMXNK T cell differentiation*
CXCR4C-X-C chemokine receptor type 4*
Organ developmentBDNFBrain-derived neurotrophic factor*
LRP6Cerebral cortex development*
LRP6Midbrain development*
LRP6Thalamus development*
Mitochondrion function (cell respiration)COX7A2Cytochrome c oxidase subunit 7A2*
MTO1Protein MTO1 homolog, mitochondrial*
ME3Malic enzyme 3, mitochondrial*
DARS2aspartyl-tRNA synthetase 2, mitochondrial*
PRELID1PRELI domain-containing protein 1, mitochondrial*
TIMM10Translocase of inner mitochondrial membrane*
MRPS1128S ribosomal protein S11, mitochondrial*
MRPL4639S ribosomal protein L46, mitochondrial*
MELANOMAGallus gallus mitochondrial ribosomal nuclear gene encoding mitochondrial protein*
DLDdihydrolipoyl dehydrogenase, mitochondrial*
MICU3Mitochondrial calcium uptake family ember 3*
HCCSCytochrome c-type hemelyase*
UQCRC1Mitochondrial electron transport*
UQCRC1Mitochondrial respiratory chain complex III*
Epigenetic regulationATAD2Histone binding *
SERPINB10Heterochromatin-associated protein MENT. Chromatin DNA binding *
Protein hemostasisPSMD13Proteasome accessory complex *
PSMD13Proteasome regulatory particle *
PSMD13Proteasome assembly *
PSMG3Proteasome assembly chaperone 3 *

The comparable results were obtained by signature selection and differential variants analysis. The results of differential variants analysis indicated that adaptive variants were involved in DNA repair, organs development, immune response, epigenetic regulation, mitochondrion function (cell respiration) and protein hemostasis. Interestingly, the similar results were reported by signature selection analysis and it is recognized that both analysis show the similar gene’s functions that contribute in response to hypoxia. SSA: signature selection analysis. DVA: differential variant analysis.

The comparison results of signature selection and differential variant analysis. The comparable results were obtained by signature selection and differential variants analysis. The results of differential variants analysis indicated that adaptive variants were involved in DNA repair, organs development, immune response, epigenetic regulation, mitochondrion function (cell respiration) and protein hemostasis. Interestingly, the similar results were reported by signature selection analysis and it is recognized that both analysis show the similar gene’s functions that contribute in response to hypoxia. SSA: signature selection analysis. DVA: differential variant analysis. Results of the analysis of genomic variants enrichment showed that miRNAs can be considered in the high-altitude adaptation. Clearly, miRNAs involve in the regulation of different pathways in various cell types; therefore, they can provide new evidence about hypoxia-related gene regulations[45]. Several studies showed that an association between hypoxia and hypoxia-inducible factor (HIF) transcription factor is available, and will lead to the expression of a different type of genes such as miRNAs[46,47]. It was recognized that MIR6644-2 was reported as the candidate gene in the high-altitude adaptation process by our genomic variants enrichment analysis. Despite there was no previous study about MIR6644-2 gene and high-altitude adaptation; however, this has been proved that it has a main role in the gastrulation[48]. This is recognized as one of the most interesting discussions about hypoxia and gastrulation, that chronic hypoxia alters the physiological and morphological pathways of developing animal embryos[49,50]. In this study, we carried out the gene network analysis in order to identify that candidate genes were enriched in human diseases. Although the biological pathways in animals are not in a good accordance with human being, there are many treatment systems and drugs developed in animal models, especially chicken models for human genetic and disease[51-53]. We found that the gene network was associated with the carcinogenesis. This result may be explained by the fact that, UV radiation in highlands leads to the DNA damage. Similarly, there is an association between cancers and DNA damage. Interestingly, the network analysis indicated that DCLRE1C and PRKDC proteins can inhibit the tumorigenesis. It is because the DCLRE1C and PRKDC proteins involve in DNA repair process[21]. In addition, PRKDC protein has a critical role in the development of immune system by maturing B and T cells[54]. This finding supports the previous research which showed the DNA-PKcs-deficient (encoded by PRKDC gene) associated with immune deficient, radio sensitivity and tumor development in mice[54,55]. We found that PES1would induce tumorigenesis. It is a possible explanation that PES1 encodes pescadillo ribosomal biogenesis factor 1, which is a member of the PeBoW complex and has a main role in regulating the cell cycle. Therefore, the abnormal mitosis and carcinogenesis might be caused by the disruption of PeBoW complex. These findings further support the idea that PES1 may involve in the promotion of the malignant phenotypes of colon cancer cells and the gastric cancer[56,57]. Also, it was found that LRP6 would lead to the cancer disease. Cell receptors are considered as biomarkers in cancer researches[58]. There are several studies attempt to explain the role of LRP6 receptor in tumorigenesis including triple negative breast cancer and human hepatocellular carcinoma[59-63]. Considering the Wnt signaling pathway (Wnt/β-catenin pathway) and normal cell cycle regulation, LRP6 is a member of the LDL receptor family[63]. HSPA14 was also reported as another candidate gene in carcinogenesis, and HSPA14 is recognized as the heat shock protein family (member 14). Also, it is known as the HSP70-4 or HSP70L1. Previous studies indicated that HSPs involves in the wide range of tumor[64-67]. Similarly, Rerole.et.al.[68] showed that HSP70 are expressed abnormally as well as it causes tumor growth and metastasis in rodents. See the Supplementary text for more information about the discussion part.

Conclusions

This is a comparative genomic study of Iranian native chicken ecotypes in order to identify the genomic region associated with hypoxia and low oxygen condition. The detection of candidate genes in this study would provide new insights about hypoxia. These adaptive genes were related to important functions including DNA repair, organ development, immune response, and histone binding. The comparable results were obtained by signature selection and differential variant analysis, in this way we demonstrate that the differential variants are actually adaptive in response to hypoxia and are not due to other evolutionary pressures or mechanisms of genetic change. We have also found that variations in mtDNA, including COX3 gene, play an important role in the process of adaptation. It was identified that several candidate genes were associated with carcinogenesis. This is the first investigation that sequences the whole genomes of highland and lowland native chickens; therefore, many novel variants are discovered and submitted to EBI (PRJEB24944). Moreover, MIR6644-2 gene is considered as a candidate gene which might impact on the adaptation process to hypoxia by the regulation of embryo development, and was not implicated in previous investigations of high-altitude adaptation among chickens. Therefore, this paper can provide new information about the adaptation process. The signature of native chicken, which ecotypes to high altitude and low oxygen at the genomic level, is produced for better understanding about hypoxia adaptation mechanisms.

Materials and Methods

The summary of sampling and whole genome data analyses are depicted in Fig. 10.
Figure 10

The Summary of sampling and data analysis for whole-genome sequencing of native chicken ecotypes and variant discovery associated with high-altitude adaptation (the same colors of vectors show the similar analysis).

The Summary of sampling and data analysis for whole-genome sequencing of native chicken ecotypes and variant discovery associated with high-altitude adaptation (the same colors of vectors show the similar analysis).

Ethical approval statements

This investigation is in accordance with relevant guidelines and regulations of Shiraz University. All experimental protocols were approved by Institute of Biotechnology at Shiraz University. The whole procedure of blood sampling was approved by the Department of Animal Science at Shiraz University (Permit number: 93-192). No birds were slaughtered and harmed.

Blood sampling and DNA extractions

Blood samples were collected from Isfahan (highland, altitude = 2087 m) and Mazandaran (lowland, altitude = 54 m) provinces. Ten samples including three males and two females of the highland, and two males and three females of the lowland were collected. See Fig. 11 for more details about sampling locations. Two mL of blood was obtained from the birds’ wing vein. Using salting out protocol, the total DNA was isolated from the whole blood[69]. The process of testing DNA samples’ quality was carried out by the agarose gel (2%) and NanoDrop spectrophotometer, and high-quality DNA samples were also utilized for the subsequent whole genome resequencing.
Figure 11

Sampling locations of highland and lowland chickens in Isfahan and Mazandaran provinces, located at different altitudes in Iran. Ten blood samples, including three males and two females from highland and two males and three females from lowland, were collected in order to be utilized in the whole genome sequencing. This figure was downloaded from an open source website https://commons.wikimedia.org. The direct URL is https://upload.wikimedia.org/wikipedia/commons/9/99/Iran_topo_en.jpg. The link to the license is https://creativecommons.org/licenses/by-sa/4.0/deed.en. Outer areas from the original figure were cropped, therefore “Caspian Sea” was rewritten in the above of figure based on the information of original figure. In addition, the locations of sampling were added to the figure by yellow vectors.

Sampling locations of highland and lowland chickens in Isfahan and Mazandaran provinces, located at different altitudes in Iran. Ten blood samples, including three males and two females from highland and two males and three females from lowland, were collected in order to be utilized in the whole genome sequencing. This figure was downloaded from an open source website https://commons.wikimedia.org. The direct URL is https://upload.wikimedia.org/wikipedia/commons/9/99/Iran_topo_en.jpg. The link to the license is https://creativecommons.org/licenses/by-sa/4.0/deed.en. Outer areas from the original figure were cropped, therefore “Caspian Sea” was rewritten in the above of figure based on the information of original figure. In addition, the locations of sampling were added to the figure by yellow vectors.

Phenotypic data and statistical analysis

Classification of highland and lowland chickens, based on phenotypic traits, was the main purpose of the statistical analysis. Twenty-four quantitative traits were recorded for 16 chickens of highland and lowland. To reduce measurement error, all chickens were matured while recording. The same recording protocol was also used in order to measure phenotypic traits of all the birds. Traits included the body weight (gr), neck length, body size (between waist and pectoral circumference), shank length, body size (between waist and abdominal circumference), wing length, tail length, femur length, crown length, crown height, back cape length, body size (between pectoral and cloaca circumference), head height (height from head to floor), wings length (open wings), number of toes, number of spur (nail of the foot), beak length, head circumference with a crown and a double wattles, toes length (right, middle, left), wattle height, wattle width, pectoral width, femur diameter and shank diameter. Utilizing the principle component analysis and discriminate analysis in Minitab software, we analyzed these traits (version: 17)[70].

Whole genome sequencing

The construction of genomic libraries was based on the Illumina standard genome library preparation pipeline. Using Hiseq. 2000 platform, whole genomes were sequenced and the length of provided paired-end short reads was also125 bp.

Quality control and trimming

The function of quality control in CLC Genomic Workbench (8.5.1)[71] was used by the following parameters for each sample: length distribution, GC content, ambiguous base content, Phred score, nucleotide contribution, enrich 5 mers, and duplicate sequences[72]. The adaptor sequences were removed by Illumina Company; thereby, trimming was carried out based on other parameters.

Reference genome, mapping, locally realignments and coverage analysis

The reference genome and annotations were downloaded from the Ensembl database. Annotations included gene annotations and variants (ftp://ftp.ensembl.org/pub/relase-84/fasta/gallus_gallus). Mapping was carried out in CLC Genomics Workbench (8.5.1)[71] based on the following parameters: masking mode = no masking, mismatch cost = 2, cost of insertions and deletions = linear gap cost, insertion cost = 3, deletion cost = 3, length fraction = 0.7, similarity fraction = 0.8, global alignment = no, Auto-detect paired distances = yes, and non-specific match handling = ignore. Finally, local realignment was used according to the following parameters: realign-unaligned ends = yes and multi-pass realignment = 2[73]. The analysis of coverage was also utilized in order to identify the frequency of a base’s coverage in the reference genome by the mapped read of a given sequencing[74]. Here, the analyses of coverage were estimated for all highland and lowland samples before and after the trimming process. Also, Lander/Waterman equation was used for the calculation of the coverage[75].where, C stands for coverage, G is the haploid genome length (bp), L is the read length and N is the number of reads.

Variant calling and statistical analysis

The variant detections algorithm was used by CLC genomics workbench (8.5.1)[71]. First, parameters were optimized for variant calling; therefore, some parameters were changed and tested for the optimization. Decreasing false variants was the main purpose of optimization (Supplementary Table S1). The ploidy level is fixed in chickens (2n = 78). Therefore, fixed ploidy algorithm[76] was used for variant calling based on following parameters: required variant probability (%) = 95.0, ignore positions with coverage above = 50,000, restrict calling to target regions = not set, ignore broken pairs = yes, ignore non-specific matches = reads, minimum coverage = 10, minimum count = 2, minimum frequency (%) = 30, base quality filter = Yes, neighborhood radius = 15, minimum central quality = 30, minimum neighborhood quality = 25, remove pyro-error variants = yes, in homo-polymer regions with minimum length = 3-with frequency below = 0.8[72,76]. We used the IDEG6 website for the statistical analysis[77]. Using Fisher exact test, we analyzed differences in variants distribution. The statistical analysis was carried out for each variant between highland and lowland samples, separately. Also, Chi-square test was established in order to analyze the differential distributed variations between highland and lowland chickens for each chromosome incorporating the sex of birds in the statistical analysis.

Phylogenetic and structural population analysis

Any change in mitochondrial DNA (mtDNA) can be utilized as the phylogenic analysis, because mtDNA are considered as the conserved sequences. A total of 48 SNVs (single nucleotide variations) was identified in mtDNA using CLC Genomic Workbench (8.5.1)[71], 480 SNVs were also genotyped (see results part) for ten samples. The phylogenic analysis was carried out based on 480 SNVs genotypes, Nei’s genetic distance matrix, and UPGMA method by PopGene software (version 1.32)[78]. The population structural analysis was performed using multiple correspondence analysis (MCA) by R program (version: 3.2.2)[79] and the “Ca” package[80]) in order to classify the highland and lowland chickens. We used the multiple correspondence analyses for qualitative data, while the principal component analysis (PCA) was used for quantitative data. MCA is another tool of multivariate methods that allows the analysis of systematic patterns of variations with categorical data[81]. 1000 SNVs genotypes were collected randomly from each highland and lowland chickens and a total of 10000 SNVs genotypes was used for MCA analysis.

Differential variants

Comparing variants, filter variants and gene ontology enrichment analysis

After variants calling, variations of highland chickens were compared against the reads of lowland chickens as a control tool in order to remove the common variation between lowland and highland samples. Also, a comparison was carried out between male and female birds of highland and lowland, separately. We considered five frequency threshold percentages (0, 25, 50, 75 and 100) in order to be utilized for the frequency threshold optimization. The threshold frequency is considered as the percentage of samples that have variants. For instance, determining it to 50% indicates that at least 50% of samples, which were selected as the inputs, must contain a given variant in order to be reported in the output. The differential variants were filtered based on known variants for the identification of novel variants. CDS (coding region) annotation was selected for overlap comparisons in order to understand which variants were located on coding and non-coding regions. Coding variants were collected for functional consequences and amino acid changes analysis. Reference genome, CDS and mRNA annotations were used for the amino acid change analysis. In addition, synonymous variants and CDS regions that had no variants were also filtered. In order to carry out the amino acid change analysis, standard genetic code was selected in CLC genomics[82]. The file of gene ontology (GO) association, which includes the gene names and associated gene ontology terms, was downloaded from the gene ontology consortium (http://geneontology.org/) and imported to CLC Genomic Workbench (8.5.1)[71]. The output of amino acid change analysis was utilized for the analyses of gene ontology enrichment of the biological process, molecular function, and cellular component. The significance level of GO analysis was determined to be 0.01.

Signature selection analysis

It should be noted that SSA was carried out in order to confirm the results of differential variant analysis in the current study. Considering descriptions of Weir and Cockerham[83], (FST) was calculated for SNVs of each population of highland and lowland chicken’s differentiation and sex chromosomes were removed. Sliding window analyses were carried out with a window size of 100 kb and a step size of 50 kb. Consequently, average FST values of SNVs in each window were calculated; also, the genomic landscape of the population differentiation between highland and lowland chicken was drawn by qqman pakage and R program (version 3.2.2)[79]. Windows with the high 5% FST values were retrieved by using the Variant Effect Predictor online tool (http://www.ensembl.org). Also, the process of gene enrichment analysis was carried out by using g: profiler (http://biit.cs.ut.ee/gprofiler/index.cgi,). Benjamini–Hochberg FDR (false discovery rate) was used in order to correct P values (Ming-Shan et al., Nosrati et al.)[84,85]. The summary of SSA are depicted in the online Supplementary Fig. S1. Finally, it should be noted that as it was previously described in CLC Genomic Workbench (8.5.1), all parameters of SNV detection existed in the command line tools were determined[71].

Common variants discovery between highland and lowland chickens

The detection of common variants, shared between highland and lowland chickens is the main purpose of common variants discovery. See Fig. 12 for more details about common variants detection. 100% was determined for the frequency threshold. In the other words, all differential variants were removed completely, and common variants were kept for further analysis. The process of filtering and gene ontology enrichment analyses was same as what described for differential variants.
Figure 12

The process of common variants detection in highland and lowland native chicken ecotypes. This figure shows that how differential variants were removed between highland and lowland chickens. All comparisons of native chicken ecotype were carried out based on the sex of birds in each step. The frequency threshold was determined to be 100 percent in order to collect common variants between highland and lowland chickens.

The process of common variants detection in highland and lowland native chicken ecotypes. This figure shows that how differential variants were removed between highland and lowland chickens. All comparisons of native chicken ecotype were carried out based on the sex of birds in each step. The frequency threshold was determined to be 100 percent in order to collect common variants between highland and lowland chickens.

mtDNA variants

Ignoring the mtDNA variants in discovery variant projects is very difficult. The analyses of mtDNA variants were carried out separately in this investigation. After variants detection, all mtDNA variants were obtained in highland and lowland chickens, separately. Filtrations and gene ontology enrichments were carried out same as what described for differential variants.

Linking variants to protein structure

Results of gene ontology enrichment analyses were applied for variants visualization. Some variants impact on a protein’s amino acid composition. Therefore, the protein’s three-dimensional structure was downloaded by CLC Genomic Workbench (8.5.1)[71] from the protein data bank (www.rcsb.org) in order to show the variants location on the protein structure. Variants visualization was carried out for important differential candidate genes of our investigation’s results, or the ones that have their structures in the protein database structure.

Developing a new approach for genomic variants enrichment analysis

The identification of genomic regions that contained the highest density of differential variant between highland and lowland chickens, including both males and females, is the main purpose of genomic variants enrichment analysis. In this article, we proposed a plan to analyze the genomic variants enrichment based on the following steps. First, we estimated each chromosome’s standard frequencies of differential variants using the division of variants number by the chromosome length (Mb). Second, the chromosome that had the most standard frequency was selected, and the cumulative frequencies graph of variants was drawn along the chromosome by the R program (version 3.2.2)[79]. Third, chromosome region, which had the most cumulative frequency, was identified and ENSGALT (Ensembl transcript) IDs were also detected by UCSC and Table browser (www.genome.ucsc.edu/cgi-bin/hgTables) and finally, all detected ENSGALT IDs were converted into the ENSGAL gene ID by BioMart (www.ensembl.org/biomart).

Gene network analysis

According to all candidate genes identified in gene ontology enrichment analyses for differential variants, we carried out the gene network analysis. A total of 27 candidate genes was suggested for differential variants analysis among males and females (Tables 7 and 8). In order to build an informative gene network, chicken Ensembl gene identifications were converted into human orthologs[72]. Based on the following parameters, gene network analysis was carried out by Studio pathway web (Elsevier) for the identification of upstream neighbor networks for diseases: Min overlap = 2, and P value < 0.05[86].

Validations

Variants discovery projects produce numerous variations. Thereby, validating the variants is highly required. Here, validations were carried out for two groups of detected variants. First, novel differential SNVs between highland and lowland chickens, and second, mtDNA variations in highland chickens. Thus, the sequencing of other five whole genomes of highland samples (Isfahan) was carried out, separately. The whole genome sequencing, trimming, and variant detection were performed according to the information provided in the material and method sections. A total of 3845 SNVs was reported as new variants for differential variants analyses and 32 variations were also detected in mtDNA for highland samples (see result part). Each SNV’s regions and chromosomes were evaluated in the new five samples by R program in order to validate the variants[79]. The percentage of validation was calculated based on those samples that had variants. Supplementary information Dataset 1 Dataset 2 Dataset 3 Dataset 4 Dataset 5 Dataset 6 Dataset 7 Dataset 8 Dataset 9 Dataset 10 Dataset 11
  73 in total

Review 1.  The role of heat shock proteins in cancer.

Authors:  Georgios D Lianos; George A Alexiou; Alberto Mangano; Alessandro Mangano; Stefano Rausei; Luigi Boni; Gianlorenzo Dionigi; Dimitrios H Roukos
Journal:  Cancer Lett       Date:  2015-02-23       Impact factor: 8.679

2.  DNA damage after acute exposure of mice skin to physiological doses of UVB and UVA light.

Authors:  Alena Rajnochová Svobodová; Adéla Galandáková; Jarmila Sianská; Dalibor Doležal; Radka Lichnovská; Jitka Ulrichová; Jitka Vostálová
Journal:  Arch Dermatol Res       Date:  2012-01-24       Impact factor: 3.017

3.  Identification of the key regulating genes of diminished ovarian reserve (DOR) by network and gene ontology analysis.

Authors:  Maryam Pashaiasl; Mansour Ebrahimi; Esmaeil Ebrahimie
Journal:  Mol Biol Rep       Date:  2016-06-20       Impact factor: 2.316

4.  Prolactin protects cardiomyocytes against intermittent hypoxia-induced cell damage by the modulation of signaling pathways related to cardiac hypertrophy and proliferation.

Authors:  Dennis Jine-Yuan Hsieh; Chih-Yang Huang; Peiying Pai; Shyi-Gang P Wang; Ying-Lan Tsai; Chia-Ning Li; Wei-Wen Kuo; Chih-Yang Huang
Journal:  Int J Cardiol       Date:  2014-11-27       Impact factor: 4.164

Review 5.  Hypoxia and hypoxia-inducible factors as regulators of T cell development, differentiation, and function.

Authors:  Eóin N McNamee; Darlynn Korns Johnson; Dirk Homann; Eric T Clambey
Journal:  Immunol Res       Date:  2013-03       Impact factor: 2.829

6.  BMX, a novel nonreceptor tyrosine kinase gene of the BTK/ITK/TEC/TXK family located in chromosome Xp22.2.

Authors:  L Tamagnone; I Lahtinen; T Mustonen; K Virtaneva; F Francis; F Muscatelli; R Alitalo; C I Smith; C Larsson; K Alitalo
Journal:  Oncogene       Date:  1994-12       Impact factor: 9.867

Review 7.  Prolactin (PRL) and prolactin receptor (PRLR) genes and their role in poultry production traits.

Authors:  Anna Wilkanowska; Artur Mazurowski; Sławomir Mroczkowski; Dariusz Kokoszyński
Journal:  Folia Biol (Krakow)       Date:  2014       Impact factor: 0.432

8.  Hypoxia impairs primordial germ cell migration in zebrafish (Danio rerio) embryos.

Authors:  Kwok Hong Lo; Michelle Nga Yu Hui; Richard Man Kit Yu; Rudolf Shiu Sun Wu; Shuk Han Cheng
Journal:  PLoS One       Date:  2011-09-08       Impact factor: 3.240

9.  Upregulation of heat shock proteins (HSPA12A, HSP90B1, HSPA4, HSPA5 and HSPA6) in tumour tissues is associated with poor outcomes from HBV-related early-stage hepatocellular carcinoma.

Authors:  Zongguo Yang; Liping Zhuang; Peter Szatmary; Li Wen; Hua Sun; Yunfei Lu; Qingnian Xu; Xiaorong Chen
Journal:  Int J Med Sci       Date:  2015-02-15       Impact factor: 3.738

10.  Integration of machine learning and meta-analysis identifies the transcriptomic bio-signature of mastitis disease in cattle.

Authors:  Somayeh Sharifi; Abbas Pakdel; Mansour Ebrahimi; James M Reecy; Samaneh Fazeli Farsani; Esmaeil Ebrahimie
Journal:  PLoS One       Date:  2018-02-22       Impact factor: 3.240

View more
  8 in total

1.  Using weighted gene co-expression network analysis (WGCNA) to identify the hub genes related to hypoxic adaptation in yak (Bos grunniens).

Authors:  Qi Bao; Xiaolan Zhang; Pengjia Bao; Chunnian Liang; Xian Guo; Min Chu; Ping Yan
Journal:  Genes Genomics       Date:  2021-08-02       Impact factor: 1.839

2.  Landscape genomics of the streamside salamander: Implications for species management in the face of environmental change.

Authors:  Marc A Beer; Rachael A Kane; Steven J Micheletti; Christopher P Kozakiewicz; Andrew Storfer
Journal:  Evol Appl       Date:  2022-01-25       Impact factor: 5.183

3.  Transcriptome resequencing data for rock pigeon (Columba livia).

Authors:  Hamed Kharrati-Koopaee; Ali Esmailizadeh; Fatemeh Sabahi
Journal:  BMC Res Notes       Date:  2022-03-29

4.  Genetic diversity and signatures of selection for heat tolerance and immune response in Iranian native chickens.

Authors:  Hojjat Asadollahpour Nanaei; Hamed Kharrati-Koopaee; Ali Esmailizadeh
Journal:  BMC Genomics       Date:  2022-03-22       Impact factor: 3.969

5.  Comparative analysis of long noncoding RNA and mRNA expression provides insights into adaptation to hypoxia in Tibetan sheep.

Authors:  Fan Wang; Jianbin Liu; Qiaoying Zeng; Deqing Zhuoga
Journal:  Sci Rep       Date:  2022-04-21       Impact factor: 4.996

6.  Key miRNAs and Genes in the High-Altitude Adaptation of Tibetan Chickens.

Authors:  Binlong Chen; Diyan Li; Bo Ran; Pu Zhang; Tao Wang
Journal:  Front Vet Sci       Date:  2022-07-14

7.  Gene network analysis to determine the effect of hypoxia-associated genes on brain damages and tumorigenesis using an avian model.

Authors:  Hamed Kharrati-Koopaee; Esmaeil Ebrahimie; Mohammad Dadpasand; Ali Niazi; Rugang Tian; Ali Esmailizadeh
Journal:  J Genet Eng Biotechnol       Date:  2021-07-08

8.  Identifying Candidate Genes for Hypoxia Adaptation of Tibet Chicken Embryos by Selection Signature Analyses and RNA Sequencing.

Authors:  Xiayi Liu; Xiaochen Wang; Jing Liu; Xiangyu Wang; Haigang Bao
Journal:  Genes (Basel)       Date:  2020-07-20       Impact factor: 4.096

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.