Literature DB >> 32485848

Genome-Wide Diversity, Population Structure and Demographic History of Dromedaries in the Central Desert of Iran.

Morteza Bitaraf Sani1, Javad Zare Harofte1, Ahmad Bitaraf1, Saeid Esmaeilkhanian2, Mohammad Hossein Banabazi3, Nader Salim4, Abbas Teimoori4, Ali Shafei Naderi1, Mohammad Ali Faghihi5, Pamela Anna Burger6, Mohammad Silawi7, Afsaneh Taghipour Sheshdeh7.   

Abstract

The development of camel husbandry for good production in a desert climate is very important, thus we need to understand the genetic basis of camels and give attention to genomic analysis. We assessed genome-wide diversity, linkage disequilibrium (LD), effective population size (Ne) and relatedness in 96 dromedaries originating from five different regions of the central desert of Iran using genotyping-by-sequencing (GBS). A total of 14,522 Single Nucleotide Polymorphisms (SNPs) with an average minor allele frequency (MAF) of 0.19 passed quality control and filtering steps. The average observed heterozygosity in the population was estimated at 0.25 ± 0.03. The mean of LD at distances shorter than 40 kb was low (r2 = 0.089 ± 0.234). The camels sampled from the central desert of Iran exhibited higher relatedness than Sudanese and lower than Arabian Peninsula dromedaries. Recent Ne of Iran's camels was estimated to be 89. Predicted Tajima's D (1.28) suggested a bottleneck or balancing selection in dromedary camels in the central desert of Iran. A general decrease in effective and census population size poses a threat for Iran's dromedaries. This report is the first SNP calling report on nearly the chromosome level and a first step towards understanding genomic diversity, population structure and demography in Iranian dromedaries.

Entities:  

Keywords:  effective population Size; genotyping-by-sequencing; linkage disequilibrium

Mesh:

Year:  2020        PMID: 32485848      PMCID: PMC7349250          DOI: 10.3390/genes11060599

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


1. Introduction

Camels have a unique morphology and physiology as they are capable of providing vital products in a desert climate, even under harsh conditions. Camelini (one- and two-humped camels) and Lamini (New World camels) are two tribes of the Camelidae family [1]. Camelids influence human endurance and thriving in the peripheral agro-natural zones of (semi-) deserts [2]. Today, camels have gained significance as sustainable livestock species with certain important properties (e.g., immunogenic and milk composition) [3]. Out of around 35 million camel heads of the world (FAO, 2019), the more significant part (95%) are dromedary [4]. There are 138,659 camels in Iran (FAO, 2019), and most of them are dromedaries producing 0.5% of total red meat in Iran [5]. Overuse of land and water resources has expanded the danger of desertification [6]. Over 84% of Iran is arid or semi-arid [7]. Thus, camel husbandry in Iran has a high potential considering changing climatic conditions as well as religious and cultural values [6]. For sustainable development of camel breeding to facilitate protein production in desert climate, we need to give special attention to the conservation and genetic improvement of the species. The main limitations that camel breeding is facing in Iran are the lack of phenotypic records and pedigrees, small herd size and missing connectedness, and genetic evaluations. Previous investigations about genetic diversity and population structure in dromedaries were confined to microsatellite markers, mitochondrial DNA (mtDNA) [8], geographical regions, e.g., Kenya [9], and Pakistan [10], and candidate gene sequencing [11]. Next-generation sequencing platforms have prepared suitable methods for population genetic analyses in local or regional populations/ breeds to investigate effective population size, breed formation, demographic history, selection and genetic drift (interpreted by linkage disequilibrium (LD)) at the whole-genome level [12]. Whole-genome sequencing revealed past demographic events and the evolutionary history of Old World camels [13]. Recently, dromedaries’ genome sequences have been assembled into chromosomes [14,15] and the whole genomes of two Iranian dromedary camels were analyzed. This genomic information could prompt genome-wide association studies (GWAS) and genomic selection in Iranian camel breeding [16], similar to populations from the Arabian Peninsula and Sudan, which showed signatures of selection in regions associated with adaptation to arid environment, dairy traits, energy homeostasis, and chondrogenesis [11]. To understand the genetic makeup and relatedness of Iranian dromedaries and to create the basis for future breeding programs and genomic selection, we assessed the genomic diversity, linkage disequilibrium and population structure of dromedary camels in different regions of the central desert of Iran.

2. Materials and Methods

2.1. Animal Resources

A total of 96 EDTA blood samples of dromedary camels from five regions in Yazd province, Iran, were collected during routine veterinary treatment: (Bafgh (n = 41), Bahabad (n = 8), Khatam (n = 17), Mehriz (n = 8) and Saghand (n = 22)) (Figure 1).
Figure 1

Yazd province, the central desert of Iran. *: samples collected places.

2.2. Genotyping-by-Sequencing (GBS) Data

The blood samples were gathered with EDTA tubes, and genomic DNA was extracted using the modified salt-out method. In this method, optimization includes utilization of separate buffers instead of buffy coat isolation, chloroform for DNA phase isolation and extraction of purified DNA, and sodium acetate for more concentrated DNA [17]. The extracted DNA was evaluated with a Nanodrop Spectrophotometer and 96 samples were genotyped-by-sequencing using two restriction enzymes, EcoR1 and HinF1, and paired-end (150 bp) sequencing on the Illumina HiSeq 2000 platform by Persian Bayangene Research and Training Center (Shiraz, Iran). The adapters were trimmed with bcl2fastq, and the read pairs with low quality were omitted (base Qphred ≤20) by using fastQC.

2.3. SNP Calling

The sequence reads were mapped to the dromedary reference genome assembly on chromosome level (GCA_000803125.3; [14]) by using the BWA-MEM algorithm of the Burrows—Wheeler Aligner (BWA; [18]). The PCR duplicates were detected by utilizing Picard tools and disregarded in downstream analyses both by GATK [19] and SAMtools [20]. SNPs were called across the GBS data using GATK.

2.4. Population Structure, Linkage Disequilibrium and Genome-Wide Diversity

Quality control (QC) steps, linkage disequilibrium (LD), genome-wide diversity (observed and expected heterozygosity), as well as admixture analyses were performed using TASSEL V5.0 [21]. Variants with MAF < 0.05 and Call Rate < 0.95 were removed. Of the 41,897 SNPs, 256 markers were monomorphic, and 27,375 markers were deleted because of low minor allele frequency (MAF < 0.05). Nei’s distances and pairwise FST among populations were calculated by using the StAMPP package in R [22]. To investigate population structure, R packages including vcfR, poppr, ape, and RColorBrewer were used for K-means clustering and discriminant analysis of principal components (DAPC). The median joining network was created in SplitsTree by using the genetic distance matrix. Relatedness among genome-wide SNPs based on the unadjusted Ajk statistic was evaluated by using TASSEL. The LD between two SNPs was evaluated using the statistics r2 (Equation (1)) [23]: where P, P, P, and P are the haplotype frequencies and P, P, P, and P are the allele frequencies, respectively. The LD decay plot was created in popLDecay [24].

2.5. Effective Population Size and Tajima’s D

Effective population size (Ne) was predicted using the pairwise LD [3]. The SNeP tool was used for predicting effective population size [25] based on the relatedness among r2, Ne, and c (Equation (1) [26]). The corrected r2 was estimated following Equation (3) [27]: where c is the recombination rate, n is the number of individuals sampled; β = 2 when the gametic phase is known and β = 1 if instead the phase is not known. Ne during t generations ago was predicted (Equation (4)): where N is the effective size t generations ago, computed as t = (2ƒ(c)) − 1 [28]; c is the recombination rate defined for a specific physical distance between markers and optimally adjusted with the mapping functions mentioned above; is the LD value adjusted for sample size; and α = {1, 2, 2.2} is a correction for the mutation rates [29]. Tajimas’s D was estimated to investigate demography or signals of selection (Equation (5)) [30]. where k is the difference between sequence i and j.

3. Results

3.1. Genomic Diversity and Linkage Disequilibrium

A total of 14,522 SNPs resulted after filtering and were used for final analysis. The largest gap between SNPs (21,088.280 kb) was located on chromosome 11. We note that this is the first SNPs-based report at chromosome-level (Table 1). Before quality control (QC), the average MAF of all SNPs was 0.08, and after filtering, it increased to 0.19 (Figure 2). Average observed heterozygosity in the Iranian central desert dromedaries was estimated at 0.25 ± 0.03.
Table 1

Genome-wide summary of SNPs.

ChrNumber of SNPsMean MAFChrNumber of SNPsMean MAF
11500.231916550.18
24120.2220510.27
3780.25213050.21
41130.2122200.33
51250.2023160.21
64130.2224840.21
73720.222511130.19
82810.2226510.24
918290.1927450.19
103530.2728300.17
1114420.2029840.25
122310.20301010.21
131670.24311950.20
1410620.1932220.25
151500.25331440.22
163780.1934550.16
173790.18353760.20
1813010.18361940.19
X7450.18
Figure 2

Minor allele frequency of SNPs.

Linkage disequilibrium (LD) was estimated for each pairwise combination of SNPs (a total of 242,265 SNP pairs). Average LD (r2 = 0.089 ± 0.234) was observed at distances shorter than 40 kb, which decreased to 0.019 ± 0.034 in 40–60 kb. The LD decay trend is shown in Figure 3. Ninety-eight percent of total pairwise combinations with r2 > 0.20 related to 0–40 kb intervals (Table 2).
Figure 3

The linkage disequilibrium (LD) decay plot.

Table 2

Pairwise linkage disequilibrium (r2) for SNPs at various distance.

DistanceNMeanSD% r2 > 0.2% r2 > 0.3
0–40 kb7,020,5690.08990.23498.0294.50
40–60 kb613,0450.01990.0340.150.52
60–100 kb735,7730.02080.0390.360.77
100–250 kb1,082,0020.01770.0300.130.66
250–500 kb789,0720.01980.0350.280.76
0.5–1 Mb650,7530.01750.0300.140.30
1–2 Mb436,2000.01810.0330.100.28
2–5 Mb808,1810.01940.03510.330.59
5–10 Mb719,1730.01880.0330.210.49
10–20 Mb809,0000.0170.0290.080.45
20–50 Mb841,8820.01870.0310.100.58
50–92 Mb204,5800.01640.0320.110.10

3.2. Cluster Analysis and Genomic Relationship

Cluster analysis of the 96 dromedary camels for five regions in the central desert of Iran has been shown in Figure 4. Because of low genetic distance, there were no determined clusters and high admixture among regions (Figure 5). Only 1.6% and 1.4% of the genetic variance was illustrated with PC1 and PC2, respectively, which demonstrated that individuals are relatively homogeneous. Nonetheless, five samples that originate from outside of Iran (Pakistan: GBS046, GBS047, GBS042, GBS057, and GBS058) diverged from the rest (Figure 4 and Figure 5). The pairwise FST statistic among five populations of Iran’s central desert camels was very low (0.002–0.011) (Table 3).
Figure 4

Camels clustered on the basis of principal components analysis using individual genotypes. The highlight box shows five samples (GBS046, GBS047, GBS042, GBS057, and GBS058) which belong to outside of Iran.

Figure 5

ADMIXTURE plots of 5 dromedary camel populations in the Iranian central desert.

Table 3

Pairwise FST among populations of Iran’s central desert camels.

BafghBahabadKhatamSaghandMehriz
Bafgh 0----
Bahabad 0.0030---
Khatam 0.0050.0040--
Saghand 0.0040.0020.0040-
Mehriz 0.0110.0100.0110.0080
Because of a limited number of bull camels and half-sib calves within herds, there were node clusters visible, although the median of relatedness among camels was very low (Ajk = −0.011 ± 0.113). There is no obvious population structure visible in the Iranian central desert dromedaries, although, in the zoom, it looks as if there was a very low separation into two groups, which, however, do not correspond to any geographical origins nor to any functional separation of the investigated dromedaries in terms of usage for milk or meat production (Figure 6).
Figure 6

SplitsTree network. Phylogenetic network calculated with SplitsTree using Neighbour-net, with a zoom to visualize the split among five regions of Iran’s central desert.

3.3. Demographic History (Effective Population Size)

Recent Ne (up to 10 generations ago) was estimated as 89 to 487, respectively (Table 4). The trend of Ne shows a great decrease of up to 90% until the last glacial period (LGP; 100,000–20,000 years ago and another bottleneck around 2000–6000 years ago (Figure 7)).
Table 4

Effective population size in dromedary camel across 113 last generations.

Generations AgoNeDist. r 2 r2SD
18918,735,0200.0107740.015864
210816,440,7420.0106430.015069
213614,311,3350.0100360.013526
315511,950,5670.0110160.016526
418610,407,0330.0108330.015233
42209,096,1090.0107190.014915
62827,360,5040.0106030.015451
73246,387,9620.0108130.014915
83965,401,4100.0106240.014839
104874,410,0260.0107590.015362
136183,561,1150.0106370.015982
167992,844,2040.0104280.01519
2110012,241,5470.0106520.014796
2512371,901,6510.0102170.014037
3516941,375,7580.0103990.014926
4421161,097,5260.0104770.014596
602883811,6690.0104420.015188
803680617,2130.0107870.015267
11353444387850.010480.01491
Figure 7

The historical demography of the dromedary camel was examined using SNeP.

4. Discussion

Guo et al., (2020) identified 2,433,732 variants in 366 camels, including the seven domestic Camelus bactrianus breeds in East Asia by using the GBS technique [31]. Bahbahani et al., (2019) reported 39,843 SNPs after QC in Sudanese dromedary camels [11]. Al Abri et al., (2017) reported ~80,000 SNPs by using WGS of 9 camels and GBS of 244 dromedary camels [32]. The different numbers of SNPS across different studies can be due to different techniques or the heterozygosity levels in the other camel groups. Heterozygosity of the Iranian central desert dromedaries was predicted as low (0.25 ± 0.30). Arabian Peninsula and Sudanese dromedaries’ where 0.560 ± 0.003 and 0.347 ± 0.003, respectively [11]. The camels sampled from the central desert of Iran have higher relationships than dromedaries from Sudan (median Ajk = − 0.007 ± 0.018) and lower ones than individuals from the Arabian Peninsula (median Ajk = 0.347 ± 0.093) [11]. Predicted Tajima’s D (1.28) suggested a bottleneck or balancing selection in dromedary camels in the central desert of Iran. The sudden decrease of camels in Iran could be the cause of a positive Tajima’s D. The number of camels in Iran declined 46% during 1960–1970, however the heterozygosity has been still maintained, most probably by balancing selection resulting in positive Tajima’s D. Balancing selection occurs when the heterozygotes have a higher fitness than the homozygote [33] and conserve genetic polymorphism. Until recently, LD patterns of many livestock species have been reported. For example, the amounts of LD at 0–30 kb distances in Angus, Charolais, and C beef cattle were 0.29, 0.22, and 0.21, respectively [34]. The mean of r2 (LD) was 0.155, 0.156, and 0.128 across all chromosomes for three sheep breeds [35]. Corbin et al. (2010) [36] reported r2 = 0.6 in 5 kb and up to 20 kb in Thoroughbred horses. The average r2 > 0.2 at 1.0–1.5 Mb intervals was predicted in two pig populations [37]. The mean of LD in Iranian central desert dromedary camels was predicted as very low compared to livestock animals because of high generation interval, lack of artificial selection, and natural reproduction. Principal component analysis suggested that Sudanese camel populations are almost homogeneous [11]. On the contrary, PCA results distinguished the Chinese Bactrian camels from the Mongolian Bactrian camels very well [31]. The results of PCA, Admixture, and SplitTree tests show that dromedary camels in the central desert of Iran are homogeneous and have a separate genetic background from outside of Iran. The trend of Ne shows a bottleneck around 2000–6000 years ago in Iranian central desert dromedary camels. This is consistent with a decline in Ne in Old World camels around 4000–5000 years ago and might be related to the domestication scenarios of dromedary and Bactrian camels [2].

5. Conclusions

Commercial SNP arrays have been developed for many livestock animals but no SNPchip is available for dromedaries. GBS based on whole-genome sequencing has solved the problem of ascertainment bias [38] as the efficiency of GBS markers is comparable to SNP arrays [39]. GBS could prove highly rewarding for predictions of the relatedness and genetic diversity of dromedaries. This research, the first report of GBS-SNPs assembled by near-chromosomes, suggested dromedary camels in the central desert of Iran are homogeneous and different from outside camels. The decrease of effective population size is a threat for Iran’s dromedary camels.
  28 in total

1.  Linkage disequilibrium in finite populations.

Authors:  W G Hill; A Robertson
Journal:  Theor Appl Genet       Date:  1968-06       Impact factor: 5.699

2.  Linkage disequilibrium between two segregating nucleotide sites under the steady flux of mutations in a finite population.

Authors:  T Ohta; M Kimura
Journal:  Genetics       Date:  1971-08       Impact factor: 4.562

3.  Linkage disequilibrium and homozygosity of chromosome segments in finite populations.

Authors:  J A Sved
Journal:  Theor Popul Biol       Date:  1971-06       Impact factor: 1.570

4.  Effect of mating structure on variation in linkage disequilibrium.

Authors:  B S Weir; W G Hill
Journal:  Genetics       Date:  1980-06       Impact factor: 4.562

5.  Genetic diversity and relationships of indigenous Kenyan camel (Camelus dromedarius) populations: implications for their classification.

Authors:  D N Mburu; J W Ochieng; S G Kuria; H Jianlin; B Kaufmann; J E O Rege; O Hanotte
Journal:  Anim Genet       Date:  2003-02       Impact factor: 3.169

6.  Potential of genotyping-by-sequencing for genomic selection in livestock populations.

Authors:  Gregor Gorjanc; Matthew A Cleveland; Ross D Houston; John M Hickey
Journal:  Genet Sel Evol       Date:  2015-03-01       Impact factor: 4.297

7.  A Near Chromosome Assembly of the Dromedary Camel Genome.

Authors:  Daniil Ruvinskiy; Denis M Larkin; Marta Farré
Journal:  Front Genet       Date:  2019-02-05       Impact factor: 4.599

8.  Genome Diversity and Signatures of Selection for Production and Performance Traits in Dromedary Camels.

Authors:  Hussain Bahbahani; Hassan H Musa; David Wragg; Eltahir S Shuiep; Faisal Almathen; Olivier Hanotte
Journal:  Front Genet       Date:  2019-09-19       Impact factor: 4.599

9.  Improving Illumina assemblies with Hi-C and long reads: An example with the North African dromedary.

Authors:  Jean P Elbers; Mark F Rogers; Polina L Perelman; Anastasia A Proskuryakova; Natalia A Serdyukova; Warren E Johnson; Petr Horin; Jukka Corander; David Murphy; Pamela A Burger
Journal:  Mol Ecol Resour       Date:  2019-05-17       Impact factor: 7.090

Review 10.  Old World camels in a modern world - a balancing act between conservation and genetic improvement.

Authors:  P A Burger; E Ciani; B Faye
Journal:  Anim Genet       Date:  2019-09-18       Impact factor: 3.169

View more
  2 in total

1.  Identification of Candidate Genes for Pigmentation in Camels Using Genotyping-by-Sequencing.

Authors:  Morteza Bitaraf Sani; Javad Zare Harofte; Mohammad Hossein Banabazi; Asim Faraz; Saeid Esmaeilkhanian; Ali Shafei Naderi; Nader Salim; Abbas Teimoori; Ahmad Bitaraf; Mohammad Zadehrahmani; Pamela Anna Burger; Nader Asadzadeh; Mohammad Silawi; Afsaneh Taghipour Sheshdeh; Behrouz Mohammad Nazari; Mohammad Ali Faghihi; Zahra Roudbari
Journal:  Animals (Basel)       Date:  2022-04-23       Impact factor: 3.231

2.  Prevalence of camel babesiosis in southeast of Iran.

Authors:  Hadi Mirahmadi; Ali Ghaderi; Shaghik Barani; Ebrahim Alijani; Ahmad Mehravaran; Reza Shafiei
Journal:  Vet Med Sci       Date:  2021-11-20
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.