| Literature DB >> 32724789 |
Nemat Hedayat-Evrigh1, Reza Khalkhali-Evrigh1, Mohammad Reza Bakhtiarizadeh2.
Abstract
The population size of Bactrian camels is smaller than dromedary, and they are distributed in cold and mountain regions and are also at the risk of extinction in some countries such as Iran. To identify and investigate the genome-wide variations, whole-genome sequencing of two Iranian Bactrian camels were performed with 37.4- and 42.6-fold coverage for the first time. Along with Iranian Bactrian camels, sequencing data from two Mongolian domestic and two wild Bactrian camels deposited in the NCBI were reanalyzed. The analysis eventuated to the identification of 4,908,998, 4,485,725, and 4,706,654 SNPs for Iranian, Mongolian domestic, and wild Bactrian camels, respectively. Also, INDEL variations ranged from 358,311 to 533,188 in all six camels. Results of variants annotation in all samples revealed that more than 88 percent of SNPs and INDELs were located in the intergenic and intronic regions. We found that 800,530 SNPs were common among all studied camels, containing 4,046 missense variants that affected 2,428 genes. Investigation of common genes among all camels containing the missense SNPs showed that there are 98 zinc finger and 4 fertility-related genes (ZP1, ZP2, ZP4, and ZPBP) in this set.Entities:
Year: 2020 PMID: 32724789 PMCID: PMC7381958 DOI: 10.1155/2020/2430846
Source DB: PubMed Journal: Int J Genomics ISSN: 2314-436X Impact factor: 2.326
Statistics of sequenced reads for Iranian, non-Iranian domestic, and wild Bactrian camels.
| Genome | Total reads | Mapped | |||
|---|---|---|---|---|---|
| Before trim | After trim | Reads | Bases | Fold coverage | |
| Bac1 | 813,158,076 | 797,679,174 | 775,967,813 | 74,469,370,345 | 37.4 |
| Bac2 | 933,365,664 | 911,654,292 | 891,056,256 | 84,834,523,451 | 42.6 |
| DC399 | 638,749,252 | 638,748,478 | 631,030,195 | 59,915,063,832 | 30.1 |
| DC408 | 703,381,722 | 703,381,078 | 695,555,934 | 65,880,723,485 | 33.1 |
| WC247 | 670,303,728 | 670,299,044 | 661,623,617 | 62,264,294,434 | 31.2 |
| WC305 | 669,110,616 | 669,109,670 | 660,574,188 | 62,389,566,081 | 31.3 |
Statistics of sequenced reads for Iranian, non-Iranian domestic, and wild Bactrian camels.
| Bac1 | Bac2 | DC399 | DC408 | WC247 | WC305 | |
|---|---|---|---|---|---|---|
| Number of SNPs | 3,917,247 | 3,677,474 | 3,080,659 | 3,222,861 | 3,656,164 | 3,637,144 |
| TS/TV | 2.16 | 2.14 | 2.21 | 2.19 | 2.26 | 2.26 |
| Heterozygote SNPs | 3,095,932 | 2,373,043 | 1,960,836 | 2,066,909 | 1,906,913 | 1,677,539 |
| SNP Het/Hom | 3.77 | 1.82 | 1.75 | 1.79 | 1.09 | 0.86 |
| Insertion | 281,033 | 262,162 | 189,259 | 207,998 | 224,440 | 232,150 |
| Deletion | 252,155 | 219,143 | 169,052 | 186,214 | 210,753 | 217,287 |
| INDEL/SNP | 0.14 | 0.13 | 0.12 | 0.12 | 0.12 | 0.12 |
Figure 1Length distribution of INDELs in genic (a) and nongenic (b) regions of all studied Bactrian camels.
Figure 2Venn diagram showing the overlap of SNPs (a) and INDELs (b) for common variants between Iranian (B1B2), non-Iranian domestic (DC), and wild Bactrian camels (WC).
Functional classification of the identified SNPs for Iranian and non-Iranian domestic and wild Bactrian camels.
| SNPs | Bac1 | Bac2 | DC399 | DC408 | WC247 | WC408 |
|---|---|---|---|---|---|---|
| Intergenic | 2,157,352 | 2,026,747 | 1,686,366 | 1,778,200 | 2,000,316 | 1,992,040 |
| Intron | 1,300,679 | 1,222,106 | 1,030,347 | 1,081,521 | 1,232,401 | 1,223,725 |
| Upstream gene | 214,053 | 198,502 | 169,131 | 170,057 | 196,542 | 197,060 |
| Downstream gene | 172,846 | 161,897 | 141,099 | 141,095 | 163,895 | 162,438 |
| Synonymous | 20,755 | 19,210 | 15,733 | 15,013 | 18,724 | 18,359 |
| Missense | 20,003 | 19,224 | 14,741 | 14,208 | 16,861 | 16,529 |
| 3′ UTR | 16,569 | 15,358 | 12,636 | 12,419 | 15,186 | 15,073 |
| nc transcript exon | 5,174 | 5,114 | 4,369 | 4,224 | 4,899 | 4,715 |
| 5′ UTR | 4,219 | 3,928 | 2,240 | 2,242 | 2,601 | 2,522 |
| nc transcript | 669 | 614 | 465 | 430 | 513 | 509 |
| Stop gained | 203 | 206 | 142 | 144 | 182 | 185 |
| Stop lost | 148 | 139 | 97 | 100 | 108 | 106 |
| Splice region | 144 | 126 | 85 | 100 | 112 | 110 |
| Start lost | 38 | 36 | 32 | 30 | 33 | 37 |
| Stop retained | 13 | 15 | 13 | 8 | 20 | 14 |
| Initiator codon | 5 | 4 | 2 | 2 | 2 | 3 |
| 5′ UTR premature start codon gain | 5 | 7 | 11 | 4 | 9 | 1 |
| Multi_annotated SNPs | 4,372 | 4,241 | 3,150 | 3,064 | 3,760 | 3,718 |
Figure 3Venn diagram created by InteractiVenn [19], showing the overlap of genes containing missense variants in all studied Bactrian camel.
Functional classification of the identified INDELs for Iranian and non-Iranian domestic and wild Bactrian camels.
| INDELs | Bac1 | Bac2 | DC399 | DC408 | WC247 | WC305 |
|---|---|---|---|---|---|---|
| Intergenic | 287,116 | 261,173 | 191,464 | 212,718 | 232,991 | 240,885 |
| Intron | 186,048 | 166,676 | 126,188 | 138,707 | 153,340 | 158,081 |
| Upstream gene | 30,102 | 26,765 | 20,054 | 21,357 | 24,305 | 25,249 |
| Downstream gene | 25,109 | 22,464 | 17,422 | 18,277 | 20,845 | 21,473 |
| 3′ UTR | 2,253 | 1,948 | 1,508 | 1,527 | 1,893 | 1,942 |
| 5′ UTR | 393 | 338 | 195 | 197 | 212 | 209 |
| Frameshift | 371 | 320 | 253 | 248 | 263 | 279 |
| nc transcript exon | 364 | 330 | 247 | 250 | 279 | 280 |
| Conservative inframe insertion | 191 | 172 | 111 | 101 | 129 | 124 |
| Conservative inframe deletion | 153 | 102 | 98 | 88 | 96 | 86 |
| Disruptive inframe insertion | 133 | 121 | 86 | 75 | 87 | 84 |
| Disruptive inframe deletion | 81 | 62 | 56 | 45 | 55 | 43 |
| Splice region | 53 | 50 | 33 | 30 | 30 | 30 |
| Bidirectional gene fusion | 1 | 0 | 0 | 0 | 0 | 0 |
| Multi_annotated INDELs | 820 | 784 | 596 | 592 | 668 | 672 |