| Literature DB >> 30235280 |
Reza Khalkhali-Evrigh1, Seyed Hasan Hafezian1, Nemat Hedayat-Evrigh2, Ayoub Farhadi1, Mohammad Reza Bakhtiarizadeh3.
Abstract
Whole genome wide identification and annotation of genetic variations in camels is in its first steps. The aim of this study was the identification of genome wide variants, functional annotations of them and enrichment analysis of affected genes using whole genome sequencing data of three dromedary camels. The genomes of two Iranian female dromedary camels that mostly used to produce meat and milk were sequenced to 41.9-fold and 38.6-fold coverage. A total of 4,727,238 single-nucleotide polymorphisms (SNPs) and 692,908 indels (insertions and deletions) were found by mapping raw reads to the dromedary reference assembly (GenBank Accession: GCA_000767585.1). In-silico functional annotation of the discovered variants in under study samples revealed that most SNPs (2,305,738; 48.78%) and indels (339,756; 49.03%) were located in intergenic regions. A comparison of the identified SNPs with those of the African camel (BioProject Accession: PRJNA269274) indicated that they had 993,474 SNPs in common. We found 15,168 non-synonymous SNPs in the shared variants of the three camels that could affect gene function and protein structure. Obtained results revealed that there were 7085, 6271 and 4688 non-synonymous SNPs among the 3436, 3058 and 2882 genes in the specific gene sets of Yazd dromedary, Trod dromedary and African dromedary, respectively. The list of genes predicted to be affected by non-synonymous variants in different individuals was subjected to gene ontology (GO) enrichment analysis.Entities:
Mesh:
Year: 2018 PMID: 30235280 PMCID: PMC6147446 DOI: 10.1371/journal.pone.0204028
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of sequenced reads for Iranian dromedaries and downloaded sample (AfD).
| Total Reads | Mapped | ||||
|---|---|---|---|---|---|
| Genome | Before trim | After trim | Reads | Bases | Fold coverage |
| 920,366,954 | 899,714,102 | 879,503,046 | 83,966,061,113 | 41.9 | |
| 843,455,144 | 826,229,484 | 804,795,688 | 77,257,841,038 | 38.6 | |
| - | 1,124,832,578 | 1,112,171,199 | 107,907,751,223 | 53.8 | |
Fig 1Overlapping and sample specific identified SNPs in Iranian dromedaries (YaD and TrD) and downloaded sample (AfD).
Summary of identified variants for Iranian dromedaries and downloaded sample (AfD).
| YaD | TrD | AfD | |
|---|---|---|---|
| 2,404,401 | 2,322,837 | 2,106,145 | |
| 2.34 | 2.34 | 2.33 | |
| 1,659,085 | 1,588,757 | 1,375,721 | |
| 2.23 | 2.16 | 1.88 | |
| 186,185 | 181,306 | 179,801 | |
| 165,244 | 160,173 | 154,473 | |
| 0.15 | 0.15 | 0.16 |
Functional annotation of discovered SNPs.
| Impact | SNPs | YaD | TrD | AfD | Shared SNPs |
|---|---|---|---|---|---|
| STOP_GAINED | 1343 | 1271 | 1217 | 486 | |
| STOP_LOST | 979 | 922 | 838 | 472 | |
| START_LOST | 124 | 111 | 98 | 51 | |
| SPILICE_SITE_ACCEPTOR | 127 | 116 | 103 | 54 | |
| SPILICE_SITE_DONOR | 200 | 188 | 158 | 93 | |
| NON_SYNONYMOUS_CODING | 36748 | 35208 | 31506 | 15168 | |
| SYNONYMOUS_STOP | 181 | 151 | 154 | 77 | |
| SYNONYMOUS_START | 1 | - | - | - | |
| NON_SYNONYMOUS_START | 10 | 11 | 9 | 5 | |
| SYNONYMOUS_CODING | 17310 | 16354 | 14780 | 7001 | |
| INTERGENIC | 1164972 | 1140766 | 1036643 | 486742 | |
| INTRAGENIC | 8 | 8 | 4 | 1 | |
| INTRON | 943845 | 899719 | 815981 | 386512 | |
| UPSTREAM | 132089 | 125375 | 112771 | 53393 | |
| DOWNSTREAM | 106464 | 102637 | 91883 | 43419 | |
| TOTAL | 2404401 | 2322837 | 2106145 | 993474 |
Functional annotation of discovered indels.
| indel | YaD | TrD | AfD |
|---|---|---|---|
| 171706 | 168050 | 165086 | |
| 140774 | 135017 | 132709 | |
| 19030 | 18577 | 17898 | |
| 15445 | 15280 | 14636 | |
| 4371 | 4446 | 3836 | |
| 55 | 53 | 58 | |
| 45 | 53 | 50 | |
| 2 | 2 | - | |
| 1 | 1 | 1 |
Fig 2Classification of identified indels by their impact on genome for YaD (A), TrD (B) and AfD (C).