| Literature DB >> 31404266 |
Reza Khalkhali-Evrigh1, Nemat Hedayat-Evrigh2, Seyed Hasan Hafezian1, Ayoub Farhadi1, Mohammad Reza Bakhtiarizadeh3.
Abstract
Transposable elements (TEs) along with simple sequence repeats (SSRs) are prevalent in eukaryotic genome, especially in mammals. Repetitive sequences form approximately one-third of the camelid genomes, so study on this part of genome can be helpful in providing deeper information from the genome and its evolutionary path. Here, in order to improve our understanding regarding the camel genome architecture, the whole genome of the two dromedaries (Yazdi and Trodi camels) was sequenced. Totally, 92- and 84.3-Gb sequence data were obtained and assembled to 137,772 and 149,997 contigs with a N50 length of 54,626 and 54,031 bp in Yazdi and Trodi camels, respectively. Results showed that 30.58% of Yazdi camel genome and 30.50% of Trodi camel genome were covered by TEs. Contrary to the observed results in the genomes of cattle, sheep, horse, and pig, no endogenous retrovirus-K (ERVK) elements were found in the camel genome. Distribution pattern of DNA transposons in the genomes of dromedary, Bactrian, and cattle was similar in contrast with LINE, SINE, and long terminal repeat (LTR) families. Elements like RTE-BovB belonging to LINEs family in cattle and sheep genomes are dramatically higher than genome of dromedary. However, LINE1 (L1) and LINE2 (L2) elements cover higher percentage of LINE family in dromedary genome compared to genome of cattle. Also, 540,133 and 539,409 microsatellites were identified from the assembled contigs of Yazdi and Trodi dromedary camels, respectively. In both samples, di-(393,196) and tri-(65,313) nucleotide repeats contributed to about 42.5% of the microsatellites. The findings of the present study revealed that non-repetitive content of mammalian genomes is approximately similar. Results showed that 9.1 Mb (0.47% of whole assembled genome) of Iranian dromedary's genome length is made up of SSRs. Annotation of repetitive content of Iranian dromedary camel genome revealed that 9,068 and 11,544 genes contain different types of TEs and SSRs, respectively. SSR markers identified in the present study can be used as a valuable resource for genetic diversity investigations and marker-assisted selection (MAS) in camel-breeding programs.Entities:
Keywords: Camelus dromedarius; breeding strategies; de novo assembly; next-generation sequencing; repetitive sequence
Year: 2019 PMID: 31404266 PMCID: PMC6675863 DOI: 10.3389/fgene.2019.00692
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Summary of the YaD and TrD genome assembly.
| Contigs | YaD | TrD |
|---|---|---|
| N25 (bp) | 97,128 | 95,611 |
| N50 (bp) | 54,626 | 54,031 |
| N75 (bp) | 26,694 | 26,620 |
| Longest contig | 466,683 | 604,268 |
| Average contig length (bp) | 14,101 | 12,980 |
| Counts of contigs | 137,772 | 149,997 |
| Total bases (Gb) | 1.94 | 1.94 |
Summary of identified transposable elements for Iranian dromedaries, African dromedary, and Bactrian camel.
| TEs | Iranian dromedary | African dromedary | Bactrian camel | ||||
|---|---|---|---|---|---|---|---|
| Numbers | Length (bp) | % | Numbers | % | Numbers | % | |
|
|
|
|
|
|
|
|
|
| Alu/B1 | 0 | 0 | 0.00 | 7 | 0.00 | 0 | 0.00 |
| MIRs | 452,075 | 67,605,477 | 3.48 | 463,927 | 3.38 | 460,330 | 3.38 |
|
|
|
|
|
|
|
|
|
| LINE1 | 483,215 | 260,189,224 | 13.38 | 642,633 | 14.57 | 552,674 | 14.82 |
| LINE2 | 301,928 | 81,743,253 | 4.20 | 312,130 | 4.10 | 280,625 | 3.92 |
| L3/CR1 | 39,626 | 8,761,454 | 0.45 | 40,821 | 0.44 | 38,504 | 0.44 |
| RTE | 13,046 | 3,412,114 | 0.18 | – | – | 10,913 | 0.14 |
|
|
|
|
|
|
|
|
|
| ERVL | 82,052 | 35,068,299 | 1.80 | 80,984 | 1.72 | 76,625 | 1.63 |
| ERVL-MaLRs | 140,115 | 47,792,407 | 2.45 | 138,020 | 2.35 | 133,362 | 2.31 |
| ERV-classI | 39,386 | 13,849,285 | 0.71 | 81,938 | 1.07 | 77,025 | 1.63 |
| ERV-classII | 0 | 0 | 0.00 | 571 | 0.00 | 23,095 | 0.10 |
|
|
|
|
|
|
|
|
|
| hAT-Charlie | 188,154 | 36,658,424 | 1.88 | 186,819 | 1.79 | 175,700 | 1.68 |
| TcMar-Tigger | 53,998 | 14,327,829 | 0.74 | 66,902 | 0.81 | 44,496 | 0.68 |
|
|
|
|
|
|
|
|
|
| Reference | Present study |
|
| ||||
Figure 1Frequency of different SSR motifs across YaD, TrD, and other seven mammalian genomes.
Distribution of different classes of SSRs in YaD, TrD, and other seven mammalian genomes.
| Species | Mono | Di | Tri | Tetra | Penta | Hexa | Hepta | octo | Total (bp) |
|---|---|---|---|---|---|---|---|---|---|
| YaD | 3,670,743 | 3,429,832 | 638,283 | 781,068 | 329,445 | 111,366 | 97,839 | 59,368 | 9,117,944 |
| TrD | 3,669,900 | 3,417,918 | 636,564 | 778,964 | 325,930 | 110,706 | 97,790 | 58,208 | 9,095,980 |
| Arabian dromedary | 4,121,293 | 3,859,778 | 685,755 | 1,390,196 | 401,970 | 129,516 | 97,986 | 89,456 | 10,775,950 |
| Bactrian camel | 4,082,129 | 3,791,158 | 615,105 | 1,147,212 | 366,725 | 125,718 | 99,785 | 92,632 | 10,320,464 |
| Alpaca | 2,065,967 | 4,102,030 | 627,525 | 1,100,084 | 348,420 | 123,546 | 152,278 | 72,032 | 8,591,882 |
| Cattle | 5,045,498 | 4,858,676 | 2,046,501 | 254,296 | 1,537,845 | 54,444 | 94,374 | 26,576 | 13,918,210 |
| Sheep | 4,657,293 | 4,889,182 | 1,770,183 | 392,980 | 1,589,625 | 62,922 | 78,764 | 57,312 | 13,498,261 |
| Horse | 3,344,142 | 3,025,388 | 577,437 | 532,720 | 180,225 | 47,934 | 64,001 | 37,848 | 7,809,695 |
| Human | 12,287,665 | 6,754,988 | 1,537,530 | 3,001,280 | 1,439,420 | 291,282 | 304,525 | 149,072 | 25,765,762 |
Figure 2The content of SINE (A), LINE (B), LTR (C), and DNA transposon elements (D) in the genomes of Iranian dromedaries (dark blue), Bactrian camel (blue), and cattle (green).
Figure 3Distribution of different SSR motifs in Iranian dromedary genome.
10 SSRs with most frequency in YaD, TrD, and other seven mammalian genomes.
| Rank | YaD | TrD | Arabian dromedary | Bactrian camel | Alpaca | Cattle | Sheep | Horse | Human |
|---|---|---|---|---|---|---|---|---|---|
| 1 | T | T | A | T | A | T | T | T | T |
| 2 | A | A | T | A | T | A | A | A | A |
| 3 | AC | AC | AC | AC | AC | TG | TG | TG | AC |
| 4 | TG | TG | TG | TG | TG | AC | AC | AC | TG |
| 5 | AT | AT | AT | AT | AT | AT | AT | TA | AT |
| 6 | TA | TA | TA | TA | TA | AGC | TA | CA | TA |
| 7 | GT | GT | GT | GT | GT | TA | CA | AT | GT |
| 8 | CA | CA | CA | CA | CA | CA | GT | GT | CA |
| 9 | TC | TC | TC | TC | TC | GT | AGC | TC | TC |
| 10 | AG | AG | AG | AG | AG | TGC | ACTGA | AG | AG |
| From all (%) | 78.81 | 78.83 | 79.06 | 79.02 | 74.96 | 76.15 | 74.27 | 78.54 | 77.29 |