| Literature DB >> 30833292 |
Tiina Sävilammi1, Craig R Primmer2,3, Srinidhi Varadharajan4, René Guyomard5, Yann Guiguen6, Simen R Sandve7, L Asbjørn Vøllestad4, Spiros Papakostas8, Sigbjørn Lien9.
Abstract
Salmonids represent an intriguing taxonomical group for investigating genome evolution in vertebrates due to their relatively recent last common whole genome duplication event, which occurred between 80 and 100 million years ago. Here, we report on the chromosome-level genome assembly of European grayling (Thymallus thymallus), which represents one of the earliest diverged salmonid subfamilies. To achieve this, we first generated relatively long genomic scaffolds by using a previously published draft genome assembly along with long-read sequencing data and a linkage map. We then merged those scaffolds by applying synteny evidence from the Atlantic salmon (Salmo salar) genome. Comparisons of the European grayling genome assembly to the genomes of Atlantic salmon and Northern pike (Esox lucius), the latter used as a nonduplicated outgroup, detailed aspects of the characteristic chromosome evolution process that has taken place in European grayling. While Atlantic salmon and other salmonid genomes are portrayed by the typical occurrence of numerous chromosomal fusions, European grayling chromosomes were confirmed to be fusion-free and were characterized by a relatively large proportion of paracentric and pericentric inversions. We further reported on transposable elements specific to either the European grayling or Atlantic salmon genome, on the male-specific sdY gene in the European grayling chromosome 11A, and on regions under residual tetrasomy in the homeologous European grayling chromosome pairs 9A-9B and 25A-25B. The same chromosome pairs have been observed under residual tetrasomy in Atlantic salmon and in other salmonids, suggesting that this feature has been conserved since the subfamily split.Entities:
Keywords: chromosomal structure; chromosome evolution; genomic rearrangements; karyotype evolution; retrotransposons
Mesh:
Year: 2019 PMID: 30833292 PMCID: PMC6505133 DOI: 10.1534/g3.118.200919
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Assembly properties of different stages of the European grayling genome assembly process. Stage 1 represents the assembly built using only short-read DNA sequencing data (from Varadharajan ). Stage 2 involves the outcome of the hybrid assembly process, which combined short- and long-read DNA sequencing data. Finally, stage 3 corresponds to the complete genome assembly that was produced using the linkage mapping data and synteny information with the Atlantic salmon genome. Numbers in brackets represent the percent increase/decrease over the previous stage for given statistics
| Statistic | Stage 1 | Stage 2 | Stage 3 | ||
|---|---|---|---|---|---|
| N50 | 283,328 | 390,289 | (38%) | 33,018,251 | (8,340%) |
| L50 | 1,359 | 1,030 | (-24%) | 20 | (-98.5%) |
| N90 | 38,415 | 49,679 | (29%) | 23,618,429 | (47,442%) |
| L90 | 6,620 | 5,397 | (-18%) | 40 | (-99.4%) |
| Scaffolds | 24,369 | 18,265 | (-25%) | 51 | (-99.7%) |
| Length | |||||
| Total Average sd min max | 1,468,519,221 | 1,575,987,192 | 1,485,210,005 | ||
| 60,261 | 86,285 | (43%) | 29,121,765 | (33,651%) | |
| 145,243 | 207,343 | 9,938,557 | |||
| 975 | 984 | (1%) | 6,483,087 | (658,750%) | |
| 2,502,076 | 4,048,953 | (62%) | 44,988,017 | (1,011%) | |
| Known bases | 87% | 95% | 95% | ||
| GC% | 42.7% | 42.8% | 42.7% | ||
Figure 1The European grayling genome. The tracks indicate the following information: (A) Chromosome number according to Northern pike, used as a non-WGD outgroup, and predicted chromosomal type (M = metacentric; SM = submetacentric; and T = acrocentric); (B) Predicted centromere (in purple) and telomere (in yellow) locations; (C) Proportions of the Tc1-mariner (in purple) and RTE-X elements (in yellow), which are used to predict centromere and telomere positions, respectively; (D) Gene density; (E) Female linkage map units; and (F) Female- (in red) and male-biased (in blue) sex-biased loci with the sdY gene region indicated.
Transposable element classification and abundances in the European grayling and Atlantic salmon genomes
| Transposable element | European grayling | Atlantic salmon | ||||
|---|---|---|---|---|---|---|
| Class | Order | Superfamily | base pairs | % coverage | base pairs | % coverage |
| RNA-transposons (class I) | LINE | Jockey | 133311368 | 9.0 | 228445871 | 10.2 |
| RTE | 10629973 | 0.7 | 12021858 | 0.5 | ||
| L1 | 6987652 | 0.5 | 15019466 | 0.7 | ||
| I | 1925092 | 0.1 | 2994056 | 0.1 | ||
| LTR | Gypsy | 88958008 | 6.0 | 120880877 | 5.4 | |
| ERV | 23336303 | 1.6 | 29951691 | 1.3 | ||
| Bel-Pao | 4559403 | 0.3 | 5228163 | 0.2 | ||
| Copia | 1009467 | 0.1 | 3887925 | 0.2 | ||
| SINE | tRNA | 6099534 | 0.4 | 12208774 | 0.5 | |
| Unknown | 439123 | 0.0 | 1325090 | 0.1 | ||
| DIRS | DIRS | 6352447 | 0.4 | 13786775 | 0.6 | |
| PLE | Penelope | 1392065 | 0.1 | 1728304 | 0.1 | |
| Unknown | Unknown | 102556 | 0.0 | 157518 | 0.0 | |
| DNA-transposons (class II) | TIR | Tc1-Mariner | 143125085 | 9.6 | 226246051 | 10.1 |
| hAT | 18745228 | 1.3 | 26354108 | 1.2 | ||
| CACTA | 1063048 | 0.1 | 1575430 | 0.1 | ||
| PIF-Harbinger | 606932 | 0.0 | 788870 | 0.0 | ||
| PiggyBac | 237022 | 0.0 | 555705 | 0.0 | ||
| Unknown | Unknown | 5627491 | 0.4 | 90211317 | 4.0 | |
| Sola | 148327 | 0.0 | 373468 | 0.0 | ||
| Ginger1 | 59568 | 0.0 | 66177 | 0.0 | ||
| ISL2EU | 37185 | 0.0 | 0 | 0.0 | ||
| Crypton | Crypton | 1763736 | 0.1 | 2838181 | 0.1 | |
| Maverick | Maverick | 4732 | 0.0 | 147407 | 0.0 | |
| Unknown | 247405186 | 16.7 | 374107217 | 16.7 | ||
Figure 3Comparison of repeat element abundances in the European grayling and Atlantic salmon genomes. The RepBase names of the elements that were more abundant in either of the two species and outside the 95% confidence limit are also given. Residuals are colored to indicate element class and order categories.
Figure 2Chromosomal rearrangements between the European grayling and Atlantic salmon genomes. Each box represents an Atlantic salmon chromosome depicted on the right side of the box. The corresponding European grayling chromosomes are shown in different colors on the left side. Horizontal lines represent corresponding positions between the two genomes, with sequence identity depicted in a bluered scale on the right side of the box. Purple and yellow circles depict predicted centromere and telomere positions in the European grayling chromosomes, respectively. The axes scales represent the log2-transformed abundance in base pairs +1.
Figure 4Karyotypic changes among salmonid taxa. A Bayesian chronogram tree based on mitochondrial sequence (tree obtained and edited from Shedko , doi: 10.5061/dryad.r42qf) of those salmonids that have diploid chromosome number (2n) and the number of chromosome arms (NF) available in Phillips and Ráb (2001).