| Literature DB >> 30212580 |
Kris A Christensen1,2,3, Eric B Rondeau1,2, David R Minkley2, Jong S Leong2, Cameron M Nugent4, Roy G Danzmann4, Moira M Ferguson4, Agnieszka Stadnik3, Robert H Devlin1, Robin Muzzerall5, Michael Edwards5, William S Davidson3, Ben F Koop2.
Abstract
Arctic charr have a circumpolar distribution, persevere under extreme environmental conditions, and reach ages unknown to most other salmonids. The Salvelinus genus is primarily composed of species with genomes that are structured more like the ancestral salmonid genome than most Oncorhynchus and Salmo species of sister genera. It is thought that this aspect of the genome may be important for local adaptation (due to increased recombination) and anadromy (the migration of fish from saltwater to freshwater). In this study, we describe the generation of a new genetic map, the sequencing and assembly of the Arctic charr genome (GenBank accession: GCF_002910315.2) using the newly created genetic map and a previous genetic map, and present several analyses of the Arctic charr genes and genome assembly. The newly generated genetic map consists of 8,574 unique genetic markers and is similar to previous genetic maps with the exception of three major structural differences. The N50, identified BUSCOs, repetitive DNA content, and total size of the Arctic charr assembled genome are all comparable to other assembled salmonid genomes. An analysis to identify orthologous genes revealed that a large number of orthologs could be identified between salmonids and many appear to have highly conserved gene expression profiles between species. Comparing orthologous gene expression profiles may give us a better insight into which genes are more likely to influence species specific phenotypes.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30212580 PMCID: PMC6136826 DOI: 10.1371/journal.pone.0204076
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Arctic charr genetic map crosses.
| Family | 1 | 2 | 3 | 4 | 5 | 7 | 9 | 52 | 59 | F2 | F3 |
|---|---|---|---|---|---|---|---|---|---|---|---|
| Female | Hybrid NLf x TRm | FRu x NLu | FRf | ||||||||
| Male | TRm | NLm | FRm | FRu x NLu | |||||||
| Progeny | 14 | 23 | 28 | 31 | 24 | 24 | 25 | 33 | 36 | 29 | 38 |
NL, Nauyuk Lake; TR, Tree River; FR, Fraser River (Labrador); m:male f:female u:unknown
NCBI accessions for RNA-seq analysis.
| Species | Tissue | NCBI (SRA) Accession |
|---|---|---|
| Brain | SRX2635054, SRX3379484, SRX2632051, SRX608607, SRX514235, SRX2894156 | |
| Eye | SRX2635059, SRX3379482, SRX2632050, SRX608616, SRX514236, NA | |
| Gill | SRX2635050, SRX3379485, SRX2632049, SRX608399, SRX514237, SRX2894158 | |
| Gonad | SRX2635049, SRX3379473, SRX2632041, SRX608620, SRX514271, SRX2894150 | |
| Gut | SRX2635056, SRX3379477, SRX2632048, SRX608567, SRX514238, SRX2894160 | |
| Head Kidney | SRX2635052, SRX3379472, SRX2632047, SRX608569, SRX514240, SRX2894157 | |
| Heart | SRX2635058, SRX3379476, SRX2632046, SRX608571, SRX514258, NA | |
| Hind Kidney | SRX2635053, NA, SRX2632045, SRX608574, SRX514263, SRX2894159 | |
| Liver | SRX2635048, SRX3379480, SRX2632044, SRX608575, SRX514266, SRX2894162 | |
| Muscle | SRX2635055, SRX3379475, SRX2632043, SRX608579, SRX514267, SRX2894153 | |
| Spleen | SRX2635057, SRX3379469, SRX2632038, SRX608599, SRX514270, SRX2894163 | |
| Stomach | SRX2635051, SRX3379470, SRX2632037, NA, SRX514269, SRX2894151 |
Accession numbers are ordered by species from the species column
Fig 1Circos plot of Arctic charr genome assembly.
A) Is comprised of bar charts for each chromosome with the fraction of repetitive DNA found in million bp windows with a scale from 0 to 1 (bars with scores greater than 0.65 are shown in orange). B) Displays bar charts of percent identity between homeologous regions defined by SyMap v4.2. The percent identity was found by weighting the percent identity by alignment lengths. Each bar represents a million bp window, and the y-axis starts at 75 percent and ends at 100 percent identity. Windows with score greater than 90 percent were highlighted orange. C) Ideograms of each of the Arctic charr chromosomes. The internal ribbons link homeologous (or duplicated) regions of the genome.
Fig 2Comparison of genome assemblies from selected species to the Arctic charr genome assembly.
The lighter colors represent alignments that are in forward orientation, while the darker colors represent reverse orientations. For the Arctic charr chromosomes, all numbers indicate corresponding chromosome names without the prefix of AC (and 0 for those below 10). For all other species, the chromosome numbers represent NCBI designations without letter prefixes.
Homologous chromosome arm comparison between species.
| E. lucius (pike) | S. alpinus (charr) | O. kisutch (coho) | S. salar (Atlantic) |
|---|---|---|---|
| 1.1 | 15b | 20b | |
| 1.2 | 18a | 9c | |
| 2.1 | AC15b | 3b | |
| 2.2 | 8b | 11a | |
| 3.1 | 14a | ||
| 3.2 | 3a | ||
| 4.1 | AC08a | 15a | 9b |
| 4.2 | 19b | 5a | |
| 5.1 | AC18a | 20a | 19b |
| 5.2 | AC08b | ||
| 6.1 | AC18b | 11a | 1b |
| 6.2 | 4b | 18a | |
| 7.1 | AC20b | 6a | 13b |
| 7.2 | 4b | ||
| 8.1 | AC13b | 10a | |
| 8.2 | 13a | 10a | |
| 9.1 | 20b | 2b | |
| 9.2 | AC01a | 1b | 12a |
| 10.1 | 17a | ||
| 10.2 | 14b | 14b | |
| 11.1 | AC20a | 10b | 6a |
| 11.2 | AC20d | 6b | 3b |
| 12.1 | 13a | ||
| 12.2 | 17b | 15b | |
| 13.1 | AC15a | 3a | |
| 13.2 | 8a | 20a | |
| 14.1 | AC04q.1.29a | 1c | |
| 14.2 | (AC20c, AC28a) | 11b | |
| 15.1 | 14a | 9a | |
| 15.2 | 7b | 1a | |
| 16.1 | |||
| 16.2 | 2b | ||
| 17.1 | AC01b | 1a | 12b |
| 17.2 | 5b | ||
| 18.1 | AC28b | 12a | 15a |
| 18.2 | AC14a | 6b | |
| 19.1 | AC04q.1.29b | 10b | |
| 19.2 | AC26b | 4a | 16a |
| 20.1 | AC06.1a | 13b | 5b |
| 20.2 | 2a | 2a | |
| 21.1 | AC27a | 11b | |
| 21.2 | AC14b | 18b | 19a |
| 22.1 | AC27b | 5a | 17a |
| 22.2 | 16b | 16b | |
| 23.1 | AC03a | 9a | 7b |
| 23.2 | 19a | 17b | |
| 24.1 | AC03b | 9b | 7a |
| 24.2 | 16a | 18b | |
| 25.1 | AC13c | 12b | 4a |
| 25.2 | 7a |
A comparison of Arctic charr homologous chromosome arms based on chromosome nomenclature from [14]. For each northern pike chromosome there are two chromosomes in the Salmonidae family due to an early salmonid-specific genome duplication (e.g. 1.1 and 1.2 refer to the same northern pike chromosome).
aExpected to align to Arctic charr chromosome AC05, but aligned to AC20c and AC28a.
bAppears to be a transposition error based on the current alignments.
c9b and 19a did not align with the parameters that were used, and are consequently missing in the current analysis.
Chromosomes that only match one northern pike chromosome are in bold text.
Fig 3Example of homologous chromosomes and orthologs between species.
Linear alignments between the rainbow trout and Arctic charr genome assembliess are shown in the bottom graph. The rainbow trout chromosomes correspond to the NCBI RefSeq accession from the lowest number to highest. Identified orthologous genes are plotted with their relative positions on the rainbow trout and Arctic charr chromosomes on the top graph. Genes without a known ortholog are plotted in gray.
Number of identified orthologs between Arctic charr and other species.
| Species | Orthologs |
|---|---|
| Arctic charr | 11,252 (homeologs) |
| Northern pike | 15,805 |
| Chinook salmon | 16,709 |
| Coho salmon | 17,837 |
| Rainbow trout | 19,646 |
| Atlantic salmon | 18,252 |
Fig 4Overlapping orthologs and homeologs between species.
A histogram of the number of species with a specific ortholog. If an ortholog was found in all species (including homeologs), it would be in the ‘Homeologs and Orthologs’ column. If an ortholog/homeolog was found in four species, it would be in the 4+ column and all prior columns. The ‘All Orthologs’ column contains only the overlapping orthologs (without regards to overlap with a homeolog) of all the species in this study. Likewise, the ‘All Salmonid Orthologs’ column contains the count of overlapping orthologs (and not homeologs) in only the salmonid species (Arctic charr, rainbow trout, coho salmon, Chinook salmon, and Atlantic salmon).
RNA-seq statistics.
| Species | Average Total Reads | Average Percent Mapped |
|---|---|---|
| Arctic charr | 90,405,878 | 74 |
| northern pike | 59,880,079 | 80 |
| coho salmon | 205,322,058 | 72 |
| rainbow trout | 89,299,071 | 65 |
| Atlantic salmon | 61,971,752 | 74 |
| Chinook salmon | 256,708,646 | 69 |
Fig 5Histograms of correlation coefficients found between Arctic charr orthologs and other species.
Each histogram also has a pie diagram to illustrate the frequency of a range of correlation values. Each histogram is labeled for the species that was compared to the Arctic charr orthologs (SalpH—11,035 homeologous genes, Eluc—15,428 orthologous northern pike genes, Okis—17,490 orthologous coho salmon genes, Otsh—16,320 orthologous Chinook salmon genes, Ssal—17,617 orthologous Atlantic salmon genes, Omyk—19,122 orthologous rainbow trout genes).
Fig 6Heatmap of correlation coefficients between species and tissues.
Pearson correlation coefficients were calculated between different tissues and species combinations based on log transformed gene expression values (5304 common orthologous or homeologous genes). These values were then used to cluster each tissue-species combination. Both dendrograms represent the same information and each leaf depicts one tissue and species combination. The heatmap shows the correlation coefficient between each combination. The groups at the bottom were manually added to help show how the groups were clustered. (Salp—Arctic charr, SalpH—Arctic charr homeologous gene pair, Okis—coho salmon, Otsh—Chinook salmon, Ssal—Atlantic salmon, Eluc—northern pike, Omyk—rainbow trout).
Number of SNPs identified in commercial stains of Arctic charr.
| Genetic Background | Heterozygous SNPs | Total Nucleotide Variants | SNP Frequency / kbp |
|---|---|---|---|
| Tree River | 2986470 | 4355891 | 2.03 |
| Tree River | 3322237 | 4649645 | 2.16 |
| TRm x NLf | 7222605 | 8573274 | 3.99 |
| TRm x NLf | 7413500 | 8715877 | 4.05 |
| (TRm x (TRm x NLf)f)m x (TRm xNLf)f | 5686470 | 7681273 | 3.57 |
| NLm x ((TRm x NLf)m x TRf)f | 6214612 | 8497451 | 3.95 |
| (TRm x NLf)m x NLf | 4665517 | 8227761 | 3.83 |
| TRm x ((TRm x NLf)m x TRf)f | 4688898 | 5906707 | 2.75 |