| Literature DB >> 31953426 |
Simo N Maduna1, Adam Vivian-Smith2, Ólöf Dóra Bartels Jónsdóttir3, Albert K D Imsland3,4, Cornelya F C Klütsch5, Tommi Nyman5, Hans Geir Eiken5, Snorre B Hagen6.
Abstract
The lumpfish Cyclopterus lumpus is commercially exploited in numerous areas of its range in the North Atlantic Ocean, and is important in salmonid aquaculture as a biological agent for controlling sea lice. Despite the economic importance, few genetic resources for downstream applications, such as linkage mapping, parentage analysis, marker-assisted selection (MAS), quantitative trait loci (QTL) analysis, and assessing adaptive genetic diversity are currently available for the species. Here, we identify both genome- and transcriptome-derived microsatellites loci from C. lumpus to facilitate such applications. Across 2,346 genomic contigs, we detected a total of 3,067 microsatellite loci, of which 723 were the most suitable ones for primer design. From 116,555 transcriptomic unigenes, we identified a total of 231,556 microsatellite loci, which may indicate a high coverage of the available STRs. Out of these, primer pairs could only be designed for 6,203 loci. Dinucleotide repeats accounted for 89 percent and 52 percent of the genome- and transcriptome-derived microsatellites, respectively. The genetic composition of the dominant repeat motif types showed differences from other investigated fish species. In the genome-derived microsatellites AC/GT (67.8 percent), followed by AG/CT (15.1 percent) and AT/AT (5.6 percent) were the major motifs. Transcriptome-derived microsatellites showed also most dominantly the AC/GT repeat motif (33 percent), followed by A/T (26.6 percent) and AG/CT (11 percent). Functional annotation of microsatellite-containing transcriptomic sequences showed that the majority of the expressed sequence tags encode proteins involved in cellular and metabolic processes, binding activity and catalytic reactions. Importantly, STRs linked to genes involved in immune system process, growth, locomotion and reproduction were discovered in the present study. The extensive genomic marker information reported here will facilitate molecular ecology studies, conservation initiatives and will benefit many aspects of the breeding programmes of C. lumpus.Entities:
Mesh:
Year: 2020 PMID: 31953426 PMCID: PMC6968997 DOI: 10.1038/s41598-019-57071-w
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the in silico search for STRs in the Cyclopterus lumpus genome and transcriptome, respectively. Figure in parenthesis show the total number of STR-containing post-annotation filtering.
| Search parameters | Genomic | Transcriptomic |
|---|---|---|
| Total number of sequences examined | 990,653 | 322,381 |
| Total length of examined sequences (bp) | 213,385,867 | 418,639,584 |
| Total number of unique reads (reduced by QDD) | 2,346 | 255,957 |
| Total number of identified STRs | 3,067 | 231,556 |
| Number of STR-containing sequences | 2,346 | 116,555 |
| Number of sequences containing more than 1 STR | 555 | 57,717 |
| Number of STRs present in compound formation | 680 | 38,550 |
| Number of STR-containing sequences with primers | 723 (394) | 6,203 |
Figure 1Relative frequencies of different motif length classes in (a) g-STRs and (b) EST-STRs of Cyclopterus lumpus.
Distribution of Cyclopterus lumpus g-STRs to different repeat motif length classes.
| Repeats motif | Number of repeats | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 15+ | |
| Mononucleotide | 30 | 15 | 8 | 11 | 6 | 4 | 5 | |||||
| Dinucleotide | 657 | 333 | 206 | 207 | 164 | 175 | 137 | 153 | 108 | 98 | 67 | 431 |
| Trinucleotide | 101 | 31 | 25 | 20 | 7 | 2 | 2 | 5 | 2 | 1 | 0 | 0 |
| Tetranucleotide | 26 | 9 | 6 | 1 | 1 | 1 | 2 | 1 | 0 | 0 | 1 | 3 |
| Pentanucleotide | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| Hexanucleotide | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
Figure 2Relative frequencies of the 13 most abundant non-redundant repeat motifs in (a) g-STRs and (b) EST-STRs in Cyclopterus lumpus.
Distribution of Cyclopterus lumpus EST-STRs to different repeat motif length classes.
| Repeats motif | Number of repeats | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 15+ | |
| Mononucleotide | 26,626 | 15,798 | 9,731 | 6,569 | 4,150 | 3,321 | 10,370 | |||||
| Dinucleotide | 39,548 | 19,578 | 12,715 | 10,011 | 7,856 | 5,628 | 5,661 | 1,625 | 80 | 217 | 1,250 | 16,184 |
| Trinucleotide | 13,690 | 7,555 | 4,583 | 1,337 | 98 | 858 | 760 | 600 | 469 | 268 | 214 | 468 |
| Tetranucleotide | 1,695 | 633 | 51 | 136 | 124 | 89 | 58 | 45 | 19 | 66 | 24 | 65 |
| Pentanucleotide | 176 | 31 | 32 | 25 | 6 | 7 | 7 | 6 | 4 | 2 | 1 | 3 |
| Hexanucleotide | 136 | 97 | 64 | 51 | 27 | 19 | 12 | 8 | 9 | 5 | 0 | 5 |
Figure 3Annotation of STR-containing EST unigenes in Cyclopterus lumpus according to their biological function, namely biological process (PB), cellular component (CC), and molecular function (MF) and respective subcategories.
Figure 4Shematic representation of the bioinformatic analysis used to obtain genome- and transcriptome-derived microsatellite loci for Cyclopterus lumpus.