| Literature DB >> 35459797 |
Grace A Wyngaard1, Rasmus Skern-Mauritzen2, Ketil Malde2,3, Rachel Prendergast1, Stefano Peruzzi4.
Abstract
The genome size of organisms impacts their evolution and biology and is often assumed to be characteristic of a species. Here we present the first published estimates of genome size of the ecologically and economically important ectoparasite, Lepeophtheirus salmonis (Copepoda, Caligidae). Four independent L. salmonis genome assemblies of the North Atlantic subspecies Lepeophtheirus salmonis salmonis, including two chromosome level assemblies, yield assemblies ranging from 665 to 790 Mbps. These genome assemblies are congruent in their findings, and appear very complete with Benchmarking Universal Single-Copy Orthologs analyses finding > 92% of expected genes and transcriptome datasets routinely mapping > 90% of reads. However, two cytometric techniques, flow cytometry and Feulgen image analysis densitometry, yield measurements of 1.3-1.6 Gb in the haploid genome. Interestingly, earlier cytometric measurements reported genome sizes of 939 and 567 Mbps in L. salmonis salmonis samples from Bay of Fundy and Norway, respectively. Available data thus suggest that the genome sizes of salmon lice are variable. Current understanding of eukaryotic genome dynamics suggests that the most likely explanation for such variability involves repetitive DNA, which for L. salmonis makes up ≈ 60% of the genome assemblies.Entities:
Mesh:
Year: 2022 PMID: 35459797 PMCID: PMC9033869 DOI: 10.1038/s41598-022-10585-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.996
Key statistics and sequencing platforms for salmon louse assemblies available March 2021.
| Assemblies and accession details | Assembly level | Louse origin | Size (Mbps) | N50 (Kbps) | Sequencing platform (s) | Depth | Reference |
|---|---|---|---|---|---|---|---|
| UVic_Lsal_1.0 GCA_016086655.1 | Chromosome | Pacific | 668.1 | 48,457 | Oxford Nanopore, Illumina | 45X | Database only*** |
UStir_LSAA PRJEB43242 | Chromosome | Atlantic | 632.5 609.3* | 5100 43,017** | PacBio | 107** | Database only*** |
| LSalAtl2S licebase.org | Scaffold | Atlantic | 695.4 | 478 | Illumina HiSeq, 454, Sanger fosmid end | 175X | [ |
| Atlantic female GCA_001005205.1 | Scaffold | Atlantic | 665.1 | 16 | Illumina HiSeq | 45X | Database only*** |
| Atlantic male GCA_001005235.1 | Scaffold | Atlantic | 665.3 | 13 | Illumina HiSeq | 55X | Database only*** |
| ASM18125v2 GCA_000181255.2 | Scaffold | Pacific | 790.1 | 10 | Illumina, 454 | 88X | [ |
Depths of sequencing coverage are the values associated with the assemblies and are estimated based on assembly-indicated genome sizes (i.e. ≈700 Mbps). The ASM18125v2 assembly is quite fragmented and the authors indicate that the assembly overestimates the actual size suggested to be around 600 Mbps (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA40179). Only LSalAtl2s is extensively assessed and the predicted gene set appears to be quite complete as it contains 92.4% of the expected genes in a BUSCO analysis and maps ≈ 90% of transcriptome reads[48,49].
*The UStir_LSAA assembly comprised 158 sequences in total, 15 of which were identified as chromosomes. The value denoted ‘*’ is the assembly size for these 15 chromosomes.
**Calculated based on the 632.5 Mbp assembly size.
***Data available in public repositories were included with permission from the assembly authors in adherence to the Fort Lauderdale agreement[50].
Consistency of assemblies.
| ASM18125v2 | Atlantic female | Atlantic male | UVic_Lsal_1.0 | UStir_LSAA | LSalAtl2S | |
|---|---|---|---|---|---|---|
| ASM18125v2 GCA_000181255.2 | 97.74% (0.90) | 96.90% (0.90) | 96.17% (0.97) | 97.12% (0.88) | 97.50% (0.88) | |
| Atlantic female GCA_001005205.1 | 93.77% (0.88) | 97.85% (0.98) | 93.67% (0.89) | 97.22% (0.95) | 96.97% (0.93) | |
| Atlantic male GCA_001005235.1 | 94.01% (0.88) | 98.90% (0.98) | 94.06% (0.89) | 97.36% (0.95) | 96.99% (0.93) | |
| UVic_Lsal_1.0 GCA_016086655.1 | 96.44% (0.97) | 98.31% (0.91) | 97.52% (0.91) | 98.13% (0.89) | 98.12% (0.89) | |
| UStir_LSAA PRJEB43242 | 93.48% (0.87) | 97.47% (0.97) | 96.59% (0.97) | 93.75% (0.88) | 97.74% (0.95) | |
| LSalAtl2S licebase.org | 94.56% (0.89) | 98.58% (0.97) | 97.59% (0.97) | 94.80% (0.89) | 98.57% (0.96) |
The assemblies were converted to 240 bp synthetic reads that were blasted against all other assemblies. The assemblies from which synthetic reads originated are indicated in column headings and the reference assemblies are indicated in the row headings. The results show average query cover % and (the proportion of reads that maps with > 95% identity).
Nuclear DNA contents of somatic and gametic cells of L. salmonis salmonis as measured using flow cytometry.
| Run | Origin | Stage | Tissue | N | 2C value (pg) vs. CEN | 2 | CV (%) | 2 |
|---|---|---|---|---|---|---|---|---|
| 1 | LsTromsø | Nauplii | WB | 10 | 3.08 ± 0.009 | na | 2–4 | 3.01 |
| 2 | LsTromsø | Nauplii | WB | 11 | 3.09 ± 0.005 | 3.06 ± 0.021 | 2–3 | 3.01* |
| 3 | LsTromsø | Adult ♀ | CT | 3 | 3.10 ± 0.020 | 3.12 ± 0.008 | 3–5 | 3.04* |
| Adult ♂ | CT | 6 | 3.14 ± 0.139 | 3.14 ± 0.010 | 3–5 | 3.07* | ||
| 4 | LsGulen | Adult ♀ | O | 5 | 3.36 ± 0.003 | 3.31 ± 0.003 | 2–4 | 3.26* |
| Adult ♂ | S | 5 | 1.69 ± 0.006 | 1.68 ± 0.002 | 1–3 | 1.65* | ||
| 5 | LsTromsø | Adult ♀ | CT | 5 | 3.01b ± 0.022 | na | 2–3 | 2.94 |
| LsTromsø | Adult ♂ | CT | 5 | 3.19a ± 0.016 | na | 3–4 | 3.12 | |
| Adult ♀ | CT | 11 | 3.08b ± 0.031 | na | 2–3 | 3.01 | ||
| Adult ♂ | CT | 5 | 3.19a ± 0.013 | na | 3–5 | 3.12 |
Runs 1, 2, 3, and 5 measured somatic cells of nauplii or adults of the LsTromsø strain. Run 4 measured gametic cells of the Ls Gulen laboratory strain. Run 5 compared the laboratory reared LsTromsø strain to wild caught adults from naturally infected fish reared in Tromsø. Chicken and/or human white blood cells were used as internal reference standards.
CEN Chicken erythrocyte nuclei (2C value = 2.5 pg DNA per nucleus), MNCs human mono-nucleated cells (2C value = 7.0 pg DNA per nucleus), WB whole body, CT cuticular and subcuticular tissues from cephalic region, O oocytes, S sperm, N number of individuals or number of samples in the cases of nauplii analyzed, CV coefficient of variation as a percentage of mean for target nuclei (L. salmonis data), Na not available. For naupliar stages (run 1 and 2), each sample consisted of approximately 50 nauplii.
*Average value based on the two internal standards. For Run 5, 2C values with superscripts “a” and “b” differ at P < 0.05 (two-way ANOVA).
Figure 1Representative fluorescence histograms of PI-stained somatic and gametic cells of L. salmonis salmonis. (a) Run 1, somatic cells obtained from whole body squash of nauplii; (b) run 3, somatic cells obtained from cephalic regions of adult females; (c) run 4; oocyte and sperm samples; (d) run 5, somatic cells obtained from cephalic regions of adult males and females. Samples analyzed using chicken and human cells as internal reference standards. Fluorescence (FL) peaks values on the X-axis are reported in arbitrary units expressed as Propidium Iodide (PI) fluorescence channel numbers (FL value). Values on the Y-axis (counts) refer to the number of nuclei counted per channel. CEN chicken erythrocytes nuclei, MNCs human mono-nucleated cells.
Figure 2Photomicrographs of squash preparations of somatic nuclei of cephalothorax of L. salmonis salmonis and erythrocytes of hen and trout used as standards, stained with Feulgen reaction for DNA. (a) Nuclei of erythrocytes of G. domesticus; (b) nuclei of erythrocytes of O. mykiss; (c) representative nuclei, indicated by arrows, of male L. salmonis salmonis slide 64 whose measurements are given in Table 4; (d) representative nucleus, indicated by arrow, of female L. salmonis salmonis slide 63 whose measurements are given in in Table 4; (e) nuclei of male L. salmonis salmonis slide 64 with high level of DNA compaction; (f) nuclei of male L. salmonis salmonis slide 64 with highly diffuse morphology. Scale bars represent 10 µm.
Nuclear DNA contents of somatic cells of individual adult L. salmonis as measured using Feulgen image analysis densitometry.
| Origin | Stage | Tissue | N | 2 | 2 | CV (%) | 2 |
|---|---|---|---|---|---|---|---|
| Adult ♀ | CT | 13 | 2.78 ± 0.012 | na | 1.6 | 2.74 | |
| Adult ♀ | CT | 15 | 2.64 ± 0.003 | na | 3.3 | 2.60 | |
| Adult ♀ | CT | 92 | 2.67 ± 0.003 | na | 10.4 | 2.63 | |
| Average | 2.70 ± 0.043 | na | 2.66 | ||||
| Adult ♀ | CT | 34 | 2.64 ± 0.046 | 2.60 ± 0.046 | 10.2 | 2.60 | |
| Adult ♀ | CT | 20 | 2.75 ± 0.037 | 2.71 ± 0.037 | 6.1 | 2.67 | |
| Adult ♀ | CT | 16 | 2.71 ± 0.025 | 2.67 ± 0.025 | 3.9 | 2.65 | |
| Average | 2.70 ± 0.032 | 2.66 ± 0.032 | 2.66 | ||||
| Adult ♂ | CT | 65 | 2.93 ± 0.035 | na | 9.9 | 2.89 | |
| Adult ♂ | CT | 36 | 2.86 ± 0.037 | na | 7.9 | 2.82 | |
| Adult ♂ | CT | 19 | 2.90 ± 0.054 | na | 8.3 | 2.85 | |
| Average | 2.90 ± 0.020 | na | 2.85 | ||||
| Copscook Bay, Maine | Adult ♀ | SP | 3 | 3.07 ± 0.176 | na | 9.8 | 3.00 |
Cephalothorax (CT) tissues of laboratory reared adults were obtained from the Ls Gulen and Ls1a populations. Tissue from the spine of an appendage (SP) was obtained from a wild caught Maine population. Hen (CEN) and male human mononucleated leucocytes (MNCs) were used as internal reference standards to estimate values in picograms (pg). Values based on hen were converted to gigabases (Gb). N refers to number of nuclei measured in each adult. SEM refers to standard error of the mean; CV refers to coefficient of variation of IOD values.
Figure 3Somatic nuclear DNA contents of copepods. Each species is represented by a vertical bar. Estimates based on cytometric methods, genome assemblies, and mapping are presented in orange, blue, and dotted black, respectively, for the siphonostomatoid L. salmonis salmonis. The black dotted lines depict a range of values obtained for L. salmonis salmonis using mapping. FIAD and FCM measurements depicted by green vertical lines for L. salmonis salmonis are from Gregory[53] and Jeffery[54], respectively. FIAD (orange) and genome assembly (blue) estimates are shown for the harpacticoid T. californicus[63,64]. FIAD (orange) and genome assembly (blue) estimates for the calanoid E. affinis are the two left most vertical lines in the distribution of Calanoida[65,66]. The adult somatic genome size estimates of Cyclopoida and Calanoida (orange) were mostly acquired using FIAD and are from Gregory[53]. All estimates are presented as 2C values found in the soma because some cyclopoid species and at least one calanoid species possess embryonic chromatin diminution, in which germline genome sizes (conventionally noted as 1C) substantially exceed the sizes of somatic values, thus violating the appropriateness of halving 2C estimates in the soma to obtain 1C estimates of the germline.