| Literature DB >> 31900119 |
Le Qin Choo1,2, Thijs M P Bal3, Marvin Choquet3, Irina Smolina3, Paula Ramos-Silva4, Ferdinand Marlétaz5, Martina Kopp3, Galice Hoarau3, Katja T C A Peijnenburg6,7.
Abstract
BACKGROUND: Pteropods are planktonic gastropods that are considered as bio-indicators to monitor impacts of ocean acidification on marine ecosystems. In order to gain insight into their adaptive potential to future environmental changes, it is critical to use adequate molecular tools to delimit species and population boundaries and to assess their genetic connectivity. We developed a set of target capture probes to investigate genetic variation across their large-sized genome using a population genomics approach. Target capture is less limited by DNA amount and quality than other genome-reduced representation protocols, and has the potential for application on closely related species based on probes designed from one species.Entities:
Keywords: Exon capture; Genome; Marine zooplankton; Non-model organism; Targeted sequencing
Mesh:
Year: 2020 PMID: 31900119 PMCID: PMC6942316 DOI: 10.1186/s12864-019-6372-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of draft genome statistics for Limacina bulimoides
| Assembly statistics | Value |
|---|---|
| Estimated total genome size | 4,801,432,559 bp |
| Total assembly size | 2,901,932,435 bp |
| Number of scaffolds | |
| > = 0 bp | 3,735,734 |
| > = 1000 bp | 802,059 |
| > = 5000 bp | 3890 |
| > = 10,000 bp | 116 |
| > = 25,000 bp | 6 |
| > = 50,000 bp | 3 |
| N50 | 893 bp |
| L50 | 994,289 |
| Smallest scaffold | 200 bp |
| Largest scaffold | 197,255 bp |
| Percentage of N’s | 0.3307 |
| GC content, % | 34.08 |
Summary of BUSCO analysis showing the number of metazoan near universal orthologues that could be detected in the draft genome of Limacina bulimoides
| Present in draft genome | |
|---|---|
| Complete | 296 (30.3%) |
| Complete and single-copy | 262 (26.8%) |
| Complete and duplicated | 34 (3.5%) |
| Fragmented | 292 (29.9%) |
| Missing | 390 (39.8%) |
| Total BUSCO groups searched | 978 |
Target capture efficiency statistics, averaged ± standard deviation across nine individuals, for each of five pteropod species, including raw reads, final mapped reads, % High Quality reads (reads mapping uniquely to the targets with proper pairs), % targets covered (percentage of bases across all targets covered by at least one read), average depth (sequencing depth across all targets with reads mapped)
| Species | Raw reads (× 1,000) | Final mapped reads (× 1,000) | % HQ reads | % targets covered | Average depth |
|---|---|---|---|---|---|
| 10,529 ± 3997 | 3531 ± 1548 | 33.23 ± 9.10 | 97.36 ± 0.42 | 250 ± 111 | |
| 15,508 ± 4865 | 1765 ± 521 | 11.61 ± 2.59 | 20.32 ± 1.65 | 468 ± 144 | |
| 7060 ± 2043 | 807 ± 196 | 11.93 ± 2.77 | 13.28 ± 1.96 | 431 ± 76.9 | |
| 10,346 ± 6260 | 337 ± 180 | 3.47 ± 0.56 | 12.57 ± 2.71 | 63.7 ± 26.7 | |
| 3089 ± 1126 | 66 ± 30 | 2.07 ± 0.30 | 8.21 ± 3.34 | 31.9 ± 14.9 |
Number of single nucleotide polymorphism (SNPs) recovered after various filtering stages for five species of shelled pteropods. Hard-filtering was implemented in GATK3.8 VariantFiltration using the following settings: QualByDepth <2.0, FisherStrand >60.0, RMSMappingQuality <5.0, MQRankSumTest <-5.0 and ReadPositionRankSum <-5.0. The hard-filtered SNPs were subsequently filtered to keep those with a minimum site coverage of 5x and present in at least 80% of the individuals. Other filtering options were less stringent, such as a minimum depth of 2x and site presence in at least 50% of individuals
| Hard-filtering | 80% individuals, 5x depth | 80% individuals, 2x depth | 50% individuals,5x depth | |
|---|---|---|---|---|
| 154,864 | 137,938 | 137,953 | 147,763 | |
| 44,014 | 11,948 | 12,165 | 20,518 | |
| 23,379 | 5359 | 5847 | 8487 | |
| 18,298 | 2432 | 2771 | 4613 | |
| 13,041 | 1371 | 1559 | 2092 |
Fig. 3Density of single nucleotide polymorphisms (SNPs, present in 80% of individuals) plotted against coverage for each of the five pteropod species (a: Limacina bulimoides, b: L. trochiformis, c: L. lesueurii, d: L. helicina, and e: Heliconoides inflatus). The plots were truncated at coverage = 2000x for L. bulimoides and coverage = 1000x for the other four species. Note that minimum coverage is 45x due to filtering settings of a minimum 5x depth for 9 individuals
Fig. 1Number of recovered targets plotted against average proportion of bases in each target, with at least 15x sequencing coverage averaged across nine individuals, for each for the five shelled pteropod species (a: Limacina bulimoides, b: L. trochiformis, c: L. lesueurii, d: L. helicina, and e: Heliconoides inflatus). Bars on the right of the dashed vertical line represent the number of targets where more than 90% of the bases in each target was sequenced with ≥15x depth. Note the differences in y-axes between the plots. There is no peak at one SNP for L. bulimoides (Additional file 1: Appendix S5)
Number of targets with at least one single nucleotide polymorphism (based on 80% genotyping rate, 5x depth) was calculated according to category: candidate biomineralisation genes (Biomin.), conserved pteropod orthologues (Ortholog.), mitochondrial (Mt genes), nuclear 28S, and other coding and non-coding regions for each of five pteropod species. Numbers in brackets represent the total number of targets in that category on the set of target probes designed for Limacina bulimoides
| Species | Biomin. (35) | Ortholog. (643) | Mt genes (10) | 28S (1) | Coding (2169) | Non-coding (41) | Total (2899) |
|---|---|---|---|---|---|---|---|
| 32 | 635 | 1 | 1 | 2140 | 13 | 2822 | |
| 7 | 169 | 3 | 1 | 436 | 4 | 620 | |
| 0 | 90 | 2 | 1 | 209 | 0 | 302 | |
| 0 | 52 | 3 | 1 | 121 | 0 | 177 | |
| 0 | 20 | 1 | 1 | 61 | 0 | 83 |
Fig. 2Number of single nucleotide polymorphisms (SNPs) per recovered target for the five pteropod species of the superfamily Limacinoidea (see legend), based on filtering settings of minimum presence in 80% of individuals with at least 5x read depth
Fig. 4Log-scaled number of SNPs against genetic divergence from the focal species Limacina bulimoides shows that there is a sharp reduction in the SNPs recovered with genetic distance
Collection details of specimens from five shelled pteropod species: Limacina bulimoides, L. trochiformis, L. lesueurii, L. helicina and Heliconoides inflatus. Three individuals per site were included from localities in the Atlantic and Pacific Oceans. Latitude and longitude are presented in the decimal system, with positive values indicating North and East and negative values, South and West, respectively
| Species | Location | Latitude | Longitude | n | Collection Date |
|---|---|---|---|---|---|
| South Atlantic | −18.32 | −25.08 | 3 | 18/10/2014 | |
| South Atlantic | − 24.45 | −25.05 | 3 | 21/10/2014 | |
| South Atlantic | −27.77 | −25.02 | 3 | 22/10/2014 | |
| South Atlantic | −14.67 | −25.07 | 3 | 17/10/2014 | |
| South Atlantic | −18.32 | −25.08 | 3 | 18/10/2014 | |
| North Pacific | 22.65 | − 157.69 | 3 | 03/07/2017 | |
| North Atlantic | 20.40 | −38.61 | 3 | 24/10/2012 | |
| South Atlantic | −15.30 | −25.07 | 3 | 05/11/2012 | |
| South Atlantic | −24.13 | −25.00 | 3 | 09/11/2012 | |
| South Atlantic | −40.12 | −30.92 | 3 | 26/10/2014 | |
| South Atlantic | −41.48 | −33.87 | 3 | 27/10/2014 | |
| North Pacific | 48.36 | −126.31 | 3 | 06/03/2016 | |
| North Atlantic | 25.48 | −39.00 | 3 | 22/10/2012 | |
| South Atlantic | −8.08 | −25.04 | 3 | 03/11/2012 | |
| South Atlantic | −38.08 | −39.31 | 3 | 16/11/2012 |