| Literature DB >> 28086785 |
Ketil Malde1,2, Bjørghild B Seliussen1, María Quintela1, Geir Dahle1, Francois Besnier1, Hans J Skaug1,3, Nils Øien1, Hiroko K Solvang1, Tore Haug4, Rasmus Skern-Mauritzen1, Naohisa Kanda5,6, Luis A Pastene5, Inge Jonassen2, Kevin A Glover7,8.
Abstract
BACKGROUND: In the marine environment, where there are few absolute physical barriers, contemporary contact between previously isolated species can occur across great distances, and in some cases, may be inter-oceanic. An example of this can be seen in the minke whale species complex. Antarctic minke whales are genetically and morphologically distinct from the common minke found in the north Atlantic and Pacific oceans, and the two species are estimated to have been isolated from each other for 5 million years or more. Recent atypical migrations from the southern to the northern hemisphere have been documented and fertile hybrids and back-crossed individuals between both species have also been identified. However, it is not known whether this represents a contemporary event, potentially driven by ecosystem changes in the Antarctic, or a sporadic occurrence happening over an evolutionary time-scale. We successfully used whole genome resequencing to identify a panel of diagnostic SNPs which now enable us address this evolutionary question.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28086785 PMCID: PMC5237217 DOI: 10.1186/s12864-016-3416-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Samples used in the present study
| Sample | Total | Sequenced | Genotyped |
|---|---|---|---|
| Samples for identification of markers, creation and validation of the genetic baseline | |||
|
| 95 | 36 | 95 |
|
| 127 | 36 | 127 |
|
| 95 | 36 | 95 |
|
| 15 | 11 | 11a |
| Samples to be identified against the established genetic baseline | |||
| ID 901016 (migrant, 1996) | 1 | 0 | 1 |
| ID 701550 (hybrid, 2007) [ | 1 | 0 | 1 |
| ID 1001065 + 1001065_fetus (pregnant hybrid + fetus, 2010) [ | 2 | 0 | 2 |
| ID 9601017, 1101069, 1101158, 1101205, 1401130 (Outlier/abnormal individuals from NMDR, 2009–2014)b | 5 | 0 | 5 |
aDwarfs were sequenced in individually tagged lanes and their genotypes were inferred from the sequence data
bPreviously unpublished, extracted from NMDR based on visual inspection of microsatellites
Sequencing results, showing the number of sequences produced for each pool, and the result of mapping them to the reference genome sequence
| Pool | Species | Individuals ( | Reads (millions) | Mapped (millions) | Coverage (%) |
|---|---|---|---|---|---|
| AN1 |
| 18 | 412.5 | 365.9 | 12.320 |
| AN2 | 18 | 366.5 | 328.0 | 11.044 | |
| AT1 |
| 18 | 386.4 | 352.7 | 11.873 |
| AT2 | 18 | 389.1 | 352.0 | 11.852 | |
| PA1 |
| 18 | 384.8 | 349.6 | 11.770 |
| PA2 | 18 | 380.8 | 345.0 | 11.618 | |
| DW |
| 11 | 332.6 | 306.4 | 7.150 |
Marker panel genotyping results. Each assay contains between 22 and 26 usable markers, and 15 to 24 appear to be fully diagnostic – viz., we have not observed any presence of minor alleles in the two subspecies or populations. Except for the dwarf subspecies, where the number of specimens is small, a 95% confidence upper bound for the minor allele frequency of an apparently fully diagnostic marker is between 3 and 4%
| Panel | Population 1 | Population 2 | No markers used | “Fully diagnostic” markers |
|---|---|---|---|---|
| WP1 |
|
| 26 | 24 |
| WP2 |
|
| 24 | 16 |
| WP3 |
|
| 23 | 22 |
| WP4 |
|
| 22 | 15 |
| WP5 |
|
| 24 | 21 |
Fig. 1Outline of the simulated back-crossing, generating F1 hybrids, and subsequent back-crossed generations into the Antarctic and Atlantic pure species
Classification intervals, giving the range of heterozygote marker counts leading to Fn classification under different assumptions of allele fixedness, and the probability of the classification of the specific back cross generation being correct, using a maximum likelihood model
| Diagnostic markers | Introgressing MAF = 0.05 | MAF = 0.05 | |||||||
|---|---|---|---|---|---|---|---|---|---|
| from | to | prob. correct | from | to | prob. correct | from | to | prob. correct | |
| F1 | 36 | 50 | 1.000 | 36 | 50 | 1.000 | 36 | 50 | 1.000 |
| F2 | 19 | 35 | 0.966 | 18 | 35 | 0.963 | 20 | 35 | 0.940 |
| F3 | 10 | 18 | 0.808 | 9 | 17 | 0.836 | 13 | 19 | 0.687 |
| F4 | 5 | 9 | 0.677 | 5 | 8 | 0.590 | 9 | 12 | 0.501 |
| F5 | 3 | 4 | 0.411 | 3 | 4 | 0.402 | 7 | 8 | 0.306 |
| F6 | 2 | 2 | 0.261 | 2 | 2 | 0.254 | 6 | 6 | 0.171 |
| F7 | 1 | 1 | 0.361 | 1 | 1 | 0.357 | |||
| Native | 0 | 0 | 1.000 | 0 | 0 | 1.000 | 0 | 4 | 0.479 |
Note that with minor allele frequency of 0.05, there is no number of heterozygote markers where the F7 generation is most likely to produce, and thus we will never classify a specimen as F7. The same holds for F8 and above under all assumptions of minor allele frequencies.
Fig. 2STRUCTURE barplot for the baseline individuals (N = 1101 divided into 11 categories): Inferred ancestry of individuals was calculated after averaging ten STRUCTURE runs with CLUMPP at K = 2 using a) popinfo, and b) without popinfo. ACU stands for the Atlantic B. acutorostrata and BON for the Antarctic B. bonaerensis
Fig. 3Posterior probabilities of belonging to the parental species Atlantic minke whale (ACU), B. acutorostrata, (left column) and Antarctic minke whale (BON), B. bonaerensis (right column) for the 1101 baseline individuals divided into 11 groups. Distribution of the averaged qi(open circles) and 90% CI (bars) values computed by STRUCTURE after ten runs at K = 2 using popinfo (a, b) and without popinfo (c, d)
Distribution of 457 simulated test individuals into the baseline of 1101 individuals. Correct assignment is depicted in boldface type
| Population | BON | F5B | F4B | F3B | F2B | F1 | F2A | F3A | F4A | F5A | ACU |
|---|---|---|---|---|---|---|---|---|---|---|---|
| F2B | 1 |
| |||||||||
| F3B | 9 |
| |||||||||
| F4B | 23 |
| 4 | ||||||||
| F5B | 9 |
| 6 | 1 | |||||||
| F1 |
| ||||||||||
| F2A | 1 |
| 2 | ||||||||
| F3A | 4 |
| 10 | 1 | |||||||
| F4A | 8 |
| 20 | ||||||||
| F5A | 1 | 11 |
| 6 |
Summary of classification of migrant and hybrid specimens according different methods: the original classification using microsatellites, the classification using SNP-specific markers from the B. bonaerensis (BON) vs. B. a. acutorostrata (ACU) specific assays WP1 and WP2, the classification using SNP without 5% MAF, the direct assignment using STRUCTURE and ONCOR against a simulated SNP-based baseline. Column “ID” gives the internal NMDR identification numbers
| ID | Classif microsatellites | Classif SNP | Classif SNP w/ 5% MAF | STRUCTURE classif against simulated SNP-based baseline | GeneClass direct assignment against simulated SNP-based baseline (and score, %) | ONCOR direct assignment against simulated SNP-based baseline (and probability) |
|---|---|---|---|---|---|---|
| 9601017 [ | BON | BON | BON | BON | BON (93.86) | BON (1.000) |
| 701550 [ | F1 | F1 | F1 | F1 | F1 (100) | F1 (1.000) |
| 901016a | ACU (outlier) | ACU | ACU | ACU | ACU (93.74) | ACU (1.000) |
| 1001065 [ | F1 | F1 | F1 | F1 | F1 (100) | F1 (1.000) |
| 1001065_fetus [ | F2 (F1 + ACU) | F2A | F2A | F2A | F2A (99.92) | F2A (0.999) |
| 1101069a | ACU (outlier) | ACU | ACU | ACU | ACU (93.90) | ACU (1.000) |
| 1101158a | ACU (outlier) | F2A, most likely | F3A/F2A | F2A | F2A (76.89) | F2A (1.000) |
| 1101205a | ACU (outlier) | ACU | ACU | ACU | ACU (93.90) | ACU (1.000) |
| 1401130a | ACU (outlier) | ACU | ACU | ACU | ACU (93.74) | ACU (1.000) |
aPreviously unpublished, extracted from NMDR based on visual inspection of microsatellites