| Literature DB >> 36092944 |
Artem Nedoluzhko1,2, Fedor Sharko2,3,4, Svetlana Tsygankova5, Eugenia Boulygina5, Natalia Slobodova5, Anton Teslyuk6, Jorge Galindo-Villegas7, Sergey Rastorguev2,4.
Abstract
Interspecific hybridization has occurred relatively frequently during the evolution of vertebrates. This process usually abolishes reproductive isolation between the parental species. Moreover, it results in the exchange of genetic material and can lead to hybridogenic speciation. Hybridization between species has predominately been observed at the interspecific level, whereas intergeneric hybridization is rarer. Here, using whole-genome sequencing analysis, we describe clear and reliable signals of intergeneric introgression between the three-spined stickleback (Gasterosteus aculeatus) and its distant mostly freshwater relative the nine-spined stickleback (Pungitius pungitius) that inhabit northwestern Russia. Through comparative analysis, we demonstrate that such introgression phenomena apparently take place in the moderate-salinity White Sea basin, although it is not detected in Japanese sea stickleback populations. Bioinformatical analysis of the sites influenced by introgression showed that they are located near transposable elements, whereas those in protein-coding sequences are mostly found in membrane-associated and alternative splicing-related genes.Entities:
Keywords: Gasterosteus aculeatus; Pungitius pungitius; hybridization; introgression; membrane-associated genes; stickleback; transposable elements
Year: 2022 PMID: 36092944 PMCID: PMC9452749 DOI: 10.3389/fgene.2022.863547
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.772
Nomenclature and sampling location of the three- and nine-spined stickleback (G. aculeatus and P. pungitus, respectively) specimens used in this study.
| Specimen | Library name | Species | Sampling location ( | Sampling date | Sampling positions (GPS |
|---|---|---|---|---|---|
| Gas1 | LibB3−PP |
| CS, E | 08/07/2014 | 66.298057, 33.400795 |
| Gas2 | Lib311 |
| WS | 08/07/2014 | 66.298404, 33.320415 |
| Pun1 | LibB2−P9M |
| CS, E | 07/07/2014 | 66.298057, 33.400795 |
| Pun2 | LibChu82 |
| CS, E | 07/07/2014 | 66.298057, 33.400795 |
| Pun3 | LibChu83 |
| CS, E | 07/07/2014 | 66.298057, 33.400795 |
| Pun4 | LibD1 |
| CS, upstream | 07/07/2014 | 66.296781, 33.398263 |
| Pun5 | LibChM42 |
| CS, upstream | 07/07/2014 | 66.296781, 33.398263 |
| Pun6 | LibChM72 |
| CS, upstream | 07/07/2014 | 66.296781, 33.398263 |
| Pun7 | LibB9−1 |
| Quarry near, CV | 06/07/2014 | 66.298542, 33.344362 |
| Pun8 | LibK3 |
| Quarry near, CV | 06/07/2014 | 66.298542, 33.344362 |
| Pun9 | LibK7 |
| Quarry near, CV | 06/07/2014 | 66.298542, 33.344362 |
All specimens were collected in the Republic of Karelia, Russia. Chakalovsky stream (CS), Chakalovsky village (CV), Estuary (E), White Sea (WS).
Genome mapping efficiency (ME) and breadth of coverage (BC) percentages (%) of the three fish species reference genomes.
| Specimen name | Library name | Ref. genome 1 | Ref. genome 2 | Ref. genome 3 | |||
|---|---|---|---|---|---|---|---|
|
|
|
| |||||
| ME | BC | ME | BC | ME | BC | ||
| Gas1 | LibB3-PP | 85.36 | 95.00 | 17.36 | 28.95 | 4.82 | 4.30 |
| Gas2 | Lib311 | 91.30 | 94.58 | 18.54 | 29.72 | — | — |
| Pun1 | LibB2-P9M | 17.94 | 50.55 | 77.54 | 84.71 | 3.78 | 3.74 |
| Pun2 | LibChu82 | 9.84 | 30.16 | 42.64 | 84.58 | 2.14 | 4.68 |
| Pun3 | LibChu83 | 11.50 | 29.64 | 47.34 | 84.32 | 2.49 | 4.77 |
| Pun4 | LibD1 | 17.61 | 29.17 | 75.22 | 84.57 | 3.91 | 3.76 |
| Pun5 | LibChM42 | 20.53 | 25.70 | 74.64 | 82.09 | 5.76 | 4.02 |
| Pun6 | LibChM72 | 12.72 | 28.24 | 53.46 | 84.30 | 3.11 | 4.54 |
| Pun7 | LibB9-1 | 18.64 | 28.02 | 79.66 | 84.43 | 3.98 | 3.57 |
| Pun8 | LibK3 | 21.06 | 27.74 | 79.27 | 83.34 | 4.96 | 4.46 |
| Pun9 | LibK7 | 19.42 | 28.64 | 80.15 | 84.32 | 4.21 | 4.28 |
| GasJ | Japan3 | 98.85 | 91.11 | 20.56 | 27.90 | 5.05 | 4.02 |
| PunJ | Japan9 | 17.96 | 27.81 | 17.91 | 39.45 | 3.84 | 3.41 |
Note that all quantities are expressed as percentage (%).
FIGURE 1An overview of the “alien” read rate estimation pipeline: (A) for nine-stickleback sequencing data; (B) for three-spined stickleback sequencing data.
Illumina-generated reads used in this study and Sequence Read Archive (SRA) accession numbers for the generated dataset. (G. aculeatus and P. pungitus, represent the three- and nine-spined stickleback, respectively).
| Specimen | Library name | Species | Population | Number of PE | Source | SRA accession |
|---|---|---|---|---|---|---|
| Gas1 | LibB3-PP |
| White sea | 75,948,125 | This study | SRR11611415 |
| Gas2 | Lib311 |
| White sea | 73,927,012 | This study | SRR11611416 |
| Pun1 | LibB2-P9M |
| White sea | 124,576,543 | This study | SRR11611426 |
| Pun2 | LibChu82 |
| White sea | 129,103,432 | This study | SRR11611420 |
| Pun3 | LibChu83 |
| White sea | 81,714,662 | This study | SRR11611421 |
| Pun4 | LibD1 |
| White sea | 69,716,994 | This study | SRR11611422 |
| Pun5 | LibChM42 |
| White sea | 54,187,021 | This study | SRR11611418 |
| Pun6 | LibChM72 |
| White sea | 75,794,201 | This study | SRR11611419 |
| Pun7 | LibB9-1 |
| White sea | 70,767,251 | This study | SRR11611417 |
| Pun8 | LibK3 |
| White sea | 54,073,367 | This study | SRR11611423 |
| Pun9 | LibK7 |
| White sea | 58,358,776 | This study | SRR11611424 |
| GasJ | Japan3 |
| Japan | 104,697,312 |
| DRR067872 |
| PunJ | Japan9 |
| Japan | 142,279,188 |
| DRR013346 |
PE, reads–paired-end reads.
LibB2-P9M corresponds to SH3 specimen in Nedoluzhko et al. (2021).
FIGURE 2Neighbor-joining clustering of three- and nine-spined sticklebacks based on Nei’s distances. Admixed specimen of nine-spined stickleback species is indicated by a red arrow.
FIGURE 3Analysis of transposable elements specific to the three-spined stickleback (A) Heatmap of transposable element (TE) representation. Columns of the heatmap correspond to specimens, whereas rows correspond to transposable element families present in specimens. The red color on the heatmap indicates low coverage of the corresponding transposable element, whereas the white color shows high coverage. (B) Numbers of “three-spined” TE families present in specimens from the White Sea populations. Only TEs that were fully covered by sequencing reads and had a high mapping depth were taken into account. The specimens were sorted according to the number of TE families present. White Sea stickleback populations are marked as quarry, upstream, estuary, and sea according to the sampling location.
FIGURE 4The D-statistic test indicates introgression between the White Sea three-spined stickleback, G. aculeatus specimen (Gas1) and all of the White Sea nine-spined stickleback P. pungitius specimens with respect to the Japanese nine-spined stickleback specimen (A) and the White Sea nine-spined stickleback specimen (B). Compared with the Japanese nine-spined stickleback, all of the White Sea nine-spined stickleback specimens have considerable introgression levels from three-spined stickleback. D-stat, the result of D-statistic test; SE, standard error; Z, z-score.
FIGURE 5Bar chart of normalized number of reads mapped to the “alien” genome. Three-spined stickleback specimens marked in red; nine-spined stickleback specimens marked in green. White Sea stickleback population marked as quarry, upstream, estuary, and sea, according to the sampling location.
Analysis of relative distances and intersections of introgressed loci with protein-coding genes and transposable elements (TE) versus source of introgressed loci in three-spined stickleback genome and target introgressed loci for LibB2-B9M nine-spined stickleback genome is presented.
| Source of introgression | Target of introgression | |||
|---|---|---|---|---|
| Statistic name | Introgressed loci vs. TE | Introgressed loci vs. genes | Introgressed loci vs. TE | Introgressed loci vs. genes |
| Query population (fragments) | 123,104 | 120,716 | 157,417 | 29,013 |
| Reference population (fragments) | 494,288 | 22,270 | 303,483 | 7,405 |
| Query coverage (nt.) | 15,945,795 | 15,603,801 | 365,694,254 | 21,596,797 |
| Reference coverage (nt.) | 66,062,554 | 189,993,805 | 95,089,834 | 28,967,551 |
| Relative distances Kolmogorov-Smirnov | 0 | 0.0001518632 | 0.0498455334 | 0.0020492999 |
| Relative distances ECDF deviation area | 0.02075064 | 0.001288394 | 0.0006375816 | 0.0033102375 |
| Relative distances ECDF area correlation | 0.08293058 | −0.00525788 | 0.0015108528 | 0.0130496244 |
Gene ontology (GO) analysis of introgressed stickleback genes, conducted using the Database for Annotation, Visualization and Integrated Discovery (DAVID v6.8) for functional annotation by gene ontology (GO) categories. This analysis assigned 205 genes, revealed by BLAST searching the database of three-spined stickleback genes for de novo assembly of “alien” contigs. The results are limited using a false discovery rate (FDR) value of 1%.
| Category | Term | Number of counts | % |
| Pop hits | Fold enrichment | FDR |
|---|---|---|---|---|---|---|---|
| UP_KEYWORDS | Alternative splicing | 150 | 73.2 | 1.97E-10 | 10587 | 1.422 | 2.54E-07 |
| UP_KEYWORDS | Membrane | 112 | 54.6 | 1.09E-07 | 7494 | 1.5 | 1.40E-04 |
| UP_SEQ_FEATURE | Glycosylation site: N-linked (e.g., GlcNAc) | 77 | 37.6 | 5.09E-08 | 4234 | 1.797 | 7.97E-05 |
| UP_KEYWORDS | Glycoprotein | 77 | 37.6 | 5.93E-07 | 4551 | 1.699 | 7.64E-04 |
| UP_KEYWORDS | Cell membrane | 64 | 31.2 | 1.93E-08 | 3175 | 2.024 | 2.49E-05 |
| UP_SEQ_FEATURE | Topological domain: cytoplasmic | 64 | 31.2 | 7.69E-07 | 3456 | 1.83 | 1.20E-03 |
| UP_SEQ_FEATURE | Topological domain: extracellular | 54 | 26.3 | 2.37E-06 | 2787 | 1.915 | 3.70E-03 |
| UP_KEYWORDS | Cell adhesion | 27 | 13.2 | 2.03E-12 | 479 | 5.659 | 2.62E-09 |
| UP_KEYWORDS | Calcium | 25 | 12.2 | 6.86E-06 | 877 | 2.862 | 8.83E-03 |
| UP_KEYWORDS | Cell junction | 24 | 11.8 | 2.70E-07 | 675 | 3.57 | 3.48E-04 |
| GOTERM_BP_DIRECT | Homophilic cell adhesion | 12 | 5.8 | 1.94E-06 | 158 | 6.642 | 3.08E-03 |
| GOTERM_CC_DIRECT | GO:0043204, perikaryon | 10 | 4.9 | 1.97E-06 | 106 | 8.817 | 2.56E-03 |
| UP_SEQ_FEATURE | Domain: cadherin 2 | 10 | 4.9 | 2.49E-06 | 115 | 8.594 | 3.90E-03 |
| UP_SEQ_FEATURE | Domain: cadherin 1 | 10 | 4.9 | 2.49E-06 | 115 | 8.594 | 3.90E-03 |
| INTERPRO | Cadherin | 10 | 4.9 | 5.32E-06 | 118 | 7.825 | 7.65E-03 |
| INTERPRO | Cadherin-like | 10 | 4.9 | 6.10E-06 | 120 | 7.694 | 8.78E-03 |
| UP_SEQ_FEATURE | Domain: EGF-like 2 | 9 | 4.4 | 3.23E-06 | 89 | 9.994 | 5.06E-03 |
| UP_SEQ_FEATURE | Repeat: desmoglein repeat 2 | 4 | 1.9 | 3.99E-06 | 4 | 98.833 | 6.25E-03 |
| UP_SEQ_FEATURE | Repeat: desmoglein repeat 1 | 4 | 1.9 | 3.99E-06 | 4 | 98.833 | 6.25E-03 |
| INTERPRO | Desmoglein | 4 | 1.9 | 4.89E-06 | 4 | 92.333 | 7.04E-03 |