| Literature DB >> 35177872 |
A Ciezarek1, Antonia G P Ford2, Graham J Etherington1, Nasser Kasozi3, Milan Malinsky4, Tarang K Mehta1, Luca Penso-Dolfin5, Benjamin P Ngatunga6, Asilatu Shechonge6, Rashid Tamatamah6, Wilfried Haerty1, Federica Di Palma7, Martin J Genner8, George F Turner9.
Abstract
Cichlid fish of the genus Oreochromis form the basis of the global tilapia aquaculture and fisheries industries. Broodstocks for aquaculture are often collected from wild populations, which in Africa may be from locations containing multiple Oreochromis species. However, many species are difficult to distinguish morphologically, hampering efforts to maintain good quality farmed strains. Additionally, non-native farmed tilapia populations are known to be widely distributed across Africa and to hybridize with native Oreochromis species, which themselves are important for capture fisheries. The morphological identification of these hybrids is particularly unreliable. Here, we describe the development of a single nucleotide polymorphism (SNP) genotyping panel from whole-genome resequencing data that enables targeted species identification in Tanzania. We demonstrate that an optimized panel of 96 genome-wide SNPs based on FST outliers performs comparably to whole genome resequencing in distinguishing species and identifying hybrids. We also show this panel outperforms microsatellite-based and phenotype-based classification methods. Case studies indicate several locations where introduced aquaculture species have become established in the wild, threatening native Oreochromis species. The novel SNP markers identified here represent an important resource for assessing broodstock purity in hatcheries and helping to conserve unique endemic biodiversity.Entities:
Keywords: Fisheries; Hybridization; Oreochromis; Tanzania; Tilapia
Year: 2022 PMID: 35177872 PMCID: PMC8655616 DOI: 10.1016/j.aquaculture.2021.737637
Source DB: PubMed Journal: Aquaculture ISSN: 0044-8486 Impact factor: 4.242
Fig. 1Sample locations for the three focal species within Tanzania (right panel) and Lake Albert (Uganda). The sampling location in Uganda is approximately 340 km north-west of Tanzania, as indicated in the bottom-left panel. Abbreviations in ‘Species present’: n: native; e: exotic. Abbreviations in reference notations on map: d: genome-wide sequencing used for design of SNP array; t: test individuals sequenced using SNP array, used as reference for assigning species. Shapefiles sourced from the ArcGIS Hub (continental boundaries), the ICPAC GeoPortal (Tanzania rivers) and the Humanitarian Data Exchange (Tanzania boundary).
Fig. 2a) PCA of the 118 SNPs, extracted from the full-genome SNP calls from the 100 individuals which were used for initial SNP calling to design the SNP panel. b) neighbor-joining tree of the 118 SNPs from the same 100 individuals. Samples are labelled with their morphological ID, sampling location and sample ID, separated by double underscores.
Fig. 3a-c) fastSTRUCTURE analysis, comparison between the 96 SNP set and: a) 118 SNPs; b) genome-wide SNPs; c) microsatellites. For each of a), b) and c), the top row gives the assignments according to the 96 SNP panel, the second row gives the assignment according to the dataset being compared against, and the bottom row gives single colour block assignments for each individual, for ease of comparison between datasets. d) PCA of the optimal 96 SNP panel, PC1 vs. PC2. The right-hand panel includes representative photographs of mature adults of each species (not to scale). Bars in the fastSTRUCTURE and points in the PCA are colored according to species: blue corresponds to O. leucostictus; red is O. urolepis, cyan is O. niloticus and grey is hybrid.
Comparison between species assignments between the 96 SNP panel,morphological identification and the 118 SNP, full-genome and microsatellite datasets.
| Dataset 1 | Dataset 2 | Number of individuals | Number of individuals with the same assignment | Figure |
|---|---|---|---|---|
| 96 SNP panel | 118 SNP panel | 164 | 164 (100%) | 3a |
| 96 SNP panel | 18 microsatellite array - prior cluster assignment using adegenet | 54 | 48 (89%) | 3c |
| 96 SNP panel | 1,822,719 genome-wide SNPs | 35 | 34 (97%) | 3b |
| 96 SNP panel | Morphological classification | 164 | 129 (79%) | 3a |
| Morphological classification | 18 microsatellite array - prior cluster assignment using adegenet | 54 | 32 (59%) | 3c |
| Morphological classification | 1,822,719 genome-wide SNPs | 35 | 20 (57%) | 3b |
Number of individuals of each species identified in each sampling location by the 96 SNP panel.
| Location | Hybrids | |||
|---|---|---|---|---|
| Mindu Dam | 13 | 6 | 0 | 7 |
| Kilosa | 6 | 32 | 18 | 4 |
| Kidatu resevoir | 5 | 0 | 14 | 2 |
| Lake Mansi/Lugongwe/Mindu | 20 | 0 | 0 | 0 |