| Literature DB >> 31921307 |
Redempta A Kajungiro1,2, Christos Palaiokostas1, Fernando A Lopes Pinto1, Aviti J Mmochi3, Marten Mtolera3, Ross D Houston4, Dirk Jan de Koning1.
Abstract
Understanding population structure and genetic diversity within and between local Nile tilapia lines cultured in Tanzania is important for sustainable aquaculture production. This study investigated the genetic structure and diversity among seven Nile tilapia populations in Tanzania (Karanga, Igunga, Ruhila, Fisheries Education and Training Agency, Tanzania Fisheries Research Institute, Kunduchi, and Lake Victoria). Double-digest restriction site-associated DNA (ddRAD) libraries were prepared from 140 individual fish (20 per population) and sequenced using an Illumina HiSeq 4000 resulting in the identification of 2,180 informative single nucleotide polymorphisms (SNPs). Pairwise Fst values revealed strong genetic differentiation between the closely related populations; FETA, Lake Victoria, and Igunga and those from TAFIRI and Karanga with values ranging between 0.45 and 0.55. Population structure was further evaluated using Bayesian model-based clustering (STRUCTURE) and discriminant analysis of principal components (DAPC). Admixture was detected among Karanga, Kunduchi, and Ruhila populations. A cross-validation approach (25% of individual fish from each population was considered of unknown origin) was conducted in order to test the efficiency of the SNP markers to correctly assign individual fish to the population of origin. The cross-validation procedure was repeated 10 times resulting in 77% of the tested individual fish being allocated to the correct population. Overall our results provide a new database of informative SNP markers for both conservation management and aquaculture activities of Nile tilapia strains in Tanzania.Entities:
Keywords: Nile tilapia; aquaculture; double-digest restriction site-associated DNA-sequencing; genetic diversity; population structure
Year: 2019 PMID: 31921307 PMCID: PMC6933018 DOI: 10.3389/fgene.2019.01269
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Sampling locations of fish used in the present study.
Figure 2Distribution of minor allele frequencies of double-digest restriction site-associated DNA (ddRAD)–derived single nucleotide polymorphisms (SNPs) in seven populations of Nile tilapia.
Summary of diversity parameters for the seven Nile tilapia populations.
| Population | He | Ho | Fis |
|---|---|---|---|
| Ruhila | 0.212 | 0.113 | 0.275 |
| Karanga | 0.213 | 0.104 | 0.265 |
| TAFIRI | 0.1 | 0.096 | 0.021 |
| Kunduchi | 0.214 | 0.067 | 0.557 |
| Igunga | 0.065 | 0.064 | 0.009 |
| Lake Victoria | 0.061 | 0.061 | 0.005 |
| FETA | 0.057 | 0.057 | 0.006 |
Figure 3Principal components analysis (PCA) of the population for 139 fish individual fish based on 2,180 single-nucleotide polymorphisms (SNPs). The genetic relationships among individual fish as seen when plotting the first and second principal components (PCA1 and PCA2). Each individual is represented by one dot, with its symbol color corresponding to the assigned population.
Pairwise Fst with 95% confidence intervals (CI) among the seven population: TAFIRI, Ruhila, FETA, Lake Victoria, Karanga, Igunga and Kunduchi.
| Population 1 | Population 2 | Lower CI | Upper CI | Fst |
|---|---|---|---|---|
| TAFIRI | Ruhila | 0.17890 | 0.20797 | 0.19322 |
| TAFIRI | FETA | 0.42571 | 0.47996 | 0.45256 |
| TAFIRI | Victoria | 0.40533 | 0.45925 | 0.43243 |
| TAFIRI | Karanga | 0.42979 | 0.45994 | 0.44498 |
| TAFIRI | Igunga | 0.39132 | 0.44652 | 0.41922 |
| TAFIRI | Kunduchi | 0.24015 | 0.27814 | 0.25967 |
| Ruhila | FETA | 0.24998 | 0.27900 | 0.26439 |
| Ruhila | Victoria | 0.23112 | 0.25824 | 0.24545 |
| Ruhila | Karanga | 0.19066 | 0.21102 | 0.20097 |
| Ruhila | Igunga | 0.22390 | 0.25150 | 0.23791 |
| Ruhila | Kunduchi | 0.06998 | 0.08676 | 0.07830 |
| FETA | Victoria | 0.08070 | 0.10897 | 0.09429 |
| FETA | Karanga | 0.53479 | 0.56045 | 0.54758 |
| FETA | Igunga | 0.03443 | 0.05096 | 0.04283 |
| FETA | Kunduchi | 0.10584 | 0.12052 | 0.11242 |
| Victoria | Karanga | 0.52577 | 0.55252 | 0.53849 |
| Victoria | Igunga | 0.02878 | 0.04490 | 0.03670 |
| Victoria | Kunduchi | 0.10591 | 0.11914 | 0.11226 |
| Karanga | Igunga | 0.51982 | 0.54576 | 0.53282 |
| Karanga | Kunduchi | 0.30078 | 0.32315 | 0.31192 |
| Igunga | Kunduchi | 0.08748 | 0.09959 | 0.09326 |
Figure 4STRUCTURE analysis bar plots for K = 3, 5, and 7 (admixture model) showing population structure of different Nile tilapia sub-populations. Each vertical stripe represents an individual. Each color represents the proportion of membership with regard to the each assigned to seven genetic clusters. Same color in different individual fish indicates that they belong to the same cluster.
Figure 5Discriminant analysis of principal components (DAPC) analysis with the find.clusters for 139 individual fish of the O. niloticus cultured in Tanzania. The axes represent the first two linear discriminants (LD). Squares represent groups and dots represent individual fish. Numbers represent the different populations identified by DAPC analysis.
Figure 6Confusion matrix for prediction efficiency of the single-nucleotide polymorphism (SNP) dataset using cross-validation. Four-fold cross-validation was performed where five randomly chosen animals on each population were considered of unknown origin. The entire procedure was repeated 10 times in order to minimize potential bias due to sample allocation in the training/test datasets. The diagonal contains the number of correct population assignments for the overall sum of the cross-validation scheme. Off-diagonals contain the number of erroneously population allocations for each particular case.