| Literature DB >> 31506510 |
Mochamad Syaifudin1,2, Michaël Bekaert1, John B Taggart1, Kerry L Bartie1, Stefanie Wehner1,3, Christos Palaiokostas1, M G Q Khan1,4, Sarah-Louise C Selly1, Gideon Hulata5, Helena D'Cotta6,7, Jean-Francois Baroiller6,7, Brendan J McAndrew1, David J Penman8.
Abstract
Tilapias (family Cichlidae) are of importance in aquaculture and fisheries. Hybridisation and introgression are common within tilapia genera but are difficult to analyse due to limited numbers of species-specific genetic markers. We tested the potential of double digested restriction-site associated DNA (ddRAD) sequencing for discovering single nucleotide polymorphism (SNP) markers to distinguish between 10 tilapia species. Analysis of ddRAD data revealed 1,371 shared SNPs in the de novo-based analysis and 1,204 SNPs in the reference-based analysis. Phylogenetic trees based on these two analyses were very similar. A total of 57 species-specific SNP markers were found among the samples analysed of the 10 tilapia species. Another set of 62 species-specific SNP markers was identified from a subset of four species which have often been involved in hybridisation in aquaculture: 13 for Oreochromis niloticus, 23 for O. aureus, 12 for O. mossambicus and 14 for O. u. hornorum. A panel of 24 SNPs was selected to distinguish among these four species and validated using 91 individuals. Larger numbers of SNP markers were found that could distinguish between the pairs of species within this subset. This technique offers potential for the investigation of hybridisation and introgression among tilapia species in aquaculture and in wild populations.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31506510 PMCID: PMC6737103 DOI: 10.1038/s41598-019-48339-2
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Origin of tilapia samples used in ddRAD analysis, SNP validation by KASP and COI sequencing.
| Species | Acronym | Strain/Population | ddRAD | KASP | COI | Sampling origin |
|---|---|---|---|---|---|---|
| Oni | Stirling | 6 | 14 | 5 | Lake Manzala, Egypt | |
| Kpandu | 12 | 8 | 2 | Ghana | ||
| Nyinuto | 12 | 8 | 2 | Ghana | ||
| Hora | 13 | 0 | 2 | Ethiopia | ||
| Koka | 12 | 0 | 0 | Ethiopia | ||
| Metahara | 8 | 0 | 0 | Ethiopia | ||
| Omo | Stirling | 5 | 7 | 3 | Zimbabwe | |
| Natal | 10 | 8 | 1 | South Africa | ||
| Singapore | 0 | 7 | 0 | Unknown | ||
| Eastern Cape | 0 | 0 | 2 | South Africa | ||
| Western Cape | 0 | 0 | 1 | South Africa | ||
| Oau | Stirling | 5 | 8 | 1 | Lake Manzala, Egypt | |
| Ein Feskha | 10 | 15 | 2 | Israel | ||
| Oka | Stirling | 5 | 0 | 3 | Lake Malawi | |
| Oho | Israel | 5 | 16 | 2 | Tanzania | |
| Stirling | 0 | 0 | 1 | Unknown | ||
| Tzi | Stirling | 5 | 0 | 3 | Lake Manzala, Egypt | |
| Ghana | 5 | 0 | 3 | Ghana | ||
| Sga | Israel | 5 | 0 | 3 | Israel | |
| Oan | Itezhi-tezhi | 6 | 0 | 2 | Zambia | |
| Oma | Itezhi-tezhi | 4 | 0 | 2 | Zambia | |
| Sme | Ghana | 4 | 0 | 3 | Ghana | |
|
|
|
|
|
A total of 34 samples (10 O. niloticus, 6 O. mossambicus, 13 O. aureus and 5 O. u. hornorum) were analysed using both ddRADseq and KASP. The 43 samples analysed by COI sequencing comprised representatives from each of the ten-fish species (of which 29 samples were also characterised by ddRAD).
Figure 1A gene tree of tilapia species inferred from COI sequences, and rooted to T. zillii. All sequences were aligned, and the tree was constructed using RAxML. *Samples not used for ddRAD seq. **Sample also used in KASP validation.
Figure 2The number of reads, loci and markers for the de novo-based analysis (DBA) and reference genome-based analysis (RBA).
Number of species-specific SNPs identified in loci with up to five species-specific SNPs per locus.
| SNPs/locus | Species | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Oni | Omo | Oau | Oho | Oan | Oma | Oka | Sme | Sga | Tzi | |
| 1 | 1/1 | 0/0 | 0/0 | 0/0 | 0/0 | 0/0 | 1/1 | 0/0 | 3/3 | 0/3 |
| 1–2 | 1/1 | 2/4 | 0/0 | 1/1 | 0/0 | 0/0 | 1/1 | 1/3 | 6/8 | 25/39 |
| 1–3 | 4/6 | 11/16 | 0/1 | 3/6 | 1/1 | 0/0 | 2/2 | 1/7 | 11/19 | 67/101 |
| 1–4 | 10/15 | 19/29 | 1/3 | 6/8 | 1/3 | 0/0 | 4/5 | 4/12 | 21/33 | 142/214 |
| 1–5 | 19/32 | 47/63 | 7/11 | 8/11 | 2/4 | 0/1 | 5/5 | 14/26 | 45/67 | 280/424 |
Numbers shown are for DBA/RBA respectively (see text for further details).
Figure 3Physical mapping of 644 species-specific SNP markers (RBA, with up to five SNPs per ddRAD locus – see text for further details) in the reference genome.
Figure 4The discriminant ability of RAD markers. (A) Phylogenetic tree of tilapia species inferred from 1,204 shared markers from reference-based analysis (RBA) and rooted to T. zillii. All sequences were aligned, and the tree was constructed using RAxML. The best ML bipartition tree is provided with bootstrap support when higher than 75%. (B) An enlarged version of the phylogenetic tree shown, encompassing the two subspecies of O. niloticus. (C) Principal Component Analysis of the 1,204 shared markers.
Summary of diagnostic allele frequency for 24 putative diagnostic markers identified by ddRAD and tested by KASP genotyping assays in 91 individuals to distinguish among O. niloticus, O. mossambicus, O. aureus and O. u. hornorum.
| Diagnostic for | Frequency of putative diagnostic allele (%) | |||||
|---|---|---|---|---|---|---|
| Marker name | Diagnostic SNP | |||||
| A | 100* | 0 | 0 | 0 | ||
| T | 100* | 0 | 0 | 31 | ||
| A | 100* | 0 | 0 | 0 | ||
| G | 97* | 0 | 0 | 0 | ||
| A | 97* | 0 | 0 | 0 | ||
| A | 0 | 100* | 0 | 0 | ||
| C | 0 | 100* | 0 | 100 | ||
| C | 0 | 100* | 0 | 100 | ||
| A | 0 | 100* | 0 | 100 | ||
| C | 0 | 100* | 0 | 100 | ||
| T | 0 | 100* | 0 | 100 | ||
| A | 0 | 100* | 0 | 100 | ||
| A | 0 | 95* | 0 | 0 | ||
| G | 0 | 91* | 0 | 56 | ||
| C | 0 | 0 | 100* | 0 | ||
| G | 0 | 0 | 100* | 0 | ||
| T | 0 | 0 | 100* | 0 | ||
| C | 0 | 0 | 100* | 0 | ||
| A | 3 | 0 | 100* | 0 | ||
| T | 12 | 0 | 100* | 0 | ||
| T | 5 | 0 | 100* | 0 | ||
| A | 0 | 0 | 0 | 100* | ||
| G | 0 | 0 | 0 | 100* | ||
| A | 0 | 0 | 0 | 100* | ||
Refer to Supplementary Table S3 for full genotype data. *Frequencies of the putative diagnostic alleles in the target species.