| Literature DB >> 35505878 |
John Fredy Gómez Agudelo1, Vito Antonio Mastrochirico-Filho1, Carolina Heloisa de Souza Borges1, Raquel Belini Ariede1, Lieschen Valeria Guerra Lira1, Rubens Ricardo de Oliveira Neto1, Milena Vieira de Freitas1, Gustavo Adolfo Lenis Sucerquia2, Manuel Vera3, Milthon Honorio Muñoz Berrocal4, Diogo Teruo Hashimoto1.
Abstract
Tambaqui or cachama (Colossoma macropomum) is one of the most important neotropical freshwater fish used for aquaculture in South America, and its production is concentrated at low latitudes (close to the Equator, 0°), where the water temperature is warm. Therefore, understanding how selection shapes genetic variations and structure in farmed populations is of paramount importance in evolutionary biology. High-throughput sequencing to generate genome-wide data for fish species allows for elucidating the genomic basis of adaptation to local or farmed conditions and uncovering genes that control the phenotypes of interest. The present study aimed to detect genomic selection signatures and analyze the genetic variability in farmed populations of tambaqui in South America using single-nucleotide polymorphism (SNP) markers obtained with double-digest restriction site-associated DNA sequencing. Initially, 199 samples of tambaqui farmed populations from different locations (located in Brazil, Colombia, and Peru), a wild population (Amazon River, Brazil), and the base population of a breeding program (Aquaculture Center, CAUNESP, Jaboticabal, SP, Brazil) were genotyped. Observed and expected heterozygosity was 0.231-0.350 and 0.288-0.360, respectively. Significant genetic differentiation was observed using global FST analyses of SNP loci (FST = 0.064, p < 0.050). Farmed populations from Colombia and Peru that differentiated from the Brazilian populations formed distinct groups. Several regions, particularly those harboring the genes of significance to aquaculture, were identified to be under positive selection, suggesting local adaptation to stress under different farming conditions and management practices. Studies aimed at improving the knowledge of genomics of tambaqui farmed populations are essential for aquaculture to gain deeper insights into the evolutionary history of these fish and provide resources for the establishment of breeding programs.Entities:
Keywords: South American Aquaculture; genetic structure; neotropical fish; serrasalmidae; signatures of selection; stress in aquaculture
Year: 2022 PMID: 35505878 PMCID: PMC9046916 DOI: 10.1111/eva.13351
Source DB: PubMed Journal: Evol Appl ISSN: 1752-4571 Impact factor: 4.929
Parameters of genetic diversity in farmed populations of tambaqui Colossoma macropomum considering 1633 SNPs
| Sampling site |
| Na | Ho (SD) | He (SD) |
| HWE | MAF > 0.1 (%) |
|---|---|---|---|---|---|---|---|
| BR1 | 14 | 1.968 | 0.350 (0.198) | 0.351 (0.140) | 0.060 | 1623 | 1401 (85.8) |
| BR2 | 19 | 1.975 | 0.314 (0.164) | 0.349 (0.138) | 0.131 | 1614 | 1409 (86.3) |
| BR3 | 20 | 1.966 | 0.231 (0.139) | 0.345 (0.139) | 0.357 | 1527 | 1408 (86.2) |
| BR4 | 11 | 1.983 | 0.311 (0.164) | 0.358 (0.129) | 0.193 | 1632 | 1452 (88.9) |
| BR5 | 15 | 1.950 | 0.256 (0.159) | 0.342 (0.145) | 0.287 | 1600 | 1394 (85.4) |
| BR6 | 18 | 1.971 | 0.290 (0.154) | 0.345 (0.138) | 0.190 | 1616 | 1.395 (85.4) |
| BR All | 97 | 2.000 | 0.290 (0.109) | 0.368 (0.112) | 0.225 | 1195 | 1521 (93.1) |
| BRGEN | 15 | 1.985 | 0.306 (0.160) | 0.353 (0.131) | 0.170 | 1616 | 1452 (88.9) |
| COL1 | 19 | 1.944 | 0.283 (0.179) | 0.336 (0.152) | 0.190 | 1592 | 1338 (81.9) |
| COL2 | 14 | 1.897 | 0.260 (0.183) | 0.321 (0.164) | 0.234 | 1609 | 1278 (78.3) |
| COL3 | 16 | 1.985 | 0.293 (0.157) | 0.360 (0.130) | 0.222 | 1603 | 1444 (88.4) |
| COL4 | 12 | 1.878 | 0.250 (0.183) | 0.312 (0.168) | 0.247 | 1617 | 1239 (75.9) |
| COL5 | 6 | 1.775 | 0.263 (0.236) | 0.288 (0.185) | 0.188 | 1633 | 1154 (70.6) |
| COL All | 67 | 1994 | 0.274 (0.132) | 0.351 (0.134) | 0.230 | 1294 | 1410 (86.3) |
| PER | 14 | 1.954 | 0.249 (0.157) | 0.343 (0.144) | 0.325 | 1601 | 1380 (84.5) |
| WILD | 6 | 1.908 | 0.263 (0.195) | 0.337 (0.152) | 0.330 | 1633 | 1371 (84.0) |
Standard deviation values (SD) are between parentheses.
Abbreviations: FIS, inbreeding coefficient; He, expected heterozygosity; Ho, observed heterozygosity; HWE, loci in Hardy–Weinberg equilibrium after FDR‐BY correction (p‐adjusted > 0.0063);MAF, minimum allelic frequency; N, sampling size; Na, average number of alleles per loci.
Effective population size (Ne) parameter based on linkage disequilibrium, considering 1633 SNPs
| Sampling site |
| Ne | CI 95% | Jackknife CI | ∆F |
|---|---|---|---|---|---|
| BR1 | 14 | 12.1 | 11.8–12.2 | 6.6–26.5 | 0.041 |
| BR2 | 19 | 17.8 | 17.5–18.1 | 10.5–37.2 | 0.028 |
| BR3 | 20 | 19.9 | 19.6–20.4 | 10.8–55.3 | 0.025 |
| BR4 | 11 | 54.7 | 53.5–62.1 | 11.1–Inf. | 0.009 |
| BR5 | 15 | 15.1 | 15.1–15.8 | 8.2–40.5 | 0.033 |
| BR6 | 18 | 26.5 | 25.8–27.1 | 13.7–95.2 | 0.018 |
| BRGEN | 15 | 9.8 | 9.7–10.0 | 3.6–28.3 | 0.051 |
| COL1 | 19 | 14.0 | 13.6–14.0 | 9.5–21.6 | 0.036 |
| COL2 | 14 | 34.8 | 32.6–35.6 | 15.6–640.2 | 0.014 |
| COL3 | 16 | 46.9 | 45.2–49.0 | 21.0–Inf. | 0.011 |
| COL4 | 12 | 17.2 | 16.7–17.8 | 7.1–188.1 | 0.029 |
| COL5 | 6 | 75.6 | 42.6–62.6 | 7.3–Inf. | 0.006 |
| PER | 14 | 25.8 | 24.7–26.4 | 10.8–2164.4 | 0.019 |
| WILD | 6 | Inf. | Inf.–Inf. | 8.3–Inf. | Infinite |
Abbreviations: CI, confidence interval 95%; inbreeding coefficient (∆F = 1 / 2Ne);Inf.: infinite; N, number of breeders for each fish farm.
Values of population differentiation based on F ST (below the diagonal) and Ney genetic distance (above the diagonal)
| BR1 | BR2 | BR3 | BR4 | BR5 | BR6 | BRGEN | COL1 | COL2 | COL3 | COL4 | COL5 | PER | WILD | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BR1 | 0 | 0.044 | 0.037 |
| 0.041 | 0.037 | 0.042 | 0.084 | 0.089 | 0.047 | 0.095 | 0.099 | 0.059 | 0.025 |
| BR2 | 0.048 | 0 | 0.035 | 0.014 | 0.040 |
| 0.046 | 0.080 | 0.082 | 0.046 | 0.086 | 0.092 | 0.058 | 0.028 |
| BR3 | 0.039 | 0.043 | 0 |
|
| 0.036 | 0.035 | 0.067 | 0.074 | 0.033 | 0.076 | 0.078 | 0.042 | 0.015 |
| BR4 |
| 0.021 |
| 0 | 0.016 |
| 0.014 | 0.051 | 0.053 | 0.018 | 0.059 | 0.058 | 0.030 |
|
| BR5 | 0.043 | 0.048 |
| 0.020 | 0 | 0.039 | 0.041 | 0.070 | 0.077 | 0.036 | 0.080 | 0.086 | 0.044 | 0.024 |
| BR6 | 0.037 |
| 0.041 |
| 0.044 | 0 | 0.042 | 0.080 | 0.083 | 0.044 | 0.085 | 0.092 | 0.056 | 0.026 |
| BRGEN | 0.049 | 0.058 | 0.044 | 0.020 | 0.050 | 0.050 | 0 | 0.051 | 0.044 | 0.023 | 0.048 | 0.054 | 0.032 |
|
| COL1 | 0.097 | 0.092 | 0.077 | 0.065 | 0.083 | 0.091 | 0.057 | 0 | 0.040 | 0.009 | 0.049 | 0.051 |
| 0.040 |
| COL2 | 0.095 | 0.091 | 0.079 | 0.067 | 0.088 | 0.094 | 0.048 | 0.042 | 0 | 0.019 |
|
| 0.033 | 0.046 |
| COL3 | 0.055 | 0.058 | 0.040 | 0.028 | 0.048 | 0.056 | 0.032 | 0.016 | 0.022 | 0 | 0.023 | 0.023 |
|
|
| COL4 | 0.102 | 0.099 | 0.084 | 0.071 | 0.091 | 0.098 | 0.055 | 0.057 |
| 0.028 | 0 |
| 0.039 | 0.042 |
| COL5 | 0.107 | 0.106 | 0.086 | 0.072 | 0.092 | 0.109 | 0.057 | 0.050 |
| 0.029 |
| 0 | 0.036 | 0.048 |
| PER | 0.072 | 0.065 | 0.049 | 0.039 | 0.058 | 0.066 | 0.039 |
| 0.032 |
| 0.046 | 0.039 | 0 |
|
| WILD | 0.029 | 0.039 |
|
| 0.036 | 0.033 |
| 0.050 | 0.048 |
| 0.047 | 0.057 |
| 0 |
All values were significant (p‐value < 0.05) except those shown in bold letters.
FIGURE 1Heatmap showing differentiation between tambaqui stocks from South America based on F ST
FIGURE 2Population genetic analysis of tambaqui stocks from South America. Multidimensional scaling (MDS) analysis resulted from 199 individuals. Individuals were plotted according to their coordinates on the first two components (C1 and C2)
FIGURE 3Plot of discriminant analysis of principal components (DAPC) showing relationship of 199 individual fish color coded by sample site, representing the structure between tambaqui stocks from South America. Twenty five PCs were retained using a‐score optimization
FIGURE 4Analysis of genetic structure of tambaqui stocks from South America. Genetic structure was analyzed approaching K = 2 according to Delta K statistics. Each vertical bar represents an individual. Populations are separated by vertical white bars. The color proportions of each bar correspond to the estimated fractions of association of the individuals in each of the clusters
Analysis of molecular variance (AMOVA) of tambaqui stocks Colossoma macropomum
|
| Variance component | % Variation | |
|---|---|---|---|
| Model I—Structure (K = 2) | |||
| Among groups ( | 0.033* | 10.33 | 3.31 |
| Among populations within groups ( | 0.035* | 10.60 | 3.40 |
| Within populations ( | 0.067* | 290.96 | 93.29 |
| Model II—DAPC analysis | |||
| Among groups ( | 0.035* | 10.93 | 3.51 |
| Among populations within groups ( | 0.030* | 9.14 | 2.94 |
| Within populations ( | 0.064* | 290.96 | 93.55 |
For Model I, the populations were grouped following population units (K) described by structure analyses (K = 2), while for model II, the populations were grouped by DAPC results. *p‐value > 0.01.
Summary of outliers identified by the three utilized methods
| SNP ID | BayeScan | Arlequin (H) | Arlequin (nH) | Blastn | |||||
|---|---|---|---|---|---|---|---|---|---|
| Log10(PO) |
|
| Obs |
| Obs |
| Gene | Annotations | |
| cmj_57835:205 | 1.833 | 0.014 | 0.255 | 0.361 | 0.008 | 0.291 | 0.005 |
| I,ST |
| cmj_49626:177 | 1.000 | 0.052 | 0.211 | 0.380 | 0.011 | 0.301 | 0.002 |
| ST |
| cmj_139241:10 | 1.269 | 0.033 | 0.240 | – | – | – | – |
| I |
| cmj_40107:214 | – | – | – | 0.413 | 0.001 | 0.335 | 0.001 |
| ST |
| cmj_115873:85 | – | – | – | 0.430 | 0.003 | 0.354 | 0.002 |
| ST |
| cmj_13492:21 | – | – | – | 0.427 | 0.005 | 0.301 | 0.002 |
| ST |
| cmj_158265:204 | 0.395 | 0.006 | 0.347 | 0.0003 |
| ST | |||
| cmj_3210:95 | 0.371 | 0.006 | 0.296 | 0.004 |
| S | |||
| cmj_185105:197 | – | – | – | 0.397 | 0.006 | 0.304 | 0.004 |
| ST |
| cmj_47461:171 | 0.383 | 0.007 | 0.312 | 0.0007 |
| I | |||
| cmj_34560:96 | – | – | – | 0.361 | 0.007 | 0.283 | 0.006 |
| S |
| cmj_44265:124 | – | – | – | 0.389 | 0.008 | 0.311 | 0.004 |
| I, ST |
| cmj_97663:203 | – | – | – | 0.391 | 0.008 | 0.311 | 0.004 |
| I |
| cmj_85247:50 | 0.352 | 0.009 | 0.280 | 0.008 |
| ST | |||
| cmj_919:262 | – | – | – | 0.335 | 0.010 | 0.281 | 0.007 |
| I |
Abbreviations: H, hierarchical island model. Blastn annotations: I, genes related to defense response against pathogens; nH, finite model; S, genes related to sensory system; ST, genes related to stress responses.