| Literature DB >> 34588599 |
Vito A Mastrochirico-Filho1, Raquel B Ariede1, Milena V Freitas1, Carolina H S Borges1, Lieschen V G Lira1, Natália J Mendes1, John F G Agudelo1, Pablo Cáceres2, Milthon H M Berrocal3, Gustavo A L Sucerquia4, Fabio Porto-Foresti1, José M Yáñez2, Diogo T Hashimoto5.
Abstract
Scarce genomic resources have limited the development of breeding programs for serrasalmid fish Colossoma macropomum (tambaqui) and Piaractus mesopotamicus (pacu), the key native freshwater fish species produced in South America. The main objectives of this study were to design a dense SNP array for this fish group and to validate its performance on farmed populations from several locations in South America. Using multiple approaches based on different populations of tambaqui and pacu, a final list of 29,575 and 29,612 putative SNPs was selected, respectively, to print an Axiom AFFYMETRIX (THERMOFISHER) SerraSNP array. After validation, 74.17% (n = 21,963) and 71.25% (n = 21,072) of SNPs were classified as polymorphic variants in pacu and tambaqui, respectively. Most of the SNPs segregated within each population ranging from 14,199 to 19,856 in pacu; and from 15,075 to 20,380 in tambaqui. Our results indicate high levels of genetic diversity and clustered samples according to their hatchery origin. The developed SerraSNP array represents a valuable genomic tool approaching in-depth genetic studies for these species.Entities:
Mesh:
Year: 2021 PMID: 34588599 PMCID: PMC8481427 DOI: 10.1038/s41598-021-98885-x
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
SNPs classification according the Axiom quality control criteria (p-convert).
| Category | Number of SNPs | |
|---|---|---|
| Pacu | Tambaqui | |
| Recommended | 99,682 | 42,851 |
| Neutral | 18,308 | 11,857 |
| Not_recommended | 12,413 | 27,140 |
| Total | 130,403 | 81,848 |
SNP performance in the Axiom array, according to the method of SNP discovery in each species.
| Species | Method | Putative SNPs | Selected SNPs | Polymorphic SNPs | Conversion rate | MAF |
|---|---|---|---|---|---|---|
| Pacu | RNA-Seq | 80,004 | 8052 | 5842 | 0.72 | 0.254 |
| RADseq | 43,355 | 17,932 | 13,345 | 0.74 | 0.175 | |
| ddRADseq | 7044 | 3628 | 2776 | 0.76 | 0.160 | |
| Total | 130,403 | 29,612 | 21,963 | 0.74 | 0.203 | |
| Tambaqui | RNA-Seq | 18,282 | 1446 | 480 | 0.33 | 0.234 |
| GBS | 6803 | 2200 | 1888 | 0.85 | 0.237 | |
| ddRADseq | 56,763 | 25,929 | 18,704 | 0.72 | 0.260 | |
| Total | 81,848 | 29,575 | 21,072 | 0.71 | 0.247 |
Summary of the SNP classification according to their performance in the Axiom SerraSNP array of each species.
| Category | Pacu | Tambaqui | ||
|---|---|---|---|---|
| Number of markers | % | Number of markers | % | |
| PolyHighResolution | 16,780 | 56.67 | 19,661 | 66.48 |
| NoMinorHom | 5183 | 17.50 | 1411 | 4.77 |
| MonoHighResolution | 1771 | 5.98 | 1468 | 4.96 |
| CallRateBelowThreshold | 1166 | 3.94 | 1443 | 4.88 |
| OffTargetVariant | 570 | 1.92 | 550 | 1.86 |
| Other | 4142 | 13.99 | 5042 | 17.05 |
| Total | 29,612 | 100.00 | 29,575 | 100.00 |
Descriptive population genetic estimates for the sampled pacu and tambaqui populations included in the validation of the array.
| Species | Population | n | SNPs MAF > 0.01 | Average MAF | Ho | He |
|---|---|---|---|---|---|---|
| Pacu | Hatchery 1 | 42 | 19,608 | 0.221 | 0.306 | 0.307 |
| Hatchery 2 | 21 | 18,461 | 0.214 | 0.307 | 0.294 | |
| Hatchery 3 | 13 | 14,199 | 0.235 | 0.402 | 0.313 | |
| Hatchery 4 | 18 | 19,856 | 0.210 | 0.285 | 0.292 | |
| All populations | 94 | 21,963 | 0.203 | 0.276 | 0.283 | |
| Tambaqui | Brazil—combined | 28 | 20,380 | 0.266 | 0.362 | 0.356 |
| Hatchery 5 | 9 | 19,572 | 0.267 | 0.39 | 0.357 | |
| Hatchery 6 | 9 | 17,172 | 0.268 | 0.426 | 0.352 | |
| Hatchery 7 | 10 | 19,078 | 0.274 | 0.379 | 0.365 | |
| Colombia—Hatchery 8 | 15 | 15,649 | 0.247 | 0.351 | 0.331 | |
| Peru—Hatchery 9 | 15 | 15,075 | 0.248 | 0.353 | 0.329 | |
| All populations | 58 | 21,072 | 0.247 | 0.306 | 0.338 | |
| Pirapitinga | Wild | 10 | 3042 | 0.278 | 0.454 | 0.348 |
Figure 1SNP distribution according to the MAF values of each species. 21,963 validated SNPs (74.17%) in pacu (blue) and 21,072 validated SNPs (71.25%) in tambaqui (green) were considered. The validation of SNPs was carried out in 94 pacu and in 58 tambaqui individuals, respectively.
Classification of SNPs from RNA-Seq included in the array, considering the annotation of 4093 SNPs from pacu and 294 SNPs from tambaqui.
| Category | Number of SNPs | |
|---|---|---|
| Pacu (%) | Tambaqui (%) | |
| Intergenic | 2047 (48.3) | 208 (69.8) |
| UTR 3′ prime | 1202 (28.4) | 17 (5.7) |
| UTR 5′ prime | 284 (6.7) | 14 (4.7) |
| Synonymous SNP | 490 (11.6) | 59 (19.8) |
| Missense SNP | 215 (5.0) | – |
| Total of effects | 4238 (100) | 298 (100) |
Percentage values are between parenthesis.
Figure 2DAPC analysis from hatcheries of pacu (Hatchery 1–4) (a) and tambaqui (Hatchery 5–9) (b) to demonstrate the genetic structure using the SNPs. DAPC was performed adopting an optimum number of principal components (PC = 10) calculated using the α-score function of the Adegenet software.
Figure 3Linkage disequilibrium (LD) by SNP pruning between markers among all the sampled pacu and tambaqui populations (Hatchery 1–9). 21,963 validated SNPs (74.17%) in pacu and 21,072 validated SNPs (71.25%) in tambaqui were considered.
Details about the sampling and methods used for SNP discovery for each species.
| Species | Source | Method | Samples | Origin |
|---|---|---|---|---|
| Tambaqui | Hatchery 1—broodstock | ddRADseqa | 11 | South Brazil |
| Hatchery 2—broodstock | ddRADseqa | 17 | South Brazil | |
| Hatchery 3—broodstock | ddRADseqa | 17 | South Brazil | |
| Hatchery 4—broodstock | ddRADseqa | 15 | North Brazil | |
| Hatchery 5—broodstock | ddRADseqa | 18 | North Brazil | |
| Hatchery 6—broodstock | ddRADseqa | 21 | North Brazil | |
| Hatchery 7—broodstock | ddRADseqa | 15 | Colombia | |
| Hatchery 8—broodstock | ddRADseqa | 16 | Colombia | |
| Hatchery 9—broodstock | ddRADseqa | 31 | Peru | |
| Hatchery 10—broodstock | ddRADseqa | 8 | Peru | |
| Breeding nucleus (Caunesp)—broodstock | ddRADseqa | 29 | South Brazil | |
| Breeding nucleus (Caunesp)—20 full-sib families | ddRADseqb | 600 | South Brazil | |
| Nunes et al.[ | GBS | 124 | North Brazil | |
| Ariede et al.[ | RNA-Seq | 10 | South Brazil | |
| Gomes et al.[ | RNA-Seq | 8 | North Brazil | |
| Pacu | Mastrochirico-Filho et al.[ | RADseq | 400 | South Brazil |
| Breeding nucleus (Caunesp)—3 full-sib families | ddRADseqa | 100 | South Brazil | |
| Mastrochirico-Filho et al.[ | RNA-Seq | 36 | South Brazil |
aEnzyme combination I (SphI and MluCI).
b3 families used the enzyme combination I (SphI and MluCI) and 17 families used the combination II (NlaIII and MluCI).