| Literature DB >> 28387238 |
José de Ribamar da Silva Nunes1,2,3, Shikai Liu2, Fábio Pértille1, Caio Augusto Perazza4, Priscilla Marqui Schmidt Villela1, Vera Maria Fonseca de Almeida-Val5,6, Alexandre Wagner Silva Hilsdorf4, Zhanjiang Liu2, Luiz Lehmann Coutinho1.
Abstract
Colossoma macropomum, or tambaqui, is the largest native Characiform species found in the Amazon and Orinoco river basins, yet few resources for genetic studies and the genetic improvement of tambaqui exist. In this study, we identified a large number of single-nucleotide polymorphisms (SNPs) for tambaqui and constructed a high-resolution genetic linkage map from a full-sib family of 124 individuals and their parents using the genotyping by sequencing method. In all, 68,584 SNPs were initially identified using minimum minor allele frequency (MAF) of 5%. Filtering parameters were used to select high-quality markers for linkage analysis. We selected 7,734 SNPs for linkage mapping, resulting in 27 linkage groups with a minimum logarithm of odds (LOD) of 8 and maximum recombination fraction of 0.35. The final genetic map contains 7,192 successfully mapped markers that span a total of 2,811 cM, with an average marker interval of 0.39 cM. Comparative genomic analysis between tambaqui and zebrafish revealed variable levels of genomic conservation across the 27 linkage groups which allowed for functional SNP annotations. The large-scale SNP discovery obtained here, allowed us to build a high-density linkage map in tambaqui, which will be useful to enhance genetic studies that can be applied in breeding programs.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28387238 PMCID: PMC5384230 DOI: 10.1038/srep46112
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Summary of the GBS reads before and after of trimmed.
| Reads | Total bases (bp) | Trimmed Reads | Trimmed bases | Sequencing depth | |
|---|---|---|---|---|---|
| Total number | 352,425,578 | 35,242,557,800 | 285,194,978 | 27,663,912,870 | 8.04X |
| Average per parent | 3,870,142 | 387,014,200 | 3,264,068 | 316,614,596 | 8.60X |
| Average per offspring | 2,779,720 | 277,972,000 | 2,247,313 | 217,989,361 | 8.03X |
Identification of SNPs from the tambaqui (Colossoma macropomum).
| SNP | Nucleotide substitution | SNP number | Proportion |
|---|---|---|---|
| Transitions | A-G | 28,744 | 0.42 |
| C-T | 15,288 | 0.22 | |
| — | 44,032 | 0.64 | |
| Transversions | A-C | 9,767 | 0.14 |
| A-T | 7,963 | 0.12 | |
| C-G | 3,483 | 0.05 | |
| G-T | 3,339 | 0.05 | |
| — | 24,552 | 0.36 | |
| Total | 68,584 | 1.00 |
Figure 1Graphical presentation of the tambaqui genetic linkage map.
The 27 linkage groups are represented by vertical bars while the horizontal lines represent markers. Genetic distances between adjacent markers are shown on the side ruler in centiMorgan (cM).
Summary of genetic linkage map of tambaqui.
| Linkage group | No. of mapped markers | Genetic size (cM) | Average marker interval | Max Gap (cM) |
|---|---|---|---|---|
| LG1 | 269 | 71.0 | 0.26 | 4.85 |
| LG2 | 362 | 145.7 | 0.40 | 8.04 |
| LG3 | 316 | 119.2 | 0.38 | 4.54 |
| LG4 | 287 | 83.1 | 0.29 | 3.39 |
| LG5 | 243 | 106.0 | 0.44 | 10.62 |
| LG6 | 316 | 110.2 | 0.35 | 7.89 |
| LG7 | 242 | 97.8 | 0.40 | 4.72 |
| LG8 | 328 | 125.1 | 0.38 | 8.67 |
| LG9 | 362 | 122.2 | 0.34 | 5.19 |
| LG10 | 331 | 123.2 | 0.37 | 5.59 |
| LG11 | 210 | 93.3 | 0.44 | 4.11 |
| LG12 | 315 | 101.7 | 0.32 | 14.79 |
| LG13 | 261 | 113.7 | 0.44 | 3.12 |
| LG14 | 86 | 71.5 | 0.83 | 3.85 |
| LG15 | 274 | 96.6 | 0.35 | 3.86 |
| LG16 | 279 | 101.3 | 0.36 | 2.85 |
| LG17 | 238 | 117.4 | 0.49 | 6.81 |
| LG18 | 361 | 88.8 | 0.25 | 3.12 |
| LG19 | 305 | 118.5 | 0.39 | 6.67 |
| LG20 | 258 | 114.4 | 0.44 | 2.08 |
| LG21 | 233 | 77.6 | 0.33 | 3.31 |
| LG22 | 271 | 127.5 | 0.47 | 4.22 |
| LG23 | 238 | 94.6 | 0.40 | 3.48 |
| LG24 | 269 | 91.6 | 0.34 | 3.18 |
| LG25 | 120 | 70.9 | 0.59 | 4.00 |
| LG26 | 193 | 99.1 | 0.51 | 3.08 |
| LG27 | 225 | 129.2 | 0.57 | 6.42 |
| Total | 7192 | 2810.9 | 0.39 | 5.27 |
Figure 2Patterns of marker distribution along the tambaqui linkage groups.
The dotted box indicates regions with higher marker density. The red lines indicate gaps without markers.
Figure 3Comparison between the tambaqui linkage map and the zebrafish genome.
Syntenic links between tambaqui linkage map and zebrafish genome were made with the Circos software (A). Plot of the percentage of SNPs that are shared between tambaqui linkage groups and zebrafish chromosomes (B). In synteny comparison, the connecting links are color coded according to the zebrafish chromosomes arc to their tambaqui linkage-group corresponding.
Annotation of SNPs for each linkage group of tambaqui against the genes from zebrafish genome (ENSEMBL release 84).
| LG | VP | Variants Consequences | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| IN | DW | UP | SY | MI | IT | SP | NC | OT | ||
| 1 | 52 | 13 | 8 | 8 | 6 | 6 | 4 | 2 | 2 | 3 |
| 2 | 46 | 16 | 3 | 5 | 1 | 3 | 6 | 5 | 6 | 1 |
| 3 | 46 | 16 | 4 | 5 | 11 | 5 | 3 | 1 | — | 2 |
| 4 | 41 | 15 | 4 | 3 | 2 | 2 | 7 | — | 6 | 2 |
| 5 | 36 | 12 | 5 | 3 | 4 | 1 | 4 | 2 | 3 | 2 |
| 6 | 56 | 27 | 3 | 6 | 4 | 4 | 7 | 1 | 3 | 1 |
| 7 | 46 | 14 | 7 | 6 | 2 | 5 | 7 | 4 | — | 1 |
| 8 | 61 | 24 | 9 | 3 | 8 | 5 | 4 | 2 | 2 | 3 |
| 9 | 79 | 27 | 13 | 6 | 8 | 11 | 6 | 2 | 3 | 2 |
| 10 | 55 | 16 | 3 | 8 | 4 | 6 | 8 | 3 | 5 | 2 |
| 11 | 33 | 13 | 5 | 4 | 6 | 1 | 3 | — | 1 | 1 |
| 12 | 49 | 22 | 5 | 2 | 3 | 1 | 9 | — | 5 | 1 |
| 13 | 40 | 20 | 9 | 3 | 1 | 3 | 3 | — | 1 | 1 |
| 14 | 24 | 9 | 3 | 1 | 5 | 2 | 1 | — | 2 | — |
| 15 | 69 | 18 | 9 | 7 | 12 | 6 | 11 | — | 2 | 4 |
| 16 | 46 | 12 | 6 | 6 | 8 | 2 | 3 | — | 2 | 6 |
| 17 | 33 | 13 | 5 | 2 | 2 | 3 | 2 | 2 | 3 | 1 |
| 18 | 58 | 20 | 13 | 9 | 5 | 5 | 3 | 1 | 1 | 1 |
| 19 | 53 | 15 | 7 | 7 | 9 | 4 | 4 | 2 | 1 | 3 |
| 20 | 38 | 17 | 2 | 6 | 4 | 2 | 4 | 2 | — | 1 |
| 21 | 47 | 14 | 7 | 5 | 8 | 4 | 4 | 2 | 1 | 2 |
| 22 | 50 | 24 | 7 | 7 | 4 | 2 | 4 | 1 | 2 | 3 |
| 23 | 51 | 24 | 4 | 5 | 5 | 3 | 3 | 4 | 3 | 1 |
| 24 | 49 | 19 | 9 | 6 | 3 | 3 | 7 | — | 1 | — |
| 25 | 20 | 7 | 3 | 3 | 1 | 2 | 2 | — | 2 | — |
| 26 | 25 | 13 | 3 | 3 | — | — | 5 | — | — | 1 |
| 27 | 34 | 11 | 4 | 5 | 7 | 1 | 4 | 1 | 1 | 1 |
| All | 1237 | 450 | 159 | 133 | 136 | 90 | 128 | 36 | 58 | 44 |
LG: linkage group; VP: variants processed; IN: intron variant; DW: downstream gene variant; UP: upstream gene variant; SY: synonymous variant; MI: missense variant IT: intergenic variant; SP: splice region variant; NC: non-coding transcript variant; OT: others variants.
Figure 4Effect of coverage depth on allele frequency of tambaqui (Colossoma macropomum) genotyped with GBS.