| Literature DB >> 34356058 |
Zhuoying Weng1, Yang Yang1, Xi Wang1, Lina Wu1, Sijie Hua1, Hanfei Zhang1, Zining Meng1,2.
Abstract
Pedigree information is necessary for the maintenance of diversity for wild and captive populations. Accurate pedigree is determined by molecular marker-based parentage analysis, which may be influenced by the polymorphism and number of markers, integrity of samples, relatedness of parents, or different analysis programs. Here, we described the first development of 208 single nucleotide polymorphisms (SNPs) and 11 microsatellites for giant grouper (Epinephelus lanceolatus) taking advantage of Genotyping-by-sequencing (GBS), and compared the power of SNPs and microsatellites for parentage and relatedness analysis, based on a mixed family composed of 4 candidate females, 4 candidate males and 289 offspring. CERVUS, PAPA and COLONY were used for mutually verification. We found that SNPs had a better potential for relatedness estimation, exclusion of non-parentage and individual identification than microsatellites, and > 98% accuracy of parentage assignment could be achieved by 100 polymorphic SNPs (MAF cut-off < 0.4) or 10 polymorphic microsatellites (mean Ho = 0.821, mean PIC = 0.651). This study provides a reference for the development of molecular markers for parentage analysis taking advantage of next-generation sequencing, and contributes to the molecular breeding, fishery management and population conservation.Entities:
Keywords: aquaculture; genotyping-by-sequencing; microsatellites; parentage assignment; relatedness analysis; single nucleotide polymorphisms
Mesh:
Year: 2021 PMID: 34356058 PMCID: PMC8304347 DOI: 10.3390/genes12071042
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Genetic characterization and combined non-exclusion probability for parent of microsatellites and SNPs.
| Molecular Marker |
| MAF Cut-Off | Proportion of Loci Typed | Ho | He | PIC | NE-1P | NE-PP |
|---|---|---|---|---|---|---|---|---|
| Microsatellites | 11 | - | 1 | 0.794 | 0.672 | 0.623 | 0.0289 | 2.13 × 10−5 |
| 10 | - | 1 | 0.821 | 0.698 | 0.651 | 0.0316 | 2.97 × 10−5 | |
| 9 | - | 1 | 0.820 | 0.709 | 0.661 | 0.0395 | 6.69 × 10−5 | |
| 8 | - | 1 | 0.834 | 0.720 | 0.673 | 0.0500 | 1.48 × 10−4 | |
| 7 | - | 1 | 0.852 | 0.734 | 0.687 | 0.0637 | 3.40 × 10−4 | |
| 6 | - | 1 | 0.871 | 0.743 | 0.697 | 0.0854 | 8.50 × 10−4 | |
| 5 | - | 1 | 0.904 | 0.752 | 0.708 | 0.118 | 2.25 × 10−3 | |
| 4 | - | 1 | 0.911 | 0.761 | 0.719 | 0.166 | 6.16 × 10−3 | |
| 3 | - | 1 | 0.912 | 0.771 | 0.732 | 0.240 | 0.0179 | |
| 2 | - | 1 | 0.914 | 0.790 | 0.757 | 0.348 | 0.0524 | |
| 1 | - | 1 | 0.903 | 0.807 | 0.778 | 0.559 | 0.201 | |
| SNPs | 208 | >0.1 | 0.941 | 0.401 | 0.395 | 0.309 | 6.53 × 10−9 | 1.15 × 10−25 |
| 137 | >0.2 | 0.937 | 0.481 | 0.478 | 0.362 | 5.00 × 10−7 | 9.36 × 10−20 | |
| 123 | >0.3 | 0.935 | 0.495 | 0.491 | 0.370 | 1.40 × 10−7 | 3.89 × 10−18 | |
| 100 | >0.4 | 0.936 | 0.500 | 0.498 | 0.373 | 1.91 × 10−6 | 5.24 × 10−15 | |
| 91 | >0.425 | 0.936 | 0.501 | 0.499 | 0.374 | 5.93 × 10−6 | 9.70 × 10−14 | |
| 78 | >0.450 | 0.936 | 0.501 | 0.500 | 0.374 | 3.18 × 10−5 | 6.80 × 10−12 | |
| 44 | >0.475 | 0.939 | 0.496 | 0.501 | 0.375 | 2.83 × 10−3 | 4.90 × 10−7 |
Legend: n, the loci number; He, expected heterozygosity; Ho, observed heterozygosity; PIC, polymorphic information content; NE-1P, average non-exclusion probability for one candidate parent when both parents were unknown; NE-PP, average non-exclusion probability for a candidate parent pair when both parents were known.
Figure 1Combined non-exclusion probability for identity based on microsatellites (a) or SNPs (b). NE-I refers to the combined non-exclusion probability for identity of unrelated individual pairs, NE-SIB refers to the combined non-exclusion probability for identity of full-sibling pairs.
Figure 2Accuracy of parentage assignment using differently sized microsatellite (a) and SNP (b) panels in different analysis programs. The accuracy was defined as the consistency derived from different marker panels compared with ‘standard pedigree’.
Figure 3Density plots of relatedness values for simulated pairs of known relatedness (unrelated, half-sibling, full-sibling and parent–offspring) based on given population allele frequency of different marker panels: (a) the microsatellite panel SSR1 composed of 1 microsatellite locus; (b) the microsatellite panel SSR11 composed of 11 microsatellite loci; (c) the SNP panel SNP44 composed of 44 SNP loci (MAF cut-off > 0.475); (d) the SNP panel SNP208 composed of 208 SNP loci (MAF cut-off > 0.1).
Figure 4The distribution of relatedness value (r) of candidate parents calculated by wang estimator based on 208 SNPs or 11 microsatellites.