| Literature DB >> 30819089 |
Abstract
BACKGROUND: The accurate determination of parent-progeny relationships within both in situ natural populations and ex situ genetic resource collections can greatly enhance plant breeding/domestication efforts and support plant genetic resource conservation strategies. Although a range of parentage analysis tools are available, none are designed to infer such relationships using genome-wide single nucleotide polymorphism (SNP) data in the complete absence of guiding information, such as generational groups, partial pedigrees, or genders. The R package ('apparent') developed and presented here addresses this gap.Entities:
Keywords: Parentage analysis; Pedigree inference; R package; SNP
Mesh:
Year: 2019 PMID: 30819089 PMCID: PMC6396488 DOI: 10.1186/s12859-019-2662-3
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1The ‘apparent’ analysis plots. For a given population, a simple gap analysis separates true triads from spurious relationships. (a) Gower Dissimilarities (GDij|k) are plotted for all possible parent-offspring combinations in the population, enabling an inspection of gap size and all subsequent hypothesis testing. (b) For each significant parent-offspring association from the dyad analysis, distribution plots of mean GDi(1...j)|k values (GDM) and their standard deviation in units of GDi|k (GDCV) help visualize the analysis. In this particular example, A. arguta cv. ‘#74–32’ was correctly identified as a parent of offspring 10 despite the absence of the other parent (cv. ‘Chang Bai Mountain 5’) from the population and the confounding presence of two full-sibs (offspring 11 and 12)
Fig. 2Influence of the number of SNP loci on error rates, accuracy, and computation time. For each set of loci sampled, the performance of the ‘apparent’ package was evaluated in terms of error rates (Types I and II) and accuracy. The times required to successfully complete the analyses were also recorded and reveal a surprising insensitivity to the number of markers used. Note that the percentage of markers usable by ‘apparent’ for the analysis (i.e. parental homozygous SNPs) is quite stable
Comparison of the ‘apparent’ R package to five currently available tools for parentage analysis, based on main features, ease of use, and available functions
| Package/Tool | Main features | Ease of use | Available functionse | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Marker Typea | Method of Analysisb | Required classesc | OSd | Parameter-ization | Input | Output | PM | PP | PR | SR | FP | EP | IC | EC | |
| Apparent | Co | GR/GS | none | R | Easy | Easy | Easy | X | X | X | X | ||||
| hsphase ( | Co | GR/GS | none | R | Easy | Easy | Complex | X | X | ||||||
| MasterBayes ( | Co/Do | BA/ML | Pa/Off | R | Complex | Easy | Easy | X | X | X | X | X | X | ||
| ParentOffspring | Co | GR/GS | Mo/Fa/Off | R | Easy | Easy | Easy | X | |||||||
| Solomon | Co | BA | Pa/Off | R | Moderate | Easy | Moderate | X | X | X | X | X | |||
| Cervus | Co/Do | ML | Pa/Off | W | Moderate | Easy | Moderate | X | X | X | X | X | |||
aCo = Co-dominant, Do = Dominant
bBA = Bayesian approach; GR = Genetic relatedness, GS = Genetic similarity, ML = Maximum likelihood
cnone = no a priori information required for individuals, Pa/Off = each individual must be classified as either a Parent or an Offspring [2 classes], Fa/Mo/Off = each individual must be uniquely classified as a potential Mother, Father, or Offspring [3 classes]
dOperational System. R = R package (OS independent), W = Windows
eFollowing the typology developed by Jones et al. (2010): PM = Paternity/maternity analysis, PP = Parent pair allocation, PR = Parental reconstruction analysis, SR = Sib-ship Reconstruction, FP = Full probability analysis, EP = Ability to calculate exclusion probabilities, IC = Ability to assign statistical confidence to individual parent-offspring pairs, EC = Ability to assess experiment-wide statistical confidence of parent-offspring assignments
Summary of results comparing the performance of ‘apparent’ to five other parentage analysis tools in identifying the pairs of parents of 15 A. arguta offspring in a population of 77 individuals
| Package/Tool | Analysis with no guiding information provided | Analysis guided with 2 genotypic classes: adults vs. offspring | Citation | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Error I (%) | Error II (%) | Accuracy (%) | Time (min) | Error I (%) | Error II (%) | Accuracy (%) | Time (min) | ||
| apparent | 0.0 | 0.0 | 100.0 | 18.4 | 0.0 | 0.0 | 100.0 | 3.1 | – |
| hsphase ( | 31.0 | 5.7 | 17.4 | 0.1 | 31.0 | 5.7 | 17.4 | 0.1 | Ferdosi et al. (2014) [ |
| MasterBayes ( | 15.6 | 22.1 | 48.2 | 2.3 | 0.0 | 0.0 | 100.0 | 0.3 | Hadfield et al. (2006) [ |
| ParentOffspringa | 44.4 | 0.0 | 55.6 | 260.4 | 6.3 | 0.0 | 93.8 | 28.9 | Abdel-Haleem et al. (2013) [ |
| Solomon | 97.7 | 0.9 | 2.3 | 401.4 | 97.5 | 1.0 | 2.6 | 43.1 | Christie et al. (2013) [ |
| Cervus | 74.6 | 0.0 | 50.9 | 11.9 | 0.0 | 0.0 | 100.0 | 5.3 | Kalinowski et al. (2007) [ |
aThe results from ParentOffspring are approximate because the package requires an a priori declaration of parental pairs for a given set of offspring (see main text)