| Literature DB >> 28105800 |
Newton O Otecko1, Min-Sheng Peng1, He-Chuan Yang2, Ya-Ping Zhang3, Guo-Dong Wang4.
Abstract
Quality deficiencies in single nucleotide polymorphism (SNP) analyses have important implications. We used missingness rates to investigate the quality of a recently published dataset containing 424 mitochondrial, 211 Y chromosomal, and 160 432 autosomal SNPs generated by a semicustom Illumina SNP array from 5 392 dogs and 14 grey wolves. Overall, the individual missingness rate for mitochondrial SNPs was ~43.8%, with 980 (18.1%) individuals completely missing mitochondrial SNP genotyping (missingness rate=1). In males, the genotype missingness rate was ~28.8% for Y chromosomal SNPs, with 374 males recording rates above 0.96. These 374 males also exhibited completely failed mitochondrial SNPs genotyping, indicative of a batch effect. Individual missingness rates for autosomal markers were greater than zero, but less than 0.5. Neither mitochondrial nor Y chromosomal SNPs achieved complete genotyping (locus missingness rate=0), whereas 5.9% of autosomal SNPs had a locus missingness rate=1. The high missingness rates and possible batch effect show that caution and rigorous measures are vital when genotyping and analyzing SNP array data for domestic animals. Further improvements of these arrays will be helpful to future studies.Entities:
Keywords: Autosomal; Dog; Mitochondrial; SNP array; Y chromosomal
Mesh:
Year: 2016 PMID: 28105800 PMCID: PMC5359323 DOI: 10.13918/j.issn.2095-8137.2016.6.356
Source DB: PubMed Journal: Zool Res ISSN: 2095-8137
Figure 1Individual missingness rates (iMR) for mitochondrial and Y chromosomal marker types
Comparison of individual missingness rates (iMR) across different breed categories
| Overall iMR, mean (standard deviation) | |||
| Mitochondrial | Y chromosomal | Autosomal | |
| Pure breeds | 0.206 1 (0.403 4) | 0.164 6 (0.365 5) | 0.028 2 (0.059 9) |
| Mixed breeds | 0.018 1 (0.132 8) | 0.012 5 (0.104 9) | 0.003 2 (0.014 9) |
| Village dogs | 0.011 2 (0.085 0) | 0.008 (0.053 3) | 0.004 1 (0.013 6) |
| Grey wolves | 1 | 0.966 8 | 0.130 3 (0.017 0) |
| ANOVA P value | 0.000 1 | 0.000 1 | 0.000 1 |
Figure 2Box plot showing the individual missingness rates (iMR) for mt: mitochondrial (n=1 387), Y: Y chromosomal (n=659), and Aut: autosomal (n=2 744) marker types according to gender for the 0
Comparison of individual missingness rates (iMR) for 374 males with likely batch effect versus remaining males
| Mean MR (Standard deviation) | ||||||
| Mitochondrial SNPs | Y chromosomal SNPs | Autosomal SNPs | ||||
| In* | 1 (0) | 0.001 | 0.982 (0.013) | 0.001 | 0.135 (0.069) | 0.001 |
| Out* | 0.001 (0.003) | 0.002 (0.007) | 0.002 (0.003) | |||
*In=374 male individuals with likely batch effect (mitochondrial SNPs MR=1 and Y chromosomal SNPs MR>0.96), Out=other individuals (n=2 288).
Summary of locus missingness rates (lMR) for mitochondrial, Y chromosomal, and autosomal SNPs
| Genotyping marker, No. SNPs (%) | Total | |||
| Mitochondrial SNPs | Autosomal SNPs | Y chromosomal SNPs | ||
| SNP MR | ||||
| MR=0 | 0 | 9 486 (5.9) | 0 | 9 486 (5.9) |
| 0<MR<0.1 | 0 | 134 747 (84.0) | 4 (1.9) | 134 751 (83.7) |
| 0.1=<MR<0.2 | 421 (99.3) | 15 456 (9.6) | 207 (98.1) | 16 084 (10.0) |
| 0.2=<MR<0.3 | 3 (0.7) | 743 (0.5) | 0 | 746 (0.5) |
| Total | 424 (100.0) | 160 432 (100.0) | 211 (100.0) | 161 067 (100.0) |
Figure 3Box plot showing the locus missingness rates (lMR) for mitochondrial, Y chromosomal, and autosomal SNPs MR>0