| Literature DB >> 28253278 |
Sajjad Ahmad Khan1,2, Sadaf Manzoor2, Amjad Ali2, Dost Muhammad Khan1, Umair Khalil1.
Abstract
Gene-mapping studies, regularly, rely on examination for Mendelian transmission of marker alleles in a pedigree as a way of screening for genotyping errors and mutations. For analysis of family data sets, it is, usually, necessary to resolve or remove the genotyping errors prior to consideration. At the Center of Inherited Disease Research (CIDR), to deal with their large-scale data flow, they formalized their data cleaning approach in a set of rules based on PedCheck output. We scrutinize via carefully designed simulations that how well CIDR's data cleaning rules work in practice. We found that genotype errors in siblings are detected more often than in parents for less polymorphic SNPs and vice versa for more polymorphic SNPs. Through computer simulations, we conclude that some of the CIDR's rules work poorly in some circumstances, and we suggest a set of modified data cleaning rules that may work better than CIDR's rules.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28253278 PMCID: PMC5333839 DOI: 10.1371/journal.pone.0172807
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
CIDR's rules for removing Mendelian inconsistencies.
| Situation in a Nuclear Family | Error Messages | Actions | Short Rule Name |
|---|---|---|---|
| 1 parent is inconsistent with 1 child | ERROR: Child 01 and Mother are inconsistent | Zero out the child’s genotype | 1P1C:C0 |
| OR | |||
| ERROR: Child 01 and Father are inconsistent. | |||
| 1 parent is inconsistent with 2+ children | ERROR: Child 01 and Mother are inconsistent | Zero out the specific parent genotype | 1P2+C:P0 |
| AND | |||
| ERROR: Child 02 and Mother are inconsistent | |||
| OR | |||
| ERROR: Child 01 and Father are inconsistent | |||
| AND | |||
| ERROR: Child 02 and Father are inconsistent. | |||
| 2 parents are inconsistent with 1 child | ERROR: Child 01 is consistent with each parent separately, but not as a pair | Zero out the child’s genotype | 2P1C:C0 |
| OR | |||
| ERROR: Child 01 and Mother are inconsistent | |||
| AND | |||
| ERROR: Child 01 and Father are inconsistent. | |||
| 2 parents are inconsistent with 2+ children | ERROR: Child 01 is consistent with each parent separately, but not as a pair | Zero out the genotypes of the whole nuclear family | 2P2+C:W0 |
| AND | |||
| ERROR: Child 02 is consistent with each parent separately, but not as a pair. | |||
| OR | |||
| ERROR: Child 01 and Father are inconsistent. | |||
| AND | |||
| ERROR: Child 02 and Mother are inconsistent. |
aLet P = parent, C = child, and W = whole family, then we name CIDR’s rules as 1. 1P1C:C0, 2. 1P2+C:P0, 3. 2P1C:C0, 4. 2P2+C:W0, where C0 = zero out the child’s genotype, P0 = zero out the specific parent genotype, and W0 = zero out the genotypes of the whole nuclear family.
Fig 1Percent of time a genotyping error is detected.
Fig 2Percent of time each rule is applied.
Fig 3Percent of time each rule is applied correctly.