| Literature DB >> 27980636 |
Sahir R Bhatnagar1, Celia M T Greenwood2, Aurélie Labbe3.
Abstract
A statistical departure from Mendel's law of segregation is known as transmission ratio distortion. Although well documented in many other organisms, the extent of transmission ratio distortion and its influence in the human genome remains incomplete. Using Genetic Analysis Workshop 19 whole genome sequence data from 20 large Mexican American pedigrees, our goal was to identify potentially distorted regions in the genome using family-based association methods such as the transmission disequilibrium test, the pedigree disequilibrium test, and the family-based association test. Preliminary results showed an unusually high number of transmission ratio distortion signals identified by the transmission disequilibrium test, but this phenomenon could not be replicated by the pedigree disequilibrium test or family-based association test. Applying these tests to different subsets of the data, we found the transmission disequilibrium test to be very sensitive to imputed genotypes. Regression analysis of transmission ratio distortion test p values controlling for minor allele frequency and quality control checks showed that Hardy Weinberg p values are associated with this inflation. Although the transmission disequilibrium test appears confounded by imputation of single nucleotide polymorphisms, the pedigree disequilibrium test and family-based association test seem to offer more robust alternatives when searching for transmission ratio distortion loci in whole genome sequence data from extended families.Entities:
Year: 2016 PMID: 27980636 PMCID: PMC5133486 DOI: 10.1186/s12919-016-0030-0
Source DB: PubMed Journal: BMC Proc ISSN: 1753-6561
Fig. 1Manhattan plots for TDT (a, d, g), PDT (b, e, h) and FBAT (c, f, i) for all data (a, b, c), the sequenced subset (d, e, f), and the nuclear subset (g, h, i). Everyone has been coded as “affected” because we are evaluating evidence for general population TRD. A threshold line was set at 10−3 for comparability with the regression analysis. The effect of imputation on the test statistic can be seen by comparing across rows, while the impact of the different test statistics can be compared across columns. The y-axis varies across the plots
Fig. 2Q-Q plots of TDT, PDT, and FBAT p values for all (red) and sequenced (blue) subsets of the data. Both axes are plotted on the - log10 scale. The y-axis varies across the plots
Contribution of HW disequilibrium to p value distribution, evaluated by regression analysis. Significant F tests (p < 0.05) demonstrating evidence of the contribution of HW disequilibrium to the p value distributions are shown in bold font
| Allb, c | Sequenceda, c | Nuclearb, c | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Method | TDT | PDT | FBAT | TDT | PDT | FBAT | TDT | PDT | FBAT |
| Regression model of - log10 | |||||||||
|
| 4.4 × | 3.6 × | 1.5 × | 0.0018 | 3.6 × | 2 × | 2.9 × | 8.26 × | 3.5 × |
| # SNPse | 6.1 × | 5.6 × | 2.1 × | 3.2 × | 3 × | 8 × | 3.6 × | 2.1 × | 5 × |
| Regression models of - log10 | |||||||||
|
| 0.002 | 8.4 × | 8.8 × | 0.0015 |
| 0.006 | 0.014 | 0.01 | 0.01 |
| # SNPse | 8 × | 517 | 870 | 1777 | 11 | 19 | 3.3 × | 53 | 54 |
| F-test | |||||||||
| All SNPs |
| 0.16 | 0.07 |
| 0.74 | 0.69 |
| 0.89 | 0.68 |
|
|
| 0.84 | 0.38 | 0.10 | NAf | 0.82 |
| 0.49 | 0.50 |
aFull model is given by Eq. (1)
bFull model is given by Eq. (2)
cReduced model excludes
dTest to see if there is a significant difference between the full model and the reduced model. Numbers presented correspond to pvalues of the F test where the null hypothesis is
eThe number of informative SNPs
fNot enough data points to fit the model
Fig. 3Number of subjects, by chromosome, where the imputed dosage genotypes were closer to an integer value at all markers than the indicated threshold