| Literature DB >> 19951892 |
Courtney Gray-McGuire1, Murielle Bochud, Robert Goodloe, Robert C Elston.
Abstract
With the trend in molecular epidemiology towards both genome-wide association studies and complex modelling, the need for large sample sizes to detect small effects and to allow for the estimation of many parameters within a model continues to increase. Unfortunately, most methods of association analysis have been restricted to either a family-based or a case-control design, resulting in the lack of synthesis of data from multiple studies. Transmission disequilibrium-type methods for detecting linkage disequilibrium from family data were developed as an effective way of preventing the detection of association due to population stratification. Because these methods condition on parental genotype, however, they have precluded the joint analysis of family and case-control data, although methods for case-control data may not protect against population stratification and do not allow for familial correlations. We present here an extension of a family-based association analysis method for continuous traits that will simultaneously test for, and if necessary control for, population stratification. We further extend this method to analyse binary traits (and therefore family and case-control data together) and accurately to estimate genetic effects in the population, even when using an ascertained family sample. Finally, we present the power of this binary extension for both family-only and joint family and case-control data, and demonstrate the accuracy of the association parameter and variance components in an ascertained family sample.Entities:
Mesh:
Year: 2009 PMID: 19951892 PMCID: PMC2874328 DOI: 10.1186/1479-7364-4-1-2
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 4.639
Summary of TDT-type methods and their respective features
| Incorporation of: | ||||||||
|---|---|---|---|---|---|---|---|---|
| Method/Reference | Missing parents | Multiple alleles | Parental phenotypes | Quantitative traits | Extended pedigrees | Different family structures | Multiple markers | Covariates |
| Curtis (1997)S1 | * | * | ||||||
| S-TDT (Spielman and Ewens, 1998)S2 | * | * | * | |||||
| DAT (Boehnke and Langefeld, 1998)S3 | * | * | ||||||
| SDT (Horvath and Laird, 1998)S4 | * | * | * | |||||
| NFS (Whittemore and Tu, 2000)S5 | * | * | * | * | * | |||
| TRANSMIT (Clayton, 1999)S6 | * | * | * | * | ||||
| RC-TDT (Knapp, 1999)S7 | * | |||||||
| 1-TDT (Sun | * | * | ||||||
| Martin et al. (1997)S9 | * | * | * | |||||
| George et al. (1999)S10 | * | * | * | * | * | |||
| P-TDT (Abecasis et al., 2000)S11 | * | * | * | * | * | |||
| Bickeboller and Clerget-Darpoux(1995)S12 | * | |||||||
| Spielman and Ewens (1996)S13 | * | |||||||
| Purcell | * | * | * | |||||
| TDT(max) (Morris, 1997)S15 | * | |||||||
| Lazzeroni and Lange (1998)S16 | * | * | ||||||
| Monks and Kaplan(2000)S17 | * | * | * | * | ||||
| Xiong | * | * | ||||||
| Fan and Jung(2002)S19 | * | * | * | * | ||||
| TDT(Q1) - TDT(Q5) (Allison, 1997)S20, | * | * | ||||||
| Rabinowitz (1997)S21 | * | * | * | |||||
| Allison | * | * | * | |||||
| Sun | * | * | * | * | ||||
| Schaid and Rowlands (2000)S24 | * | * | * | |||||
| Waldman | * | * | ||||||
| Sinsheimer | * | * | * | * | * | |||
| Kistner and Weinberg (2004)S27 | * | * | * | |||||
| QTDT (Abecasis | * | * | * | * | ||||
| Zhu and Elston(2001)S29 | * | * | * | * | ||||
| PDT (Martin, 2000)S30 | * | * | ||||||
| Goring and Terwilliger(2000)S31 | * | * | ||||||
| Clayton and Jones(1999)S32 | * | * | ||||||
| ETDT (Sham and Curtis, 1995)S33 | * | |||||||
| TDT-EX (Cleves | * | * | ||||||
| Fulker (1999)S35 | * | * | * | |||||
| Fan | * | * | * | |||||
Table S1 References
S1. Curtis D. (1997), 'Use of siblings as controls in case-control association studies', Ann. Hum. Genet. vol. 61, No. 4, July, pp. 319-333.
S2. Spielman R.S. and Ewens W.J. (1998), 'A sibship test for linkage in the presence of association: The sib transmission/disequilibrium test', Am. J. Hum. Genet. vol. 62, No. 2, February, pp. 450-458.
S3. Boehnke M. and Langefeld C.D. (1998), 'Genetic association mapping based on discordant sib pairs: The discordant-alleles test', Am. J. Hum. Gene. vol. 62, No. 4, April, pp. 950-961.
S4. Horvath S. and Laird N.M. (1998), 'A discordant-sibship test for disequilibrium and linkage: No need for parental data', Am. J. Hum. Genet. vol. 63, No. 6, December, pp. 1886-1897.
S5. Whittemore A.S. and Tu I.P. (2000), 'Detection of disease genes by use of family data. I. Likelihood-based theory', Am. J. Hum. Genet. vol. 66, No. 4, April, pp. 1328-1340.
S6. Clayton D. (1999), 'A generalization of the transmission/disequilibrium test for uncertain-haplotype transmission', Am. J. Hum. Genet. vol. 65, No. 4, October, pp. 1170-1177.
S7. Knapp M. (1999), 'The transmission/disequilibrium test and parental genotype reconstruction: The reconstruction-combined transmission/disequilibrium test', Am. J. Hum. Genet. vol. 64, No. 3, March, pp. 861-870.
S8. Sun F., Flanders W., Yang Q. and Khoury M. (1999), 'Transmission disequilbrium test (TDT) when only one parent is available: The 1-TDT', Am. J. Epidemiol. vol. 150, pp. 97-104.
S9. Martin E.R., Kaplan N.L. and Weir B.S. (1997), 'Tests for linkage and association in nuclear families', Am. J. Hum. Genet. Vol 61, No. 2, August, pp. 439-448.
S10. George V., Tiwari H.K., Zhu X. and Elston R.C. (1999), 'A test of transmission/disequilibrium for quantitative traits in pedigree data, by multiple regression', Am. J. Hum. Genet. vol. 65, No. 1, July, pp. 236-245.
S11. Abecasis G.R., Cookson W.O. and Cardon L.R. (2000), 'Pedigree tests of transmission disequilibrium', Eur. J. Hum. Genet. vol. 8, No. 7, pp. 545-551.
S12. Bickeboller H. and Clerget-Darpoux F. (1995), 'Statistical properties of the allelic and genotypic transmission/disequilibrium test for multiallelic markers', Genet. Epidemiol. vol. 12, No. 6, pp. 865-870.
S13. Spielman R.S. and Ewens W.J. (1996), 'The TDT and other family-based tests for linkage disequilibrium and association', Am. J. Hum. Genet. vol. 59, No. 5, November, pp. 983-989.
S14. Purcell S., Sham P.C. and Daly M.J. (2005), 'Parental phenotypes in family-based association analysis', Am. J. Hum. Genet. vol. 76, No. 2, pp. 249-259.
S15. Morris A.P., Curnow R.N. and Whittaker J.C. (1997), 'Randomization tests of disease-marker associations', Ann. Hum. Genet. vol. 61, No. 1, January, pp. 49-60.
S16. Lazzeroni L.C. and Lange K. (1998), 'A conditional inference framework for extending the transmission/disequilibrium test', Hum. Hered. vol. 48, No. 2, March, pp. 67-81.
S17. Monks S.A. and Kaplan N.L. (2000), 'Removing the sampling restrictions from family-based tests of association for a quantitative-trait locus', Am. J. Hum. Genet. vol. 66, No. 2, February, pp. 576-592.
S18. Xiong M.M., Krushkal J. and Boerwinkle E. (1998), 'TDT statistics for mapping quantitative trait loci', Ann. Hum. Genet. vol. 62, No. 5, September, pp. 431-452.
S19. Fan R. and Jung J. (2002), 'Association studies of QTL for multi-allele markers by mixed models', Hum. Hered. vol. 54, No. 3, pp. 132-150.
S20. Allison D.B. (1997), 'Transmission-disequilibrium tests for quantitative traits', Am. J. Hum. Genet. vol. 60, No. 3, March, pp. 676-690.
S21. Rabinowitz D. (1997), 'A transmission disequilibrium test for quantitative trait loci', Hum. Hered. vol. 47, No. 6, November, pp. 342-350.
S22. Allison D.B. Neale M.C., Zannolli R., Schork N.J. et al. (1999), 'The robustness of the likelihood-ratio test in a variance-component quantitative-trait loci-mapping procedure', Am. J. Hum. Genet. vol. 65, No. 2, August, pp. 531-544.
S23. Sun F., Flanders W., Yang Q. and Zhao H. (2000), 'Transmission/disequilibrium tests for quantitative traits', Ann. Hum. Genet. vol. 64, pp. 555-565.
S24. Schaid D.J. and Rowland C.M. (2000), 'Robust transmission regression models for linkage and association', Genet. Epidemiol. vol. 19, Suppl. 1. pp. S78-S84.
S25. Waldman I.D., Robinson B.F. and Rowe D.C. (1999), 'A logistic regression based extension of the TDT for continuous and categorical traits', Ann. Hum. Genet. vol. 63, No. 4, July, pp. 329-340.
S26. Sinsheimer J.S., Blangero J. and Lange K. (2000), 'Gamete-competition models', Am. J. Hum. Gene. vol. 66, No. 3, March, pp. 1168-1172.
S27. Kistner E.O. and Weinberg C.R. (2004), 'Method for using complete and incomplete trios to identify genes related to a quantitative trait', Genet. Epidemiol. vol. 27, No. 1, July, pp. 33-42.
S28. Abecasis G.R., Cardon L.R. and Cookson W.O. (2000), 'A general test of association for quantitative traits in nuclear families', Am. J. Hum. Genet. vol. 66, pp. 279-292.
S29. Zhu X. and Elston R.C. (2001), 'Transmission/disequilibrium tests for quantitative traits', Genet. Epidemiol. vol. 20, No. 1, January, pp. 57-74.
S30. Martin E.R., Monks S.A., Warren L.L. and Kaplan N.L. (2000), 'A test for linkage and association in general pedigrees: The pedigree disequilibrium test', Am. J. Hum. Genet. vol. 67, No. 1, pp. 146-154.
S31. Goring H.H. and Terwilliger J.D. (2000), 'Linkage analysis in the presence of errors IV: Joint pseudomarker analysis of linkage and/or linkage disequilibrium on a mixture of pedigrees and singletons when the mode of inheritance cannot be accurately specified', Am. J. Hum. Genet. vol. 66, No. 14, pp. 1310-1327.
S32. Clayton D. and Jones H. (1999), 'Transmission/disequilibrium tests for extended marker haplotypes', Am. J. Hum. Genet. vol. 65, No. 4, October, pp. 1161-1169.
S33. Sham P.C. and Curtis D. (1995), 'An extended transmission/disequilbrium test (TDT) for multiallele marker loci', An. Hum. Genet. vol. 59, Nos. 53/323, p. 336.
S34. Cleves M.A., Olson J.M. and Jacobs K.B. (1997), 'Exact transmission disequilibrium tests with multiallelic markers', Genet. Epidemiol. vol. 14, No. 4, pp. 337-347.
S35. Fulker D.W., Sham P.C. and Hewitt J.K. (1999), 'Combined linkage and association sib-pair analysis for quantitative traits', Am. J. Hum. Genet. vol. 64, pp. 259-267.
S36. Fan R., Floros J. and Xiong M. (2002), 'Models and tests of linkage and association studies of quantitative trait locus for multi-allele marker loci', Hum. Hered. vol. 53, No. 3, pp. 130-145.
Total variance of the non-major gene component of the continuous liability underlying the binary trait and the proportion of that variance represented by each variance component for each model
| Simulated proportion of variance for each variance component | ||||||
|---|---|---|---|---|---|---|
| Model Name | Total variance | Polygenic | Familial | Sibling | Marital | Random |
| 0.3125 | 0.200 | - - - - - - | - - - - - - | - - - - - - | 0.800 | |
| 0.3750 | 0.167 | 0.167 | - - - - - - | - - - - - - | 0.667 | |
| 0.3750 | 0.167 | - - - - - - | 0.167 | - - - - - - | 0.667 | |
| 0.3750 | 0.167 | - - - - - - | - - - - - - | 0.167 | 0.667 | |
| 0.4375 | 0.143 | - - - - - - | 0.143 | 0.143 | 0.571 | |
F = familial effect; M = marital effect; P = polygenic effect; S = sibling effect.
Accuracy of the association parameter as ln odds of being affected given two copies of the disease allele versus one copy for a sample size of 1,000 individuals
| Nuclear | Extended | ||||
|---|---|---|---|---|---|
| Model* | RAND | ASC | RAND | ASC | |
| 2.529 | 2.479 | 2.511 | 2.561 | ||
| 0.1709 | 0.1210 | 0.1530 | 0.2030 | ||
| 2.537 | 2.509 | 2.517 | 2.524 | ||
| 0.1789 | 0.1510 | 0.1591 | 0.1661 | ||
| 2.780 | 2.655 | 2.763 | 2.722 | ||
| 0.1775 | 0.0529 | 0.1603 | 0.1196 | ||
| 2.780 | 2.648 | 2.768 | 2.724 | ||
| 0.1775 | 0.0458 | 0.1655 | 0.1212 | ||
* Model indicates the variance components that were simulated followed by those included in the analysis model (F = familial and P = polygenic); Est is the average estimate across all replicates of that model; rMSE is the square root of the mean square error; ASC represents the analysis of an ascertained sample using ascertainment correction and RAND represents the analysis of a random sample without any such correction.
Accuracy of variance components as proportions of the total variance, N = 1,000
| Nuclear | Extended | |||||
|---|---|---|---|---|---|---|
| Parameter | Model | RAND | ASC | RAND | ASC | |
| 0.079 | 0068 | 0.0775 | 0.0696 | |||
| 0.0877 | 0.0990 | 0.0894 | 0.0693 | |||
| 0.1743 | 0.0636 | 0.1291 | 0.0555 | |||
| 0.0316 | 0.0781 | 0.0141 | 0.0866 | |||
| 0.0574 | 0.0669 | 0.0549 | 0.0713 | |||
| 0.1095 | 0.1000 | 0.1122 | 0.0959 | |||
| 0.0549 | 0.0554 | 0.057 | 0.0579 | |||
| 0.0883 | 0.1118 | 0.0860 | 0.1090 | |||
| 0.0896 | 0.0604 | 0.1711 | 0.1388 | |||
| 0.0775 | 0.1068 | 0.0000 | 0.0283 | |||
| 0.0741 | 0.0655 | 0.0775 | 0.0643 | |||
| 0.0927 | 0.1015 | 0.0894 | 0.1030 | |||
| 0.063 | 0.0805 | 0.0602 | 0.0755 | |||
| 0.1039 | 0.1196 | 0.1068 | 0.1249 | |||
| 0.2169 | 0.0617 | 0.1759 | 0.0559 | |||
| 0.0742 | 0.0800 | 0.0332 | 0.0860 | |||
| 0.0962 | 0.0723 | 0.0782 | 0.0603 | |||
| 0.0707 | 0.0949 | 0.0889 | 0.1068 | |||
| 0.1133 | 0.0122 | 0.033 | 0.0139 | |||
| 0.0539 | 0.3521 | 0.1342 | 0.1530 | |||
| 0.0198 | 0.0032 | 0.048 | 0.0102 | |||
| 0.0200 | 0.0032 | 0.0480 | 0.0100 | |||
* Model indicates the variance components that were simulated followed by those included in the analysis model (F = familial, M = marital, S = sibling and P = polygenic); Est is the average estimate across all replicates of that model; rMSE is the square root of the mean square error; ASC represents the analysis of an ascertained sample using ascertainment correction and RAND represents the analysis of a random sample without any such correction.
Figure 1Power to detect association by both total and locus-specific heritability for nuclear families (nuc fam) under an additive model (No Dom) and a model with 50 per cent additive and 50 per cent dominance variance (Add = Dom).
Figure 2Number of unrelated case-control samples needed, in addition to a fixed sample of either nuclear or extended pedigrees, to achieve a power of 92 per cent under an additive model (No Dom) and 86 per cent under a model with 50 per cent additive and 50 per cent dominance variance (Add = Dom). Values were generated for fixed sample sizes of both nuclear families and extended pedigrees, as well as for allele frequencies of both 0.5 and 0.1.
Figure 3Number of unrelated case-control samples needed, in addition to a fixed, mixed sample of nuclear and extended pedigrees, to achieve a power of 86 per cent under a model with 50 per cent additive and 50 per cent dominance variance (Add = Dom), assuming an allele frequency of 0.5. Values were generated for samples that comprised 30 per cent nuclear families and 70 per cent extended pedigrees, 50 per cent and 50 per cent, and 30 per cent and 70 per cent, respectively.
Figure 4Number of unrelated case-control samples needed, in addition to a fixed, mixed sample of nuclear and extended pedigrees, to achieve a power of 86 per cent under a model with 50 per cent additive and 50 per cent dominance variance (Add = Dom), assuming an allele frequency of 0.1. Values were generated for samples that comprised 30 per cent nuclear families and 70 per cent extended pedigrees, 50 per cent and 50 per cent, and 30 per cent and 70 per cent, respectively.
Figure 5Number of unrelated case-control samples needed, in addition to a fixed, mixed sample of nuclear and extended pedigrees, to achieve a power of 92 per cent under as additive model (No Dom), assuming an allele frequency of 0.1. Values were generated for samples that comprised 30 per cent nuclear families and 70 per cent extended pedigrees, 50 per cent and 50 per cent, and 30 per cent and 70 per cent, respectively.
Accuracy of the association parameter as ln odds of being affected given two copies of the disease allele versus no copies for a sample size of 1,000 individuals
| Nuclear | Extended | ||||
|---|---|---|---|---|---|
| Model* | RAND | ASC | RAND | ASC | |
| 5.058 | 4.958 | 5.022 | 5.122 | ||
| 0.2936 | 0.3936 | 0.3295 | 0.2296 | ||
| 5.074 | 5.018 | 5.034 | 5.048 | ||
| 0.2777 | 0.3336 | 0.3176 | 0.3036 | ||
| 5.560 | 5.310 | 5.526 | 5.444 | ||
| 0.1010 | 0.5696 | 0.3536 | 0.4357 | ||
| 5.560 | 5.296 | 5.536 | 5.448 | ||
| 0.3195 | 0.5836 | 0.3437 | 0.4316 | ||
* Model indicates the variance components that were simulated followed by those included in the analysis model (F = familial and P = polygenic); Est is the average estimate across all replicates of that model; rMSE is the square root of the mean square error; ASC represents the analysis of an ascertained sample using ascertainment correction and RAND represents the analysis of a random sample without any such correction.