| Literature DB >> 21129206 |
Karen Curtin1, Roger K Wolff, Jennifer S Herrick, Ryan Abo, Martha L Slattery.
Abstract
BACKGROUND: In candidate-gene association studies of single nucleotide polymorphisms (SNPs), multilocus analyses are frequently of high dimensionality when considering haplotypes or haplotype pairs (diplotypes) and differing modes of expression. Often, while candidate genes are selected based on their biological involvement in a given pathway, little is known about the functionality of SNPs to guide association studies. Investigators face the challenge of exploring multiple SNP models to elucidate which variants, independently or in combination, might be associated with a disease of interest. A data mining module, hapConstructor (freely-available in Genie software) performs systematic construction and association testing of multilocus genotype data in a Monte Carlo framework. Our objective was to assess its utility to guide statistical analyses of haplotypes within a candidate region (or combined genotypes across candidate genes) beyond that offered by a standard logistic regression approach.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21129206 PMCID: PMC3006374 DOI: 10.1186/1471-2350-11-170
Source DB: PubMed Journal: BMC Med Genet ISSN: 1471-2350 Impact factor: 2.103
Characteristics of colon cancer and rectal cancer studies
| Total | 1556 | 1956 | 754 | 959 | ||||
| Male | 871 | 56.0 | 1047 | 53.5 | 451 | 59.8 | 541 | 56.4 |
| Female | 685 | 44.0 | 909 | 46.5 | 303 | 40.2 | 418 | 43.6 |
| Age at diagnosis or selection | ||||||||
| 30-39 | 23 | 1.5 | 40 | 2.0 | 19 | 2.5 | 21 | 2.2 |
| 40-49 | 102 | 6.6 | 128 | 6.5 | 96 | 12.7 | 101 | 10.5 |
| 50-59 | 290 | 18.6 | 326 | 16.7 | 196 | 26.0 | 243 | 25.3 |
| 60-69 | 538 | 34.6 | 673 | 34.4 | 250 | 33.2 | 329 | 34.3 |
| 70-79 | 603 | 38.8 | 789 | 40.3 | 193 | 25.6 | 265 | 27.6 |
| Center | ||||||||
| No. California (KPMCP) | 744 | 47.8 | 787 | 40.2 | 480 | 63.7 | 594 | 61.9 |
| Utah | 250 | 16.1 | 378 | 19.3 | 274 | 36.3 | 365 | 38.1 |
| Minnesota | 562 | 36.1 | 791 | 40.4 | 0 | - | 0 | - |
| White, non-Hispanic | 1429 | 91.8 | 1828 | 93.5 | 625 | 82.9 | 824 | 85.9 |
| White, Hispanic | 59 | 3.8 | 75 | 3.8 | 61 | 8.1 | 63 | 6.6 |
| Black | 68 | 4.4 | 53 | 2.7 | 29 | 3.8 | 43 | 4.5 |
| Asian | 0 | - | 0 | - | 39 | 5.2 | 29 | 3.0 |
Single SNP results from hapConstructor and from multivariable-adjusted logistic regression
| Genetic | hapConstructor* | Logistic regression** | |||||
|---|---|---|---|---|---|---|---|
| Gene | SNP | Comparison | Model | OR | OR | 95%CI | |
| rs2069860 | AT/TT vs. AA | Dominant | 0.53 | 0.02 | 0.55 | 0.32, 0.95 | |
| rs2272733 | TT vs. CC | Additive | 0.55 | 0.02 | 0.56 | 0.34, 0.93 | |
| rs4648110 | AA vs. TT | Additive | 0.63 | 0.01 | 0.65 | 0.44, 0.94 | |
| AA vs. TT/TA | Recessive | 0.64 | 0.01 | 0.66 | 0.45, 0.96 | ||
| rs13117745 | TT vs. CC | Additive | 0.61 | 0.03 | 0.61 | 0.37, 1.00 | |
| TT vs. CC/CT | Recessive | 0.62 | 0.04 | 0.64 | 0.39, 1.04 | ||
| rs230510 | TT vs. AA | Additive | 0.66 | 0.004 | 0.65 | 0.49, 0.87 | |
| TT vs. AA/AT | Dominant | 0.77 | 0.009 | 0.79 | 0.51, 0.94 | ||
| rs3821958 | GG vs. AA | Additive | 1.34 | 0.04 | 1.32 | 1.00, 1.75 | |
| rs11722146 | GA/AA vs. GG | Dominant | 1.21 | 0.047 | 1.24 | 1.03, 1.51 | |
| rs13117745 | TT vs. CC | Additive | 1.92 | 0.04 | 1.69 | 0.93, 3.07 | |
NOTES: Odds ratios (ORs) compared to reference of major allele homozygotes; 95% confidence interval (95% CI)
*adjusted for race
**adjusted for age, sex, center, and race
NFKB1 haplotypes and associations with colon and rectal cancer from hapConstructor and from multivariable-adjusted logistic regression
| Genetic | hapConstructor* | Logistic regression** | |||||
|---|---|---|---|---|---|---|---|
| SNP 1 | SNP 2 | Haplotype | Model | OR | OR | 95%CI | |
| rs230510 A > T | rs4648110 T > A | 2 copies A-A vs. 0/1 copy | Rec. | 0.62 | 0.009 | 0.65 | 0.45, 0.95 |
| rs13117745 C > T | rs4648110 T > A | 2 copies T-A vs. 0 copies | Additive | 0.62 | 0.009 | 0.65 | 0.44, 0.94 |
| rs3821958 A > G | rs230510 A > T | 2 copies A-T vs. 0 copies | Additive | 0.66 | 0.006 | 0.68 | 0.50, 0.91 |
| rs4648127 C > T | rs230510 A > T | 2 copies C-T vs. 0 copies | Additive | 0.66 | 0.006 | 0.68 | 0.50, 0.91 |
| rs11722146 G > A | rs230510 A > T | 2 copies G-T vs. 0 copies | Additive | 0.66 | 0.006 | 0.67 | 0.50, 0.90 |
| rs13117745 C > T | rs230510 A > T | 2 copies C-T vs. 0 copies | Additive | 0.66 | 0.006 | 0.67 | 0.50, 0.90 |
| rs230510 A > T | rs4648090 G > A | 2 copies T-G vs. 0 copies | Additive | 0.67 | 0.007 | 0.68 | 0.51, 0.90 |
| rs11722146 G > A | rs4648090 G > A | 2 copies A-A vs. 0 copies | Additive | 1.47 | 0.007 | 1.47 | 1.11, 1.93 |
| rs230510 A > T | rs4648110 T > A | 2 copies T-T vs. 0 copies | Additive | 0.68 | 0.009 | 0.68 | 0.51, 0.91 |
| rs3821958 A > G | rs4648110 T > A | 2 copies G-A vs. 0 copies | Additive | 1.52 | 0.005 | 1.49 | 1.12, 1.99 |
| rs11722146 G > A | rs13117745 C > T | 2 copies A-T vs. 0 copies | Additive | 1.51 | 0.005 | 1.53 | 1.16, 2.01 |
| rs11722146 G > A | rs4648127 C > T | 2 copies A-T vs. 0 copies | Additive | 1.33 | 0.004 | 1.36 | 1.12, 1.65 |
| rs3821958 A > G | rs13117745 C > T | 2 copies G-T vs. 0 copies | Additive | 1.53 | 0.003 | 1.52 | 1.13, 2.02 |
| rs11722146 G > A | rs4648110 T > A | 2 copies A-A vs. 0 copies | Additive | 1.54 | 0.0028 | 1.55 | 1.17, 2.04 |
NOTES: Odds ratio (OR); 95% confidence interval (95% CI); Recessive (Rec)
*adjusted for race
**adjusted for age, sex, center, and race; all P-trends (additive model) were significant, <0.01
Composite genotype associations with colon and rectal cancer from hapConstructor and from multivariable-adjusted logistic regression
| Genetic | hapConstructor* | Logistic regression** | ||||||
|---|---|---|---|---|---|---|---|---|
| SNP 1 | SNP 2 | SNP 3 | SNP1/SNP2/SNP3 genotype | Model | OR | OR | 95%CI | |
| (TA or AA)/AA | Dom./Rec. | 0.56 | 0.0094 | 0.54 | 0.34, 0.87 | |||
| (GT or TT)/AA | Dom./Rec. | 0.23 | 0.0058 | 0.21 | 0.06, 0.74 | |||
| (GA or AA)/AA | Dom./Rec. | 0.41 | 0.0004 | 0.40 | 0.23, 0.72 | |||
| (CA or AA)/(GA or AA)/AA | Dom./Dom./Rec. | 0.21 | 0.0004 | 0.20 | 0.06, 0.67 | |||
| (GA or AA)/(GA or AA)/AA | Dom./Dom./Rec. | 0.40 | 0.0004 | 0.40 | 0.21, 0.73 | |||
| (GA or AA)/(GA or AA)/AA | Dom./Dom./Rec. | 0.39 | 0.0002 | 0.39 | 0.21, 0.73 | |||
| (TA or AA)/(GA or AA) | Dom./Dom. | 1.30 | 0.0096 | 1.11 | 0.83, 1.50 | |||
| (GA or AA)/(GA or AA) | Dom./Dom. | 1.35 | 0.0068 | 1.38 | 1.04, 1.83 | |||
| (GA or AA)/(AG or GG) | Dom./Dom. | 1.34 | 0.0044 | 1.15 | 0.85, 1.56 | |||
NOTES: Odds ratio (OR); 95% confidence interval (95%CI); Dominant (Dom); Recessive (Rec)
*adjusted for race; reference category is all other composite genotypes
**adjusted for age, sex, center, and race; reference category is major allele homozygote (dominant) or heterozygote (recessive) for each SNP
Figure 1Illustration of the hapConstructor method and associations across .
Comparison of hapConstructor and logistic regression characteristics in multilocus modeling
| Characteristic | hapConstructor software | Standard logistic regression procedure | Stepwise logistic regression procedure |
|---|---|---|---|
| Integrated phase estimation and building of haplotypes from genotype data | Yes | No; specialized software required | No; specialized software required |
| Systematically model all 2 to n-locus haplotypes within a gene | Yes | Yes, with additional programming | Yes, with additional programming |
| Systematically model mode of inheritance (dominant, recessive, additive) | Yes | Yes, with additional programming | Yes, with additional programming |
| Systematically model all SNP combinations across genes | Yes | No | No |
| User-specified significance levels | Yes; can vary for single locus to n loci | Yes | Yes |
| Model simultaneous covariates | No | Yes | Yes |
| Multiple testing correction | Yes (FDR) | Yes (add-on procedure) | Yes (add-on procedure) |
| Runtime (excluding additional operator programming time) | Slow to adequate | Fast | Fast |