| Literature DB >> 28594416 |
James R Staley1, Edmund Jones1, Stephen Kaptoge1, Adam S Butterworth1,2, Michael J Sweeting1,2, Angela M Wood1,2, Joanna M M Howson1.
Abstract
Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP-disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease.Entities:
Mesh:
Year: 2017 PMID: 28594416 PMCID: PMC5520083 DOI: 10.1038/ejhg.2017.78
Source DB: PubMed Journal: Eur J Hum Genet ISSN: 1018-4813 Impact factor: 4.246
Simulation results for cohort studies with 10,000 individuals for a SNP with RAF=0.10.
| Complete Follow-up, 5% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0039 | 0.1020 | 0.1610 | –0.0010 | 0.1052 | 0.1590 | 0.0020 |
| 1.2 | –0.0024 | 0.0985 | 0.4674 | 0.0034 | 0.1018 | 0.4636 | 0.0008 |
| 1.5 | –0.0031 | 0.0909 | 0.9856 | 0.0111 | 0.0947 | 0.9858 | –0.0002 |
| 2 | –0.0014 | 0.0834 | 1.0000 | 0.0264 | 0.0881 | 1.0000 | 0.0000 |
| Complete Follow-up, 10% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0025 | 0.0716 | 0.2664 | 0.0036 | 0.0764 | 0.2628 | 0.0036 |
| 1.2 | –0.0006 | 0.0694 | 0.7360 | 0.0114 | 0.0744 | 0.7342 | 0.0022 |
| 1.5 | –0.0004 | 0.0642 | 0.9998 | 0.0290 | 0.0699 | 0.9998 | 0.0000 |
| 2 | –0.0005 | 0.0586 | 1.0000 | 0.0576 | 0.0655 | 1.0000 | 0.0000 |
| Complete Follow-up, 15% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0008 | 0.0588 | 0.3754 | 0.0087 | 0.0648 | 0.3748 | 0.0006 |
| 1.2 | –0.0013 | 0.0570 | 0.8730 | 0.0168 | 0.0633 | 0.8694 | 0.0036 |
| 1.5 | –0.0007 | 0.0530 | 1.0000 | 0.0440 | 0.0601 | 1.0000 | 0.0000 |
| 2 | –0.0007 | 0.0487 | 1.0000 | 0.0869 | 0.0574 | 1.0000 | 0.0000 |
| Survey Follow-up, 5% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0006 | 0.1011 | 0.1678 | 0.0020 | 0.1044 | 0.1684 | -0.0006 |
| 1.2 | –0.0015 | 0.0979 | 0.4668 | 0.0026 | 0.1013 | 0.4576 | 0.0092 |
| 1.5 | –0.0043 | 0.0907 | 0.9864 | 0.0065 | 0.0945 | 0.9858 | 0.0006 |
| 2 | –0.0014 | 0.0840 | 1.0000 | 0.0193 | 0.0885 | 1.0000 | 0.0000 |
| Survey Follow-up, 10% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0035 | 0.0716 | 0.2638 | –0.0006 | 0.0763 | 0.2550 | 0.0088 |
| 1.2 | –0.0016 | 0.0694 | 0.7296 | 0.0057 | 0.0743 | 0.7068 | 0.0228 |
| 1.5 | –0.0019 | 0.0645 | 1.0000 | 0.0166 | 0.0700 | 1.0000 | 0.0000 |
| 2 | 0.0002 | 0.0594 | 1.0000 | 0.0362 | 0.0661 | 1.0000 | 0.0000 |
| Survey Follow-up, 15% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0018 | 0.0590 | 0.3672 | 0.0027 | 0.0650 | 0.3370 | 0.0302 |
| 1.2 | –0.0005 | 0.0573 | 0.8760 | 0.0093 | 0.0635 | 0.8472 | 0.0288 |
| 1.5 | –0.0011 | 0.0535 | 1.0000 | 0.0219 | 0.0605 | 1.0000 | 0.0000 |
| 2 | –0.0013 | 0.0498 | 1.0000 | 0.0439 | 0.0580 | 1.0000 | 0.0000 |
| Random Follow-up, 5% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0038 | 0.1017 | 0.1682 | –0.0022 | 0.1049 | 0.1620 | 0.0062 |
| 1.2 | –0.0025 | 0.0983 | 0.4638 | 0.0004 | 0.1016 | 0.4574 | 0.0064 |
| 1.5 | –0.0031 | 0.0913 | 0.9858 | 0.0049 | 0.0950 | 0.9836 | 0.0022 |
| 2 | –0.0037 | 0.0852 | 1.0000 | 0.0106 | 0.0896 | 1.0000 | 0.0000 |
| Random Follow-up, 10% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0018 | 0.0721 | 0.2702 | 0.0005 | 0.0768 | 0.2494 | 0.0208 |
| 1.2 | –0.0025 | 0.0699 | 0.7324 | 0.0017 | 0.0748 | 0.6898 | 0.0426 |
| 1.5 | –0.0029 | 0.0651 | 1.0000 | 0.0078 | 0.0706 | 1.0000 | 0.0000 |
| 2 | –0.0006 | 0.0604 | 1.0000 | 0.0198 | 0.0668 | 1.0000 | 0.0000 |
| Random Follow-up, 15% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0016 | 0.0589 | 0.3614 | –0.0003 | 0.0648 | 0.3240 | 0.0374 |
| 1.2 | –0.0012 | 0.0572 | 0.8786 | 0.0031 | 0.0634 | 0.8282 | 0.0504 |
| 1.5 | –0.0011 | 0.0536 | 1.0000 | 0.0091 | 0.0605 | 1.0000 | 0.0000 |
| 2 | –0.0008 | 0.0502 | 1.0000 | 0.0184 | 0.0582 | 1.0000 | 0.0000 |
Abbreviations: Diff in Power, difference in power (Cox − logistic); HR, hazard ratio; OR, odds ratio; RAF, risk allele frequency; SE, standard error of the logarithm of hazard or odds ratio. Bias refers to either the estimated log(HR) or log(OR) minus the underlying log(HR). It is important to note that HRs and ORs are different measures of comparison and as such ORs are not ’biased‘ if different from the underlying HR. This table is a subset of Supplementary Table S3.
Figure 1Difference in power between the Cox and logistic regression models for an SNP with a risk allele frequency of 10% for the cohort study design. The red, blue and green lines represent the sample sizes 5000, 10 000 and 25 000, respectively. Complete (a), Survey (b) and Random (c) are the types of follow-up and 5, 10 and 15% are the cumulative disease incidences.
Simulation results for case-cohort studies with sampling fraction of 10% from 40,000 individuals for a SNP with RAF=0.10.
| Complete Follow-up, 5% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0007 | 0.0635 | 0.3284 | 0.0021 | 0.0647 | 0.3318 | –0.0034 |
| 1.2 | –0.0002 | 0.0622 | 0.8308 | 0.0053 | 0.0634 | 0.8392 | –0.0084 |
| 1.5 | 0.0010 | 0.0596 | 1.0000 | 0.0143 | 0.0608 | 1.0000 | 0.0000 |
| 2 | 0.0007 | 0.0582 | 1.0000 | 0.0259 | 0.0588 | 1.0000 | 0.0000 |
| Complete Follow-up, 10% Cumulative Disease Incidence | |||||||
| 1.1 | 0.0001 | 0.0524 | 0.4480 | 0.0054 | 0.0542 | 0.4580 | –0.0100 |
| 1.2 | 0.0006 | 0.0517 | 0.9430 | 0.0119 | 0.0536 | 0.9522 | –0.0092 |
| 1.5 | –0.0001 | 0.0505 | 1.0000 | 0.0270 | 0.0524 | 1.0000 | 0.0000 |
| 2 | 0.0004 | 0.0503 | 1.0000 | 0.0531 | 0.0518 | 1.0000 | 0.0000 |
| Complete Follow-up, 15% Cumulative Disease Incidence | |||||||
| 1.1 | 0.0011 | 0.0484 | 0.5126 | 0.0098 | 0.0510 | 0.5402 | –0.0276 |
| 1.2 | –0.0008 | 0.0481 | 0.9674 | 0.0161 | 0.0507 | 0.9782 | –0.0108 |
| 1.5 | 0.0003 | 0.0475 | 1.0000 | 0.0418 | 0.0504 | 1.0000 | 0.0000 |
| 2 | 0.0022 | 0.0479 | 1.0000 | 0.0816 | 0.0507 | 1.0000 | 0.0000 |
| Survey Follow-up, 5% Cumulative Disease Incidence | |||||||
| 1.1 | 0.0002 | 0.0659 | 0.3120 | 0.0027 | 0.0643 | 0.3378 | –0.0258 |
| 1.2 | 0.0008 | 0.0647 | 0.8050 | 0.0049 | 0.0631 | 0.8402 | –0.0352 |
| 1.5 | –0.0002 | 0.0624 | 1.0000 | 0.0094 | 0.0607 | 1.0000 | 0.0000 |
| 2 | 0.0015 | 0.0612 | 1.0000 | 0.0195 | 0.0590 | 1.0000 | 0.0000 |
| Survey Follow-up, 10% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0004 | 0.0569 | 0.3786 | 0.0026 | 0.0541 | 0.4350 | –0.0564 |
| 1.2 | 0.0009 | 0.0563 | 0.9038 | 0.0076 | 0.0536 | 0.9422 | –0.0384 |
| 1.5 | 0.0007 | 0.0553 | 1.0000 | 0.0171 | 0.0525 | 1.0000 | 0.0000 |
| 2 | 0.0024 | 0.0551 | 1.0000 | 0.0334 | 0.0519 | 1.0000 | 0.0000 |
| Survey Follow-up, 15% Cumulative Disease Incidence | |||||||
| 1.1 | 0.0019 | 0.0542 | 0.4332 | 0.0064 | 0.0510 | 0.5130 | –0.0798 |
| 1.2 | –0.0005 | 0.0538 | 0.9294 | 0.0089 | 0.0507 | 0.9682 | –0.0388 |
| 1.5 | 0.0016 | 0.0533 | 1.0000 | 0.0230 | 0.0504 | 1.0000 | 0.0000 |
| 2 | 0.0008 | 0.0535 | 1.0000 | 0.0424 | 0.0506 | 1.0000 | 0.0000 |
| Random Follow-up, 5% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0007 | 0.0681 | 0.2864 | –0.0001 | 0.0645 | 0.3212 | –0.0348 |
| 1.2 | 0.0005 | 0.0670 | 0.7768 | 0.0030 | 0.0633 | 0.8334 | –0.0566 |
| 1.5 | –0.0005 | 0.0648 | 1.0000 | 0.0067 | 0.0609 | 1.0000 | 0.0000 |
| 2 | 0.0007 | 0.0639 | 1.0000 | 0.0143 | 0.0593 | 1.0000 | 0.0000 |
| Random Follow-up, 10% Cumulative Disease Incidence | |||||||
| 1.1 | 0.0006 | 0.0599 | 0.3536 | 0.0021 | 0.0543 | 0.4290 | –0.0754 |
| 1.2 | –0.0005 | 0.0593 | 0.8756 | 0.0037 | 0.0537 | 0.9398 | –0.0642 |
| 1.5 | 0.0004 | 0.0583 | 1.0000 | 0.0097 | 0.0526 | 1.0000 | 0.0000 |
| 2 | 0.0004 | 0.0583 | 1.0000 | 0.0192 | 0.0520 | 1.0000 | 0.0000 |
| Random Follow-up, 15% Cumulative Disease Incidence | |||||||
| 1.1 | –0.0002 | 0.0574 | 0.3740 | 0.0016 | 0.0509 | 0.4726 | –0.0986 |
| 1.2 | 0.0021 | 0.0571 | 0.9062 | 0.0055 | 0.0507 | 0.9578 | –0.0516 |
| 1.5 | 0.0004 | 0.0566 | 1.0000 | 0.0112 | 0.0503 | 1.0000 | 0.0000 |
| 2 | 0.0031 | 0.0571 | 1.0000 | 0.0199 | 0.0505 | 1.0000 | 0.0000 |
Abbreviations: Diff in Power, difference in power (Cox − logistic); HR, hazard ratio; OR, odds ratio; RAF, risk allele frequency; SE, standard error of the logarithm of hazard or odds ratio. Bias refers to either the estimated log(HR) or log(OR) minus the underlying log(HR). It is important to note that HRs and ORs are different measures of comparison and as such ORs are not 'biased' if different from the underlying HR. The Cox model was Prentice weighted and robust SEs were applied to account for the sampling process. This table is a subset of Supplementary Table S5.
Figure 2Difference in power between the Cox and logistic regression models for an SNP with a risk allele frequency of 10% for the case-cohort study design. The red, blue and green lines represent the sampling fractions of 5, 10 and 15%, respectively. Complete (a), Survey (b) and Random (c) are the types of follow-up and 5, 10 and 15% are the cumulative disease incidences.
Results of 25 SNPs previously reported to be associated with CHD in EPIC-CVD
| N (N | P | P | N (N | P | P | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| rs11206510 | chr1:g.55496039T>C | 11 812 (437) | 0.058 | 0.088 | 0.509 | 0.052 | 0.092 | 0.572 | 18 807 (7432) | 0.069 | 0.036 | 0.054 | 0.073 | 0.031 | 0.021 |
| rs4299376 | chr2:g.44072576T>G | 11 697 (436) | −0.020 | 0.073 | 0.785 | −0.011 | 0.077 | 0.885 | 18 653 (7392) | −0.058 | 0.030 | 0.053 | −0.043 | 0.026 | 0.098 |
| rs2252641 | chr2:g.145801461T>C | 11 799 (437) | −0.130 | 0.068 | 0.057 | −0.138 | 0.072 | 0.054 | 18 785 (7423) | −0.019 | 0.028 | 0.508 | −0.012 | 0.025 | 0.639 |
| rs2306374 | chr3:g.138119952T>C | 11 812 (437) | −0.116 | 0.090 | 0.199 | −0.125 | 0.095 | 0.189 | 18 806 (7431) | 0.008 | 0.038 | 0.843 | −0.008 | 0.033 | 0.821 |
| rs273909 | chr5:g.131667353A>G | 11 789 (434) | 0.012 | 0.107 | 0.909 | −0.001 | 0.112 | 0.996 | 18 757 (7402) | −0.049 | 0.043 | 0.260 | −0.065 | 0.038 | 0.083 |
| rs2706399 | chr5:g.131867702A>G | 11 812 (437) | −0.040 | 0.068 | 0.552 | −0.057 | 0.071 | 0.428 | 18 807 (7431) | 0.028 | 0.028 | 0.319 | −0.015 | 0.025 | 0.556 |
| rs9349379 | chr6:g.12903957A>G | 11 811 (437) | −0.127 | 0.069 | 0.066 | −0.133 | 0.073 | 0.069 | 18 806 (7432) | −0.093 | 0.029 | 0.001 | −0.123 | 0.025 | 9.00E-07 |
| rs2048327 | chr6:g.160863532T>C | 11 813 (437) | 0.042 | 0.071 | 0.553 | 0.042 | 0.075 | 0.578 | 18 808 (7432) | −0.083 | 0.030 | 0.005 | −0.092 | 0.026 | 4.00E-04 |
| rs2023938 | chr7:g.19036775T>C | 11 809 (437) | 0.000 | 0.112 | 1.000 | 0.001 | 0.118 | 0.992 | 18 802 (7430) | −0.018 | 0.046 | 0.691 | −0.043 | 0.040 | 0.286 |
| rs10953541 | chr7:g.107244545C>T | 11 810 (437) | 0.061 | 0.081 | 0.450 | 0.065 | 0.084 | 0.438 | 18 805 (7432) | −0.009 | 0.033 | 0.792 | 0.015 | 0.029 | 0.607 |
| rs264 | chr8:g.19813180G>A | 11 812 (437) | 0.140 | 0.098 | 0.151 | 0.141 | 0.102 | 0.168 | 18 807 (7432) | 0.079 | 0.039 | 0.040 | 0.076 | 0.035 | 0.029 |
| rs10808546 | chr8:g.126495818C>T | 11 787 (434) | 0.033 | 0.070 | 0.632 | 0.034 | 0.073 | 0.642 | 18 769 (7416) | 0.042 | 0.028 | 0.142 | 0.036 | 0.025 | 0.149 |
| rs3217992 | chr9:g.22003223C>T | 11 813 (437) | −0.034 | 0.070 | 0.633 | −0.036 | 0.074 | 0.629 | 18 808 (7432) | −0.114 | 0.029 | 8.00E-05 | −0.097 | 0.025 | 1.00E-04 |
| rs2047009 | chr10:g.44539913G>T | 11 811 (437) | 0.104 | 0.068 | 0.130 | 0.106 | 0.072 | 0.140 | 18 797 (7432) | 0.055 | 0.028 | 0.049 | 0.072 | 0.024 | 0.003 |
| rs1746048 | chr10:g.44775824C>T | 11 811 (437) | 0.101 | 0.104 | 0.329 | 0.110 | 0.108 | 0.309 | 18 805 (7432) | 0.060 | 0.042 | 0.146 | 0.062 | 0.036 | 0.082 |
| rs11203042 | chr10:g.90989109C>T | 11 813 (437) | 0.034 | 0.069 | 0.618 | 0.037 | 0.072 | 0.610 | 18 808 (7432) | 0.014 | 0.028 | 0.612 | 0.005 | 0.025 | 0.829 |
| rs1412444 | chr10:g.91002927C>T | 11 812 (437) | −0.118 | 0.071 | 0.097 | −0.128 | 0.075 | 0.086 | 18 806 (7431) | −0.001 | 0.030 | 0.962 | −0.041 | 0.026 | 0.111 |
| rs974819 | chr11:g.103660567C>T | 11 324 (424) | −0.123 | 0.073 | 0.091 | −0.132 | 0.077 | 0.086 | 18 163 (7263) | −0.082 | 0.031 | 0.007 | −0.080 | 0.027 | 0.003 |
| rs964184 | chr11:g.116648917C>G | 11 811 (437) | 0.126 | 0.103 | 0.221 | 0.128 | 0.108 | 0.236 | 18 806 (7432) | −0.047 | 0.040 | 0.241 | −0.063 | 0.034 | 0.065 |
| rs3184504 | chr12:g.111884608C>T | 11 800 (437) | 0.049 | 0.068 | 0.472 | 0.054 | 0.071 | 0.447 | 18 755 (7392) | −0.003 | 0.028 | 0.919 | −0.014 | 0.025 | 0.560 |
| rs9319428 | chr13:g.28973621G>A | 11 808 (437) | −0.158 | 0.072 | 0.027 | −0.165 | 0.075 | 0.028 | 18 800 (7429) | −0.050 | 0.030 | 0.098 | −0.057 | 0.026 | 0.030 |
| rs4380028 | chr15:g.79111093C>T | 11 812 (437) | 0.082 | 0.070 | 0.241 | 0.088 | 0.073 | 0.229 | 18 807 (7432) | 0.086 | 0.029 | 0.003 | 0.065 | 0.025 | 0.010 |
| rs17514846 | chr15:g.91416550C>A | 11 811 (437) | −0.115 | 0.068 | 0.091 | −0.112 | 0.072 | 0.119 | 18 796 (7422) | −0.037 | 0.028 | 0.187 | −0.068 | 0.025 | 0.006 |
| rs6511720 | chr19:g.11202306G>T | 11 813 (437) | −0.067 | 0.105 | 0.525 | −0.063 | 0.110 | 0.568 | 18 807 (7432) | 0.062 | 0.044 | 0.156 | 0.067 | 0.039 | 0.089 |
| rs2075650 | chr19:g.45395619A>G | 11 813 (437) | −0.093 | 0.099 | 0.347 | −0.099 | 0.104 | 0.340 | 18 808 (7432) | −0.120 | 0.041 | 0.003 | −0.133 | 0.036 | 2.00E-04 |
N, number of individuals; Build 37 chr:pos, dbSNP (hg19) chromosome:position (risk allele>other allele); β is the logarithm of the association measure (either odds ratio or hazard ratio); SE, standard errors of the logarithm of the association measure (either odds ratio or hazard ratio). In the full case-cohort, the Cox model was Prentice weighted. Alleles are aligned to the + strand.