| Literature DB >> 24498600 |
Kyoko Shibata1, Atsushi Hozawa2, Gen Tamiya3, Masao Ueki3, Tomohiro Nakamura3, Hiroto Narimatsu4, Isao Kubota5, Yoshiyuki Ueno6, Takeo Kato7, Hidetoshi Yamashita8, Akira Fukao9, Takamasa Kayama10.
Abstract
The impact of cryptic relatedness (CR) on genomic association studies is well studied and known to inflate false-positive rates as reported by several groups. In contrast, conventional epidemiological studies for environmental risks, the confounding effect of CR is still uninvestigated. In this study, we investigated the confounding effect of unadjusted CR among a rural cohort in the relationship between environmental risk factors (body mass index, smoking status, alcohol consumption) and systolic blood pressure. We applied the methods of population-based whole-genome association studies for the analysis of the genome-wide single nucleotide polymorphism data in 1622 subjects, and detected 20.2% CR in this cohort population. In the case of the sample size, approximately 1000, the ratio of CR to the population was 20.2%, the population prevalence 25%, the prevalence in the CR 26%, heritability for liability 14.3% and prevalence in the subpopulation without CR 26%, the difference of estimated regression coefficient between samples with and without CR was not significant (P-value = 0.55). On the other hand, in another case with approximately >20% heritability for liability, we showed that confounding due to CR biased the estimation of exposure effects.Entities:
Keywords: confounding effect; cryptic relatedness; systolic blood pressure
Year: 2013 PMID: 24498600 PMCID: PMC3893157 DOI: 10.1002/mgg3.4
Source DB: PubMed Journal: Mol Genet Genomic Med ISSN: 2324-9269 Impact factor: 2.183
Figure 1Plot of the relationship between total 1622 subjects with an identity by descent (IBD) probability with regard to an identity by state (IBS) distance; y-axis and x-axis describe IBD probability and IBS distance, respectively.
Figure 2Plot of the relationship between 1291 subjects with an identity by descent (IBD) probability with regard to an identity by state (IBS) distance after which is removed a cryptic relatedness of 326 subjects with an IBD probability >1/4 (i.e., monozygotic twins, dizygotic twins, full-sibs, parent-offspring, half-siblings, grandparent, grandchild, aunt/uncle, and niece/nephew); y-axis and x-axis describe IBD probability and IBS distance, respectively.
Results of the regression coefficients between systolic blood pressure and environmental risk factors in the sample with and without cryptic relatedness (CR): sample size 1039, ratio of CR to the population 20.2%, population prevalence 25%, heritability for liability 14.3%, prevalence in the subpopulation without CR 26%, prevalence in CR 26%
| Estimated | Standard error | Pr(>| | ||
|---|---|---|---|---|
| Intercept | ||||
| Sample with CR | 82.52 | 5.15 | 16.03 | <2e−16 |
| Sample without CR | 77.11 | 5.49 | 14.06 | <2e−16 |
| BMI | ||||
| Sample with CR | 1.31 | 0.14 | 9.30 | <2e−16 |
| Sample without CR | 1.36 | 0.16 | 8.34 | 3.23e−16 |
| Alcohol consumption | ||||
| Sample with CR | −0.80 | 1.51 | −0.53 | 0.60 |
| Sample without CR | −0.42 | 1.61 | −0.26 | 0.79 |
| Smoking status | ||||
| Sample with CR | −0.66 | 0.71 | −0.93 | 0.35 |
| Sample without CR | −0.02 | 1.05 | −0.02 | 0.98 |
| Age | ||||
| Sample with CR | 0.38 | 0.07 | 5.43 | 6.86e−08 |
| Sample without CR | 0.41 | 0.05 | 7.99 | 4.62e−15 |
| Gender | ||||
| Sample with CR | −3.09 | 1.04 | −2.97 | 0.003 |
| Sample without CR | −2.46 | 1.01 | −2.44 | 0.01 |
Size of sample with CR was 1039 subjects. From the sampling data with CR, equation (1) in Results were found. Adjusted R-squared = 0.14.
Size of sample without CR was 829 subjects. From the sampling data without CR, equation (2) in Results were found. Adjusted R-squared = 0.15.
Results of the regression coefficients between systolic blood pressure and environmental risk factors in the sample with and without cryptic relatedness (CR): sample size 400, ratio of CR to the population 52.5%, population prevalence 40%, heritability for liability 24.2%, prevalence in the subpopulation without CR 55%, prevalence in CR 26%
| Estimated | Standard error | Pr(>| | ||
|---|---|---|---|---|
| Intercept | ||||
| Sample with CR | 141.04 | 5.52 | 25.56 | <2e−16 |
| Sample without CR | 145.97 | 1.67 | 87.40 | <2e−16 |
| BMI | ||||
| Sample with CR | −0.12 | 0.15 | −0.76 | 0.45 |
| Sample without CR | 0.04 | 0.05 | 0.79 | 0.43 |
| Alcohol consumption | ||||
| Sample with CR | −6.11 | 1.81 | −3.37 | 0.0008 |
| Sample without CR | 1.28 | 0.56 | 2.30 | 0.022 |
| Smoking status | ||||
| Sample with CR | 5.34 | 0.71 | 7.57 | 2.69e−13 |
| Sample without CR | −0.39 | 0.25 | −1.59 | 0.11 |
| Age | ||||
| Sample with CR | −0.040 | 0.082 | −0.48 | 0.63 |
| Sample without CR | −0.13 | 0.02 | −5.47 | 1.44e−07 |
| Gender | ||||
| Sample with CR | −1.24 | 1.13 | −1.10 | 0.27 |
| Sample without CR | −0.92 | 0.32 | −2.86 | 0.004 |
Size of sample with CR was 400 subjects. From the sampling data with CR, equation (3) in Results were found. Adjusted R-squared = 0.25.
Size of sample without CR was 190 subjects. From the sampling data without CR, equation (4) in Results were found. Adjusted R-squared = 0.16.
Results of the regression coefficients between systolic blood pressure and environmental risk factors in the sample with and without cryptic relatedness (CR): sample size 400, ratio of CR to the population 52.5%, population prevalence 50%, heritability for liability 14.3%, prevalence in the subpopulation without CR 76.3%, prevalence in CR 26%
| Estimated | Standard error | Pr(>| | ||
|---|---|---|---|---|
| Intercept | ||||
| Sample with CR | 135.37 | 5.77 | 23.47 | <2e−16 |
| Sample without CR | 136.68 | 2.09 | 65.34 | <2e−16 |
| BMI | ||||
| Sample with CR | 0.07 | 0.16 | 0.46 | 0.64 |
| Sample without CR | 0.06 | 0.05 | 1.11 | 0.27 |
| Alcohol consumption | ||||
| Sample with CR | −12.34 | 2.00 | −6.16 | 1.77e−09 |
| Sample without CR | −7.75 | 0.81 | −9.54 | <2e−16 |
| Smoking status | ||||
| Sample with CR | 4.15 | 0.74 | 5.62 | 3.60e−08 |
| Sample without CR | −2.32 | 0.32 | −7.23 | 1.24e−11 |
| Age | ||||
| Sample with CR | 0.17 | 0.09 | 1.89 | 0.06 |
| Sample without CR | 0.27 | 0.03 | 7.84 | 3.56e−13 |
| Gender | ||||
| Sample with CR | −1.13 | 1.18 | −0.95 | 0.34 |
| Sample without CR | 0.11 | 0.41 | 0.27 | 0.79 |
Size of sample with CR was 400 subjects. From the sampling data with CR, equation (5) in Results were found. Adjusted R-squared = 0.24.
Size of sample without CR was 190 subjects. From the sampling data without CR, equation (6) in Results were found. Adjusted R-squared = 0.42.
Results of the regression coefficients between systolic blood pressure and environmental risk factors in the sample with and without cryptic relatedness (CR): sample size 500, ratio of CR to the population 42%, population prevalence 40%, heritability for liability 22.1%, prevalence in the subpopulation without CR 50%, prevalence in CR 26%
| Estimated | Standard error | Pr(>| | ||
|---|---|---|---|---|
| Intercept | ||||
| Sample with CR | 137.18 | 4.91 | 27.92 | <2e−16 |
| Sample without CR | 141.55 | 2.99 | 47.26 | <2e−16 |
| BMI | ||||
| Sample with CR | 0.04 | 0.14 | 0.29 | 0.77 |
| Sample without CR | 0.09 | 0.08 | 1.11 | 0.27 |
| Alcohol consumption | ||||
| Sample with CR | −9.47 | 1.59 | −5.94 | 5.47e−09 |
| Sample without CR | −5.34 | 1.00 | −5.35 | 4.26e−07 |
| Smoking status | ||||
| Sample with CR | 3.47 | 0.64 | 5.45 | 8.10e−08 |
| Sample without CR | −2.30 | 0.44 | −5.18 | 4.26e−11 |
| Age | ||||
| Sample with CR | 0.10 | 0.07 | 1.34 | 0.18 |
| Sample without CR | 0.11 | 0.05 | 2.39 | 0.018 |
| Gender | ||||
| Sample with CR | −0.95 | 1.10 | −0.94 | 0.35 |
| Sample without vCR | −1.10 | 0.60 | −1.83 | 0.07 |
Size of sample with CR was 500 subjects. From the sampling data with CR, equation (7) in Results were found. Adjusted R-squared = 0.19.
Size of sample without CR was 290 subjects. From the sampling data without CR, equation (8) in Results were found. Adjusted R-squared = 0.16.
Results of the regression coefficients between systolic blood pressure and environmental risk factors in the sample with and without cryptic relatedness (CR): sample size 500, ratio of CR to the population 42%, population prevalence 50%, heritability for liability 31.7%, prevalence in the subpopulation without CR 67.2%, prevalence in CR 26%
| Estimated | Standard error | Pr(>| | ||
|---|---|---|---|---|
| Intercept | ||||
| Sample with CR | 130.15 | 5.51 | 23.62 | <2e−16 |
| Sample without CR | 128.16 | 3.92 | 32.73 | <2e−16 |
| BMI | ||||
| Sample with CR | 0.17 | 0.15 | 1.13 | 0.26 |
| Sample without CR | 0.22 | 0.10 | 2.13 | 0.03 |
| Alcohol consumption | ||||
| Sample with CR | −16.61 | 1.67 | −9.95 | <2e−16 |
| Sample without CR | −8.11 | 1.24 | −6.53 | 2.99e−10 |
| Smoking status | ||||
| Sample with CR | 2.58 | 0.71 | 3.63 | 0.0003 |
| Sample without CR | −2.87 | 0.65 | −4.40 | 1.53e−05 |
| Age | ||||
| Sample with CR | 0.39 | 0.08 | 4.91 | 1.21e−06 |
| Sample without CR | 0.42 | 0.05 | 7.72 | 1.99e−13 |
| Gender | ||||
| Sample with CR | −1.44 | 1.09 | −1.33 | 0.19 |
| Sample without CR | −1.29 | 0.72 | −1.79 | 0.08 |
Size of sample with CR was 500 subjects. From the sampling data with CR, equation (9) in Results were found. Adjusted R-squared = 0.21.
Size of sample without CR was 290 subjects. From the sampling data without CR, equation (10) in Results were found. Adjusted R-squared = 0.24.