| Literature DB >> 18360741 |
Lee M Butcher1, Robert Plomin.
Abstract
Widely used measures of the environment, especially the family environment of children, show genetic influence in dozens of twin and adoption studies. This phenomenon is known as gene-environment correlation in which genetically driven influences of individuals affect their environments. We conducted the first genome-wide association (GWA) analysis of an environmental measure. We used a measure called CHAOS which assesses 'environmental confusion' in the home, a measure that is more strongly associated with cognitive development in childhood than any other environmental measure. CHAOS was assessed by parental report when the children were 3 years and again when the children were 4 years; a composite CHAOS measure was constructed across the 2 years. We screened 490,041 autosomal single-nucleotide polymorphisms (SNPs) in a two-stage design in which children in low chaos families (N = 469) versus high chaos families (N = 369) from 3,000 families of 4-year-old twins were screened in Stage 1 using pooled DNA. In Stage 2, following SNP quality control procedures, 41 nominated SNPs were tested for association with family chaos by individual genotyping an independent representative sample of 3,529. Despite having 99% power to detect associations that account for more than 0.5% of the variance, none of the 41 nominated SNPs met conservative criteria for replication. Similar to GWA analyses of other complex traits, it is likely that most of the heritable variation in environmental measures such as family chaos is due to many genes of very small effect size.Entities:
Mesh:
Year: 2008 PMID: 18360741 PMCID: PMC2480594 DOI: 10.1007/s10519-008-9198-z
Source DB: PubMed Journal: Behav Genet ISSN: 0001-8244 Impact factor: 2.805
Fig. 1A histogram illustrating the distribution of absolute allele frequency differences between low and high CHAOS groups derived through pooled DNA on microarrays. The y-axis indicates the number of SNPs and x-axis shows absolute allele frequency differences between low and high CHAOS groups. The figure shows that the vast majority of allele frequency differences are small and that the mean allele frequency between low and high CHAOS groups is about 0.027. The x-axis is elongated to accommodate outliers, which are a logical source of candidate SNPs to follow up. The total number of SNPs is 448,944 because SNPs represented by fewer than 6 out of 10 replicates were removed
Fig. 2A scatterplot showing the 48 top-ranked SNPs (crosses) against the background of 448,994 unselected SNPs comparing allele frequencies for the low CHAOS group (x-axis) and the high CHAOS group (y-axis). The figure also displays the density of SNPs as a function of low CHAOS versus high CHAOS allele frequency differences; density of SNP clusters increases as the heat map changes from light grey (sparse clusters) though to dark grey (dense clusters). Allele frequency differences are small with the majority of small differences occurring for SNPs with minor allele frequencies of 0.10–0.25, which reflects the representation of SNPs with these allele frequencies on the Affymetrix microarray. The correlation between low and high CHAOS allele frequencies was 0.992 indicating high reliability of the rank order of allele frequencies across the low and high CHAOS groups
Summary of Stage 1 and Stage 2 genotyping for the 48 SNPs selected by ranked composite score using pooled DNA to screen the genome
| SNP information | Stage 1 (SNP-MaP) | Stage 2 (individual genotyping) | Correct direction? | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AffyID | dbSNP RS ID | Chr | Genomic relation | Strand | Alleles | Low | High | L−H diff | Strand | QTL | QTL |
| H−W | |
| SNP_A-4295547 | rs10001415 | 4p15.1 | Intergenic | – | A/G | 0.63 | 0.872 | –0.242 | + | –0.003 | 0.436 | 2,794 | 0.000 | |
| SNP_A-1865002 | rs1030303 | 9q21.33 |
| + | A/T | 0.397 | 0.502 | –0.105 | + | 0.018 | 0.172 | 2,905 | 0.005 | No |
| SNP_A-1964916 | rs10496711 | 2q21.2 | Intergenic | + | C/G | 0.604 | 0.526 | 0.078 | + | –0.016 | 0.189 | 2,976 | 0.647 | No |
| SNP_A-1813456 | rs10783746 | 12q13.2 | mRNA evidence | + | A/G | 0.62 | 0.496 | 0.124 | + | 0.019 | 0.152 | 2,879 | 0.562 | Yes |
| SNP_A-2005014 | rs10884390 | 10q25.1 |
| + | A/G | 0.571 | 0.472 | 0.099 | + | 0.028 | 0.068 | 2,951 | 0.914 | Yes |
| SNP_A-1893780 | rs10915207 | 1p35.3 |
| – | A/G | 0.489 | 0.57 | –0.081 | + | 0.006 | 0.378 | 2,739 | 0.978 | |
| SNP_A-1887744 | rs11084785 | 19q13.11 | Intergenic | + | C/T | 0.551 | 0.654 | –0.103 | + | 0.007 | 0.364 | 2,703 | 0.929 | |
| SNP_A-2305187a | rs11263591 | 11q13.3 |
| – | C/G | 0.406 | 0.488 | –0.082 | + | – | – | – | – | |
| SNP_A-2049663 | rs11264825 | 1q23.1 |
| – | C/T | 0.465 | 0.528 | –0.063 | + | −0.007 | 0.356 | 2,587 | 0.971 | |
| SNP_A-1822015 | rs11950448 | 5p12 | Intergenic | – | C/G | 0.52 | 0.425 | 0.095 | + | −0.001 | 0.483 | 2,880 | 0.021 | |
| SNP_A-1921659 | rs1214451 | 10q23.31 | Intergenic | + | C/G | 0.495 | 0.596 | –0.101 | + | −0.027 | 0.072 | 2,920 | 0.969 | Yes |
| SNP_A-2112236 | rs1275051 | 11q24.1 | Intergenic | + | A/G | 0.564 | 0.479 | 0.085 | + | −0.007 | 0.351 | 2,959 | 0.595 | |
| SNP_A-1869107 | rs12820468 | 12q23.1 |
| – | A/G | 0.528 | 0.434 | 0.094 | + | −0.031 |
| 2,966 | 0.810 | Yes |
| SNP_A-1957323 | rs13262547 | 8q13.2 | mRNA evidence | + | G/T | 0.556 | 0.449 | 0.107 | + | 0.005 | 0.402 | 2,880 | 0.997 | |
| SNP_A-2161686 | rs1414314 | 1p32.1 | Intergenic | – | A/G | 0.45 | 0.342 | 0.108 | + | 0.038 | 0.023 | 2,712 | 0.416 | No |
| SNP_A-2072154 | rs1501630 | 3q24 | mRNA evidence | + | C/T | 0.451 | 0.531 | –0.08 | – | −0.015 | 0.207 | 2,916 | 0.998 | No |
| SNP_A-2234256 | rs1667301 | 2q37.1 | mRNA evidence | – | C/T | 0.488 | 0.642 | –0.154 | – | 0.004 | 0.406 | 2,960 | 1.000 | |
| SNP_A-2183115 | rs17203935 | 18q11.2 | mRNA evidence | – | G/T | 0.625 | 0.526 | 0.099 | + | 0.015 | 0.215 | 2,742 | 0.288 | No |
| SNP_A-1804934 | rs17221652 | 10p15.3 |
| + | C/G | 0.551 | 0.473 | 0.078 | + | 0.022 | 0.124 | 2,701 | 0.162 | Yes |
| SNP_A-1915452 | rs1861594 | 12p12.3 |
| + | A/G | 0.439 | 0.337 | 0.102 | + | 0.022 | 0.118 | 2,949 | 0.125 | Yes |
| SNP_A-2269742 | rs2043478 | 5q31.2 |
| + | A/G | 0.598 | 0.471 | 0.127 | + | –0.005 | 0.397 | 2,716 | 0.739 | |
| SNP_A-2229263 | rs289647 | 10p11.21 | Intergenic | + | A/G | 0.55 | 0.394 | 0.156 | + | 0.004 | 0.414 | 2,854 | 0.471 | |
| SNP_A-2252700 | rs297255 | 10q26.13 | mRNA evidence | + | A/G | 0.511 | 0.616 | –0.105 | – | −0.019 | 0.150 | 2,948 | 1.000 | No |
| SNP_A-1955113 | rs316191 | 5q15 |
| – | G/T | 0.408 | 0.488 | –0.08 | – | 0.001 | 0.468 | 2,941 | 0.441 | |
| SNP_A-1828397 | rs318931 | 10q26.3 | Intergenic | – | A/T | 0.613 | 0.526 | 0.087 | + | −0.014 | 0.220 | 2,897 | 0.043 | Yes |
| SNP_A-2025774 | rs333785 | 11p12 | Intergenic | + | G/T | 0.397 | 0.51 | –0.113 | + | 0.001 | 0.482 | 2,720 | 0.208 | |
| SNP_A-1800569a | rs3843872 | 3q13.13 |
| + | C/G | 0.451 | 0.562 | –0.111 | – | – | – | – | – | |
| SNP_A-2239462 | rs392670 | 3p26.3 |
| – | A/T | 0.521 | 0.431 | 0.09 | – | −0.014 | 0.227 | 2,958 | 0.556 | No |
| SNP_A-1809780 | rs4637723 | 7p12.3 | Intergenic | – | A/C | 0.497 | 0.573 | –0.076 | + | −0.026 | 0.090 | 2,734 | 0.848 | No |
| SNP_A-2161472 | rs4761289 | 12q15 | Intergenic | – | A/C | 0.293 | 0.437 | –0.144 | + | −0.032 | 0.046 | 2,792 | 0.162 | No |
| SNP_A-2086969 | rs4766691 | 12q24.13 | Intergenic | – | C/T | 0.285 | 0.47 | –0.185 | + | 0.001 | 0.478 | 2,697 | 0.072 | |
| SNP_A-2196642 | rs4787125 | 16p13.2 | Intergenic | + | A/G | 0.622 | 0.515 | 0.107 | + | 0.010 | 0.285 | 2,967 | 0.602 | Yes |
| SNP_A-1974629a | rs4839628 | 3q24 |
| – | A/G | 0.662 | 0.563 | 0.099 | + | – | – | – | – | |
| SNP_A-2106553 | rs4853578 | 2q32.3 |
| + | A/G | 0.378 | 0.56 | –0.182 | + | 0.014 | 0.228 | 2,699 | 0.986 | No |
| SNP_A-4241706 | rs538385 | 13q12.3 | Intergenic | + | C/T | 0.442 | 0.538 | –0.096 | + | 0.026 | 0.087 | 2,716 | 0.965 | No |
| SNP_A-2281229 | rs6048344 | 20p11.21 | Intergenic | + | C/G | 0.562 | 0.48 | 0.082 | + | 0.001 | 0.483 | 2,911 | 0.535 | |
| SNP_A-2092290 | rs6130222 | 20q12 |
| – | C/G | 0.444 | 0.556 | –0.112 | + | 0.011 | 0.275 | 2,950 | 0.605 | Yes |
| SNP_A-2020710 | rs619689 | 11q23.2 | Intergenic | + | C/G | 0.553 | 0.496 | 0.057 | – | 0.010 | 0.290 | 2,964 | 0.907 | No |
| SNP_A-2283323 | rs6583654 | 8q24.3 | Intergenic | + | C/T | 0.583 | 0.47 | 0.113 | + | 0.009 | 0.311 | 2,738 | 0.266 | |
| SNP_A-2133000 | rs6751663 | 2p14 | Intergenic | – | C/G | 0.377 | 0.487 | –0.11 | + | −0.006 | 0.368 | 2,967 | 0.087 | |
| SNP_A-2035401 | rs6776618 | 3q28 | Intergenic | + | C/G | 0.589 | 0.468 | 0.121 | + | −0.008 | 0.334 | 2,911 | 0.514 | |
| SNP_A-2277540 | rs7122693 | 11q23.2 |
| + | A/G | 0.465 | 0.606 | –0.141 | + | −0.005 | 0.387 | 2,734 | 0.751 | |
| SNP_A-2265151 | rs7233747 | 18q23 | Intergenic | + | C/T | 0.562 | 0.677 | –0.115 | + | 0.028 | 0.062 | 2,941 | 0.999 | No |
| SNP_A-2178721 | rs7873355 | 9p23 | Intergenic | + | C/G | 0.551 | 0.414 | 0.137 | + | −0.003 | 0.432 | 2,952 | 0.277 | |
| SNP_A-2270656 | rs7970012 | 12q24.32 | mRNA evidence | – | A/G | 0.606 | 0.49 | 0.116 | + | −0.015 | 0.206 | 2,987 | 0.002 | Yes |
| SNP_A-2042483 | rs9533962 | 13q14.11 | Intergenic | – | A/G | 0.36 | 0.472 | –0.112 | + | −0.026 | 0.078 | 2,955 | 0.940 | No |
| SNP_A-1959377 | rs9606213 | 22q11.21 |
| + | A/G | 0.485 | 0.425 | 0.06 | + | 0.005 | 0.402 | 2,918 | 0.115 | |
| SNP_A-1785409 | rs9934231 | 16p13.13 |
| – | C/G | 0.553 | 0.668 | –0.115 | + | 0.010 | 0.286 | 2,942 | 0.690 | Yes |
These SNPs were individually genotyped in an unselected sample of 3,529 individuals in order to test the QTL hypothesis. The allele whose frequency is shown in the ‘Low’ and ‘High’ columns is the allele that is alphabetically first in the ‘Alleles’ column (e.g., A rather than G in the first row). Note that the alleles measured here (using Affymetrix microarrays) may not be the same as the alleles measured using individual genotyping (using the SNPlex™ assay) because the Affymetrix and SNPlex platforms may be based on different DNA strands (e.g., for the SNP in the first row, alleles on different strands were assessed in the Affymetrix microarray and in the SNPlex—which was of course taken into account in determining the direction of the association in Stage 1 and Stage 2. The Low–High allele frequency difference (‘l-h’) shown in the 9th column was calculated by subtracting the allele frequency of the High CHAOS group from the Low CHAOS group. ‘QTL r’ (column 11) is the correlation between the composite CHAOS scores and additive genotypic values (0, 1, 2) as scored in the direction of high CHAOS as predicted by Stage 1. For example, for the SNP in row 1, because the frequency of the A allele was higher in the high CHAOS group in Stage 1, the genotypic values in Stage 2 were scored as follows: GG = 0, AG = 1, AA = 2). QTL p (column 12) is the p value of the QTL r for the unselected sample of individuals (N shown in column 13)
The last column (‘correct direction’) column indicates whether the direction of allele frequency differences is the same in Stage 2 as it was in Stage 1. We did not compare directions of effect between Stage 1 and Stage 2 for SNPs yielding correlations in the range –0.01 and 0.01 in Stage 2.
aAssay failure: no data available
Note. Value in bold for SNP rs12820468 is the only SNP yielding a nominally significant association in the unselected independent sample in Stage 2
Fig. 3Genotype-by-phenotype plot for SNP rs12820468 illustrating the effect of genotype (x-axis) on standardized CHAOS scores (y-axis). The best-fitting genetic model was additive despite the apparent effect of dominance