Literature DB >> 21694764

Educational attainment: a genome wide association study in 9538 Australians.

Nicolas W Martin1, Sarah E Medland, Karin J H Verweij, S Hong Lee, Dale R Nyholt, Pamela A Madden, Andrew C Heath, Grant W Montgomery, Margaret J Wright, Nicholas G Martin.   

Abstract

BACKGROUND: Correlations between Educational Attainment (EA) and measures of cognitive performance are as high as 0.8. This makes EA an attractive alternative phenotype for studies wishing to map genes affecting cognition due to the ease of collecting EA data compared to other cognitive phenotypes such as IQ.
METHODOLOGY: In an Australian family sample of 9538 individuals we performed a genome-wide association scan (GWAS) using the imputed genotypes of ∼2.4 million single nucleotide polymorphisms (SNP) for a 6-point scale measure of EA. Top hits were checked for replication in an independent sample of 968 individuals. A gene-based test of association was then applied to the GWAS results. Additionally we performed prediction analyses using the GWAS results from our discovery sample to assess the percentage of EA and full scale IQ variance explained by the predicted scores.
RESULTS: The best SNP fell short of having a genome-wide significant p-value (p = 9.77×10(-7)). In our independent replication sample six SNPs among the top 50 hits pruned for linkage disequilibrium (r(2)<0.8) had a p-value<0.05 but only one of these SNPs survived correction for multiple testing--rs7106258 (p = 9.7*10(-4)) located in an intergenic region of chromosome 11q14.1. The gene based test results were non-significant and our prediction analyses show that the predicted scores explained little variance in EA in our replication sample.
CONCLUSION: While we have identified a polymorphism chromosome 11q14.1 associated with EA, further replication is warranted. Overall, the absence of genome-wide significant p-values in our large discovery sample confirmed the high polygenic architecture of EA. Only the assembly of large samples or meta-analytic efforts will be able to assess the implication of common DNA polymorphisms in the etiology of EA.

Entities:  

Mesh:

Year:  2011        PMID: 21694764      PMCID: PMC3111411          DOI: 10.1371/journal.pone.0020128

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Access to education is considered to be a predictor for a wide range of later life outcomes such as employment [1], income [2], and health outcomes such as obesity [3], [4]. In addition to its relevance to economics and health, educational attainment (EA) is a measure of interest in the study of cognitive abilities/intelligence. The correlation between EA and measures of cognitive ability ranges from 0.45 up to 0.80 [5], [6], [7]. Three hypotheses of causality have been postulated to explain the underlying mechanisms of the relationship between EA and cognitive functioning, (i) intelligence/cognitive abilities are a cause of EA because intelligence is believed to be more biologically anchored than scholastic achievements [8], (ii) cognitive abilities are a product of scholastic achievement [9], [10] and (iii) basic cognitive processes such as reaction time, inspection time and memory recall partly determine both scholastic achievement and cognitive performance [11]. At present, none of these hypotheses have been discarded and it is likely that a mixture of all three plays a role in the link between EA and cognitive abilities. Additionally, a recent report suggested that the causal relationship between EA and intelligence varies according to an individual's level of intelligence [7]. Biological processes involved in scholastic achievement and cognitive performance are likely to be shared to some extent. Twin studies that take advantage of differences between genetically identical twins (monozygotic or MZ) and fraternal twins (dizygotic or DZ), have shown that EA and cognitive performance are influenced to a large extent by common genetic factors [12], [13], [14] and the heritability of EA has been estimated as high as 80% [6], [15]. Therefore, investigating the biological etiology of educational attainment could provide insights into the molecular basis of cognition because of its strong heritability and high correlation with cognitive performance. Moreover, EA has the advantage of being a measure that is much easier to collect than IQ and therefore it is more viable to assemble the large cohorts necessary to perform genome wide association scans (GWAS). Molecular genetics studies of EA have been largely limited to a candidate gene approach including the Catechol-O-methyltransferase (COMT) [16], [17], the Brain-derived neurotrophic factor (BDNF) [18] and the Dopamine receptor D4 (DRD4) [19]. The choice of these genes as candidates for EA is strongly hypothesis driven due to their characterized neurobiological functions but convincing replications are lacking in support of their hypothetical role in the etiology of EA. The advances in microarray technologies in recent years have made it possible to genotype millions of single nucleotide polymorphisms (SNPs) for a low cost (about US$500 per individual). This has led to a major shift toward a hypothesis free genome-wide association study design. A lot of GWAS studies has emerged in the literature, identifying hundreds of polymorphisms and genes associated with complex traits and diseases. In this study we present the results of a GWAS for EA, using an Australian twin family discovery sample of 9538 individuals from the general population and a further 968 individuals as an independent replication sample.

Methods

Participants

The participants for our discovery sample were drawn from two cohorts of adult twin families (cohorts 1 and 2) that have taken part in a wide range of studies of health and well-being. Individuals from Cohort 1 were born before 1964 and individuals from cohort 2 were born between 1964 and 1981. Both these cohorts have participated in previous postal questionnaire and telephone interview studies, and recruitment was extended to their relatives (parents, siblings, adult children and spouses). Our total discovery sample was composed of 9538 individuals for whom both EA and genome-wide SNP genotype data were available. Our replication sample (Cohort 3) consisted of 968 individuals who are the parents of adolescent twins participating in our melanoma risk factors and cognition studies (1996-ongoing) [20]. Information for the different studies is available in Table S1. For the prediction analysis of cognitive ability based on the EA GWAS results obtained in the discovery sample, we used one of the twin children of cohort 3. Their full scale IQ (FSIQ) was collected as part of the cognition study and genotyping data were available for 1842 adolescent twins and their siblings ranging in age from 15 to 22 years (mean = 16.28 years ±0.45 SD). Ethical approval, for the studies from which the data drawn, was obtained from the Human Research Ethics Committee of the Queensland Institute of Medical Research. Informed written consent for all measures was obtained from each participant and their parents/guardian if participants were younger than 18 years of age.

Educational attainment

Self reported educational attainment (EA) was collected as part as of questionnaires and telephone interviews. In the adult cohorts (1 and 2) three similar education scales were used to collect EA depending on the study in which an individual participated (fig. 1). Each individual score was transformed to create a new 6-level EA scale harmonised across the studies, with 1 = 7 years or less of schooling, 2 = 8 to 10 years of schooling, 3 = 8 to 10 years of schooling + apprenticeship or 11 to 12 years of schooling or 12 years of schooling + apprenticeship, 4 = teacher college or technical college, 5 = university undergraduate training and 6 = university postgraduate training (fig. 1). For a number of individuals (n = 5314) multiple reports of EA were available and the highest education level reported was selected for analysis. In cohort 3, EA was also assessed as part of a questionnaire that the parents of the adolescent twins answered while their children underwent cognitive testing. EA was recorded with one of the scales previously used in the adult cohorts (Figure 1). All individuals that were included in the GWAS analysis were at least 21 years of age (the standard age of first degree graduation). Descriptive statistics of the age and educational attainment of the participants according to their study of origin can be found in Table S1.
Figure 1

Transformation applied to the original education scales (1 to 3) to obtain a new harmonized scale across studies.

Cognitive abilities

Cognitive abilities were measured by full scale IQ (FSIQ) that was assessed using the Multi-dimensional Aptitude Battery (MAB) [21]. The MAB is a general intelligence test designed to mirror the WAIS-R [22] and it is presented in a multiple-choice format. Participants completed three verbal (information, arithmetic, vocabulary) and two performance subtests (spatial, object assembly) which were combined here to form a full-scale IQ score. Twins completed the MAB as close as possible to their 16th birthday when they came to participate in the cognition study. Further details of the IQ testing procedure have been previously published [23], [24].

Genotyping and Imputation

The genotypic data used in the current study come from a large genotyping project involving nine waves of genotyping that used three different Illumina SNP chips (Human610-Quad, HumanCNV370-Quadv3 and Human 317K) [25]. For each genotyping wave, rigorous quality control (QC) steps were applied to ensure the highest standard of the pre imputation data. Details of the QC steps have been described elsewhere [25]. Imputation of the autosomal chromosomes was performed in two stages with MACH [26] using a set of SNPs (N = 269840) common to the different Illumina chips. These data were screened for ancestry outliers. Full siblings and offspring of individuals who had been identified as ancestry outliers were excluded from the reference set used in MACH stage 1. In the first stage, the data from a set of 450 reference individuals from our set were compared to the phased haplotype data from the HapMap samples of european ancestry (CEU I+II) (release 22, build 36). These 450 reference individuals were made up of fifty unrelated individuals (with the lowest missingness) from each of the nine subsamples. Stage 1 generated recombination and error files that describe how our data relate to the HapMap data, in effect allowing us to customise the HapMap data for our population. In the second stage, data were imputed for the 17,862 individuals using the HapMap data (release 22, build 36) as the reference panel and the recombination and error files generated in stage 1 to customise the imputation. SNPs with a minor allele frequency <.01, SNPs with an Rsq imputation quality score <.3 were excluded. A panel of 2,480,163 autosomal SNPs and a panel of 13783 genotyped X-chromosome SNPs were use for association analysis.

Data analysis

Genome wide association analysis using dosage scores was performed in MERLIN offline [27] to account for family structure. The association analysis of genotyped markers on the X-chromosome were conducted in Minx (as implemented in MERLIN). Sex, year of birth (YOB), age, YOB×sex, YOB2, YOB2×sex, age2, Age×sex and Age2×Sex were used as covariates. Both YOB and age were used as covariates in this study: YOB in order to account for changes in terms of access to education that have occurred since the early 1900's and age in order to account for the fact that the older an individual is, the less likely he/she is to undertake further education. Visualisation and annotation of the GWAS results was conducted using WGAViewer [28] and the pruning for linkage desquilibrium (LD) (r2<0.8) of the top 200 SNPs was performed in SNAP [29] based on the HapMap release 22 CEU panel. Additionally, The VEGAS gene-based test [30] that can be used with related individuals was performed using the GWAS output data of our discovery sample. The test summarizes evidence for association on a per gene basis by considering the full set of SNPs within the gene (determined by SNPs lying within ±50 kb of a gene's 5′ and 3′ UTRs) and the LD between them.

Power analyses

It is expected that many genes of very small effect size contribute to the genetic variance of complex traits. We estimated the empirical power our discovery sample provides to detect genetic variants explaining 1%, 0.5% and 0.2% of the phenotypic variance by running association tests on simulated datasets in Merlin. The simulated datasets are similar to the original data in terms of marker informativeness, spacing, allele frequency, trait distribution, and missing data patterns, but they are simulated such that a selected SNP accounts for a specified proportion of the variance. The selected SNP had minor allele frequency 0.25. Association analysis was conducted on 1000 data sets generated by the simulation procedure. The empirical power is estimated as that proportion of the 1000 association analyses in which a genome-wide significant association (α = 5*10−8) was detected. Results indicated that our sample provides 100%, 80% (799 out of the 1000 simulations) and 13% (130 out of the 1000 simulations) power to detect SNPs that explain 1%, 0.5% and 0.2% of the variance in EA, respectively.

Prediction analyses

Prediction analyses were conducted in two stages. In stage 1, the effect sizes of the 2,480,163 SNPs from GWAS for EA, as well as 5 sub-panels of SNPs (based on p-value thresholds of p<0.5, p<0.4, p<0.3, p<0.2 and p<0.1), were extracted from the MERLIN [27] output of our discovery sample. Based on the effect sizes for these panels and the imputed SNP data of our adolescent twin families we generated 6 sets of prediction scores using the PLINK [31] scoring routine for cohort 3 (968 parents and only one of their twin children (n = 799) in order to have a sample of independent individuals). In stage 2, we compared the predicted scores of the parents to their EA by fitting EA as a function of the predicted scores in a linear model (EA∼predicted scores) using R (http://cran.r-project.org/) to evaluate to percentage of variance explained (R2) by the predicted score and its level of significance. Similarly, we fitted the FSIQ of one of the twin children as a function of their predicted scores in a linear model (FSIQ∼predicted scores).

Results

We examined educational attainment for over 9538 individuals born between 1900 and 1981 from 3764 families. A similar distribution of the mean educational attainment was observed between males and females (figure 2). We observed a gradual increase of the mean EA from 8–10 years of schooling to above 11 to 12 years of schooling for individuals born during the first half of the 20th century (Figure 2). The mean EA of individuals born between the late 40's up to the mid 70's stayed constant around 3.5 before increasing close to 4 (4 = teacher college or technical college) for individuals born after the early to mid 70's (figure 2).
Figure 2

Mean educational attainment (EA) by year of birth in male and female.

Using DNA previously collected for these individuals we performed a GWAS for EA. The heritability of EA was estimated 63.5% by MERLIN and is comparable to the 57% previously reported in our earlier variance components analysis of EA in cohort 1 [6]. Table 1 reports the results for the top 50 most associated SNPs after pruning for LD (r2<0.8), with the best-associated SNP rs2680324 having a p-value of 9.77×10−7 (the Q-Q plot is presented in Figure S1 and the manhattan plot in Figure S2). We then tested the association of these 50 SNPs with EA in an independent replication sample of 968 individuals. Five of these SNPs had an association p = value below 0.05 and an effect in the same direction: rs226039 (p = 0.040), rs976928 (p = 0.0036), rs4298928 (p = 0.004), rs7106258 (p = 0.00097) and rs226047 (p = 0.039). However, only rs7106258 survived correction for multiple testing (p = 0.05/50 = 0.001).
Table 1

Top 50 most associated SNPs in 9538 individuals after pruning for LD (r2<0.8).

rankSNPChr.Position in bpSNP typeClosest gene p-valueSNP in LD*
1rs26803241056970071INTRONICPCDH159.77E-072
2rs100794755179585240INTRONICRASGEF1C1.36E-063
3rs80856891821656634INTRONICTTC39C5.20E-062
4rs130923483116580097INTRONICLSAMP5.40E-0617
5rs15214173116761113INTRONICLSAMP5.55E-062
6rs110047781056974190INTRONICPCDH155.58E-069
7rs22464775172208454UPSTREAMY_RNA.4797.27E-061
8rs914925993584793INTRONICSYK8.16E-062
9rs2260391621288101INTRONICCRYM;8.98E-060
10rs9769281181479043INTERGENICRP11-179A16.19.60E-066
11rs719190916212628973PRIME UTRANKS4B1.02E-0515
12rs42989281181470414INTERGENICRP11-179A16.11.11E-050
13rs67676693116599500INTRONICLSAMP1.19E-050
14rs65811911258875698INTERGENICRP11-652N17.11.23E-050
15rs72819482132970928INTERGENICFBXW11P11.23E-052
16rs15176352224203115INTERGENICAC068035.11.32E-050
17rs80823131717976932INTERGENICC17orf391.40E-050
18rs24316645172203911WITHIN NON CODING GENERP11-779O18.31.50E-050
19rs171575097109881775INTERGENICAC003088.11.52E-056
20rs124976553116764010INTRONICLSAMP1.82E-051
21rs134022892100659389INTRONICAFF31.89E-056
22rs71062581181484019INTERGENICRP11-179A16.11.92E-0514
23rs100688465179588797INTRONICRASGEF1C2.01E-050
24rs50095242038894887INTERGENICRP1-191L6.22.17E-050
25rs2260471621301472INTRONICCRYM2.30E-053
26rs7446847154196001INTRONICDPP62.32E-050
27rs2260381621288338INTRONICCRYM2.37E-050
28rs66910531173868955DOWNSTREAMZBTB372.37E-050
29rs10116891097760476INTRONICCC2D2B2.38E-056
30rs487615382304334INTERGENICAC133633.22.38E-050
31rs76725214175302173UPSTREAMRP11-51M24.12.58E-050
32rs20292381821702106INTRONICAC090772.12.63E-052
33rs99039611710732872UPSTREAMPIRT2.75E-050
34rs11222081558623525INTERGENICLIPC2.82E-053
35rs1119470110111292456UPSTREAMBTF3P152.92E-050
36rs2297679132157009INTRONICCOL16A13.00E-051
37rs20445411141343185INTRONICLRRC4C3.36E-051
38rs179379111131111013WITHIN NON CODING GENEAP002856.73.61E-050
39rs60118992061356257INTRONICNTSR13.76E-050
40rs4729751185534476WITHIN NON CODING GENEGS1-204I12.23.83E-0512
41rs2569405114866191INTRONICFEM1C3.84E-050
42rs45147831861918306INTERGENICAC100848.13.85E-054
43rs118768201861891408INTERGENICAC100848.14.12E-050
44rs22759981166326581INTRONICCTD-3074O7.24.15E-050
45rs79361671140156059INTRONICLRRC4C4.40E-054
46rs6986402817630589INTRONICMTUS14.43E-051
47rs100573875172230003INTERGENICRP11-536N17.14.49E-050
48rs6938281625312488INTRONICLRRC16A4.54E-050
49rs22570559130311404INTRONICFAM129B4.57E-050
50rs658447510103806793INTRONICC10orf764.68E-051

*number of SNPs in LD (r2≥0.8) among the top 200 GWAS hits.

*number of SNPs in LD (r2≥0.8) among the top 200 GWAS hits. To completely utilise the GWAS results, we then performed a gene-based test that maps SNPs to their respective genes if they are located within 50 kb of a gene locus. The gene-based test produced an empirical p-value for 17549 autosomal genes. The top 50 most associated genes are reported in table 2 and the best result was obtained for FLJ41766 (p = 1.4×10−5), however, it does not reach the multiple-testing correction threshold (p = 0.05/17549 = 2.8*10−6).
Table 2

Top 50 most associated genes with the EA in 9538 individuals.

rankChr.GeneNumber of SNPsStart positionStop positionP-value
116FLJ417663521219670212374131.40E-05
216CRYM6421177342212219182.10E-05
316ANKS4B8221152516211712512.80E-05
410NPM3241035310711035331486.10E-05
510FGF8221035198761035258178.80E-05
610MGEA5311035341981035681650.000117
714LTB4R211023848000238507980.000154
810HPS6121038151361038177830.000155
914C14orf2111523838937238442140.000171
1014ADCY412223857409238737040.000175
1119ZNF1008321698682217422700.000232
1214RABGGTA7623804583238106430.000237
135DUSP11081721277061721308090.000249
1410CC2D2B8097749872977824310.000263
1514TGM18023788159238022560.000319
1610KCNIP2291035757201035936670.000538
171ZBTB37531721041151721223970.000553
181DARS2451720605801720943050.000617
1918CABYR7619972952199955620.000629
2010CCNJ8497793143978106120.000723
2117DYNLL24953515797535218100.000783
2211FLRT19963627937636432210.000844
2310C10orf76891035953451038059220.000847
247C7orf594499584465995897690.000872
2511BBS16466034694660576600.000953
2619ZNF998822730846227446240.000961
2711ACTN35666070966660873730.001017
287C7orf434199589978995942380.00102
2917CCDC434540110330401226910.001079
3011CTSF5066087510660926230.001108
3114NFATC410423907093239186500.001127
3211DPP35466004455660337060.00114
3311ZDHHC245766063310660702470.00117
3415RHCG8987815643878408030.001346
353APEH5249686438496959380.00135
367GAL3ST45299594800996043090.001364
371SERPINC1571721395641721530960.001451
387BET17993458935934716260.001466
393RHOA5149371582494245300.001497
4017SPHK15271892296718955360.001549
413MST14849696391497010990.001573
4218CD22619965681172657752120.00158
433RNF1235349701993497339660.00163
445C5orf395743074938430760980.00166
4511PELI33965990911660013840.001683
466LY6G6F11031782662317863510.00177
473BSN8749566925496839860.001818
487STAG35099613473996499460.00182
496BAT513131762714317790670.00186
506LY6G6C9731794403317974890.00188
Our Prediction analyses of EA in our replication sample using predicted scores for different thresholds of GWAS significance (from p<0.1 to the whole genome) showed a regression r2 ranging between 0.0011 to 0.0023 (non-significant: p≥0.14) between the predicted scores and EA in the independent sample of adults in all 6 analyses. Similarly, in the children of these individuals (n = 799) when we attempted to predict FSIQ as a function of the predicted scores across the different thresholds of GWAS significance the regression r2 were non-significant (p≥0.15) and varied between 9.0*10−5 to 0.0026.

Discussion

In the current study we observed that Educational attainment (EA) has increased constantly during the 20th century from a mean EA of 8–10 years of schooling to around 12 years of schooling followed by apprenticeship, teacher college or technical college. This trend was the same for males and females. However, it should be noted that only 2% of our sample was born before 1926 and 31% before 1951. Thus the data are too sparse to comment on the changes of mean EA in relation to time specific events such as World War II. The majority of our sample was born post World War II and the mean EA for these individuals confirms that pursuing a tertiary education was much more common for these generations than it was for the previous generations. The mean EA for individuals born in this period remained fairly constant and major events such as the Vietnam War (Australian conscription: 1965–1975) or free access to university education (1974–1988) do not appear to have affected the level of EA in our sample. Using SNP genotyping data for these individuals, we performed a large GWAS study for EA. The strength of our study was its large sample size that conferred 100% and 80% power to detect polymorphisms (MAF = 0.25) explaining 1% and 0.5% of the phenotypic variance respectively. However, despite a good power, no SNP had a p-value significant at the genome wide level of 5×10−8. Therefore, we tested the robustness of the top 50 most associated SNPs pruned for LD (r2<0.8) in a replication sample. Five of these 50 SNPs reached a p-value smaller than p = 0.05 and an effect in the same direction. Among these SNPs only one had a p-value surviving multiple correction, rs7106258 (p = 0.00097) located in an intergenic region of chromosome 11q14.1 (combined p-value was 3.71*10−7). This region deserves future attention as it has been suggested to be involved in other brain phenotypes. This region contains the GAB2 gene, a candidate gene for Alzheimer's disease [32] and copy number variations in this region have also been linked with mental retardation [33]. Overall our GWAS findings are comparable to the latest GWAS report for general cognitive abilities in that no SNPs survived multiple testing correction for genome wide significance [34]. The estimated heritability of EA in our sample was 63.2% and comparable to the 57% found in our behavior genetic analysis of EA [6]. However, we also know from our previous study that the heritability of EA could be as high as 82% in Australia once corrected for assortative mating [6]. So why can we not find SNPs with genome wide significant p-values for a highly heritable trait such as EA? Some elements of an answer can be drawn from what we have learned from other human complex traits such as IQ [34] and height [35] in which common variants of large effect size were not found. Common explanations for the missing phenotypic variance due to genetic variation might be (i) a large number of common variants of small effect sizes (<1%), (ii) rare variants of large effect sizes, (iii) structural variants or (iv) low power to detect epistasis or gene-environment interactions [36]. These questions are being addressed with the formation of a new consortium and meta-analytical efforts that assemble large samples to examine if a large number of common variants of small effect sizes contribute to the current lack of robust association signals. Some new elements of an answer to the above question are already available with Yang and colleagues [37] recently showing that 45% of variance for human height can be explained when all SNP effects from a panel of nearly 300 000 SNPs were considered simultaneously. On the other hand, the potential role of rare and structural variants in the etiology of EA and other complex traits may be answered by the next-generation of association studies [38]. An example of an innovative study design was recently described by Holm et al [39] and was successful to detect low frequency SNPs associated with sick sinus syndrome. Their multi step approach consisted of a classic GWAS followed by whole genome sequencing for a small number of cases and controls, before imputing the newly discovered variants into their original sample. Although the above example was applied to a case-control setting this could be adapted to quantitative traits by sequencing a small number of individuals at the extremes of the distribution. Another noticeable feature of our GWAS results is that none of the top 50 SNPs fall into exons. This is not surprising as more than 80% of associated variants detected by GWAS were found in non-coding regions [36] which further supports the importance of including these regions in gene mapping studies of complex traits [40]. Moving from a traditional single marker analysis, we performed the VEGAS gene-based test [30]. This is an attractive approach that gives a second life to GWAS data and it might ease the current frustration of the millions of dollars spent in large genotyping projects that cannot produce replicated associations. FLJ 41766 was the best hit, however, there is no evidence for this gene to suggest a role in neurobiology. A similar observation was found when looking at the genes among the top hits one at the time. Additionally, both the gene-based test and GWAS showed no evidence of association for BDNF (rank = 16132 out of 17549 genes tested by VEGAS), COMT (rank = 197), and DRD4 (rank = 17451), which have been previously associated with EA [16], [17], [18], [19]. However, these candidate gene studies were conducted on small samples that may have not produced genuine association signals. If true, these association signals previously reported should have been detected in our large sample (>9 500 individuals). One of the major goals of genetic epidemiologists is to identify genetic variants that can be used to predict complex traits and susceptibility risk to diseases. So far, this has been harder than originally thought due to the difficulty of mapping loci of small effect that reflect the highly polygenic architecture of common diseases and complex traits. One recent approach that has been used in this exercise is to generate a predicted score based on a genome wide profile of SNPs effects [41], [42]. Here, we generated an EA predicted score based on different level of p-values obtained in the GWAS of our large discovery sample to see whether or not we could predict EA in an independent sample. The EA variance explained by the six sets of predicted scores was low and non significant. When a similar prediction analysis was performed with FSIQ instead of EA, this percentage of variance was lower than with EA as expected and also non-significant. These low correlations between the predicted phenotypes scores might arise from a low accuracy of the genome wide SNP effects that were used to generate these scores. In the future, it will be most interesting to see the evolution of these results if these analyses were to be repeated once genome wide SNP effects from a larger sample or from a meta-analysis are available. In the present study we performed one of the largest genome wide association scans to examine the molecular genetics of educational attainment. Despite our large discovery sample and good genome coverage of the genome, no SNP reached genome wide evidence of association. As for many other complex traits (e.g. Human heights [43]), our results confirmed the high polygenic architecture of EA. Future large consortiums combined with sequencing efforts will hopefully bring more insights into the molecular architecture of EA and shed light on whether it is common variants of small effects or rare variants of large effect that contribute to the biological blue print of EA. Quantile-Quantile plot from the GWAS result of educational attainment in 9538 individuals. (Lambda = 1.0229). The grey shade area represents the 95% confidence intervals. (TIFF) Click here for additional data file. Manhattan plot of GWAS for educational attainment for 9538 individuals. X-axis represents the chromosomal location for each SNPs, and Y-axis the −log10 P-value for association with educational attainment. (TIF) Click here for additional data file. Descriptive statistics for the different cohorts from which the educational attainment data originated. (DOC) Click here for additional data file.
  33 in total

1.  Genetic covariance among measures of information processing speed, working memory, and IQ.

Authors:  M Luciano; M Wright; G A Smith; G M Geffen; L B Geffen; N G Martin
Journal:  Behav Genet       Date:  2001-11       Impact factor: 2.805

2.  Heritability of educational achievement in 12-year-olds and the overlap with cognitive ability.

Authors:  Meike Bartels; Marjolein J H Rietveld; G Caroline M Van Baal; Dorret I Boomsma
Journal:  Twin Res       Date:  2002-12

3.  Genetic and environmental sources of covariance between reading tests used in neuropsychological assessment and IQ subtests.

Authors:  Mark Wainwright; Margaret J Wright; Gina M Geffen; Laurie B Geffen; Michelle Luciano; Nicholas G Martin
Journal:  Behav Genet       Date:  2004-07       Impact factor: 2.805

4.  Genetic and environmental influences on academic achievement trajectories during adolescence.

Authors:  Wendy Johnson; Matt McGue; William G Iacono
Journal:  Dev Psychol       Date:  2006-05

5.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

6.  SNAP: a web-based tool for identification and annotation of proxy SNPs using HapMap.

Authors:  Andrew D Johnson; Robert E Handsaker; Sara L Pulit; Marcia M Nizzari; Christopher J O'Donnell; Paul I W de Bakker
Journal:  Bioinformatics       Date:  2008-10-30       Impact factor: 6.937

7.  Prediction of individual genetic risk to disease from genome-wide association studies.

Authors:  Naomi R Wray; Michael E Goddard; Peter M Visscher
Journal:  Genome Res       Date:  2007-09-04       Impact factor: 9.043

8.  Does education reduce the probability of being overweight?

Authors:  Dinand Webbink; Nicholas G Martin; Peter M Visscher
Journal:  J Health Econ       Date:  2009-11-26       Impact factor: 3.883

9.  Common SNPs explain a large proportion of the heritability for human height.

Authors:  Jian Yang; Beben Benyamin; Brian P McEvoy; Scott Gordon; Anjali K Henders; Dale R Nyholt; Pamela A Madden; Andrew C Heath; Nicholas G Martin; Grant W Montgomery; Michael E Goddard; Peter M Visscher
Journal:  Nat Genet       Date:  2010-06-20       Impact factor: 38.330

10.  BDNF variants, premorbid educational attainment, and disease characteristics in Alzheimer's disease: an exploratory study.

Authors:  Kristina F Zdanys; Timothy G Kleiman; Huiping Zhang; Fatih Ozbay; Martha G MacAvoy; Joel Gelernter; Christopher H van Dyck
Journal:  J Alzheimers Dis       Date:  2009       Impact factor: 4.472

View more
  8 in total

1.  Schooling and variation in the COMT gene: the devil is in the details.

Authors:  Daniel Campbell; Johanna Bick; Carolyn M Yrigollen; Maria Lee; Antony Joseph; Joseph T Chang; Elena L Grigorenko
Journal:  J Child Psychol Psychiatry       Date:  2013-08-19       Impact factor: 8.982

Review 2.  Biometric Modeling of Gene-Environment Interplay: The Intersection of Theory and Method and Applications for Social Inequality.

Authors:  Susan C South; Nayla R Hamdi; Robert F Krueger
Journal:  J Pers       Date:  2015-11-21

3.  The Promises and Pitfalls of Genoeconomics*

Authors:  Daniel J Benjamin; David Cesarini; Christopher F Chabris; Edward L Glaeser; David I Laibson; Vilmundur Guðnason; Tamara B Harris; Lenore J Launer; Shaun Purcell; Albert Vernon Smith; Magnus Johannesson; Patrik K E Magnusson; Jonathan P Beauchamp; Nicholas A Christakis; Craig S Atwood; Benjamin Hebert; Jeremy Freese; Robert M Hauser; Taissa S Hauser; Alexander Grankvist; Christina M Hultman; Paul Lichtenstein
Journal:  Annu Rev Econom       Date:  2012-06-18

4.  Educational attainment influences levels of homozygosity through migration and assortative mating.

Authors:  Abdel Abdellaoui; Jouke-Jan Hottenga; Gonneke Willemsen; Meike Bartels; Toos van Beijsterveldt; Erik A Ehli; Gareth E Davies; Andrew Brooks; Patrick F Sullivan; Brenda W J H Penninx; Eco J de Geus; Dorret I Boomsma
Journal:  PLoS One       Date:  2015-03-03       Impact factor: 3.240

5.  A Genome-Wide Association Study and Complex Network Identify Four Core Hub Genes in Bipolar Disorder.

Authors:  Zengyan Xie; Xianyan Yang; Xiaoya Deng; Mingyue Ma; Kunxian Shu
Journal:  Int J Mol Sci       Date:  2017-12-19       Impact factor: 5.923

6.  Cell type-specific epigenetic links to schizophrenia risk in the brain.

Authors:  Isabel Mendizabal; Stefano Berto; Noriyoshi Usui; Kazuya Toriumi; Paramita Chatterjee; Connor Douglas; Iksoo Huh; Hyeonsoo Jeong; Thomas Layman; Carol A Tamminga; Todd M Preuss; Genevieve Konopka; Soojin V Yi
Journal:  Genome Biol       Date:  2019-07-09       Impact factor: 13.583

7.  Independent evidence for an association between general cognitive ability and a genetic locus for educational attainment.

Authors:  Joey W Trampush; Todd Lencz; Emma Knowles; Gail Davies; Saurav Guha; Itsik Pe'er; David C Liewald; John M Starr; Srdjan Djurovic; Ingrid Melle; Kjetil Sundet; Andrea Christoforou; Ivar Reinvang; Semanti Mukherjee; Pamela DeRosse; Astri Lundervold; Vidar M Steen; Majnu John; Thomas Espeseth; Katri Räikkönen; Elisabeth Widen; Aarno Palotie; Johan G Eriksson; Ina Giegling; Bettina Konte; Masashi Ikeda; Panos Roussos; Stella Giakoumaki; Katherine E Burdick; Antony Payton; William Ollier; Mike Horan; Matthew Scult; Dwight Dickinson; Richard E Straub; Gary Donohoe; Derek Morris; Aiden Corvin; Michael Gill; Ahmad Hariri; Daniel R Weinberger; Neil Pendleton; Nakao Iwata; Ariel Darvasi; Panos Bitsios; Dan Rujescu; Jari Lahti; Stephanie Le Hellard; Matthew C Keller; Ole A Andreassen; Ian J Deary; David C Glahn; Anil K Malhotra
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2015-05-07       Impact factor: 3.568

8.  GWAS of 126,559 individuals identifies genetic variants associated with educational attainment.

Authors:  Cornelius A Rietveld; Sarah E Medland; Jaime Derringer; Jian Yang; Tõnu Esko; Nicolas W Martin; Harm-Jan Westra; Konstantin Shakhbazov; Abdel Abdellaoui; Arpana Agrawal; Eva Albrecht; Behrooz Z Alizadeh; Najaf Amin; John Barnard; Sebastian E Baumeister; Kelly S Benke; Lawrence F Bielak; Jeffrey A Boatman; Patricia A Boyle; Gail Davies; Christiaan de Leeuw; Niina Eklund; Daniel S Evans; Rudolf Ferhmann; Krista Fischer; Christian Gieger; Håkon K Gjessing; Sara Hägg; Jennifer R Harris; Caroline Hayward; Christina Holzapfel; Carla A Ibrahim-Verbaas; Erik Ingelsson; Bo Jacobsson; Peter K Joshi; Astanand Jugessur; Marika Kaakinen; Stavroula Kanoni; Juha Karjalainen; Ivana Kolcic; Kati Kristiansson; Zoltán Kutalik; Jari Lahti; Sang H Lee; Peng Lin; Penelope A Lind; Yongmei Liu; Kurt Lohman; Marisa Loitfelder; George McMahon; Pedro Marques Vidal; Osorio Meirelles; Lili Milani; Ronny Myhre; Marja-Liisa Nuotio; Christopher J Oldmeadow; Katja E Petrovic; Wouter J Peyrot; Ozren Polasek; Lydia Quaye; Eva Reinmaa; John P Rice; Thais S Rizzi; Helena Schmidt; Reinhold Schmidt; Albert V Smith; Jennifer A Smith; Toshiko Tanaka; Antonio Terracciano; Matthijs J H M van der Loos; Veronique Vitart; Henry Völzke; Jürgen Wellmann; Lei Yu; Wei Zhao; Jüri Allik; John R Attia; Stefania Bandinelli; François Bastardot; Jonathan Beauchamp; David A Bennett; Klaus Berger; Laura J Bierut; Dorret I Boomsma; Ute Bültmann; Harry Campbell; Christopher F Chabris; Lynn Cherkas; Mina K Chung; Francesco Cucca; Mariza de Andrade; Philip L De Jager; Jan-Emmanuel De Neve; Ian J Deary; George V Dedoussis; Panos Deloukas; Maria Dimitriou; Guðny Eiríksdóttir; Martin F Elderson; Johan G Eriksson; David M Evans; Jessica D Faul; Luigi Ferrucci; Melissa E Garcia; Henrik Grönberg; Vilmundur Guðnason; Per Hall; Juliette M Harris; Tamara B Harris; Nicholas D Hastie; Andrew C Heath; Dena G Hernandez; Wolfgang Hoffmann; Adriaan Hofman; Rolf Holle; Elizabeth G Holliday; Jouke-Jan Hottenga; William G Iacono; Thomas Illig; Marjo-Riitta Järvelin; Mika Kähönen; Jaakko Kaprio; Robert M Kirkpatrick; Matthew Kowgier; Antti Latvala; Lenore J Launer; Debbie A Lawlor; Terho Lehtimäki; Jingmei Li; Paul Lichtenstein; Peter Lichtner; David C Liewald; Pamela A Madden; Patrik K E Magnusson; Tomi E Mäkinen; Marco Masala; Matt McGue; Andres Metspalu; Andreas Mielck; Michael B Miller; Grant W Montgomery; Sutapa Mukherjee; Dale R Nyholt; Ben A Oostra; Lyle J Palmer; Aarno Palotie; Brenda W J H Penninx; Markus Perola; Patricia A Peyser; Martin Preisig; Katri Räikkönen; Olli T Raitakari; Anu Realo; Susan M Ring; Samuli Ripatti; Fernando Rivadeneira; Igor Rudan; Aldo Rustichini; Veikko Salomaa; Antti-Pekka Sarin; David Schlessinger; Rodney J Scott; Harold Snieder; Beate St Pourcain; John M Starr; Jae Hoon Sul; Ida Surakka; Rauli Svento; Alexander Teumer; Henning Tiemeier; Frank J A van Rooij; David R Van Wagoner; Erkki Vartiainen; Jorma Viikari; Peter Vollenweider; Judith M Vonk; Gérard Waeber; David R Weir; H-Erich Wichmann; Elisabeth Widen; Gonneke Willemsen; James F Wilson; Alan F Wright; Dalton Conley; George Davey-Smith; Lude Franke; Patrick J F Groenen; Albert Hofman; Magnus Johannesson; Sharon L R Kardia; Robert F Krueger; David Laibson; Nicholas G Martin; Michelle N Meyer; Danielle Posthuma; A Roy Thurik; Nicholas J Timpson; André G Uitterlinden; Cornelia M van Duijn; Peter M Visscher; Daniel J Benjamin; David Cesarini; Philipp D Koellinger
Journal:  Science       Date:  2013-05-30       Impact factor: 47.728

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.