| Literature DB >> 22563384 |
Joseph E Powell1, Anjali K Henders, Allan F McRae, Anthony Caracella, Sara Smith, Margaret J Wright, John B Whitfield, Emmanouil T Dermitzakis, Nicholas G Martin, Peter M Visscher, Grant W Montgomery.
Abstract
There is growing evidence that genetic risk factors for common disease are caused by hereditary changes of gene regulation acting in complex pathways. Clearly understanding the molecular genetic relationships between genetic control of gene expression and its effect on complex diseases is essential. Here we describe the Brisbane Systems Genetics Study (BSGS), a family-based study that will be used to elucidate the genetic factors affecting gene expression and the role of gene regulation in mediating endophenotypes and complex diseases.BSGS comprises of a total of 962 individuals from 314 families, for which we have high-density genotype, gene expression and phenotypic data. Families consist of combinations of both monozygotic and dizygotic twin pairs, their siblings, and, for 72 families, both parents. A significant advantage of the inclusion of parents is improved power to disentangle environmental, additive genetic and non-additive genetic effects of gene expression and measured phenotypes. Furthermore, it allows for the estimation of parent-of-origin effects, something that has not previously been systematically investigated in human genetical genomics studies. Measured phenotypes available within the BSGS include blood phenotypes and biochemical traits measured from components of the tissue sample in which transcription levels are determined, providing an ideal test case for systems genetics approaches.We report results from an expression quantitative trait loci (eQTL) analysis using 862 individuals from BSGS to test for associations between expression levels of 17,926 probes and 528,509 SNP genotypes. At a study wide significance level approximately 15,000 associations were observed between expression levels and SNP genotypes. These associations corresponded to a total of 2,081 expression quantitative trait loci (eQTL) involving 1,503 probes. The majority of identified eQTL (87%) were located within cis-regions.Entities:
Mesh:
Year: 2012 PMID: 22563384 PMCID: PMC3338511 DOI: 10.1371/journal.pone.0035430
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Summary of BSGS study design.
The structure of the study design allows us to investigate fundamental questions about the genetic basis of gene expression and their correlation with phenotypes that are known risk factors for disease.
Figure 2Samples collected in BSGS comprise of a number of different families.
Family structure h represents the 50 MZ pairs comprising the stage I study. The remaining family structures are from stage II. The numbers of each family structure are given below the pedigree diagram. By utilising expression information contained between and within twin pairs, siblings and between progeny and parents we are able to estimate genetic and non-genetic variance components using linear mixed models.
Relationship pairing between 962 individuals in BSGS.
| Relationship pairs | Code | N | Notes |
| Monozygotic twins | MZ | 128 | 68 female pairs; 60 male pairs; 50 MZ pairs form stage I, where we have expression data from both WB and LCL RNA sources |
| Dizygotic twins | DZ | 206 | 51 female pairs; 53 male pairs; 102 mixed sex pairs |
| Siblings | SIB | 343 | 81 female pairs; 82 male pairs; 180 mixed sex pairs |
| Parent – Offspring | PO | 425 | 98 father – daughter pairs; 103 father – son pairs; 113 mother – daughter pairs; 111 mother – son pairs |
| Parent – Parent | PP | 71 |
Central phenotypes in BSGS and a brief summary of previous studies identifying genetic parameters and association signals.
| Phenotypes | Summary | Reference |
| Hemoglobin concentration | Hemoglobin phenotypes have been associated with SNPs in |
|
| Red blood cell count | Association with SNPs close to |
|
| Platelet count | Platelet count has suggestive association with SNPs in |
|
| White blood cell count | White blood cell count has suggestive association with SNPs in |
|
| Monocytes | Monocyte count is associated with SNPs close to |
|
| Eosinophils | Eosinophil count has suggestive association with SNPs in |
|
| CD4+/CD8+ T-cell ratio | Collectively, these phenotypes are associated with SNPs in the MHC and the Schlafen family of genes. They are also endophenotypes for Type 1 Diabetes, HIV-1 immune control and autoimmune diseases. |
|
| Plasma Cholesterol (HDL and LDL) and Triglyceride concentrations | A known endophenotypes for cardiovascular disease. Data comprising part of BSGS have shown strong associations between Cholesterol and genes on chromosome 19 and between Triglyceride and genes on chromosome 7. |
|
| Blood pressure | An important endophenotype for hypertension. Data comprising part of BSGS have shown strong associations between blood pressure and genes on chromosomes 4,5,14 and 17. |
|
| Iron, Ferritin and Transferrin levels | Collectively, these phenotypes show association with SNPs in |
|
Figure 3Distribution of the R observed for the best eSNP from the 1,885 eQTLs.
Figure 4The distribution of cis-eSNPs distance from the Transcription Start Site (TSS).
The distances of eSNPs from the TSS were divided into 50KB bins across the cis-region.
Top cis-eQTL results.
| Probe ID | Gene | Probe Chromosome | TSS bp | Top SNP | Top SNP location | -log10 |
|
| ILMN_1715169 | HLA-DRB1 | 6 | 32654825 | rs9271170 | 6 - 32577889 | 131.5 | 73.5 |
| ILMN_1743145 | ERAP2 | 5 | 96274820 | rs10051637 | 5 - 96279490 | 131.0 | 81.6 |
| ILMN_1798177 | CHURC1 | 14 | 64471249 | rs7143432 | 14 - 65379146 | 130.4 | 83.0 |
| ILMN_2209027 | RPS26 | 12 | 54722494 | rs10876864 | 12 - 56401085 | 122.0 | 74.6 |
| ILMN_2403228 | CLEC12A | 12 | 10029119 | rs7313235 | 12 - 10132283 | 121.3 | 75.8 |
| ILMN_2352023 | RIPK5 | 1 | 203378372 | rs12139373 | 1 - 205054879 | 114.4 | 70.2 |
| ILMN_2312606 | IRF5 | 7 | 127973722 | rs6965542 | 7 - 128655918 | 112.6 | 74.8 |
| ILMN_1791511 | TMEM176A | 7 | 150133038 | rs7806458 | 7 - 150476888 | 107.5 | 66.4 |
| ILMN_2038775 | TUBB2A | 6 | 3154070 | Rs9392465 | 6 - 3162378 | 107.2 | 66.7 |
| ILMN_3298167 | ZSWIM7 | 17 | 15879944 | rs1045599 | 17 - 15879910 | 105.5 | 65.9 |
| ILMN_1661266 | HLA-DQB1 | 6 | 32736001 | rs9273349 | 6 - 32625869 | 102.0 | 61.2 |
| ILMN_2313901 | PAM | 5 | 102340879 | rs28092 | 5 - 102149795 | 99.7 | 66.2 |
The chromosome and base pair position of the probe transcription start site (TSS) are given for each probe. R is the proportion of transcript level variance explained by the SNP with the strongest association.
Figure 5Positions of cis (A) and trans (defined as greater than 2MB from the transcription start site) (B) eSNP across the genome.
The number of eSNP within 1MB bins is shown. A single eSNP represents a unique eQTL.
Number of associations and eQTL at various levels of significance threshold.
| Significance threshold | Total associations | Expected number of associations | Total significant SNPs | Probes with 1+ significant SNP | eQTL | Cis-eQTL | Trans-eQTL |
| 10e−8 | 31,032 | 951 | 20,068 | 3,969 | 5,679 | 2,673 | 3,006 |
| 10e−10 | 20,049 | 9.5 | 13,319 | 1,933 | 2,512 | 1,953 | 559 |
|
|
|
|
|
|
|
|
|
| 10e−15 | 10,519 | 0 | 7,468 | 1,126 | 1,507 | 1,256 | 251 |
| 10e−25 | 4,362 | 0 | 3,382 | 572 | 731 | 639 | 92 |
| 10e−50 | 896 | 0 | 764 | 185 | 199 | 172 | 27 |
| >10e−100 | 60 | 0 | 55 | 27 | 27 | 22 | 5 |
The study-wide significance threshold employed for our eQTL analysis is highlighted. The expected number of associations is the number of association that are expected to be observed under the null hypothesis of no associations between probe expression levels and SNPs.