| Literature DB >> 19061490 |
Rong Chen1, Alex A Morgan, Joel Dudley, Tarangini Deshpande, Li Li, Keiichi Kodama, Annie P Chiang, Atul J Butte.
Abstract
BACKGROUND: Candidate single nucleotide polymorphisms (SNPs) from genome-wide association studies (GWASs) were often selected for validation based on their functional annotation, which was inadequate and biased. We propose to use the more than 200,000 microarray studies in the Gene Expression Omnibus to systematically prioritize candidate SNPs from GWASs.Entities:
Mesh:
Year: 2008 PMID: 19061490 PMCID: PMC2646274 DOI: 10.1186/gb-2008-9-12-r170
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Use of differentially and constantly expressed genes to rediscover disease genes. The DER was calculated as the count of GEO datasets in which a gene was differentially expressed divided by the count of GEO datasets in which it was measured. For any cutoff x, differentially expressed genes were defined as genes with DER > x, whereas constantly expressed genes were defined as genes with DER
Figure 2Performance of rediscovering disease genes by DER. Genes with DER ≥ 0.55 were predicted to be disease genes, and compared with genes with disease-associated DNA variants listed in GAD and HGMD. P values were calculated using Fisher's exact test. DER, differential expression ratio; GAD, Genetic Association Database; GEO, Gene Expression Omnibus; HGMD, Human Gene Mutation Database.
DER values for T1DM and false positive genes in the top 7 WTCCC T1DM loci
| Loci | Genea | Associated?b | DER | Correct?c |
| 4q27 | No | 0.54 | True negative | |
| 4q27 | No | 0.48 | True negative | |
| 4q27 | No | 0.46 | True negative | |
| 6p21 | Yes | 0.68 | True positive | |
| 6p21 | Yes | 0.61 | True positive | |
| 6p21 | Yes | 0.59 | True positive | |
| 6p21 | Yes | 0.59 | True positive | |
| 6p21 | Yes | 0.54 | False negative | |
| 6p21 | Yes | 0.58 | True positive | |
| 6p21 | Yes | 0.59 | True positive | |
| 6p21 | No | 0.5 | True negative | |
| 6p21 | No | 0.53 | True negative | |
| 6p21 | No | 0.35 | True negative | |
| 6p21 | No | 0.48 | True negative | |
| 6p21 | No | 0.62 | False positive | |
| 12p13 | No | 0.51 | True negative | |
| 12q13 | Yes | 0.63 | True positive | |
| 12q24 | Yes | 0.52 | False negative | |
| 12q24 | Yes | 0.58 | True positive | |
| 16p13 | Yes | 0.5 | False negative | |
| 18p11 | Yes | 0.64 | True positive | |
aThe positive candidate genes from WTCCC GWAS with reported validation results. bValidated to be associated or unassociated with T2DM in the high-quality genotyping. cThe predicted result using DER ≥ 0.55. DER, differential expression ratio; GWAS, genome-wide association study; T1DM, type 1 diabetes mellitus; WTCCC, Wellcome Trust Case Control Consortium.
Figure 3Distinguishing T1DM genes from false positives in the top seven loci from GWASs using DER. Genes in the top seven loci from the WTCCC T1DM GWASs are reported with validation results. False-positive genes were shown as positive in the initial scan but found to be unassociated with T1DM in the follow-up validation studies. T1DM genes had significantly higher DER values than did false positive genes (P = 0.003). The mean DER values for T1DM and false-positive genes were 0.59 and 0.50, respectively. DER, differential expression ratio; GWAS, genome-wide association study; T1DM, type 1 diabetes mellitus; WTCCC, Wellcome Trust Case Control Consortium.
DER values for T2DM and false positive genes from GWAS
| Locus or SNP | Gene | Associated in populations | DER | Correct?a |
| 2q37.3 | Finish. [ | 0.57 | True positive | |
| 3p25 | Caucasian. [ | 0.53 | False negative | |
| 3q27.2 | Asian. [ | 0.54 | False negative | |
| 6p22.3 | Asian. [ | 0.55 | True positive | |
| 8q24.11 | Asian. [ | 0.42 | False negative | |
| 9p21 | Asian. [ | 0.59 | True positive | |
| 9p21 | Asian. [ | 0.49 | False negative | |
| 10q23 | Asian. [ | 0.58 | True positive | |
| 10q23 | Caucasian. [ | 0.61 | True positive | |
| 10q24 | Caucasian, Chinese [ | 0.54 | False negative | |
| 10q25.3 | African. [ | 0.64 | True positive | |
| 11p15.1 | Arab. [ | 0.39 | False negative | |
| 11p15.5 | Singaporean. [ | 0.6 | True positive | |
| 16q12.2 | Asian. [ | 0.55 | True positive | |
| 20q12 | Amish. [ | 0.63 | True positive | |
| rs11078674 | No | 0.53 | True negative | |
| rs2866016 | No | 0.53 | True negative | |
| rs12629276 | No | 0.54 | True negative | |
| rs8101509 | No | 0.4 | True negative | |
| rs945384 | No | 0.53 | True negative | |
| rs2050831 | No | 0.63 | False positive | |
| rs6670163 | No | 0.55 | False positive | |
| rs859101 | No | 0.5 | True negative | |
| rs11084127 | No | 0.46 | True negative | |
| rs7950175 | No | 0.48 | True negative | |
| rs13064991 | No | 0.45 | True negative | |
| rs6541240 | No | 0.51 | True negative | |
| rs2278419 | No | 0.45 | True negative | |
aThe predicted result using DER ≥ 0.55. DER, differential expression ratio; GWAS, genome-wide association study; T2DM, type 2 diabetes mellitus.
Figure 4Interpreting T1DM GWAS findings at 4q27 using fitSNPs. The region 4q27 has been identified as a risk factor area for T1DM, celiac disease, and rheumatoid arthritis. IL2, IL21, and TENR were selected based on prior knowledge for sequencing in the follow-up studies, but no association was found. KIAA1109 has a much higher fitSNPs DER value than all other genes in the region, and is flanked by two significant T1DM GWAS SNPs (-log10P >5). We predicted that this gene may explain the T1DM association in this region. The GWAS -log10P curve for KIAA1109 is missing because it was not listed in the Affymetrix 500 K SNP array used for the GWAS. DER, differential expression ratio; fitSNPs, functionally interpolating single nucleotide polymorphisms; GWAS, genome-wide association study; SNP, single nucleotide polymorphism; T1DM, type 1 diabetes mellitus.
DER value comparisons among Mendelian, complex, cancer, all disease genes and nondisease genes
| Mendelian (mean = 0.53, n = 931) | Complex (mean = 0.54, n = 70) | Cancer (mean = 0.56, n = 324) | All diseases (mean = 0.53, n = 3,178) | Nondisease (mean = 0.50, n = 16,698) | |
| Mendelian | 0.2 | <0.0001 | 0.4 | <0.0001 | |
| Complex | 0.001 | 0.3 | <0.0001 | ||
| Cancer | <0.0001 | <0.0001 | |||
| All diseases | <0.0001 | ||||
| Nondisease | |||||
*P values were calculated using t-test. DER, differential expression ratio.
Figure 5Prediction that OBSL1 is associated with systemic lupus erythematosus with nephritis through 2q34-q35. Systemic lupus erythemetosus with nephritis (SLEN2; OMIM %607966) was identified to be associated with 2q34-q35 but without identification of specific genes. OBSL1 has a much higher DER value (0.71) than those of all other genes from 2q34-q35. It was also found to be differentially expressed in juvenile idiopathic arthritis, kidney cancer, and kidney transplant rejection. Therefore, we suggest that it should be sequenced for its potential association with SLEN2.