| Literature DB >> 19043567 |
Albert O Edwards1, Brooke L Fridley, Katherine M James, Anil K Sharma, Anil S Sharma, Julie M Cunningham, Nirubol Tosakulwong.
Abstract
Genome-wide association studies (GWASs) assess correlation between traits and DNA sequence variation using large numbers of genetic variants such as single nucleotide polymorphisms (SNPs) distributed across the genome. A GWAS produces many trait-SNP associations with low p-values, but few are replicated in subsequent studies. We sought to determine if characteristics of the genomic loci associated with a trait could be used to identify initial associations with a higher chance of replication in a second cohort. Data from the age-related eye disease study (AREDS) of 100,000 SNPs on 395 subjects with and 198 without age-related macular degeneration (AMD) were employed. Loci highly associated with AMD were characterized based on the distribution of genotypes, level of significance, and clustering of adjacent SNPs also associated with AMD suggesting linkage disequilibrium or multiple effects. Forty nine loci were highly associated with AMD, including 3 loci (CFH, C2/BF, LOC387715/HTRA1) already known to contain important genetic risks for AMD. One additional locus (C3) reported during the course of this study was identified and replicated in an additional study group. Tag-SNPs and haplotypes for each locus were evaluated for association with AMD in additional cohorts to account for population differences between discovery and replication subjects, but no additional clearly significant associations were identified. Relying on a significant genotype tests using a log-additive model would have excluded 57% of the non-replicated and none of the replicated loci, while use of other SNP features and clustering might have missed true associations.Entities:
Mesh:
Year: 2008 PMID: 19043567 PMCID: PMC2583911 DOI: 10.1371/journal.pone.0003813
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Number of loci with at least one SNP significantly associated with AMD in the 100,000 SNP genome-wide scan of the AREDS cohort*.
| Group | Hardy-Weinberg equilibrium (HWE) in controls (P≥0.01) | Minor allele frequency>1% in cases or controls | One or more adjacent SNPs associated with AMD | Number of loci | Total SNPs with P<10−4 |
| Confirmed | Yes | Both | Yes | 20 | 30 |
| Confirmed-Rare | Yes | Either | Yes | 6 | 8 |
| Valid | Yes | Both | No | 14 | 14 |
| Valid-Rare | Yes | Either | No | 6 | 6 |
|
|
|
| |||
| Loci already associated with AMD | Yes | Either | Yes | 3 | 25 |
| Artifact - Possible | No | NA | No | 20 | 23 |
| Artifact - Probable | No | Either or Both | Yes | 30 | 65 |
|
|
|
|
Significant association was defined as P<10−4.
An adjacent SNP associated with AMD refers to the nearest centromeric and telomeric SNP within 50 kb or less genotyped on the Illumina 100,000 SNP genome-wide scan with a p-value≤0.01 for association with AMD. SNPs meeting this criteria are referred to as “confirmed”.
SNPs associated with AMD were arbitrarily defined as being in separate loci if they were located at least 500 kb from each other.
One of these 58 SNPs (rs7497988, now called rs3985626) could not be genotyped on the Illumina platform and was not studied. The already known and artifact categories were not genotyped, leaving 57 AREDS SNPs in 46 loci.
The five AREDS SNPs out of 57 genotyped that were most associated with AMD in the replication study with Mayo subjects.*
| Locus Description | Gene | AREDS subjects (N = 593) | Mayo subjects (N = 744) | |||||||
| Chromosome | SNP | Function | Minor allele frequency (AREDS controls) | 2×2 Allelic test p-value | Fisher Genotypic test p-value | HWE p-value (controls) | Log-Additive model Genotypic test p-value | Fisher Genotypic test p-value | HWE p-value (controls) | |
| 7 | rs2341823 | intron | 0.138 | PLXNA4B | 2.0E-05 | 4.8E-05 | 0.013 | 0.04 | 0.12 | 0.883 |
| 9 | rs7867504 | synonymous | 0.382 | SLC28A3 | 8.9E-05 | 1.7E-04 | 0.091 | 0.08 | 0.22 | 0.810 |
| 16 | rs8056814 | 5′ near gene | 0.133 | CTRB2, CTRB1, BCAR1 | 4.8E-06 | 2.0E-05 | 0.133 | 0.02 | 0.05 | 0.827 |
| 11 | rs174602 | intronic | 0.292 | FADS2, FADS3 | 3.9E-05 | 3.6E-04 | 0.369 | 0.03 | 0.1 | 0.516 |
| 19 | rs2230199 | Non-synonymous | 0.175 | C3 | 2.8E-05 | 7.0E-05 | 0.331 | 3.7E-05 | 1.10E-04 | 0.097 |
The nsSNP in C3 was considered replicated.
Genotype counts are provided in Table S3.
Attempted replication of AREDS loci using 225 Tag-SNPs and 18 nsSNPs genotyped on the Mayo subjects.
| Chromosome | AREDS locus | Function | No. SNPs studied in locus | No. SNPs p<0.01 | SNP | Fisher Genotypic test p-value | Log Additive model Genotypic test p-value | Global haplotype simulated p-value |
| 1 | LOC127602 | flanking 3′ UTR | 11 | 1 | rs1871570 | 0.009 | 0.009 | 0.079 |
| 7 | NOD1 | UTR | 9 | 1 | rs2906766 | 0.004 | 0.009 | 0.42 |
| 7 | PLXNA4B | intron | 15 | 1 | rs11773117 | 0.002 | 0.017 | 0.34 |
| 17 | METT10D | 5′ UTR | 17 | 1 | rs4790335 | 0.003 | 0.64 | 0.53 |
The SNPs (all Tag-SNPs) from the 4 loci that showed possible association with AMD in the Mayo subjects are listed in this table.*
Genotype distributions are provided in Table S3.
Global haplotype refers to the 3-SNP haplotype score across the entire gene using all SNPs.
Results of genotyping 6 SNPs from the 3 loci in Table 3 with possible association with AMD in the Mayo subjects on 1,598 (1,280 cases and 318 controls) AREDS subjects.*
| Chromosome | Locus | SNP | Function | Mayo Samples | AREDS Samples (Replication subjects) | ||
| Fisher Genotypic test p-value | Log Additive model Genotypic test p-value | Fisher Genotypic test p-value | Log Additive Model Genotypic test p-value | ||||
| 1 | LOC127602 | rs1871570 | intron | 0.009 | 0.009 | 0.61 | 0.49 |
| 1 | LOC127602 | rs12038394 | intron | 0.07 | 0.02 | 0.93 | 0.7 |
| 7 | NOD1 | rs2906766 | 5′ untranslated region | 0.004 | 0.009 | 0.38 | 0.23 |
| 7 | PLXNA4B | rs1499300 | intron | 0.02 | 0.01 | 0.16 | 0.77 |
| 7 | PLXNA4B | rs2341823 | intron | 0.12 | 0.04 | 0.005 | 0.01 |
| 7 | PLXNA4B | rs11773117 | intron | 0.002 | 0.01 | 0.34 | 0.52 |
Genotype distributions are provided in Table S4. Note that METT10D (Table 3) was not genotyped because the log-additive model did not support true association.
SNP and locus variables for the 56 successfully genotyped loci that were studied for replication of the association with AMD.*
| Replication Status | Average | ||||||||||||
| Minor Allele Frequency | HWE p-value | SNP Call Rate | dbGaP allelic p-value | General model Fisher p-value | Additive model p-value | Minor allele frequency in controls for adjacent upstream SNP | HWE p-value for adjacent upstream SNP | dbGaP alleleic p-value for adjacent upstm SNP | Minor allele frequency in controls for adjacent downstream SNP | HWE p-value for adjacent downstream SNP | dbGaP p-value for adjacent downstream SNP | Number of significant SNPs within 20kb of AREDS SNP | |
| Not-replicated | 0.163 | 0.326 | 0.998 | 3.782E-05 | 1.500E-04 | 0.158 | 0.256 | 0.542 | 0.295 | 0.263 | 0.520 | 0.243 | 0.571 |
| Replicated | 0.278 | 0.257 | 0.995 | 7.718E-06 | 2.120E-05 | 1.003E-05 | 0.302 | 0.479 | 0.339 | 0.278 | 0.288 | 0.135 | 2.500 |
The individual data points from each locus are presented in Table S1.