| Literature DB >> 22022419 |
Theru A Sivakumaran1, Robert P Igo, Jeffrey M Kidd, Andy Itsara, Laura J Kopplin, Wei Chen, Stephanie A Hagstrom, Neal S Peachey, Peter J Francis, Michael L Klein, Emily Y Chew, Vedam L Ramprasad, Wan-Ting Tay, Paul Mitchell, Mark Seielstad, Dwight E Stambolian, Albert O Edwards, Kristine E Lee, Dmitry V Leontiev, Gyungah Jun, Yang Wang, Liping Tian, Feiyou Qiu, Alice K Henning, Thomas LaFramboise, Parveen Sen, Manoharan Aarthi, Ronnie George, Rajiv Raman, Manmath Kumar Das, Lingam Vijaya, Govindasamy Kumaramanickavel, Tien Y Wong, Anand Swaroop, Goncalo R Abecasis, Ronald Klein, Barbara E K Klein, Deborah A Nickerson, Evan E Eichler, Sudha K Iyengar.
Abstract
Complement factor H shows very strong association with Age-related Macular Degeneration (AMD), and recent data suggest that multiple causal variants are associated with disease. To refine the location of the disease associated variants, we characterized in detail the structural variation at CFH and its paralogs, including two copy number polymorphisms (CNP), CNP147 and CNP148, and several rare deletions and duplications. Examination of 34 AMD-enriched extended families (N = 293) and AMD cases (White N = 4210 Indian = 134; Malay = 140) and controls (White N = 3229; Indian = 117; Malay = 2390) demonstrated that deletion CNP148 was protective against AMD, independent of SNPs at CFH. Regression analysis of seven common haplotypes showed three haplotypes, H1, H6 and H7, as conferring risk for AMD development. Being the most common haplotype H1 confers the greatest risk by increasing the odds of AMD by 2.75-fold (95% CI = [2.51, 3.01]; p = 8.31×10(-109)); Caucasian (H6) and Indian-specific (H7) recombinant haplotypes increase the odds of AMD by 1.85-fold (p = 3.52×10(-9)) and by 15.57-fold (P = 0.007), respectively. We identified a 32-kb region downstream of Y402H (rs1061170), shared by all three risk haplotypes, suggesting that this region may be critical for AMD development. Further analysis showed that two SNPs within the 32 kb block, rs1329428 and rs203687, optimally explain disease association. rs1329428 resides in 20 kb unique sequence block, but rs203687 resides in a 12 kb block that is 89% similar to a noncoding region contained in ΔCNP148. We conclude that causal variation in this region potentially encompasses both regulatory effects at single markers and copy number.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22022419 PMCID: PMC3192039 DOI: 10.1371/journal.pone.0025598
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Detection of CNPs in the reference samples.
(1a) Schematic diagram showing the location of the genes and segmental duplications in the RCA gene cluster. Distal areas that are highly similar to one another are represented using the same color. (1b) Heat map showing normalized probe intensities in 90 HapMap Yoruban (YRI) samples. Yellow indicates the high probe intensity and red indicates low intensity; clustering of low intensity (red) occurs in regions with CNP 147 and CNP 148. Black vertical lines denote the boundaries of the region that incorporates CNP147 and CNP148. (1c) and (1d) Genotypes of CNP147 and CNP178, respectively, across different reference groups. y axis is probe intensity and x axis is reference groups (see Table S3 for CNP genotype frequencies); AA: Coriell African American, CAU: Coriell Caucasian, CHI: Coriell Chinese, MEX: Coriell Mexicans, ASN: HapMap Asian, CEU: HapMap Caucasian, YRI: HapMap Yorubans.
Figure 2Validation of CNPs in the HapMap samples.
(2a) UCSC browser view (http://humanparalogy.gs.washington.edu/) showing organization and structural variation in the RCA gene cluster. Red bars indicate the sites of structural variation in eight individuals underwent fosmid pair-end sequencing. Deletions at CNP147 and CNP148 were using high-resolution tiling-path custom array-based CGH. Probes with log2 ratios below or above a threshold of 1.5 s.d. from the normalized mean log2 ratio are colored red (deletions) or green (duplication), respectively. (2b) and (2c) Alignment of sequenced fosmid inserts against the human genome assembly (build36) confirms the extent of two deletions in the CFHR cluster on chromosome 1. Clone AC213924, derived from sample NA18507, corresponds to deletion CNP147 (2b). This variant removes 86.3 kbp of sequence (chr1:194,988,828–195,075,129, build36), resulting in loss of CFHR3 and CFHR1 genes. Clone AC210432, derived from sample NA18956, corresponds to deletion CNP148 (2c). This variant removes 122.0 kbp of sequence (chr1:195,049,336–195,171,294), resulting in loss of CFHR1 and CFHR4 genes. Red boxes: RefSeq transcripts, with orientation indicated by arrows. Grey boxes: regions of segmental duplication as predicted by SegDup Masker. Common repeats were identified using Repeat Masker. Purple: SINEs, Green: LINEs, Pink: DNA elements, Light Grey: Low Complexity sequence, Black: Simple Repeats.
Figure 3Haplotypes and linkage disequilibrium (LD) in the region surrounding CNP147.
(a) Distribution of r2 and D′ values for SNPs with CNP147 in three HapMap populations. The squares represent the D′ values while the triangles indicate the r2 values. Different shades of squares and triangles indicate weak (light shade) to strong LD (dark shade). ΔCNP147 shows perfect LD with nearby SNPs in non-Africans. The arrows indicate the SNPs used as a proxy to predict ΔCNP147 in our AMD custom Illumina panel. (b) Haplotypes in the CFH region observed in clinical cohorts. Haplotypes were constructed from genotypes of 70 SNPs located in the CFH region using PHASE2.1.1. Location and alleles of key selected SNPs are given here for expediency. The risk-associated allele at each locus is shown in red and the protective alleles at the corresponding locus are in blue. The bottom panel shows the location of CNP147 and CNP148. ΔCNP147 is indicated by a thin black line in haplotypes H3 and H4. Except H3 and H4, all haplotypes carry normal constitution at CNP147. The critical region, wherein the putative AMD susceptibility variant lies, is enclosed in black box for clarity and the two best SNPs rs203687 and rs1329428 are highlighted in pink. H1 and H6 are risk bearing, and H2–H4 are protective in nature. The role of H5, which carries risk alleles at several loci in block2 in causing disease is not clear. The Indian cohort has a rare risk recombinant haplotype H7, with non-risk allele T at rs1061170 present in 4% of the chromosomes. The most common risk haplotype among the Indians was equivalent to H1 as seen in Caucasians.
Association of CFH haplotypes in the FARMS cohort.
| Haplotype Association | |||
| Haplotype | Frequency | Effect | LRT p-value |
| H1 | 0.5189 | 0.657 | 9.46×10−05 |
| H2 | 0.1065 | Base | - |
| H3 | 0.067 | −0.279 | 0.272 |
| H4 | 0.0206 | −0.293 | 0.485 |
| H5 | 0.1615 | −0.008 | 0.714 |
| H6 | 0.0653 | 0.467 | 0.082 |
Association of CFH haplotypes with AMD in different clinical cohorts.
| Haplotype Frequency | Haplotype Association | ||||||
| Cohort | Haplotype | Cases | Controls | Total | OR | 95% CI | p-value |
| Custom Illumina Panel | H1 | 0.5152 | 0.3263 | 0.4210 | 2.25 | 1.96, 2.58 | 1.35×10−30 |
| H2 | 0.1061 | 0.1916 | 0.1488 | Base | - | - | |
| H3 | 0.0611 | 0.1205 | 0.0908 | 0.70 | 0.56, 0.87 | 1.15×10−03 | |
| H4 | 0.0285 | 0.0624 | 0.0454 | 0.61 | 0.46, 0.82 | 8.49×10−04 | |
| H5 | 0.1459 | 0.1416 | 0.1438 | 1.46 | 1.21, 1.75 | 6.93×10−05 | |
| H6 | 0.0379 | 0.0322 | 0.0351 | 1.83 | 1.33, 2.54 | 2.42×10−04 | |
| MPM Panel | H1 | 0.5802 | 0.3317 | 0.4878 | 3.35 | 2.91, 3.86 | 1.73×10−62 |
| H2 | 0.0888 | 0.2000 | 0.131 | Base | - | - | |
| H3 | 0.0593 | 0.1230 | 0.0832 | 0.89 | 0.73, 1.10 | 0.275 | |
| H4 | 0.0209 | 0.0548 | 0.0336 | 0.80 | 0.59, 1.08 | 0.143 | |
| H5 | 0.1435 | 0.1570 | 0.1477 | 1.73 | 1.45, 2.06 | 6.24×10−10 | |
| H6 | 0.0359 | 0.0322 | 0.0353 | 2.19 | 1.61, 2.99 | 2.42×10−07 | |
| AREDS | H1 | 0.5855 | 0.3193 | 0.499 | 2.45 | 1.76, 3.41 | 9.70×10−08 |
| H2 | 0.0768 | 0.1566 | 0.1027 | Base | - | - | |
| H3 | 0.0536 | 0.1687 | 0.091 | 0.43 | 0.26, 0.71 | 0.001 | |
| H4 | 0.0130 | 0.0392 | 0.0215 | 0.48 | 0.19, 1.22 | 0.120 | |
| H5 | 0.1174 | 0.1386 | 0.1243 | 1.06 | 0.65, 1.719 | 0.830 | |
| H6 | 0.0333 | 0.0211 | 0.0294 | 2.27 | 0.89, 5.79 | 0.087 | |
| Indian | H1 | 0.4291 | 0.2391 | 0.3414 | 2.05 | 1.27, 3.30 | 0.003 |
| H2 | 0.0597 | 0.1696 | 0.1104 | Base | - | - | |
| H3 | 0.0746 | 0.1391 | 0.1044 | 0.63 | 0.31, 1.26 | 0.190 | |
| H4 | 0.0858 | 0.1870 | 0.1325 | 0.54 | 0.28, 1.07 | 0.078 | |
| H5 | 0.0858 | 0.1217 | 0.1024 | 0.59 | 0.29, 1.22 | 0.150 | |
| H7 | 0.0672 | 0.0043 | 0.0382 | 15.57 | 2.12, 114.30 | 0.007 | |
| SiMES | H1 | 0.0786 | 0.0768 | 0.0767 | 0.99 | 0.62, 1.57 | 0.950 |
| H2 | 0.4286 | 0.4161 | 0.4176 | Base | - | - | |
| H3 | 0.0143 | 0.0100 | 0.0102 | 1.69 | 0.58, 4.90 | 0.340 | |
| H4 | 0.0357 | 0.0393 | 0.0389 | 0.92 | 0.47, 1.80 | 0.800 | |
| H5 | 0.3893 | 0.4013 | 0.4003 | 0.99 | 0.76, 1.29 | 0.950 | |
| Meta CC | H1 | 2.75 | 2.51, 3.01 | 8.31×10−109 | |||
| H2 | Base | - | - | ||||
| H3 | 0.74 | 0.65, 0.85 | 1.83×10−05 | ||||
| H4 | 0.70 | 0.58, 0.85 | 2.81×10−04 | ||||
| H5 | 1.54 | 1.37, 1.73 | 1.73×10−13 | ||||
| H6 | 1.85 | 1.51, 2.27 | 3.52×10−09 | ||||
| Meta All | H1 | 2.00×10−90 | |||||
| H2 | Base | ||||||
| H3 | 8.07×10−05 | ||||||
| H4 | 0.003 | ||||||
| H5 | 1.44×10−10 | ||||||
| H6 | 4.52×10−09 | ||||||
*Odds ratio and p-values obtained from meta-analysis is given here and the results from the individual cohorts are seen in the Table S4.
Frequency from the pooled data (all cohorts).
Results from the pooled data.
Association using H2 haplotype as baseline, adjusted for covariates described in Table S8.
Meta analysis of all Caucasians non-familial cohorts.
Meta analysis of all Caucasian cohorts (non-family cohorts and family cohort in Table 2) and p-value indicates the Fisher p-value.
16] and Maller et al. [17] first showed that several SNPs in this region were more strongly associated with AMD than rs1061170; among the best ranked variants in these earlier reports were rs2274700 and rs1410996, respectively.
Figure 4The strength of the association signal, as indicated by −log(p value) on the Y-axis, is plotted against SNP location on chromosome 1 on the X-axis.
The (A) panel shows the p-values from individual clinical cohorts which are labeled in different colors and shapes (see Labels). The SNPs located within the 32 kb critical region show the best association signal in the 6 clinical datasets. Panel (B) shows the distribution of pair-wise linkage disequilibrium (D′) values between CFH and ZBTB41. The LD blocks are marked with black triangles; red indicates D′ values of 1 (complete LD) while white indicates D′ values of 0 (no LD whatsoever).
Association of ΔCNP147 with AMD, with and without inclusion of rs1329428 and rs203687, as covariates.
| ΔCNP147 Frequency | ΔCNP147 association with AMD | ΔCNP147 association with AMD | ||||||
| Cohorts | Cases | Controls | OR | 95% CI | p-value | OR | 95% CI | p-value |
| Custom Illumina | 0.1052 | 0.2119 | 2.21 | 1.90, 2.58 | 4.50×10−24 | 1.12 | 0.92, 1.37 | 0.24 |
| MPM Panel | 0.0948 | 0.2074 | 2.47 | 2.11, 2.88 | 2.77×10−30 | 1.04 | 0.85, 1.27 | 0.72 |
| AREDS | 0.0855 | 0.2314 | 3.23 | 2.22, 4.72 | 1.89×10−9 | 1.41 | 0.87, 2.29 | 0.16 |
| Meta-analysis | — | — | 2.34 | 2.10, 2.61 | 1.33×10−52 | 1.10 | 0.96, 1.26 | 0.15 |
| Indian | 0.2129 | 0.3675 | 2.06 | 1.35, 3.14 | 7.74×10−4 | NA | NA | NA |
| SiMES | 0.0504 | 0.0649 | 1.47 | 0.53, 4.08 | 0.46 | NA | NA | NA |
Frequencies obtained from the pooled data.
After adjusting for significant covariates as described in Table S8.
After adjusting for rs1329428 and rs203687, in addition to significant covariates as described in Table S8.
*Meta-analysis of the Custom illumina panel, MPM panel and AREDS cohort.
ΔCNP147 predicted by A-G-C haplotype at rs6677604, rs7542235 and rs16840639.
ΔCNP147 predicted by “A” allele at rs6677604.
Since none of the ΔCNP147-specific SNPs are genotyped in AREDS,the “A” allele at rs2019727 that is highly correlated (r2 = 0.9) with this deletion in the Caucasians was used as proxy.
ΔCNP147 predicted by the “G” allele at rs7542235.
NA - not adjusted for rs1329428 and rs203687 as their genotypes are not available.
CNP association in the FARMS cohort.
| Association of CNPs with AMD | Association of CNPs with AMD | |||||
| CNP | Effect | SE | LRT p-value | Effect | SE | LRT p-value |
| CNP147 | 0.689 | 0.182 | 0.0002 | 0.42 | 0.23 | 0.07 |
| CNP148 | 1.260 | 0.520 | 0.016 | 1.110 | 0.310 | 0.03 |
*Increase in every copy at of the CNP with increase in the AMD score.
Association of CNPs after adjusting for age and age2.
Association of CNPs after adjusting for age, age2, rs1329428, and rs203687.