Literature DB >> 32246154

Family-based exome sequencing identifies rare coding variants in age-related macular degeneration.

Rinki Ratnapriya1,2, İlhan E Acar3, Maartje J Geerlings3, Kari Branham4, Alan Kwong5, Nicole T M Saksens3, Marc Pauper3, Jordi Corominas3, Madeline Kwicklis1, David Zipprer1, Margaret R Starostik1, Mohammad Othman4, Beverly Yashar4, Goncalo R Abecasis5, Emily Y Chew1, Deborah A Ferrington6, Carel B Hoyng3, Anand Swaroop1, Anneke I den Hollander3.   

Abstract

Genome-wide association studies (GWAS) have identified 52 independent variants at 34 genetic loci that are associated with age-related macular degeneration (AMD), the most common cause of incurable vision loss in the elderly worldwide. However, causal genes at the majority of these loci remain unknown. In this study, we performed whole exome sequencing of 264 individuals from 63 multiplex families with AMD and analyzed the data for rare protein-altering variants in candidate target genes at AMD-associated loci. Rare coding variants were identified in the CFH, PUS7, RXFP2, PHF12 and TACC2 genes in three or more families. In addition, we detected rare coding variants in the C9, SPEF2 and BCAR1 genes, which were previously suggested as likely causative genes at respective AMD susceptibility loci. Identification of rare variants in the CFH and C9 genes in our study validated previous reports of rare variants in complement pathway genes in AMD. We then extended our exome-wide analysis and identified rare protein-altering variants in 13 genes outside the AMD-GWAS loci in three or more families. Two of these genes, SCN10A and KIR2DL4, are of interest because variants in these genes also showed association with AMD in case-control cohorts, albeit not at the level of genome-wide significance. Our study presents the first large-scale, exome-wide analysis of rare variants in AMD. Further independent replications and molecular investigation of candidate target genes, reported here, would assist in gaining novel insights into mechanisms underlying AMD pathogenesis.
© The Author(s) 2020. Published by Oxford University Press.

Entities:  

Year:  2020        PMID: 32246154      PMCID: PMC7390936          DOI: 10.1093/hmg/ddaa057

Source DB:  PubMed          Journal:  Hum Mol Genet        ISSN: 0964-6906            Impact factor:   6.150


Introduction

The genetic basis of common diseases continues to be a subject of significant interest in human genetics research in the post-genome era. Genome-wide association studies (GWAS) have been successful in uncovering hundreds of common genetic variants associated with multiple complex traits (1). Despite representing a major advance in deciphering genetic structure of complex diseases, GWAS have faced two major setbacks. First, the associated variants often reside in non-coding regions of the genome, representing a tag SNP (single nucleotide polymorphism) for the causal variant(s). Thus, the causal gene(s) and genetic variant(s) are not immediately evident for a majority of identified loci. Second, barring few exceptions, the associated variants identified by GWAS explain only a small proportion of the disease heritability (2). These observations have led to shifts in paradigms including possible involvement of rare variants in complex traits. In fact, both common and rare variants have been recognized to contribute to the genetic architecture of multifactorial diseases (3). Nevertheless, common variants have long been central to these investigations, primarily because of the identification of millions of SNPs across the genome that could be genotyped in a high-throughput manner (4,5). The advent of next-generation sequencing technologies and reduced cost of sequencing have prompted a new wave of studies for evaluating the impact of rare and low-frequency variants in complex diseases (6). Age-related macular degeneration (AMD; MIM, 603075) is a late-onset multifactorial, neurodegenerative disease, which affects the macular region of the retina, eventually resulting in the loss of central vision. AMD afflicts 8.7% of the worldwide population (7) with over 10 million individuals in the United States alone, posing a substantial healthcare burden (8). The clinical presentation of the disease is progressive and can be broadly divided into early, intermediate and late (advanced) stage (9). Early and intermediate stages are mostly asymptomatic and characterized by changes in the retinal pigment epithelium (RPE) and accumulation of extracellular aggregates, called drusen. Central vision loss characterizes the advanced stage, which can be further subdivided into two forms: geographic atrophy (GA) and choroidal neovascularization (CNV) (10). Therapeutic interventions are available for many patients with CNV, which represents a small proportion of all AMD cases (11). AMD is a complex trait determined by a combination of advanced age, genetic variants and environmental factors such as smoking and nutrition (12,13). Genetic studies in AMD have contributed significantly to biological underpinnings of its pathogenesis (14,15). The most recent GWAS of 16,144 patients and 17,832 control individuals identified 52 independently associated variants at 34 loci that explain over 50% of the genomic heritability (16). These findings have highlighted the complement system, extracellular matrix remodeling and lipid metabolism in AMD pathogenesis. Rare variants are also suggested to contribute to the heritability in AMD (16,17). Targeted sequencing of candidate genes and whole exome sequencing (WES) have identified rare coding variants in the CFH, C3, CFI, C9, CFB and COL8A1 genes at GWAS loci (18–24). Additional studies have established an unequivocal role of the complement system in AMD pathology (25–30). At this stage, the contribution of rare variants outside of the GWAS loci remains largely unexplored due to power issues, as case-control designs require sequencing of large numbers of samples to achieve statistical significance. Early investigations indicating the aggregation of AMD in families strongly indicated a strong genetic contribution (31–34), and such families can serve as a reference for identifying rare variants and causal genes. Linkage analysis in AMD families had initially led to the identification of the 1q31 and 10q26 loci (35–38), which were later identified as the top two association signals in GWAS and/or by candidate gene association at the CFH (39–43) and ARMS2-HTRA1 (44–46) loci, respectively. In addition, exome sequencing in large, multiplex families identified a rare variant in the FBN2 gene (47) in the early form of macular degeneration and confirmed the occurrence of rare variants in the CFH, CFI, C9 and C3 genes at multiple AMD loci (21,28,29,48–52). Thus, AMD families can provide new insights into AMD by identifying additional rare variants and pointing to candidate causal genes. In this study, we performed WES of 264 individuals from 63 multiplex families. To determine causality of genes in loci previously identified by GWAS, we analyzed these families for rare protein-altering variants in coding regions at AMD-associated loci. In addition, we examined whole exome data for rare coding variants outside the AMD-associated loci to identify new AMD candidate genes.

Results

Whole exome sequencing in AMD families

We assembled a cohort of 226 affected and 38 unaffected individuals from 63 multiplex families with a history of AMD (Table 1). A majority of the cohort (51.5%) had advanced stages of the disease (GA, CNV or GA + CNV) with a mean age of 78.2 years. Individuals with early and intermediate stages had a mean age of 69.1 years and constituted 34% of the affected individuals (Table 1). We also included 38 unaffected members (age > 55 years) from 32 families with a mean age of 72.1 years. Exome capture and sequencing was performed at two different centers [at the National Eye Institute (NEI) for the NEI families and at ErasmusMC Rotterdam for the Radboudumc families] (Fig. 1). As the WES data were captured on different platforms, we extracted variants from the shared 58.55 Mb exome regions. WES data were analyzed following the guidelines of GATK best practices (Fig. 1), resulting in the identification of 107,348 coding variants (53). We identified 4957 variants after additional, multi-tiered filtering based on quality, allele depth, minor allele frequency (MAF) in matched controls and predicted effect of the variants (Fig. 1). The variants segregating in a family were considered only if the variant was shared among 80% or more affected family members and/or segregated in all but one affected individual. We excluded the variants that were present in available unaffected member(s) of the respective family.
Table 1

Overview of families analyzed

Radboudumc cohortNEI cohort
Number of families4419
Total number of individuals19569
Males, females73,12230, 39
Number of individuals with advanced AMD8254
Mean age for advanced AMD76.580.8
Number of individuals with early or intermediate AMD828
Mean age for early or intermediate AMD68.574.8
Number of unaffected individuals317
Mean age of unaffected individuals70.877.5
Figure 1

An overview of stepwise filtering of variants identified in multiplex AMD families using whole exome sequencing. We retained variants that were shared among 80% or more affected family members or segregated in all but one affected individual. Variants were discarded if they were present in one or more unaffected family members.

Overview of families analyzed An overview of stepwise filtering of variants identified in multiplex AMD families using whole exome sequencing. We retained variants that were shared among 80% or more affected family members or segregated in all but one affected individual. Variants were discarded if they were present in one or more unaffected family members.

Survey of rare coding variants in genes within GWAS loci

We first analyzed the families for rare (MAF < 1%) coding variants within 34 AMD-associated loci (Table 2). The locus regions were defined by genome-wide significant variants and those within linkage disequilibrium (LD) (r2 > 0.5), with an additional 500 kb to both sides, as described in the most recent AMD-GWAS study (16). We identified 91 rare variants in 78 genes within the LD intervals of 29 known AMD loci and initially focused on loci where rare coding variants were identified in three or more families. Variants were detected in the CFH, PUS7, RXFP2 and PHF12 genes in three families and in the TACC2 gene in four families (Fig. 2A–D and F; Table 2). We also included genes that were predicted as most likely candidates using the gene priority score (GPS), as described previously (16). The scoring was based on expression of genes in AMD-relevant tissues (retina, RPE and choroid) and the presence of rare variants in AMD patients and their known biological functions. By combining our WES results with GPS score, we identified three additional likely AMD genes: SPEF2 (PRLR-SPEF2 locus), BCAR1 (CTRB2-CTRB1 locus) and C9 (C9 locus) (Table 2). Two independent non-synonymous variants (R201G and R421H) in SPEF2 were predicted to be pathogenic and identified in two families (Fig. 2G). No additional segregating rare variants were uncovered among the 16 genes within the LD region of the PRLR-SPEF2 locus. A non-synonymous and splice variant in BCAR1 is segregated in two families (Fig. 2H). In addition, a single family with four affected members revealed the segregation of a rare, likely pathogenic variant P167S in C9 (Fig. 2D).
Table 2

Rare coding variants (MAF < 1%) in genes within GWAS loci identified by WES

Locus (agenes within LD ± 500 kb)GeneFamily IDChr: Posrs IdMAF_NFEVariantsExonic functionProtein change
Genes with rare variants in three or more families
CFH (13) CFH AMD5801:196621252rs1422665510NM_000186:exon1:c.5G > CNon-synonymousR2T
CFH W11-2310_21:196646702NM_000186:exon5:c.524G > ANon-synonymousR175Q
CFH W10-0408_11:1966543110NM_000186:exon7:c.908G > ANon-synonymousR303Q
KMT2E/SRPK2 (8) PUS7 AMD9307:105148683rs1390582700.006NM_001318163:exon1:c.277 T > CNon-synonymousC93R
PUS7 AMD4797:105148683rs1390582700.006NM_001318163:exon1:c.277 T > CNon-synonymousC93R
PUS7 W08–05537:105148893NM_001318163:exon1:c.67A > GNon-synonymousS23G
B3GALTL (7) RXFP2 W11–465613:32351535rs1219183030.0072NM_001166058:exon8:c.664A > CNon-synonymousT222P
RXFP2 AMD58013:32352714rs731633170.0078NM_001166058:exon9:c.779A > GNon-synonymousN260S
RXFP2 W07–019913:32367033rs1389512900.0014NM_001166058:exon15:c.1522C > GNon-synonymousR508G
TMEM9/VTN (38) PHF12 W11-1525_117:27238135rs1483474850.0083NM_001033561:exon10:c.2210A > GNon-synonymousN737S
PHF12 AMD58017:27251125NM_001033561:exon4:c.517A > TNon-synonymousT173S
PHF12 AMD93017:27238135rs1483474850.0083NM_001033561:exon10:c.2210A > GNon-synonymousN737S
ARMS2/HRTA1 (15) TACC2 W09–183210:123844894rs1121883130.0068NM_001291876:exon4:c.2879G > ANon-synonymousR960K
TACC2 W10-0408_110:123954596NM_001291878:exon2:c.110C > TNon-synonymousT37M
TACC2 W11-1525_210:123844894rs1121883130.0068NM_001291876:exon4:c.2879G > ANon-synonymousR960K
TACC2 AMD5610:123809983rs202197379NM_001291876:exon3:c.64G > ANon-synonymousA22T
Genes with highest GPSa
C9 (6) C9 W11–40355:39331894rs348829570.0066NM_001737:exon5:c.499C > TNon-synonymousP167S
PRLR/SPEF2 (16) SPEF2 AMD9305:35646784rs800103290.0066NM_024867:exon5:c.601A > GNon-synonymousR201G
SPEF2 AMD3935:35667268rs1395808770.0094NM_024867:exon9:c.1262G > ANon-synonymousR421H
CTRB2/CTRB1 (20) BCAR1 AMD93016:75268977rs617431040.01NM_001170721:exon4:c.1190G > ANon-synonymousR397Q
BCAR1 W11–404416:75286131rs740247540.0029NM_001170720:exon2:c.12 + 2 T > CSplicing

aGPS refers to gene priority score as described in Fritsche et al. (16), which was a scoring-based method for suggesting most likely candidate at known AMD loci based on expression, presence of rare variants and known biological function that were deemed important for AMD. Minor allele frequency (MAF) was from non-Finnish Europeans from ExAC database.

Figure 2

Segregation of rare variants in genes residing within AMD-associated loci. Filled symbols represent the individuals with AMD.

We next analyzed the expression of the eight genes where rare variants were identified in the retinal transcriptome and expression quantitative trait loci (eQTL) dataset that included both control and AMD donor samples (54) (Fig. 3). Six of these genes (CFH, PUS7, PHF12, TACC2, SPEF2 and BCAR1) were expressed in the retina (counts per million, CPM > 0), while no retinal expression was detected for RXPF2 and C9 (Fig. 3A). Additionally, we evaluated gene expression changes during AMD progression by comparing the gene expression profile of the control retina with the profiles of the retina during early, intermediate and late stages of the disease. While we did not observe a significant difference [at false discovery rate (FDR) ≤ 10%] during disease progression, several candidates showed a trend of either increased or decreased expression (Fig. 3B). We note that expression of four of the candidates (PUS7, PHF12, SPEF2, BCAR1) is regulated in the retina through common eQTL variants (Fig. 3C).
Figure 3

(A) Mean expression of candidate genes within AMD-associated loci in human donor retina. (B) Heatmap showing the fold change differences observed in genes during early-, intermediate- and late-stage AMD compared to normal retina. None of the candidate reaches statistical significance of FDR ≤ 10%. (C) Violin plots showing the relationship between the variant and the gene of an observed eQTL. Fewer individuals with homozygous genotypes of PUS7 (G/G) and SPEF2 (T/T) resulted in single point in violin plots. P-values for the eVariants: (rs170690–3.97 × 10−5), (7:105733118–1.39 × 10−5), (rs117886541–1.12 × 10−5), (rs78996920–3.85 × 10−10), (rs12520223–1.75 × 10−8), (rs150308036–3.15 × 10–7), (rs8064132–2.04 × 10−5).

Rare coding variants (MAF < 1%) in genes within GWAS loci identified by WES aGPS refers to gene priority score as described in Fritsche et al. (16), which was a scoring-based method for suggesting most likely candidate at known AMD loci based on expression, presence of rare variants and known biological function that were deemed important for AMD. Minor allele frequency (MAF) was from non-Finnish Europeans from ExAC database. Segregation of rare variants in genes residing within AMD-associated loci. Filled symbols represent the individuals with AMD. (A) Mean expression of candidate genes within AMD-associated loci in human donor retina. (B) Heatmap showing the fold change differences observed in genes during early-, intermediate- and late-stage AMD compared to normal retina. None of the candidate reaches statistical significance of FDR ≤ 10%. (C) Violin plots showing the relationship between the variant and the gene of an observed eQTL. Fewer individuals with homozygous genotypes of PUS7 (G/G) and SPEF2 (T/T) resulted in single point in violin plots. P-values for the eVariants: (rs170690–3.97 × 10−5), (7:105733118–1.39 × 10−5), (rs117886541–1.12 × 10−5), (rs78996920–3.85 × 10−10), (rs12520223–1.75 × 10−8), (rs150308036–3.15 × 10–7), (rs8064132–2.04 × 10−5).

Exome-wide analysis of rare pathogenic variants

We then extended our analysis in search of rare pathogenic variants in genes outside the GWAS loci. We selected genes with rare (MAF < 1%) variants that were detected in at least three families and applied an additional predictive causality filter to focus on the 1% most deleterious variants in the human genome (CADD score ≥ 20). We also removed the variants that did not pass the quality score filter in Exome Aggregation Consortium (ExAC) database. Our analyses resulted in the identification of 13 candidate genes with 38 rare variants segregating in three or more families (Table 3). As large or polymorphic genes harbor a large number of variants, we further prioritized the list based on the following criteria: (i) whether the gene was expressed in the human retina (CPM > 0) and is regulated by an eQTL, (ii) whether common variants around the candidate gene exhibited a suggestive association (P-value < 5 × 10−4) for AMD and (iii) whether the candidate gene or a closely related gene was shown to have a function relevant to AMD pathology. Ten candidate genes demonstrated expression in the human retina with a few showing changes during AMD progression (Fig. 4A and B). Five of the genes also had an eQTL in the retina (Fig. 4C). One candidate, SCN10A, showed a modest association (3:38800182_C/T; P-value: 9.63 × 10−5) in the AMD-GWAS (Fig. 4D). We identified three different rare variants in SCN10A, one nonsense variant (Q923X) and two non-synonymous variants (P893L and P332L) in three independent AMD families (Fig. 5A). A rare frameshift variant (S267fs) in KIR2DL4 is segregated in three families (Fig. 5B). Based on their role in mitochondrial biogenesis, ESRRA and VPS13B, each with rare variants in three different families, could be presented as additional interesting candidates (Fig. 5C and D).
Table 3

Candidates with rare coding variants (MAF < 1%) segregating in three or more families identified in exome-wide analysis

GeneChr: Posrs IdNo. of familyMAF_NFEExonic functionNucleotide changeAmino acid changeCADD_phred
CTDSP2 chr12: 58220816rs76940645110Non-synonymousNM_005730:exon4:c.317 T > CI106T25.3
TTN chr2: 179395813rs5586528470.0001Non-synonymousNM_003319:exon186:c.78334G > AV26112M22.1
chr2: 179482937rs726772320.0026Non-synonymousNM_003319:exon80:c.20053G > AV6685I21.6
chr2: 179482994rs726772310.0035Non-synonymousNM_003319:exon80:c.19996C > TR6666C22.2
chr2: 179486037rs726772250.0065Non-synonymousNM_003319:exon74:c.18213G > TK6071N22.3
chr2: 179486223rs173549920.0079Non-synonymousNM_003319:exon73:c.18133G > AD6045N23.2
chr2: 179486345rs1143317730.0015Non-synonymousNM_003319:exon73:c.18011A > TE6004V22.8
chr2: 179537200rs2020144780.006Non-synonymousNM_133378:exon149:c.30961G > AV10321I21.5
chr2: 179549407rs726500310.0077Non-synonymousNM_133378:exon128:c.28892C > TP9631L20.1
chr2: 179582913rs726489810.003Non-synonymousNM_133378:exon83:c.21088G > AE7030K22.7
chr2: 179585312rs174525880.008Non-synonymousNM_133378:exon77:c.19445C > TS6482L22.5
chr2: 179588622rs2013941170.0007Non-synonymousNM_133378:exon70:c.17632G > AA5878T23.6
KIR3DL3 chr19: 55237616rs1437658604NANon-synonymousNM_153443:exon3:c.168C > AN56KNA
chr19: 55246731rs602444NANon-synonymousNM_153443:exon6:c.961C > TH321YNA
DCHS2 chr4: 155156598rs14954884840.002Non-synonymousNM_017639:exon25:c.7841C > TP2614L24.9
chr4: 155411721rs1996210860.0042Non-synonymousNM_001142552:exon1:c.787C > TR263W25.9
chr4: 155411948rs1846190330.0074Non-synonymousNM_001142552:exon1:c.560G > TR187L24.2
DNAH14 chr1: 225340410rs1757881940.0094Non-synonymousNM_001373:exon32:c.4970G > AG1657E21.8
chr1: 225490915rs1400661300.0042Non-synonymousNM_001373:exon55:c.8410C > TR2804C34
chr1: 225528175rs1840947530.0034StopgainNM_001373:exon67:c.10171G > TE3391X58
SCN10A chr3: 3876812330StopgainNM_001293307:exon15:c.2767C > TQ923X24.9
chr3: 38768212rs1384134380.0014Non-synonymousNM_001293307:exon15:c.2678C > TP893L24.7
chr3: 387986060Non-synonymousNM_001293306:exon8:c.995C > TP332L33
ESRRA chr11: 64083290rs15084835930.0098Non-synonymousNM_001282450:exon7:c.1124G > AR375Q22.4
chr11: 64083293rs2019713620.0098Non-synonymousNM_001282450:exon7:c.1127G > TR376L22.1
ADPRHL1 chr13: 114077172rs14518772930.0084Non-synonymousNM_138430:exon7:c.1030G > AA344T20.5
chr13: 114107571rs1380297630.0076Non-synonymousNM_138430:exon1:c.182 T > CM61T26.2
KIR2DL4 chr19: 55324674rs113712653NAframeshift_insertionNM_001080772:exon6:c.802dupAS267fsNA
PLB1 chr2: 28805359rs14473737230.0013Non-synonymousNM_001170585:exon24:c.1687A > GR563G22.3
chr2: 28854972rs74701215NANon-synonymousNM_001170585:exon54:c.3934C > AP1312T28.1
TAF1C chr16: 84212875rs19997656730.0037Non-synonymousNM_001243158:exon11:c.1286C > TS429L31
chr16: 84213651-NANon-synonymousNM_001243158:exon10:c.604C > AQ202K23.3
chr16: 84215010-0Non-synonymousNM_001243158:exon7:c.170G > AR57H34
UNC80 chr2: 210683829rs20047365230.0019Non-synonymousNM_032504:exon12:c.1806G > CQ602H23.1
chr2: 210707144rs789121920.0003Non-synonymousNM_032504:exon21:c.3434A > CE1145A27.9
chr2: 210860221rs1997833520.0005Non-synonymousNM_182587:exon63:c.9607G > AE3203K23.4
VPS13B chr8: 100147957rs14320529630.0005Non-synonymousNM_015243:exon11:c.1559A > GH520R24.8
chr8: 100443885rs617537220.0091Non-synonymousNM_017890:exon22:c.3203C > TT1068I24.2
Figure 4

(A) Mean expression of the candidate genes with rare variants (MAF < 1%) in three or more families that are located outside of AMD-associated loci. (B) Heatmap showing the fold change differences observed in genes during early-, intermediate- and late-stage AMD compared to normal retina. (C) Violin plots showing the relationship between the variant and the gene of an eQTL. Fewer individuals with homozygous genotypes of TTN (G/G) resulted in single point in the violin plot. P-values for the eVariants: (rs66773470–2.66 × 10−6), (rs2278043–5.92 × 10−60), (rs35615286–4 × 10−5), (rs58805123–2.50 × 10−5), (rs11889787–4.11 × 10−5), (rs1800792–1.56 × 10−5). (D) LocusZoom plot (80) showing the association signals around SCN10A in most recent GWAS analysis in AMD (16).

Figure 5

Segregation of the rare variants in SCN10A (A), KIR2DL4 (B), ESRRA (C) and VPS13B (D) in AMD families.

Given that segregation of rare variants in small families can be detected by chance, we performed simulation analysis to test the null case of no variant effect on disease by randomly assigning alleles to subjects in our study and then calculating the probability of observing segregation in three or more families. We applied the segregation criteria similar to the criteria that were used to identify variants in the AMD families and performed 100 000 gene-level simulations for two different variant sets: one with rare variants identified at the AMD loci and the other with exome-wide, rare variants after filtering by CADD score. We observed an average gene-level type I error rate of 7.7 × 10−7 for the AMD loci analysis and 1.87 × 10−7 for the CADD score filtered exome-wide analysis. The study-level type I error was 4.6 x 10−4 (AMD loci) and 2.2 × 10−4 (exome-wide). Given the pedigree structure and variants identified in our study, it is therefore unlikely to detect segregation of rare variants in three or more families by chance.

Rare variant association in case-control cohort

We further tested whether candidate genes identified in this study exhibit rare variant associations in AMD case-control studies that employed WES or whole genome sequencing (WGS) to genotype rare variants in coding regions of the genome. We analyzed the burden of rare coding variants in the eight genes within GWAS loci (Table 2) and 13 genes identified in our exome-wide analysis (Table 3) in two different cohorts (see Methods) (24,30). A meta-analysis of rare variants in 3519 AMD subjects and 3754 controls suggested a burden of rare variants in the C9 (P = 0.024) and KIR2DL4 (P = 0.035) genes using the burden and variable threshold (VT) tests (Supplementary Material, Table S1).

Discussion

GWAS and deep sequencing efforts have strongly supported a role for common and rare variants in AMD susceptibility. In addition to conferring high risk for AMD, rare variant studies can provide important insights into the phenotypic characteristics of the carrier patients (55,56). The most recent GWAS examined both common and rare variants in a large cohort but was limited to the analyses of known rare variants that were present on the chip (16) and did not allow sequencing-based discovery of rare variants. A few sequencing studies attempted to harness the power of familial cases, but their search was restricted to known AMD loci (28,49). Here, we present the first, large-scale, genome-wide survey of rare coding variants in multiplex AMD families that identified several novel candidate genes/variants contributing to AMD susceptibility. Within known AMD susceptibility regions, we identified rare variants at eight loci; two of these in CFH and C9 have previously been reported in case-control and familial studies (18,20). These findings further strengthen the role of the complement pathway in AMD and validate the presence of highly penetrant, functional rare variants as causal variants at these loci. Our study suggested the causal genes at other AMD loci. PUS7 encodes a key RNA-modifying enzyme. RXFP2 encodes a receptor for glycoprotein hormones that contains a unique low-density lipoprotein type A (LDLa) module (57); though this gene is not expressed in the adult human retina (54), its homolog in zebrafish is transcribed in the developing brain and retina (58). PHF12 encodes a member of the PHD zinc finger family of proteins involved in regulation of ribosomal biogenesis and senescence (59) and could contribute to AMD pathology through activation of senescence. Although present in only two families, the findings of rare variants in SPEF2 and BCAR1 are also noteworthy as both of these candidates were highlighted as most likely candidate at the AMD-GWAS loci based on a gene priority score (16). SPEF2 is highly expressed in the human retina and shows a slight increase in advanced stages of AMD. SPEF2 is a cilia-related protein and its absence causes male infertility and primary ciliary dyskinesia (60). This protein is shown to interact with IFT-related protein, IFT20 (61) that is required for opsin trafficking and photoreceptor outer segment development (62). BCAR1 encodes a Src family kinase substrate that is shown to be involved in early retinal development, and a mouse model of Bcar1 exhibits dramatic disruption of the ganglion cell layer (63). Recently, BCAR1 was also identified as a putative causal gene at the CTRB2/CTRB1 locus by co-localization of GWAS and single-cell eQTL data (64). Candidates with rare coding variants (MAF < 1%) segregating in three or more families identified in exome-wide analysis (A) Mean expression of the candidate genes with rare variants (MAF < 1%) in three or more families that are located outside of AMD-associated loci. (B) Heatmap showing the fold change differences observed in genes during early-, intermediate- and late-stage AMD compared to normal retina. (C) Violin plots showing the relationship between the variant and the gene of an eQTL. Fewer individuals with homozygous genotypes of TTN (G/G) resulted in single point in the violin plot. P-values for the eVariants: (rs66773470–2.66 × 10−6), (rs2278043–5.92 × 10−60), (rs35615286–4 × 10−5), (rs58805123–2.50 × 10−5), (rs11889787–4.11 × 10−5), (rs1800792–1.56 × 10−5). (D) LocusZoom plot (80) showing the association signals around SCN10A in most recent GWAS analysis in AMD (16). Segregation of the rare variants in SCN10A (A), KIR2DL4 (B), ESRRA (C) and VPS13B (D) in AMD families. Our survey of candidate genes harboring rare variants outside AMD loci was aimed at identifying potential novel candidate genes underlying AMD pathology. Such candidates could have been missed in GWAS and other association studies if the gene confers the risk to AMD solely through rare variants, or if the effect size of the common risk variant is very low, warranting larger numbers of cases and controls to detect such associations. We identified 13 candidate genes that harbor rare variants in more than three families and show, by simulation studies, that the probability of these variants segregating by chance is very low. Thus, these genes represent reasonable candidates for AMD susceptibility. We propose SCN10A and KIR2DL4 as the most interesting AMD candidates. Common variants at the SCN10A gene have been associated with AMD in a recent AMD-GWAS (16) though not at the level of genome-wide significance, and a suggestive burden of rare variants in KIR2DL4 has been detected in a meta-analysis of WGS data in a case-control cohort composed of 3519 cases and 3754 controls (65). SCN10A is a voltage-gated sodium channel expressed in starburst amacrine cells and a subset of retinal ganglion cells (66), which is likely the reason for not observing its expression in the human retina. Gain-of-function mutations in this gene cause axonal degeneration leading to painful neuropathy (67). Several retinal diseases are caused by ion channel mutations including inherited macular degeneration (68). KIR2DL4, a member of the human killer cell Ig-like receptor (KIR) family, is expressed primarily in natural killer (NK) cells and plays an important role in innate immunity (69). NK cells interact with HLA class ligand through their KIR receptors and a certain combination of HLA-C, and KIR gene variants have been associated with AMD (70). ESRRA, an estrogen-related receptor is a key regulator of energy homeostasis and mitochondrial function. ESRRA-null mice show altered regulation of enzymes involved in lipid, eicosanoid and steroid synthesis (71). Dysfunction in lipid metabolism has been linked to AMD pathogenesis (72), making ESRRA an attractive candidate for further investigation. VPS13B is a transmembrane protein with a function in vesicle-mediated transport. Mutations in this gene cause Cohen syndrome, whose clinical features include non-progressive psychomotor retardation, microcephaly and retinal dystrophy (73). Our study demonstrates that WES in extended families could prove valuable for identifying rare coding risk variants in potentially novel AMD genes. Similar results could be obtained from traditional case-control studies; however, that would require study designs involving much larger cohorts. A further incorporation of expression data from a large cohort of normal and AMD donors (54) also uncovered new insights. These candidates do not reach statistical threshold (FDR ≤ 10%) in the differential expression analysis because of the clinical and genetic heterogeneity among AMD and normal individuals and limited power of the study to detect such differences. Investigation of larger cohorts as well as other AMD-relevant tissues such as RPE and choroid will be useful as additional line of evidence in validating these candidates as AMD-causing genes. In conclusion, our family-based exome sequencing studies identified rare coding variants for novel candidate genes at eight known GWAS loci and 13 additional candidates outside GWAS regions. Further independent replications and molecular investigations of the candidate genes and variants, reported here, could provide novel mechanisms and pathways underlying AMD pathogenesis.

Materials and Methods

Study samples

The study population consisted of 264 samples from 63 multiplex AMD families is collected at two different centers. The Radboudumc cohort (Fig. 1, Table 1) comprised of 44 families (195 subjects). All patients gave written consent and the local ethics committees on research involving human subjects approved the study. Family ascertainment and disease classification for NEI cohort have been described in detail elsewhere (16,37). Briefly, samples were ascertained from the clinical practice at the Kellogg Eye Center. Fundus photographs, fluorescein angiograms and eye examinations records were obtained for all probands and family members and were updated every 1–2 years. The recruitment and research protocols were reviewed and approved by the University of Michigan institutional review board, and informed consent was obtained from all study participants. Fundus findings in each eye were classified on the basis of a standardized set of diagnostic criteria established by the International Age-Related Maculopathy Epidemiological Study (74). The Declaration of Helsinki principles was followed for all procedures. The NEI cohort (Fig. 1, Table 1) consisted of 19 families (69 subjects) that were collected at the University of Michigan. All patients signed informed consent, and the Institutional Review Boards of the University of Michigan and the National Eye Institute approved the study.

Whole exome sequencing

Genomic DNA was extracted from the peripheral blood using standard methods. Genomic DNA samples were quantified using the Promega QuantiFluor® dsDNA system (Promega, Madison WI, USA), according to the manufacturer’s instructions. The NEI families were subjected to exome sequencing using standard library preparation protocol with Agilent SureSelect Human All exon 50 Mb kit (Agilent Technologies, Santa Clara, CA), following the manufacturer’s instructions (75). Captured libraries were amplified and converted to clusters using Illumina Cluster Station, and single-end 101 bp sequencing was performed on Illumina GAIIx (Illumina, Inc., San Diego, CA). The Nimblegen SeqCap EZ Exome v2.0 44 Mb kit (Roche Nimblegen, Inc., Madison, WI) was used for performing WES in the Radboudumc families. Illumina HiSeq2000 sequencer was used to perform paired-end sequencing, using Illumina TruSeq V3 chemistry (Illumina, Inc., San Diego, CA).

Primary bioinformatics analysis

FastQC (available at http://www.bioinformatics.babraham.ac.uk/projects/fastqc/) was used to confirm quality of sequencing, after which adapter indexes were removed using Trimmomatic (76). Mapping was performed on the hg19/GRCh37 human reference genome sequence build using BWA (77). Aligned reads were processed to mark duplicates using Picard (http://picard.sourceforge.net). The Genome Analysis Toolkit (GATK) recommendations for best practices were applied for variant calling, local realignment, base quality recalibration and variant recalibration (78). The annotation of variants was performed with ANNOVAR (79). Overlapping regions between the capture kits were identified using the mergeBed tools from the bedtools package (80). Alternate allele frequencies of the variants in non-Finnish European (NFE) populations were obtained from the Exome Aggregation Consortium, which constituted data from 33,370 individuals.

Variant filtering and prioritization

We removed the variants that did not pass the variant quality score recalibration (VQSR) filter. We further retained variants based on read depth (≥ 5×), and at least 20% of the reads were attributed to the alternate allele. We applied a segregation filter where we retained variants that were shared among 80% or more affected family members or segregated in all but one affected individual. Variants were discarded if they were present in one or more unaffected family members. We generated a list of rare variants with MAF < 1% and low-frequency variants based on allele frequency data from ExAC non-Finnish Europeans (81). The AMD locus regions were defined by the genome-wide significant variants and the variants within linkage disequilibrium (LD) (r2 > 0.5), adding a further 500 kb to both sides, as described in the most recent AMD-GWAS analysis (16). Finally, the variant-level data were collapsed into gene-level data by combining all variants observed in each gene across different families. For exome-wide analysis, we applied additional filters of predictive causality (CADD ≥ 20) and retained variants that passed variant quality score recalibration in the ExAC database. Common AMD-associated variants from the most recent GWAS (16) were visualized using LocusZoom (82).

Simulation analysis of rare variant segregation

Simulation analysis was performed to assess type I errors. We included all the variants that passed quality control, MAF and functional filters. We did not include the segregation filter in order to access the probability of genes/variants, segregating in three or more families. We started with our list of filtered variants with allele counts from the AMD loci analysis (2862 variant) and the exome-wide analysis (1299 variants) as our input. For each variant, we simulated the null case by randomly distributing its minor alleles across the 264 subjects in the study, with at most one allele per subject. Then, we applied the same segregation criteria we used in our real data analysis. Briefly, if any minor allele was assigned to a control subject, the variant was discarded; otherwise, we evaluated the segregation pattern of each family and counted properly segregating families as described above. We repeated this for all variants within each gene and then identified all segregating families. We performed 100,000 simulations for each gene in two different scenarios: (1) 2862 variants in 600 genes within the known AMD Loci and (2) 1299 variants with a CADD score ≥ 20 in 1176 genes exome-wide. The study-level type I error was calculated by multiplying the gene-level type I errors with the number of genes tested.

Expression and eQTL analysis of control and AMD retina

The transcriptome data of post-mortem human donor retinas from 453 donors at different stages of AMD and controls were analyzed for candidate gene prioritization. The donor retina that were graded for normal and disease status of AMD using Minnesota Grading System (MGS), with criteria similar to the Age-Related Eye Disease Study (AREDS). We analyzed 105 MGS1 (normal), 175 MGS2 (early AMD), 112 MGS3 (intermediate AMD) and 61 MGS4 (advanced AMD) after initial RNA-seq quality control described elsewhere. The mean age of donors was 80 years (range 55–107). The methods for RNA sequencing, gene expression quantitation, differential gene expression and eQTL analysis are described in detail elsewhere (54). Briefly, RSEM expected counts from 453 samples and 18,053 genes passing the expression-level filter (≥ 1 CPM in ≥ 10% of samples) were transformed into TMM-normalized CPM and then converted into log2 CPM with an offset of 1. Batch effects were estimated and corrected using the supervised surrogate variable analysis (supervised sva) method within the bioconductor sva package in R. The differential expression between MGS 1 controls and each disease stage was assessed using the limma package in R. Fold changes between these comparisons were presented as heatmaps and violin plots using in-house scripts in R. The eQTL analysis included 17,389 genes and 8,924,684 genotyped and imputed common variants. The mapping of cis-eQTLs [as defined by SNP-gene combination within ±1 Mb of the transcriptional start site (TSS) of each gene] was performed using QTLtools to identify genetic variants (eVariants) that control expression of genes (eGenes) at FDR ≤ 0.05.

Rare variant association analysis in AMD case-control studies

We analyzed two different cohorts for this analysis. The first cohort was analyzed using whole genome sequencing data in 4787 subjects consisting of 2394 cases and 2393 controls (30,81). All the cases had advanced AMD (GA or CNV). The second cohort was analyzed using whole exome sequencing in 1125 cases and 1361 controls (24). RAREMETAL was used to perform the meta-analysis of gene-based tests for rare variants (83). Click here for additional data file.
  82 in total

Review 1.  Vision from next generation sequencing: multi-dimensional genome-wide analysis for producing gene regulatory networks underlying retinal development, aging and disease.

Authors:  Hyun-Jin Yang; Rinki Ratnapriya; Tiziana Cogliati; Jung-Woong Kim; Anand Swaroop
Journal:  Prog Retin Eye Res       Date:  2015-02-07       Impact factor: 21.198

2.  Expression pattern of zebrafish rxfp2 homologue genes during embryonic development.

Authors:  Aldo Donizetti; Marcella Fiengo; Rosanna Del Gaudio; Giovanni Iazzetti; Paolo Pariante; Sergio Minucci; Francesco Aniello
Journal:  J Exp Zool B Mol Dev Evol       Date:  2015-07-14       Impact factor: 2.656

3.  Rare genetic variants in Tunisian Jewish patients suffering from age-related macular degeneration.

Authors:  Eran Pras; Dana Kristal; Nadav Shoshany; Dina Volodarsky; Inna Vulih; Gershon Celniker; Ofer Isakov; Noam Shomron; Elon Pras
Journal:  J Med Genet       Date:  2015-05-18       Impact factor: 6.318

Review 4.  A new perspective on lipid research in age-related macular degeneration.

Authors:  Elisabeth M van Leeuwen; Eszter Emri; Benedicte M J Merle; Johanna M Colijn; Eveline Kersten; Audrey Cougnard-Gregoire; Sascha Dammeier; Magda Meester-Smoor; Frances M Pool; Eiko K de Jong; Cécile Delcourt; Eduardo Rodrigez-Bocanegra; Marc Biarnés; Philip J Luthert; Marius Ueffing; Caroline C W Klaver; Everson Nogoceke; Anneke I den Hollander; Imre Lengyel
Journal:  Prog Retin Eye Res       Date:  2018-05-04       Impact factor: 21.198

5.  LocusZoom: regional visualization of genome-wide association scan results.

Authors:  Randall J Pruim; Ryan P Welch; Serena Sanna; Tanya M Teslovich; Peter S Chines; Terry P Gliedt; Michael Boehnke; Gonçalo R Abecasis; Cristen J Willer
Journal:  Bioinformatics       Date:  2010-07-15       Impact factor: 6.937

Review 6.  Unraveling a multifactorial late-onset disease: from genetic susceptibility to disease mechanisms for age-related macular degeneration.

Authors:  Anand Swaroop; Emily Y Chew; Catherine Bowes Rickman; Gonçalo R Abecasis
Journal:  Annu Rev Genomics Hum Genet       Date:  2009       Impact factor: 8.929

Review 7.  Global prevalence of age-related macular degeneration and disease burden projection for 2020 and 2040: a systematic review and meta-analysis.

Authors:  Wan Ling Wong; Xinyi Su; Xiang Li; Chui Ming G Cheung; Ronald Klein; Ching-Yu Cheng; Tien Yin Wong
Journal:  Lancet Glob Health       Date:  2014-01-03       Impact factor: 26.763

8.  Gain-of-function Nav1.8 mutations in painful neuropathy.

Authors:  Catharina G Faber; Giuseppe Lauria; Ingemar S J Merkies; Xiaoyang Cheng; Chongyang Han; Hye-Sook Ahn; Anna-Karin Persson; Janneke G J Hoeijmakers; Monique M Gerrits; Tiziana Pierro; Raffaella Lombardi; Dimos Kapetis; Sulayman D Dib-Hajj; Stephen G Waxman
Journal:  Proc Natl Acad Sci U S A       Date:  2012-10-31       Impact factor: 11.205

9.  Rare Genetic Variants Associated With Development of Age-Related Macular Degeneration.

Authors:  Nicole T M Saksens; Maartje J Geerlings; Bjorn Bakker; Tina Schick; Mohamed R Daha; Sascha Fauser; Camiel J F Boon; Eiko K de Jong; Carel B Hoyng; Anneke I den Hollander
Journal:  JAMA Ophthalmol       Date:  2016-03       Impact factor: 7.389

10.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

View more
  8 in total

1.  The human pseudouridine synthase PUS7 recognizes RNA with an extended multi-domain binding surface.

Authors:  Julia Guegueniat; Levon Halabelian; Hong Zeng; Aiping Dong; Yanjun Li; Hong Wu; Cheryl H Arrowsmith; Ute Kothe
Journal:  Nucleic Acids Res       Date:  2021-11-18       Impact factor: 16.971

2.  Gene-Based Association Testing of Dichotomous Traits With Generalized Functional Linear Mixed Models Using Extended Pedigrees: Applications to Age-Related Macular Degeneration.

Authors:  Yingda Jiang; Chi-Yang Chiu; Qi Yan; Wei Chen; Michael B Gorin; Yvette P Conley; M'Hamed Lajmi Lakhal-Chaieb; Richard J Cook; Christopher I Amos; Alexander F Wilson; Joan E Bailey-Wilson; Francis J McMahon; Ana I Vazquez; Ao Yuan; Xiaogang Zhong; Momiao Xiong; Daniel E Weeks; Ruzong Fan
Journal:  J Am Stat Assoc       Date:  2020-07-28       Impact factor: 5.033

Review 3.  The complement system in age-related macular degeneration.

Authors:  Angela Armento; Marius Ueffing; Simon J Clark
Journal:  Cell Mol Life Sci       Date:  2021-03-09       Impact factor: 9.261

4.  Genomic-Metabolomic Associations Support the Role of LIPC and Glycerophospholipids in Age-Related Macular Degeneration.

Authors:  Ines Lains; Shujian Zhu; Joan W Miller; Jessica Lasky-Su; Liming Liang; Deeba Husain; Xikun Han; Wonil Chung; Qianyu Yuan; Rachel S Kelly; Joao Q Gil; Raviv Katz; Archana Nigalye; Ivana K Kim; John B Miller; Isabel M Carreira; Rufino Silva; Demetrios G Vavvas
Journal:  Ophthalmol Sci       Date:  2021-03-19

Review 5.  Targeting the Complement Cascade for Treatment of Dry Age-Related Macular Degeneration.

Authors:  Prem N Patel; Parth A Patel; Matthew R Land; Ibrahim Bakerkhatib-Taha; Harris Ahmed; Veeral Sheth
Journal:  Biomedicines       Date:  2022-08-04

Review 6.  Molecular Genetic Mechanisms in Age-Related Macular Degeneration.

Authors:  Aumer Shughoury; Duriye Damla Sevgi; Thomas A Ciulla
Journal:  Genes (Basel)       Date:  2022-07-12       Impact factor: 4.141

7.  HTRA1 rs11528744, BCRA1 rs9928736, and B3GLCT rs4381465 are associated with age-related macular degeneration in a Chinese population.

Authors:  Guo Huang; Huan Li; Shuang Lai; Jialing Xiao; Liang Wang; Huijuan Xu; Chuntao Lei; Jinglan Zhang; Man Yu; Ping Shuai; Yuping Liu; Yi Shi; Kaijie Wang; Bo Gong
Journal:  Front Genet       Date:  2022-09-29       Impact factor: 4.772

Review 8.  Complement System and Potential Therapeutics in Age-Related Macular Degeneration.

Authors:  Young Gun Park; Yong Soo Park; In-Beom Kim
Journal:  Int J Mol Sci       Date:  2021-06-25       Impact factor: 5.923

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.