Literature DB >> 25352737

Electronic medical records and genomics (eMERGE) network exploration in cataract: several new potential susceptibility loci.

Marylyn D Ritchie1, Shefali S Verma1, Molly A Hall1, Robert J Goodloe2, Richard L Berg3, Dave S Carrell4, Christopher S Carlson5, Lin Chen6, David R Crosslin7, Joshua C Denny8, Gail Jarvik9, Rongling Li10, James G Linneman11, Jyoti Pathak12, Peggy Peissig11, Luke V Rasmussen13, Andrea H Ramirez14, Xiaoming Wang15, Russell A Wilke16, Wendy A Wolf17, Eric S Torstenson2, Stephen D Turner18, Catherine A McCarty19.   

Abstract

PURPOSE: Cataract is the leading cause of blindness in the world, and in the United States accounts for approximately 60% of Medicare costs related to vision. The purpose of this study was to identify genetic markers for age-related cataract through a genome-wide association study (GWAS).
METHODS: In the electronic medical records and genomics (eMERGE) network, we ran an electronic phenotyping algorithm on individuals in each of five sites with electronic medical records linked to DNA biobanks. We performed a GWAS using 530,101 SNPs from the Illumina 660W-Quad in a total of 7,397 individuals (5,503 cases and 1,894 controls). We also performed an age-at-diagnosis case-only analysis.
RESULTS: We identified several statistically significant associations with age-related cataract (45 SNPs) as well as age at diagnosis (44 SNPs). The 45 SNPs associated with cataract at p<1×10(-5) are in several interesting genes, including ALDOB, MAP3K1, and MEF2C. All have potential biologic relationships with cataracts.
CONCLUSIONS: This is the first genome-wide association study of age-related cataract, and several regions of interest have been identified. The eMERGE network has pioneered the exploration of genomic associations in biobanks linked to electronic health records, and this study is another example of the utility of such resources. Explorations of age-related cataract including validation and replication of the association results identified herein are needed in future studies.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25352737      PMCID: PMC4168835     

Source DB:  PubMed          Journal:  Mol Vis        ISSN: 1090-0535            Impact factor:   2.367


Introduction

Cataract is the leading cause of blindness in the world [1,2], is the leading cause of vision loss in the United States [3], and accounts for approximately 60% of Medicare costs related to vision [4]. Summary prevalence estimates indicate that 17.2% of Americans aged 40 years and older have cataract in either eye and 5.1% have pseudophakia or aphakia (previous cataract surgery). In addition to the implications for healthcare delivery and healthcare costs, cataract has been shown to be associated with falls and increased mortality [5-12], possibly because of associated systemic conditions. Women have a slightly higher risk of having cataract than men [13]. With increased life expectancy, the number of cataract cases and cataract surgeries is expected to increase dramatically unless primary prevention strategies can be developed and successfully implemented. Several genetic loci have also been linked to cataract as an independent phenotypic trait. An extensive body of literature has addressed the role of genetics in childhood cataract [14], and it has been hypothesized that these same genes may be plausible candidates for age-related cataract [15]. It has been suggested that as many as 40 genes may be involved in age-related cataract [16]. Evidence for a major gene has been identified for cortical [17] and nuclear [18,19] cataract, with heritability estimates of 58% [20] and 48% [21], respectively. A whole genome STR scan conducted in families in Wisconsin revealed a major locus for age-related cortical cataract on chromosome 6p12-q12 [22], and specific candidate genes that have been studied include galactokinase (Gene_ID: 2584; OMIM: 604313) [23,24], apolipoprotein E (Gene_ID: 348; OMIM: 107741) [25], glutathione S-transferase (Gene_ID: 2944; OMIM: 138350)[26], N-acetyltransferase 2 (Gene_ID: 10; OMIM: 612182) [27,28], and estrogen metabolism genes [29]. Two recent studies found an association between the EPHA2 gene (Gene_ID: 1969; OMIM: 176946) and cataract [30,31]. Higher body mass index (BMI) has been shown in many studies to increase risk of cortical and posterior subcapsular (PSC) cataract (odds ratio [OR] = 1.5–2.5) [32-38]. A recent study found that nuclear cataract was not associated with obesity but was associated with the FTO obesity gene (Gene_ID: 79068; OMIM: 610966) in an Asian population [39]. Although familial aggregation studies have shown a potential role for gene and environment interactions in nuclear cataract [40,41], research in this area is limited. The association of glutathione S-transferase with cataract has been shown to be modified by smoking [42] and sunlight exposure [43]. No whole genome association SNP studies of age-related cataract in unrelated individuals have been reported in the medical literature. The purpose of this study was to conduct a genome-wide association study (GWAS) for age-related cataract and to prioritize top hits for further follow-up.

Methods

Phenotypic data

The National Human Genome Research Institute (NHGRI)-funded electronic medical records and genomics (eMERGE) network implemented an electronic phenotype algorithm to select cataract cases and controls [44]. Cataracts as a condition were selected by Marshfield Clinic as its primary eMERGE phenotype, and the algorithm, which uses diagnostic and procedure codes, was developed by the Marshfield Clinic Personalized Medicine Research Project (PMRP) investigators [45]. The five sites in eMERGE-I include Marshfield Clinic, Group Health Research Institute, Vanderbilt University, Mayo Clinic, and Northwestern University. This study included four of the sites: Marshfield Clinic, Group Health Research Institute, Vanderbilt University, and Mayo Clinic. Using an algorithm for a specific phenotype, each participating site extracted study samples for a specific disease or phenotype from the electronic health records (EHR). Once samples had been selected and genotyped, they were available for phenotyping with additional algorithms. Thus, the cataract algorithm was deployed across the network. The cases and the controls had to meet the following inclusion criteria: The cases were age 50 years and older at the time of diagnosis or surgery, and the controls were age 50 years or older at the time of the most recent eye exam and had had an eye exam within the previous 5 years. The controls had no diagnostic codes for cataract or evidence of cataract surgery. The cases were identified as “surgical” or “diagnosis only.” Surgical cases had undergone a cataract extraction in at least one eye. The diagnosis-only cases were required to have either cataract diagnoses on two or more dates or have one diagnosis date and natural language processing and optical character recognition (NLP/OCR) find one or more inclusion cataract terms. Cataract type was extracted from the notes using natural language processing and optical character recognition with validation through manual chart abstraction [45,46].

Genotypic data

Genome-wide genotyping has been performed on approximately 17,000 samples across the network at the Broad Institute and at the Center for Inherited Disease Research (CIDR) using the Illumina 660W-Quad or 1M-Duo Beadchips (CIDR, Baltimore, MD). For this particular study, which includes predominantly individuals of European descent, we used only the Illumina 660W-Quad platform. This platform consists of 561,490 SNPs and 95,876 intensity-only probes. Genotyping calls were made at either CIDR or Broad using BeadStudio version 3.3.7. The eMERGE Cataract dataset pre-quality control (QC) included 7,535 DNA samples and 344 HapMap controls: 3,968 Marshfield Clinic, 2,379 Group Health, 986 Mayo, and 202 Vanderbilt BioVU. Data were cleaned using the eMERGE QC pipeline developed by the eMERGE Genomics Working Group [47]. This process includes evaluation of the sample and marker call rate, gender mismatch, duplicate and HapMap concordance, batch effects, Hardy–Weinberg equilibrium, sample relatedness, and population stratification. After QC, 530,101 SNPs and 7,397 samples were used for analysis (see Table 1 for distribution by site). All genotype data and a detailed QC report for each individual site, as well as the merged eMERGE dataset, can be found on dbGaP, and the detailed eMERGE QC pipeline can be found in [47,48].
Table 1

Descriptive statistics on eMERGE cataract data set.

Study sampleSiteNumberNumber MissingTotal




7397
0
7397

Marshfield Clinic

Total


3914 (52.91%)



Cases
2557





Controls
1357



Mayo Clinic

Total


952 (12.87%)



Cases
606





Controls
346



Group Health

Total


2346 (31.72%)



Cases
2235





Controls
111



Vanderbilt

Total


185 (2.50%)



Cases
105





Controls
80



White


7109 (96.11%)


Race
Black


114 (1.54%)
0
7397
(p=0.1157)







Other


174 (2.35%)


Case-Control Cataract


Cases
5503
0
7397



Controls
1894


Cataract Age at Diagnosis (Case only)

Mean±SD
70.50±8.09
7296
101
7397


Median
71





IQR(25%,75%)
(66,76)





Range
35 - 136



Sex
Cases

Male
2401
0
5503
(p=0.7684)


Female
3102



Controls

Male
819
0
1894



Female
1075


Birthdate Year**
Cases
Mean±SD
2.3211±1.01
5462
41
5503
(p<0.0001)

Median
2





IQR(25%,75%)
(1,3)





Range
1.000 - 5.000




Controls
Mean±SD
4.07±0.936
1893
1
1894


Median
4





IQR(25%,75%)
(4,5)





Range
1.000 - 6.000



Diabetes (p<0.0001)
Cases

Yes
921
2613
5503



No
1969



Controls

Yes
306
323
1894
No1265

** Birthdate Year denotes decade of birth where 1=1910, 2=1920, 3=1930, 4=1940, 5=1950, 6=1960

** Birthdate Year denotes decade of birth where 1=1910, 2=1920, 3=1930, 4=1940, 5=1950, 6=1960

Statistical analyses

Single-locus tests of association were performed using PLINK [49] assuming an additive genetic model for all 530,101 SNPs in a total of 7,397 unrelated individuals (5,503 cases and 1,894 controls). We calculated principal components using the EIGENSTRAT program [50] and thus adjusted our analyses for the first three principal components (PCs) to avoid any spurious associations that can be caused due to population stratification. EIGENSTRAT is based on principal components analysis and is used to detect and correct for population stratification in genome-wide association studies. Thus, we present the results of the analysis adjusted by principal components 1–3 (PC1–3). We also performed an age-at-diagnosis association analysis using cases only. Age at diagnosis is defined as the age when the first cataract diagnosis was made in the electronic health record. We performed unadjusted analysis and adjusted for PC1–3 using linear regression in PLINK. In Table 2 and Table 3, we report all p values <1×10−5. All associations identified by our analyses are suggestive and must be replicated in independent datasets because the signals did not reach a Bonferroni corrected genome-wide statistical significance level.
Table 2

PC adjusted case-control association analysis results.

CHRSNPReference AlleleCase MAFORP valueGeneLeft GeneRight GeneType of Variant
16
rs8044853
T
0.335
0.7099
2.42E-06
NA
GAN
CMIP
NA
9
rs1929494
T
0.4391
1.217
2.46E-06
LOC100129210
ALDOB
C9orf125
intron
22
rs926937
A
0.045
0.8525
6.09E-06
NA
LOC100130624
MN1
NA
16
rs9927153
A
0.2391
0.8359
9.38E-06
NA
GAN
CMIP
NA
16
rs2098753
G
0.3183
0.8106
1.06E-05
NA
GAN
CMIP
NA
5
rs9292118
A
0.2659
1.193
1.17E-05
NA
LOC441073
MAP3K1
NA
1
rs16853148
A
0.059
1.28
0.000012
NA
PRDM2
RP1–21O18.1
NA
5
rs13178221
T
0.243
1.203
1.33E-05
NA
LOC441073
MAP3K1
NA
9
rs882809
T
0.3823
0.7482
1.48E-05
LOC100129210
ALDOB
LOC100129210
near-gene-5
10
rs9299674
G
0.3242
0.7436
1.53E-05
NA
LOC441550
LOC439953
NA
10
rs4301693
C
0.1521
1.184
1.84E-05
NA
LOC441550
LOC439953
NA
2
rs10197959
A
0.4305
0.8409
1.87E-05
DNER
PID1
LOC100130031
intron
16
rs1563655
A
0.3251
0.8514
2.04E-05
NA
GAN
CMIP
NA
2
rs4853633
T
0.1937
1.241
2.17E-05
NA
MSTN
MGC13057
NA
15
rs8027435
T
0.4498
1.235
2.23E-05
NA
ARRDC4
LOC728459
NA
3
rs13074058
C
0.0789
0.8423
0.000028
LOC285382
VPS8
EHHADH
intron
10
rs549676
C
0.4961
1.219
3.19E-05
NA
PITRM1
KLF6
NA
2
rs10864871
C
0.2922
0.7878
3.26E-05
NA
hCG_2045614
LOC728241
NA
6
rs9405313
A
0.1204
0.7855
3.31E-05
NA
LY86
RP11–320C15.1
NA
4
rs4695885
C
0.3323
1.222
3.96E-05
NA
LOC100128266
FBXO8
NA
4
rs2015977
A
0.4608
0.5185
4.08E-05
NA
LOC391656
LOC100131441
NA
16
rs310011
G
0.4267
0.8545
4.39E-05
NA
GAN
CMIP
NA
3
rs3732933
A
0.0718
1.181
4.42E-05
EHHADH
C3orf70
EIF2S2P2
reference
12
rs7963343
C
0.1752
1.203
4.49E-05
LOC100129881
CRADD
LOC441644
intron
20
rs6073358
T
0.0897
0.8249
4.57E-05
JPH2
TOX2
C20orf111
intron
18
rs7244678
C
0.0764
1.183
6.02E-05
IMPA2
MPPE1
LOC646044
intron
19
rs7252479
A
0.0516
0.8323
6.02E-05
ZNF578
LOC441862
ZNF808
intron
15
rs1993976
A
0.4469
0.7933
6.74E-05
NA
ARRDC4
LOC728459
NA
3
rs17008958
A
0.1439
0.7515
7.02E-05
EIF4E3
FOXP1
GPR27
intron
13
rs943386
G
0.324
1.258
7.13E-05
NA
LOC646208
LOC100130029
NA
17
rs4531770
C
0.1407
0.8437
0.000072
NA
hCG_1644301
FLJ37644
NA
5
rs3850653
A
0.2327
1.178
7.26E-05
NA
MEF2C
LOC729011
NA
1
rs10746432
A
0.4345
0.8413
7.53E-05
HHAT
LOC100129235
KCNH1
intron
5
rs160044
T
0.3105
1.232
7.62E-05
MEF2C
LOC645323
LOC729011
intron
3
rs1447899
T
0.2838
1.246
0.00008
EIF4E3
FOXP1
GPR27
intron
12
rs4831958
T
0.0711
0.8469
8.04E-05
NA
LOC100130336
LOC100131830
NA
4
rs6814129
G
0.4445
1.217
8.11E-05
NA
MRPS36P2
LOC644325
NA
9
rs12347205
A
0.3934
1.21
8.47E-05
NA
IL6RL1
OR7E31P
NA
9
rs951611
T
0.0095
0.8008
9.12E-05
NA
LOC286239
LOC401497
NA
1
rs4951508
T
0.2343
0.7486
9.58E-05
HHAT
LOC100129235
KCNH1
intron
6
rs9379053
A
0.1076
0.7166
9.65E-05
NA
LY86
RP11–320C15.1
NA
20
rs1337906
C
0.3422
1.179
9.84E-05
NA
RPL41P1
ST13P
NA
9
rs2148996
T
0.4654
0.8493
9.86E-05
NA
LOC392358
GAS1
NA
19
rs7247032
T
0.3922
0.7675
9.91E-05
NA
LOC100130084
USP29
NA
8rs4268128A0.2150.67619.97E-05NATNFRSF10BTNFRSF10CNA
Table 3

PC adjusted age-at-diagnosis association analysis results.

CHRSNPReference AlleleCase MAFBetaP valueGeneLeft GeneRight GeneType of Variant
12
rs12296937
G
0.0267
−1.08
6.39E-07
ACSS3
LIN7A
PPFIA2
intron
12
rs2574730
A
0.0371
−1.003
3.04E-06
ACSS3
LIN7A
PPFIA2
intron
12
rs769056
T
0.0369
0.6454
3.39E-06
ACSS3
LIN7A
PPFIA2
intron
12
rs11835432
T
0.1937
0.6667
7.64E-06
NA
LOC100132564
LOC644489
NA
1
rs207145
T
0.1237
−0.6722
7.9E-06
NA
LOC645506
GOT2L1
NA
12
rs2593270
A
0.2593
−1.263
1.06E-05
NA
LOC100132564
LOC644489
NA
12
rs2656824
G
0.2529
0.7466
1.19E-05
NA
LOC100132564
LOC644489
NA
15
rs4965818
G
0.3444
−0.8272
1.37E-05
SNRPA1
SELS
PCSK6
intron
12
rs337656
T
0.2225
−0.7027
1.45E-05
NA
LOC643264
CLLU1OS
NA
12
rs10778791
G
0.0354
−0.7015
2.08E-05
ACSS3
LIN7A
PPFIA2
intron
2
rs12612521
C
0.2144
−0.6601
0.000024
NA
LOC728241
LOC100131284
NA
2
rs10932058
C
0.4981
−0.7464
2.68E-05
NA
LOC100132132
LOC100132669
NA
15
rs748696
G
0.4491
−0.6481
2.98E-05
KIAA1199
FAM108C1
LOC100128570
intron
15
rs1524876
T
0.4568
0.7457
3.57E-05
MTMR10
MTMR15
TRPM1
intron
15
rs4778856
G
0.4667
0.7429
3.77E-05
KIAA1199
FAM108C1
LOC100128570
intron
9
rs2229594
T
0.1722
0.7258
3.95E-05
BAAT
LOC347275
LOC100128665
utr-3
16
rs933717
T
0.435
0.6402
0.000041
FBXO31
LOC730018
MAP1LC3B
intron
1
rs6663771
G
0.4138
0.652
4.24E-05
NA
SPATA17
RRP15
NA
15
rs1432442
G
0.0913
0.6621
0.000043
MAP2K1
ATP5J2P6
SNAPC5
intron
5
rs2468475
T
0.473
−0.6838
4.59E-05
NA
LOC100128659
LOC729862
NA
4
rs2406040
G
0.266
0.8714
4.59E-05
NA
LOC646316
LOC729578
NA
4
rs2406041
C
0.2591
−1.213
0.000051
NA
LOC646316
LOC729578
NA
2
rs13414831
G
0.2974
−2.441
5.34E-05
NA
UBR3
MYO3B
NA
20
rs864184
A
0.2301
−1.864
0.000054
PHACTR3
LOC645605
SYPC2
intron
4
rs10517073
T
0.4173
−2.027
5.98E-05
ANAPC4
ZCCHC4
LOC645433
intron
18
rs578026
C
0.3204
−2.022
0.000061
CLUL1
CETN1
C18orf56
intron
2
rs16857804
G
0.2944
−0.8456
6.24E-05
NA
UBR3
MYO3B
NA
2
rs4560089
G
0.3525
1.146
6.35E-05
NA
LOC100130842
MRPL50P1
NA
12
rs934078
A
0.1049
−1.048
6.57E-05
NA
OSTF1P
TBX3
NA
1
rs1416156
A
0.4182
0.8037
6.96E-05
NA
SPATA17
RRP15
NA
2
rs617222
A
0.2478
0.8032
7.03E-05
NA
LOC100129746
EPHA4
NA
4
rs2897305
G
0.2646
0.8959
7.06E-05
NA
LOC646316
LOC729578
NA
20
rs6070943
A
0.1765
−0.6294
7.23E-05
PHACTR3
LOC645605
SYCP2
intron
1
rs991007
T
0.1118
0.656
7.26E-05
INADL
TM2D1
L1TD1
intron
12
rs12099972
A
0.0821
1.132
7.71E-05
NA
LOC100129881
LOC441644
NA
2
rs9309489
A
0.2321
−0.6522
7.86E-05
NA
TACR1
FAM176A
NA
14
rs1742707
A
0.4406
0.6657
7.97E-05
NA
CPSF2
SLC24A4
NA
17
rs9908117
C
0.2586
−0.7235
8.27E-05
NA
LOC100128284
WSCD1
NA
9
rs7874443
C
0.3182
−0.6617
8.34E-05
NA
GOLM1
LOC100130433
NA
2
rs10195113
T
0.0657
−0.7182
8.41E-05
NA
SLC8A1
LOC729984
NA
5
rs1472606
G
0.3331
−0.69
0.000085
NA
SFXN1
HRH2
NA
18
rs7227421
G
0.0358
−1.679
9.02E-05
GNAL
LOC729602
CHMP1B
intron
10
rs4388822
T
0.0717
−0.7606
9.14E-05
NA
LOC439992
GRID1
NA
5rs2277939A0.3473−0.83829.34E-05SAP30LGALNT10HAND1intron

Results

Figure 1 shows the Manhattan plots for the single locus tests of association for cataract case control adjusted (Figure 1A) and age-at-diagnosis adjusted (Figure 1B) and Figure 2 shows the corresponding QQ plots for each GWAS analysis. Our top hits in the adjusted case-control analysis include gigaxonin (GAN; Gene_ID: 8139, OMIM: 605379; p value = 2.42×10−6), which encodes a member of the cytoskeletal Broad-Complex, Tramtrack, and Bric a brac (BTB/kelch) repeat family. The encoded protein plays a role in neurofilament architecture and is involved in mediating the ubiquitination and degradation of some proteins. Defects in this gene are a cause of giant axonal neuropathy (GAN). Other potential interesting findings include DNER (Gene_ID: 92737; OMIM: 607299; p value = 1.87×10−5), which encodes for the Delta and Notch-like epidermal growth factor-related receptor, and EHHADH (Gene_ID: 1962; OMIM: 607037; p value = 2.80×10−5) encodes for enoyl-CoA, hydratase/3-hydroxyacyl CoA dehydrogenase. Myocyte-specific enhancer factor 2C also known as MADS box transcription enhancer factor 2, polypeptide C is a protein that in humans is encoded by the MEF2C gene (Gene_ID: 4208; OMIM: 600662; p value = 7.26×10−5). MEF2C upregulates the expression of the homeodomain transcription factors DLX5 and DLX6, two transcription factors that are necessary for craniofacial development [51]. This could be another interesting link to cataracts.
Figure 1

Genome-wide association study Manhattan plots for cataract and age-at-cataract-diagnosis. A: Case-control adjusted by first three principal components and site where eMERGE data was collected. B: Age-at-diagnosis adjusted by first three principal components and site where eMERGE data were collected.

Figure 2

Quantile-quantile plot for analysis adjusted by the first three principal components and site where eMERGE data were collected.

Genome-wide association study Manhattan plots for cataract and age-at-cataract-diagnosis. A: Case-control adjusted by first three principal components and site where eMERGE data was collected. B: Age-at-diagnosis adjusted by first three principal components and site where eMERGE data were collected. Quantile-quantile plot for analysis adjusted by the first three principal components and site where eMERGE data were collected. Several SNPs in or near ALDOB (Gene_ID: 229; OMIM: 612724; p value = 2.46×10−6), which encodes for aldolase B, fructose-bisphosphate, were also associated with cataracts in our GWAS analysis. Mutations in this gene result in an autosomal recessive disorder of fructose intolerance, and cases of cataract have been reported in the first decade of life [52]. Another interesting associated gene is MAP3K1 (Gene_ID: 4214; OMIM: 600982; p value = 1.33×10−5), a functional mitogen-activated protein kinase kinase kinase 1. Molecular signatures of MAP3K1 have been shown to be important in embryonic eyelid closure in the mouse [53]. In total, 45 SNPs were statistically significant at p<10−5 or smaller. In the age-at-diagnosis analysis, our top hits include ACSS3 (Gene_ID: 79611; OMIM: 614356; p value = 6.39×10−7), which is acyl-CoA synthetase short-chain family member 3; EPHA4 (p value = 7.03×10−5), ephrin type-A receptor 4, which is a protein that in humans is encoded by the EPHA4 gene (Gene_ID: 2043; OMIM: 602188). This gene belongs to the ephrin receptor subfamily of the protein-tyrosine kinase family, along with EPHA2. EPH and EPH-related receptors have been implicated in mediating developmental events, especially in the nervous system [54].

Discussion

This study is the first genome-wide association study in age-related cataract reported in the literature. Cataract in type 2 diabetes has been investigated, and a region on chromosome 3p14.4–3p14.2 was identified in a Han Chinese population [55]. The five SNPs identified in that study do not show evidence of association in our eMERGE cataract GWAS. It is difficult to interpret these results, however, because age-related cataracts and cataracts in type 2 diabetics may be two different phenotypes, which may have disparate etiologies. In addition, our dataset does not have an overwhelming number of individuals with type 2 diabetes (see Table 1); thus, we were underpowered to explore this specific type of association. Other previously published research on gene mapping in cataracts supports a linkage region on chromosome 1 [56] and association with EPHA2 [30,31]. In our GWAS, we did not see evidence for association with EPHA2, although we did see association with EPHA4. One significant difference in this study is the phenotyping of cases and controls based on electronic health records (EHR) in population-based cohorts, rather than family-based samples. However, our study in addition to the literature supports the suggestion of cataract-susceptibility loci on chromosome 1. Replication studies and larger sample sizes are needed to validate and confirm these findings. Although the eMERGE network has demonstrated the utility of electronic phenotyping in EHR for several traits [57-61], there are inherent challenges with this approach. For ophthalmic conditions specifically, the abundance of EHR coded information is extremely limited or, in some health systems, absent. Thus, sophisticated phenotyping strategies must be established [45,46] Still, the success of the EHR and biobank approach for association studies is unprecedented. The ability to perform multiple GWAS simultaneously with no additional genotyping is an enormous benefit [58]. Once a set of patient samples has been genotyped on a genome-wide association platform, those data can be reused for multiple additional genotype-phenotype association studies. In particular, the eMERGE network has done quite a bit of this for quantitative traits and clinical laboratory variables such as cholesterol [60], red-blood cell indices [59], and white blood cell count [57]. The additional effort is expended on creating electronic phenotyping algorithms, rather than collecting samples and genotyping. Thus, this is an enormous resource for subsequent genotype-phenotype association studies. Future explorations of age-related cataract include validating and replicating the association results identified herein. Unfortunately, because of the sample size and limited power by stratifying cases and controls by the eMERGE site, we did not have the opportunity to replicate these findings within eMERGE. The goal is to identify a similar study population where these results can be explored. In addition, we are beginning to investigate the role of gene–gene and gene–environment interactions associated with cataracts [62]. Due to the complexity of the trait, we hypothesize that the genetic architecture will be similar to that of other complex traits: multigenic with a combination of genetic and environmental interactions. As demonstrated by this and other studies, the beauty of using an electronic health record is the ability to reuse genotyped samples for various phenotypes. The eMERGE network has clearly demonstrated the success of this study design, and continues to demonstrate the strengths and limitations of this approach.
  61 in total

1.  Fructose intolerance associated with congenital cataract. Report of a case.

Authors:  C Sitadevi; Y Ramaiah; Z Askari
Journal:  Indian J Pediatr       Date:  1968-10       Impact factor: 1.967

2.  Next-generation analysis of cataracts: determining knowledge driven gene-gene interactions using Biofilter, and gene-environment interactions using the PhenX Toolkit.

Authors:  Sarah A Pendergrass; Shefali S Verma; Emily R Holzinger; Carrie B Moore; John Wallace; Scott M Dudek; Wayne Huggins; Terrie Kitchner; Carol Waudby; Richard Berg; Catherine A McCarty; Marylyn D Ritchie
Journal:  Pac Symp Biocomput       Date:  2013

3.  Prevalence of cataract and pseudophakia/aphakia among adults in the United States.

Authors:  Nathan Congdon; Johannes R Vingerling; Barbara E K Klein; Sheila West; David S Friedman; John Kempen; Benita O'Colmain; Suh-Yuh Wu; Hugh R Taylor
Journal:  Arch Ophthalmol       Date:  2004-04

Review 4.  Molecular genetic basis of inherited cataract and associated phenotypes.

Authors:  M Ashwin Reddy; Peter J Francis; Vanita Berry; Shomi S Bhattacharya; Anthony T Moore
Journal:  Surv Ophthalmol       Date:  2004 May-Jun       Impact factor: 6.048

5.  Null genotype of glutathione S-transferase M1 is associated with senile cataract susceptibility in non-smoker females.

Authors:  Mostafa Saadat; Majid Farvardin-Jahromi; Hooshang Saadat
Journal:  Biochem Biophys Res Commun       Date:  2004-07-09       Impact factor: 3.575

6.  Identification of a major locus for age-related cortical cataract on chromosome 6p12-q12 in the Beaver Dam Eye Study.

Authors:  Sudha K Iyengar; Barbara E K Klein; Ronald Klein; Gyungah Jun; James H Schick; Christopher Millard; Rachel Liptak; Karlie Russo; Kristine E Lee; Robert C Elston
Journal:  Proc Natl Acad Sci U S A       Date:  2004-09-27       Impact factor: 11.205

7.  Associations of mortality with ocular disorders and an intervention of high-dose antioxidants and zinc in the Age-Related Eye Disease Study: AREDS Report No. 13.

Authors:  Traci E Clemons; Natalie Kurinij; Robert D Sperduto
Journal:  Arch Ophthalmol       Date:  2004-05

8.  Polymorphism of estrogen metabolism genes and cataract.

Authors:  S-M Lee; L-M Tseng; A-F Li; H-C Liu; T-Y Liu; C-W Chi
Journal:  Med Hypotheses       Date:  2004       Impact factor: 1.538

9.  Nuclear cataract shows significant familial aggregation in an older population after adjustment for possible shared environmental factors.

Authors:  Nathan Congdon; Karl W Broman; Hong Lai; Beatriz Munoz; Heidi Bowie; Donna Gilber; Robert Wojciechowski; Christine Alston; Sheila K West
Journal:  Invest Ophthalmol Vis Sci       Date:  2004-07       Impact factor: 4.799

Review 10.  Molecular genetics of age-related cataract.

Authors:  J Fielding Hejtmancik; Marc Kantorow
Journal:  Exp Eye Res       Date:  2004-07       Impact factor: 3.467

View more
  22 in total

Review 1.  Molecular Genetics of Cataract.

Authors:  Alan Shiels; J Fielding Hejtmancik
Journal:  Prog Mol Biol Transl Sci       Date:  2015-06-12       Impact factor: 3.622

Review 2.  Unravelling the human genome-phenome relationship using phenome-wide association studies.

Authors:  William S Bush; Matthew T Oetjens; Dana C Crawford
Journal:  Nat Rev Genet       Date:  2016-02-15       Impact factor: 53.242

3.  Exome Array Analysis of Nuclear Lens Opacity.

Authors:  Stephanie J Loomis; Alison P Klein; Kristine E Lee; Fei Chen; Samantha Bomotti; Barbara Truitt; Sudha K Iyengar; Ronald Klein; Barbara E K Klein; Priya Duggal
Journal:  Ophthalmic Epidemiol       Date:  2017-11-28       Impact factor: 1.648

4.  Genetically-guided algorithm development and sample size optimization for age-related macular degeneration cases and controls in electronic health records from the VA Million Veteran Program.

Authors:  Christopher W Halladay; Tamer Hadi; Matthew D Anger; Paul B Greenberg; Jack M Sullivan; P Eric Konicki; Neal S Peachey; Robert P Igo; Sudha K Iyengar; Wen-Chih Wu; Dana C Crawford
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2019-05-06

5.  Comorbidity Characterization Among eMERGE Institutions: A Pilot Evaluation with the Johns Hopkins Adjusted Clinical Groups® System.

Authors:  Casey Overby Taylor; Klaus W Lemke; Thomas M Richards; Kenneth D Roe; Ting He; Adelaide Arruda-Olson; David Carrell; Joshua C Denny; George Hripcsak; Krzysztof Kiryluk; Iftikhar Kullo; Eric B Larson; Peggy Peissig; Nephi A Walton; Wei Wei-Qi; Zi Ye; Christopher G Chute; Jonathan P Weiner
Journal:  AMIA Jt Summits Transl Sci Proc       Date:  2019-05-06

6.  PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability.

Authors:  Jacqueline C Kirby; Peter Speltz; Luke V Rasmussen; Melissa Basford; Omri Gottesman; Peggy L Peissig; Jennifer A Pacheco; Gerard Tromp; Jyotishman Pathak; David S Carrell; Stephen B Ellis; Todd Lingren; Will K Thompson; Guergana Savova; Jonathan Haines; Dan M Roden; Paul A Harris; Joshua C Denny
Journal:  J Am Med Inform Assoc       Date:  2016-03-28       Impact factor: 4.497

7.  Learning statistical models of phenotypes using noisy labeled training data.

Authors:  Vibhu Agarwal; Tanya Podchiyska; Juan M Banda; Veena Goel; Tiffany I Leung; Evan P Minty; Timothy E Sweeney; Elsie Gyang; Nigam H Shah
Journal:  J Am Med Inform Assoc       Date:  2016-05-12       Impact factor: 4.497

8.  A Rare Variant (rs933717) at FBXO31-MAP1LC3B in Chinese Is Associated With Systemic Lupus Erythematosus.

Authors:  Yuan-Yuan Qi; Xu-Jie Zhou; Swapan K Nath; Celi Sun; Yan-Na Wang; Ping Hou; Rong Mu; Chun Li; Jian-Ping Guo; Zhan-Guo Li; Geng Wang; Hu-Ji Xu; Yan-Jie Hao; Zhuo-Li Zhang; Wei-Hua Yue; Huoru Zhang; Ming-Hui Zhao; Hong Zhang
Journal:  Arthritis Rheumatol       Date:  2018-01-09       Impact factor: 10.995

9.  A case study evaluating the portability of an executable computable phenotype algorithm across multiple institutions and electronic health record environments.

Authors:  Jennifer A Pacheco; Luke V Rasmussen; Richard C Kiefer; Thomas R Campion; Peter Speltz; Robert J Carroll; Sarah C Stallings; Huan Mo; Monika Ahuja; Guoqian Jiang; Eric R LaRose; Peggy L Peissig; Ning Shang; Barbara Benoit; Vivian S Gainer; Kenneth Borthwick; Kathryn L Jackson; Ambrish Sharma; Andy Yizhou Wu; Abel N Kho; Dan M Roden; Jyotishman Pathak; Joshua C Denny; William K Thompson
Journal:  J Am Med Inform Assoc       Date:  2018-11-01       Impact factor: 4.497

Review 10.  Genes and genetics in eye diseases: a genomic medicine approach for investigating hereditary and inflammatory ocular disorders.

Authors:  Mahavir Singh; Suresh C Tyagi
Journal:  Int J Ophthalmol       Date:  2018-01-18       Impact factor: 1.779

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.