Literature DB >> 30133451

Case-control studies of gene-environment interactions. When a case might not be the case.

Iryna Lobach1, Joshua Sampson2, Alexander Alekseyenko3, Siarhei Lobach4, Li Zhang1,5,6.   

Abstract

Case-control Genome-Wide Association Studies (GWAS) provide a rich resource for studying the genetic architecture of complex diseases. A key is to elucidate how the genetic effects vary by the environment, what is traditionally defined by Gene-Environment interactions (GxE). The overlooked complication is that multiple, distinct pathophysiologic mechanisms may lead to the same clinical diagnosis and often these mechanisms have distinct genetic bases. In this paper, we first show that using the clinically diagnosed status can lead to severely biased estimates of GxE interactions in situations when the frequency of the pathologic diagnosis of interest, as compared to other diagnoses, depends on the environment. We then propose a pseudo-likelihood solution to correct the bias. Finally, we demonstrate our method in extensive simulations and in a GWAS of Alzheimer's disease.

Entities:  

Mesh:

Year:  2018        PMID: 30133451      PMCID: PMC6104951          DOI: 10.1371/journal.pone.0201140

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

We are interested in using data from a case-control Genome-Wide Association Studies (GWAS) to estimate how an “environmental variable” modifies the effect of a genetic variant on a specific, pathologically defined disease state. However, the complication is that in many GWAS, the cases are a heterogenous group, where multiple distinct pathologically defined disease states have led to a common set of symptoms and a shared clinical diagnosis. In these scenarios, a genetic variant will appear to interact with the environmental variable if the genetic variant affects the pathologically defined disease state of interest and the environmental variable is related to the proportion of cases with that disease state. The issue of heterogeneity among cases is, perhaps, most pronounced in neurologic and psychiatric disorders, where the clinically defined status is based primarily on descriptive criteria and is typically made in absence of biomarker measurements, imaging data, and biopsies. Our specific motivating study is a GWAS of late-onset Alzheimer’s disease (AD), a neurodegenerative disorder that is clinically characterized by progressive mental decline. Here, we are interested in identifying genetic variants specifically associated with a high abundance of amyloid deposits and neurofibrially tangles in the brain, which we refer to as “histopathologically defined AD.”[1] Specifically, we are interested in whether carrying the ApoE ε4 variant, which in the study is considered the “environmental variable”, modifies the effect of SNPs residing in Toll-Like Receptors (TLR) and Receptor for advanced glycation end products (RAGE) on histopathologically defined AD. Importantly, ApoE ε4 status is likely to be associated with the proportion of the GWAS cases who have histopathologically defined AD. Recent biomarker studies of AD [2] reported that 36% of ApoE ε4 non-carriers and 6% in ApoE ε4 carriers clinically diagnosed with AD do not have evidence of amyloid deposition. We provide a more detailed description of ApoE ε4, other the risk factors for AD and the heterogeneity of the disease in the Discussion section. We are interested to test an association between single nucleotide polymorphisms (SNPs) residing in Toll-Like Receptors (TLR) and the true AD diagnosis, i.e. our goal is to identify the genetic that might have lea to amyloid plaques with associated cognitive decline. TLRs play a key role in an innate immune response to invading pathogens and are also important for triggering the adaptive immune responses. Dysregulation of human toll-like receptor function has been shown in aging [3]. Specifically to the etiology of AD, TLRs act through modification of the inflammatory state of microglia/macrophages [1]. Receptor for advanced glycation end products (RAGE) has been identified as receptor for amyloid-beta peptide [4]. There is an extensive literature on how the estimates of the main genetic effect can be biased in situations when disease status is misclassified, i.e. the clinical and pathologic diagnoses do not correspond [5]. We extend the literature by investigating the impact of misdiagnosis on estimates of the Gene-Environment interaction (GxE). In case-control studies, the effects of covariates have been traditionally assessed using logistic regression analysis [6]. Recently, however, Chatterjee and Carroll [7] noticed and proved that the assumptions of Hardy-Weinberg Equilibrium and Gene x Environment independence can be leveraged in the appropriate retrospective analyses to gain statistical efficiency. We adopt the principals derived by Chatterjee and Carroll [7] and develop a pseudo-likelihood model in settings when a case defined based on the clinical diagnosis might not be the case in terms of the true diagnosis defined pathophysiologically. Our paper is organized as follows. First, in the Material and Methods section we present the setting, notation, and proposed pseudo-likelihood approach. Next, the Simulation Experiments section describes the simulation experiments conducted to compare the resulting performance of the proposed method with the performance of standard logistic regression using clinically defined disease. In the same section, we apply our method to the motivating study of AD. The Discussion section concludes the paper.

Materials and methods

We define G be the genotype, e.g. SNPs measured at multiple locations. Let X be the environmental variables that interact with G and let Z be other environmental variables. We assume that the genotype is independent of all environmental variables and the genotypes follows Hardy-Weinberg Equilibrium: G~Q(g,θ). If θ is the frequency of minor allele a when the major allele is A, then the Hardy-Weinberg Equilibrium model [8] according to the number of minor alleles is We define D = {0, 1} be observed clinical disease status defined based on a set of symptoms. Suppose that the same set of symptoms can be caused by two distinct pathophysiologic mechanisms. Let D be the true disease status defined based on the underlying pathology, where D = 1 indicates the disease of interest, while D = 1* is the nuisance disease. For ethical and/or budgetary reasons it might not be possible to measure the underlying pathology, hence D is latent. Instead, an evaluation is performed on a subset of patients or in an external reliability study. We define τ(X) = pr(D = 1|D = 1,X) to be the frequency of the true diagnosis of interest within the clinically diagnosed set that varies by the environment X. We let probabilities of the clinical and true diagnoses in the population to be and π = pr(D =d), respectively. The clinical and true diagnoses are related , which indicates that the probabilities of the clinical diagnosis are weighted sums of frequencies of the true diagnoses. If pr(D = d|D = d,X = x,G = g) = pr(D = d|D =d), then D is a surrogate of D. In this setting, ; hence if there is no relationship between (X,G) and D, neither there is one between (X,G) and D. We first consider a binary setting where the risk parameters are defined in terms of D = 1 vs. D = 1* and D = 0 combined. Then the risk model is defined in terms of coefficients B = (β0,β,β,β,β) by In the second setting that we consider the risk model is defined separately for D = 1 vs. D = 0 in terms of B = (β0,β,β,β,β) and for D = 1* vs. D = 0 in terms of by In Eq (2) B and B* might share coefficients, e.g. if . The observed data are collected using a case-control design where genetic and environmental variables are measured after the disease status is ascertained. However, the data will be analyzed as a random sample. To facilitate this analysis, we let δ = 1 be an indicator of selection into the study and consider the imaginary Bernoulli sampling with . Define and with a parameter set Ω = (κ0,β0,β,β,β,β,θ) For model (1) we define and for model (2) we define In addition we let Consider probability, Pr(D,G|X,Z,δ = 1) and define a function L(d,g,x,z;Ω) as follows. The pseudo-likelihood can be used in place of the likelihood function based on arguments provided in the Appendix. Define Ψ(d,g,x,z;Ω) to be the derivative of log{L(d,g,x,z;Ω)} with respect to Ω and where all expectations are taken with respect to the actual retrospective sampling scheme. Derivations shown in the Appendix demonstrate that under suitable regularity conditions there is a consistent sequence of solutions to with the following property Remark 1: The intercept parameter is a function of the probability of disease in the population. Hence, if the probability of clinical diagnosis in the population is known or a good bound can be specified, this information can be used while estimating parameters. This cannot be done in the usual logistic regression setting.

Results

Simulation experiments

The goal of the simulation study is to examine potential differences in the effect estimates of the genetic and environmental variables in their relationship to the 1) observed clinical diagnosis using the usual logistic regression model (uLR) and pseudo-likelihood model (pMLE) [7]; and 2) to the true disease status by using our pseudo-likelihood approach (pMLE-DX) that takes into account that only a proportion of the clinically diagnosed cases have the true disease. In pMLE-DX parameters are estimated based on Eq (4). Parameters are compared by their Bias and Root Mean Squared Error (RMSE). Simulations are performed using MatLab version R2017a. In each setting we simulate 500 datasets with n0 = n1 ∈ {1000,3000,5000,10000,50000}. We let the genotype (G) be a Bernoulli random variable with frequency 0.10 to mimic a SNP and allow its effect to follow a recessive or dominant model. We set our other parameters to be similar to the values observed in our GWAS of AD. The binary variable X = {ε4+,ε4−}, which represents the ApoE ε4 status according to presence or absence of ε4 allele that occurs in approximately 14% of the population. The proportion of the nuisance disease within the clinical diagnosis is defined as pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. The clinical diagnosis of late onset AD is defined for ages 65 and older. We simulated age (Z1) to be Bernoulli with frequency 0.50 e.g. corresponding to a median split. Sex (Z2) is Bernoulli with frequency 0.52 to reflect what we observed in the motivating data example of AD.

Setting A

We first examine a setting when the nuisance disease and controls are equivalent in that the risk parameters are defined for the disease of interest vs. the combination of controls and nuisance disease as in Eq (1). The risk coefficients are . In this setting, the frequency of the true disease status is pr(D = 1) = 46%, pr(D = 1|ε4−) = 40%, pr(D = 1|ε4+) = 82%. presents properties of the risk parameter estimates in the datasets with n0 = n1 = 3,000. Additionally, shown in are studies with n0 = n1 ∈ {1000,5000,10000,50000}. When the presence of the nuisance disease is ignored (uLR, pMLE), and are biased with elevated RMSE. For example, in a study with n0 = n1 = 3,000, the bias in is -0.31 in uLR and pMLE, while the bias is reduced to 0.005 by pMLE-DX. RMSE is 0.33 in uLR and pMLE, while it is reduced to 0.12 by pMLE-DX. Similarly, bias in is 0.56 in uLR and pMLE, while pMLE-DX reduces the bias by more than half. RMSE of is 2.5x larger when the presence of the nuisance disease is ignored. Notably, estimates of and are biased in uLR and pMLE. When sample size increased, the uLR bias in decreased, e.g. the bias is 0.08 in a study with n0 = n1 = 10,000; while the bias in persisted. Across all sample sizes, is biased by approximately -0.13, whereas considering the nuisance disease nearly eliminated the bias, e.g. to -0.01 in a study with n0 = n1 = 1000. The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 = 3000 controls and n1 = 3000 cases. Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequency of the true disease status is 46% in the population; and is 40% among the subpopulation with no ApoE ε4 alleles, and 82% in the subpopulation with at least one ApoE ε4 alleles. Frequency of nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. We next examine if the presence of the nuisance disease could lead us to erroneously conclude that there was a significant when β = 0. Here, we simulated datasets with β = 0. presents estimates in a study with n0 = n1 = 3000 and is based on studies with n0 = n1 ∈ {1000,5000,10000,50000}. Estimates of and β are clearly biased when the presence of the nuisance disease is ignored. For example, in a study with n0 = n1 = 3,000, pMLE-DX decreased the bias in from 0.12 in uLR and pMLE to 0.04, while RMSE remained approximately the same 0.41 vs. 0.43. Similarly, pMLE-DX reduced the bias in from -0.26 in uLR to 0.007. At the same time, the RMSE of went from 0.28 (uLR, pMLE) to 0.12 (pMLE-DX). Increasing the sample size reduced the uLR bias for e.g. the bias is 0.09 in a study with n0 = n1 = 10,000 but did not alleviate the substantial uLR bias in β. Across all sample sizes considered, the uLR estimates of β are biased by approximately -0.12, while pMLE-DX reduced the bias to e.g. 0.01 in a study with 1,000 cases and 1,000 controls. The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 = 3000 controls and n1 = 3000 cases. Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequency of the true disease status is 46% in the population; and is 40% among the subpopulation with no ApoE ε4 alleles, and 82% in the subpopulation with at least one ApoE ε4 alleles. Frequency of nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. We next consider the effect of underestimating pr(D = 1*|D = 1,ε4+) and pr(D = 1*|D = 1,ε4−) in the pseudo-likelihood. Here, we simulate data using the parameters specified above, but, when fitting the pseudo-likelihood (), set pr(D = 1*|D = 1,ε4−) = 0.3 and pr(D = 1*|D = 1,ε4+) = 0, i.e. underestimated by 6%. Naturally, this misspecification introduced bias in some of the estimates and hence increased RMSE. Estimates of β were generally affected more than the estimates of the other parameters. For example, in a study with 3,000 cases and 3,000 controls, bias in increased from 0.005 to -0.66 in pMLE-DX, while RMSE went from 0.12 to 0.67. In estimates of β, the bias increased from 0.22 to 0.32, while RMSE went up from 0.93 to 0.94. The bias in increased to -0.10 in a study with 3,000 cases and 3,000 controls, what has not reached the level of uLR where the bias is -0.12. Estimates of remained nearly unbiased with the same RMSE. We next consider the effect of overestimating pr(D = 1*|D = 1,ε4+) and pr(D = 1*|D = 1,ε4−) in the pseudo-likelihood (). Here, we simulate data using the parameters specified above, but, when fitting the pseudo-likelihood, set pr(D = 1*|D = 1,ε4−) = 0.42 and pr(D = 1*|D = 1,ε4+) = 0.16, i.e. overestimated by 6%. As expected, this misspecification inflated the bias in the risk estimates. For example, in a study of 3,000 cases and 3,000 controls, bias in increased from 0.005 to -0.43, while RMSE went from 0.12 to 0.44. Bias in decreased from 0.22 to 0.17, while RMSE remained the same. Estimates of β and remained nearly unbiased.

Setting B

We next examine a setting when two sets of parameters define the risk of disease, i.e. for D = 1 vs. D = 0 and D = 1* vs. D = 0 according to the risk model (2). (n0 = n1 = 3,000) and present parameter estimates in the setting when With these parameters, the frequencies of the disease of interest and the nuisance disease are pr(D = 1) = 25.1%, pr(D = 1*) = 12.5%, pr(D = 1|ε4+) = 45.4%, pr(D = 1*|ε4+) = 16.1%, pr(D = 1|ε4−) =20%, pr(D = 1*|ε4−) = 16.1%. When presence of the nuisance disease is ignored (uLR, pMLE), estimates of β0,β,β,β are substantially biased.For example, in a study with 3,000 cases and 3,000 controls, in the bias of uLR for is -0.22, while pMLE-DX reduced this bias to -0.006; the bias of uLR for is -0.13, while pMLE-DX reduced this bias to 0.01; the bias of uLR bias for is 0.30, while pMLE-DX reduced it to 0.005. Biases in uLR persisted for larger sample sizes. If a priori evidence is sufficient to set parameters and to 0, when in fact these coefficients are zero, then RMSE of pMLE-DX are further reduced by at least 2-fold (data not shown). The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 = 3000 controls and n1 = 3000 cases. Risk of the disease of interest is defined in a set of parameters ; while the risk of the nuisance disease is parametrized by Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequencies of the disease of interest and the nuisance disease are pr(D = 1) = 24.8%, pr(D = 1*) = 12.5%, pr(D = 1|ε4+) = 43%, pr(D = 1*|ε4+) = 16.1%, pr(D = 1|ε4−) = 20%, pr(D = 1*|ε4+) = 11.6%. Frequency of the nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. and present the results in a setting similar to that of but when there is no interaction between the genotype and ApoE4 status, i.e. β = 0. Ignoring the nuisance disease in the uLR resulted in bias in the estimate of β that is -0.23, which might mislead to a conclusion that there is an interactive effect between the genotype and ApoE ε4 status. The bias persisted for larger sample sizes. The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 = 3000 controls and n1 = 3000 cases. Risk of the disease of interest is defined in a set of parameters ; while the risk of the nuisance disease is parametrized by Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequencies of the disease of interest and the nuisance disease are pr(D = 1) = 24.8%, pr(D = 1*) = 12.5%, pr(D = 1|ε4+) = 43%, pr(D = 1*|ε4+) = 16.1%, pr(D = 1|ε4−) = 20%, pr(D = 1*|ε4−) = 11.6%. Frequency of the nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06.

Setting C

We next conducted a simulation study to better understand the underlying nature of the biases in the estimates noted when presence of the nuisance disease is ignored (uLR). For clarity, we simulated all variables to be binary. Variables G,Z1 and Z2 are Bernoulli with frequencies 0.10, 0.52 and 0.50, respectively. Risk coefficients are Then we varied values of , and β. The relationship between clinical and pathophysiological diagnosis is set to be pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. We simulated 500 datasets with 3,000 cases and 3,000 controls. presents a study where β varies as log(1),log(1.5),log(2),log(2.5),…log(8) across the x-axis and is color-coded to be 0, 0.5, 1, 1.5. We show in panels A, B, C, D, and E, the biases of , and , respectively. With increasing value of β, the biases in the main effect estimates of and β increase. For example, the bias in reaches -0.10 when β is log(5). The bias in and is even more sensitive to value of β. For example, when β = 0, the bias in is 0.8; while when β = log(8) the bias is -0.7. Similarly, when β = 0, the bias in is -0.18; while when β = log(8) the bias becomes 0.6. Bias in the estimates of increases with the increase in the true value. Bias in the other estimates is nearly not affected by values of . The bias in estimates of (, (. Simulated are datasets with 3,000 cases and 3,000 controls. Values of β are listed along the x-axis and the true values of are indicated by color. The parameters are set as follows: β0 = −1, β = log(1.5), β = log(3); the relationship between the clinical and true disease statuses is pr(D = 1*|D = 1,ε4-) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. Variables G,Z1 and Z2 are Bernoulli with frequencies 0.10, 0.50 and 0.52, respectively. presents a study where β varies as log(1),log(1.5),log(2),log(2.5),…log(8) across the x-axis and is color-coded to be 0, 0.5, 1, 1.5. We show in panels A, B, C, D, and E, the bias of , respectively. In this setting, the biases in the main effects and were approximately the same for all values of β, while the biases in the estimates of and were more sensitive to the value of β. For example, when the interaction coefficient is 0, the bias of is nearly -2, while when β = log(8) = 2.08, the bias goes up to 3. When β = 0, the bias in the estimate is nearly zero, while the bias goes to almost 6 when the true value is log(8). The bias in estimates of ( (. Simulated are datasets with 3,000 cases and 3,000 controls. Values of β are listed along the x-axis and the true values of are indicated by color. The parameters are set as follows: β0 = −1, β = log(1.5), β = log(3); the relationship between the clinical and true disease statuses is pr(D = 1*|D = 1,ε4-) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. Variables G,Z1 and Z2 are Bernoulli with frequencies 0.10, 0.50 and 0.52, respectively.

Analyses of genetic variants serving toll-like receptors and receptor for advanced glycation end products in Alzheimer’s disease

We applied the proposed analyses to a dataset collected as part of the Alzheimer’s Disease Genetics Consortium. The data has been anonymized prior to access by the authors. The data consists of 1,245 controls and 2,785 cases. The average age (SD) of Cases and controls are 72.1 (9.1) and 70.9 (8.8) years, respectively. Among cases, 1,458 (52.4%) are men; among controls, 678 (63.9%) are men. At least one ApoE ε4 allele is present in (64.5%) of cases and 365 (29.1%) of controls. Illumina Human 660K markers have been mapped onto human chromosomes using NCBI dbSNP database (https://www.ncbi.nlm.nih.gov/projects/SNP/). Chromosome location, proximal gene or genes and gene structure location (e.g. intron, exon, intergenic, UTR) has been recorded for all SNPs. From these data, we inferred 111 SNPs to reside in genes serving Toll-Like Receptors (TLR). Similarly, we inferred 3 SNPs to reside in the Receptor for advanced glycation end products (AGER). It is of interest to examine a relationship between the pathologic diagnosis and each of the 111 TLR SNPs (G), ApoE ε4 status (X), age (Z1), sex (Z2). The effect of SNPs might vary by ApoE ε4 hence we included interaction between the genotype and ApoE ε4 status. The genetic variables are modeled using a binary indicator of presence or absence of a minor allele. We estimate parameters using the standard logistic model (uLR) that uses the clinical diagnosis as a surrogate of the pathophysiologic diagnosis and the pseudo-likelihood model (pMLE-DX) where we assume that the relationship between the clinical and pathophysiologic diagnosis is as estimated in the Salloway study [2], i.e. the proportion of the nuisance disease within the clinically diagnosed set is 36% in ApoE ε4 non-carriers and 6% in ApoE ε4 carriers. The pseudo-likelihood model pMLE-DX estimates the coefficients in a model that treats the nuisance disease and controls equivalently as in Eq (1). pMLE-DX*, however, estimates two sets of the risk coefficients as in Eq (2). Data analyses are performed using MatLab version R2017a. When optimizing the pseudolikelihood function we bounded the estimates to be on the interval [-5,5]. We first examine the results when statistical significance is assessed according to p-value<0.05. We next correct for false discovery rate using Benjanimi-Hochberg method [9].

TLR

Shown in are estimates of the risk coefficients for 53 SNPs with permutation-based p-values for or that are <0.05 in either of the analyses. Of these 53 SNPs, 28 SNPs are within 500k up- or downstream of the SNPs previously reported in GWAS on Alzheimer’ disease, dementia, tauopathy, or/and vascular disease (). Analyses are performed using the usual logistic regression (uLR) that uses the clinical diagnosis as an outcome and using pseudo-likelihood method that assumes that the proportion of nuisance disease within the clinically diagnosed AD is 36% for ε4 non-carriers and is 6% for ε4 carriers. Pseudo-likelihood analyses pMLE-DX estimates parameters for D = 1 vs. D = 0 and D = 1* combined. Pseudo-likelihood analyses pMLE – DX*, however, estimate two sets of risk coefficients, i.e. βs for D = 0 vs. D = 1 and β*s D = 0 vs. D = 1*. Estimates of β*s are reported in . Estimates of β or β differ numerically between the three approaches. For 14 of these 53 SNPs, have p-values <0.05 in uLR, while in pMLE-DX and pMLE-DX* the corresponding p-values are >0.05. These associations detected by uLR might be spurious as a result of clinical-pathophysiological diagnoses relationship varying by ApoE ε4 status. One SNP, rs830832, has significant both in uLR () and . This SNP locates at the intergenic region between SORBS2 and TLR3 at Chromosome 4 and are 72k downstream of SNP rs75718659, which was reported associated with Alzheimer’s disease in a family-based GWAS [10]. Among the seven SNPs appear to have significant in pMLE- DX* but not uLR, two of the SNPs: rs4862611 ( = -2.8, p = 0.03) and rs1706143 ( = 2.9, p = 0.03), are also located at the intergenic region between SORBS2 and TLR3 at Chromosome 4 and are 80k and 20k downstream of SNP rs75718659. Nine of the SNPs appear to have significant in pMLE-DX* but not in uLR or pMLE-DX. Two SNPs, rs7676342 () and rs13113778 (), again are located in the intergenic region between SORBS2 and TLR3 at Chromosome 4 and are 80k and 100k downstream of SNP rs75718659, respectively. Three SNPs, rs955302 (), rs4837254 () and rs12342331 (), are located at the intergenic region between ASTN2 and TLR4 at Chromosome 9 and are 400k, 430k, and 492k downstream of rs1360695 associated with Schizophrenia [11]. Estimates of β, however, are generally larger in magnitude when estimated in pMLE-DX and pMLE- DX* models. Two SNPs appear to be associated with the diagnosis both in uLR and pMLE-DX. SNP rs7656500 (uLR and pMLE-DX ) locates at the intergenic region between KIAA0922 and TLR2 at Chromosome 4, and is 163k upstream and 144k downstream of rs727153 and rs1466662, respectively, which were reported associated with Alzheimer’s disease in two studies [12,13]. It is also 54k upstream of rs7654093 associated with thrombosis [14], 30k upstream of rs7659024 associated with Venous thromboembolism [15], 34k upstream of rs2066865 associated with Venous thromboembolism [16, 17], 52k upstream of rs6536024 associated with Venous thromboembolism [18], and 360k downstream of rs11099942 associated with Type 2 diabetes [19]. Among six SNPs which appear to be significantly associated with the nuisance diagnosis in absence of an interactive effect, three SNPs rs1869617 (at the intergenic region between SORBS2 and TLR3 at Chromosome 4, pMLE- ), rs3775296 (at the UTR region of TLR3 at Chromosome 4, pMLE- ), rs7668666 (at the INTRON region of TLR3 at Chromosome 4, pMLE- ) locate 110k, 176k and 179k, respectively, downstream of rs75718659 reported associated with Alzheimer’s disease [10] and another two SNPs rs16905625 (pMLE- ) and rs1890047 (pMLE- ) locate at the intergenic region between ASTN2 and TLR4 at Chromosome 9, 412k and 428k, respectively downstream of rs1360695 reported associated with Schizophrenia [11]. Estimates of β in the absence of interaction are generally larger in magnitude for the diagnosis of interest in pMLE-DX. For example, in a model with SNP rs1816702 (uLR and pMLE- ).

AGER

All of the three SNPs in the AGER gene measured in the data are associated with susceptibility to AD as inferred in uLR and also are associated with susceptibility to the nuisance disease when measured by pMLE- DX*. rs3134940 has been previously reported in association to breast cancer, type I diabetes and other phenotypes (https://www.gwascentral.org/marker/HGVM1600838/results?t=ZERO); rs1035798 and rs2070600 have been previously reported in association to rheumatoid arthritis (https://www.gwascentral.org/marker/HGVM275161/results?t=ZERO and https://www.gwascentral.org/marker/HGVM571318/results?t=ZERO).

Discussion

We investigated if disease heterogeneity among clinically diagnosed cases could introduce bias into the estimates of GxE interactions. We showed that when there is a strong association between the environmental variable and the relative risk of the disease of interest, as compared to the nuisance disease, and then there could be bias in either direction. We base our developments on the method by Chatterjee and Carroll [7] that is fully efficient in situations when the genetic and environmental variables are distributed independently in the population, a population-based genetics model is assumed for the genetic factors and the environmental variables are treated non-parametrically. Interestingly, in our analyses, the estimates of regression coefficients are qualitatively differed between the analyses that used the clinical diagnosis as a surrogate of the pathologic diagnosis and the analyses that used our newly proposed pseudo-likelihood approach that incorporates the uncertainty of the clinical diagnosis. Specifically, in TLR set for 13% of the SNPs examined, GxE was found to be significant in the relationship to the clinical diagnosis, while the pseudo-likelihood analyses inferred these GxE to be not significant. On the other hand, for 14% of the SNPs that we examined, GxE was found to be statistically significant only when we incorporated the uncertainty in the clinical-pathological diagnoses relationship. This finding is consistent with the conclusion reached by a study of phenotypic misclassification among cases [20] in situations when the misclassification is non-differential, i.e. is not a function of the environmental variables. The study concluded that presence of “non-cases” greatly decreased the estimates of risk attributed to the genetic variation. One of the major concerns in the analyses of the genetic studies has been the missing heritability, when the genetic markers identified thus far explain only a small portion of inter-person variability in familiar clustering of complex diseases [21]. The downward biases in the estimates associating GxE to the clinically diagnosed disease status might in part explain the missing heritability. On the other hand, the upward biases in these estimates might in part address the conclusion reached by [22] that only 1% of the association found are likely to be true. We examined estimates of the genetic effects, ApoE4 status, and age, sex consistent with the original publication on this dataset [23]. Epidemiologic evidence [24] suggests that the following factors play important role in AD risk: education/cognitive reserve, racial and ethnic difference, gender, smoking, drinking, head injury, diabetes, cardiovascular disease, obesity, social engagement, etc. However, not all of these factors have been consistently confirmed by subsequent studies, and considerable inconsistencies exist. For example, nicotine intake has been observed to decrease the risk of dementia due to the demonstrated ability of nicotine to stimulate neurotransmitter systems that are compromised in dementia [25]. More recent studies have suggested that nicotine intake may increase the risk of AD and also bring forward age of onset with APOE interactive effect [26]. The main conclusion reached in this paper is that using the clinically diagnosed status can lead to severely biased estimates of GxE interactions in situations when the frequency of the pathologic diagnosis of interest, as compared to other diagnoses, depends on the environment, and we aim to correct such biases by proposing pseudolikelihood method. AD dataset is mainly used for illustration, therefore, for clarity we restricted to variables to the minimum necessary instead of considering full risk prediction modes which might be able to better describe the inter-patient variability in susceptibility to AD. Although other factors are potentially important in predicting the risk of AD, this relatively simple model was able to achieve the main goals of the current manuscript. By recognizing and accounting for the potential of case heterogeneity, which biases the gene x environment interaction, our newly proposed method has the ability to remove this bias. Define E to be the set of variables in the model, i.e. age, sex. Let O define a set of key environmental variables omitted from the model. Addition of variables O would not modify the effect estimates of GxE beyond what is expected purely by chance if O does not interact with either G or E. Also, if conditional on the diagnosis of AD, GxE is independent of O, then omission of O does not change the effect estimate of GxE [27]. If, however, O interacts with GxE, then addition of these variables would change the effect estimate of GxE in the direction that is consistent with the direction of the GxE effect. Further studies that incorporate environmental variables, such as medical history, tobacco use, and infections are needed for their potential to modify the risk and the estimates of GxE in particular. Epigenetic mechanisms are well-recognized in the mediation of GxE and analysis of epigenetic changes at the genome scale can offer new insights into the relationship between brain epigenomes and AD. Further, candidate genes from epigenome-wide association studies interact with those from GWAS that can undergo epigenetic changes in their upstream gene regulatory elements [28]. However, an active conundrum is how the epigenetic mechanisms influence gene-environment interactions.

Appendix

Derivation of pseudo-likelihood (2) and covariance matrix

Derivation of the pseudo-likelihood (2) is straightforward. Next we demonstrate that the pseudo-likelihood (2) has zero mean when evaluated at the true parameters. Derivative of (2) with respect to Ω is Let p(x,z|η) be the density of the environment. Note the conditional probabilities Hence Therefore the derivative of the pseudo-likelihood has zero mean when evaluated at the true parameters. Evaluated at the true parameters the estimating function (2) takes the following form Covariance matrix is then Define The covariance matrix can then be represented in the form Σ = Σ1-Σ2−Λ. Define and , then I = I1−I2. We note that and . Hence Σ = I1−I2−Λ = I−Λ.

β4 ≠ 0.

The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 controls and n1 cases. Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequency of the true disease status is 46% in the population; and is 40% among the subpopulation with no ApoE ε4 alleles, and 82% in the subpopulation with at least one ApoE ε4 alleles. Frequency of nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4) = 0.06. (DOCX) Click here for additional data file.

β = 0.

The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 controls and n1 cases. Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequency of the true disease status is 46% in the population; and is 40% among the subpopulation with no ApoE ε4 alleles, and 82% in the subpopulation with at least one ApoE ε4 alleles. Frequency of nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. (DOCX) Click here for additional data file.

Frequency of the nuisance disease is underestimated.

The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 = 3000 controls and n1 = 3000 cases. Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequency of the true disease status is 46% in the population; and is 40% among the subpopulation with no ApoE ε4 alleles, and 82% in the subpopulation with at least one ApoE ε4 alleles. Frequency of nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. The clinical-pathophysiological diagnoses relationship is misspecified to be pr(D = 1*|D = 1,ε4−) = 0.30 and pr(D = 1*|D = 1,ε4+) = 0. (DOCX) Click here for additional data file.

Frequency of the nuisance disease is overestimated.

Bias and Root Mean Squared Error (RMSE) for parameter estimates based on a study of 500 simulated datasets with n0 controls and n1 cases with clinical phenotype. Analyses are based on the usual logistic regression model that ignores nuisance disease and based on pseudolikelihood with (pMLE-DX) and without the consideration of clinical-pathological diagnoses relationship (pMLE). Frequency of ApoE ε4 alleles is 14% in the population. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequency of the true disease status is 46% in the population; and is 40% among the subpopulation with no ApoE ε4 alleles, and 82% in the subpopulation with at least one ApoE ε4 alleles. Frequency of nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1′|D = 1,ε4−) = 0.36 and pr(D = 1′|D = 1,ε4+) = 0.06. The clinical-pathological diagnoses relationship is misspecified to be pr(D = 1′|D = 1,ε4−) = 0.42 and pr(D = 1′|D = 1,ε4+) = 0.12. (DOCX) Click here for additional data file.

and .

The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as t he outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 controls and n1 cases. Risk of the disease of interest is defined in a set of parameters ; while the risk of the nuisance disease is parametrized by Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequencies of the disease of interest and the nuisance disease are pr(D = 1) = 24.8%, pr(D = 1*) = 12.5%, pr(D = 1|ε4+) = 43%, pr(D = 1*|ε4+) = 16.1%, pr(D = 1|ε4−) = 20%, pr(D = 1*|ε4−) = 11.6%. Frequency of the nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. (DOCX) Click here for additional data file.

β = 0, .

The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 controls and n1 cases. Risk of the disease of interest is defined in a set of parameters ; while the risk of the nuisance disease is parametrized by . Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequencies of the disease of interest and the nuisance disease are pr(D = 1) = 24.8%, pr(D = 1*) = 12.5%, pr(D = 1|ε4+) = 43%, pr(D = 1*|ApoE4+) = 16.1%, pr(D = 1|ε4−) = 20%, pr(D = 1*|ε4−) = 11.6%. Frequency of the nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06. (DOCX) Click here for additional data file.

Parameter estimates in Alzheimer’s disease study.

Analyses are performed using the usual logistic regression (uLR) that uses the clinical diagnosis as an outcome and using pseudo-likelihood method that assumes that the proportion of nuisance disease within the clinically diagnosed AD is 36% for ε4 carriers and is 6% for ε4 non-carriers. Pseudo-likelihood analyses pMLE-DX estimates parameters for D = 1 vs. D = 0 and D = 1* combined. Pseudo-likelihood analyses pMLE − DX*, however, estimate two sets of risk coefficients, i.e. βs for D = 0 vs. D = 1 and β*s D = 0 vs. D = 1*. (DOCX) Click here for additional data file.

SNPs previously reported in GWAS that are within 500k up- or downstream of SNPs that we inferred in Alzheimer’s disease study.

(Table 5, SNPs whose effect estimates of β and/or β are with permutation-based p-value <0.05).
Table 5

Parameter estimates in Alzheimer’s disease study.

SNPGene/Intergenic RegionMethodβ^Ageβ^Sexβ^ε4β^Gβ^G×ε4
rs2033831KIAA0922 | TLR2uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.6, p = 0.13-0.24, p = 0.00-0.43, p = 0.00-2.4, p = 0.11-0.43, p = 0.000.86, p = 0.00-2.6, p = 0.321.9, p = 0.25-0.30, p = 0.09-0.87, p = 0.334.2, p = 0.280.58, p = 0.033.7, p = 0.800.22, p = 0.28
rs7656500KIAA0922 | TLR2uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.2, p = 0.25-0.25, p = 0.00-0.42, p = 0.00-1.8, p = 0.27-0.43, p = 0.001.6, p = 0.010.12, p = 0.57-2.3, p = 0.270.79, p = 0.016.0, p = 0.025.0, p = 0.13-0.16, p = 0.430.73, p = 0.274.3, p = 0.09
rs1816702TLR2uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.13-0.24, p = 0.00-0.43, p = 0.00-2.1, p = 0.10-0.43, p = 0.001.4, p = 0.00-0.18, p = 0.502.6, p = 0.030.43, p = 0.065.0, p = 0.045.0, p = 0.380.01, p = 0.491.1, p = 0.65-0.46, p = 0.13
rs830832SORBS2 | TLR3uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.0, p = 0.16-0.24, p = 0.00-0.42, p = 0.00-1.6, p = 0.10-0.43, p = 0.000.74, p = 0.01-0.64, p = 0.42-1.4, p = 0.07-0.31, p = 0.06-0.55, p = 0.354.4, p = 0.210.74, p = 0.011.8, p = 0.702.6, p = 0.03
rs7676342SORBS2 | TLR3uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.3, p = 0.19-0.24, p = 0.00-0.43, p = 0.00-2.2, p = 0.15-0.43, p = 0.001.6, p = 0.000.94, p = 0.721.5, p = 0.030.35, p = 0.040.88, p = 0.754.3, p = 0.66-0.24, p = 0.21-0.02, p = 0.530.23, p = 0.69
rs4862611SORBS2 | TLR3uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.5, p = 0.13-0.25, p = 0.00-0.43, p = 0.00-2.9, p = 0.09-0.43, p = 0.001.4, p = 0.000.04, p = 0.561.7, p = 0.080.08, p = 0.24-0.36, p = 0.352.2, p = 0.150.03, p = 0.551.2, p = 0.32-2.8, p = 0.03
rs13113778SORBS2 | TLR3uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.7, p = 0.16-0.24, p = 0.00-0.43, p = 0.00-4.9, p = 0.05-0.42, p = 0.002.0, p = 0.004.1, p = 0.10-0.75, p = 0.490.13, p = 0.62-2.1, p = 0.265.0,p = 0.36-0.64, p = 0.14-3.1, p = 0.252.7, p = 0.06
rs1869617SORBS2 | TLR3uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.14-0.24, p = 0.00-0.43, p = 0.00-2.7, p = 0.09-0.43, p = 0.000.96, p = 0.02-3.1, p = 0.322.2, p = 0.00-0.33, p = 0.15-0.53, p = 0.394.6, p = 0.010.46, p = 0.864.0, p = 0.110.18, p = 0.51
rs11938703SORBS2 | TLR3uLRpMLE-DXpMLE-DX*-0.24, p = 0.002-0.78, p = 0.15-0.25, p = 0.00-0.42, p = 0.00-1.2, p = 0.13-0.42, p = 0.000.95, p = 0.000.09, p = 0.571.7, p = 0.20-0.24, p = 0.05-0.50, p = 0.282.8, p = 0.280.58, p = 0.001.2, p = 0.67-0.57, p = 0.35
rs1519318SORBS2 | TLR3uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.14-0.25, p = 0.00-0.43, p = 0.00-2.6, p = 0.10-0.43, p = 0.001.2, p = 0.000.86, p = 0.702.1, p = 0.08-0.01,p = 0.510.04, p = 0.593.3, p = 0.010.21, p = 0.160.07, p = 0.44-0.41, p = 0.39
rs12648771SORBS2 | TLR3uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.6, p = 0.15-0.24, p = 0.00-0.42, p = 0.00-2.0, p = 0.13-0.43, p = 0.003.0, p = 0.002.9, p = 0.150.62, p = 0.06-0.27, p = 0.26-1.6, p = 0.265.0, p = 0.24-1.6, p = 0.004-2.1, p = 0.380.02, p = 0.31
rs6894NQO1 | LOC100132364uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.12-0.24, p = 0.00-0.43, p = 0.00-2.4, p = 0.09-0.42, p = 0.000.59, p = 0.070.42, p = 0.642.8, p = 0.00-0.64, p = 0.03-0.20, p = 0.463.9, p = 0.000.84, p = 0.030.49, p = 0.590.84, p = 0.16
rs3775296TLR3uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.14-0.24, p = 0.00-0.43, p = 0.00-2.4, p = 0.11-0.42, p = 0.002.0, p = 0.003.4, p = 0.141.4, p = 0.02-0.02, p = 0.500.81, p = 0.274.3, p = 0.046-0.61, p = 0.06-2.5, p = 0.330.89, p = 0.12
rs7668666TLR3uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.7, p = 0.13-0.24, p = 0.00-0.43, p = 0.00-4.9, p = 0.018-0.42, p = 0.001.4, p = 0.003.4, p = 0.141.7, p = 0.02-0.14, p = 0.251.7, p = 0.194.9, p = 0.000.01, p = 0.52-2.6, p = 0.27-0.81, p = 0.19
rs1706143TLR3 | FAM149AuLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.14-0.24, p = 0.00-0.43, p = 0.00-2.4, p = 0.11-0.42, p = 0.001.3, p = 0.000.33, p = 0.370.45, p = 0.67-0.12, p = 0.20-0.39, p = 0.323.8, p = 0.260.12, p = 0.260.66, p = 0.332.9, p = 0.03
rs9299251ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, 0.00-1.3, p = 0.14-0.24, p = 0.00-0.43, p = 0.00-2.1, p = 0.14-0.42, p = 0.001.1, p = 0.000.00, p = 0.630.48, p = 0.18-0.04, p = 0.650.19, p = 0.653.3, p = 0.450.35, p = 0.041.1, p = 0.320.47, p = 0.18
rs955302TNFRSF19uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.5, p = 0.13-0.25, p = 0.00-0.43, p = 0.00-2.5, p = 0.10-0.43, p = 0.001.2, p = 0.001.2, p = 0.342.2, p = 0.02-0.28, p = 0.06-0.23, p = 0.403.9, p = 0.300.27, p = 0.13-0.37, p = 0.44-1.2, p = 0.16
rs17419570ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-0.91, p = 0.23-0.24, p = 0.00-0.44, p = 0.00-1.5, p = 0.21-0.42, p = 0.000.68, p = 0.06-0.93, p = 0.501.2, p = 0.44-0.94, p = 0.01-1.8, p = 0.324.5, p = 0.0080.74, p = 0.061.9, p = 0.21.6, p = 0.064
rs16905625ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.5, p = 0.19-0.25, p = 0.00-0.43, p = 0.00-3.3, p = 0.13-0.43, p = 0.001.3, p = 0.013.3, p = 0.120.92, p = 0.110.04, p = 0.592.3, p = 0.153.2, p = 0.030.16, p = 0.69-2.5, p = 0.28-0.33,p = 0.32
rs10513307ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.7, p = 0.15-0.24, p = 0.00-0.43, p = 0.00-5.0, p = 0.03-0.43,p = 0.001.7, p = 0.013.9, p = 0.11-1.3, p = 0.330.06, p = 0.56-0.61, p = 0.415.0, p = 0.02-0.28, p = 0.31-3.0, p = 0.322.5, p = 0.09
rs1890047ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.002-1.3, p = 0.11-0.25, p = 0.00-0.43, p = 0.00-2.2, p = 0.10-0.43, p = 0.001.1, p = 0.00-0.77, p = 0.45-0.17, p = 0.44-0.06, p = 0.34-0.08, p = 0.444.6, p = 0.0080.39,p = 0.021.9, p = 0.222.0, p = 0.064
rs4837254ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.7, p = 0.15-0.24, p = 0.00-0.43, p = 0.00-5.0, p = 0.03-0.42, p = 0.001.2, p = 0.003.9, p = 0.112.0, p = 0.04-0.03, p = 0.43-0.61, p = 0.413.4, p = 0.650.27, p = 0.08-3.0, p = 0.32-0.90, p = 0.25
rs13285674ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.002-1.6, p = 0.14-0.24, p = 0.00-0.43, p = 0.00-3.1, p = 0.08-0.43, p = 0.001.0, p = 0.000.31, p = 0.622.8, p = 0.00-0.47, p = 0.003-1.3, p = 0.294.5, p = 0.030.42, p = 0.100.63, p = 0.40-0.65, p = 0.24
rs1337208ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.2, p = 0.12-0.25, p = 0.00-0.43, p = 0.00-2.0, p = 0.10-0.42, p = 0.001.1, p = 0.00-0.03, p = 0.550.96, p = 0.79-0.009, p = 0.490.15, p = 0.622.9, p = 0.150.34, p = 0.051.1, p = 0.310.46, p = 0.71
rs1415378ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.5, p = 0.14-0.24, p = 0.00-0.43, p = 0.00-2.8, p = 0.10-0.42, p = 0.001.4, p = 0.001.5, p = 0.200.50, p = 0.280.05, p = 0.320.98, p = 0.212.1, p = 0.060.05, p = 0.54-0.62, p = 0.371.4, p = 0.08
Rs504204ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.002-1.5, p = 0.16-0.24, p = 0.00-0.43, p = 0.00-2.9, p = 0.13-0.43, p = 0.000.21, p = 0.63-4.0, p = 0.27-3.9 p = 0.160.11, p = 0.54-2.5, p = 0.485.0, p = 0.111.2, p = 0.175.0, p = 0.0444.6, p = 0.06
rs12337381ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.5, p = 0.14-0.25, p = 0.00-0.43, p = 0.00-2.5, p = 0.10-0.43, p = 0.000.84, p = 0.04-0.63, p = 0.503.0, p = 0.010.14, p = 0.66-0.60, p = 0.404.6, p = 0.080.59, p = 0.111.6, p = 0.30-1.5, p = 0.09
rs1952464ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.5, p = 0.15-0.25, p = 0.00-0.43, p = 0.00-2.3, p = 0.10-0.43, p = 0.000.98, p = 0.00-0.08, p = 0.562.9, p = 0.002-0.07, p = 0.340.62, p = 0.334.6, p = 0.080.50, p = 0.011.1, p = 0.67-1.5, p = 0.09
rs12342331ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.5, p = 0.13-0.25, p = 0.00-0.43, p = 0.00-2.5, p = 0.11-0.43, p = 0.000.75, p = 0.08-0.80, p = 0.501.4, p = 0.010.07, p = 0.59-0.59, p = 0.385.0, p = 0.030.68, p = 0.081.8, p = 0.29-0.35, p = 0.09
rs16905754ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.5, p = 0.05-0.24, p = 0.00-0.43, p = 0.00-2.7, p = 0.06-0.43, p = 0.001.1, p = 0.015.0, p = 0.070.11?, p = 0.000.52, p = 0.33-0.99, p = 0.415.0, p = 0.66-1.1, p = 0.002-4.1, p = 0.210.19, p = 0.08
rs2771054ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.5, p = 0.22-0.24, p = 0.00-0.43, p = 0.00-2.4, p = 0.20-0.43, p = 0.002.9, p = 0.015.0, p = 0.04-0.19, p = 0.101.5, p = 0.014.9, p = 0.045.0, p = 0.15-1.5, p = 0.016-4.1, p = 0.191.7, p = 0.66
rs521581ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.002-1.7, p = 0.11-0.25, p = 0.00-0.43, p = 0.00-3.0, p = 0.08-0.43, p = 0.001.1, p = 0.000.35, p = 0.581.5, p = 0.10-0.02, p = 0.430.82,p = 0.333.5, p = 0.620.34, p = 0.040.61, p = 0.38-0.36, p = 0.32
rs1329063ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.002-1.5, p = 0.19-0.24, p = 0.00-0.43, p = 0.00-2.8, p = 0.15-0.43, p = 0.001.6, p = 0.000.92, p = 0.34-2.0, p = 0.170.65, p = 0.02-0.10, p = 0.545.0, p = 0.58-0.23, p = 0.33-0.01, p = 0.573.3, p = 0.09
rs495083ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.5, p = 0.11-0.24, p = 0.00-0.42, p = 0.00-3.3, p = 0.08-0.42, p = 0.001.2, p = 0.011.5, p = 0.254.7, p = 0.00-0.19, p = 0.100.37, p = 0.284.6, p = 0.100.24, p = 0.10-0.72, p = 0.33-2.8, p = 0.04
rs476ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.5, p = 0.13-0.24, p = 0.00-0.43, p = 0.00-3.0, p = 0.07-0.43, p = 0.003.0, p = 0.002.3, p = 0.200.85, p = 0.124.9, p = 0.031.1, p = 0.193.2, p = 0.36-1.3, p = 0.11-1.6, p = 0.290.50, p = 0.64
rs565055ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.3, p = 0.16-0.25, p = 0.002-0.43, p = 0.00-2.1, p = 0.11-0.42, p = 0.001.1, p = 0.000.81, p = 0.671.0, p = 0.21-0.01, p = 0.470.15, p = 0.650.84, p = 0.590.37, p = 0.010.14,p = 0.572.1, p = 0.33
rs2094630ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.3, p = 0.15-0.25, p = 0.00-0.43, p = 0.00-2.1, p = 0.11-0.43, p = 0.001.1, p = 0.000.76, p = 0.330.30, p = 0.05-0.2, p = 0.000.09, p = 0.571.7, p = 0.440.38, p = 0.020.20, p = 0.570.32, p = 0.35
rs10983712ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.16-0.25, p = 0.00-0.43, p = 0.00-4.8, p = 0.03-0.43, p = 0.001.4, p = 0.001.8, p = 0.271.3, p = 0.100.02, p = 0.590.09, p = 0.623.0, p = 0.36-0.04, p = 0.44-1.0, p = 0.390.009, p = 0.49
rs10983736ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.5, p = 0.14-0.25, p = 0.00-0.43, p = 0.00-2.8, p = 0.09-0.43, p = 0.001.1, p = 0.01-0.00, p = 0.672.8, p = 0.00-0.24, p = 0.27-0.67, p = 0.345.0, p = 0.0380.26, p = 0.260.95, p = 0.650.05, p = 0.50
rs16905962ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.23-0.24, p = 0.00-0.43, p = 0.00-2.5, p = 0.15-0.43, p = 0.00-0.005, p = 0.56-0.51, p = 0.51-1.5, p = 0.06-0.14, p = 0.485.1, p = 0.035.0, p = 0.571.4,p = 0.151.4, p = 0.723.1, p = 0.49
Rs1927914ASTN2 | TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.3, p = 0.15-0.25, p = 0.00-0.43, p = 0.00-2.1, p = 0.14-0.43, p = 0.001.4, p = 0.001.2, p = 0.251.2, p = 0.120.17, p = 0.120.65, p = 0.763, p = 0.330.04, p = 0.58-0.29, p = 0.471.2, p = 0.08
rs11536879TLR4uLRpMLE-DXpMLE-DX*-0.25, p = 0.00-1.5, p = 0.07-0.24, p = 0.00-0.43, p = 0.00-3, p = 0.056-0.43, p = 0.000.63, p = 0.25-2.9, p = 0.37-0.28, p = 0.04-0.56, p = 0.30-3.6, p = 0.365, p = 0.190.78, p = 0.273.8, p = 0.250.85, p = 0.19
rs4986790TLR4uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.08-0.24, p = 0.00-0.43, p = 0.00-2.4, p = 0.06-0.43, p = 0.003.5, p = 0.024.9, p = 0.010.84, p = 0.000.14, p = 0.36-0.55, p = 0.485.0, p = 0.08-3.3, p = 0.02-4.1, p = 0.20-1.3, p = 0.26
rs7045953TLR4 | LOC100129489uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.4, p = 0.10-0.24, p = 0.00-0.43, p = 0.00-2.4, p = 0.09-0.43, p = 0.000.96, p = 0.01-0.19, p = 0.542.6, p = 0.00-0.02, p = 0.461.6, p = 0.194.8, p = 0.470.47, p = 0.111.1, p = 0.670.06, p = 0.44
rs7357627TLR4 | LOC100129489uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.7, p = 0.11-0.24, p = 0.00-0.43, p = 0.00-3.3, p = 0.10-0.41, p = 0.001.3, p = 0.001.1, p = 0.280.51, p = 0.67-0.02, p = 0.43-0.31, p = 0.382.3, p = 0.090.07, p = 0.63-0.24, p = 0.431.7, p = 0.08
rs7046020TLR4 | LOC100129489uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.7, p = 0.14-0.25, p = 0.00-0.43, p = 0.00-2.8, p = 0.11-0.42, p = 0.001.5, p = 0.001.7, p = 0.75-4.2, p = 0.000.08, p = 0.691.1, p = 0.244.5, p = 0.10-0.11, p = 0.33-0.91, 0.364.6, p = 0.01
rs1927937TLR4 | LOC100129489uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.5, p = 0.11-0.25, p = 0.00-0.43, p = 0.00-2.7, p = 0.09-0.42, p = 0.001.3, p = 0.000.85, p = 0.591.3, p = 0.07-0.02, p = 0.450.25, p = 0.664.3, p = 0.180.08, p = 0.630.05, p = 0.50-0.51, p = 0.20
rs1927924TLR4 | LOC100129489uLRpMLE-DXpMLE-DX*-0.24, p = 0.00-1.5, p = 0.23-0.24, p = 0.00-0.43, p = 0.00-2.9, p = 0.21-0.43, p = 0.001.6, p = 0.004.9, p = 0.0010.15, p = 0.610.25, p = 0.184.3, p = 0.055.0, p = 0.25-0.20, p = 0.28-4.2, p = 0.171.6, p = 0.04
rs3860141TLR4 | LOC100129489uLRpMLE-DX pMLE-DX*-0.24, p = 0.00-1.2, p = 0.16-0.24, p = 0.00-0.42, p = 0.00-2.2, p = 0.11-0.42, p = 0.001.1, p = 0.000.01, p = 0.551.4, p = 0.17-0.06, p = 0.33-0.35, p = 0.353.6, p = 0.530.37, p = 0.021.1, p = 0.69-0.57, p = 0.46
rs1877876TLR4 | LOC100129489uLRpMLE-DX pMLE-DX*-0.24, p = 0.00-1.5, p = 0.10-0.24, p = 0.00-0.43, p = 0.00-2.3, p = 0.08-0.42, p = 0.001.7, p = 0.002.3, p = 0.30-1.2, p = 0.070.20, p = 0.111.1, p = 0.223.9, p = 0.56-0.31, p = 0.09-1.5, p = 0.322.3, p = 0.02
rs497322TLR4 | LOC100129489uLRpMLE-DX pMLE-DX*-0.25, p = 0.00-1.4, p = 0.23-0.24, p = 0.00-0.43, p = 0.00-1.9, p = 0.20-0.43, p = 0.001.7, p = 0.011.8, p = 0.260.24, p = 0.030.69, p = 0.014.5, p = 0.045.0, p = 0.05-0.28, p = 0.28-1, p = 0.421.7, p = 0.47
rs6478330TLR4 | LOC100129489uLRpMLE-DX pMLE-DX*-0.24, p = 0.00-1.5, p = 0.15-0.25, p = 0.00-0.43, p = 0.00-2.6, p = 0.13-0.43, p = 0.001.2, p = 0.03-0.008, p = 0.62-2.8, p = 0.0480.36, p = 0.210.73, p = 0.245.0, p = 0.140.16, p = 0.280.93, p = 0.335.5, p = 0.01
rs7856175TLR4 | LOC100129489uLRpMLE-DX pMLE-DX*-0.24, p = 0.00-1.4, p = 0.13-0.25, p = 0.00-0.43, p = 0.00-2.3, p = 0.11-0.43, p = 0.001.7, p = 0.001.6, p = 0.241.5, p = 0.030.07, p = 0.300.43, p = 0.282.7, p = 0.61-0.33, p = 0.03-0.97, p = 0.36-0.08, p = 0.28
rs3134940AGERuLRpMLE-DXpMLE-DX*-0.97, p = 0.00-0.74, p = 0.13-0.96, p = 0.11-0.12, p = 0.08-0.52, p = 0.14-1.2, p = 0.101.1, p = 0.001.2, p = 0.401.4, p = 0.090.55, p = 0.001.2, p = 0.181.8, p = 0.13-0.28, p = 0.05-0.67, p = 0.33-0.32, p = 0.50
rs1035798AGERuLRpMLE-DXpMLE-DX*-0.97, p = 0.00-1.42, p = 0.09-0.97, p = 0.11-0.13, p = 0.06-0.29, p = 0.18-0.60, p = 0.150.89, p = 0.000.96, p = 0.561.3, p = 0.080.43, p = 0.000.67, p = 0.211.8, p = 0.130.03, p = 0.85-0.06, p = 0.41-1.9, p = 0.08
rs2070600AGERuLRpMLE-DXpMLE-DX*-0.97, p = 0.00-2.3, p = 0.08-0.97, p = 0.11-0.12, p = 0.09-0.50, p = 0.18-0.62, p = 0.130.99, p = 0.000.97, p = 0.591.5, p = 0.050.49, p = 0.000.99, p = 0.221.8, p = 0.13-0.14, p = 0.33-0.24, p = 0.17-0.23, p = 0.51

Analyses are performed using the usual logistic regression (uLR) that uses the clinical diagnosis as an outcome and using pseudo-likelihood method that assumes that the proportion of nuisance disease within the clinically diagnosed AD is 36% for ε4 non-carriers and is 6% for ε4 carriers. Pseudo-likelihood analyses pMLE-DX estimates parameters for D = 1 vs. D = 0 and D = 1* combined. Pseudo-likelihood analyses pMLE – DX*, however, estimate two sets of risk coefficients, i.e. βs for D = 0 vs. D = 1 and β*s D = 0 vs. D = 1*. Estimates of β*s are reported in .

(DOCX) Click here for additional data file.
Table 1

Bias and RMSE in parameter estimates when β ≠ 0.

ParametersTrue valueClinical disease status is the outcomeWith consideration of clinical-pathological diagnoses relationship
Usual logisticregressionPseudo-likelihood method (pMLE)Pseudo-likelihood method(pMLE-DX)
BiasRMSEBiasRMSEBiasRMSE
n0 = 3,000 and n1 = 3,000
β0-10.460.460.980.98-0.00020.07
βG0.406-0.130.16-0.130.16-0.0080.13
βZ11.098-0.350.35-0.350.350.0030.08
βZ2-0.0830.020.060.020.06-0.0040.08
βε42.079-0.310.33-0.310.330.0050.12
βG×ε40.6930.562.40.260.910.220.93
Pr(G = 1)0.10-0.00040.0040.020.02

The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 = 3000 controls and n1 = 3000 cases. Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequency of the true disease status is 46% in the population; and is 40% among the subpopulation with no ApoE ε4 alleles, and 82% in the subpopulation with at least one ApoE ε4 alleles. Frequency of nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06.

Table 2

Bias and RMSE in parameter estimates when β = 0.

ParametersTrue valueClinical disease status is the outcomeWith consideration of clinical-pathological relationship
Usual logisticregressionPseudo-likelihood methodPseudo-likelihood method
BiasRMSEBiasRMSEBiasRMSE
n0 = 3,000 and n1 = 3,000
β0-10.450.450.930.93-0.00040.07
βG1.099-0.120.15-0.07-0.150.0020.13
βZ11.098-0.330.34-0.330.340.0010.08
βZ2-0.0830.020.060.020.06-0.0030.08
βε42.079-0.260.28-0.260.280.0070.12
βG×ε400.120.410.130.410.040.43
Pr(G = 1)0.10-0.0000.0040.030.03

The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 = 3000 controls and n1 = 3000 cases. Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequency of the true disease status is 46% in the population; and is 40% among the subpopulation with no ApoE ε4 alleles, and 82% in the subpopulation with at least one ApoE ε4 alleles. Frequency of nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06.

Table 3

Bias and RMSE in parameter estimates when = 0 and = 0.

ParametersTrue valueClinical disease status is the outcomeWith consideration of clinical-pathological diagnoses relationship
Usual logisticregressionPseudo-likelihood method(pMLE)Pseudo-likelihood method(pMLE-DX)
BiasRMSEBiasRMSEBiasRMSE
n0 = 3,000 and n1 = 3,000
β0-10.970.970.740.740.020.06
β0*-1.70.0080.05
βG-0.690.300.31-0.390.390.0050.10
βG*0-0.020.14
βZ10.100.0020.310.0040.050.0020.05
βZ2-0.083-0.0040.05-0.00080.05-0.0040.05
βε41.3-0.220.24-0.210.23-0.0060.10
βε4*0.5-0.0070.05
βG×ε40.10-0.130.29-0.280.360.010.25
βG×ε4*00.0010.11
Pr(G = 1)0.100.050.050.00010.004

The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 = 3000 controls and n1 = 3000 cases. Risk of the disease of interest is defined in a set of parameters ; while the risk of the nuisance disease is parametrized by Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequencies of the disease of interest and the nuisance disease are pr(D = 1) = 24.8%, pr(D = 1*) = 12.5%, pr(D = 1|ε4+) = 43%, pr(D = 1*|ε4+) = 16.1%, pr(D = 1|ε4−) = 20%, pr(D = 1*|ε4+) = 11.6%. Frequency of the nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06.

Table 4

Bias and RMSE in parameter estimates when = 0, β = 0 and = 0.

ParametersTrue valueClinical disease is the outcomeWith consideration of clinical-pathological diagnoses relationship
Usual logisticregressionPseudo-likelihood method(pMLE)Pseudo-likelihood method(pMLE-DX)
BiasRMSEBiasRMSEBiasRMSE
n0 = 3,000 and n1 = 3,000
β0-10.970.970.750.750.030.06
β0*-1.70.010.05
βG-0.690.300.31-0.380.390.0040.09
βG*0-0.010.13
βZ10.100.0020.050.0010.090.0020.05
βZ2-0.083-0.0040.05-0.0030.05-0.0040.05
βε41.3-0.220.24-0.220.23-0.0060.10
βε4*0.5-0.0090.06
βG×ε40-0.230.28-0.230.280.010.25
βG×ε4*0-0.00080.12
Pr(G = 1)0.100.0000.004

The Bias and Root Mean Squared Error (RMSE) in parameter estimates from simulations using the usual logistic regression with clinical diagnosis as the outcome (uLR), the pseudo-likelihood approach (pMLE), and our newly proposed pseudo-likelihood approach that accounts for misdiagnosis (pMLE-DX). For these simulations, the study included n0 = 3000 controls and n1 = 3000 cases. Risk of the disease of interest is defined in a set of parameters ; while the risk of the nuisance disease is parametrized by Frequency of ApoE ε4 allele in the population is 14%. Variables Z1 and Z2 are Bernoulli with frequencies 0.50 and 0.52, respectively. Frequencies of the disease of interest and the nuisance disease are pr(D = 1) = 24.8%, pr(D = 1*) = 12.5%, pr(D = 1|ε4+) = 43%, pr(D = 1*|ε4+) = 16.1%, pr(D = 1|ε4−) = 20%, pr(D = 1*|ε4−) = 11.6%. Frequency of the nuisance disease within the clinical diagnosis varies by ApoE4 status pr(D = 1*|D = 1,ε4−) = 0.36 and pr(D = 1*|D = 1,ε4+) = 0.06.

  22 in total

Review 1.  Dysregulation of human Toll-like receptor function in aging.

Authors:  Albert C Shaw; Alexander Panda; Samit R Joshi; Feng Qian; Heather G Allore; Ruth R Montgomery
Journal:  Ageing Res Rev       Date:  2010-11-10       Impact factor: 10.895

2.  Genome-wide association analysis of age-at-onset in Alzheimer's disease.

Authors:  M I Kamboh; M M Barmada; F Y Demirci; R L Minster; M M Carrasquillo; V S Pankratz; S G Younkin; A J Saykin; R A Sweet; E Feingold; S T DeKosky; O L Lopez
Journal:  Mol Psychiatry       Date:  2011-10-18       Impact factor: 15.992

3.  MENDELIAN PROPORTIONS IN A MIXED POPULATION.

Authors:  G H Hardy
Journal:  Science       Date:  1908-07-10       Impact factor: 47.728

4.  A genome-wide association study for venous thromboembolism: the extended cohorts for heart and aging research in genomic epidemiology (CHARGE) consortium.

Authors:  Weihong Tang; Martina Teichert; Daniel I Chasman; John A Heit; Pierre-Emmanuel Morange; Guo Li; Bruno H Ch Stricker; Paul M Ridker; Aaron R Folsom; Nicholas L Smith; Nathan Pankratz; Frank W Leebeek; Guillaume Paré; Mariza de Andrade; Christophe Tzourio; Bruce M Psaty; Saonli Basu; Rikje Ruiter; Lynda Rose; Sebastian M Armasu; Thomas Lumley; Susan R Heckbert; André G Uitterlinden; Mark Lathrop; Kenneth M Rice; Mary Cushman; Albert Hofman; Jean-Charles Lambert; Nicole L Glazer; James S Pankow; Jacqueline C Witteman; Philippe Amouyel; Joshua C Bis; Edwin G Bovill; Xiaoxiao Kong; Russell P Tracy; Eric Boerwinkle; Jerome I Rotter; David-Alexandre Trégouët; Daan W Loth
Journal:  Genet Epidemiol       Date:  2013-05-05       Impact factor: 2.135

Review 5.  Advanced glycation end products and RAGE: a common thread in aging, diabetes, neurodegeneration, and inflammation.

Authors:  Ravichandran Ramasamy; Susan J Vannucci; Shirley Shi Du Yan; Kevan Herold; Shi Fang Yan; Ann Marie Schmidt
Journal:  Glycobiology       Date:  2005-03-10       Impact factor: 4.313

6.  Genome-wide association study of schizophrenia in Ashkenazi Jews.

Authors:  Fernando S Goes; John McGrath; Dimitrios Avramopoulos; Paula Wolyniec; Mehdi Pirooznia; Ingo Ruczinski; Gerald Nestadt; Eimear E Kenny; Vladimir Vacic; Inga Peters; Todd Lencz; Ariel Darvasi; Jennifer G Mulle; Stephen T Warren; Ann E Pulver
Journal:  Am J Med Genet B Neuropsychiatr Genet       Date:  2015-07-21       Impact factor: 3.568

Review 7.  A comprehensive review of genetic association studies.

Authors:  Joel N Hirschhorn; Kirk Lohmueller; Edward Byrne; Kurt Hirschhorn
Journal:  Genet Med       Date:  2002 Mar-Apr       Impact factor: 8.822

8.  Common variants at MS4A4/MS4A6E, CD2AP, CD33 and EPHA1 are associated with late-onset Alzheimer's disease.

Authors:  Adam C Naj; Gyungah Jun; Gary W Beecham; Li-San Wang; Badri Narayan Vardarajan; Jacqueline Buros; Paul J Gallins; Joseph D Buxbaum; Gail P Jarvik; Paul K Crane; Eric B Larson; Thomas D Bird; Bradley F Boeve; Neill R Graff-Radford; Philip L De Jager; Denis Evans; Julie A Schneider; Minerva M Carrasquillo; Nilufer Ertekin-Taner; Steven G Younkin; Carlos Cruchaga; John S K Kauwe; Petra Nowotny; Patricia Kramer; John Hardy; Matthew J Huentelman; Amanda J Myers; Michael M Barmada; F Yesim Demirci; Clinton T Baldwin; Robert C Green; Ekaterina Rogaeva; Peter St George-Hyslop; Steven E Arnold; Robert Barber; Thomas Beach; Eileen H Bigio; James D Bowen; Adam Boxer; James R Burke; Nigel J Cairns; Chris S Carlson; Regina M Carney; Steven L Carroll; Helena C Chui; David G Clark; Jason Corneveaux; Carl W Cotman; Jeffrey L Cummings; Charles DeCarli; Steven T DeKosky; Ramon Diaz-Arrastia; Malcolm Dick; Dennis W Dickson; William G Ellis; Kelley M Faber; Kenneth B Fallon; Martin R Farlow; Steven Ferris; Matthew P Frosch; Douglas R Galasko; Mary Ganguli; Marla Gearing; Daniel H Geschwind; Bernardino Ghetti; John R Gilbert; Sid Gilman; Bruno Giordani; Jonathan D Glass; John H Growdon; Ronald L Hamilton; Lindy E Harrell; Elizabeth Head; Lawrence S Honig; Christine M Hulette; Bradley T Hyman; Gregory A Jicha; Lee-Way Jin; Nancy Johnson; Jason Karlawish; Anna Karydas; Jeffrey A Kaye; Ronald Kim; Edward H Koo; Neil W Kowall; James J Lah; Allan I Levey; Andrew P Lieberman; Oscar L Lopez; Wendy J Mack; Daniel C Marson; Frank Martiniuk; Deborah C Mash; Eliezer Masliah; Wayne C McCormick; Susan M McCurry; Andrew N McDavid; Ann C McKee; Marsel Mesulam; Bruce L Miller; Carol A Miller; Joshua W Miller; Joseph E Parisi; Daniel P Perl; Elaine Peskind; Ronald C Petersen; Wayne W Poon; Joseph F Quinn; Ruchita A Rajbhandary; Murray Raskind; Barry Reisberg; John M Ringman; Erik D Roberson; Roger N Rosenberg; Mary Sano; Lon S Schneider; William Seeley; Michael L Shelanski; Michael A Slifer; Charles D Smith; Joshua A Sonnen; Salvatore Spina; Robert A Stern; Rudolph E Tanzi; John Q Trojanowski; Juan C Troncoso; Vivianna M Van Deerlin; Harry V Vinters; Jean Paul Vonsattel; Sandra Weintraub; Kathleen A Welsh-Bohmer; Jennifer Williamson; Randall L Woltjer; Laura B Cantwell; Beth A Dombroski; Duane Beekly; Kathryn L Lunetta; Eden R Martin; M Ilyas Kamboh; Andrew J Saykin; Eric M Reiman; David A Bennett; John C Morris; Thomas J Montine; Alison M Goate; Deborah Blacker; Debby W Tsuang; Hakon Hakonarson; Walter A Kukull; Tatiana M Foroud; Jonathan L Haines; Richard Mayeux; Margaret A Pericak-Vance; Lindsay A Farrer; Gerard D Schellenberg
Journal:  Nat Genet       Date:  2011-04-03       Impact factor: 38.330

9.  PROX1 gene CC genotype as a major determinant of early onset of type 2 diabetes in slavic study participants from Action in Diabetes and Vascular Disease: Preterax and Diamicron MR Controlled Evaluation study.

Authors:  Pavel Hamet; Mounsif Haloui; François Harvey; François-Christophe Marois-Blanchet; Marie-Pierre Sylvestre; Muhammad-Ramzan Tahir; Paul H G Simon; Beatriz Sonja Kanzki; John Raelson; Carole Long; John Chalmers; Mark Woodward; Michel Marre; Stephen Harrap; Johanne Tremblay
Journal:  J Hypertens       Date:  2017-05       Impact factor: 4.844

10.  The impact of phenotypic and genetic heterogeneity on results of genome wide association studies of complex diseases.

Authors:  Mirko Manchia; Jeffrey Cullis; Gustavo Turecki; Guy A Rouleau; Rudolf Uher; Martin Alda
Journal:  PLoS One       Date:  2013-10-11       Impact factor: 3.240

View more
  2 in total

1.  A simple approximation to bias in the genetic effect estimates when multiple disease states share a clinical diagnosis.

Authors:  Iryna Lobach; Inyoung Kim; Alexander Alekseyenko; Siarhei Lobach; Li Zhang
Journal:  Genet Epidemiol       Date:  2019-03-19       Impact factor: 2.135

2.  A Simple Approximation to Bias in Gene-Environment Interaction Estimates When a Case Might Not Be the Case.

Authors:  Iryna Lobach; Inyoung Kim; Alexander Alekseyenko; Siarhei Lobach; Li Zhang
Journal:  Front Genet       Date:  2019-10-09       Impact factor: 4.599

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.