Literature DB >> 31167832

A Powerful Method To Test Associations Between Ordinal Traits and Genotypes.

Jinjuan Wang1,2, Juan Ding3, Shouyou Huang4, Qizhai Li5, Dongdong Pan6,7.   

Abstract

The methods commonly used to test the associations between ordinal phenotypes and genotypes often treat either the ordinal phenotype or the genotype as continuous variables. To address limitations of these approaches, we propose a model where both the ordinal phenotype and the genotype are viewed as manifestations of an underlying multivariate normal random variable. The proposed method allows modeling the ordinal phenotype, the genotype and covariates jointly. We employ the generalized estimating equation technique and M-estimation theory to estimate the model parameters and deduce the corresponding asymptotic distribution. Numerical simulations and real data applications are also conducted to compare the performance of the proposed method with those of methods based on the logit and probit models. Even though there may be potential limitations in Type I error rate control for our method, the gains in power can prove its practical value in case of exactly ordinal phenotypes.
Copyright © 2019 Wang et al.

Entities:  

Keywords:  M-estimation; association study; generalized estimating equation; latent normal variate; ordinal phenotype

Mesh:

Substances:

Year:  2019        PMID: 31167832      PMCID: PMC6686925          DOI: 10.1534/g3.119.400293

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Research in the field of genetic epidemiology suggests that some genetic variants play important roles in the etiology of human diseases. On the one hand, genetic variants are defined as genotypes, which are often treated as ordinal variables. On the other hand, there are multiple data types for diseases, i.e., phenotypes, which can be continuous, binary, or ordinal (Li ; Kim ). Note that both binary and ordinal variables are categorical variables, but the latter can describe the disease state of a patient more precisely in many circumstances. For example, four levels—normal liver, light steatosis, moderate steatosis, and severe steatosis—have been utilized to describe the severity of liver steatosis (Bedogni ). With the development of high throughput biologic technology, increasingly more genotypes and data with complex traits have been generated and deposited in public databases. It is urgently required to develop new statistical testing methods to investigate the associations between these and extract useful information to understand the underlying occurrence and development mechanisms of diseases and traits. Genome-wide association studies aim to identify associations between phenotypes and genotypes. In these studies, genotypes are often treated as predictors and phenotypes as outcomes. If the phenotype of interest is continuous, then the classic linear regression model is commonly employed. When the phenotype is ordinal, the multinomial logit model (McCullagh 1980; Zhang ) or ordered probit model (Daykin and Moffatt 2002; Wang 2014) should be recommended. All these models regress phenotype values or their distribution-based transformations on genotypes, with the assumptions that genotype values are continuous (Korse ; Bedogni ) and the probability of having a disease increases linearly with the genotype value. However, the continuity assumption on genotype values and the linearity assumption between a phenotype and genotype are difficult to verify in practice. If these two assumptions are violated, the corresponding Wald testing statistics may severely decrease in power. To overcome this, some researchers treated genotypes as ordinal variables and reversed the regression process by regressing genotypes on phenotypes (O’Reilly ). When a phenotype is a continuous variable, this new method is indeed useful for removing or relaxing the continuity and linearity assumptions. However, this does not work when a phenotype is exactly ordinal, such as in the above-mentioned example of liver steatosis. Therefore, we propose a new method to deal with this problem. In this work, we treat genotypes as ordinal variables and propose a new procedure to assess the association between an ordinal phenotype and ordinal genotype after adjusting for covariates. Rather than regressing the phenotype on the genotype or regressing the genotype on the phenotype using existing methods, we jointly model the phenotype and genotype by introducing a latent variable following a multivariate normal distribution. The phenotype and genotype are regarded as manifestation values of the latent variable. The relationships between phenotypes, genotypes, and covariates of interest are elaborately described by the covariance matrix. Taking advantage of the framework of generalized estimation equations (Hanley ; Zhang ) and M-estimation theory (Huber 1981; Stefanski and Boos 2002), we construct a Wald test statistic for an equivalent transformation of the original null hypothesis, and prove that it asymptotically follows the standard normal distribution under the null hypothesis. Numerical simulations are conducted to compare the proposed method with other methods. Our simulation results show that the proposed method can suitably maintain Type I error control and may achieve considerable statistical power compared to existing methods in various scenarios. Finally, we apply the proposed method to anticyclic citrullinated protein antibody data for rheumatoid arthritis studies, to further demonstrate its performance.

Materials And Methods

Notations

Let random variables G and Y denote a genotype and ordinal phenotype, respectively, and be a -dimensional continuous covariate of interest, with , where τ denotes the transpose of a matrix or vector. Without loss of generality, we assume that there are two alleles at a genetic locus, with one being the risk allele and the other being the reference allele. The value of the random variable G represents the number of the risk allele at a locus, which means that G can take three values: 0, 1, or 2. Suppose that Y takes m ordinal values: . The null hypothesis states that after adjusting for covariates, the phenotype is not related to the genotype, i.e., the phenotype and genotype are conditionally independent given a set of covariates, which can be denoted byAssume that n subjects are enrolled in the genetic association study. Further, let , , and be the n observations of G, Y, and corresponding to the n subjects, respectively.

Equivalent statement of by introducing a latent variable

We assume that the combined vector is generated from a k-dimensional random variable following a multivariate normal distribution, with mean vector (the k-dimensional column vector with all units being zero) and covariance matrix , where and all its diagonal elements are equal to one. That is, represents the manifestation of and . We rewrite in the partitioned matrix formwhere is a 2 2 matrix. Then, follows a multivariate normal distribution with mean vector and covariance matrix . Then, G and Y can be obtained as follows:andwhere and . Based on the theory of the conditional normal distribution, we have that We define Then, the conditional covariance matrix above can be further expressed asNow, denote , . By introducing the latent variable and taking advantage of its distribution property, we can state that the original hypothesis is exactly equivalent to

Proposed statistical test

In this subsection, we construct a test statistic to test based on the generalized estimating equation technique and M-estimation theory. Recall that the joint distribution of and isHence, the conditional distribution of given isSimilarly, the conditional distribution of given isDenote . It should be noted that the marginal density function of and the joint density functions of each pair of variables among , and are as follows:where is an indicator function, i.e., is one if the event E holds and zero otherwise. The unknown parameters and can be estimated via the following procedures. First, based on the marginal density function (11) of , we have the likelihood functionBy maximizing on , we can obtain the MLE of (denoted by ), which is the sample covariance matrix of the observed data . Second, in our model both G and Y are ordinal variables, whose realizations are determined by the intervals in which the values of two standard normal random variables and may fall in, respectively. Specifically, we employ the distribution properties of G and Y to intuitively estimate and , respectively. We define , , , and for based on the observed data and of G and Y, respectively. Recall that . Then, we can estimate as , and as , by solving the following equations:where is the cumulative distribution function of the standard normal variable. The parameter can be estimated using the generalized estimating equation technique and M-estimation theory. First, letThe function vector consisting of the first-order partial derivatives of with respect to each parameter in isThen, the estimator of is the root of the following generalized estimating equationAfter estimating all the unknown parameters, the estimate of can be expressed as . To construct a Wald-type test statistic, we need to derive the asymptotical distribution of . Based on the classical M-estimation theory, asymptotically follows a multivariate normal distribution. That is,where , , and According to the delta method, is also asymptotically multivariate normal. Namely,where . Now, we can propose a new Wald test statistic for the null hypothesis as follows:

Data availability

The authors affirm that all data necessary for confirming the conclusions of the article are present within the article, figures, and tables. Supplemental material available at FigShare: https://doi.org/10.25387/g3.8226650.

Results

Simulation results

In this subsection, we present a series of simulation studies to investigate the performance of our proposed latent variable model (abbreviated as lvm), and compare it with the probit and logit models, which both regressing an ordinal phenotype on a genotype. We compared these under multiple simulation scenarios, so that different modeling assumptions would be favored. Two types of data generation mechanism were considered throughout our simulation studies: (i) generating data from a multivariate normal random variable (named the ND mechanism) and (ii) generating data under the proportional odds model (named the PO mechanism). In addition, three genetic models (co-dominant, dominant, and recessive models) were considered. The specifics of our simulation data generation scenarios are as follows. For simplicity, the dimension of the covariate is one, and the number of levels for a phenotype Y is set to five. Under the ND mechanism, the three-dimensional latent variable was generated from a multivariate normal distribution, with the genotype G being a manifestation of , the ordinal phenotype Y being a manifestation of , and the covariate being equal to . Each marginal distribution of followed a standard normal distribution. Note that the distribution of G varied according to the type of the true genetic model. Let A (major allele) and a (minor allele) denote two alleles at the single biallelic locus corresponding to the genotype G. Under the Hardy–Weinberg equilibrium (HWE) conditions, the expected genotype frequencies of G being , , and would be , and , respectively, with the minor allele frequency (MAF) p taking on values from . If the HWE assumption did not hold, we directly set three kinds of multinomial distribution (P) for (, , ) with different parameter structures , and . When a co-dominant model was assumed, the three genotypes , , and were coded as 0, 1, and 2, respectively. In a model where a dominant effect was assumed, the genotype was coded as 0 while both and were coded as 1. Accordingly, scores of 0 for both and and 1 for were employed in a recessive model. In addition, under each genetic model the probabilities of Y being , and 5 were always , and 0.05, respectively. We set the covariance matrix of asto investigate the Type I error rate, and let beto compare the statistical power under different alternatives, depending on the parameter θ, whose range was the set . Under the PO mechanism, the ordinal phenotype Y was related to the genotype G and covariate through the proportional odds model. Specifically, the distribution of G under different genetic models would remain the same as that under the ND mechanism, regardless of whether the HWE held. In this case, still followed the standard normal distribution, and Y was generated with five levels using the proportional odds modelwhere , , , . It should be noted that . When , the null hypothesis was true, and the derived simulation data were used to compare the Type I error rates for the three models of interest, i.e., the lvm, probit, and logit models. When , we explored the power of these three models under different alternatives. As previously described, we considered scenarios. For each simulation scenario, we generated 1000 datasets, each consisting of 300 subjects. P-values were calculated for each dataset using the three respective models. The nominal level of the tests was set to 0.05, and all simulations were performed using the R language (https://www.r-project.org/). The empirical Type I error rates and power estimates were calculated using the percentage of rejection in each scenario. The results are presented side-by-side in Tables 1–4. Note that the blanks (marked with —) in these four tables are a result of unavailability of the simulation data under the corresponding scenarios. The reason for this is that in these parameter setting conditions, the mean number for G amounting to 1 is three, which can easily lead to samples with all the G values being 0, such that none of the three considered models can be applied.
Table 1

Type I error rates and power estimates under the ND mechanism (HWE holds)

θMAFco-dominantdominantrecessive
probitlogitlvmprobitlogitlvmprobitlogitlvm
null0.10.0510.0460.0440.0400.0400.034
0.20.0490.0490.0400.0530.0540.0480.0490.0580.069
0.30.0480.0510.0460.0480.0540.0490.0560.0590.058
0.40.0490.0520.0410.0500.0420.0410.0460.0460.045
0.50.0460.0500.0370.0540.0480.0600.0550.0560.050
−0.20.10.5140.4810.5570.5030.4590.544
0.20.6360.6200.6680.6540.6300.6700.0750.0130.253
0.30.7130.6850.7360.6570.6540.6680.3090.2460.390
0.40.7650.7420.7780.6650.6570.6780.4450.4030.500
0.50.7570.7470.7680.6050.6070.6170.5800.5560.610
−0.10.10.1830.1620.2280.1630.1460.190
0.20.2500.2370.2670.2180.2090.2330.0310.0060.138
0.30.2890.2880.3090.2230.2170.2370.1260.0940.166
0.40.2400.2340.2510.2330.2390.2440.1580.1280.195
0.50.2840.2740.2990.1920.1940.1990.1960.1730.220
0.10.10.1350.1470.1480.1720.1690.180
0.20.1950.1970.2020.1620.1610.1690.0830.0940.128
0.30.2080.2020.2170.1840.1830.1920.0980.1030.113
0.40.2100.1960.2170.1740.1590.1900.1310.1320.138
0.50.2080.1970.2180.1410.1250.1650.1710.1660.176
0.20.10.4940.4880.5090.4840.4770.493
0.20.5900.5860.5980.5850.5730.5950.2370.2480.269
0.30.6460.6440.6620.5770.5690.5990.3500.3640.370
0.40.6750.6650.6850.5460.5330.5730.4460.4480.446
0.50.6830.6700.7040.4580.4160.4860.4950.4950.504
Table 4

Type I error rates and power estimates under the PO mechanism (HWE does not hold)

Pco-dominantdominantrecessive
probitlogitlvmprobitlogitlvmprobitlogitlvm
nullP10.0500.0540.0630.0540.0530.0590.0550.0510.057
P20.0520.0510.0530.0480.0460.0500.0580.0590.058
P30.0540.0570.0600.0560.0600.0610.0560.0530.068
betaP10.9680.9750.9720.5890.6240.5990.7550.7740.765
P20.9360.9370.9260.5950.6080.6020.6980.7060.713
P30.7370.7610.7560.6050.6320.6200.3290.2930.375
Table 1 presents the results for the ND mechanism when the HWE holds. The first five rows suggest that all three methods can control the Type I error rate at the nominal level of 0.05 under the three different genetic models. Furthermore, the remaining rows show that the lvm model enhances the statistical power over the probit and logit models. In some cases, the power gain can be as high as 0.07 to 0.1. For example, when and , the empirical powers for the probit, logit, and lvm models are 0.514, 0.481, and 0.557 under the co-dominant genetic model, and are 0.503, 0.459, and 0.544 under the dominant genetic model, respectively. When the true genetic model is recessive and and , the power estimates for the probit, logit, and lvm models are 0.126, 0.094, and 0.166, respectively. The results of the three methods under the ND mechanism when the HWE does not hold are presented in Table 2. We can observe that the proposed method achieves a greater power than the other two methods, even though the distribution of the genotype G does not satisfy the HWE conditions. Specifically, the power gain can be as high as 0.07 when and the distribution of G is .
Table 2

Type I error rates and power estimates under the ND mechanism (HWE does not hold)

θPco-dominantdominantrecessive
probitlogitlvmprobitlogitlvmprobitlogitlvm
nullP10.0600.0610.0550.0490.0390.0520.0540.0540.049
P20.0420.0500.0430.0430.0370.0600.0590.0560.050
P30.0500.0450.0530.0400.0320.0460.0590.0580.059
−0.2P10.7450.7410.7540.5560.5510.5640.6530.6270.662
P20.7750.7590.7880.5700.5810.5860.6130.5870.642
P30.6840.6670.7000.5830.5750.5890.3510.2990.428
−0.1P10.2480.2470.2660.1940.1950.1980.2430.2320.256
P20.3020.2860.3150.1970.1940.1990.2200.2050.244
P30.2380.2240.2530.1920.1920.1980.1610.1200.198
0.1P10.1720.1630.1880.1460.1300.1750.1750.1800.189
P20.1910.1840.2020.1230.1070.1480.1930.1910.195
P30.1800.1760.2010.1500.1330.1710.1330.1330.136
0.2P10.6380.6200.6690.4790.4400.5230.5640.5420.579
P20.6690.6650.6870.4650.4290.4990.5350.5190.545
P30.6150.6050.6330.4420.4110.4710.3830.3990.401
The corresponding results when the data are generated under the PO mechanism and the HWE is assumed are displayed in Table 3. It follows that all three models can control the Type I error rates under the null hypothesis. Even though the data are generated using the proportional odds model, the lvm method still performs better than the other two methods in detecting an alternative hypothesis, and can achieve a power gain of up to 0.058 in some cases, such as the setting with for the recessive genetic model. It is worth noting that the advantage of the proposed lvm method is more obvious under the recessive model.
Table 3

Type I error rates and power estimates under the PO mechanism (HWE holds)

MAFco-dominantdominantrecessive
probitlogitlvmprobitlogitlvmprobitlogitlvm
null0.10.0470.0390.0560.0510.0510.062
0.20.0570.0540.0630.0550.0500.0580.0300.0350.070
0.30.0500.0490.0540.0510.0490.0540.0420.0390.057
0.40.0440.0380.0430.0520.0420.0550.0460.0380.059
0.50.0480.0440.0550.0590.0560.063
beta0.10.5800.5630.5980.5190.5120.534
0.20.8060.8130.8220.7150.7220.7250.0870.0530.233
0.30.8970.9000.9010.7630.7760.7750.2910.2540.349
0.40.9290.9330.9360.7460.7490.7590.5150.4970.532
0.50.9290.9320.9290.6820.6940.6970.6470.6460.655
The simulation results under the PO mechanism when the HWE does not hold are presented in Table 4. We observe that the advantage of the proposed lvm method is not as significant as that when the HWE holds, but the model is superior to the probit model in all scenarios. Moreover, the logit model has a slightly greater power under the co-dominant and dominant models, while the lvm method outperforms it in the recessive model. From all of the four tables, it can be seen that the proposed method might have potential limitations in controlling Type I error rates in a few situations, while the power gains in almost all of simulation scenarios indeed indicate its efficiency for practical applications.

Application to anticyclic citrullinated protein antibody data for rheumatoid arthritis study

It is well known that rheumatoid arthritis (RA) is significantly associated with some genetic variants (Carlton ; Ruiz-Larrañaga ). The anticyclic citrullinated protein antibody (anti-CCP) can be an auxiliary diagnosis indicator for RA, and the specificity of anti-CCP lies between 87.8% and 96.4% (Coenen ). Besides, the genomic region of 6p21.33 has been reported to be associated with RA (Zhang ; Zhang and Li 2015). The aim here is to check whether the single nucleotide polymorphisms (SNPs) in the 6p21.33 region are associated with anti-CCP, taking advantage of the proposed lvm test. Note that there are a total of 45 SNPs in the region of 6p21.33 according to the Genetic Analysis Workshop 16 Data, all of which meet the quality control rule of the MAF being more than 5%, the missing rate being smaller than 15%, and the least genotype frequency being no less than five. The anti-CCP measure takes four values, 1, 2, 3, and 4, and the number of subjects who have these four values were 1195, 103, 66, and 698, respectively. The total number of subjects was 2062. Five principal components coordinated by applying the multi-dimensional scaling method (Li and Yu 2008) to the 12747 population structure information SNPs (Yu ) were used to adjust for population stratification effects. Before conducting association analysis, we run chi-square tests to check whether HWE holds for each of these 45 SNPs in controls. The P-value results are summarized in Table 5. At a 0.05 level of significance, we can state that the HWE law holds for all SNPs in controls on the basis of their Bonferroni-corrected P-values. Then we apply the probit model, the logit model, and the lvm model to these 45 SNPs to test their association with anti-CCP in sequence. The results are presented in Table 5. It shows that after Bonferroni correction, the SNPs rs2246986, rs3093998, rs2071596, and rs2844509 were found to be significant under the probit and logit models, while the SNPs rs2516398, rs2844494, rs3130637, rs3093993, rs3095227, rs2259435, and rs2844509 were identified as significant using the lvm method. Though these two groups of SNPs overlap at only one SNP rs2844509, each of other SNPs found by the probit (logit) model is physically close to one or two SNPs found by the lvm model. For example, the SNP rs2246986 of the first group is 677 kb away from rs2516398 on one side and is 1212 kb away from rs2844494 on the other side, while both of these two SNPs rs2516398 and rs2844494 are included in the second group. In addition, the SNP rs3093998 (in the first group) is 2971 kb away from rs3130637 (in the second group). The distances are so short that it is reasonable to infer that the SNPs rs2246986, rs2516398, and rs2844494 contain similar information. So do another two SNPs rs3093998 and rs3130637. In short, for detecting the association between anti-CPP and the genomic region 6p21.33, the proposed lvm method is more powerful than the methods based on logit and probit models.
Table 5

P-values of 45 SNPs in the region of 6p21.33 for Genetic Analysis Workshop 16 Data

SNP IDlocationMAFHWE testprobitlogitlvm
rs6940467315501160.32400.73540.00850.00650.1596
rs12660382315513020.10760.85310.89040.97240.4209
rs2395488315538880.18510.06390.00310.00360.1066
rs2248372315544450.35150.60270.32600.34870.8352
rs2248373315545250.31260.06960.67020.67570.6428
rs2248462315547750.33290.09050.59060.61380.3037
rs2516513315555670.20180.15260.46250.44080.3336
rs2516424315562940.20370.08680.47370.44810.4048
rs2248617315565120.35020.28450.32530.34780.9011
rs3828893315566320.35080.82630.38650.41430.9098
rs3749946315568410.06930.86140.86680.84510.6300
rs3099844315569550.06770.32220.15560.14930.0249
rs2905722315573060.11350.25840.20390.14430.0400
rs2523647315577570.13550.90040.01190.01350.0020
rs2516509315579730.23410.70440.03550.02840.0038
rs2523710315588880.20030.07420.49010.46730.3584
rs2905747315594550.17870.38430.06950.06440.0032
rs2523467315655570.19960.27300.84050.93770.3975
rs2516415315677210.30730.40620.81820.73860.8137
rs3130922315690680.31720.00390.00970.01430.4089
rs3828903315727180.32540.00150.02110.01480.0049
rs3828914315737980.31790.08190.05240.04320.0152
rs2855812315806990.24640.84260.01740.02390.3046
rs3134899315812650.23570.53370.99910.95050.5082
rs2844498315848330.42790.91480.98200.92210.4318
rs2246618315869650.28860.00380.78290.87020.4483
rs2516400315890840.31040.38590.71780.82150.3033
rs2516399315892780.08860.44150.00510.00370.0158
rs2516398315895050.31390.68240.00230.00220.0001
rs2246986315901820.08070.83150.00100.00070.0106
rs2844494315913940.32070.86150.00990.01010.0004
rs9267444315914370.33630.66470.93170.87460.3115
rs3093998315931530.33100.02330.00030.00070.0046
rs3130637315961240.23900.01930.01370.01750.0004
rs3132454315976230.36360.40280.80700.83520.3769
rs3093993315987040.23880.02080.01390.01780.0004
rs3095227315989790.23650.04770.01670.02100.0005
rs2259435316048940.18000.27650.00940.01110.0002
rs3093983316049040.19170.28580.49220.53270.1049
rs3130055316053780.27650.31650.96000.99240.4228
rs3093978316064760.19160.23370.36180.39810.0706
rs2734583316134590.11090.00400.03290.02200.0059
rs2071596316146700.18450.22630.000020.000020.0689
rs2516393316147230.19110.26130.39050.42820.0776
rs2844509316189030.20940.56010.00000.00000.0003

Discussion

In this work, we have shown that the idea of treating a genotype variable as ordinal without assuming linearity can result in a more powerful and robust test, via introducing a joint multivariate normal distribution for the group of genotypes, traits, and covariates. Meanwhile, we have also demonstrated that the proposed lvm test can provide appropriate Type I error rates. The important strength of our method is that it does not make an assumption on the type of relationship between a phenotype and genotype; nor does it treat the genotype as a continuous variable. Rather, our approach only introduces a latent multivariate normal variable to characterize the relationship between the two, which is very reasonable, and generally considerably more useful. Besides the simulations with respect to significance level of 0.05, we also conducted simulation studies with a lower significance level 0.005. The results are given in Supplementary Table S1. We found that the proposed method can reasonably control Type I error rates and achieve power gains at this lower significance level, similar to those results in Tables 1-4. It is worth mentioning that our proposed test model can also be applied to other situations where the outcome is continuous. In such a situation, we can still employ a joint multivariate normal distribution to model outcomes, genotypes, and covariates simultaneously. Even though the proposed method might have potential limitations in Type I error rate control in some situations, the power gains prove its efficiency in practical applications. In population-based genetic association studies, hundreds of thousands of subjects are often enrolled to achieve optimal power. It is inevitable that there exists a population stratification effect in such large-scale studies. Not considering the effect of population stratification could lead to many false positive findings, and therefore adjusting for its effect represents the basis for conducting a genetic association analysis (Price ; Li and Yu 2008). In this study, to characterize the influence of population stratification when investigating the relationships between ordinal traits and genotypes, we treat these effects as covariates in the proposed lvm method. The numerical results of our simulation studies and real data applications have demonstrated that the strategy is feasible and effective.
  17 in total

1.  Statistical analysis of correlated data using generalized estimating equations: an orientation.

Authors:  James A Hanley; Abdissa Negassa; Michael D deB Edwardes; Janet E Forrester
Journal:  Am J Epidemiol       Date:  2003-02-15       Impact factor: 4.897

2.  Multiple-interval mapping for ordinal traits.

Authors:  Jian Li; Shengchu Wang; Zhao-Bang Zeng
Journal:  Genetics       Date:  2006-04-03       Impact factor: 4.562

3.  Technical and diagnostic performance of 6 assays for the measurement of citrullinated protein/peptide antibodies in the diagnosis of rheumatoid arthritis.

Authors:  Dries Coenen; Patrick Verschueren; René Westhovens; Xavier Bossuyt
Journal:  Clin Chem       Date:  2007-01-26       Impact factor: 8.327

4.  Improved correction for population stratification in genome-wide association studies by identifying hidden population structures.

Authors:  Qizhai Li; Kai Yu
Journal:  Genet Epidemiol       Date:  2008-04       Impact factor: 2.135

5.  Chromogranin-A and N-terminal pro-brain natriuretic peptide: an excellent pair of biomarkers for diagnostics in patients with neuroendocrine tumor.

Authors:  Catharina M Korse; Babs G Taal; Cornelis A de Groot; Robert H Bakker; Johannes M G Bonfrer
Journal:  J Clin Oncol       Date:  2009-08-10       Impact factor: 44.544

6.  Genetic variants associated with rheumatoid arthritis patients and serotypes in European populations.

Authors:  Otsanda Ruiz-Larrañaga; Maria Uribarri; Maria C Alcaro; Sergio Escorza-Treviño; Jokin Del Amo; Mikel Iriondo; Carmen Manzano; Paola Migliorini; Veronika Lóránd; Andone Estonba
Journal:  Clin Exp Rheumatol       Date:  2016-03-03       Impact factor: 4.473

7.  Testing for association with multiple traits in generalized estimation equations, with application to neuroimaging data.

Authors:  Yiwei Zhang; Zhiyuan Xu; Xiaotong Shen; Wei Pan
Journal:  Neuroimage       Date:  2014-04-01       Impact factor: 6.556

8.  Nonparametric Risk and Nonparametric Odds in Quantitative Genetic Association Studies.

Authors:  Wei Zhang; Qizhai Li
Journal:  Sci Rep       Date:  2015-07-15       Impact factor: 4.379

9.  Case-control genome-wide association study of rheumatoid arthritis from Genetic Analysis Workshop 16 using penalized orthogonal-components regression-linear discriminant analysis.

Authors:  Min Zhang; Yanzhu Lin; Libo Wang; Vitara Pungpapong; James C Fleet; Dabao Zhang
Journal:  BMC Proc       Date:  2009-12-15

10.  Identification of multiple gene-gene interactions for ordinal phenotypes.

Authors:  Kyunga Kim; Min-Seok Kwon; Sohee Oh; Taesung Park
Journal:  BMC Med Genomics       Date:  2013-05-07       Impact factor: 3.063

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.