Literature DB >> 24104466

Soft truncation thresholding for gene set analysis of RNA-seq data: application to a vaccine study.

Brooke L Fridley1, Gregory D Jenkins, Diane E Grill, Richard B Kennedy, Gregory A Poland, Ann L Oberg.   

Abstract

Gene set analysis (GSA) has been used for analysis of microarray data to aid the interpretation and to increase statistical power. With the advent of next-generation sequencing, the use of GSA is even more relevant, as studies are often conducted on a small number of samples. We propose the use of soft truncation thresholding and the Gamma Method (GM) to determine significant gene set (GS), where a generalized linear model is used to assess per-gene significance. The approach was compared to other methods using an extensive simulation study and RNA-seq data from smallpox vaccine study. The GM was found to outperform other proposed methods. Application of the GM to the smallpox vaccine study found the GSs to be moderately associated with response, including focal adhesion (p = 0.04) and extracellular matrix receptor interaction (p = 0.05). The application of GSA to RNA-seq data will provide new insights into the genomic basis of complex traits.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 24104466      PMCID: PMC3793215          DOI: 10.1038/srep02898

Source DB:  PubMed          Journal:  Sci Rep        ISSN: 2045-2322            Impact factor:   4.379


With the advent of next-generation sequencing, many researchers are using RNA-Seq to profile mRNA expression of the entire transcriptome. The use of RNA-Seq allows researchers to determine: all transcripts (novel and known); different isoforms; allelic expression; splicing patterns; fusion genes; and differences in expression levels of each transcript1. However, due to the relatively high cost of RNA-Seq, many experiments are done with relatively few samples, thus limiting the statistical power to detect differences in gene expression under different conditions. In addition to the limited sample size, the number of features measured on each subject has increased from approximately one million to 10–15 million features (including SNVs, indels, and structural variants) for each subject (depending on the depth of coverage and region targeted for sequencing23) which increases the multiple testing penalty. As many complex disorders may be controlled by the interplay of multiple genes within the same molecular pathway or gene set (GS), gene set analysis (GSA) has been widely used for mRNA data from microarrays to aggregate the association signals for a set of genes within a GS. This incorporates biological knowledge, reduces the multiple-testing burden, and may increase the association signal, thus increasing the power to detect meaningful associations. With the advent of next-generation sequencing technologies, the use of GSA is even more relevant due to limited statistical power resulting from generally small sample sizes. Over the past ten years, many approaches have been proposed for GSA of mRNA data from microarrays45678. In addition, many of the gene set methods proposed for mRNA data have been extended for use in genome-wide genetic association studies using single nucleotide polymorphism (SNP) microarrays91011. Previous research of self-contained GSA methods (i.e., approaches that test the null hypothesis H: genes within the GS of interest are not associated with the phenotype versus the alternative hypothesis H: genes within the GS are associated with the phenotype) found the global random effect model12 and Fisher's method to be two powerful approaches for analysis of microarray based mRNA expression data13. In addition, we found the soft truncation thresholding Gamma method (GM)14, a generalization of Fisher's method for combining p-values, to be more powerful than Fisher's method and the global random effects model for GSA of SNP data15. Thus, we hypothesized that the GM would be a powerful approach for GSA of mRNA gene expression data measured by microarray or next generation sequencing technologies to determine sets of genes associated with an endpoint (e.g. differential expressed genes between two experimental conditions). In this manuscript, we present the findings from an extensive simulation study which compares performance of the GM to that of other commonly used self-contained GSA methods on gene expression data. We also applied the GM for GSA to a smallpox vaccine immunogenetic study involving RNA-Seq data to determine GSs with differentially expressed genes between high and low responders to the vaccine.

Results

Simulation studies

The results from the simulation studies found the GM to outperform other proposed approaches for self-contained GSA. All methods had adequate control of the type I error rate. Table 1 presents the summary of the power for the various methods across all simulation scenarios and Figure 1 presents a power comparison between simulation scenarios for the subset of the most powerful GSA methods. The full set of results for all methods assessed can be found in Supplemental Table 1. The GM with soft truncation threshold (STT) < 1/e consistently outperformed the other commonly used GSA approaches for gene expression data. The only method that had similar performance was the full model with fixed effects; however, this approach could only be applied to approximately two-thirds of the simulation scenarios where the number of genes in the GS was less than the number of samples.
Table 1

Summary of power across all 1440 non-null simulation scenarios for sample sizes of N = 500 and N = 100, with 1000 simulated data sets per scenario. The GM with various STT values is compared to ten previously proposed self-contained GSA methods. Table entries are sorted by descending mean power for the scenarios with sample size of 500

  N = 500N = 100
MethodSTTMin.1st Qu.MedianMeanMin.1st Qu.MedianMean
Gamma Method (GM)0.10.264110.9910.0720.99410.923
GM0.050.331110.9930.0730.99010.921
GM0.150.210110.9900.0730.99510.920
Global model with fixed effects (GMFE)* 0.223110.9850.0680.99810.906
GM0.20.171110.9880.0680.99210.916
GM0.010.449110.9930.0670.9610.903
Global model using random effects (GMRE) 0.111110.9830.0650.94510.896
Fisher's Method/Gamma Method (FM)1/e0.096110.9800.0590.94310.889
PCA using principal components that explain 80% of variation (PCA80) 0.101110.9740.060.85610.855
Stouffer's Method (SM) 0.062110.9330.0510.67410.816
FTS. GS Modified Tail Strength (MTS) 0.057110.9240.0450.57410.787
PCA using top five principal components (PCA1.5) 0.070.95810.9200.0560.6310.9930.788
Tail Strength (TS) 0.0780.98110.8670.0590.62110.797
Kolmogorov-Smirnov (KS) 0.0510.96010.8190.0470.3940.9990.738
PCA using top principal component (PCA1) 0.0560.59110.8080.0510.3140.9910.704

*198 and 396 scenarios were unable to be fit do to size of gene set for N = 500 and N = 100, respectively.

Figure 1

Power comparison between the Gamma Method with STT = 0.15 and 0.05 (GM.15, GM.05), Fisher's Method (FM), Global model with random effects (GMRE), Principal components analysis with 80% of components that explained the variability included in the model (PCA.80), Kolmogorov-Smirnov test (KS).

Comparison of the GM with various STT values, ranging from 0.20 to 0.01, is presented in Figure 2. As expected, there was more similarity between the GM with similar STT values (e.g., GM with STT = 0.20 versus GM with STT = 0.15) and less similarity between results when the STT values were further apart (e.g., GM with STT = 0.20 versus GM with STT = 0.01). The optimal value of the STT for a given GSA will depend on the underlying, unknown, disease model, but in general STT values that are not too small, e.g. between 0.01 and 1/e (≈0.36), tend to give the best power. This coincides with the rationale for GSA, in that we wish to detect GSs with multiple genes with moderate or small effects for which we have limited power to detect individually.
Figure 2

Power comparison between the Gamma Method with various STT values.

STT values ranged from 0.20 to 0.01.

Finally, based on results from the null simulations designed from the smallpox vaccine study, no relationship was found between type I error rate and: number of genes in a GS; average length of genes in a GS; sum of all lengths of genes in a GS; and the number of "large" genes in a GS (Supplemental Figure 1). We also observed that the GM's type I error rate was found to be controlled for all values of STT (and thus the corresponding shape parameter ω in the Gamma transformation).

Smallpox vaccine study

The GM with various STT values, including FM, was applied to the smallpox vaccine study to determine if any GSs in KEGG16 were associated with response to the smallpox vaccine. Table 2 presents GSs with p < 0.05 for any of the GM analyses with various STT values. To adjust for multiple testing, FDR q-values were computed17. For the analysis of response to smallpox vaccine, there was little difference between the GM results with STT < 0.20, while the largest p-values resulted from FM (i.e., GM with STT = 1/e). The top associated GSs (with pathway coverage ≥ 70%) included: Biotin metabolism (p = 0.0005, q = 0.02); non-homologous end-joining (p = 0.02, q = 0.17); focal adhesion (p = 0.04, q = 0.17); D-Glutamine and D-glutamate metabolism (p = 0.04, q = 0.17); and ECM-receptor interaction (p = 0.05, q = 0.17).
Table 2

Top GSs associated with response to Smallpox vaccine for various STT values. Results with p < 0.05 from GSA using the GM with any of the STT values are presented

 GSA P-values for various STT Value   
Gene Set0.050.100.150.201/eN Genes in KEGGN Genes in AnalysisCoverage of Pathway
Biotin metabolism0.0005*0.00050.00050.00050.00222100%
Pentose and glucuronate interconversions0.0180.0210.0250.0310.11928829%
Non-homologous end-joining0.0220.0260.0310.0390.121141286%
Focal adhesion0.0390.0530.0650.0760.10320114874%
D-Glutamine and D-glutamate metabolism0.0410.0410.0420.0410.05044100%
ECM-receptor interaction0.0460.0640.0800.0930.129845970%
Lysine biosynthesis0.0580.0550.0480.0420.0444375%

*FDR q-value was 0.02.

Within the top GSs, Table 3 presents the genes with gene-level association p < 0.15. Of particular relevance to vaccine response are the genes within the focal adhesion GS (p < 0.15 for 30 genes out of the 148 genes measured in pathway of 201 genes), as many of these genes are involved in cytokine-cytokine receptor interactions. Another interesting GS is the EMC-receptor interaction GS (p < 0.15 for 9 genes out of the 59 genes measured in the pathway of 84 genes). Genes within this GS interact with a number of immunologically important cell surface molecules including: integrins which mediate cytokine adhesion, extravasation, and homing; CD44 (a cell surface glycoprotein involved in lymphocyte homing, migration, and activation); and CD36 (a scavenger receptor expressed on multiple cell types including: monocytes, macrophage, dendritic cells). SPP1 (osteopontin), the top-most associated gene in the ECM-receptor interaction GS, up-regulates the production of IFNγ and IL-12 thereby driving Th1-type immune responses. The MAPK9 gene identified in the focal adhesion GS is a member of the MAP kinase family and is required for differentiation of T helper cells into Th1 cells. As depicted in the dendrogram in Figure 3, these two GSs contain many genes in common, including genes COL1A2 (p = 0.0278), THBS4 (p = 0.0311) and ITGB3 (p = 0.0168).
Table 3

Gene-level results (p < 0.15) for GSs with p < 0.05

Gene SetGeneP-valueGene SetGeneP-value
Biotin metabolismHLCS0.0005Focal adhesionCOL1A20.0278
Pentose & glucuronate interconversionsUGP20.1160 THBS40.0311
 DCXR0.0115 AKT30.0385
Non-homologous end-joiningPRKDC0.0164 SOS20.0430
 RAD500.0613 BRAF0.0468
 MRE11A0.0802 PDGFA0.0493
D-Glutamine & D-glutamate metabolismGLS20.0176 JUN0.0496
 GLS0.1061 FLNB0.0521
ECM-receptor interactionSPP10.0117 ZYX0.0626
 ITGB30.0168 PIK3CA0.0776
 LAMB10.0208 PPP1R12A0.0788
 COL1A20.0278 MAPK80.0801
 THBS40.0311 ERBB20.0814
 COL5A10.0947 ROCK20.0899
 COL11A20.1277 COL5A10.0947
 COL5A20.1288 GRLF10.0996
 HSPG20.1452 PIK3R10.1028
focal adhesionPRKCA0.0090 CCND30.1089
 SPP10.0117 SHC10.1170
 MAPK90.0131 ARHGAP50.1252
 IGF1R0.0132 COL11A20.1277
 ITGB30.0168 COL5A20.1288
 LAMB10.0208 BIRC30.1440
 VASP0.0219   
Figure 3

Dendrogram of top 25 GS associated with response to smallpox vaccine to visualize relationship and overlap between gene sets.

GSs containing a large set of genes in common would be clustered close together while GSs with no genes in common would not be clustered together.

Discussion

In this paper, we present the use of the GM for GSA to determine GSs in which the transcript levels for genes within the GS are associated with a phenotype. The method is able to be used in both the context of microarray data and next-generation sequence data, along with the ability to be used for both binary and quantitative traits. An extensive simulation study, involving over 1,400 simulation scenarios, was completed to compare the GM with various levels of STT to other GSA methods, including Fisher's method and the Global model with random effects of Goeman et al. (2004), that were found to be powerful approaches for self-contained GSA in past research13. From our simulation study we found the GM with STT < 0.20 to uniformly outperform the other methods, while maintaining type I error rate control. In addition to the simulation study, we applied the GM for GSA to a RNA-Seq smallpox study to identify GSs with differences in mRNA expression between high and low responders to the smallpox vaccine. The top biologically relevant GSs included focal adhesion (p = 0.04) and ECM-receptor interaction (p = 0.05). Of note, these GSs mediate communication and interactions between the leukocytes involved in immune responses. It is possible that the differential expression seen in the high and low responders reflects a differential ability of circulatory peripheral blood mononuclear cells to recognize viral infection and coordinate the resulting immune responses. These results indicate that further examination of these gene pathways in mixed cell populations and in specific cell subsets (i.e. B cells, monocytes, CD4 T cells) is warranted, as such studies may further our understanding of poxvirus immunity. In conclusion, this research shows the GM with STT < 0.20 to be a powerful method for GSA. In practice, we suggest the selection of an STT value between 0.10 and 0.20, realizing the optimal STT depends on each individual study. Care should be taken in the interpretation of results from GSA completed based on multiple STT values (i.e., multiple testing and “data snooping”). In addition, the application of the method to a smallpox vaccine study has provided new insights into the genomic basis of individual variations in immune response to the vaccine.

Methods

Vaccine study

In brief, 21 high and 23 low responders to smallpox vaccine based on extremes of antibody titer were selected from a cohort of 1076 successful smallpox vaccine recipients. Aliquots of these 44 subjects' peripheral blood mononuclear cells (PBMCs) were either left unstimulated or were stimulated with vaccinia virus for a total of 88 specimens. Specimens were allocated to flow cell and lane for sequencing ensuring that response status was balanced over these potential experimental effects, ensuring the paired specimens from a given subject were in adjacent lanes on the same flow cell. For purposes of the GSA methods comparison here, we focus on the 44 stimulated specimens to test the hypothesis of differential gene expression between high and low responders. Full details of the study cohort are provided in Kennedy et al. (2009), Haralambieva et al.(2011), Ovsyannikova et al. (2011)18192021, and of the PBMC stimulation and RNA-Seq methods in Kennedy et al (2013)22.

Simulation study

Power and type I error simulations

The simulation of gene expression data and a quantitative phenotype was completed in a similar manner as outlined in Fridley et al13. Briefly, let n represent the number of subjects and m represent the number of genes in a GS. The expression data for each subject (i = 1,…,n) was simulated from a multivariate normal distribution with mean equal to zero and covariance matrix Σ. The matrix Σ was set to either the case where there is no correlation between the genes in a GS or a structure in which all genes in the GS are correlated the same amount. While these precise correlation structures are not biologically realistic, they do facilitate the comparison of the algorithms, and the relative performance of the algorithms should extend to correlation structures observed in real data. The quantitative phenotype (Yi) for each subject was generated conditional on the gene expression data, Y ~ N(μ, σ2) with μ = βXi, where represents the vector of gene expression values for genes within the GS for subject i and β is the vector of gene level effects. The simulation scenarios varied in terms of the number and size of the effects (β), sample size, GS size, correlation between genes within a GS, and variation in the phenotype. For each of the simulation scenarios, 1000 data sets were generated to assess either the power (1440 scenarios) or the type I error rate (72 scenarios). For determining power and type I error rate, the significance level was set to 0.05.

Null simulations based on smallpox vaccine study

In addition to the power and type I error rate estimates based on the simulated data, we also investigated the possible impact of gene size and gene set size on the type I error rate in GSA of RNA-Seq data. Using the stimulated group of samples in the smallpox study, we permuted the high (H) and low (L) responder status 100 times to generate 100 “null” data sets with no association (beyond that of chance) between response and level of gene expression. In doing so, we keep the structure of the RNA-Seq data intact to preserve the correlation structure between the gene expression levels. For each null data set, we then completed GSA with the GM for various soft truncation threshold (STT) values for the 200 KEGG GSs. In completing the gene level association analyses, the same gene-specific dispersion estimate was used for all 100 data sets. We then compared the type I error rates between the various sizes of gene sets (number of genes), average gene length in the GS, sum of all the gene lengths within a GS, and number of genes in a gene set with gene size larger than the 75% of all gene lengths.

Gene set analysis method for RNA-Seq data

Gene-level assessment

Prior to the completion of the self-contained GSA with the GM, the gene-level association p-values must be determined. In contrast to microarray based mRNA data in which relative mRNA expression is measured for pre-defined probe sets via fluorescence, RNA-Seq experiments measure the gene expression levels from the total number of reads that fall into the exons of a gene. To assess the significance of each gene with the outcome (i.e., differential gene expression analysis), we used a generalized linear model that assumed a Negative Binomial distribution23. The Negative Binomial distribution is appropriate for count data where within-subject technical variation follows a Poisson distribution with subject-specific mean λ, and the between-subject biological variation of λ follows a Gamma distribution. In addition, we assumed that the nature of over-dispersion differs across genes. Thus, an Empirical Bayes-like moderated test implemented in the R package edgeR was used, in which gene specific dispersion parameters were estimated with “shrinkage” of estimates to the overall mean using a quantile-adjusted conditional maximum likelihood method, scaled by the 75%-tile242526. Genes were removed from analysis if they had low coverage (i.e., average count ≤5).

Gene set assessment

The GSA is then completed with the application of the GM to the gene-level association p-values produced from the negative binomial testing framework. The GM is based on summing p-values transformed using an inverse Gamma (ω, 1) transformation. For a particular shape parameter ω, the test statistic is defined as , where G−1 is the inverse of a Gamma (ω, 1) cumulative distribution function14. The varying of ω results in different transformations applied to the p-values, resulting in more emphasis being given to p-values below a particular threshold, referred to as the STT. The STT is controlled by the shape parameter ω, where 14. When ω is 1, the GM becomes equivalent to Fisher's method (FM) with a STT value of 1/e. Non-independence of gene-specific p-values due to correlation of expression of genes in a gene set can lead to departures of p-values from the expected Uniform(0,1) distribution under the null hypothesis. Due to this lack of independence between the gene-level p-values, we utilized Monto Carlo estimation of the test-statistic T's null distribution to compute empirical or “non-parametric” based p-values for each GS1527. In doing so, the phenotypic variable is randomly permuted, preserving the correlation structure in the RNA-seq gene expression count data. Using the permuted data, differential expression analysis for each gene was then computed using edgeR, followed by the determination of the GSA T test statistic. This process was repeated many times (e.g. 1,000 times), producing an empirical distribution of the test statistic T. The proportion of permutations in which the test statistic was smaller than the observed data test statistic was the empirical estimate of the GS p-value. To visualize the overlap of the genes within the various GSs, hierarchical clustering was completed using a distance measure between GSs defined as 1 − τ, where τ represents the average proportion of genes shared between the GSs. For the smallpox vaccine study and the simulation studies, GSA was completed using the GM with six different STTs ranging from 0.01 to 1.0 (i.e., Fisher's method). We also compared the results from the GM to methods assessed in Fridley et al (2010)13: global model with fixed effects (GMFE); global model with random effects (GMRE)12; tail strength (TS)28; Kolmogorov-Smirnov (KS); and a principal component approach with either the top k principal components needed to explain 80% (PCA80) of the variation in the gene expression values within the gene set, the top principal component (PCA1) or top five principal components (PCA1.5).

Author Contributions

B.L.F. conceived of the study, made the figures and tables, and wrote the majority of the manuscript text. G.D.J. and D.E.G. completed all statistical analyses outlined in the manuscript. R.B.K., A.L.O., D.E.G. and G.A.P. completed the sequencing study for the smallpox vaccine immunogenetics study. All authors reviewed and edited the manuscript text.
  25 in total

1.  Insights into colon cancer etiology via a regularized approach to gene set analysis of GWAS data.

Authors:  Lin S Chen; Carolyn M Hutter; John D Potter; Yan Liu; Ross L Prentice; Ulrike Peters; Li Hsu
Journal:  Am J Hum Genet       Date:  2010-06-11       Impact factor: 11.025

2.  Pathway-based approaches for analysis of genomewide association studies.

Authors:  Kai Wang; Mingyao Li; Maja Bucan
Journal:  Am J Hum Genet       Date:  2007-12       Impact factor: 11.025

3.  Common SNPs/haplotypes in IL18R1 and IL18 genes are associated with variations in humoral immunity to smallpox vaccination in Caucasians and African Americans.

Authors:  Iana H Haralambieva; Inna G Ovsyannikova; Neelam Dhiman; Richard B Kennedy; Megan O'Byrne; V Shane Pankratz; Robert M Jacobson; Gregory A Poland
Journal:  J Infect Dis       Date:  2011-08-01       Impact factor: 5.226

4.  Genome-wide gene-set analysis for identification of pathways associated with alcohol dependence.

Authors:  Joanna M Biernacka; Jennifer Geske; Gregory D Jenkins; Colin Colby; David N Rider; Victor M Karpyak; Doo-Sup Choi; Brooke L Fridley
Journal:  Int J Neuropsychopharmacol       Date:  2012-05-02       Impact factor: 5.176

5.  Human leukocyte antigen genotypes in the genetic control of adaptive immune responses to smallpox vaccine.

Authors:  Inna G Ovsyannikova; Robert A Vierkant; V Shane Pankratz; Robert M Jacobson; Gregory A Poland
Journal:  J Infect Dis       Date:  2011-06-01       Impact factor: 5.226

6.  A map of human genome variation from population-scale sequencing.

Authors:  Gonçalo R Abecasis; David Altshuler; Adam Auton; Lisa D Brooks; Richard M Durbin; Richard A Gibbs; Matt E Hurles; Gil A McVean
Journal:  Nature       Date:  2010-10-28       Impact factor: 49.962

7.  Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments.

Authors:  James H Bullard; Elizabeth Purdom; Kasper D Hansen; Sandrine Dudoit
Journal:  BMC Bioinformatics       Date:  2010-02-18       Impact factor: 3.169

8.  Use of the gamma method for self-contained gene-set analysis of SNP data.

Authors:  Joanna M Biernacka; Gregory D Jenkins; Liewei Wang; Ann M Moyer; Brooke L Fridley
Journal:  Eur J Hum Genet       Date:  2011-12-14       Impact factor: 4.246

9.  Mapping copy number variation by population-scale genome sequencing.

Authors:  Ryan E Mills; Klaudia Walter; Chip Stewart; Robert E Handsaker; Ken Chen; Can Alkan; Alexej Abyzov; Seungtai Chris Yoon; Kai Ye; R Keira Cheetham; Asif Chinwalla; Donald F Conrad; Yutao Fu; Fabian Grubert; Iman Hajirasouliha; Fereydoun Hormozdiari; Lilia M Iakoucheva; Zamin Iqbal; Shuli Kang; Jeffrey M Kidd; Miriam K Konkel; Joshua Korn; Ekta Khurana; Deniz Kural; Hugo Y K Lam; Jing Leng; Ruiqiang Li; Yingrui Li; Chang-Yun Lin; Ruibang Luo; Xinmeng Jasmine Mu; James Nemesh; Heather E Peckham; Tobias Rausch; Aylwyn Scally; Xinghua Shi; Michael P Stromberg; Adrian M Stütz; Alexander Eckehart Urban; Jerilyn A Walker; Jiantao Wu; Yujun Zhang; Zhengdong D Zhang; Mark A Batzer; Li Ding; Gabor T Marth; Gil McVean; Jonathan Sebat; Michael Snyder; Jun Wang; Kenny Ye; Evan E Eichler; Mark B Gerstein; Matthew E Hurles; Charles Lee; Steven A McCarroll; Jan O Korbel
Journal:  Nature       Date:  2011-02-03       Impact factor: 49.962

10.  Transcriptomic profiles of high and low antibody responders to smallpox vaccine.

Authors:  R B Kennedy; A L Oberg; I G Ovsyannikova; I H Haralambieva; D Grill; G A Poland
Journal:  Genes Immun       Date:  2013-04-18       Impact factor: 2.676

View more
  11 in total

Review 1.  Statistical genomics in rare cancer.

Authors:  Farnoosh Abbas-Aghababazadeh; Qianxing Mo; Brooke L Fridley
Journal:  Semin Cancer Biol       Date:  2019-08-19       Impact factor: 15.707

Review 2.  The personal touch: strategies toward personalized vaccines and predicting immune responses to them.

Authors:  Richard B Kennedy; Inna G Ovsyannikova; Nathaniel D Lambert; Iana H Haralambieva; Gregory A Poland
Journal:  Expert Rev Vaccines       Date:  2014-04-04       Impact factor: 5.217

3.  Comparative evaluation of gene set analysis approaches for RNA-Seq data.

Authors:  Yasir Rahmatallah; Frank Emmert-Streib; Galina Glazko
Journal:  BMC Bioinformatics       Date:  2014-12-05       Impact factor: 3.169

4.  Gene signatures associated with adaptive humoral immunity following seasonal influenza A/H1N1 vaccination.

Authors:  I G Ovsyannikova; H M Salk; R B Kennedy; I H Haralambieva; M T Zimmermann; D E Grill; A L Oberg; G A Poland
Journal:  Genes Immun       Date:  2016-08-18       Impact factor: 2.676

5.  Ontogeny-related pharmacogene changes in the pediatric liver transcriptome.

Authors:  Richard Meier; Chengpeng Bi; Roger Gaedigk; Daniel P Heruth; Shui Qing Ye; J Steven Leeder; Brooke L Fridley
Journal:  Pharmacogenet Genomics       Date:  2018-03       Impact factor: 2.089

6.  GSAASeqSP: a toolset for gene set association analysis of RNA-Seq data.

Authors:  Qing Xiong; Sayan Mukherjee; Terrence S Furey
Journal:  Sci Rep       Date:  2014-09-12       Impact factor: 4.379

7.  Gene set analysis approaches for RNA-seq data: performance evaluation and application guideline.

Authors:  Yasir Rahmatallah; Frank Emmert-Streib; Galina Glazko
Journal:  Brief Bioinform       Date:  2015-09-04       Impact factor: 11.622

8.  Whole Transcriptome Profiling Identifies CD93 and Other Plasma Cell Survival Factor Genes Associated with Measles-Specific Antibody Response after Vaccination.

Authors:  Iana H Haralambieva; Michael T Zimmermann; Inna G Ovsyannikova; Diane E Grill; Ann L Oberg; Richard B Kennedy; Gregory A Poland
Journal:  PLoS One       Date:  2016-08-16       Impact factor: 3.240

9.  Gene signatures related to HAI response following influenza A/H1N1 vaccine in older individuals.

Authors:  Inna G Ovsyannikova; Ann L Oberg; Richard B Kennedy; Michael T Zimmermann; Iana H Haralambieva; Krista M Goergen; Diane E Grill; Gregory A Poland
Journal:  Heliyon       Date:  2016-05-06

10.  DOT: Gene-set analysis by combining decorrelated association statistics.

Authors:  Olga A Vsevolozhskaya; Min Shi; Fengjiao Hu; Dmitri V Zaykin
Journal:  PLoS Comput Biol       Date:  2020-04-14       Impact factor: 4.475

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.