Literature DB >> 28978191

Incorporation of Biological Knowledge Into the Study of Gene-Environment Interactions.

Marylyn D Ritchie, Joe R Davis, Hugues Aschard, Alexis Battle, David Conti, Mengmeng Du, Eleazar Eskin, M Daniele Fallin, Li Hsu, Peter Kraft, Jason H Moore, Brandon L Pierce, Stephanie A Bien, Duncan C Thomas, Peng Wei, Stephen B Montgomery.

Abstract

A growing knowledge base of genetic and environmental information has greatly enabled the study of disease risk factors. However, the computational complexity and statistical burden of testing all variants by all environments has required novel study designs and hypothesis-driven approaches. We discuss how incorporating biological knowledge from model organisms, functional genomics, and integrative approaches can empower the discovery of novel gene-environment interactions and discuss specific methodological considerations with each approach. We consider specific examples where the application of these approaches has uncovered effects of gene-environment interactions relevant to drug response and immunity, and we highlight how such improvements enable a greater understanding of the pathogenesis of disease and the realization of precision medicine.

Entities: Chemical

Keywords: data integration; functional genomics; gene-environment interaction; model organisms

Mesh：

Year: 2017 PMID： 28978191 PMCID： PMC5860556 DOI： 10.1093/aje/kwx229

Source DB: PubMed Journal: Am J Epidemiol ISSN： 0002-9262 Impact factor: 5.363

In the quest for the discovery of genetic and environmental risk factors associated with common, complex disease risk, researchers have largely focused on either the genome or “the exposome,” the multitude of environmental factors affecting the individual's health. To identify genetic risk factors, genetics researchers have used a variety of study design techniques, including family-based linkage studies, candidate-gene association studies, and more recently genome-wide association studies (GWASs). Through unbiased, genome-wide scans, GWASs have identified thousands of common genetic variants associated with risk for common diseases (1), some of which highlight new biological pathways. However, the effects of associated variants are typically small and account for only a small proportion of the estimated heritability (2, 3). To identify environmental risk factors, epidemiologists have applied diverse study designs ranging from observational studies of the environment—investigating factors outside of the control of the individual (air pollution, water contaminants, etc.)—to experimental studies of modifiable risks, such as nutritional interventions. Classical studies of single environmental factors, such as radon and smoking, have greatly advanced our understanding of health (4). However, in recent years, large-scale environmental screening projects have emerged to enable simultaneous study of multiple environmental factors, or the exposome. These large-scale investigations are subject to many of the challenges of “big data” research, including high correlation among study variables, multiple testing corrections, and missing data (5). The risk of common diseases is often due to a complex interplay of the genome and the exposome; however, the degree to which genetic versus environmental factors alter risk varies greatly by disease. For example, in certain subtypes of lung cancer, carcinogens in tobacco smoke have such a strong influence on disease risk that the effects of genetic variation pale in comparison (6). Conversely, a familial subtype of breast cancer, resulting from mutations in BRCA1/BRCA2 genes, is predominantly explained by the genetic component of risk (7). Given that individual variability in both genes and environment can influence disease risk, joint analysis of the genome and the exposome, as well as their potential interactions, will provide better insights into disease etiology. However, with tens of millions of variants and thousands of measured environmental variables, the challenge is identifying how to do so effectively. Publicly funded research has provided a wealth of genetic and environmental data. The All of Us Research Program announced in 2015 aims to further support data-driven science by building a national research cohort of 1 million or more US participants with genetic and environmental data (https://www.nih.gov/research-training/allofus-research-program). As these types of resources grow, the challenges of big-data analysis are a significant consideration for genomic and exposome research. Analyzing data sets that include both types of data only exacerbate these issues further. GWASs have had remarkable success in combining analyses across diverse studies to improve power. However, for exposome studies, challenges remain in unifying measurements and defining analytical procedures. When such analyses can be achieved, several challenges emerge. These issues include computational complexity issues, a very large multiple-testing burden, big p small n (many more variables than individuals/samples), and often sparse data matrices (due to missing data). In order to deal with these issues, many approaches perform some form of data reduction or filtering prior to analysis to increase statistical power to detect interactions. One strategy for data reduction is to use genetic association data to identify genetic risk variants that are then coupled with epidemiologic/environmental data. This strategy depends on the assumption that the genes influencing risk through interactions with environmental factors will also show significant independent/marginal association with disease. Thus, a 2-step approach can be taken in which stage 1 is a genome-wide scan of single nucleotide polymorphism (SNP) associations to identify a smaller set of SNPs passing a P value threshold to carry forward. In stage 2, the prioritized variant list is tested for SNP-environment interaction with the available environmental factors (8). This makes an important assumption about the relevance of the marginal SNP effect, but this strategy can be quite powerful when that assumption is met. Note that for quantitative outcomes, other criteria have been proposed to filter SNPs at step 1, including, for example, a test of heterogeneity of variance by genotypic classes (9). Another approach to improving the power of interaction studies is to test for association with underlying quantitative traits for diseases that have a clear genetic component, referred to as endophenotypes. In some situations, such endophenotypes may be better measured and representative of the mechanistic pathway toward disease risk. For example, lipid profiles or glucose tolerance tests may be more appropriate for certain genetic association tests because they are closer to gene action than the disease outcome of heart disease or diabetes (10). Much like using marginal SNP association tests for selecting SNPs, by using endophenotypes that are closer to the molecular function of the gene, we will have more power to filter for potentially functional SNPs in subsequent gene-environment interaction (G×E) analyses. As we will discuss below, molecular endophenotypes such as gene expression may have sufficient power to filter environmental factors as well. Finally, some studies have begun using prior biological knowledge to reduce the genome down to a set of candidate genes for G×E analyses. Here, there are a number of strategies for gene selection, each based on a set of assumptions: Evidence in the literature for variants in the gene region associated with the disease of interest Evidence in the literature of the gene associated with environmental factors of interest Genes related to the pathways where environmental factors may play a role Variants that are positioned in functional regions of the genome Use of public databases of G×E relationships and information about regulatory regions of the genome to filter the candidate genes and environmental factors This is just a short list of options to reduce the genome to a smaller list of genes to test for G×Es. Such options are complemented by the use of model organism studies, which have several advantages for analyzing G×Es because environmental exposures can be carefully controlled and the genetic structure of the study can be leveraged to improve power (discussed further in McAllister et al. (11)). A variety of model organisms have been used for discovering G×Es (12) including yeast (13), Drosophila (14), and mouse (15, 16). Compared to human G×E studies, model organism studies allow the measurement of genetically similar individuals across multiple distinct environments. Considerable work has been done in this area, and it is outside the scope of this review. Thus, we focus our review on strategies that take advantage of molecular and cellular endophenotypes and multi-omics data.

USE OF -OMICS TO INFORM G×E DISCOVERY

Molecular and cellular endophenotypes provide a unique opportunity to identify genetic variants responsive to the environment at a more basic, mechanistic level. Unlike a GWAS, which may require tens of thousands of individuals to identify genetic variants associated with a phenotype, comparable studies of molecular and cellular endophenotypes using functional genomics have identified abundant associations with only dozens of individuals. This increased power to identify functional genetic loci has culminated in a wide diversity of quantitative trait studies for functional genomics data (or -omics), including epigenomes, methylomes, proteomes, and transcriptomes. Deciphering the role of these functional loci across human tissues has relied upon epigenome maps generated within large-scale projects where the data are publicly available, such as ENCODE (17) and the NIH Epigenomics Roadmap (18), in combination with expression quantitative trait locus (eQTL) studies from projects such as Multiple Tissue Human Expression Resource (MuTHER) (19) and Genotype-Tissue Expression (GTEx) (20). Similar to using a marginal association test to prioritize important SNPs, epigenomic data (chromatin immunoprecipitation assays with sequencing (ChIP-seq), DNase I hypersensitive site sequencing (DNaseI-seq), and assay for transposase-accessible chromatin using sequencing (ATAC-seq)) can be used to identify enhancers and other regulatory elements in the genome and, subsequently, to prioritize variants positioned in those regions. Therefore, a straightforward application of -omics data is to identify responsive elements or variants at a molecular level in order to select candidate variants to test in G×E analyses. In recent years, several studies have taken advantage of the increased power of molecular studies to map G×E effects. Barreiro et al. (21) mapped eQTLs in primary dendritic cells from 65 individuals before and after infection with Mycobacterium tuberculosis and identified 198 response eQTLs specific to either condition. This study demonstrated that nonnegligible numbers of G×E effects exist in the context of an infection and that mapping these variants and genes can be accomplished with limited numbers of individuals in well-controlled in vitro assays. Following this work, multiple studies have perturbed primary blood cells to elicit immune-response eQTLs (22–25). Beyond gene expression, response QTLs using chromatin accessibility assays have identified that immune-response eQTLs can be foreshadowed in the naïve state by regulatory variants influencing chromatin accessibility (26). This observation provides new opportunity to identify primed response variants that may underlie unexplained, noncoding, complex trait associations (27). Response eQTLs need not be identified through in vitro perturbations alone; increasingly, studies of the interaction of observational variables such as age, sex, or behavior in cohorts with genetic and functional genomics data have identified G×E variants (28–31). For age- and sex-specific eQTLs, Yao et al. (28) focused on known complex disease-associated variants in a cohort of 5,254 individuals where whole blood gene expression was measured. They identified 10 age-specific and 14 sex-specific eQTLs, highlighting a notable scarcity of variants with strong effects given either variable. In addition to these variables, behaviors such as smoking, medication use, and exercise have been studied for G×E effects on gene expression; Knowles et al. (30) recently surveyed these variables using a novel approach for allele-specific expression to identify G×E effects for each environment. In both this study and Zhernakova et al. (31), investigators adopted the use of “proxy environments” to model unobserved perturbations, such as an individual's cell-type composition or infection status in discovery of interaction effects, providing a means to test previously unmeasured environments. Furthermore, the impact of genetics and in utero environments—including maternal smoking, birth weight, and birth order—have been observed to have large effects on an individual's methylome at birth (32, 33).

METHODOLOGICAL AND STUDY DESIGN ISSUES WITH THE USE OF -OMICS DATA

Use of -omics data in the discovery of G×Es presents unique methodological issues. First, there is the issue of tissue specificity. The identity of the most effective tissues for interaction testing is not always clear. For instance, in pursuing genetic contributors to adverse drug reactions, we could analyze either drug-metabolizing tissues such as the liver or the tissues where adverse effects are presented. Obtaining these tissues can be quite challenging, and even if samples can be obtained, we must consider whether such effects can be more easily and less invasively discovered from accessible patient tissues such as skin or blood. Indeed, in the study of adverse drug reactions to statins, the link to the GATM gene was observed through analysis of statin-response eQTLs in lymphoblastoid cells rather than muscle or liver (34). Determining the extent of tissue specificity of genetic effects is still a major challenge. Studies such as the Genotype-Tissue Expression project have begun to comprehensively identify tissue-shared and tissue-specific effects (20). Designing G×E studies in the correct tissues will further benefit from increasingly rich epigenomic maps, such as ENCODE and the NIH Epigenomics Roadmap. By identifying the regions that are differentially accessible in response to an environmental perturbation in a few tissues, one can use these large maps to gain insight into the relative benefits of G×E testing in different tissues. The problem of statistical power and adequate sample size can significantly impede G×E studies (35). To study the effect of a cellular perturbation in vitro as in the study by Fairfax et al. (22), sample sizes of a few hundred individuals per condition or a few time points may be required. The study design should include -omics analysis (RNA sequencing, DNase-Seq, ATAC-Seq, etc.) at every condition or time point. If measurements are made on the same individuals, then statistical methods need to account for within-person correlation, an adjustment that may decrease power. This design is feasible for in vitro studies. Only recently have in vivo studies begun to track large numbers of individuals with repeated -omics measurements. These emerging studies pose additional challenges. In vivo studies need to account for the potential impact of multiple environments, increased variability in sample collection and measures of the environment, and increased challenges in causal inference. Other methodological challenges facing G×E studies arise largely from the inherent properties of -omics data. Data generated from a specific -omics technology are modeled better by certain distributions than others. Knowledge of the underlying distribution of the data governs the models chosen to detect G×E effects. For example, it has been widely shown that gene expression as measured by RNA sequencing can be adequately modeled by a negative binomial distribution (36). For gene expression measured using microarrays, one might chose a linear model with an interaction term (gene-expression variable × environment). However, for RNA-sequencing data, a generalized linear model accounting for overdispersion in the count data may be preferable. Another consideration is that -omics data encounters challenges from known and unknown confounders. Known confounders may include sex, age, or batch. In fact, known biological confounders may appear as environmental effects that we wish to test in a subset of our analysis. Many methods have been developed to remove these unwanted effects in the context of eQTL studies from simple linear regression and/or principal components analysis to more advanced techniques (probabilistic estimation of expression residuals (37), HCP (38), svaseq (39)). For in vitro studies, these methods are typically applied to each condition or time point separately. For in vivo studies, one must take care not to remove the environmental effect to be tested. Given the diversity of cellular perturbations that can be assayed and studied for G×E effects, an ongoing bottleneck is selecting specific environments to assay. Moyerbrailean et al. (40) demonstrated an effect pipeline for assessing G×E effects across 250 environments (50 perturbations in 5 cell types) where all perturbations were tested using low-pass transcriptome sequencing. Only those perturbations with significant differences were then carried forward for deeper sequencing and assessment of context-specific, allele-specific expression. Using this approach, they identified 215 genes with G×Es, of which nearly 50% were implicated in previous GWASs, thereby providing a rich resource of candidate hypotheses for further investigation. Ultimately, estimating the degree to which genetics and environment contribute to molecular and cellular phenotypes poses a principal challenge to G×E study design. Multiple gene-expression studies have used family relationships to determine the relative contributions of genetics and environment, reporting average heritability from 0.1 to 0.26 (41, 42). The majority of studies calculate heritability from total expression levels, but allele-specific expression is increasingly complementing these estimates. Buil et al. (41) used allele-specific expression measured in a twin study of approximately 400 female twin pairs with RNA- sequencing data from different tissues (fat, lymphoblastoid cell lines, skin, and blood) to measure the relative contribution of genetics and environment. In support of previous studies, they found little evidence for shared environmental effects. However, they found significant evidence of unique or individual-specific environmental effects, explaining approximately 10%–20% of expression variation in each of the tissues, on par with cis or cis-trans effects, and they further estimated that 38%–49% of variance observed in allele-specific expression was not explained by additive effects and was due to gene-gene or G×E effects. Despite this, limitations on power influenced the discovery of specific G×E associations, highlighting that either larger studies or candidate studies were required.

INTEGRATIVE AND PATHWAY-BASED STRATEGIES

Genes do not act in isolation; they function through physical, metabolic, and chemical reactions with other genes in large pathways and networks. Yet when we look for associations between genes and disease outcomes or even G×Es, we tend to treat each gene independently. If we embrace the complexity and relationships between genes in pathways, we may increase our power to detect and interpret our findings. Many researchers describe pathway approaches as being antithetical to the unbiased genome-wide perspective. Thomas (43) describes ways that these powerful pathway-based approaches can further enhance power and insights rather than replace the GWAS approach. While taking an agnostic GWAS approach to generate the genetic data might be beneficial to ensure that all regions of the genome are explored, these data can be married with hierarchical modeling strategies that exploit pathway knowledge when we perform analyses of genome-wide data (43). Exploring candidate pathways and using biological knowledge related to the environment in the gene selection process can potentially be a powerful alternative to the current paradigm. Two strategies can be applied. First, as for gene-based and pathway-based analyses of marginal SNP effects, we can perform an agnostic search for enrichment of interaction effects. Indeed, existing methods such as gene-set enrichment analysis (44) only assume genome-wide P values of the test considered (marginal effect, a priori) following a uniform distribution under the null hypothesis of no enrichment. Therefore, these methods can be directly applied to the standard 1-degree-of-freedom test of interaction performed on a genome-wide scale. Because it relies on established methodologies, some groups have already started to apply such strategies. For example, Wei et al. (45) combined gene-based testing with pathway enrichment analyses to look for G×E associations with lung-cancer susceptibility. The second potential strategy is to use pathway information to reduce the search space for interactions, as in candidate-based approaches. To facilitate selection of candidate variants and study of G×Es, the CardioGxE database has extensively curated G×E variants in the literature (46). Further, as discussed in previous sections, a common strategy in G×E screening is to assume SNPs involved in interaction effects also display a marginal effect. Similarly, one can argue that SNPs involved in interactions might be enriched in pathways related to the outcome or the exposure in question. Building on this idea, Rava et al. (47) proposed a strategy to select genes for G×E for associations with asthma (a trait for which earlier G×E studies did not find strong evidence of associations). They selected the canonical pathways that the set of candidate genes belong to and included biological knowledge related to the environment in their gene selection process. Their approach reduced the gene list to a small and focused set for G×E analysis (47). Similar approaches have been adopted for other diseases as well. Huang and Hu (48) used these same strategies to look for G×Es and associations with obesity. Tang et al. (49) performed gene-based gene × smoking interaction analysis, followed by enrichment analysis of nominally interacting genes in canonical biological pathways, and found that the axonal guidance signaling pathway interacted with smoking to modify the risk of pancreatic cancer. Of note, exactly the same pathway was identified to be enriched with recurrent somatic point mutations and copy number aberrations by an exome-sequencing study of pancreatic tumors (50). This study also suggested a negative correlation between somatic mutations in the axonal guidance pathway and smoking—nonsmokers were more likely to have somatic mutations in this pathway than smokers. For the first time, both GWAS and somatic mutation data pointed to the same biological pathway that might interact with smoking in modifying cancer risk. To integrate this type of biological knowledge into an analysis, bioinformatics tools such as Biofilter can be used. Biofilter is a knowledge integration tool developed to allow for annotation, filtering, and model building of genomic data (51, 52). The underlying database of biological knowledge within Biofilter is called the Library of Knowledge Integration (LOKI) and consists of multiple public database sources such as Kyoto Encyclopedia of Genes and Genomes (KEGG) (http://www.genome.jp/kegg/), Gene Ontology (http://www.geneontology.org/), Pfam (http://pfam.xfam.org/), and the GWAS Catalog (https://www.ebi.ac.uk/gwas/). These sources are linked in the Library of Knowledge Integration so that a user can provide a list of either SNPs or genes and get the SNPs annotated by gene, the genes annotated by group (pathway, protein family, etc.), and the lists of genes that belong to specific groups. These annotations can be used to filter gene sets into subsets prior to association analyses. For example, in cataract susceptibility, Hall et al. (53) used Biofilter to identify potential gene-gene interactions in the eMERGE network. This same process could be used for G×E analysis with a candidate gene list based on either the disease or the environmental-factor candidates; the user would provide the candidate gene list, and Biofilter would provide all of the other genes that are linked to the genes in the user's list based on the Library of Knowledge Integration. Tools like Biofilter make performing candidate-pathway, or gene-set, approaches more efficient because they integrate knowledge from multiple public data sources and enable researchers to use all of the sources simultaneously. Because of their novelty and the limited number of applications that have been developed so far, it is challenging to assess the relevance of these approaches. Obviously, their performances rely on the validity of the underlying statistical and biological assumptions, such as valid and correct biological/pathway knowledge. For example, in the later candidate-based approach, substantial gain in power might be achieved if G×E are indeed enriched in outcome/exposure pathways, but might have decreased power otherwise. Also, as in the case of marginal-effects testing of genes and pathways, the potential success of these strategies lies in their ability to cope with multiple causal genes whose effects might be heterogeneous across populations. Focusing on single-variant association signals, as in standard G×E GWAS screening, implicitly assumes the effects of each variant is large enough and homogeneous enough that it can be replicated across populations. However, when effects are small, and variability due to other uncontrolled risk factors is high, gene- and pathway-based tests, which aggregate information, might be more efficient than those using SNPs as the testing unit. Thus, it is anticipated that these pathway-based approaches could increase power by magnitudes, assuming that the knowledge used is accurate. This is a critical assumption. Direct comparisons of the statistical power of the different approaches are not possible at this time. However, as more applications of these approaches are published, we will learn more about the gain in power through the use of these pathway-based strategies.

FUTURE DIRECTIONS AND CONCLUSIONS

We have described several different biological considerations that can be used to improve power and increase flexibility in using prior knowledge and/or multiple data types in G×E analyses. The wealth of biological knowledge that has been discovered over the past two decades is astonishing. It is clearly to our benefit to include this knowledge; this is particularly important because it can help us find biologically meaningful G×Es. Also, this knowledge may be useful to help identify interactions with rare or low-frequency variants that may have been missed previously. An additional important consideration is that of replication of the G×E effects. In genetics, replication of association has become the gold standard (54). Here, to avoid an inflation of false positives, the field looks for replication of the precise result in multiple, independent data sets. This is an important strategy under an assumption where we believe that one particular SNP and one particular exposure measurement are important for the disease of interest. However, if we start to consider gene-based models or pathway models, what is the “model” that we need to replicate? If we see several genes from a particular pathway associating in a G×E with a particular trait of interest in one data set and then 2 different genes from the same pathway associating in a G×E in a second data set, is that replication? This would not be a replication under the traditional definition (the same independent variables combined in the same statistical model with the same direction of effect). We need to consider this apparent contradiction more carefully as we expand analyses to accommodate gene-based or pathway-based approaches. Our ability to integrate information from multi-omics approaches, the literature, and other public knowledge sources provides tremendous power to deal with the challenges inherent in big-data analyses. In the coming years, we expect to see more G×E analyses incorporating biological knowledge–driven strategies similar to those described here. These approaches will continue to evolve and expand as we learn more about the relationships between the genome and the exposome in the architecture of complex traits.

52 in total

1. Personal genomes: The case of the missing heritability.

Authors: Brendan Maher
Journal: Nature Date: 2008-11-06 Impact factor: 49.962

2. Sulfonylurea inadequacy: efficacy of addition of insulin over 6 years in patients with type 2 diabetes in the U.K. Prospective Diabetes Study (UKPDS 57).

Authors: Alex Wright; A C Felix Burden; Richard B Paisey; Carole A Cull; Rury R Holman
Journal: Diabetes Care Date: 2002-02 Impact factor: 19.112

Review 3. Non-small cell lung cancer: current treatment and future advances.

Authors: Cecilia Zappa; Shaker A Mousa
Journal: Transl Lung Cancer Res Date: 2016-06

4. Gene-gene and gene-environment interactions detected by transcriptome sequence analysis in twins.

Authors: Alfonso Buil; Andrew Anand Brown; Tuuli Lappalainen; Ana Viñuela; Matthew N Davies; Hou-Feng Zheng; J Brent Richards; Daniel Glass; Kerrin S Small; Richard Durbin; Timothy D Spector; Emmanouil T Dermitzakis
Journal: Nat Genet Date: 2014-12-01 Impact factor: 38.330

5. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors: Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal: Proc Natl Acad Sci U S A Date: 2005-09-30 Impact factor: 11.205

6. Environmental epidemiology: challenges and opportunities.

Authors: J Pekkanen; N Pearce
Journal: Environ Health Perspect Date: 2001-01 Impact factor: 9.031

7. CardioGxE, a catalog of gene-environment interactions for cardiometabolic traits.

Authors: Laurence D Parnell; Britt A Blokker; Hassan S Dashti; Paula-Dene Nesbeth; Brittany Elle Cooper; Yiyi Ma; Yu-Chi Lee; Ruixue Hou; Chao-Qiang Lai; Kris Richardson; José M Ordovás
Journal: BioData Min Date: 2014-10-26 Impact factor: 2.522

8. A perspective on interaction effects in genetic association studies.

Authors: Hugues Aschard
Journal: Genet Epidemiol Date: 2016-07-07 Impact factor: 2.135

9. Integrative analysis of 111 reference human epigenomes.

Authors: Anshul Kundaje; Wouter Meuleman; Jason Ernst; Misha Bilenky; Angela Yen; Alireza Heravi-Moussavi; Pouya Kheradpour; Zhizhuo Zhang; Jianrong Wang; Michael J Ziller; Viren Amin; John W Whitaker; Matthew D Schultz; Lucas D Ward; Abhishek Sarkar; Gerald Quon; Richard S Sandstrom; Matthew L Eaton; Yi-Chieh Wu; Andreas R Pfenning; Xinchen Wang; Melina Claussnitzer; Yaping Liu; Cristian Coarfa; R Alan Harris; Noam Shoresh; Charles B Epstein; Elizabeta Gjoneska; Danny Leung; Wei Xie; R David Hawkins; Ryan Lister; Chibo Hong; Philippe Gascard; Andrew J Mungall; Richard Moore; Eric Chuah; Angela Tam; Theresa K Canfield; R Scott Hansen; Rajinder Kaul; Peter J Sabo; Mukul S Bansal; Annaick Carles; Jesse R Dixon; Kai-How Farh; Soheil Feizi; Rosa Karlic; Ah-Ram Kim; Ashwinikumar Kulkarni; Daofeng Li; Rebecca Lowdon; GiNell Elliott; Tim R Mercer; Shane J Neph; Vitor Onuchic; Paz Polak; Nisha Rajagopal; Pradipta Ray; Richard C Sallari; Kyle T Siebenthall; Nicholas A Sinnott-Armstrong; Michael Stevens; Robert E Thurman; Jie Wu; Bo Zhang; Xin Zhou; Arthur E Beaudet; Laurie A Boyer; Philip L De Jager; Peggy J Farnham; Susan J Fisher; David Haussler; Steven J M Jones; Wei Li; Marco A Marra; Michael T McManus; Shamil Sunyaev; James A Thomson; Thea D Tlsty; Li-Huei Tsai; Wei Wang; Robert A Waterland; Michael Q Zhang; Lisa H Chadwick; Bradley E Bernstein; Joseph F Costello; Joseph R Ecker; Martin Hirst; Alexander Meissner; Aleksandar Milosavljevic; Bing Ren; John A Stamatoyannopoulos; Ting Wang; Manolis Kellis
Journal: Nature Date: 2015-02-19 Impact factor: 69.504

10. Genomic analyses with biofilter 2.0: knowledge driven filtering, annotation, and model development.

Authors: Sarah A Pendergrass; Alex Frase; John Wallace; Daniel Wolfe; Neerja Katiyar; Carrie Moore; Marylyn D Ritchie
Journal: BioData Min Date: 2013-12-30 Impact factor: 2.522

14 in total

Review 1. Opportunities and Challenges for Environmental Exposure Assessment in Population-Based Studies.

Authors: Chirag J Patel; Jacqueline Kerr; Duncan C Thomas; Bhramar Mukherjee; Beate Ritz; Nilanjan Chatterjee; Marta Jankowska; Juliette Madan; Margaret R Karagas; Kimberly A McAllister; Leah E Mechanic; M Daniele Fallin; Christine Ladd-Acosta; Ian A Blair; Susan L Teitelbaum; Christopher I Amos
Journal: Cancer Epidemiol Biomarkers Prev Date: 2017-07-14 Impact factor: 4.254

Review 2. Environmental neuroscience linking exposome to brain structure and function underlying cognition and behavior.

Authors: Feng Liu; Jiayuan Xu; Lining Guo; Wen Qin; Meng Liang; Gunter Schumann; Chunshui Yu
Journal: Mol Psychiatry Date: 2022-07-05 Impact factor: 15.992

3. Genome-Wide Gene-Diabetes and Gene-Obesity Interaction Scan in 8,255 Cases and 11,900 Controls from PanScan and PanC4 Consortia.

Authors: Hongwei Tang; Lai Jiang; Donghui Li; Peter Kraft; Peng Wei; Rachael Z Stolzenberg-Solomon; Alan A Arslan; Laura E Beane Freeman; Paige M Bracci; Paul Brennan; Federico Canzian; Mengmeng Du; Steven Gallinger; Graham G Giles; Phyllis J Goodman; Charles Kooperberg; Loïc Le Marchand; Rachel E Neale; Xiao-Ou Shu; Kala Visvanathan; Emily White; Wei Zheng; Demetrius Albanes; Gabriella Andreotti; Ana Babic; William R Bamlet; Sonja I Berndt; Amanda Blackford; Bas Bueno-de-Mesquita; Julie E Buring; Daniele Campa; Stephen J Chanock; Erica Childs; Eric J Duell; Charles Fuchs; J Michael Gaziano; Michael Goggins; Patricia Hartge; Manal H Hassam; Elizabeth A Holly; Robert N Hoover; Rayjean J Hung; Robert C Kurtz; I-Min Lee; Núria Malats; Roger L Milne; Kimmie Ng; Ann L Oberg; Irene Orlow; Ulrike Peters; Miquel Porta; Kari G Rabe; Nathaniel Rothman; Ghislaine Scelo; Howard D Sesso; Debra T Silverman; Ian M Thompson; Anne Tjønneland; Antonia Trichopoulou; Jean Wactawski-Wende; Nicolas Wentzensen; Lynne R Wilkens; Herbert Yu; Anne Zeleniuch-Jacquotte; Laufey T Amundadottir; Eric J Jacobs; Gloria M Petersen; Brian M Wolpin; Harvey A Risch; Nilanjan Chatterjee; Alison P Klein
Journal: Cancer Epidemiol Biomarkers Prev Date: 2020-06-16 Impact factor: 4.254

Review 4. Lessons Learned From Past Gene-Environment Interaction Successes.

Authors: Beate R Ritz; Nilanjan Chatterjee; Montserrat Garcia-Closas; W James Gauderman; Brandon L Pierce; Peter Kraft; Caroline M Tanner; Leah E Mechanic; Kimberly McAllister
Journal: Am J Epidemiol Date: 2017-10-01 Impact factor: 5.363

5. Multidimensional molecular measurements-environment interaction analysis for disease outcomes.

Authors: Yaqing Xu; Mengyun Wu; Shuangge Ma
Journal: Biometrics Date: 2021-07-02 Impact factor: 1.701

Review 6. Opportunities for Gene and Environment Research in Cancer: An Updated Review of NCI's Extramural Grant Portfolio.

Authors: Armen A Ghazarian; Naoko Ishibe Simonds; Gabriel Y Lai; Leah E Mechanic
Journal: Cancer Epidemiol Biomarkers Prev Date: 2020-12-15 Impact factor: 4.090

7. A Systematic Analysis of Interactions between Environmental Risk Factors and Genetic Variation in Susceptibility to Colorectal Cancer.

Authors: Maria Timofeeva; Evropi Theodoratou; Tian Yang; Xue Li; Susan M Farrington; Malcolm G Dunlop; Harry Campbell
Journal: Cancer Epidemiol Biomarkers Prev Date: 2020-04-01 Impact factor: 4.254

8. Current Challenges and New Opportunities for Gene-Environment Interaction Studies of Complex Diseases.

Authors: Kimberly McAllister; Leah E Mechanic; Christopher Amos; Hugues Aschard; Ian A Blair; Nilanjan Chatterjee; David Conti; W James Gauderman; Li Hsu; Carolyn M Hutter; Marta M Jankowska; Jacqueline Kerr; Peter Kraft; Stephen B Montgomery; Bhramar Mukherjee; George J Papanicolaou; Chirag J Patel; Marylyn D Ritchie; Beate R Ritz; Duncan C Thomas; Peng Wei; John S Witte
Journal: Am J Epidemiol Date: 2017-10-01 Impact factor: 5.363

Review 9. Another Round of "Clue" to Uncover the Mystery of Complex Traits.

Authors: Shefali Setia Verma; Marylyn D Ritchie
Journal: Genes (Basel) Date: 2018-01-25 Impact factor: 4.096

Review 10. The Evolving Field of Genetic Epidemiology: From Familial Aggregation to Genomic Sequencing.

Authors: Priya Duggal; Christine Ladd-Acosta; Debashree Ray; Terri H Beaty
Journal: Am J Epidemiol Date: 2019-12-31 Impact factor: 4.897