Literature DB >> 21184165

Genetical genomic analysis of complex phenotypes using the PhenoGen website.

Beth Bennett1, Laura M Saba, Cheryl K Hornbaker, Katerina J Kechris, Paula Hoffman, Boris Tabakoff.   

Abstract

Our laboratory has developed an online interactive resource called PhenoGen ( http://phenogen.ucdenver.edu ) which provides an archive of brain and other organ gene expression data from a panel of 20 common inbred mouse strains, and three recombinant inbred (RI) panels (two mouse and one rat). DNA microarray data can also be uploaded to the site where numerous analytical tools can be implemented. An important advantage to the archived data is that each array represents data from a single animal and each strain was sampled 4-7 times, providing an estimate of genetic variance (heritability) of individual transcript levels. These panels also allow genetic mapping of expression QTLs. Overlap of eQTLs with phenotypic QTLs provides a powerful approach to candidate gene identification. These methods are briefly described here and we encourage the use of our site for both scientific discovery and as a teaching tool in quantitative genetics.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21184165      PMCID: PMC3121939          DOI: 10.1007/s10519-010-9427-0

Source DB:  PubMed          Journal:  Behav Genet        ISSN: 0001-8244            Impact factor:   2.805


The advent of DNA microarray technology allowed simultaneous, unbiased measurement of levels of most mRNA transcripts. These measures can be used to address many questions in biology, since variations in the expression levels of the mRNA, i.e., amount of mRNA available for translation, have been identified as etiologic factors for behavioral, physiological, or disease phenotypes (Schadt et al. 2005). Differential expression of genes in selected strains, or treated and control tissues, can be used to identify candidate genes influencing a phenotype. However, the set of candidate genes may contain a large proportion of false positives. One method for reducing the number of false positives is to require that the area of the genome that controls the transcription of a candidate gene also controls variation in the phenotype of interest. To do this, transcript levels measured in segregating populations are treated as quantitative traits. Using genetic marker information, chromosomal regions that regulate transcript levels are mapped as expression quantitative trait loci (eQTLs). This approach, “genetical genomics”, has been successfully used to construct regulatory gene transcription networks in various organisms (Schadt et al. 2005; Chesler et al. 2005; Lu et al. 2008) and extended to identify candidate genes contributing to complex behavioral phenotypic traits (Saba et al. 2006; Tabakoff et al. 2008). The identification of common QTLs regulating both the phenotype of interest (pQTLs), and the expression of genes whose transcript levels correlate with the phenotype, provides a strong basis for candidate gene identification (Chesler et al. 2005; Hu et al. 2008; Saba et al. 2006; Tabakoff et al. 2008). The use of microarrays, though attractive, is out of reach for many researchers without access to costly facilities, supplies, and technicians required to process tissue for gene expression and interpret results. We have developed an online, interactive resource (http://phenogen.ucdenver.edu) that enables researchers to mine data from microarray expression experiments and implement the genetical genomic/phenotypic approach described above. Users of the website can access microarray data available on the website (or upload their own microarray data), then use a variety of analytical methods, some of which we describe here for candidate gene searches using their own or published data on quantitative traits in panels of animals for which our site stores genetic and genomic data. We have previously demonstrated the use of the PhenoGen website to identify candidate genes influencing sensitivity to morphine analgesia (Hoffman et al. 2010) and ethanol drinking (Saba et al. 2006; Tabakoff et al. 2008, 2009). The PhenoGen site allows access to and manipulation of whole brain gene expression data from the BXD recombinant inbred (RI) mouse panel (32 strains and 172 arrays), the HXB/BXH RI rat panel (described by Printz et al. 2003) (22 strains and 139 arrays), and 20 commonly used inbred strains of mice (90 arrays). The site also has data available on transcript expression on mRNA at the exon level for whole brain from the LXS RI mouse panel (describe by Williams et al. 2004) and for heart RNA from the HXB/BXH rat panel (mRNA data from liver, intestine and fat tissue will soon be available). Several important advantages of the PhenoGen data and associated analytical tools are: each array represents mRNA generated from tissue from a single animal; data on 4–7 biological replicates are available (i.e., one animal’s RNA per array), allowing calculation of heritability for each transcript and increased power; datasets from the RI panels and the inbred strain set have been normalized and cleaned of ambiguous and non-unique probes as well as probes containing SNPs (see below), but users can carry out their own cleaning and normalization procedures as well; users can create, normalize, and save their own datasets by selecting from available arrays or using their own uploaded data. By registering on the site, users can create private repositories for saving data and analyses; users can implement various ‘filters’ (see below) prior to statistical analysis to reduce the multiple testing burden by integrating relevant prior knowledge. The analysis on the PhenoGen website includes a probe mask that eliminates probes containing known SNPs, and probes that are not targeted to a unique location in their respective genome. This mask has been implemented on the Affymetrix Mouse and Rat Exon arrays and on the Affymetrix Mouse 430 v2 array. Users have many tools for analyzing raw data, including a choice of normalization procedures, several filtering methods for eliminating data from non-informative or noisy probesets, the ability to choose appropriate statistical methods for data analysis, as well as access to comprehensive annotation and literature search capabilities. The use of these PhenoGen tools is described in more detail by Hoffman et al. (2010), the Demo link on the website, and the User’s Manual that can be downloaded from the website. A typical workflow for a genetical genomic/phenomic approach to identifying candidate genes for a complex quantitative trait is described below. To complete these analyses the user must have phenotypic data from at least 5 of the rat or mouse strains from one of the panels on our website, for correlation, or 15 strains for pQTL mapping. Phenotypes can be uploaded or typed directly into a dialog box. Phenotypic measures can be user-generated or obtained from online or literature sources (see Hoffman et al. 2010). Candidate genes are identified through a series of filtering steps: Eliminate transcripts not expressed in brain, i.e., probesets with expression values below detection limits; Retain transcripts with evidence for strong genetic heritability of mRNA levels; Retain transcripts whose expression level is controlled within the same area of the genome (eQTL) that controls the variation in the phenotype (pQTL); Retain transcripts whose expression level significantly correlates with the quantitative phenotype (steps 3 and 4 can be performed in reverse order). The measure of heritability gives insight into the strength of genetic control of transcript expression, i.e., correlation of expression level with phenotype represents a true genetic relationship, rather than an association with environmental variables. Heritability has important implications for the relevance of a transcript/protein for drug targeting or disease characterization. As described in our previous work, the rationale for using the eQTL/pQTL overlap filter is that if a gene contributes to a complex behavior through variation in its expression levels, areas of the genome that regulate those expression levels (eQTLs) should be represented within areas of the genome that contribute to the regulation of the phenotype. Transcripts with eQTL/pQTL overlap are considered to represent likely candidate genes for the particular phenotype (Hu et al. 2008; Tabakoff et al. 2008). The determination of eQTLs for the panels on the website was described in earlier work (Hu et al. 2008; Tabakoff et al. 2008, 2009), and the eQTL locations (obtained using gene expression data from RI strains), including 95% confidence intervals for eQTLs with empirical p values < 0.1, are available on PhenoGen. Finally, for quality control purposes, outlier arrays can be easily and confidently identified by comparison to other arrays within a strain. This check has been done for the ‘public’ datasets of the RI and inbred strain panels, which have been normalized and probe-masked, and can be accessed by a single click on the website. Because of the multitude of filtering steps involved in the approach described here, the probability of missing a “true” candidate gene is increased. However, the probability of including a “false” candidate gene in the final set of candidate genes is greatly reduced. For example, we examined differential expression in three strain pairs, each of which shows large differences in ethanol consumption (B6 and D2; ILS and ISS; HAP and LAP), and used the method described above to search for candidate genes influencing this phenotype. One probeset passed all filters and showed a significant correlation with ethanol intake, Gnb1 (guanine nucleotide binding protein, β1 subunit; unpubl. data), replicating previous results implicating Gnb1 in drinking (Saba et al. 2006; Tabakoff et al. 2008). A second analysis, using published measurements of sensitivity to morphine-induced analgesia in BXD RI strains (Bergeson et al. 2001) and gene expression data from the same strains identified candidate genes known to affect recycling of opiate receptors and receptor glycosylation, providing a mechanistic explanation for the range in analgesia seen in the BXD (Hoffman et al. 2010). We have previously described some of the differences between functions available on the PhenoGen site and other sites such as GeneNetwork (Hoffman et al. 2010). Briefly, PhenoGen offers the ability for extensive quality control, user-generated datasets, and a broad choice of statistical analysis. PhenoGen is organized primarily to allow for genetical genomic/phenotypic analysis that includes the determination of bQTL/eQTL overlap as a key aspect of candidate gene identification. However, the site also provides other tools for gene expression analysis, such as promoter analyses (oPOSSUM and MEME), and co-expression/clustering analyses, including graphical tools for visualization of these results. We continue to improve the PhenoGen website by adding new genetic/genomic data, adding the latest tools for systems genetics approaches, and improving usability. By the spring of 2011, we will have added brain expression data from 64 LXS strains [derived from ILS × ISS crosses (Williams et al. 2004)], heart, liver, gut, and fat exon expression data from 27 HXB/BXH rat strains, and new genotype information for both of these panels. We also plan to expand our functionality to include analysis of high throughput RNA sequencing data and to implement popular biologic network analysis tools. In addition to the opportunities for in silico analyses using expression data on the website and phenotypic data either collected by the user or extracted from archived datasets (e.g., GeneNetwork or the Mouse Phenome Project), the PhenoGen site offers numerous options for didactic, hands-on experiences with gene arrays in the realm of quantitative genetics for students and all levels of statistical scientists.
  11 in total

1.  Quantitative trait loci influencing morphine antinociception in four mapping populations.

Authors:  S E Bergeson; M L Helms; L A O'Toole; M W Jarvis; H S Hain; J S Mogil; J K Belknap
Journal:  Mamm Genome       Date:  2001-07       Impact factor: 2.957

2.  Genetic structure of the LXS panel of recombinant inbred mouse strains: a powerful resource for complex trait analysis.

Authors:  Robert W Williams; Beth Bennett; Lu Lu; Jing Gu; John C DeFries; Phyllis J Carosone-Link; Brad A Rikke; John K Belknap; Thomas E Johnson
Journal:  Mamm Genome       Date:  2004-08       Impact factor: 2.957

3.  Complex trait analysis of gene expression uncovers polygenic and pleiotropic networks that modulate nervous system function.

Authors:  Elissa J Chesler; Lu Lu; Siming Shou; Yanhua Qu; Jing Gu; Jintao Wang; Hui Chen Hsu; John D Mountz; Nicole E Baldwin; Michael A Langston; David W Threadgill; Kenneth F Manly; Robert W Williams
Journal:  Nat Genet       Date:  2005-02-13       Impact factor: 38.330

4.  An integrative genomics approach to infer causal associations between gene expression and disease.

Authors:  Eric E Schadt; John Lamb; Xia Yang; Jun Zhu; Steve Edwards; Debraj Guhathakurta; Solveig K Sieberts; Stephanie Monks; Marc Reitman; Chunsheng Zhang; Pek Yee Lum; Amy Leonardson; Rolf Thieringer; Joseph M Metzger; Liming Yang; John Castle; Haoyuan Zhu; Shera F Kash; Thomas A Drake; Alan Sachs; Aldons J Lusis
Journal:  Nat Genet       Date:  2005-06-19       Impact factor: 38.330

5.  Using the Phenogen website for 'in silico' analysis of morphine-induced analgesia: identifying candidate genes.

Authors:  Paula L Hoffman; Beth Bennett; Laura M Saba; Sanjiv V Bhave; Phyllis J Carosone-Link; Cheryl K Hornbaker; Katerina J Kechris; Robert W Williams; Boris Tabakoff
Journal:  Addict Biol       Date:  2010-11-04       Impact factor: 4.280

Review 6.  Genetic Models in Applied Physiology. HXB/BXH rat recombinant inbred strain platform: a newly enhanced tool for cardiovascular, behavioral, and developmental genetics and genomics.

Authors:  Morton P Printz; Martin Jirout; Rebecca Jaworski; Adamu Alemayehu; Vladimir Kren
Journal:  J Appl Physiol (1985)       Date:  2003-06

7.  The genomic determinants of alcohol preference in mice.

Authors:  Boris Tabakoff; Laura Saba; Katherina Kechris; Wei Hu; Sanjiv V Bhave; Deborah A Finn; Nicholas J Grahame; Paula L Hoffman
Journal:  Mamm Genome       Date:  2008-06-19       Impact factor: 2.957

8.  Genetical genomic determinants of alcohol consumption in rats and humans.

Authors:  Boris Tabakoff; Laura Saba; Morton Printz; Pam Flodman; Colin Hodgkinson; David Goldman; George Koob; Heather N Richardson; Katerina Kechris; Richard L Bell; Norbert Hübner; Matthias Heinig; Michal Pravenec; Jonathan Mangion; Lucie Legault; Maurice Dongier; Katherine M Conigrave; John B Whitfield; John Saunders; Bridget Grant; Paula L Hoffman
Journal:  BMC Biol       Date:  2009-10-27       Impact factor: 7.431

9.  Genomic insights into acute alcohol tolerance.

Authors:  Wei Hu; Laura Saba; Katherina Kechris; Sanjiv V Bhave; Paula L Hoffman; Boris Tabakoff
Journal:  J Pharmacol Exp Ther       Date:  2008-06-11       Impact factor: 4.030

10.  Using gene expression databases for classical trait QTL candidate gene discovery in the BXD recombinant inbred genetic reference population: mouse forebrain weight.

Authors:  Lu Lu; Lai Wei; Jeremy L Peirce; Xusheng Wang; Jianhua Zhou; Ramin Homayouni; Robert W Williams; David C Airey
Journal:  BMC Genomics       Date:  2008-09-25       Impact factor: 3.969

View more
  6 in total

1.  Quantitative trait locus mapping of acute functional tolerance in the LXS recombinant inbred strains.

Authors:  Beth Bennett; Colin Larson; Phillip A Richmond; Aaron T Odell; Laura M Saba; Boris Tabakoff; Robin Dowell; Richard A Radcliffe
Journal:  Alcohol Clin Exp Res       Date:  2015-04       Impact factor: 3.455

2.  Multi-Omic Approaches to Identify Genetic Factors in Metabolic Syndrome.

Authors:  Karen C Clark; Anne E Kwitek
Journal:  Compr Physiol       Date:  2021-12-29       Impact factor: 8.915

3.  Potential translational targets revealed by linking mouse grooming behavioral phenotypes to gene expression using public databases.

Authors:  Andrew Roth; Evan J Kyzar; Jonathan Cachat; Adam Michael Stewart; Jeremy Green; Siddharth Gaikwad; Timothy P O'Leary; Boris Tabakoff; Richard E Brown; Allan V Kalueff
Journal:  Prog Neuropsychopharmacol Biol Psychiatry       Date:  2012-10-31       Impact factor: 5.067

Review 4.  Using genome-wide expression profiling to define gene networks relevant to the study of complex traits: from RNA integrity to network topology.

Authors:  M A O'Brien; B N Costin; M F Miles
Journal:  Int Rev Neurobiol       Date:  2012       Impact factor: 3.230

5.  Genomic landscape of rat strain and substrain variation.

Authors:  Roel Hermsen; Joep de Ligt; Wim Spee; Francis Blokzijl; Sebastian Schäfer; Eleonora Adami; Sander Boymans; Stephen Flink; Ruben van Boxtel; Robin H van der Weide; Tim Aitman; Norbert Hübner; Marieke Simonis; Boris Tabakoff; Victor Guryev; Edwin Cuppen
Journal:  BMC Genomics       Date:  2015-05-06       Impact factor: 3.969

6.  Hands-on workshops as an effective means of learning advanced technologies including genomics, proteomics and bioinformatics.

Authors:  Nichole Reisdorph; Robert Stearman; Katerina Kechris; Tzu Lip Phang; Richard Reisdorph; Jessica Prenni; David J Erle; Christopher Coldren; Kevin Schey; Alexey Nesvizhskii; Mark Geraci
Journal:  Genomics Proteomics Bioinformatics       Date:  2013-12-06       Impact factor: 7.691

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.