Supriyo De1, Yongqing Zhang, Catherine A Wolkow, Sige Zou, Ilya Goldberg, Kevin G Becker. 1. Gene Expression and Genomics Unit, Laboratory of Genetics, National Institute on Aging, National Institutes of Health, Biomedical Research Center, 251 Bayview Boulevard, Baltimore, MD 21224, USA. beckerk@grc.nia.nih.gov.
Abstract
BACKGROUND: The genetic and molecular basis for many intermediate and end stage phenotypes in model systems such as C. elegans and D. melanogaster has long been known to involve pleiotropic effects and complex multigenic interactions. Gene sets are groups of genes that contribute to multiple biological or molecular phenomena. They have been used in the analysis of large molecular datasets such as microarray data, Next Generation sequencing, and other genomic datasets to reveal pleiotropic and multigenic contributions to phenotypic outcomes. Many model systems lack species specific organized phenotype based gene sets to enable high throughput analysis of large molecular datasets. RESULTS AND DISCUSSION: Here, we describe two novel collections of gene sets in C. elegans and D. melanogaster that are based exclusively on genetically determined phenotypes and use a controlled phenotypic ontology. We use these collections to build genome-wide models of thousands of defined phenotypes in both model species. In addition, we demonstrate the utility of these gene sets in systems analysis and in analysis of gene expression-based molecular datasets and show how they are useful in analysis of genomic datasets connecting multigenic gene inputs to complex phenotypes. CONCLUSIONS: Phenotypic based gene sets in both C. elegans and D. melanogaster are developed, characterized, and shown to be useful in the analysis of large scale species-specific genomic datasets. These phenotypic gene set collections will contribute to the understanding of complex phenotypic outcomes in these model systems.
BACKGROUND: The genetic and molecular basis for many intermediate and end stage phenotypes in model systems such as C. elegans and D. melanogaster has long been known to involve pleiotropic effects and complex multigenic interactions. Gene sets are groups of genes that contribute to multiple biological or molecular phenomena. They have been used in the analysis of large molecular datasets such as microarray data, Next Generation sequencing, and other genomic datasets to reveal pleiotropic and multigenic contributions to phenotypic outcomes. Many model systems lack species specific organized phenotype based gene sets to enable high throughput analysis of large molecular datasets. RESULTS AND DISCUSSION: Here, we describe two novel collections of gene sets in C. elegans and D. melanogaster that are based exclusively on genetically determined phenotypes and use a controlled phenotypic ontology. We use these collections to build genome-wide models of thousands of defined phenotypes in both model species. In addition, we demonstrate the utility of these gene sets in systems analysis and in analysis of gene expression-based molecular datasets and show how they are useful in analysis of genomic datasets connecting multigenic gene inputs to complex phenotypes. CONCLUSIONS: Phenotypic based gene sets in both C. elegans and D. melanogaster are developed, characterized, and shown to be useful in the analysis of large scale species-specific genomic datasets. These phenotypic gene set collections will contribute to the understanding of complex phenotypic outcomes in these model systems.
Authors: Jason G Wood; Sara Hillenmeyer; Charles Lawrence; Chengyi Chang; Suzanne Hosier; Will Lightfoot; Eric Mukherjee; Nan Jiang; Christoph Schorl; Alexander S Brodsky; Nicola Neretti; Stephen L Helfand Journal: Aging Cell Date: 2010-10-21 Impact factor: 9.304
Authors: Gary Schindelman; Jolene S Fernandes; Carol A Bastiani; Karen Yook; Paul W Sternberg Journal: BMC Bioinformatics Date: 2011-01-24 Impact factor: 3.169
Authors: Yonqing Zhang; Supriyo De; John R Garner; Kirstin Smith; S Alex Wang; Kevin G Becker Journal: BMC Med Genomics Date: 2010-01-21 Impact factor: 3.063
Authors: Todd W Harris; Igor Antoshechkin; Tamberlyn Bieri; Darin Blasiar; Juancarlos Chan; Wen J Chen; Norie De La Cruz; Paul Davis; Margaret Duesbury; Ruihua Fang; Jolene Fernandes; Michael Han; Ranjana Kishore; Raymond Lee; Hans-Michael Müller; Cecilia Nakamura; Philip Ozersky; Andrei Petcherski; Arun Rangarajan; Anthony Rogers; Gary Schindelman; Erich M Schwarz; Mary Ann Tuli; Kimberly Van Auken; Daniel Wang; Xiaodong Wang; Gary Williams; Karen Yook; Richard Durbin; Lincoln D Stein; John Spieth; Paul W Sternberg Journal: Nucleic Acids Res Date: 2009-11-12 Impact factor: 16.971