| Literature DB >> 28667017 |
Sudhir Kumar1,2, Charlotte Konikoff1, Maxwell Sanderford1, Li Liu2, Stuart Newfeld3, Jieping Ye4, Rob J Kulathinal5,6.
Abstract
Gene expression patterns assayed across development can offer key clues about a gene's function and regulatory role. Drosophila melanogaster is ideal for such investigations as multiple individual and high-throughput efforts have captured the spatiotemporal patterns of thousands of embryonic expressed genes in the form of in situ images. FlyExpress (www.flyexpress.net), a knowledgebase based on a massive and unique digital library of standardized images and a simple search engine to find coexpressed genes, was created to facilitate the analytical and visual mining of these patterns. Here, we introduce the next generation of FlyExpress resources to facilitate the integrative analysis of sequence data and spatiotemporal patterns of expression from images. FlyExpress 7 now includes over 100,000 standardized in situ images and implements a more efficient, user-defined search algorithm to identify coexpressed genes via Genomewide Expression Maps (GEMs). Shared motifs found in the upstream 5' regions of any pair of coexpressed genes can be visualized in an interactive dotplot. Additional webtools and link-outs to assist in the downstream validation of candidate motifs are also provided. Together, FlyExpress 7 represents our largest effort yet to accelerate discovery via the development and dispersal of new webtools that allow researchers to perform data-driven analyses of coexpression (image) and genomic (sequence) data.Entities:
Keywords: dotplot; embryonic expression patterns; gene regulation; image analysis; intergenic regions
Mesh:
Substances:
Year: 2017 PMID: 28667017 PMCID: PMC5555482 DOI: 10.1534/g3.117.040345
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Integration of new discovery tools in FlyExpress 7 online platform for regulatory discovery. (A) Searching the FlyExpress digital image library reveals standardized in situ images for a gene or genomic element of interest. Along with gene names, queries can be also initiated by PubMed article ID, image ID, keywords, or anatomy ontology. (B) Each image in the digital library is annotated by data source, stage range, and anatomical orientation (e.g., lateral, dorsal, or ventral). (C) Selecting an image along with an associated level of resolution initiates the search for coexpressed genes. (D) Results are displayed as a ranked list. Genes exhibiting the greatest coexpression are displayed first. Links to other Drosophila databases are provided for each gene. (E) Upstream IGRs of the query and result genes can be explored via an interactive shared motif viewer equipped with dotplot and genome browser. (F) GEMs provide another entry point to search for coexpressed genes by allowing global expression in the developing embryo to be rapidly visualized and searched for a given stage. (G) Clicking a pixel(s) of interest on the GEM returns a list of genes and genomic elements expressed at that precise location and stage. (H) A list of multiple genes can also be manually entered or uploaded. FlyExpress 7 automatically searches the digital library and adds them to the Hitlist. The Hitlist facilitates further analysis of coexpressed genes. (I) Custom GEMs can be manually created by selecting pixels across the embryo. Genes and genomic elements matching the user's selected expression pattern will be returned as a Hitlist. (J) The Hitlist content can also be exported to a number of popular resources for further analysis, including FlyBase (St Pierre ), FlyMine (Lyne ), and STRING (Franceschini ). BDGP, Berkeley Drosophila Genome Project; GEMs, Genomewide Expression Maps; ID, identifier; IGR, intergenic region.
Novel features and improvements in FlyExpress 7
| FlyExpress 6 | FlyExpress 7 | |
|---|---|---|
| Digital library images | BDGP: 57,083 | BDGP: 57,083 |
| Fly-FISH: 42,065 | Fly-FISH: 42,065 | |
| PubMed: 5,977 | PubMed: 34,074 | |
| Coexpression searching | Pairwise | GEM-based |
| Absent | Present | |
| Shared motif viewer | Absent | Present |
| Motif visualization | Absent | Interactive dotplot viewer |
| Hitlist functionality | Export to FlyBase and FlyMine | Export to FlyBase and FlyMine |
| Downloadable gene list (CSV) | Downloadable gene list (CSV) | |
| Create custom GEMs | Create custom GEMs | |
| One-click export to STRING ( | ||
| One-click export to Tomtom ( |
In addition to expanding the digital library, a variety of biologist-friendly tools integrating sequence analysis with temporal-spatial patterns of gene expression are available for use in FlyExpress 7. BDGP, Berkeley Drosophila Genome Project; GEMs, Genomewide Expression Maps.
Figure 2Pairwise vs. genomewide expression map (GEM)-based coexpression similarities for a query gene, twi, against all other genes. Both pairwise and GEM-based searches were performed to identify genes coexpressed with twi (FBim9035159_c, 40% pixel coverage; image shown in Figure 1C). For each coexpressed gene image matched against the twi image, GEM-based coexpression similarity is plotted against the Jaccard pairwise similarity. See Figure S1 for further analyses.
Figure 3Shared motif viewer features and functionality. Biologists can visualize shared motifs from two upstream IGRs of a query gene and the selected coexpressed gene via a customizable dotplot. Shared motifs from the upstream intergenic regions of two genes, distal anntena (dan) and putzig (pzg), are displayed. Users can choose from precomputed shared motifs of a minimum k-mer length. An evolutionary filter can be applied to only include conserved sequence. Conservation is determined by averaged PhastCons scores (Siepel ), with darker pixels indicating greater conservation (see Materials and Methods for details). Selecting a shared motif and clicking “Go” will highlight all matches in the dotplot viewer in red. The length of the IGR and genomic coordinates are indicated in the gene axis label. Clicking the gene name provides links to a choice of two genome browsers, FlyBase and modENCODE. Clicking a matched span (i.e., contiguous line) in the dotplot will display a “Dotplot Match” context menu displaying start and end coordinates for the match as well as the matching sequence. IGR, intergenic region; PhastCons, PHylogenetic Analysis with Space-Time models.