| Literature DB >> 31844835 |
Renesh Bedre1, Kranthi Mandadi1,2.
Abstract
Genome-scale studies using high-throughput sequencing (HTS) technologies generate substantial lists of differentially expressed genes under different experimental conditions. These gene lists need to be further mined to narrow down biologically relevant genes and associated functions in order to guide downstream functional genetic analyses. A popular approach is to determine statistically overrepresented genes in a user-defined list through enrichment analysis tools, which rely on functional annotations of genes based on Gene Ontology (GO) terms. Here, we propose a new computational approach, GenFam, which allows annotation, classification, and enrichment of genes based on their gene family, thus simplifying identification of candidate gene families and associated genes that may be relevant to the query. GenFam and its integrated database comprises of three hundred and eighty-four unique gene families and supports gene family analyses for sixty plant genomes. Four comparative case studies with plant species belonging to different clades and families were performed using GenFam which demonstrated its robustness and comprehensiveness over preexisting functional enrichment tools. To make it readily accessible for plant biologists, GenFam is available as a web-based application where users can input gene IDs and export enrichment results in both tabular and graphical formats. Users can also customize analysis parameters by choosing from the various statistical enrichment tests and multiple testing correction methods. Additionally, the web-based application, source code, and database are freely available to use and download. Website: http://mandadilab.webfactional.com/home/. Source code and database: http://mandadilab.webfactional.com/home/dload/.Entities:
Keywords: data integration; database; gene family enrichment analysis; gene ontologies; software; statistics
Year: 2019 PMID: 31844835 PMCID: PMC6892992 DOI: 10.1002/pld3.191
Source DB: PubMed Journal: Plant Direct ISSN: 2475-4455
Figure 1GenFam workflow. The list of input gene IDs for respective plant species provided by the user is analyzed for enrichment analysis using various statistical tests. The output of the analysis can be viewed and/or downloaded as a table and/or graphical summary. The results page has multiple options to visualize or download data for both enriched and non‐enriched categories (all gene families). The detailed output data from case studies are provided in Tables S3, S4, S5, and S6
Figure 2Graphical summary of GenFam enrichment analysis of a cotton case study. Results are plotted as bar chart using the −log10(p‐value) scores. Higher the −log10(p‐value) value, greater the confidence in enrichment of the gene family