| Literature DB >> 32392296 |
John Lee1, Manthan Shah1, Sara Ballouz1, Megan Crow1, Jesse Gillis1.
Abstract
Co-expression analysis has provided insight into gene function in organisms from Arabidopsis to zebrafish. Comparison across species has the potential to enrich these results, for example by prioritizing among candidate human disease genes based on their network properties or by finding alternative model systems where their co-expression is conserved. Here, we present CoCoCoNet as a tool for identifying conserved gene modules and comparing co-expression networks. CoCoCoNet is a resource for both data and methods, providing gold standard networks and sophisticated tools for on-the-fly comparative analyses across 14 species. We show how CoCoCoNet can be used in two use cases. In the first, we demonstrate deep conservation of a nucleolus gene module across very divergent organisms, and in the second, we show how the heterogeneity of autism mechanisms in humans can be broken down by functional groups and translated to model organisms. CoCoCoNet is free to use and available to all at https://milton.cshl.edu/CoCoCoNet, with data and R scripts available at ftp://milton.cshl.edu/data.Entities:
Year: 2020 PMID: 32392296 PMCID: PMC7319556 DOI: 10.1093/nar/gkaa348
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Schematic of underlying data. Co-expression networks are aggregated for each species, ortholog maps are generated for each pair of species and data quality is assessed using a neighbor voting algorithm across all functional groups.
Figure 2.Left: Counts of experiments expressing at least half of all genes. Right: Counts of samples with a correlation with the global average >0.3.
Figure 3.Distribution of co-expression values for ortholog mapped genes to the input of highly co-expressed yeast genes for each of the 13 other species.
Figure 4.Highly co-expressed yeast (Saccharomyces cerevisiae) genes are mapped to orthologous genes in Arabidopsis (Arabidopsis thaliana), zebrafish (Danio rerio) and fruit fly (Drosophila melanogaster). Genes annotated with the nucleolus (GO:0005730) are highlighted, and the top 1% of connections are shown. Red stars denote highly connected genes as measured by their node degree.