| Literature DB >> 33076954 |
Peng-Cheng Chen1, Li Ruan2, Jie Jin2, Yu-Tian Tao2, Xiao-Bao Ding2, Hai-Bo Zhang2, Wen-Ping Guo2, Qiao-Lei Yang1, Heng Yao1, Xin Chen3,4,5.
Abstract
BACKGROUND: The nematode worm, Caenorhabditis elegans, is a saprophytic species that has been emerging as a standard model organism since the early 1960s. This species is useful in numerous fields, including developmental biology, neurobiology, and ageing. A high-quality comprehensive molecular interaction network is needed to facilitate molecular mechanism studies in C. elegans.Entities:
Keywords: Caenorhabditis elegans; Database; Functional interaction; Gene set linkage analysis; Transcriptomic analysis tool
Year: 2020 PMID: 33076954 PMCID: PMC7574172 DOI: 10.1186/s13062-020-00271-6
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Fig. 1Workflow for inferencing functional interactions between C. elegans genes. High-quality experimentally reported protein interactions were integrated from three databases and were used as positive examples. Six types of functional association evidence from 10 databases were collected to infer putative functional interactions. A total of 10 high-quality feature values were selected from 36 feature values that characterize this evidence with different mathematical representations. Random gene pairs after removing positive examples were used as negative examples. The number of negative examples was 100 times that of the positive examples
Fig. 2Assessment of the capabilities of seven interactomes to group functionally associated genes together. The precision-recall curves of gene function prediction using different interactomes are illustrated. Precision estimates the proportion of correct annotations identified by an interactome. Recall estimates the proportion of new annotations that is identified by an interactome
Fig. 3Interface of FIC and GSLA. a Two search options in FIC. b Search result page. A right click on the edge and node will show interaction details. c Interface of GSLA. d Results of a GSLA analysis job
Fig. 4Two hypothesis tests that GSLA used to identify significant functional associations between two gene sets that are biologically meaningful. Q1 tests whether the density of functional associations between two biologically meaningful gene sets is higher than random gene pairs, while Q2 tests whether the strong functional associations observed between two gene sets can only be observed from the biologically correct network, rather than any random interactomes
Fig. 5Functional interpretations produced by FIC/GSLA. Compared to GO enrichment analysis and DAVID, the annotations produced by GSLA are more comprehensive and more accurate