| Literature DB >> 30460054 |
Jung Eun Shim1, Tak Lee1, Insuk Lee1.
Abstract
Advanced high-throughput sequencing technology accumulated massive amount of genomics and transcriptomics data in the public databases. Due to the high technical accessibility, DNA and RNA sequencing have huge potential for the study of gene functions in most species including animals and crops. A proven analytic platform to convert sequencing data to gene functional information is co-functional network. Because all genes exert their functions through interactions with others, network analysis is a legitimate way to study gene functions. The workflow of network-based functional study is composed of three steps: (i) inferencing co-functional links, (ii) evaluating and integrating the links into genome-scale networks, and (iii) generating functional hypotheses from the networks. Co-functional links can be inferred from DNA sequencing data by using phylogenetic profiling, gene neighborhood, domain profiling, associalogs, and co-expression analysis from RNA sequencing data. The inferred links are then evaluated and integrated into a genome-scale network with aid from gold-standard co-functional links. Functional hypotheses can be generated from the network based on (i) network connectivity, (ii) network propagation, and (iii) subnetwork analysis. The functional analysis pipeline described here requires only sequencing data which can be readily available for most species by next-generation sequencing technology. Therefore, co-functional networks will greatly potentiate the use of the sequencing data for the study of genetics in any cellular organism.Entities:
Keywords: Sequencing data; co-functional networks; gene functions
Year: 2017 PMID: 30460054 PMCID: PMC6138336 DOI: 10.1080/19768354.2017.1284156
Source DB: PubMed Journal: Anim Cells Syst (Seoul) ISSN: 1976-8354 Impact factor: 1.815
Figure 1.From sequencing data to co-functional networks. Functional links between genes can be inferred by (A) phylogenetic profiling (PG), (B) gene neighborhood (GN), (C) domain profiling (DP), (D) associalogs (AS) using DNA sequencing data, and by (E) co-expression (CX) analysis using RNA sequencing data. All inferred links are evaluated by gold-standard co-functional links derived from pathway annotation databases. The inferred links are scored for likelihood (represented by edge thickness, in which the thicker edge indicates higher likelihood of functional association), and then integrated into a genome-scale co-functional network.
Figure 2.From co-functional networks to gene functions. Functional hypotheses can be generated by three different network approaches. (A) Methods based on network connectivity identify hub genes as essential genes, disease-associated genes by network connections to the DEGs in disease conditions, and disease-associated modules based on network connectivity within a group of candidate disease genes from genome-wide unbiased screening. (B) Functional information of known genes can be propagated to the direct neighbors or throughout the entire network by network diffusion. (C) Modules for processes and phenotypes can be identified by de novo discovery based on clustered network communities or by subnetwork enriched for seed genes, which are already known for the processes or phenotypes.