| Literature DB >> 31614842 |
Peng Zhang1, Yuval Itan2,3.
Abstract
Network biology has the capability to integrate, represent, interpret, and model complex biological systems by collectively accommodating biological omics data, biological interactions and associations, graph theory, statistical measures, and visualizations. Biological networks have recently been shown to be very useful for studies that decipher biological mechanisms and disease etiologies and for studies that predict therapeutic responses, at both the molecular and system levels. In this review, we briefly summarize the general framework of biological network studies, including data resources, network construction methods, statistical measures, network topological properties, and visualization tools. We also introduce several recent biological network applications and methods for the studies of rare diseases.Entities:
Keywords: application; bioinformatics; biological network; database; rare diseases; software
Mesh:
Year: 2019 PMID: 31614842 PMCID: PMC6827097 DOI: 10.3390/genes10100797
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1A flowchart illustrating the general framework of biological network studies. The blue arrows and text correspond to the construction of biological networks, whereas the green arrows and text correspond to the mapping of biological data onto the network.
Databases providing human protein-protein interaction data.
|
| ||||
|
|
|
|
| |
| BioGRID [ | Physical | 371,513 | 23,795 | |
| IntAct [ | Physical | 379,393 | 25,643 | |
| STRING [ | Physical, association, text-mining | 11,759,455 | 19,567 | |
| HIPPE [ | Physical | 273,900 | 17,000 | |
| HPRD [ | Physical | 41,327 | 30,047 | |
|
| ||||
|
|
|
|
|
|
| TissueNet [ | Physical | 40 | 243,706 | 17,283 |
| GIANT [ | Physical, co-expression | 144 | n.a. | n.a. |
| IID [ | Physical, predicted | 26 | 975,877 | 19,250 |
| TISPIN [ | Physical | 53 | 128,579 | 13,123 |
Typical local and global topological properties (with their computational equations and graph theory explanation) that have been used in biological studies.
| Level | Topological Property | Computational Equation | Graph Theory Explanation | Biological Implication |
|---|---|---|---|---|
| Local | Clustering coefficient [ | Measures the tendency of a node to form a group with the neighboring nodes. | Used to analyze the organizational properties of human protein networks [ | |
| Local | Closeness centrality [ | Measures how fast information can spread from a given gene to the other reachable genes. | These centralities have been used to prioritize disease candidate genes [ | |
| Local | Betweenness centrality [ | Indicates the number of times a given node serves as a linking bridge on the shortest path between any other two nodes. | ||
| Local | PageRank centrality [ | Gauges the importance of a given node by considering both the number of connections of the nodes, and the importance of the connected nodes. | PageRank centrality has been used to identify protein targets in metabolic networks [ | |
| Global | Connectivity centralization [ | Distinguishes highly connected networks or decentralized networks. | Used in studies of the structural differences between metabolic networks [ | |
| Global | Heterogeneity [ | Measures the variation of the connectivity distribution. | Reflects the tendency of a network to have hub genes [ | |
| Global | Global efficiency [ |
| Represents the information exchange efficiency across the entire network or a defined subnetwork. | Used to describe the brain neuro-connectivity [ |
The applications and methods of biological networks in studying rare diseases.
| Application/Method | Source of Network | Algorithm | Results |
|---|---|---|---|
| Congenital hyperinsulinism [ | PPI network (BioGRID) | Graph-partitioning for subnetwork identification. | Identified the nine known disease-causing genes that are functionally diverse and clustered together in a core subnetwork. |
| Systemic sclerosis [ | Gene co-expression network | Consensus clustering and differential network analysis for subnetwork identification. | Identified common pathogenic signature in four tissues of systemic sclerosis patients, and identified a distinct disease process in the lung. |
| HGC [ | Gene association network (STRING) | Shortest path distance, distance distribution, and statistical significance. | Identified 20 of the 21 known disease-causing genes of herpes simplex virus encephalitis, and further used to identify the disease-causing genes of primary immunodeficiency diseases, which were experimentally validated. |
| Vertex Similarity [ | PPI network (3 papers, HPRD, BIND, Reactome) | Pairwise similarity by an edge-weighted and neighbor-considered equation for connected nodes, or a shortest-path-based equation for disconnected nodes. | Developed the Vertex Similarity method to identify and rank orphan disease candidate genes of 172 rare diseases based on the known disease-causing genes in the protein interaction network. |
| DIGNiFI [ | PPI network (HPRD) | Pairwise similarity by measuring local direct neighbor connectivity, and global network feature by a random walk algorithm. | Developed DIGNiFI method to discover causative genes in orphan diseases of 128 rare diseases, and suggested the use of GO terms and protein domains to refine PPI networks. |