| Literature DB >> 32972400 |
Alessio Gamba1, Mario Salmona1, Laura Cantù2, Gianfranco Bazzoni3.
Abstract
BACKGROUND: Despite being caused by mutations in different genes, diseases in the same phenotypic series are clinically similar, as reported in Part I of this study. Here, in Part II, we hypothesized that the phenotypic series too might be clinically similar. Furthermore, on the assumption that gene mutations indirectly cause clinical phenotypes by directly affecting biological functions, we hypothesized that clinically similar phenotypic series might be biologically similar as well.Entities:
Keywords: Biological processes; Disease phenotypes; Gene mutations; Inherited diseases; Network analysis; Ontologies
Mesh:
Year: 2020 PMID: 32972400 PMCID: PMC7513283 DOI: 10.1186/s12920-020-00793-y
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Fig. 1Assembly of the CSN and BSN. The experimental procedure leading to the assembly of the CSN and one of the three BSN (here the BSN-BP) is depicted schematically, in the top and middle panels, respectively. The PS (black circles), labeled A, B and C, are annotated with the HPO terms (blue squares) and the GO-BP terms (red hexagons) labeled 0, 1, 2, 3 and 4. As shown in the schematic ontology trees, 4 is the common ancestor of terms 2 and 3, and 0 is the root term. Then the algorithm retrieves all the possible PS-PS pairs and the annotations that the two PS share (dotted squares and hexagons). The shared annotations can be either an identical term (e.g., A and B sharing 1) or the most informative common ancestor of two different terms (e.g., A and C sharing 4 as a common ancestor of 2 and 3); highly dissimilar PS (e.g., B and C) share the root 0. For simplicity’s sake, each pair of PS is assumed here to share only one annotation. At the end of the search, a non-weighted bipartite graph is assembled. Then, from the PS-HPO and PS-GO bipartite graphs, the CSN and the various BSN are derived, by linking all the PS that share the annotations. The thickness of the edge in the CSN and BSN is proportional to the similarity coefficients of the linked PS pair. The bottom panel shows schematically different types of correlation between HPO- and GO-based similarities. For instance, in the AB, AC and BC pairs of PS, both the clinical and the biological similarities are directly correlated in a high, intermediate or low manner, respectively. In contrast, two additional PS pairs exemplify instances of high clinical (but low biological) similarity (AX) and, conversely, high biological (but low clinical) similarity (AY)
Network analysis
| Network | ||||||
|---|---|---|---|---|---|---|
| 293 | 316 | 314 | 305 | 319 | ||
| 42,778 | 49,770 | 49,141 | 46,360 | 50,721 | ||
| 287 | 298 | 258 | 209 | 311 | ||
| 5660 | 11,223 | 2821 | 1633 | 13,282 | ||
| 39.440 | 75.320 | 21.870 | 15.630 | 85.420 | ||
| 0.635 | 0.708 | 0.653 | 0.691 | 0.654 | ||
| 2.294 | 1.950 | 2.486 | 2.829 | 1.799 | ||
| 0.138 | 0.254 | 0.085 | 0.075 | 0.276 | ||
The topological properties of the five networks. Network analysis was conducted after setting a threshold of 1.0 for all the networks. At this threshold, all the networks are still composed of a single GCC, despite the loss of all the edges with weight w ≤ 1.0
Fig. 2The CSN. The CSN, at a threshold of 2.46, contains 58 nodes linked by 63 edges and is fragmented into 13 islands and 8 clusters. Node color indicates the DO class (inset), while node size is proportional to the connectivity k of the PS. Edges represent biological similarity and edge thickness is proportional to the weight w (i.e., the maximal GO-based similarity among the three GO sub-ontologies)
Fig. 3The BSN. The BSN, at a threshold of 2.70, contains 68 nodes linked by 138 edges and is fragmented into 18 islands and 8 clusters. Node color indicates the DO class (see Fig. 2, inset) and node size is proportional to the connectivity k of the PS. Edges represent clinical similarity and edge thickness is proportional to the weight w (i.e., the HPO-based clinical similarity)
Fig. 4Comparing biological and clinical similarities. a The scatter plot displays the correlation between clinical and biological similarity (as defined in the CSN and the BSN, respectively). b Details of the subset of PS-PS discussed in the Results