| Literature DB >> 31438841 |
Joëlle Barido-Sottani1,2,3, Samuel D Chapman1,4, Evsey Kosman5, Arcady R Mushegian6,7,8.
Abstract
BACKGROUND: Gene and protein interaction data are often represented as interaction networks, where nodes stand for genes or gene products and each edge stands for a relationship between a pair of gene nodes. Commonly, that relationship within a pair is specified by high similarity between profiles (vectors) of experimentally defined interactions of each of the two genes with all other genes in the genome; only gene pairs that interact with similar sets of genes are linked by an edge in the network. The tight groups of genes/gene products that work together in a cell can be discovered by the analysis of those complex networks.Entities:
Keywords: Gene networks; Genetic interactions; SUN domain; Similarity measures; Slp1
Mesh:
Year: 2019 PMID: 31438841 PMCID: PMC6704681 DOI: 10.1186/s12859-019-3024-x
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1General outline of genetic interaction assays and schematics of the data transformations used in the process of their analysis
Statistics of similarity scores between yeast genetic interaction vectors under different similarity measures for the one-square matrix
| Braun-Blanquet | Maryland Bridge | Ochiai | Pearson | |
|---|---|---|---|---|
| Mean | 0.04 | 0.06 | 0.06 | < 0.01 |
| Variance | 0.01 | < 0.01 | < 0.01 | < 0.01 |
| Median | 0.03 | 0.06 | 0.05 | < 0.01 |
| Minimum | 0 | 0 | 0 | −0.36 |
| Maximum | 0.53 | 0.60 | 0.57 | 0.81 |
Fig. 2Cumulative similarity distributions between genetic interaction vectors under different similarity measures for the “one-square” transformation
Properties of gene interaction networks and modules derived from the networks under different similarity measures. All values are for the one-square matrix transformation method. See Methods and Discussion for detailed discussion, Figs. 3 and 4 for visual representation of the data, and supplementary online materials for generally similar results obtained under the two-square transformation
| Similarity measure | Braun-Blanquet | Maryland Bridge | Ochiai | Pearson |
|---|---|---|---|---|
| Similarity threshold applied to retain ~ 20,000 edges in the network | 0.16 | 0.18 | 0.20 | 0.15 |
| Nodes (genes) in the network / nodes (genes) in the giant connected component | 3427 / 3303 | 3610 / 3587 | 3385 / 3321 | 4038 / 3956 |
| Edges in the network / edges in the giant connected component | 20,020 / 19,943 | 20,065 / 20,052 | 20,067 / 20,032 | 20,016 / 19,967 |
| Unique genes in modules / percentage of all genes in respective giant connected component | 2072 / 62.7 | 725 / 20.2 | 1519 / 45.7 | 3072 / 77.6 |
| Number of modules / unique genes per module | 682 / 3.04 | 408 / 1.78 | 516 / 2.94 | 1446 / 2.12 |
| Biological Homogeneity Index | 0.12 | 0.27 | 0.16 | 0.33 |
| Percentage of uncharacterized genes / | 36 / 10−38 / 38 | 17 / 10−11 / 2 | 36 / 10− 22 / 30 | 26 / 10− 3 / 35 |
| Annotated modules of Type 1 / Type 2 | 36 / 409 | 64 / 119 | 48 / 309 | 279 / 568 |
Fig. 3Select statistics of clustering and module annotation. The data are taken from Table 2
Fig. 4Genes shared between clustering solutions and the number of uncharacterized genes in each clustering solution. Line thicknesses represent genes shared by each pair of solutions, with the width proportional to their number, also shown next to each line. The band color represents the p-value of the number of shared genes between each pair of clusterings. The area of each circle is proportional to the number of genes shown next to the circle, and the size of each inner circle indicates the number of uncharacterized genes, shown in parentheses