| Literature DB >> 25489174 |
Anisha Datta1, Sinjini Sikdar2, Ryan Gill3.
Abstract
Various methods to determine the connectivity scores between groups of proteins associated with lung adenocarcinoma are examined. Proteins act together to perform a wide range of functions within biological processes. Hence, identification of key proteins and their interactions within protein networks can provide invaluable information on disease mechanisms. Differential network analysis provides a means of identifying differences in the interactions among proteins between two networks. We use connectivity scores based on the method of partial least squares to quantify the strength of the interactions between each pair of proteins. These scores are then used to perform permutation-based statistical tests. This examines if there are significant differences between the network connectivity scores for individual proteins or classes of proteins. The expression data from a study on lung adenocarcinoma is used in this study. Connectivity scores are computed for a group of 109 subjects who were in the complete remission and as well as for a group of 51 subjects whose cancer had progressed. The distributions of the connectivity scores are similar for the two networks yet subtle but statistically significant differences have been identified and their impact discussed.Entities:
Keywords: association networks; expression data; lung adenocarcinoma; networks; protein-protein; protein-protein interaction
Year: 2014 PMID: 25489174 PMCID: PMC4248347 DOI: 10.6026/97320630010647
Source DB: PubMed Journal: Bioinformation ISSN: 0973-2063
Figure 1Flow diagram illustrating the test for differential connectivity for an individual protein, starting with the expression values from both groups. The values from both groups are pooled. Then the labels are randomly permuted 1000 times to form new pairs of groups for each data set. The connectivity scores are computed for each actual and permuted group. These connectivity scores are then used to calculate the test statistic for both the observed and permuted data sets. Finally, a p-value is determined by comparing the observed test statistic with the values of the test statistic based on the permuted data sets and is used to make a decision on whether there is a significant difference between the scores for the two proteins between the two groups.
Figure 2Histograms for the distribution of connectivity scores for the complete remission and progression networks. The scores were computed for each pair of proteins using expression values for 174 proteins from a group of 109 subjects with lung adenocarcinoma who went into complete remission and from a group of 51 subjects with lung adenocarcinoma whose cancer progressed. For each network, the connections involving scores greater than 0.085 are illustrated in a graph to the right of each corresponding histogram. The edges represent pairs of proteins with connectivity scores which exceed 0.085. The proteins in the graph (with labels for vertices in parentheses) are alpha-Catenin(1), ACC_pS79(2), ACC1(3), c-Met(4), Caspase-3(5), Caspase-8(6), CD20(7), E-Cadherin(8), EGFR_pY1068(9),ERCC1(a),HER2_pY1248(b),MAPK_pT202_Y204(c), MEK1_pS217_S221(d), p27_pT157(e), PARP(f), PKC-alpha(g),PKC-alpha_pS657(h),Rab25(i),Rb(j), 6_pS235_S236(k), SETD2(l), Snail(m), Src(n), Src_pY416(o), TIGAR(p), XBP1(q), YB-1(r), and YB-1_pS102(s).