| Literature DB >> 31350445 |
Hao He1, Shaolong Cao1,2, Ji-Gang Zhang1, Hui Shen1, Yu-Ping Wang1,2, Hong-Wen Deng3.
Abstract
Differential network analysis investigates how the network of connected genes changes from one condition to another and has become a prevalent tool to provide a deeper and more comprehensive understanding of the molecular etiology of complex diseases. Based on the asymptotically normal estimation of large Gaussian graphical model (GGM) in the high-dimensional setting, we developed a computationally efficient test for differential network analysis through testing the equality of two precision matrices, which summarize the conditional dependence network structures of the genes. Additionally, we applied a multiple testing procedure to infer the differential network structure with false discovery rate (FDR) control. Through extensive simulation studies with different combinations of parameters including sample size, number of vertices, level of heterogeneity and graph structure, we demonstrated that our method performed much better than the current available methods in terms of accuracy and computational time. In real data analysis on lung adenocarcinoma, we revealed a differential network with 3503 nodes and 2550 edges, which consisted of 50 clusters with an FDR threshold at 0.05. Many of the top gene pairs in the differential network have been reported relevant to human cancers. Our method represents a powerful tool of network analysis for high-dimensional biological data.Entities:
Mesh:
Year: 2019 PMID: 31350445 PMCID: PMC6659630 DOI: 10.1038/s41598-019-47362-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Comparison of estimating the group specific precision matrix and differential network.
| Sample size | Number of vertices | Level of heterogeneity | Graph structure | Our methoda | DINGOa | JGLb | Our methodb | DINGOb |
|---|---|---|---|---|---|---|---|---|
| 50 | 100 | 0.25 | Random | 0.832 | 0.718 | 0.508 | 0.643 | 0.522 |
| Hub | 0.844 | 0.807 | 0.521 | 0.632 | 0.521 | |||
| Scale-free | 0.777 | 0.644 | 0.505 | 0.640 | 0.514 | |||
| 50 | 100 | 0.75 | Random | 0.815 | 0.708 | 0.532 | 0.660 | 0.516 |
| Hub | 0.799 | 0.727 | 0.563 | 0.640 | 0.513 | |||
| Scale-free | 0.769 | 0.616 | 0.510 | 0.653 | 0.519 | |||
| 100 | 200 | 0.25 | Random | 0.828 | 0.657 | 0.505 | 0.648 | 0.515 |
| Hub | 0.868 | 0.799 | 0.527 | 0.637 | 0.513 | |||
| Scale-free | 0.734 | 0.592 | 0.502 | 0.645 | 0.505 | |||
| 100 | 200 | 0.75 | Random | 0.835 | 0.680 | 0.537 | 0.677 | 0.513 |
| Hub | 0.832 | 0.740 | 0.595 | 0.658 | 0.515 | |||
| Scale-free | 0.756 | 0.580 | 0.510 | 0.665 | 0.518 |
Note: JGL, joint graphical lasso; DINGO, Differential Network Analysis in Genomics.
aAUC values group specific precision matrix.
bAUC values for differential network.
Top 10 most significant pairs of genes in the differential network analysis from the lung cancer study.
| Gene | Gene |
| p value for |
| p value for |
| p value for |
| p value for |
| p value |
|---|---|---|---|---|---|---|---|---|---|---|---|
| GRN | TSPO | −1.220 | 1.603E-01 | 0.197 | 1.366E-01 | −58.002 | 7.369E-06 | 0.781 | 2.287E-48 | 6.224 | 4.837E-10 |
| CARHSP1 | RRAS | −0.776 | 2.804E-01 | 0.150 | 2.642E-01 | −89.694 | 2.145E-05 | 0.719 | 2.539E-27 | 6.163 | 7.148E-10 |
| COMMD5 | DLC1 | 0.399 | 4.554E-01 | −0.103 | 4.481E-01 | 76.557 | 2.567E-05 | −0.709 | 3.822E-25 | −6.130 | 8.791E-10 |
| CIP29 | ZCCHC17 | 2.551 | 1.144E-01 | −0.222 | 8.888E-02 | −78.504 | 1.450E-05 | 0.741 | 3.985E-33 | 5.941 | 2.833E-09 |
| CUTA | MRPS24 | −1.061 | 3.074E-01 | 0.142 | 2.929E-01 | −115.204 | 2.623E-05 | 0.707 | 6.723E-25 | 5.920 | 3.223E-09 |
| LOC284230 | RPL23 | −10.744 | 1.777E-03 | 0.475 | 7.822E-06 | −174.241 | 3.781E-06 | 0.822 | 4.563E-76 | 5.913 | 3.357E-09 |
| CRBN | SERINC3 | 6.459 | 1.393E-02 | −0.359 | 2.709E-03 | −22.954 | 8.653E-05 | 0.640 | 2.793E-15 | 5.872 | 4.299E-09 |
| HTRA2 | RPL7L1 | −0.724 | 6.184E-01 | 0.069 | 6.159E-01 | 201.324 | 5.377E-05 | −0.667 | 2.353E-18 | −5.814 | 6.089E-09 |
| PCTK3 | SUSD1 | −0.351 | 6.442E-01 | 0.064 | 6.422E-01 | −51.450 | 2.156E-05 | 0.719 | 2.925E-27 | 5.808 | 6.333E-09 |
| GDI2 | LOC651816 | −12.125 | 4.843E-03 | 0.420 | 2.084E-04 | 39.698 | 2.959E-04 | −0.573 | 5.365E-10 | −5.802 | 6.538E-09 |
Note: , d = 1, 2, for case and control group, respectively.
Figure 1One cluster in the differential network between lung adenocarcinoma tumors and healthy samples. The sizes of nodes are proportional to their degrees. The widths of the edges are proportional to the W statistics.