| Literature DB >> 26906975 |
Fatemeh Vafaee1,2.
Abstract
Biomarkers have gained immense scientific interest and clinical value in the practice of medicine. With unprecedented advances in high-throughput technologies, research interest in identifying novel and customized disease biomarkers for early detection, diagnosis, or drug responses is rapidly growing. Biomarkers can be identified in different levels of molecular biomarkers, networks biomarkers and dynamical network biomarkers (DNBs). The latter is a recently developed concept which relies on the idea that a cell is a complex system whose behavior is emerged from interplay of various molecules, and this network of molecules dynamically changes over time. A DNB can serve as an early-warning signal of disease progression, or as a leading network that drives the system into the disease state, and thus unravels mechanisms of disease initiation and progression. It is therefore of great importance to identify DNBs efficiently and reliably. In this work, the problem of DNB identification is defined as a multi-objective optimization problem, and a framework to identify DNBs out of time-course high-throughput data is proposed. Temporal gene expression data of a lung injury with carbonyl chloride inhalation exposure has been used as a case study, and the functional role of the discovered biomarker in the pathogenesis of lung injury has been thoroughly analyzed.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26906975 PMCID: PMC4764930 DOI: 10.1038/srep22023
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Workflow of identifying the pre-disease state and DNB, using an evolutionary multi-objective optimizer.
Figure 2Dynamical behaviors of Pareto solutions (grey lines) identified for different time-points.
Black lines represent the averaged trends. The p-values of the significance assessment of Pareto sets are provided below the corresponding graphs.
List of 16 genes included in the identified DNB.
| Gene symbol | Gene description | Gene symbol | Gene description |
|---|---|---|---|
| Aldh3a1 | aldehyde dehydrogenase 3 family member A1 | Rtp3 | receptor transporter protein 3 |
| Aplnr | apelin receptor | S100A9 | S100 calcium binding protein A9 |
| Cdk1 | cyclin-dependent kinase 1 | Shisa2 | shisa homolog 2 (Xenopus laevis) |
| Chst15 | carbohydrate (N-acetylgalactosamine 4-sulfate 6-O) sulfotransferase 15 | Tk1 | thymidine kinase 1 |
| Clec2d | C-type lectin domain family 2, member d | Tnxb | tenascin XB |
| Ercc6l | excision repair cross-complementing rodent repair deficiency complementation group 6 - like | Ttn | titin |
| Lgals1 | lectin, galactose binding, soluble 1 | Xdh | xanthine dehydrogenase |
| Melk | maternal embryonic leucine zipper kinase | Zbtb16 | zinc finger & BTB domain containing 16 |
Figure 3DNB dynamical evolution. Members of DNB are placed in the center surrounded by randomly chosen neighbors; links are colored from red to blue proportional to the correlation values.
This Figure clearly shows the emergence of the identified DNB at time 4 h.
Comparison of DNBs of acute lung injury (ALI) and the corresponding pre-disease times predicted by different methodologies (GSE2565 dataset).
| Method | DNB size | DNB genes | ||
|---|---|---|---|---|
| Method 1 (Chen | 8 h | 220 | 3.1 (0.015) | See Supplementary file 1 of the original paper |
| Method 2 (Yu | 8 h | 27 | 2.89 (0.115) | Anln, Asns, Atf3, Bag3, Ccnb1, Cdc20, Cdk1, Cenpa, Cep55, Cks1b, Dnaja1, Dnaja4, Dnajb1, Dnajb4, Gsta2, Hspa1b, Hspd1, Maff, Pbk, Prc1, Spag5, Spp1, Tuba4a, Txnrd1, Ube2c, Uhrf1, Birc5 |
| Method 3 (Zeng | 8 h | 3 | 35.82 (0.002) | Hspa8, Hspb1, Hsph1 |
| Method 4 (Li | 8 h | 25 | 0.24 (0.909) | Aldoa, Arhgef12, Bnip3, Esd, Gtf2f2, Hk2, Ldha, Papolg, Pard6b, Pcf11, Pgd, pgk1, Pkm2, Pkp3, Ppl, Prkci, Ptbp1, Rhoj, Rhou, Scel, Taldo1, Thoc4, Tkt, Tpi1, U2af1 |
| Current Method | 16 | 5.063 | See |
Information for Method 1 and Method 4 were extracted from the original papers. Method 3 has provided the java source code of the proposed algorithm. The code was run on ALI data, and the identified modules were extracted for consequent analyses. Method 2 was reproduced for ALI database. Methods are chronologically sorted.
Figure 4The hierarchy of biological processes enriched by EDNB genes.
Nodes are GO terms (i.e., biological processes) and edges shows to the relationships between the terms. Overrepresented biological processes are colored from orange to yellow as the p-value increases. White nodes are not significantly overrepresented; they are included to show the colored nodes in the context of the GO hierarchy. The size of a node is proportional to the number of the EDNB genes annotated to the corresponding GO category. Different regions of the enriched ontology were highlighted and functionally annotated based on the underlying biological processes.