| Literature DB >> 25392692 |
Nancy Lan Guo1, Ying-Wooi Wan1.
Abstract
Unraveling complex molecular interactions and networks and incorporating clinical information in modeling will present a paradigm shift in molecular medicine. Embedding biological relevance via modeling molecular networks and pathways has become increasingly important for biomarker identification in cancer susceptibility and metastasis studies. Here, we give a comprehensive overview of computational methods used for biomarker identification, and provide a performance comparison of several network models used in studies of cancer susceptibility, disease progression, and prognostication. Specifically, we evaluated implication networks, Boolean networks, Bayesian networks, and Pearson's correlation networks in constructing gene coexpression networks for identifying lung cancer diagnostic and prognostic biomarkers. The results show that implication networks, implemented in Genet package, identified sets of biomarkers that generated an accurate prediction of lung cancer risk and metastases; meanwhile, implication networks revealed more biologically relevant molecular interactions than Boolean networks, Bayesian networks, and Pearson's correlation networks when evaluated with MSigDB database.Entities:
Keywords: coexpression with signaling pathways; implication networks; lung cancer biomarkers
Year: 2014 PMID: 25392692 PMCID: PMC4218687 DOI: 10.4137/CIN.S14054
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
Figure 1Coefficient estimation for regularized linear models. Equations to estimate the coefficient vectors in (A) OLS linear regression model, (B) lasso, (C) elastic net, and (D) network-constrained regularization models.
Figure 2Precision (A) and FDR (B) of the disease-specific coexpression networks derived with Boolean implication networks and Genet. Genome-wide coexpression networks were constructed for good prognosis and poor prognosis patient groups, respectively, in the training cohort from Shedden et al.185 The disease-specific networks derived with both models were compared in terms of precision and FDR. An asterisk (*) above the bar indicates that the precision is significantly (P < 0.05) higher than the null precisions in 1,000 permutations.
Figure 3Comparison of the disease-specific coexpression networks derived with Genet and Bayesian networks. Comparisons of the disease-specific coexpression relations validated in two test cohorts in terms of precision (A) and FDR (B) for the 21 identified prognostic lung cancer gene signatures. For the Bayesian networks, the precision is zero for all 21 gene signatures in (A) and the FDR is NA in (B) because no coexpression relation was validated by both test cohorts. The asterisk (*) above the bar indicates that the precision is significantly (P < 0.05) greater than null precisions in 1,000 permutations.
Summary of commonly used methods for biomarker identification.
| METHOD | INPUT HIGH-THROUGHPUT DATA | RESPONSE VARIABLE |
|---|---|---|
| Rank-based methods | ||
| SAM | Expression | Comparison of two conditions |
| Random forests | Expression | Gaussian |
| Cox model | Expression, Mutation | Survival time and Event |
| Mutation | Comparison of two conditions | |
| Regularized models | ||
| Expression, Mutation, Read counts | Gaussian | |
| Elastic net | Expression, Mutation, Read counts | Gaussian |
| Network-constraint regularized model | Expression, Mutation, Read counts, Gene-interaction Network | Gaussian |
| Network-constraint logistic model | Expression, Mutation, Read counts, Gene-interaction Network | Binary |
| Network based methods | ||
| Graphical Gaussian Models | Expression | Not required |
| Poisson graphical models | Read counts | Not required |
| Multi-Dentrix | Mutation | Not required |
| PARADIGM | Integration of Expression, Mutation, and Biological function and interaction data | Not required |
| Bayesian belief networks | Expression | Not required |
| Implication networks | Expression, Mutation | Not required |
Notes:
Expression, mRNA/miRNA expression profiled with microarray. Mutation, mutation profiled with SNP array or next-generation sequencing. Read counts, mRNA/miRNA expression profiled with next-generation sequencing.