| Literature DB >> 27600242 |
Xu Wang1, Mustafa Alshawaqfeh2, Xuan Dang3, Bilal Wajid4, Amina Noor5, Marwa Qaraqe6, Erchin Serpedin7.
Abstract
In systems biology, the regulation of gene expressions involves a complex network of regulators. Transcription factors (TFs) represent an important component of this network: they are proteins that control which genes are turned on or off in the genome by binding to specific DNA sequences. Transcription regulatory networks (TRNs) describe gene expressions as a function of regulatory inputs specified by interactions between proteins and DNA. A complete understanding of TRNs helps to predict a variety of biological processes and to diagnose, characterize and eventually develop more efficient therapies. Recent advances in biological high-throughput technologies, such as DNA microarray data and next-generation sequence (NGS) data, have made the inference of transcription factor activities (TFAs) and TF-gene regulations possible. Network component analysis (NCA) represents an efficient computational framework for TRN inference from the information provided by microarrays, ChIP-on-chip and the prior information about TF-gene regulation. However, NCA suffers from several shortcomings. Recently, several algorithms based on the NCA framework have been proposed to overcome these shortcomings. This paper first overviews the computational principles behind NCA, and then, it surveys the state-of-the-art NCA-based algorithms proposed in the literature for TRN reconstruction.Entities:
Keywords: gene; network component analysis; transcription factor; transcriptional regulatory network
Year: 2015 PMID: 27600242 PMCID: PMC4996402 DOI: 10.3390/microarrays4040596
Source DB: PubMed Journal: Microarrays (Basel) ISSN: 2076-3905
Figure 1Examples of two transcription regulatory networks (TRNs) with six genes and four transcription factors (TFs), but different connectivity topologies.
Figure 2An example of (a) a non-identifiable pattern and (b) an identifiable pattern.
Figure 3Network component analysis (NCA) algorithm.
Figure 4NMSE for different algorithms with respect to SNR from dB–20 dB.
Figure 5Data fitting error for different algorithms with respect to SNR from dB–20 dB.
Figure 6NMSE for different algorithms with respect to SNR from dB–20 dB and outliers with probability 0.1.
Figure 7Data fitting error for different algorithms with respect to SNR from dB–20 dB and outliers with probability 0.1.
Normalized mean square error (NMSE), data fitting error (DFE) and computation time for different algorithms under 10 dB SNR. NINCA, non-iterative NCA; ROBNCA, robust NCA; PosNCA, positive NCA.
| Algorithm | ANSME | SNSME | Data Fitting Error | Computation Time | |||
|---|---|---|---|---|---|---|---|
| Noise | Noise + Outliers | Noise | Noise + Outliers | Noise | Noise + Outliers | ||
| FastNCA | 0.0571 | 0.0500 | 0.2544 | 0.2666 | 1.6973 | 4.4193 | 0.0005 |
| NINCA | 0.0037 | 0.0134 | 0.2250 | 0.2280 | 1.7361 | 4.7164 | 0.0119 |
| ROBNCA | 0.0033 | 0.0044 | 0.2218 | 0.2062 | 1.7141 | 4.5630 | 0.0080 |
| NCA | 0.0033 | 0.0060 | 0.2217 | 0.2068 | 1.7139 | 4.4809 | 6.6728 |
| PosNCA | 0.0031 | 0.0055 | 0.3896 | 0.3451 | 1.8200 | 4.7275 | 0.2648 |
Figure 8TFA reconstruction: estimation of seven TFAs of the Arabidopsis Gene Regulatory Information Server (AGRIS).
Summary of NCA-based algorithms. mNCA, motif-directed NCA; gNCA, generalized NCA; NCAr, revised NCA; gfNCA, generalized-framework NCA; nnNCA, non-negative NCA; ALS, alternate least-squares; SSP, subspace separation principle; TLS, total least-squares.
| Algorithm | Category | Estimation Technique | Contribution |
|---|---|---|---|
| NCA [ | Iterative | ALS | Proposed the NCA framework and criteria, motivated other NCA algorithms |
| mNCA [ | Iterative | ALS | Incorporated motif information to obtain the prior connectivity information |
| gNCA [ | Iterative | ALS | Incorporated the prior information about the TFA matrix |
| NCAr [ | Iterative | ALS | Revised and extended the third identification criterion |
| gfNCA [ | Iterative | ALS | Modified the criteria of NCA, such that they are only related to the prior connectivity information |
| ROBNCA [ | Non-iterative | Alternate optimization | Reduced the computational complexity and improved the robustness against outliers |
| FastNCA [ | Non-iterative | SSP, rank-1 factorization | Reduced the computational complexity |
| PosNCA [ | Non-iterative | SSP, convex optimization | Combined additional prior information to reduce the complexity |
| nnNCA [ | Non-iterative | SSP, convex optimization | Combined additional prior information and reduced the complexity |
| NINCA [ | Non-iterative | SSP, convex optimization, TLS | Combined additional prior information, reduced the complexity and improved the estimation accuracy |