| Literature DB >> 22355294 |
Robert Altwasser1, Jörg Linde, Ekaterina Buyko, Udo Hahn, Reinhard Guthke.
Abstract
Discovery of essential genes in pathogenic organisms is an important step in the development of new medication. Despite a growing number of genome data available, little is known about C. albicans, a major fungal pathogen. Most of the human population carries C. albicans as commensal, but it can cause systemic infection that may lead to the death of the host if the immune system has deteriorated. In many organisms central nodes in the interaction network (hubs) play a crucial role for information and energy transport. Knock-outs of such hubs often lead to lethal phenotypes making them interesting drug targets. To identify these central genes via topological analysis, we inferred gene regulatory networks that are sparse and scale-free. We collected information from various sources to complement the limited expression data available. We utilized a linear regression algorithm to infer genome-wide gene regulatory interaction networks. To evaluate the predictive power of our approach, we used an automated text-mining system that scanned full-text research papers for known interactions. With the help of the compendium of known interactions, we also optimize the influence of the prior knowledge and the sparseness of the model to achieve the best results. We compare the results of our approach with those of other state-of-the-art network inference methods and show that we outperform those methods. Finally we identify a number of hubs in the genome of the fungus and investigate their biological relevance.Entities:
Keywords: Candida albicans; LASSO; hubs; linear regression; network inference; prior knowledge; reverse engineering; scale-free
Year: 2012 PMID: 22355294 PMCID: PMC3280432 DOI: 10.3389/fmicb.2012.00051
Source DB: PubMed Journal: Front Microbiol ISSN: 1664-302X Impact factor: 5.640
Figure 1F-measure of the LASSO inference for the 503 . We exploited the BIND prior knowledge. The different graphs represent different values of ϵ and therefore different weighting of prior knowledge. It indicates that a higher influence of prior knowledge yields better results concerning the F-measure.
Figure 2F-measures and number of interactions for different values of . The maximum F-measure 0.0018 is reached at c = 0.2 with a network of 6866 interactions between 6,167 genes.
Results of the genome-wide network inference.
| LASSO | LASSO + FAC | LASSO + PPI | LASSO + TRANS | LASSO + BIND | LASSO + ALL | CLRNET | MRNET | ARACNE | |
|---|---|---|---|---|---|---|---|---|---|
| F-measure | 0.0014 | 0.0015 | 0.0053 | 0.0058 | 0.0067 | 0.0018 | 0.00006 | 0.00006 | 0.0009 |
| No. of interactions | 6,167 | 6,167 | 6,167 | 6,167 | 6,167 | 6,866 | 15,686,064 | 15,329,450 | 39,986 |
The first six rows show the results for LASSO and LASSO with different prior knowledge sources. The sixth row shows the LASSO inference with ALL four sources of prior knowledge. The last three rows show the results for the mutual information-based networks.
Figure 3F-measure obtained by LASSO-based genome-wide network inferences (left) with or without prior knowledge (FAC, PPI, TRANS, BIND) and with all four prior knowledge sources (ALL). The three bars on the right show the results of the mutual information-based networks.
Figure 4Distribution of degrees for the LASSO-based inference without prior knowledge. The red line represents the fitted power-law. The correlation coefficient of the logarithmical data is 0.95.
Figure 5Distribution of degrees for the ARACNE-based inference. The red line represents the fitted power-law. The correlation coefficient of the logarithmical data is 0.08.
Figure 6Venn diagram of the four different prior knowledge sources and the . Empty fields contain no common interaction. There is little overlap between the sources of prior knowledge as well as between the prior knowledge and the gold standard.
Ten genes with the highest out degree of the LASSO network inferred with all four sources of prior knowledge (ALL).
| Gene name | Out degree |
|---|---|
| FET31 | 29 |
| orf19.7450 | 28 |
| orf19.1300 | 25 |
| MAL2 | 20 |
| orf19.4678 | 19 |
| orf19.1735 | 18 |
| SGO1 | 17 |
| orf19.6715 | 17 |
| Yor353c | 15 |
| PSA2 | 15 |
Table of 16 hubs which are sensitive to antifungal treatment.
| Yor353c | Domain protein of RAM cell wall integrity signaling network; role in cell separation, |
| orf19.5975 | Putative zinc finger DNA-binding transcription factor; |
| Hmg2 | HMG-CoA reductase; enzyme of sterol pathway; inhibited by |
| ASR1 | Putative heat shock protein; transcription regulated by cAMP, osmotic stress, |
| YJR073c | Phosphatidylethanolamine |
| Cor1 | Putative ubiquinol-cytochrome- |
| Taf19 | Putative TFIID subunit; mutation confers hypersensitivity to |
| OPT8 | Possible oligopeptide transporter; induced by nitric oxide, |
| AGP2 | Amino acid permease; hyphal downregulated; regulated upon white-opaque switching; induced in core |
| FET31 | Putative iron transport multicopper oxidase precursor; |
| HIP1 | Similar to amino acid permeases; alkaline upregulated; |
| APT1 | Adenine phosphoribosyltransferase; |
| ARX1 | Putative ribosomal large subunit biogenesis protein; downregulated during core stress response; decreased expression in response to |
| Ygr090w | Putative U3 snoRNP protein; decreased expression in response to |
| NOG1 | Putative GTPase; mutation confers hypersensitivity to 5-fluorocytosine (5-FC), 5-fluorouracil (5-FU), and tubercidin (7-deazaadenosine); decreased expression in response to |
| Imp4 | Putative SSU processome component; decreased expression in response to |
The data was taken from the .
Figure 7Subnetwork GAL-genes using expression data and the full set of prior knowledge that does not contain these predicted relations. The connection of GAL10, GAL1, and GAL7 is well studied in many yeast forms. The network predictions also contain this relationship.
Figure 8Predicted hub . The labels on the edges tell which inference and prior knowledge predicted this interaction. LASSO for the LASSO inference without prior knowledge. FAC, BIND, TRANS, and PPI for the inferences with the corresponding prior knowledge source. ALL for the inference that exploited all four prior knowledge sources.
Figure 9Predicted hub . The labels on the edges tell which inference and prior knowledge predicted this interaction. LASSO for the LASSO inference without prior knowledge. FAC, BIND, TRANS, and PPI for the inferences with the corresponding prior knowledge source. ALL for the inference that exploited all four prior knowledge sources.