| Literature DB >> 18433478 |
Kin-On Cheng1, Ngai-Fong Law, Wan-Chi Siu, Alan Wee-Chung Liew.
Abstract
BACKGROUND: The DNA microarray technology allows the measurement of expression levels of thousands of genes under tens/hundreds of different conditions. In microarray data, genes with similar functions usually co-express under certain conditions only 1. Thus, biclustering which clusters genes and conditions simultaneously is preferred over the traditional clustering technique in discovering these coherent genes. Various biclustering algorithms have been developed using different bicluster formulations. Unfortunately, many useful formulations result in NP-complete problems. In this article, we investigate an efficient method for identifying a popular type of biclusters called additive model. Furthermore, parallel coordinate (PC) plots are used for bicluster visualization and analysis.Entities:
Mesh:
Year: 2008 PMID: 18433478 PMCID: PMC2396181 DOI: 10.1186/1471-2105-9-210
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Examples of different biclusters. (A) A constant bicluster. (B) A constant row bicluster. (C) A constant column bicluster. (D) An additive-related bicluster. (E) A multiplicative-related bicluster. Note that Ci denotes the i-th experimental condition.
Figure 2An example of gene expression matrix with two embedded biclusters.
Figure 3The difference matrix for the dataset shown in Figure 2.
Figure 4The two different groups formed by merging columns "C5" and "C3".
Figure 5The difference matrix for the two different groups formed by merging columns "C5" and "C3".
Figure 6The flow chart of the proposed biclustering algorithm.
Figure 7The pseudo-code of the proposed biclustering algorithm.
Figure 8The PC plot of the true additive-related bicluster.
Figure 9The PC plots of the bicluster identified using noise threshold of 1.2. The expression values of all rows in the four related columns and their difference between the last three columns and the first column are drawn using PC plots in Figures (A) and (B) respectively. In Figures (A) and (B), red color shows rows from the identified bicluster while blue color shows rows from the original dataset. Figures (C) and (D) illustrate the inconsistencies between the identified bicluster and the true bicluster in the four related columns and the column differences respectively. In Figures (C) and (D), the red color indicates rows of the true bicluster that are found by our algorithm while the blue color represents the three missed rows of the true bicluster.
Figure 10The first type of dataset with four additive-related biclusters before noise is added.
Figure 11The second type of dataset with two multiplicative-related biclusters before noise is added.
Parameter settings for biclustering algorithms and post-filtering in the experiments on artificial datasets
| Experiment | Algorithm/post-filtering | Parameter settings* |
| Artificial datasets for additive models | PA | |
| C&C | ||
| pCluster | ||
| Post-filtering | ||
| Artificial datasets for multiplicative models | PM | |
| PAL | ||
| C&C | ||
| pCluster | ||
| Post-filtering |
* The definitions of parameters ε, N, Nand Pfollow those defined for the proposed algorithm, i.e. noise threshold, minimum number of rows, minimum number of columns and maximum percentage in overlap allowed in biclusters respectively. Furthermore, M denotes the maximum number of biclusters required and δ of C&C and the pCluster algorithm is defined as in the original publications [12, 25].
Figure 12Biclustering accuracy against noise level for additive models. The biclustering accuracies of the proposed algorithm, Cheng and Church algorithm and pCluster algorithm are represented by the curves 'Proposed', 'C&C' and 'pCluster' respectively.
Statistical properties of biclustering results for the artificial datasets embedded with additive-related biclusters before post-filtering
| Property | Algorithm | Noise s.d. | |||||
| 0 | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | ||
| Average number of biclusters | PA | 4 | 4 | 4 | 4.4 | 5.8 | 6.6 |
| C&C | 40 | 40 | 40 | 40 | 40 | 40 | |
| pCluster | 23 | 366.6 | 378.2 | 255.6 | 124.2 | 21.40 | |
| Average number of rows | PA | 35 | 35 | 34.85 | 33.67 | 31.87 | 32.25 |
| C&C | 7.625 | 7.640 | 8.025 | 7.980 | 8.265 | 7.895 | |
| pCluster | 25.57 | 23.14 | 23.28 | 22.98 | 22.69 | 21.74 | |
| Average number of columns | PA | 8.250 | 8.250 | 8.250 | 7.480 | 7.262 | 7.228 |
| C&C | 3.725 | 4.500 | 4.915 | 4.765 | 5.070 | 5.280 | |
| pCluster | 5.217 | 5.224 | 5.115 | 4.873 | 4.468 | 4.237 | |
Figure 13Biclustering accuracy against noise level for multiplicative models. The biclustering accuracy of the proposed algorithms for multiplicative models is denoted by 'PM' while the proposed algorithms for additive models, Cheng and Church algorithm and pCluster algorithm on logarithm of expression data are labelled by 'PAL', 'C&C (log)' and 'pCluster (log)' respectively.
Statistical properties of biclustering results for the artificial datasets embedded with multiplicative-related biclusters before post-filtering
| Property | Algorithm | Noise s.d. | |||||
| 0 | 0.1 | 0.2 | 0.3 | 0.4 | 0.5 | ||
| Average number of biclusters | PM | 2 | 2 | 2.2 | 3 | 2.4 | 2.6 |
| PAL | 2 | 2 | 2.2 | 3.2 | 3 | 3.6 | |
| C&C | 20 | 20 | 20 | 20 | 20 | 20 | |
| pCluster | 1109 | 956.4 | 855.8 | 753.4 | 658.8 | 729.2 | |
| Average number of Rows | PM | 25 | 24.80 | 24.17 | 23.72 | 20.20 | 23.40 |
| PAL | 25 | 23.70 | 21 | 20.10 | 19.52 | 19.72 | |
| C&C | 5.850 | 5.560 | 6.900 | 8.820 | 8.270 | 6.940 | |
| pCluster | 19.94 | 19.93 | 19.87 | 19.75 | 19.71 | 19.66 | |
| Average number of columns | PM | 7 | 7 | 6.367 | 6.400 | 5.900 | 5.933 |
| PAL | 7 | 7 | 6.433 | 5.517 | 5.567 | 5.417 | |
| C&C | 4.300 | 3.870 | 5.020 | 5.660 | 5.570 | 5.110 | |
| pCluster | 4.346 | 4.345 | 4.297 | 4.224 | 4.178 | 4.158 | |
Average processing time for the artificial datasets
| Dataset | Artificial datasets for additive models with noise s.d. of 0.3 | Artificial datasets for multiplicative models with noise s.d. of 0.3 | ||||
| Algorithm | PA | C&C | pCluster | PM | C&C (log) | pCluster (log) |
| Average time (sec) | 3.776 | 17 | 2716 | 0.0232 | 1 | 105.2 |
Parameter settings of algorithms and post-processing investigated in experiments based on the yeast dataset
| Algorithm/post-filtering | Parameter settings* |
| PA | |
| PM | |
| C&C | |
| C&C (log) | |
| ISA | |
| OPSM | |
| xMotifs | |
| Filtering |
* The definitions of parameters ε, N, N, Pand M follow those defined in the experiments on artificial datasets. The other parameters in C&C, ISA, OPSM and xMotifs are as defined in their original publications [1, 12, 32-34].
Figure 14Percentage of additive-related biclusters enriched with GO annotations of different ontologies. (A) Biological process ontology. (B) Cellular component ontology. (C) Molecular function ontology.
Homogeneity comparison of biclusters identified in the yeast cell-cycle dataset using various algorithms
| Algorithm | MSRS/MSRSl* | ACV | ||||
| min | mean | max | min | mean | max | |
| PA | 326.0 | 412.1 | 552.7 | 0.8960 | 0.9416 | 0.9755 |
| PM | 3.694 × 10-4 | 9.573 × 10-3 | 3.809 × 10-2 | 0.7493 | 0.9219 | 1 |
| C&C | 439.6 | 554.5 | 593.3 | 0.6481 | 0.9217 | 0.9768 |
| C&C (log) | 2.784 × 10-2 | 6.262 × 10-2 | 8.451 × 10-2 | 0.3489 | 0.5740 | 0.9000 |
| ISA | 108.9 | 489.6 | 794.6 | 0.8420 | 0.9247 | 0.9588 |
| OPSM | 480.4 | 497.1 | 513.8 | 0.8866 | 0.8904 | 0.8941 |
| xMotifs | 1.910 × 10-12 | 4.820 | 12.04 | 0.9982 | 0.9992 | 1 |
* MSRS is evaluated for all the algorithms except PM and C&C (log) which use MSRSl.
Statistical comparison of biclusters identified in the yeast cell-cycle dataset using various algorithms
| Algorithm | no. of biclusters | size* | no. of genes | no. of conditions | |||||
| min | max | min | mean | max | min | mean | max | ||
| PA | 25 | 10 × 5 | 597 × 17 | 10 | 97.64 | 597 | 5 | 13.16 | 17 |
| PM | 59 | 10 × 6 | 518 × 17 | 10 | 46.09 | 518 | 5 | 9.085 | 17 |
| C&C | 31 | 10 × 5 | 1391 × 17 | 10 | 91.74 | 1391 | 5 | 11.6 | 17 |
| C&C (log) | 5 | 12 × 13 | 2270 × 17 | 12 | 486.2 | 2270 | 13 | 14.80 | 17 |
| ISA | 18 | 28 × 5 | 149 × 6 | 28 | 74.56 | 149 | 5 | 5.667 | 7 |
| OPSM | 2 | 132 × 7 | 469 × 5 | 132 | 300.5 | 469 | 5 | 6 | 7 |
| xMotifs | 13 | 11 × 5 | 115 × 5 | 11 | 40.08 | 115 | 5 | 5 | 5 |
* The size of a bicluster is determined by its number of values, i.e. product of numbers of rows and columns.
Processing time for the yeast cell-cycle expression dataset using various biclustering algorithms
| Algorithm | PA | PM | C&C | C&C (log) | ISA | OPSM | xMotifs |
| Processing time (sec) | 0.72 | 1.35 | 32 | 1217 | 1590 | 38 | 937 |
Annotations of biological process ontology for biclusters identified by the proposed algorithm for additive models at p-value < 0.001.
| Bicluster index | Annotation | P-value | Corrected P-value | Genes |
| 1 | chromatin modification | 3.84E-04 | 1.39E-01 | YBR081C, YBR198C, YDR392W, YDR448W, YGL112C, YMR236W, YNL097C |
| histone acetylation | 4.84E-04 | 1.76E-01 | YBR081C, YBR198C, YDR392W, YDR448W, YFL039C, YGL112C, YJL081C, YMR236W, YNL136W | |
| endocytosis | 7.24E-04 | 2.63E-01 | YBR109C, YCL034W, YDR388W, YDR490C, YER166W, YFL039C, YGL106W, YHR001W, YHR073W, YNL084C, YNL227C, YNL243W, YOR089C, YOR109W, YOR327C, YPL145C | |
| 2 | aerobic respiration | 2.47E-04 | 4.34E-02 | YBR026C, YDL174C, YDR231C, YHR001W, YMR030W, YMR081C, YPL132W, YPL159C |
| 3 | ribosome biogenesis and assembly | 2.75E-13 | 4.73E-11 | YBL024W, YBR142W, YDL153C, YDL167C, YDR060W, YDR120C, YDR312W, YDR365C, YEL026W, YER082C, YGL099W, YGR162W, YGR245C, YJL033W, YJR002W, YJR066W, YLL008W, YLR175W, YLR401C, YNL110C, YNL132W, YNL175C, YOL077C, YOR145C, YOR272W, YPL126W |
| 35S primary transcript processing | 2.23E-06 | 3.84E-04 | YCL031C, YDR339C, YER082C, YGR090W, YJL033W, YJR002W, YLL008W, YLR175W, YOR145C, YPR137W | |
| 4 | ergosterol biosynthetic process | 1.08E-04 | 1.33E-02 | YLR450W, YML008C, YMR202W, YMR208W |
| translational elongation | 1.87E-06 | 2.30E-04 | YAL003W, YDL081C, YDR382W, YDR385W, YLR249W, YLR340W, YOL039W | |
| regulation of translational fidelity | 7.04E-07 | 8.67E-05 | YBR048W, YDL229W, YDR025W, YGR118W, YNL209W, YPL081W | |
| ribosomal small subunit assembly and maintenance | 6.90E-05 | 8.49E-03 | YBR048W, YDR025W, YDR447C, YGR214W, YLR048W, YLR167W | |
| ribosomal large subunit assembly and maintenance | 6.83E-06 | 8.40E-04 | YBR142W, YDR312W, YLR075W, YLR340W, YLR448W, YML073C, YOL127W, YPR102C | |
| translation | 6.92E-44 | 8.51E-42 | YBL072C, YBL092W, YBR048W, YBR268W, YDL061C, YDL075W, YDL081C, YDL082W, YDL083C, YDL136W, YDL191W, YDL229W, YDR012W, YDR025W, YDR064W, YDR382W, YDR447C, YDR450W, YDR471W, YDR500C, YER074W, YER117W, YER131W, YGR118W, YGR214W, YHL001W, YHR141C, YIL069C, YJL136C, YJL190C, YJR123W, YKL056C, YKL156W, YKR057W, YKR094C, YLR048W, YLR075W, YLR167W, YLR185W, YLR325C, YLR340W, YLR388W, YLR441C, YLR448W, YML026C, YML063W, YML073C, YMR143W, YMR242C, YNL067W, YNL096C, YNL162W, YNL209W, YNL301C, YNL302C, YNL306W, YOL039W, YOL040C, YOL127W, YOR167C, YOR234C, YOR293W, YOR312C, YOR369C, YPL081W, YPL090C, YPL143W, YPL198W, YPR102C | |
| 6 | protein folding | 3.15E-04 | 1.51E-02 | YDR214W, YFL016C, YML130C, YNL007C, YOR027W |
| copper ion import | 3.98E-04 | 1.91E-02 | YLR411W, YPR124W | |
| 8 | tricarboxylic acid cycle | 8.65E-04 | 1.03E-01 | YDR178W, YIL125W, YLL041C, YNL037C |
| mitochondrial electron transport, ubiquinol to cytochrome c | 4.23E-04 | 5.04E-02 | YEL024W, YHR001W, YOR065W | |
| ubiquitin-dependent protein catabolic process | 3.41E-04 | 4.05E-02 | YBR173C, YDR394W, YJL001W, YMR119W, YOL038W | |
| 9 | cytokinesis, contractile ring contraction | 1.51E-04 | 3.17E-03 | YBR038W, YHR023W |
| 10 | cell morphogenesis checkpoint | 8.19E-04 | 7.37E-02 | YJL187C, YKL101W |
| chitin biosynthetic process | 8.19E-04 | 7.37E-02 | YER096W, YNL233W | |
| mitotic sister chromatid cohesion | 6.67E-06 | 6.01E-04 | YFL008W, YIL026C, YJL019W, YMR076C, YMR078C, YNL273W | |
| 14 | glycolysis | 4.18E-04 | 3.81E-02 | YCL040W, YDR050C, YJR009C, YKL152C |
| ribosomal small subunit assembly and maintenance | 9.27E-04 | 8.43E-02 | YDR337W, YGR214W, YLR167W, YML024W | |
| 15 | protein folding | 4.54E-04 | 1.22E-02 | YAL005C, YBR169C, YDR214W, YLR216C |
| 19 | tricarboxylic acid cycle | 7.49E-05 | 2.55E-03 | YDR148C, YLL041C, YNR001C |
| 24 | response to stress | 3.45E-04 | 3.45E-03 | YBR072W, YDR258C, YPL240C |
| 25 | SRP-dependent cotranslational protein targeting to membrane, translocation | 4.27E-04 | 8.12E-03 | YAL005C, YER103W |
| response to stress | 6.85E-05 | 1.30E-03 | YAL005C, YDR258C, YER103W, YPL240C | |
| protein folding | 5.73E-04 | 1.09E-02 | YAL005C, YDR258C, YER103W | |
| protein refolding | 2.86E-04 | 5.43E-03 | YAL005C, YPL240C |
Annotations of cellular component ontology for biclusters identified by the proposed algorithm for additive models at p-value < 0.001.
| Bicluster index | Annotation | P-value | Corrected P-value | Genes |
| 1 | SLIK (SAGA-like) complex | 4.45E-04 | 7.56E-02 | YBR081C, YBR198C, YDR392W, YDR448W, YGL112C, YMR236W |
| transcription factor TFIID complex | 4.45E-04 | 7.56E-02 | YBR198C, YER148W, YGL112C, YML114C, YMR236W, YPL129W | |
| INO80 complex | 3.75E-04 | 6.38E-02 | YDL002C, YFL039C, YJL081C, YLR052W, YPL129W | |
| 3 | nucleolus | 4.60E-14 | 4.28E-12 | YAL059W, YBR142W, YCL031C, YCL054W, YDR312W, YDR339C, YDR365C, YDR378C, YEL026W, YEL055C, YGR090W, YGR159C, YJL033W, YJR002W, YLL008W, YLL034C, YLR175W, YML074C, YNL110C, YNL132W, YNL175C, YOL077C, YOL080C, YOR145C, YOR272W |
| small nucleolar ribonucleoprotein complex | 2.46E-04 | 2.29E-02 | YDL153C, YDR378C, YEL026W, YER082C, YGR090W, YJR002W, YPL126W, YPR137W | |
| 4 | cytosolic large ribosomal subunit (sensu Eukaryota) | 1.76E-23 | 1.25E-21 | YBL092W, YDL075W, YDL081C, YDL082W, YDL136W, YDL191W, YDR012W, YDR382W, YDR471W, YDR500C, YER117W, YHL001W, YHR141C, YKR094C, YLR075W, YLR185W, YLR325C, YLR340W, YLR448W, YML073C, YMR242C, YNL067W, YNL162W, YNL301C, YOL039W, YOL127W, YOR234C, YOR312C, YPL143W, YPL198W, YPR102C |
| cytosolic small ribosomal subunit (sensu Eukaryota) | 7.45E-29 | 5.29E-27 | YBL072C, YBR048W, YDL061C, YDL083C, YDR025W, YDR064W, YDR447C, YDR450W, YER074W, YER131W, YGR118W, YGR214W, YIL069C, YJL136C, YJL190C, YJR123W, YKL156W, YKR057W, YLR048W, YLR167W, YLR388W, YLR441C, YML026C, YML063W, YMR143W, YNL096C, YNL302C, YOL040C, YOR167C, YOR293W, YOR369C, YPL081W, YPL090C | |
| ribosome | 2.87E-05 | 2.04E-03 | YAL003W, YDR385W, YEL034W, YKL056C, YLR249W, YOL139C, YPR163C | |
| 5 | chromatin remodeling complex | 5.68E-04 | 3.12E-02 | YJL176C, YOR290C, YPL016W |
| 8 | mitochondrion | 1.66E-08 | 1.11E-06 | YBL015W, YBL090W, YBR003W, YBR037C, YBR120C, YBR122C, YBR147W, YCR028C, YDL027C, YDR141C, YDR178W, YDR305C, YDR316W, YDR494W, YDR513W, YEL006W, YEL024W, YER141W, YGL229C, YGR207C, YGR243W, YHR001W, YHR147C, YIL111W, YIL125W, YJL131C, YJL171C, YKL087C, YLL041C, YLR168C, YLR395C, YML120C, YMR145C, YMR167W, YNL037C, YNL073W, YOL038W, YOL059W, YOL096C, YOR065W, YOR317W, YOR356W, YOR386W, YPL005W, YPL029W, YPL103C |
| respiratory chain complex III (sensu Eukaryota) | 4.23E-04 | 2.84E-02 | YEL024W, YHR001W, YOR065W | |
| endosome | 9.02E-05 | 6.04E-03 | YAL030W, YDL113C, YJL053W, YLR119W, YLR408C, YNR006W, YOR036W | |
| 9 | bud neck | 1.42E-06 | 2.27E-05 | YBR038W, YGR092W, YHR023W, YIL106W, YLR190W, YOL070C |
| 10 | bud neck | 7.51E-04 | 3.68E-02 | YDR507C, YGR152C, YGR238C, YIL140W, YJL187C, YKL101W, YNL233W |
| septin ring | 9.01E-05 | 4.42E-03 | YIL140W, YKL101W, YNL233W | |
| 14 | lipid particle | 5.85E-05 | 2.93E-03 | YIL124W, YJR009C, YMR110C, YNL231C, YOR317W |
| 17 | mitochondrion | 8.16E-06 | 2.86E-04 | YCL057W, YDL027C, YDL164C, YDR116C, YDR194C, YDR375C, YDR513W, YGL104C, YHR002W, YHR067W, YHR147C, YIL087C, YIL111W, YJL063C, YLL040C, YLR270W, YLR346C, YMR098C, YMR152W, YMR188C, YNL063W, YNL073W, YNL200C, YNL274C, YOL059W, YOL071W, YOR136W, YPR011C |
| 18 | bud neck contractile ring | 6.08E-05 | 1.58E-03 | YHR023W, YJR092W, YMR032W |
| pre-autophagosomal structure | 6.62E-04 | 1.72E-02 | YBL078C, YDL113C | |
| 20 | nuclear cohesin complex | 3.74E-04 | 5.24E-03 | YDL003W, YIL026C |
Annotations of molecular function ontology for biclusters identified by the proposed algorithm for additive models at p-value < 0.001.
| Bicluster index | Annotation | P-value | Corrected P-value | Genes |
| 1 | endopeptidase activity | 7.71E-05 | 1.26E-02 | YBL041W, YDR394W, YER012W, YJL001W, YOL038W, YOR362C |
| 3 | ATP-dependent RNA helicase activity | 5.10E-04 | 3.57E-02 | YBR142W, YER172C, YJL033W, YLL008W, YMR080C |
| snoRNA binding | 7.57E-04 | 5.30E-02 | YDL153C, YER082C, YGR090W, YPL126W, YPR137W | |
| 4 | structural constituent of ribosome | 7.61E-47 | 3.96E-45 | YBL072C, YBL092W, YBR048W, YBR268W, YDL061C, YDL075W, YDL081C, YDL082W, YDL083C, YDL136W, YDL191W, YDR012W, YDR025W, YDR064W, YDR382W, YDR447C, YDR450W, YDR471W, YDR500C, YER074W, YER117W, YER131W, YGR118W, YGR214W, YHL001W, YHR141C, YIL069C, YJL136C, YJL190C, YJR123W, YKL156W, YKR057W, YKR094C, YLR048W, YLR075W, YLR167W, YLR185W, YLR325C, YLR340W, YLR388W, YLR441C, YLR448W, YML026C, YML063W, YML073C, YMR143W, YMR242C, YNL067W, YNL096C, YNL162W, YNL301C, YNL302C, YNL306W, YOL039W, YOL040C, YOL127W, YOR167C, YOR234C, YOR293W, YOR312C, YOR369C, YPL081W, YPL090C, YPL143W, YPL198W, YPR102C |
| RNA-directed DNA polymerase activity | 9.72E-04 | 5.05E-02 | YAR009C, YJR027W, YML039W, YML045W, YMR045C, YMR050C | |
| DNA helicase activity | 5.11E-04 | 2.66E-02 | YDR545W, YLR467W, YNL339C, YPL283C, YPR204W | |
| ribonuclease activity | 9.72E-04 | 5.05E-02 | YAR009C, YJR027W, YML039W, YML045W, YMR045C, YMR050C | |
| RNA binding | 3.10E-05 | 1.61E-03 | YAR009C, YDL208W, YDR378C, YDR381W, YEL026W, YHL001W, YJR027W, YLR277C, YLR448W, YML039W, YML045W, YML073C, YMR045C, YMR050C, YNL175C, YOL123W, YOL127W | |
| helicase activity | 2.93E-05 | 1.53E-03 | YEL077C, YJL225C, YLL066C, YLL067C, YML133C | |
| 6 | copper uptake transporter activity | 3.98E-04 | 1.11E-02 | YLR411W, YPR124W |
| 17 | glycerol-3-phosphate dehydrogenase (NAD+) activity | 8.19E-04 | 2.95E-02 | YDL022W, YOL059W |
| 18 | spermidine transporter activity | 1.12E-04 | 2.24E-03 | YLL028W, YOR273C |
| spermine transporter activity | 6.62E-04 | 1.32E-02 | YLL028W, YOR273C | |
| 24 | unfolded protein binding | 1.05E-04 | 4.19E-04 | YBR072W, YDR258C, YPL240C |
| 25 | unfolded protein binding | 1.38E-05 | 8.29E-05 | YAL005C, YDR258C, YER103W, YPL240C |
| ATPase activity | 9.71E-04 | 5.82E-03 | YAL005C, YDR258C, YER103W |
Annotations of biological process ontology for biclusters identified by the proposed algorithm for multiplicative models at p-value < 0.001.
| Bicluster index | Annotation | P-value | Corrected P-value | Genes |
| 1 | ribosomal large subunit biogenesis and assembly | 5.83E-06 | 1.97E-03 | YAL025C, YBR267W, YDR091C, YNL110C, YNL163C, YOR272W, YPL211W |
| tRNA methylation | 1.94E-04 | 6.55E-02 | YBL024W, YBR061C, YDR165W, YNR046W, YOL093W, YOL124C | |
| processing of 20S pre-rRNA | 1.80E-05 | 6.05E-03 | YDL153C, YDL166C, YDR449C, YEL026W, YJL191W, YJR002W, YLR068W, YLR192C, YLR222C, YML093W, YMR093W, YOR056C, YPR137W | |
| transcription from RNA polymerase III promoter | 1.02E-04 | 3.45E-02 | YBR154C, YDL150W, YDR045C, YER148W, YLR223C, YNL113W, YNR003C, YOR224C | |
| 35S primary transcript processing | 6.37E-06 | 2.15E-03 | YBL004W, YCL031C, YCL059C, YDR339C, YGR090W, YJR002W, YKR060W, YLL008W, YLR051C, YLR186W, YLR430W, YNR038W, YOL021C, YOR145C, YPR112C, YPR137W | |
| transcription from RNA polymerase I promoter | 9.82E-04 | 3.31E-01 | YBL014C, YBR154C, YDR156W, YER148W, YNL113W, YOR224C, YOR341W | |
| rRNA processing | 5.33E-04 | 1.80E-01 | YBR142W, YBR257W, YCL059C, YDR365C, YDR478W, YGR159C, YLR223C, YMR049C, YMR290C, YOL144W, YOR145C, YPL211W | |
| ribosome biogenesis and assembly | 1.09E-15 | 3.66E-13 | YAL025C, YBL024W, YBL054W, YBR034C, YBR084W, YBR142W, YBR267W, YCL059C, YDL153C, YDL167C, YDR060W, YDR165W, YDR300C, YDR312W, YDR365C, YDR449C, YDR465C, YEL026W, YHL039W, YJR002W, YJR066W, YKL143W, YKL191W, YKR056W, YKR060W, YLL008W, YLR186W, YML093W, YMR093W, YMR131C, YMR290C, YNL110C, YNL113W, YNL132W, YNL175C, YNR003C, YNR038W, YNR053C, YOL077C, YOL124C, YOL144W, YOR056C, YOR145C, YOR206W, YOR272W, YPL211W, YPL212C, YPL226W | |
| mRNA export from nucleus | 3.51E-04 | 1.18E-01 | YBR034C, YDL116W, YDR432W, YER107C, YJL140W, YKL057C, YKL068W, YKR002W, YKR095W, YMR308C, YOR098C | |
| 5 | nucleotide-excision repair | 1.88E-04 | 1.97E-02 | YBR088C, YDL164C, YJL173C, YNL312W |
| DNA recombination | 1.47E-05 | 1.54E-03 | YDL164C, YJL173C, YML061C, YNL312W | |
| DNA replication, synthesis of RNA primer | 6.91E-05 | 7.25E-03 | YJL173C, YKL045W, YNL312W | |
| double-strand break repair via homologous recombination | 8.96E-04 | 9.41E-02 | YER147C, YJL173C, YNL312W | |
| 7 | arabinose catabolic process | 3.55E-04 | 1.03E-02 | YHR104W, YOR120W |
| D-xylose catabolic process | 3.55E-04 | 1.03E-02 | YHR104W, YOR120W | |
| protein refolding | 1.22E-05 | 3.55E-04 | YBR169C, YLL026W, YPL240C | |
| 15 | protein refolding | 2.41E-06 | 6.74E-05 | YAL005C, YBR169C, YPL240C |
| 21 | ribosome biogenesis and assembly | 2.26E-04 | 1.26E-02 | YAL025C, YBR238C, YBR267W, YDL031W, YDR083W, YDR184C, YIR026C, YKL078W |
| 24 | Glycolysis | 3.40E-05 | 4.15E-03 | YAL038W, YCR012W, YDR050C, YJR009C, YKL060C, YKL152C |
| translational elongation | 4.41E-09 | 5.38E-07 | YAL003W, YBR118W, YDL081C, YDL130W, YDR382W, YDR385W, YLR249W, YLR340W, YOL039W | |
| regulation of translational fidelity | 1.61E-08 | 1.97E-06 | YBR048W, YBR189W, YDL229W, YDR025W, YGR118W, YNL209W, YPL081W | |
| ribosomal small subunit assembly and maintenance | 4.58E-07 | 5.59E-05 | YBR048W, YCR031C, YDR025W, YDR447C, YGR214W, YLR048W, YLR167W, YML024W | |
| ribosomal large subunit assembly and maintenance | 1.58E-04 | 1.93E-02 | YDR418W, YLR075W, YLR340W, YLR448W, YML073C, YOL127W, YPR102C | |
| translation | 6.16E-69 | 7.51E-67 | YBL027W, YBL038W, YBL072C, YBL092W, YBR031W, YBR048W, YBR181C, YBR189W, YBR191W, YCR031C, YDL061C, YDL075W, YDL081C, YDL082W, YDL083C, YDL130W, YDL136W, YDL191W, YDL229W, YDR012W, YDR025W, YDR064W, YDR382W, YDR418W, YDR447C, YDR450W, YDR471W, YDR500C, YER074W, YER102W, YER117W, YER131W, YGR118W, YGR214W, YHR141C, YIL069C, YJL136C, YJL177W, YJL189W, YJL190C, YJR123W, YJR145C, YKL006W, YKL056C, YKL156W, YKL180W, YKR057W, YKR094C, YLL045C, YLR029C, YLR048W, YLR075W, YLR167W, YLR185W, YLR325C, YLR333C, YLR340W, YLR344W, YLR388W, YLR406C, YLR441C, YLR448W, YML024W, YML026C, YML063W, YML073C, YMR121C, YMR143W, YMR194W, YMR230W, YMR242C, YNL067W, YNL096C, YNL162W, YNL209W, YNL301C, YNL302C, YOL039W, YOL040C, YOL127W, YOR167C, YOR234C, YOR293W, YOR312C, YOR369C, YPL081W, YPL090C, YPL143W, YPL198W, YPR043W, YPR102C | |
| telomere maintenance via recombination | 5.04E-04 | 6.14E-02 | YDR545W, YER190W, YLR467W, YNL339C, YPL283C | |
| 31 | nucleotide-excision repair | 3.95E-04 | 2.05E-02 | YAR007C, YDL164C, YNL312W |
| DNA recombination | 6.57E-05 | 3.42E-03 | YAR007C, YDL164C, YNL312W | |
| DNA replication, synthesis of RNA primer | 9.46E-04 | 4.92E-02 | YAR007C, YNL312W | |
| mitotic sister chromatid cohesion | 6.18E-05 | 3.21E-03 | YDL003W, YFL008W, YIL026C, YMR078C | |
| DNA strand elongation during DNA replication | 7.82E-07 | 4.06E-05 | YAR007C, YKL108W, YLR103C, YNL312W | |
| 34 | NADH oxidation | 1.25E-05 | 9.00E-04 | YBR145W, YML120C, YMR145C, YOL059W |
| 35 | transposition, RNA-mediated | 3.41E-06 | 1.36E-04 | YCL020W, YER160C, YJR026W, YJR028W, YML040W, YOR142W |
| 36 | glycine catabolic process | 1.94E-05 | 9.31E-04 | YAL044C, YDR019C, YMR189W |
| one-carbon compound metabolic process | 4.79E-05 | 2.30E-03 | YAL044C, YDR019C, YMR189W | |
| 41 | karyogamy during conjugation with cellular fusion | 4.10E-04 | 3.86E-02 | YCL055W, YNL313C, YPL192C |
| ribosome biogenesis and assembly | 2.32E-05 | 2.18E-03 | YBR267W, YCR072C, YDL031W, YDR184C, YDR465C, YGL099W, YGR187C, YMR128W, YOL010W, YOR001W | |
| 35S primary transcript processing | 4.81E-04 | 4.52E-02 | YDL031W, YGR090W, YOL010W, YOL021C, YOR001W | |
| 50 | pseudohyphal growth | 9.72E-04 | 4.57E-02 | YBR083W, YJL164C, YKL185W, YOR127W |
| N-terminal protein myristoylation | 5.58E-04 | 2.62E-02 | YIL009W, YOR317W | |
| 54 | DNA unwinding during replication | 6.79E-04 | 4.21E-02 | YBR202W, YGL201C, YLR274W |
| DNA replication initiation | 1.69E-04 | 1.05E-02 | YBL035C, YBR202W, YGL201C, YLR274W | |
| pheromone-dependent signal transduction during conjugation with cellular fusion | 2.18E-04 | 1.35E-02 | YHR005C, YJL157C, YNL173C, YOR127W | |
| 55 | spore wall assembly (sensu Fungi) | 7.33E-05 | 3.66E-03 | YDR126W, YDR523C, YOR177C, YOR242C |
| 56 | meiotic mismatch repair | 3.03E-05 | 1.48E-03 | YDR097C, YNL082W, YOL090W |
| mismatch repair | 6.19E-04 | 3.03E-02 | YDR097C, YNL082W, YOL090W | |
| microtubule nucleation | 1.13E-04 | 5.52E-03 | YDR356W, YKL042W, YOR373W, YPL124W | |
| 57 | sulfate assimilation | 1.76E-04 | 9.50E-03 | YFR030W, YJR010W, YKR069W |
| microtubule nucleation | 1.24E-04 | 6.67E-03 | YBL063W, YMR117C, YOR373W, YPL124W | |
| 59 | DNA replication checkpoint | 5.86E-04 | 3.23E-02 | YCL061C, YMR048W |
Annotations of cellular component ontology for biclusters identified by the proposed algorithm for multiplicative models at p-value < 0.001.
| Bicluster index | Annotation | P-value | Corrected P-value | Genes |
| 1 | DNA-directed RNA polymerase I complex | 9.40E-04 | 1.50E-01 | YBR154C, YDR156W, YNL113W, YOR224C, YOR341W |
| nucleoplasm | 6.60E-04 | 1.06E-01 | YAL059W, YDL051W, YKR002W, YKR095W, YNL175C, YNR053C | |
| nucleolus | 9.69E-17 | 1.55E-14 | YAL025C, YAL059W, YBL004W, YBL026W, YBR142W, YCL031C, YCL054W, YCL059C, YDL051W, YDR299W, YDR312W, YDR339C, YDR365C, YDR378C, YEL026W, YGR090W, YGR159C, YJR002W, YKR060W, YLL008W, YLL034C, YLR051C, YLR068W, YLR186W, YLR223C, YMR049C, YMR131C, YMR233W, YMR269W, YMR290C, YNL110C, YNL132W, YNL147W, YNL175C, YNL299W, YNR038W, YNR046W, YNR053C, YOL041C, YOL077C, YOL144W, YOR145C, YOR272W, YPL211W, YPR112C | |
| nucleus | 1.60E-05 | 2.56E-03 | YAL059W, YAR015W, YBL016W, YBL024W, YBL054W, YBL093C, YBR034C, YBR066C, YBR090C, YBR112C, YBR160W, YBR173C, YCL011C, YCL031C, YCL054W, YCR036W, YCR051W, YCR059C, YCR060W, YCR090C, YDL002C, YDL006W, YDL047W, YDL051W, YDL070W, YDL076C, YDL153C, YDL166C, YDR006C, YDR091C, YDR098C, YDR143C, YDR155C, YDR162C, YDR165W, YDR260C, YDR296W, YDR305C, YDR361C, YDR365C, YDR390C, YDR432W, YDR465C, YDR477W, YEL007W, YER012W, YER042W, YER148W, YGL130W, YGR090W, YGR159C, YGR200C, YJL140W, YJR002W, YJR017C, YJR105W, YKL143W, YKR060W, YKR072C, YKR079C, YKR096W, YLL034C, YLR007W, YLR039C, YLR051C, YLR052W, YLR068W, YLR107W, YLR186W, YLR223C, YLR262C, YLR265C, YLR327C, YLR384C, YLR420W, YLR430W, YML032C, YML053C, YML080W, YML081W, YML114C, YMR009W, YMR021C, YMR049C, YMR070W, YMR074C, YMR092C, YMR176W, YMR178W, YMR226C, YMR233W, YMR235C, YMR308C, YNL004W, YNL016W, YNL110C, YNL136W, YNL164C, YNL186W, YNL199C, YNL215W, YNL299W, YNR003C, YNR046W, YNR053C, YOL093W, YOL108C, YOL143C, YOR006C, YOR056C, YOR123C, YOR145C, YOR189W, YOR206W, YOR252W, YOR272W, YOR283W, YOR304W, YPL047W, YPL086C, YPL204W, YPL212C, YPL268W, YPR069C, YPR073C | |
| small nucleolar ribonucleoprotein complex | 1.97E-05 | 3.16E-03 | YBL004W, YBL026W, YCL059C, YDL153C, YDR378C, YDR449C, YEL026W, YGR090W, YJL191W, YJR002W, YLR186W, YLR222C, YML093W, YMR093W, YNL147W, YPR137W | |
| 5 | incipient bud site | 4.75E-04 | 2.18E-02 | YGR189C, YKR090W, YLL021W, YNL233W, YNL304W |
| nucleus | 1.26E-05 | 5.80E-04 | YBL046W, YBR073W, YBR088C, YCR065W, YDL006W, YDL103C, YDL164C, YDL197C, YER003C, YER152C, YGR042W, YKL045W, YKL089W, YKL113C, YLL022C, YLR233C, YLR376C, YML021C, YML061C, YML109W, YOR074C, YOR279C, YOR342C, YPL008W, YPL127C, YPL208W, YPL256C, YPR120C, YPR135W | |
| bud neck | 4.38E-04 | 2.01E-02 | YDR507C, YGR152C, YKL101W, YKR090W, YLL021W, YNL233W, YNL304W | |
| 11 | MCM complex | 8.91E-04 | 5.52E-02 | YEL032W, YLR274W, YPR019W |
| 12 | bud neck | 1.07E-05 | 3.33E-04 | YBR038W, YBR200W, YGR092W, YHR023W, YIL106W, YLR190W, YMR001C, YPR119W |
| 14 | spindle microtubule | 6.05E-05 | 2.12E-03 | YBL063W, YBR156C, YGL061C |
| 16 | endoplasmic reticulum | 1.91E-04 | 4.21E-03 | YBR229C, YCR011C, YCR044C, YDL204W, YIL124W, YML128C, YMR134W |
| mitochondrion | 7.89E-04 | 1.74E-02 | YBR003W, YBR026C, YBR037C, YBR147W, YBR229C, YCR005C, YHL021C, YHR067W, YIL087C, YIL124W, YLR142W, YLR253W, YML128C, YNL073W | |
| 24 | cytosolic large ribosomal subunit (sensu Eukaryota) | 1.11E-45 | 7.07E-44 | YBL027W, YBL092W, YBR031W, YBR191W, YDL075W, YDL081C, YDL082W, YDL130W, YDL136W, YDL191W, YDR012W, YDR382W, YDR418W, YDR471W, YDR500C, YER117W, YHR141C, YJL177W, YJL189W, YKL006W, YKL180W, YKR094C, YLL045C, YLR029C, YLR075W, YLR185W, YLR325C, YLR340W, YLR344W, YLR406C, YLR448W, YML073C, YMR121C, YMR194W, YMR242C, YNL067W, YNL162W, YNL301C, YOL039W, YOL127W, YOR234C, YOR312C, YPL143W, YPL198W, YPR043W, YPR102C |
| cytosolic small ribosomal subunit (sensu Eukaryota) | 7.45E-43 | 4.77E-41 | YBL072C, YBR048W, YBR181C, YBR189W, YCR031C, YDL061C, YDL083C, YDR025W, YDR064W, YDR447C, YDR450W, YER074W, YER102W, YER131W, YGR118W, YGR214W, YIL069C, YJL136C, YJL190C, YJR123W, YJR145C, YKL156W, YKR057W, YLR048W, YLR167W, YLR333C, YLR388W, YLR441C, YML024W, YML026C, YML063W, YMR116C, YMR143W, YMR230W, YNL096C, YNL302C, YOL040C, YOR167C, YOR293W, YOR369C, YPL081W, YPL090C | |
| ribosome | 4.84E-06 | 3.10E-04 | YAL003W, YBR118W, YDR385W, YEL034W, YKL056C, YLR249W, YOL139C, YPR163C | |
| 28 | condensed nuclear chromosome | 2.75E-04 | 2.20E-03 | YHR157W, YPL194W |
| 31 | chromosome, telomeric region | 4.77E-04 | 1.43E-02 | YAR007C, YNL312W |
| nuclear cohesin complex | 3.79E-05 | 1.14E-03 | YDL003W, YFL008W, YIL026C | |
| DNA replication factor A complex | 4.77E-04 | 1.43E-02 | YAR007C, YNL312W | |
| replication fork | 2.99E-04 | 8.97E-03 | YDL164C, YKL108W, YLR103C | |
| 34 | mitochondrial inner membrane | 4.76E-04 | 2.14E-02 | YDL198C, YDR197W, YER058W, YER141W, YOL027C, YPR011C |
| mitochondrion | 5.60E-06 | 2.52E-04 | YDL198C, YDR194C, YDR197W, YDR301W, YDR322W, YDR505C, YER058W, YER141W, YGL187C, YHR147C, YJR048W, YKL150W, YLR168C, YML030W, YML052W, YML120C, YMR098C, YMR145C, YMR188C, YNL306W, YNR036C, YOL009C, YOL027C, YOL038W, YOL059W, YPR011C | |
| 35 | retrotransposon nucleocapsid | 3.41E-06 | 9.89E-05 | YCL020W, YER160C, YJR026W, YJR028W, YML040W, YOR142W |
| 36 | glycine cleavage complex | 1.94E-05 | 4.46E-04 | YAL044C, YDR019C, YMR189W |
| 41 | nuclear exosome (RNase complex) | 4.93E-05 | 2.02E-03 | YNL251C, YOL021C, YOR001W |
| 54 | MCM complex | 1.19E-04 | 4.65E-03 | YBR202W, YGL201C, YLR274W |
| pre-replicative complex | 9.21E-04 | 3.59E-02 | YBR202W, YGL201C, YLR274W | |
| 56 | central plaque of spindle pole body | 3.03E-05 | 8.78E-04 | YDR356W, YKL042W, YPL124W |
Annotations of molecular function ontology for biclusters identified by the proposed algorithm for multiplicative models at p-value < 0.001.
| Bicluster index | Annotation | P-value | Corrected P-value | Genes |
| 1 | DNA-directed RNA polymerase activity | 1.62E-05 | 2.62E-03 | YBR154C, YDL140C, YDL150W, YDR045C, YDR156W, YJL140W, YNL113W, YNR003C, YOR224C, YOR341W |
| snoRNA binding | 1.55E-04 | 2.51E-02 | YBL004W, YDL153C, YDR449C, YGR090W, YLR222C, YML093W, YMR093W, YPR112C, YPR137W | |
| 3 | MAP kinase kinase activity | 9.00E-04 | 4.50E-02 | YJL128C, YPL140C |
| 7 | ATPase activity, coupled | 3.55E-04 | 6.04E-03 | YLL026W, YPL240C |
| aldo-keto reductase activity | 3.55E-04 | 6.04E-03 | YHR104W, YOR120W | |
| 11 | chromatin binding | 6.21E-05 | 3.66E-03 | YEL032W, YJL081C, YLR002C, YLR274W, YPR019W |
| 14 | protein phosphatase type 2C activity | 4.51E-04 | 6.77E-03 | YCR079W, YDL006W |
| 24 | structural constituent of ribosome | 3.69E-75 | 1.88E-73 | YBL027W, YBL038W, YBL072C, YBL092W, YBR031W, YBR048W, YBR181C, YBR189W, YBR191W, YCR031C, YDL061C, YDL075W, YDL081C, YDL082W, YDL083C, YDL130W, YDL136W, YDL191W, YDR012W, YDR025W, YDR064W, YDR382W, YDR418W, YDR447C, YDR450W, YDR471W, YDR500C, YER074W, YER102W, YER117W, YER131W, YGR118W, YGR214W, YHR141C, YIL069C, YJL136C, YJL177W, YJL189W, YJL190C, YJR123W, YJR145C, YKL006W, YKL156W, YKL180W, YKR057W, YKR094C, YLL045C, YLR029C, YLR048W, YLR075W, YLR167W, YLR185W, YLR325C, YLR333C, YLR340W, YLR344W, YLR388W, YLR406C, YLR441C, YLR448W, YML024W, YML026C, YML063W, YML073C, YMR121C, YMR143W, YMR194W, YMR230W, YMR242C, YNL067W, YNL096C, YNL162W, YNL301C, YNL302C, YOL039W, YOL040C, YOL127W, YOR167C, YOR234C, YOR293W, YOR312C, YOR369C, YPL081W, YPL090C, YPL143W, YPL198W, YPR043W, YPR102C |
| translation elongation factor activity | 4.78E-04 | 2.44E-02 | YAL003W, YBR118W, YDR385W, YLR249W | |
| DNA helicase activity | 6.99E-05 | 3.57E-03 | YDR545W, YER190W, YLR467W, YNL339C, YPL283C, YPR204W | |
| RNA binding | 4.35E-04 | 2.22E-02 | YAR009C, YCR031C, YDL208W, YJR027W, YKL006W, YLR029C, YLR344W, YLR448W, YML039W, YML045W, YML073C, YMR045C, YMR050C, YMR121C, YMR194W, YOL127W | |
| helicase activity | 1.61E-08 | 8.22E-07 | YBL113C, YEL077C, YIL177C, YJL225C, YLL066C, YLL067C, YML133C | |
| 35 | RNA binding | 3.28E-05 | 1.05E-03 | YCL020W, YER160C, YJR026W, YJR028W, YML040W, YMR290C, YOR142W, YPR107C |
| 36 | glycine dehydrogenase (decarboxylating) activity | 1.94E-05 | 5.62E-04 | YAL044C, YDR019C, YMR189W |
| 50 | long-chain-fatty-acid-CoA ligase activity | 5.58E-04 | 1.06E-02 | YIL009W, YOR317W |
| glycerol-3-phosphate dehydrogenase (NAD+) activity | 1.88E-04 | 3.56E-03 | YDL022W, YOL059W | |
| citrate (Si)-synthase activity | 5.58E-04 | 1.06E-02 | YCR005C, YNR001C | |
| 52 | structural constituent of cytoskeleton | 5.63E-04 | 9.57E-03 | YDR016C, YGR113W, YHR129C, YNL126W |
| 54 | ATP-dependent DNA helicase activity | 3.25E-04 | 1.11E-02 | YBR202W, YGL201C, YLR274W |
| ATP binding | 3.25E-04 | 1.11E-02 | YBR202W, YDR097C, YNL082W | |
| 56 | ATP binding | 1.64E-04 | 3.78E-03 | YDR097C, YNL082W, YOL090W |
| structural constituent of cytoskeleton | 4.43E-05 | 1.02E-03 | YDR356W, YGR113W, YKL042W, YOR373W, YPL124W | |
| guanine/thymine mispair binding | 2.17E-04 | 5.00E-03 | YDR097C, YOL090W | |
| single base insertion or deletion binding | 2.17E-04 | 5.00E-03 | YDR097C, YOL090W | |
| four-way junction DNA binding | 2.17E-04 | 5.00E-03 | YDR097C, YOL090W | |
| 57 | structural constituent of cytoskeleton | 7.39E-04 | 1.85E-02 | YBL063W, YMR117C, YOR373W, YPL124W |
| 58 | copper ion binding | 6.45E-04 | 1.42E-02 | YBR037C, YBR295W |
| 59 | guanine/thymine mispair binding | 1.97E-04 | 4.73E-03 | YDR097C, YOL090W |
| four-way junction DNA binding | 1.97E-04 | 4.73E-03 | YDR097C, YOL090W | |
| single base insertion or deletion binding | 1.97E-04 | 4.73E-03 | YDR097C, YOL090W |
Figure 15Biclustering accuracy (solid line) and the ACV (dashed line) against noise threshold ε.