| Literature DB >> 16970825 |
Sandra L Rodriguez-Zas1, Bruce R Southey, Charles W Whitfield, Gene E Robinson.
Abstract
BACKGROUND: A semiparametric approach was used to identify groups of cDNAs and genes with distinct expression profiles across time and overcome the limitations of clustering to identify groups. The semiparametric approach allows the generalization of mixtures of distributions while making no specific parametric assumptions about the distribution of the hidden heterogeneity of the cDNAs. The semiparametric approach was applied to study gene expression in the brains of Apis mellifera ligustica honey bees raised in two colonies (A. m. mellifera and ligustica) with consistent patterns across five maturation ages.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16970825 PMCID: PMC1592090 DOI: 10.1186/1471-2164-7-233
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of Bayesian information criterion (BIC), Akaike information criterion (AIC) and means square error (MSE) by number of groups for the mellifera and ligustica data sets.
| Data set | ||||||
| Groups | BIC | AIC | MSE | BIC | AIC | MSE |
| 5 | -3103.31 | -3049.92 | 0.586 | -3065.63 | -3012.24 | 0.586 |
| 6 | -2964.67 | -2900.60 | 0.526 | -2961.00 | -2896.94 | 0.526 |
| 7 | -2894.63 | -2819.89 | 0.493 | -2902.09 | -2827.35 | 0.493 |
| 8 | -2843.36 | -2757.94 | 0.466 | -2766.41 | -2851.83 | 0.466 |
| 9 | -2727.89 | -2633.44 | 0.432 | -2737.53 | -2641.43 | 0.443 |
| 10 | -2637.40 | -2530.63 | 0.409 | -2665.38 | -2558.61 | 0.416 |
| 11 | -2668.76 | -2551.31 | 0.403 | -2679.47 | -2562.02 | 0.406 |
Figure 1Expected cDNA expression trajectories for each of the 10 groups identified in the ligustica (L) data set. The legend indicates the group number and the estimated number of cDNAs per group.
Figure 2Expected cDNA expression trajectories for each of the 10 groups identified in the mellifera (M) data set. The legend indicates the group number and the estimated number of cDNAs per group.
Figure 3Expected cDNA expression trajectories for each of the 10 clusters identified in the ligustica (L) data set using a clustering approach. The legend indicates the group number and the estimated number of cDNAs per group.
Figure 4Expected cDNA expression trajectories for each of the 10 clusters identified in the mellifera (M) data set using a clustering approach. The legend indicates the group number and the estimated number of cDNAs per group.
Estimates, standard error (SE) and P-value of the intercept, linear, quadratic and cubic terms describing the 10 groups using a semiparametric approach in the ligustica and mellifera data sets.
| Intercept | Linear | Quadratic | Cubic | ||||||||||
| Group | Dataset | Estimate | SE | P-value | Estimate | SE | P-value | Estimate | SE | P-value | Estimate | SE | P-value |
| 1 | -1.4922 | 0.0774 | <0.0001 | 0.0813 | 0.0403 | 0.044 | -0.0092 | 0.0058 | 0.1144 | 0.0003 | 0.0002 | 0.122 | |
| -1.4635 | 0.0704 | <0.0001 | 0.0685 | 0.0372 | 0.0654 | -0.0078 | 0.0054 | 0.1541 | 0.0003 | 0.0002 | 0.1223 | ||
| 2 | -1.0266 | 0.0624 | <0.0001 | 0.1773 | 0.0351 | <0.0001 | -0.017 | 0.0052 | 0.0011 | 0.0006 | 0.0002 | 0.0025 | |
| -0.9844 | 0.0612 | <0.0001 | 0.2214 | 0.0335 | <0.0001 | -0.0232 | 0.005 | <0.0001 | 0.0009 | 0.0002 | <0.0001 | ||
| 3 | -0.0964 | 0.0685 | 0.159 | -0.2606 | 0.0339 | <0.0001 | 0.025 | 0.0049 | <0.0001 | -0.0008 | 0.0002 | 0 | |
| -0.0233 | 0.0599 | 0.6969 | -0.2479 | 0.0307 | <0.0001 | 0.0231 | 0.0046 | <0.0001 | -0.0007 | 0.0002 | <0.0001 | ||
| 4 | 0.2749 | 0.1579 | 0.0818 | 0.9044 | 0.0956 | <0.0001 | -0.1 | 0.0144 | <0.0001 | 0.0025 | 0.0006 | <0.0001 | |
| -1.1858 | 0.1465 | <0.0001 | 0.5999 | 0.0886 | <0.0001 | -0.023 | 0.0134 | 0.0864 | -0.0006 | 0.0005 | 0.2179 | ||
| 5 | 0.7813 | 0.0536 | <0.0001 | -0.2726 | 0.0289 | <0.0001 | 0.027 | 0.0043 | <0.0001 | -0.0009 | 0.0002 | <0.0001 | |
| 0.9021 | 0.0553 | <0.0001 | -0.2764 | 0.0284 | <0.0001 | 0.0272 | 0.0042 | <0.0001 | -0.0009 | 0.0002 | <0.0001 | ||
| 6 | -0.3338 | 0.0702 | <0.0001 | 0.2797 | 0.0328 | <0.0001 | -0.0255 | 0.0049 | <0.0001 | 0.0008 | 0.0002 | <0.0001 | |
| -0.1963 | 0.0593 | 0.0009 | 0.2989 | 0.033 | <0.0001 | -0.0337 | 0.0049 | <0.0001 | 0.0013 | 0.0002 | <0.0001 | ||
| 7 | 1.5573 | 0.0526 | <0.0001 | -0.2004 | 0.0281 | <0.0001 | 0.02 | 0.0042 | <0.0001 | -0.0007 | 0.0002 | <0.0001 | |
| 1.612 | 0.0588 | <0.0001 | -0.1945 | 0.0281 | <0.0001 | 0.0183 | 0.0042 | <0.0001 | -0.0006 | 0.0002 | 0.0001 | ||
| 8 | 0.8752 | 0.0685 | <0.0001 | 0.3204 | 0.0391 | <0.0001 | -0.0336 | 0.0058 | <0.0001 | 0.0012 | 0.0002 | <0.0001 | |
| 1.0052 | 0.0671 | <0.0001 | 0.286 | 0.0394 | <0.0001 | -0.0309 | 0.0059 | <0.0001 | 0.0011 | 0.0002 | <0.0001 | ||
| 9 | 2.5404 | 0.0607 | <0.0001 | -0.1607 | 0.0341 | <0.0001 | 0.0164 | 0.0051 | 0.0013 | -0.0006 | 0.0002 | 0.002 | |
| 2.636 | 0.0569 | <0.0001 | -0.2106 | 0.0328 | <0.0001 | 0.0204 | 0.0049 | <0.0001 | -0.0007 | 0.0002 | 0.0003 | ||
| 10 | 2.2781 | 0.1012 | <0.0001 | 0.4776 | 0.0615 | <0.0001 | -0.055 | 0.0092 | <0.0001 | 0.0019 | 0.0004 | <0.0001 | |
| 2.4787 | 0.1008 | <0.0001 | 0.2641 | 0.0606 | <0.0001 | -0.029 | 0.009 | 0.0013 | 0.0011 | 0.0004 | 0.0026 | ||
Number cDNAs, corresponding genes based on the honey bee genome and genes with Gene Ontology (GO) information for each expression trajectory group assigned by the semiparametric and regression-clustering approaches in the mellifera and ligustica data sets.
| Semiparametric | Clustering | Semiparametric | Clustering | |||||||
| Group | cDNA | Genes | GO | cDNA | Genes | cDNA | Genes | GO | cDNA | Genes |
| 1 | 49 | 26 | 11 | 80 | 25 | 49 | 28 | 12 | 47 | 42 |
| 2 | 57 | 33 | 22 | 102 | 40 | 57 | 31 | 22 | 39 | 24 |
| 3 | 69 | 39 | 20 | 47 | 40 | 67 | 36 | 21 | 73 | 31 |
| 4 | 8 | 7 | 0 | 54 | 8 | 7 | 7 | 0 | 80 | 7 |
| 5 | 81 | 50 | 27 | 79 | 52 | 81 | 49 | 27 | 57 | 50 |
| 6 | 61 | 31 | 12 | 59 | 34 | 62 | 32 | 12 | 83 | 28 |
| 7 | 84 | 48 | 36 | 15 | 33 | 88 | 52 | 33 | 70 | 60 |
| 8 | 41 | 24 | 16 | 39 | 22 | 44 | 26 | 14 | 64 | 26 |
| 9 | 61 | 38 | 24 | 47 | 48 | 57 | 36 | 26 | 7 | 30 |
| 10 | 18 | 8 | 5 | 7 | 2 | 17 | 7 | 6 | 9 | 6 |
Distribution of all 304 known genes and 173 genes with Gene Ontology (GO) information across semiparametric groups and regression clusters for the mellifera and ligustica data sets.
| Semiparametric | Clustering | Semiparametric | Clustering | |||
| Group | Gene | GO | Gene | Gene | GO | Gene |
| 1 | 26 | 12 | 25 | 28 | 11 | 42 |
| 2 | 33 | 22 | 40 | 31 | 22 | 24 |
| 3 | 39 | 21 | 40 | 36 | 20 | 31 |
| 4 | 7 | 0 | 8 | 7 | 0 | 7 |
| 5 | 50 | 27 | 52 | 49 | 27 | 50 |
| 6 | 31 | 12 | 34 | 32 | 12 | 28 |
| 7 | 48 | 33 | 33 | 52 | 36 | 60 |
| 8 | 24 | 14 | 22 | 26 | 16 | 26 |
| 9 | 38 | 26 | 48 | 36 | 24 | 30 |
| 10 | 8 | 6 | 2 | 7 | 5 | 6 |