| Literature DB >> 17925013 |
Ivan G Costa1, Stefan Roepcke, Alexander Schliep.
Abstract
BACKGROUND: The regulatory processes that govern cell proliferation and differentiation are central to developmental biology. Particularly well studied in this respect is the lymphoid system due to its importance for basic biology and for clinical applications. Gene expression measured in lymphoid cells in several distinguishable developmental stages helps in the elucidation of underlying molecular processes, which change gradually over time and lock cells in either the B cell, T cell or Natural Killer cell lineages. Large-scale analysis of these gene expression trees requires computational support for tasks ranging from visualization, querying, and finding clusters of similar genes, to answering detailed questions about the functional roles of individual genes.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17925013 PMCID: PMC2244641 DOI: 10.1186/1471-2172-8-25
Source DB: PubMed Journal: BMC Immunol ISSN: 1471-2172 Impact factor: 3.615
Figure 1Schematic view of lymphocyte cell development. Developmental stages are depicted as nodes and arrows indicate transition from one stage to another, i.e. specialization. Self-renewing hematopoietic stem cells give rise to T cells in the thymus (green), B cells in the bone marrow (blue) and natural killer cells (NK) via intermediate stages. DN stands for CD4-/CD8-double negative cells, DPL for CD4+/CD8+ double positive large cells, and DPS for CD4+/CD8+ double positive small cells. Cell surface antigens and rearrangement events are partially annotated. The expression data sets investigated in this paper are marked as follows: green ovals for TCell, blue ovals for BCell, and pink boxes for LymphoidTree. We do not investigate developmental stages and transitions depicted in grey.
Figure 2Example of a simple developmental tree and a cluster of developmental profiles. On the left, we depict a simple development tree, where arrows represent dependencies between variables. Above each tree variable, we depict a distribution related to it. On the right, we display the gene expression values (y-axis) in the distinct development stages (x-axis). Each line corresponds to the developmental profile of a given gene of a particular path of the tree in the left, as in a time-course plot. Distinct paths have distinct colors, in correspondence with the tree on the left. In this particular example, we have the path A, B and C in green and B and D in red. By superimposing the lines corresponding to paths B to C and B to D, we can contrast the differences in expression values of genes in these two alternative differentiation pathways.
Figure 3Example of a mixture of four dependence trees with the topology defined in Fig. 2. Each of the trees models distinct developmental profiles found in an example data set. Furthermore, clusters may have distinct sizes proportional to their α's. Note also that it is not necessary that clusters have distinct expression values in branching stages. For example, stages C and D have similar expression values for cluster 3 and 4. This can be interpreted as the genes being equally expressed in the two alternative lineages.
Figure 4Selected clusters from MixDTrees on Tcell. We depict the clusters 5, 8 and 18 found in TCell, expression values on the y-axis, and cell types on the x-axis. Lines corresponding to developmental profile values between stages DN2, DN3, DN4, DPL, DPS and SP4 are in green and between DPS and SP8 in red.
Figure 5Selected clusters from from MixDTrees on Bcell. We depict clusters 3, 5, 6 and 20 found in BCell, expression values on the y-axis, and cell types on the x-axis. Lines corresponding to developmental profile values between between all stages are in red.
Figure 6Selected clusters from from MixDTrees on LymphoidTree. We depict clusters 11 and 19 found in LymphoidTree, expression values on the y-axis, and cell types on the x-axis. Lines corresponding to developmental profile values between stages HSC, pro-B, pre-B and immature B cell are in read, between HSC and NK cells in blue, and between HSC and SP4 cells in green.
Figure 7Results of SIM. We display the mean sensitivity (left plots) and mean specificity (right plots) against five experimental settings: (1) w∈ [-ε, ε ] (independent data), (2) w∈ [-0.5, 0.5], (3) w∈ [-1, 1], (4) w∈ [-1.0, -0.5] ∪ [0.5, 1] and (5) w∈ [-1, -1 + ε] ∪ [1 – ε, 1]. The dependence increases with experiment number. On the top plots, k-means results are displayed in blue, SOM in green and mixture of dependence trees with MAP estimation (MixDTrees) in red. On the bottom plots, mixture of Gaussians with full covariance matrices are displayed in yellow, mixture of Gaussians with diagonal covariance matrices in purple, Mixture of dependence trees with MLE estimation in light blue (MixDTrees-MLE) and mixture of dependence trees with MAP estimation (MixDTrees-MAP) in red.
Figure 8Strategy to identify enriched microRNAs. Strategy to identify microRNAs and their target genes overrepresented in groups of co-expressed genes (indicated left) as part of a post-transcriptional regulatory mechanism. In the middle mRNAs clustered according to our mixture results are depicted and potential microRNA binding sites in their 3'UTRs are symbolized.
List of LympMIR enriched in the clusters from MixDTrees on data sets TCell and BCell
| Cluster ID | MicroRNA | Target Genes |
| TCell 3 | miR-222 | |
| TCell 5 | miR-15a1, miR-181a2, miR-2213, | |
| miR-244, miR-26a5 | ||
| TCell 10 | miR-142-3p6, miR-1507 | |
| TCell 11 | miR-1468, miR-169, miR-181b10 | |
| BCell 3 | miR-181b1, miR-181c2, miR-26a3 | |
| BCell 5 | miR-15a4, miR-15b5, miR-2216, | |
| miR-2237 | ||
| BCell 6 | miR-1558, miR-1919 | |
| BCell 19 | miR-142-3p14, miR-34215 | |
We display the cluster and data set id, the list of microRNA and list of target genes, with p-values <0.05 and at least four target genes per cluster. Genes involved in cell proliferation or DNA repair are depicted in bold. The indices indicates to which microRNA a gene is related, when there is more than one enriched microRNA in a cluster.