| Literature DB >> 20932287 |
Olivier Arnaiz1, Jean-François Goût, Mireille Bétermier, Khaled Bouhouche, Jean Cohen, Laurent Duret, Aurélie Kapusta, Eric Meyer, Linda Sperling.
Abstract
BACKGROUND: The genome of Paramecium tetraurelia, a unicellular model that belongs to the ciliate phylum, has been shaped by at least 3 successive whole genome duplications (WGD). These dramatic events, which have also been documented in plants, animals and fungi, are resolved over evolutionary time by the loss of one duplicate for the majority of genes. Thanks to a low rate of large scale genome rearrangement in Paramecium, an unprecedented large number of gene duplicates of different ages have been identified, making this organism an outstanding model to investigate the evolutionary consequences of polyploidization. The most recent WGD, with 51% of pre-duplication genes still in 2 copies, provides a snapshot of a phase of rapid gene loss that is not accessible in more ancient polyploids such as yeast.Entities:
Mesh:
Year: 2010 PMID: 20932287 PMCID: PMC3091696 DOI: 10.1186/1471-2164-11-547
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Hierarchical clusterization of genes differentially expressed during autogamy. The 2467 genes most differentially expressed during autogamy (treat model fold-change of 2, FDR < 0.05) were hierarchically clustered as described in Methods. The heatmap displays the samples as columns and the genes as rows. The color code goes from dark blue for the lowest expression to dark red for the highest expression. The clusterization of the samples corresponds to the classification based on nuclear morphology, although the DEV2 and DEV3 samples are not resolved. The 6 clusters of co-expressed genes were obtained by cutting the gene dendrogram on the left of the heatmap as indicated. On the right hand side, the average expression profile and standard deviation for each cluster are drawn as they appear in ParameciumDB [65].
Differential Expression Analysis.
| Name | Time points | Method | P-Value | Fold-change | Number of genes |
|---|---|---|---|---|---|
| Autogamy | 6 | eBayes | 0.05 | 20164 | |
| Autogamy | 6 | treat | 0.05 | 1.5 | 5558 |
| Autogamy | 6 | treat | 0.05 | 2 | 2467 |
| Reciliation | 3 | eBayes | 0.05 | 1212 | |
| Exocytosis recovery | 3 | eBayes | 0.05 | 526 | |
The limma package was used for all analyses, with either the eBayes [33] or the treat [35] statistical models and the model parameters (p-Value, Fold-change) indicated. The eBayes model does not incorporate a Fold-change. The p-value is adjusted for multiple testing [34] and is therefore equivalent to a false discovery rate (FDR).
Clusters of differentially expressed genes.
| Experiment | Cluster Name | Number of genes |
|---|---|---|
| Autogamy | Early peak | 373 |
| Early induction | 97 | |
| Intermediate induction | 583 | |
| Late induction | 36 | |
| Early repression | 252 | |
| Late repression | 1126 | |
| Reciliation | Early peak | 264 |
| Gradual induction | 695 | |
| Repression | 253 | |
| Exocytosis | Induced | 432 |
| Repressed | 94 |
The clusters were determined by hierarchical clustering of differentially expressed genes, as shown in Figure 1 for the autogamy experiment. Note that for the autogamy experiment, the 2467 genes identified using a Treat model fold-change of 2 were used for hierarchical clustering.
Developmentally regulated P. tetraurelia genes.
| Transcript Accession | Synonym | Northern profile | Biological Process | Molecular Function | Cluster Name | Reference |
|---|---|---|---|---|---|---|
| GSPATT00006994001 | EARLY PEAK | programming DNA elimination | RNA-binding | Early induction | Nowacki et al. 2005 | |
| GSPATT00016668001 | EARLY PEAK | programming DNA elimination | RNA-binding | Early induction | Nowacki et al. 2005 | |
| GSPATT00008494001 | EARLY PEAK | programming DNA elimination | ribonuclease III activity | Early peak | Lepere et al. 2008 | |
| GSPATT00027456001 | EARLY PEAK | programming DNA elimination | ribonuclease III activity | Early peak | Lepere et al. 2008 | |
| GSPATT00021895001 | EARLY PEAK | programming DNA elimination | RNA-binding | Early peak | Bouhouche et al. (d) | |
| GSPATT00001395001 | EARLY PEAK | ? | RNA-binding | Early peak | Bouhouche et al. (d) | |
| GSPATT00020796001 | EARLY PEAK | programming DNA elimination | RNA-binding | Early peak | Bouhouche et al. (d) | |
| GSPATT00000299001 | EARLY PEAK | MAC development | RNA helicase | Early peak | Nowak et al. 2010 | |
| GSPATT00009108001 | EARLY PEAK | meiosis | endonuclease | Early peak | Baudry et al. 2009 | |
| GSPATT00007001001 | EARLY PEAK (c) | DNA elimination | SUMOylation | Early peak | Matsuda et al. 2006 | |
| GSPATT00016666001 | EARLY PEAK (c) | DNA elimination | SUMOylation | Early peak | Matsuda et al. 2006 | |
| GSPATT00013187001 | EARLY PEAK (c) | DNA elimination | SUMOylation | Early peak | Matsuda et al. 2006 | |
| GSPATT00000555001 | EARLY PEAK | ? | ? | not found (a) | Nowak et al. 2010 | |
| GSPATT00000151001 | EARLY PEAK | DNA mismatch repair | DNA-binding | not found (b) | Nowak et al. 2010 | |
| GSPATT00016627001 | LATE PEAK | DNA elimination | endonuclease | Intermediate induction | Baudry et al. 2009 | |
| GSPATT00024933001 | LATE PEAK (c) | DNA elimination | ? | Intermediate induction | Matsuda et al. 2010 | |
| GSPATT00026720001 | LATE PEAK (c) | DNA elimination | ? | Intermediate induction | Matsuda et al. 2010 | |
| GSPATT00021288001 | LATE PEAK | ? | RNA-binding | Intermediate induction | Bouhouche et al. (d) | |
| GSPATT00000552001 | GRADUAL INDUCTION | ? | ? | Intermediate induction | Nowak et al. (e) | |
| GSPATT00000388001 | GRADUAL INDUCTION | ? | ? | Intermediate induction | Nowak et al. (e) | |
| GSPATT00000358001 | LATE PEAK | ? | ? | Intermediate induction | Nowak et al. (e) | |
| GSPATT00000301001 | LATE PEAK | ? | ? | Intermediate induction | Nowak et al. (e) | |
| GSPATT00000022001 | LATE PEAK | ? | ? | Intermediate induction | Nowak et al. (e) | |
| GSPATT00009468001 | LATE PEAK | ? | RNA-binding | Late induction | Bouhouche et al. (d) | |
| GSPATT00019939001 | LATE PEAK | ? | RNA-binding | Late induction | Bouhouche et al. (d) |
Developmentally regulated genes with published Northern blots are given along with the cluster in which the gene is found in the autogamy experiment, if any. The biological process and the likely molecular function are given when possible. a) This gene figures among the set of 5558 differentially expressed genes, is up-regulated early in autogamy (p-value = 0.003), but was not present in the set of 2467 most differentially expressed genes used for hierarchical clustering. b) This MSH2 homolog was not found to be differentially expressed. Examination of the data indicates that the gene is probably up-regulated at meiosis in some, but not all, autogamy experiments. c) The SUMO and DIE5 Northern blots are of conjugation, not autogamy. d) Bouhouche K, Goût J, Kapusta A, Bétermier M, Meyer E: Functional specialization of Piwi proteins in Paramecium tetraurelia from post-transcriptional gene silencing to genome remodeling, submitted; e) Nowak JK, Gromadka R, Juszczuk M, Jerka-Dziadosz M, Maliszewska K, Mucchielli M, Goût JF, Arnaiz O, Agier N, Tang T, Aggerbeck L, Cohen J, Delacroix H, Sperling L, Herbert CJ, Zagulski M, Bétermier M: A chromosome-wide study of genes essential for meiosis and nuclear reorganization in Paramecium, submitted. Published data was taken from references [66-69].
Figure 2Genes differentially expressed in more than one experiment. The Venn diagrams show the overlap between differentially expressed genes identified by each of the 3 experiments. For the autogamy experiment, the set of 2467 most differentially expressed genes was used. A) All differentially expressed genes B) Up-regulated genes.
Retention of differentially expressed genes after whole genome duplication.
| WGD1 | WGD2 | WGD3 | Genes | |
|---|---|---|---|---|
| Differentially expressed genes | ||||
| Reciliation | 67% | 28% | 7% | 1212 |
| Exocytosis recovery | 68% | 32% | 9% | 526 |
| Autogamy | 69% | 35% | 9% | 2467 |
| Genome | 61% | 32% | 8% | 39642 |
The percentage of genes differentially expressed during the biological processes studied that have retained at least one ohnolog from a whole genome duplication event is given along with the total number of genes in the category ("Genes"). The last row in the table shows across the whole genome the percentage of genes that have retained ohnologs after each of the WGD events. WGD1, recent whole genome duplication; WGD2, intermediate whole genome duplication; WGD3, old whole genome duplication. Note that a given gene can have 0 or 1 ohnolog from the recent WGD, 0 to 2 ohnologs from the intermediate WGD and 0 to 4 ohnologs from the old WGD.
Differential expression of ohnologs.
| Category | AUTOGAMY | RECILIATION | EXOCYTOSIS |
|---|---|---|---|
| Genes with WGD1 ohnolog | 1695 | 817 | 355 |
| | |||
| | |||
| | |||
| Genes with WGD2 ohnolog | 875 | 339 | 170 |
| | |||
| | |||
| | |||
| | |||
| Genes with WGD3 ohnolog | 205 | 85 | 44 |
| | |||
| | |||
| |
The ohnologs of differentially expressed genes were examined to see whether they belong to the same cluster, a different cluster, or were not identified as differentially expressed, for each of the three biological processes studied. For WGD3, genes were scored as "same cluster" or as "different cluster" if at least one ohnolog was found in the same cluster or in a different cluster, respectively. WGD1, recent whole genome duplication; WGD2, intermediate whole genome duplication; WGD3, old whole genome duplication.