| Literature DB >> 19832986 |
Peter von Dassow1, Hiroyuki Ogata, Ian Probert, Patrick Wincker, Corinne Da Silva, Stéphane Audic, Jean-Michel Claverie, Colomban de Vargas.
Abstract
BACKGROUND: Eukaryotes are classified as either haplontic, diplontic, or haplo-diplontic, depending on which ploidy levels undergo mitotic cell division in the life cycle. Emiliania huxleyi is one of the most abundant phytoplankton species in the ocean, playing an important role in global carbon fluxes, and represents haptophytes, an enigmatic group of unicellular organisms that diverged early in eukaryotic evolution. This species is haplo-diplontic. Little is known about the haploid cells, but they have been hypothesized to allow persistence of the species between the yearly blooms of diploid cells. We sequenced over 38,000 expressed sequence tags from haploid and diploid E. huxleyi normalized cDNA libraries to identify genes involved in important processes specific to each life phase (2N calcification or 1N motility), and to better understand the haploid phase of this prominent haplo-diplontic organism.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19832986 PMCID: PMC2784329 DOI: 10.1186/gb-2009-10-10-r114
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Origins of Emiliania huxleyi strains
| Strain designation | RCC1216 | RCC1217 |
|---|---|---|
| Strain synonym | TQ26-2N | TQ26-1N |
| Coccolith morphotype | R | NA |
| Origin | Tasman Sea, New Zealand Coast | Clonal isolate from RCC1216 |
| Date of isolation | October, 1998 | July, 1999 |
| Date axenic cultures prepared, and purity of ploidy type ensured | August-October 2007 | August-October 2007 |
| Date of RNA harvest | 11-12 November 2007 | 12-13 December 2007 |
NA, not applicable.
Figure 1Flow cytometry plot showing conditions of cells in cultures on day of harvesting. (a) 1N and, (b) 2N cells (red) were identified by chlorophyll autofluorescence and their forward scatter (FSC) and side scatter (SSC) were compared to 1 μm bead standards (green).
Figure 2Cell cycle changes during the day-night cycle of harvesting. Example DNA content histograms of nuclear extracts taken from 1N cultures at different times are shown. The time point at 15 h on day 1 is not shown but had a similar distribution to that at 19 h on day 1 and 15 h30 on day 2. RNA was not collected at 15 h30 on day 2, but nuclear extracts (shown here), flow cytometric profiles, and Fv/Fm confirmed cells had returned to the same state after a complete diel cycle. Extracted nuclei were stained with Sybr Green I and analyzed by flow cytometry.
EST read characteristics
| RCC1217 1N | RCC1216 2N | |
|---|---|---|
| Number of raw sequences | 19,532 | 19,559 |
| Number of ESTs after trimming, quality control | 19,198 | 19,188 |
| Length of high quality trimmed ESTs, mean ± standard deviation (minimum/maximum) | 599.51 ± 143.14 (50/897) | 563.55 ± 151.37 (55/866) |
| %GC | 64.49 | 64.68 |
Figure 3Venn diagram showing the degree of overlap existing E. huxleyi EST libraries. Included are the libraries analyzed in this study (1N RCC1217 and 2N RCC1216, combined) and the two other publicly available EST libraries (CCMP 1516 and CCMP371). ESTs were considered matching based on BLAT criteria of an alignment length of ≥ 100 nucleotides and ≥ 95% identity. The degrees of overlap increased only very modestly when the BLAT criteria were relaxed to an alignment length of ≥ 50 nucleotides.
EST clusters
| Total | 1N and 2N | 1N only | 2N only | |
|---|---|---|---|---|
| Number of mini-clusters | 16,470 | 3,226 | 6,002 | 7,242 |
| Number of mini-clusters (containing ≥ 2 EST reads) | 6,444 | 3,226 | 1,765 | 1,453 |
| Number of mini-clusters singletons (only 1 read) | 10,026 | 0 | 4,237 | 5,789 |
| Number of clusters | 13,056 | 3,519 | 4,368 | 5,169 |
| Number of clusters (≥ 2 EST reads) | 6,701 | 3,519 | 1,717 | 1,465 |
| Number of clusters singletons (only 1 read) | 6,355 | 0 | 2,651 | 3,704 |
Clusters were generated from the total pool of 1N (RCC1217) and 2N (RCC1216) ESTs. Clusters represented by EST reads in both libraries (1N and 2N) and clusters with representation in only one library (1N only or 2N only) are also shown.
Analysis of transcriptome complexity
| RCC1217 1N | RCC1216 2N | Combined libraries | |
|---|---|---|---|
| Total clusters | 7,887 | 8,688 | 13,056 |
| ML estimate of transcriptome richness | 10,039 | 11,988 | 16,211 |
| Chao1 ± SD (boundaries of 95% CI) | 12,840 ± 214 (12,438, 13,278) | 15,931 ± 289 (15,385, 16,522) | 22,169 ± 314 (21,573, 22,806) |
| Coverage (%) based on richness estimates | 61.4-78.6 | 54.5-72.5 | 58.9-80.5 |
| Shannon diversity (maximum possible) | 8.66 (8.97) | 8.76 (9.06) | 9.05 (9.48) |
The maximum likelihood (ML) estimate of transcriptome richness was calculated following Claverie [23] using the two separate rounds of EST sequencing. The Chao1 estimator of transcriptome richness and the Shannon diversity index was computed for each library separately and for the combined library using EstimateS with the classic formula for Chao1. The range of estimated coverage was calculated by dividing the number of clusters observed by the two estimates of transcriptome richness. The similarity of content of the 1N and 2N libraries was also determined: the Chao abundance-based estimator of the Jaccard similarity index (accounting for estimated proportions of unseen shared and unique transcripts) was 0.506 ± 0.009, calculated with 200 bootstrap replicates and the upper abundance limit for rare or infrequent transcript species set at 2. The maximum possible Shannon diversity index was calculated as the natural log of the number of clusters.
Figure 4Distribution of clusters and reads by KOG functional class and library. Distributions of clusters over KOG class for clusters shared between the 1N and 2N libraries and clusters unique to each library. Fisher's exact test was used to determine significant differences in the distribution of clusters by KOG class between the 1N-unique and 2N-unique sets (asterisks indicate the KOG classes exhibiting significant differences between the 1N-unique and 2N-unique sets); P < 0.002 without correction for multiple tests). The same test was applied to determine differences in the distribution of clusters by KOG class between the set of shared clusters and both 1N-unique and 2N-unique clusters (the at symbol (@) indicates KOG classes exhibiting significant differences between the 1N-unique and shared sets; P < 0.002 without correction for multiple tests).
KOG-assigned EST clusters predicted to be highly 1N-specific based on statistical comparison of libraries
| Cluster ID | Number of 1N ESTs | Homolog ID | Homolog description | BLAST | |
|---|---|---|---|---|---|
| Amino acid transport and metabolism | |||||
| GS01965 | 6 | 7.8 × 10-3 | CDO_CAEBR | Cysteine dioxygenase | 8 × 10-19 |
| GS00820 | 7 | 3.9 × 10-3 | *Q8GYS4_ARATH | Putative uncharacterized protein | 5 × 10-11 |
| Carbohydrate transport and metabolism | |||||
| GS01922 | 6 | 7.8 × 10-3 | AAPC_CENCI | Putative apospory-associated protein C | 2 × 10-25 |
| Cell cycle control, cell division, chromosome partitioning | |||||
| | 6 | 7.8 × 10-3 | †Cyclin_N | Cyclin, N-terminal domain | 1 × 10-09 |
| Chromatin structure and dynamics | |||||
| GS09138 | 13 | 6.1 × 10-5 | H4_OLILU | Histone H4 | 1 × 10-38 |
| Cytoskeleton | |||||
| GS00708 | 6 | 7.8 × 10-3 | DYI3_ANTCR | Dynein intermediate chain 3, ciliary | 6 × 10-62 |
| Function unknown | |||||
| GS00091 | 6 | 7.8 × 10-3 | EMAL4_MOUSE | Echinoderm microtubule-associated protein-like 4 | 4 × 10-36 |
| GS02362 | 7 | 3.9 × 10-3 | *A8Q1G0_MALGO | Putative uncharacterized protein | 8 × 10-16 |
| GS00939 | 6 | 7.8 × 10-3 | * B8BBW9_ORYSI | Putative uncharacterized protein | 9 × 10-08 |
| General function prediction only | |||||
| | 6 | 7.8 × 10-3 | EHMT2_MOUSE | Histone-lysine N-methyltransferase | 3 × 10-13 |
| GS08284 | 8 | 2.0 × 10-3 | EI2B_AQUAE | Putative translation initiation factor eIF-2B | 4 × 10-27 |
| GS00938 | 7 | 3.9 × 10-3 | MORN3_HUMAN | MORN repeat-containing protein 3 | 4 × 10-18 |
| GS00985 | 6 | 7.8 × 10-3 | PTHD2_MOUSE | Patched domain-containing protein 2 | 2 × 10-08 |
| Inorganic ion transport and metabolism | |||||
| GS01939 | 6 | 7.8 × 10-3 | AMT12_ARATH | Ammonium transporter 1 member 2 | 2 × 10-25 |
| GS02431 | 8 | 2.0 × 10-3 | RABL5_DANRE | Rab-like protein 5 | 3 × 10-28 |
| GS01141 | 6 | 7.8 × 10-3 | TM9S2_RAT | Transmembrane 9 superfamily member 2 | 7 × 10-84 |
| GS00197 | 6 | 7.8 × 10-3 | ARF1_SALBA | ADP-ribosylation factor 1 | 1 × 10-70 |
| Nucleotide transport and metabolism | |||||
| GS00406 | 7 | 3.9 × 10-3 | NDK7_HUMAN | Nucleoside diphosphate kinase 7 | 2 × 10-32 |
| Posttranslational modification, protein turnover, chaperones | |||||
| GS00465 | 6 | 7.8 × 10-3 | TRAP1_DICDI | TNF receptor-associated protein 1 homolog, mitochondrial precursor | 1 × 10-98 |
| GS04078 | 6 | 7.8 × 10-3 | BIRC7_HUMAN | Baculoviral IAP repeat-containing protein 7 | 2 × 10-06 |
| GS01693 | 6 | 7.8 × 10-3 | IQCAL_HUMAN | IQ and AAA domain-containing protein ENSP00000340148 | 3 × 10-41 |
| GS00324 | 8 | 2.0 × 10-3 | TTLL4_HUMAN | Tubulin polyglutamylase | 1 × 10-42 |
| GS06285 | 7 | 3.9 × 10-3 | IAP3_NPVOP | Apoptosis inhibitor 3 | 1 × 10-05 |
| GS03771 | 6 | 7.8 × 10-3 | 14335_ORYSJ | 14-3-3-like protein GF14-E | 1 × 10-34 |
| GS01424 | 6 | 7.8 × 10-3 | PCSK7_RAT | Proprotein convertase subtilisin/kexin type 7 precursor | 2 × 10-08 |
| GS01530 | 6 | 7.8 × 10-3 | YDM9_SCHPO | Uncharacterized RING finger protein C57A7.09 precursor | 3 × 10-07 |
| GS00537 | 7 | 3.9 × 10-3 | XRP2_XENLA | Protein XRP2 | 5 × 10-20 |
| Signal transduction mechanisms | |||||
| GS01456 | 8 | 2.0 × 10-3 | CML12_ARATH | Calmodulin-like protein 12 | 3 × 10-11 |
| GS03471 | 6 | 7.8 × 10-3 | DNAL1_CHLRE | Flagellar outer arm dynein light chain 1 | 1 × 10-52 |
| | 14 | 3.1 × 10-5 | KAPR2_DROME | amp-dependent protein kinase type II regulatory subunit | 2 × 10-08 |
| GS04612 | 6 | 7.8 × 10-3 | RHOM_DROME | Protein rhomboid | 3 × 10-08 |
| GS02444 | 11 | 2.4 × 10-4 | ANR11_HUMAN | Ankyrin repeat domain-containing protein 11 | 3 × 10-09 |
| GS02191 | 6 | 7.8 × 10-3 | LRC50_HUMAN | Leucine-rich repeat-containing protein 50 | 1 × 10-54 |
| | 7 | 3.9 × 10-3 | KCC1A_RAT | Calcium/calmodulin-dependent protein kinase type 1 | 1 × 10-51 |
| | 6 | 7.8 × 10-3 | TNI3K_RAT | Serine/threonine-protein kinase | 3 × 10-14 |
| GS01544 | 7 | 3.9 × 10-3 | |||
| GS03554 | 7 | 3.9 × 10-3 | †PH | Plecstrin homology domain | 3 × 10-09 |
| Transcription | |||||
| GS00117 | 8 | 2.0 × 10-3 | MYB_DROME | Myb protein | 9 × 10-06 |
| | 8 | 2.0 × 10-3 | MYB_CHICK | Myb proto-oncogene protein (C-myb) | 3 × 10-34 |
| GS01762 | 6 | 7.8 × 10-3 | MYBB_CHICK | Myb-related protein B | 5 × 10-06 |
Only clusters with zero ESTs originating from the 2N library are shown. The number of 1N EST reads in each cluster and the P-value for significance of the difference between libraries are shown. When no Swiss-Prot homolog was detected, ID and homology values for the top Uniprot homolog are given (indicated by an asterisk), or the CDD name and homology values are given (indicated by †). Clusters are arranged by KOG class. Clusters in bold were chosen for RT-PCR validation. Additional data file 4 gives a complete list of all clusters predicted to be 1N-specific by statistical comparison of libraries.
EST clusters without KOG assignment predicted to be highly 1N-specific based on statistical comparison of libraries
| Cluster ID | Number of 1N ESTs | Homolog ID | Homolog description | BLAST | |
|---|---|---|---|---|---|
| 7 | 3.9 × 10-3 | DYHC_ANTCR | Dynein beta chain, ciliary | 2 × 10-52 | |
| GS01639 | 6 | 7.8 × 10-3 | BSN1_BACAM | Extracellular ribonuclease precursor | 2 × 10-10 |
| GS02259 | 7 | 3.9 × 10-3 | GAS8_CHLRE | Growth arrest-specific protein 8 homolog (Protein PF2) | 2 × 10-82 |
| GS00095 | 6 | 7.8 × 10-3 | DYHB_CHLRE | Dynein beta chain, flagellar outer arm | 1 × 10-35 |
| GS03902 | 6 | 7.8 × 10-3 | *Q94EY1_CHLRE | Predicted protein | 8 × 10-14 |
| GS00471 | 6 | 7.8 × 10-3 | *A9BCA5_PROM4 | Putative uncharacterized protein | 2 × 10-80 |
| GS00126 | 7 | 3.9 × 10-3 | STCE_ECO57 | Metalloprotease stcE precursor | 5 × 10-31 |
| 8 | 2.0 × 10-3 | SPT17_HUMAN | Spermatogenesis-associated protein 17 | 8 × 10-11 | |
| 9 | 9.8 × 10-4 | DYH6_HUMAN | Axonemal beta dynein heavy chain 6 | 1 × 10-129 | |
| GS00276 | 11 | 2.4 × 10-4 | PLMN_MACEU | Plasminogen precursor | 2 × 10-15 |
| GS00140 | 10 | 4.9 × 10-4 | Y326_METJA | Uncharacterized protein MJ0326 | 1 × 10-64 |
| GS01207 | 8 | 2.0 × 10-3 | CF206_MOUSE | Uncharacterized protein C6orf206 homolog | 2 × 10-26 |
| GS01392 | 9 | 9.8 × 10-4 | DYH3_MOUSE | Axonemal beta dynein heavy chain 3 | 5 × 10-89 |
| GS02146 | 9 | 9.8 × 10-4 | CCD37_MOUSE | Coiled-coil domain-containing protein 37 | 3 × 10-21 |
| GS00154 | 6 | 7.8 × 10-3 | IQCG_MOUSE | IQ domain-containing protein G | 3 × 10-22 |
| GS02689 | 6 | 7.8 × 10-3 | RNF32_MOUSE | RING finger protein 32 | 2 × 10-11 |
| GS00461 | 10 | 4.9 × 10-4 | NAT_MYCSM | Arylamine N-acetyltransferase | 2 × 10-21 |
| GS03363 | 6 | 7.8E-03 | *A1UWW2_BURMS | RemN protein | 6 × 10-06 |
| GS00524 | 8 | 2.0E-03 | *Q0 MYX1_EMIHU | Putative uncharacterized protein | 3 × 10-55 |
| GS00907 | 7 | 3.9E-03 | *Q0 MYV7_EMIHU | Putative uncharacterized protein | 7 × 10-07 |
| 6 | 7.8E-03 | *Q9ZTY0_EMIHU | Putative calcium binding protein | 2 × 10-07 | |
| GS02739 | 8 | 2.0E-03 | *Q2 MCN4_HYDAT | HyTSR1 protein | 5 × 10-07 |
| GS01630 | 7 | 3.9E-03 | *A0L4Q4_MAGSM | Cadherin | 4 × 10-12 |
| GS02194 | 8 | 2.0E-03 | *C1 MZQ6_9CHLO | Predicted protein | 9 × 10-07 |
| GS00043 | 7 | 3.9E-03 | *C1NAB5_9CHLO | Predicted protein | 6 × 10-14 |
| GS02204 | 6 | 7.8E-03 | *C1EGP6_9CHLO | Predicted protein | 2 × 10-09 |
| GS02009 | 6 | 7.8E-03 | *A9UNX1_MONBE | Predicted protein | 4 × 10-24 |
| GS03800 | 8 | 2.0E-03 | *Q0JCM6_ORYSJ | Os04 g0461600 protein | 3 × 10-09 |
| GS00472 | 6 | 7.8E-03 | *Q00Y28_OSTTA | Chromosome 12 contig 1, DNA sequence | 2 × 10-13 |
| GS00972 | 6 | 7.8E-03 | *A0DFH5_PARTE | Chromosome undetermined scaffold_49, whole genome shotgun sequence | 7 × 10-13 |
| GS00363 | 8 | 2.0E-03 | *A9RPM7_PHYPA | Predicted protein | 2 × 10-07 |
| 12 | 1.2E-04 | *Q0E9S1_PLEHA | Putative beta-type carbonic anhydrase | 9 × 10-70 | |
| GS00753 | 6 | 7.8E-03 | *Q0E9R5_PLEHA | Putative uncharacterized protein | 2 × 10-30 |
| 15 | 1.5E-05 | *Q2NSA6_SODGM | Hypothetical phage protein | 5 × 10-06 | |
| GS00195 | 7 | 3.9E-03 | *C4EA11_STRRS | Putative uncharacterized protein | 4 × 10-12 |
| GS01216 | 8 | 2.0E-03 | *B4WU30_9SYNE | Putative uncharacterized protein | 1 × 10-06 |
| GS00006 | 8 | 2.0E-03 | *B8BYB9_THAPS | Predicted protein | 4 × 10-12 |
| GS00629 | 6 | 7.8E-03 | *B8LBM2_THAPS | Predicted protein | 1 × 10-32 |
| GS03100 | 8 | 2.0E-03 | *A5AXV4_VITVI | Putative uncharacterized protein | 7 × 10-07 |
| Orphan genes tested | |||||
| | 25 | 1.5 × 10-8 | |||
| | 16 | 7.7 × 10-6 |
Only clusters with zero ESTs originating from the 2N library are shown, and only the orphans confirmed by RT-PCR are included in this table. Homolog IDs are marked as in Table 5. Additional data file 4 gives a complete list of all clusters predicted to be 1N-specific by statistical comparison of libraries.
KOG-assigned EST clusters predicted to be highly 2N-specific based on statistical comparison of libraries
| Cluster ID | Number of 2N ESTs | Homolog ID | Homolog description | BLAST | |
|---|---|---|---|---|---|
| Carbohydrate transport and metabolism | |||||
| | 7 | 3.9 × 10-3 | PIP25_ARATH | Probable aquaporin PIP2-5 | 1 × 10-34 |
| GS00433 | 8 | 1.9 × 10-3 | F26_RANCA | 6PF-2-K/Fru-2,6-P2ASE liver/muscle isozymes | 3 × 10-40 |
| Cell wall/membrane/envelope biogenesis | |||||
| GS01290 | 8 | 1.9 × 10-3 | ASB3_BOVIN | Ankyrin repeat and SOCS box protein 3 (ASB-3) | 9 × 10-06 |
| Chromatin structure and dynamics | |||||
| | 6 | 7.8 × 10-3 | H4_OLILU | Histone H4 | 8 × 10-33 |
| Cytoskeleton | |||||
| GS00171 | 6 | 7.8 × 10-3 | EXS_ARATH | Leucine-rich repeat receptor protein kinase EXS precursor | 1 × 10-08 |
| Energy production and conversion | |||||
| GS00763 | 6 | 7.8 × 10-3 | QORH_ARATH | Putative chloroplastic quinone-oxidoreductase homolog | 6 × 10-25 |
| GS01632 | 7 | 3.9 × 10-3 | CYPD_BACSU | Probable bifunctional P-450/NADPH-P450 reductase 1 | 2 × 10-43 |
| General function prediction only | |||||
| GS00580 | 9 | 9.7 × 10-4 | YMO3_ERWST | Uncharacterized protein in mobD 3' region | 6 × 10-07 |
| GS02524 | 7 | 3.9 × 10-3 | †RKIP | Raf kinase inhibitor protein (RKIP), Phosphatidylethanolamine-binding protein (PEBP) | 1 × 10-06 |
| Inorganic ion transport and metabolism | |||||
| | 8 | 1.9 × 10-3 | NCKXH_DROME | Probable Na+/K+/Ca2+ exchanger CG1090 | 1 × 10-22 |
| | 7 | 3.9 × 10-3 | B3A2_RAT | Anion exchange protein 2 (AE2 anion exchanger) | 8 × 10-14 |
| Intracellular trafficking, secretion, and vesicular transport | |||||
| | 9 | 9.7 × 10-4 | STX1A_CAEEL | Syntaxin-1A homolog | 2 × 10-19 |
| Lipid transport and metabolism | |||||
| GS00955 | 7 | 3.9 × 10-3 | S5A1_MACFA | 3-oxo-5-alpha-steroid 4-dehydrogenase 1 | 3 × 10-54 |
| Posttranslational modification, protein turnover, chaperones | |||||
| GS06447 | 6 | 7.8 × 10-3 | CLPP3_ANASP | Probable ATP-dependent Clp protease proteolytic subunit 3 | 2 × 10-31 |
| GS02029 | 8 | 1.9 × 10-3 | UBCY_ARATH | Ubiquitin-conjugating enzyme E2-18 kDa | 4 × 10-20 |
| GS03925 | 8 | 1.9 × 10-3 | FKBP4_DICDI | FK506-binding protein 4 (peptidyl-prolyl cis-trans isomerase) | 1 × 10-07 |
| Replication, recombination and repair | |||||
| GS00109 | 8 | 1.9 × 10-3 | MCM2_XENTR | DNA replication licensing factor mcm2 | 1 × 10-109 |
| Secondary metabolites biosynthesis, transport and catabolism | |||||
| GS00417 | 6 | 7.8 × 10-3 | WBC11_ARATH | White-brown complex homolog protein 11 | 9 × 10-28 |
| Signal transduction mechanisms | |||||
| GS00826 | 6 | 7.8 × 10-3 | STK4_BOVIN | Serine/threonine-protein kinase 4 | 2 × 10-47 |
| GS00712 | 7 | 3.9 × 10-3 | PI4K_DICDI | Phosphatidylinositol 4-kinase | 3 × 10-43 |
| GS00083 | 7 | 3.9 × 10-3 | SHKE_DICDI | Dual specificity protein kinase shkE | 9 × 10-22 |
| GS01230 | 7 | 3.9 × 10-3 | †PP2Cc | Serine/threonine phosphatases, family 2C, catalytic domain | 2 × 10-08 |
Only clusters with zero ESTs originating from the 1N library are shown. The number of 2N EST reads in each cluster and the P-value for significance of the difference between libraries are shown. Homolog IDs are marked as in Table 5. Clusters are arranged by KOG class. Clusters in bold were chosen for RT-PCR validation. Additional data file 5 gives a complete list of all clusters predicted to be 2N-specific by statistical comparison of libraries.
EST clusters without KOG assignment predicted to be highly 2N-specific based on statistical comparison of libraries
| Cluster ID | Number of 2N ESTs | Homolog ID | Homolog description | BLAST | |
|---|---|---|---|---|---|
| GS00092 | 6 | 7 × 10-17 | *B1X317_CYAA5 | Putative uncharacterized protein | 7 × 10-17 |
| 14 | 2 × 10-06 | *Q0 MYU5_EMIHU | Putative arachidonate 15-lipoxygenase second type | 2 × 10-06 | |
| GS05210 | 7 | 1 × 10-25 | *C1AEM4_GEMAT | Putative glutamine cyclotransferase | 1 × 10-25 |
| GS01732 | 8 | 1 × 10-19 | *A7WPV6_KARMI | Putative uncharacterized protein | 1 × 10-19 |
| GS02223 | 7 | 1 × 10-31 | *C1 MGG4_9CHLO | Predicted protein | 1 × 10-31 |
| GS05779 | 6 | 2 × 10-17 | *C1E2K5_9CHLO | Predicted protein | 2 × 10-17 |
| GS06362 | 6 | 2 × 10-15 | *A9V2G5_MONBE | Predicted protein | 2 × 10-15 |
| GS00766 | 7 | 3 × 10-07 | *B7G9 M0_PHATR | Predicted protein | 3 × 10-07 |
| GS03302 | 7 | 1 × 10-32 | *B7G0S2_PHATR | Predicted protein | 1 × 10-32 |
| GS03476 | 9 | 2 × 10-09 | *B7FQM3_PHATR | Predicted protein | 2 × 10-09 |
| GS00513 | 8 | 6 × 10-08 | *Q7V952_PROMM | Putative uncharacterized protein | 6 × 10-08 |
| GS01720 | 9 | 8 × 10-06 | *B2ZYD9_9CAUD | Nucleoside-diphosphate-sugar pyrophosphorylase-like protein | 8 × 10-06 |
| GS05985 | 7 | 6 × 10-06 | *B0J8I4_RHILT | Putative uncharacterized protein | 6 × 10-06 |
| GS01421 | 6 | 7 × 10-11 | *B9S8J5_RICCO | Putative uncharacterized protein | 7 × 10-11 |
| GS05596 | 8 | 2 × 10-11 | *B8 MI73_TALSN | Putative uncharacterized protein | 2 × 10-11 |
| GS00659 | 6 | 5 × 10-22 | *A4VDD7_TETTH | Putative uncharacterized protein | 5 × 10-22 |
| 16 | 7.6 × 10-6 | ||||
| 12 | 1.2 × 10-4 | ||||
| 10 | 4.9 × 10-4 | ||||
| 10 | 4.9 × 10-4 |
Only clusters with zero ESTs originating from the 1N library are shown, and only the orphans confirmed by RT-PCR are included in this table. Homolog IDs are marked as in Table 5. Clusters in bold were chosen for RT-PCR validation (cluster GS11002 is shown in bold italics, the only cluster tested in which abundant RT-PCR product could also be detected from 1N cells). Additional data file 5 gives a complete list of all clusters predicted to be 2N-specific by statistical comparison of libraries.
Figure 5The taxonomic distribution of homology. Shown are the percentages of clusters with KEGG homologs that have the 'best hit' in each taxonomic group. Indicated are cases where the proportion of clusters best hitting to the taxonomic group differs between 1N-unique and 2N-unique (asterisks) or between 1N-unique and shared clusters (at symbol (@)), tested as above. The inset shows the proportion of all assigned clusters that are accounted for by best-hits to Chlamydomonas reinhardtii (a subset of those which are best-hits to Viridiplantae). The differences between 1N-unique and 2N-unique, and between 1N-unique and shared clusters were significant (P < 0.002).
Figure 6The proportion of orphan clusters. Non-orphan clusters that do not have hits in the KOG database are also represented (Others). (a) All clusters. (b) Shared clusters composed of reads in both 1N and 2N libraries. (c) Potentially 1N-specific clusters composed of two or more reads in the 1N library but zero in the 2N library. (d) Potentially 2N-specific clusters composed of two or more reads in the 2N library but zero in the 1N library.
Distribution of EST reads and clusters related to proteins highly specific to cilia/flagella or basal bodies
| Number of 1N clusters | Number of 2N clusters | Number of 1N ESTs | Number of 2N ESTs | |
|---|---|---|---|---|
| Dynein heavy chain alpha (ODA11) | 2 | 0 | 8 | 0 |
| Dynein heavy chain beta (ODA4) | 3 | 0 | 12 | 0 |
| Outer dynein arm intermediate chain 1 (ODA9) | 1 | 0 | 2 | 0 |
| Dynein, 70 kDa intermediate chain, flagellar outer arm (ODA6) | 2 | 0 | 7 | 0 |
| Outer dynein arm light chain 1 (DLC1) | 1 | 0 | 6 | 0 |
| Outer dynein arm light chain 2 (ODA12) | 1 | 0 | 5 | 0 |
| Outer dynein arm light chain 5, 14KD (DLC5) | 3 | 0 | 9 | 0 |
| Outer dynein arm light chain 7b (DLC7b) | 1 | 0 | 2 | 0 |
| Outer dynein arm light chain 8, 8KD (FLA14) | 2 | 0 | 3 | 0 |
| Outer dynein arm docking complex 2 (ODA-DC2) | 1 | 0 | 5 | 0 |
| Outer dynein arm docking complex 3 (ODA-DC3) | 2 | 0 | 7 | 0 |
| Inner dynein arm heavy chain 1-alpha (DHC1a) | 1 | 0 | 1 | 0 |
| Inner dynein arm heavy chain 1-beta (DHC1b/IDA2) | 3 | 0 | 6 | 0 |
| Dynein heavy chain 2 (DHC2) | 3 | 0 | 15 | 0 |
| Dynein heavy chain 8 (DHC8) | 1 | 0 | 1 | 0 |
| Dynein heavy chain 9 (DHC9) | 3/2 | 1/0 | 15/14 | 4/0 |
| Inner dynein arm I1 intermediate chain IC14 (IDA7) | 1 | 0 | 4 | 0 |
| Inner dynein arm I1 intermediate chain (IC138) | 1 | 0 | 3 | 0 |
| Inner dynein arm ligh chain p28 (IDA4) | 1 | 0 | 2 | 0 |
| Dynein light chain tctex1 (TCTEX1) | 2 | 0 | 5 | 0 |
| Dynein light chain Tctex2b | 1 | 0 | 4 | 0 |
| Radial spoke protein 1 | 1 | 0 | 1 | 0 |
| Radial spoke protein 2 (PF24) | 1/0 | 0 | 3/0 | 0/0 |
| Radial spoke protein 4 (PF1) | 1 | 0 | 3 | 0 |
| Radial spoke protein 9 | 1 | 0 | 8 | 0 |
| Radial spoke protein 10 | 2/0 | 0/0 | 6/0 | 0/0 |
| Radial spoke protein 11 | 1 | 0 | 4 | 0 |
| Radial spoke protein 14 | 1 | 0 | 1 | 0 |
| Radial spoke protein 16 | 1/0 | 0/0 | 1/0 | 0'0 |
| Radial spoke protein 23 | 1/0 | 0/0 | 8/0 | 5/0 |
| Central pair protein (PF16) | 2 | 0 | 5 | 0 |
| Central pair associated WD-repeat protein | 1 | 0 | 4 | 0 |
| Central pair protein (PF6) | 1 | 0 | 1 | 0 |
| Dynein 1b light intermediate chain (D1bLIC) | 1 | 0 | 1 | 0 |
| Intraflagellar transport protein 20 (IFT2) | 1 | 0 | 2 | 0 |
| Intraflagellar transport protein 57 (IFT57), alternative version | 1 | 0 | 4 | 0 |
| Intraflagellar transport protein 72 and 74 (IFT72/74) | 1 | 0 | 1 | 0 |
| Intraflagellar transport protein 80 (CHE2) | 2 | 0 | 6 | 0 |
| Intraflagellar transport protein 81 (IFT81) | 1 | 0 | 4 | 0 |
| Intraflagellar transport protein 121 (IFT121) | 1 | 0 | 2 | 0 |
| Intraflagellar transport protein 139 (IFT139) | 1 | 0 | 1 | 0 |
| Intraflagellar transport protein 140 (IFT140) | 2/1 | 0/0 | 2/1 | 0/0 |
| Intraflagellar transport protein 172 (IFT172) | 1 | 0 | 1 | 0 |
| Dynein regulatory complex protein (PF2) | 1 | 0 | 7 | 0 |
| Tektin | 1 | 0 | 3 | 0 |
| Conserved uncharacterized flagellar associated protein FAP189 | 2 | 0 | 9 | 0 |
| Conserved uncharacterized flagellar associated protein FAP58 | 1 | 0 | 3 | 0 |
| Flagellar protofilament ribbon protein (RIB43a) | 1 | 0 | 6 | 0 |
| Nucleoside-diphosphokinase regulatory subunit p72 (RIB72) | 1 | 0 | 1 | 0 |
| Subunit of axonemal inner dynein arn (A9ZPM1_CHLRE) | 1 | 0 | 1 | 0 |
| Flagellar associated protein (A8J1V4_CHLRE) | 1 | 0 | 4 | 0 |
| Flagellar associated protein (A8JDM7_CHLRE) | 1 | 0 | 1 | 0 |
| Flagellar associated protein (A8J0N6_CHLRE) | 1 | 0 | 4 | 0 |
| Flagellar associated protein (A8J7D6_CHLRE) | 1 | 0 | 4 | 0 |
| Flagellar associated protein (A8JB22_CHLRE) | 1 | 0 | 4 | 0 |
| Flagellar associated protein (A8HZK8_CHLRE) | 2 | 0 | 6 | 0 |
| Flagellar associated protein (A8I9E8_CHLRE) | 2 | 0 | 5 | 0 |
| Flagellar associated protein (A7S8J6_NEMVE) | 1 | 0 | 6 | 0 |
| Flagellar associated protein (A8HMZ4_CHLRE) | 1 | 0 | 1 | 0 |
| Chlamydomonas minus and plus agglutinin (AAS07042.1) | 1/0 | 0/0 | 3/0 | 0/0 |
| Flagellar/basal body protein (A8J795_CHLRE) | 1 | 0 | 2 | 0 |
| Flagellar/basal body protein (A8I6L8_CHLRE) | 1 | 0 | 1 | 0 |
| Bardet-Biedl syndrome 1 protein (A8JEA1_CHLRE) | 1 | 0 | 4 | 0 |
| Dynein heavy chain beta (ODA4) | 1 | 0 | 1 | 0 |
| ADP-ribosylation factor-like protein 6 (BBS3) (Q9HF7_HUMAN) | 1 | 0 | 3 | 0 |
| Bardet-Biedl syndrome 5 protein (BBS5_DANRE) | 1 | 0 | 2 | 0 |
| Bardet-Biedl syndrome 7 protein (BBS7_MOUSE) | 1 | 0 | 1 | 0 |
| Bardet-Biedl syndrome 9 protein (PTHB1_HUMAN) | 1 | 0 | 2 | 0 |
| 90/82 | 1/0 | 275/252 | 9/0 |
The numbers of homologous clusters containing ESTs originating from 1N and 2N libraries are shown. Also shown are numbers of component ESTs from 1N and 2N libraries. Potential 'false positive' homologs were identified (for example, clusters with stronger homology to non-flagellar proteins). In such cases, the numerator represents the number of clusters or ESTs including potential false positive clusters, and the denominator when false positive clusters are excluded. Detailed analysis of all motility-related clusters is provided in Additional data files 7 and 8.
Distribution of EST reads and clusters related to cilia/flagella components that also have non-ciliary functions
| Number of 1N clusters | Number of 2N clusters | Number of 1N ESTs | Number of 2N ESTs | |
|---|---|---|---|---|
| Alpha-1 tubulin (TUA1, TUA2) | 6 | 2 | 10 | 5 |
| Beta-1 tubulin (TUB1, TUB2) | 3 | 1 | 4 | 2 |
| Actin, inner dynein arm intermediate chain (IDA5) | 7 | 6 | 11 | 13 |
| Caltractin/centrin 20 kDa calcium-binding protein (VFL2) | 8 | 4 | 13 | 22 |
| Kinesin-like protein 1 (KLP1) | 1 | 0 | 1 | 0 |
| Phophatase 1 (PP1a) | 3 | 2 | 4 | 8 |
| Kinesin-II associated protein (KAP1) | 1 | 0 | 1 | 0 |
| Cytoplasmic dynein heavy chain 1b (DHC1b) | 2 | 0 | 7 | 0 |
| Microtubule-associated protein (EB1) | 1 | 1 | 6 | 4 |
| Glycogen synthase kinase 3 (GSK3) | 2 | 2 | 5 | 12 |
| Calmodulin (CAM) | 7 | 3 | 19 | 10 |
| Deflagellation inducible protein, 13KD (DIP13) | 0 | 1 | 0 | 1 |
| Heat shock 70 kDa protein (HSP70A) | 4 | 4 | 11 | 8 |
| Phototropin, blue light receptor (PHOT) | 5/4 | 2/2 | 17/15 | 6/6 |
| Protein phosphatase 2a (PP2A-r2) | 0 | 3 | 0 | 3 |
| Flagellar associated protein (A8JAF7_CHLRE) | 1 | 1 | 6 | 4 |
| Flagellar associated protein (A8JC09_CHLRE) | 1 | 1 | 6 | 1 |
| 52/51 | 33/33 | 121/119 | 99/99 |
Table is organized as Table 9. Additional data files 7 and 8 contain a detailed analysis.
Figure 7RT-PCR confirmation of expression of selected flagellar-related genes only in 1N cells. All reactions were run with the same RT+ cDNA samples. The RT-PCR shown at the top used the elongation factor 1α (GS000217) as a positive (loading) control showing successful cDNA amplification occurred in all samples. RT- control reactions prepared from the same RNA were run for nine of the PCRs shown here and no contaminating genomic DNA (gDNA) was ever found (see examples with RT- reactions included in Figure S6 in Additional data file 1). For clarity, RT- control reactions run simultaneously have been cut out here. Positions of molecular weight markers on each side of the gel are shown. The sample identifiers are listed for each lane at the top of the gel. 11 h, harvested at 11 h (late morning); 21 h, harvested at 21 h (early evening, time of S-phase); 02 h, harvested at 02 h (after cell division); CL, cultures (1N only) exposed to continuous light.
Figure 8RT-PCR tests of expression patterns of selected genes chosen by digital subtraction. RT- control reactions prepared from the same RNA were run for six of the PCRs shown here and no contaminating genomic DNA (gDNA) was ever found. For clarity, RT- control reactions run simultaneously have been cut out here. Positions of molecular weight markers on each side of the gel are shown. The sample identifiers are listed for each lane at the top of the gel (as for Figure 7).
E. huxleyi EST clusters related to Ca2+ and H+ transporters
| Cluster ID | Number of 1N clusters | Number of 2N clusters | Top Swiss-Prot hit | E-value | |
|---|---|---|---|---|---|
| Ca2+/H+ antiporter VCX1 and related proteins | |||||
| GS00019 | 7 | 1 | 0.020 | CAX5_ARATH | 2 × 10-66 |
| | 0 | 4 | 0.031 | CAX2_ARATH | 3 × 10-60 |
| GS00617 | 3 | 0 | 0.063 | VCX1_YEAST | 3 × 10-58 |
| GS00976 | 2 | 1 | 0.313 | VCX1_YEAST | 3 × 10-31 |
| GS06500 | 0 | 1 | 0.250 | CAX3_ORYSJ | 4 × 10-30 |
| Ca2+ transporting ATPase | |||||
| GS07761 | 1 | 0 | 0.250 | AT2A2_CHICK | 4 × 10-63 |
| GS01511 | 4 | 5 | 0.377 | ECA4_ARATH | 7 × 10-31 |
| GS05702 | 0 | 1 | 0.250 | ECA4_ARATH | 3 × 10-12 |
| K+-dependent Ca2+/Na+ exchanger NCKX1 and related proteins | |||||
| GS05506 | 0 | 2 | 0.125 | NCKX2_RAT | 5 × 10-24 |
| | 0 | 8 | 0.002 | NCKXH_DROME | 1 × 10-22 |
| GS04866 | 2 | 0 | 0.125 | NCKX_DROME | 2 × 10-22 |
| GS02609 | 1 | 0 | 0.250 | NCKX3_HUMAN | 8 × 10-20 |
| GS00834 | 4 | 3 | 0.364 | NCKXH_DROME | 2 × 10-18 |
| GS03656 | 4 | 1 | 0.110 | NCKX3_MOUSE | 6 × 10-7 |
| Vacuolar H+-ATPase V0 sector, subunit a | |||||
| GS01798 | 2 | 0 | 0.125 | VPP4_HUMAN | 2 × 10-38 |
| GS02526 | 1 | 4 | 0.109 | VPP4_HUMAN | 9 × 10-40 |
| GS12017 | 0 | 1 | 0.250 | No hit | |
| GS04358 | 1 | 0 | 0.250 | VATM_DICDI | 2 × 10-30 |
| GS08326 | 0 | 1 | 0.250 | No hit | |
| Vacuolar H+-ATPase V0 sector, subunit c" | |||||
| GS01501 | 4 | 0 | 0.031 | VATO_YEAST | 4 × 10-47 |
| Vacuolar H+-ATPase V0 sector, subunit d | |||||
| GS00290 | 7 | 5 | 0.291 | VA0D_DICDI | 1 × 10-126 |
| Vacuolar H+-ATPase V0 sector, subunit M9.7 (M9.2) | |||||
| GS11177 | 0 | 2 | 0.125 | ||
| Vacuolar H+-ATPase V0 sector, subunits c/c' | |||||
| GS03783 | 1 | 0 | 0.250 | VATL_PLECA | 7 × 10-38 |
| GS01934 | 3 | 5 | 0.254 | VATL_PLECA | 4 × 10-38 |
| Vacuolar H+-ATPase V1 sector, subunit A | |||||
| GS01727 | 2 | 5 | 0.144 | VATA_CYACA | 3 × 10-86 |
| Vacuolar H+-ATPase V1 sector, subunit B | |||||
| GS08492 | 0 | 1 | 0.250 | VATB_ARATH | 1 × 10-62 |
| Vacuolar H+-ATPase V1 sector, subunit C | |||||
| GS00316 | 6 | 4 | 0.275 | VATC1_XENTR | 6 × 10-41 |
| Vacuolar H+-ATPase V1 sector, subunit E | |||||
| GS00924 | 1 | 1 | 0.500 | VATE_MESCR | 1 × 10-21 |
| Vacuolar H+-ATPase V1 sector, subunit F | |||||
| GS09780 | 0 | 4 | 0.031 | VATF_ARATH | 3 × 10-32 |
| Vacuolar H+-ATPase V1 sector, subunit H | |||||
| GS01820 | 1 | 5 | 0.062 | VATH_MANSE | 4 × 10-36 |
Clusters are arranged by KOG hit classification. Clusters in bold were tested by RT-PCR.
Figure 9RT-PCR determination of expression patterns of selected genes potentially related to biomineralization. RT- control reactions prepared from the same RNA were run for all of the PCRs shown here and no contaminating genomic DNA (gDNA) was ever found. For clarity, these RT- control reactions run simultaneously have been cut out here. Positions of molecular weight markers on each side of the gel are shown. The sample identifiers are listed for each lane at the top of the gel (as for Figure 7).
Figure 10RT-PCR determination of expression patterns of selected histone genes. Positions of molecular weight markers on each side of the gel are shown. The sample identifiers are listed for each lane at the top of the gel (as for Figure 7).