| Literature DB >> 19036158 |
Robert S Coyne1, Mathangi Thiagarajan, Kristie M Jones, Jennifer R Wortman, Luke J Tallon, Brian J Haas, Donna M Cassidy-Hanley, Emily A Wiley, Joshua J Smith, Kathleen Collins, Suzanne R Lee, Mary T Couvillion, Yifan Liu, Jyoti Garg, Ronald E Pearlman, Eileen P Hamilton, Eduardo Orias, Jonathan A Eisen, Barbara A Methé.
Abstract
BACKGROUND: Tetrahymena thermophila, a widely studied model for cellular and molecular biology, is a binucleated single-celled organism with a germline micronucleus (MIC) and somatic macronucleus (MAC). The recent draft MAC genome assembly revealed low sequence repetitiveness, a result of the epigenetic removal of invasive DNA elements found only in the MIC genome. Such low repetitiveness makes complete closure of the MAC genome a feasible goal, which to achieve would require standard closure methods as well as removal of minor MIC contamination of the MAC genome assembly. Highly accurate preliminary annotation of Tetrahymena's coding potential was hindered by the lack of both comparative genomic sequence information from close relatives and significant amounts of cDNA evidence, thus limiting the value of the genomic information and also leaving unanswered certain questions, such as the frequency of alternative splicing.Entities:
Mesh:
Year: 2008 PMID: 19036158 PMCID: PMC2612030 DOI: 10.1186/1471-2164-9-562
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of cDNA libraries and EST data.
| Vegetative growth, rich medium | RCH | pcDNA3.1 | TTB, TTC, TTD | 10229 | 5270 | 4959 | 661 |
| Vegetative growth, rich medium, plus Cu/Cd | HVM | pBluescriptIISK+ | TTS | 3160 | 3160 | 0 | 712 |
| Vegetative growth, rich medium, plus TSA | TSA | pDNR-LIB | FTS | 2651 | 2651 | 0 | 1810 |
| Vegetative growth, minimal medium | MIN | pDNR-LIB | FMM | 1809 | 1809 | 0 | 1157 |
| Starvation | STV | pBluescriptIISK+ | TT1 | 23321 | 12820 | 10501 | 4647 |
| Conjugation | CNJ | pBluescriptIISK+, pDNR-LIB | TTE, FCO | 18837 | 17813 | 1024 | 6568 |
| 60007 | 43523 | 16484 | 9709 | ||||
Figure 1Results of MIC/MAC comparative genomic hybridization. A: Distribution of MIC scaffold ratios. Red line: proposed separation of MAC-destined (maD) DNA scaffolds (on the left) and MIC-limited (miL) scaffolds (on the right). B: Scatter plot of MIC scaffold ratios as a function of scaffold length. Pink and aqua points: maD and miL DNA, respectively, by the log2 ratio criterion in Figure 1A. Black diamonds and small black circles, respectively: miL and maD scaffolds with high sequence identity to miL transposon genes. The maD distribution is more diffuse as the length decreases to the minimum scaffold length (1,000 bp). This is attributed to the fact that the number of probes is roughly proportional to scaffold length. Given a uniform intrinsic variability in hybridization ratios for each probe, the variance of the scaffold means is expected to vary inversely with scaffold length. The secondary peak in the maD distribution (around log2 ratio = -0.45) in 1A and the multimodality of the maD distribution in 1B (most clearly seen for scaffolds > 50 kb) are caused by the partial loss of MIC chromosome segments in the cells used for the MIC DNA preps (Orias and Hamilton, unpublished observations).
Figure 2Distribution of EST gene hits. The x-axis is divided into bins by the total number of validated ESTs (from all libraries) hitting a given gene. The y-axis depicts the percent of ESTs from each of the six conditions that fall into the indicated x-axis bin. For example, the bin containing genes matched by between 2 and 10 ESTs contains 8,426 matches from the conjugation condition (TTE and FCO libraries). The total of all CNJ ESTs is 18,837 (see Table 1). The percent of total CNJ ESTs in this bin is therefore 8,426/18,837 = 44.7%. Abbreviations as in Table 1.
Reannotated T. thermophila selenoprotein homologs.
| Glutathione Peroxidase | TTHERM_00141160 | AUUUUCAAAUAUUGAAAACUAAAAUGUUAAAUGAAAGAUUAUUUUUAAAUUUGUAAAAAAGAAAUAAUUUUGAAAAAAAUAUUAUUUUAGUUAGU | 20 STV, 1 CNJ, 1 HVM, 1 MIN |
| Glutathione Peroxidase | TTHERM_00279820 | GAUAAAAGAGAUAUCAUUCAAUGAUAGCUUUAUAAUUAAAUCUUUAAUAGAAGUUAUAAGGUUUGAAGCUAAUGAGCUCUAUUAUC | 2 STV, 2 CNJ, 1 TSA |
| Thioredoxin Reductase | TTHERM_00486500 | AAUUUAUAUAUCUUAAAGAUGUAUAGUAUAAUGAUAGCAAAUCUCGAAAAUCUUAGGAUUGGGAUUAGGCUUGAAUAGUCAGAGUAAUAAGAGUAUUUAUUA | 0 |
| Thioredoxin Reductase | TTHERM_00823430 | UCUAAUAUGGAAAAUGACGAAUUUAGUCUAAACUGUAAAACAGGGAUUAAAUUCUGAA | 6 STV, 1 TSA |
| Thioredoxin Reductase | TTHERM_00047660 | AUACCUUCAACUGGUAGGAAUAUAAUGAUUAGAGAACUCCUAACCUCACUGAGGAGGGUUUUCUAUGAGGCAAGAUUAUUGAUUUUGUUGUAG | 164 RCH, 11 STV, 5 CNJ, 1 HVM, 1 TSA, 1 MIN |
| Selenophosphate Synthetase 2 | TTHERM_00522580 | AAGUAUCAUUUCUAAAAUGAUGCAAAUUAUUACCUGAAACUCUAAAAGAGAAGGAAUUUGCUGAGAAAAAAAAUGAAAAUGAUAACUU | 7 STV, 3 CNJ, 1 HVM, 1 TSA |
| Glutathione Peroxidase | TTHERM_01099010 | N.A. | 0 |
| Thioredoxin Reductase | TTHERM_00723630 | N.A. | 5 STV, 2 TSA |
| SelO | TTHERM_00852990 | N.A. | 4 STV |
The first six are putative selenoproteins. EST library condition abbreviations are as in Table 1
Figure 3Venn diagrams of the overlap in EST representation for all genes detected in (A) the four vegetative growth conditions and (B) the combined vegetative pool vs. starvation or conjugation.