| Literature DB >> 16186131 |
Richard Bishop1, Trushar Shah, Roger Pelle, David Hoyle, Terry Pearson, Lee Haines, Andrew Brass, Helen Hulme, Simon P Graham, Evans L N Taracha, Simon Kanga, Charles Lu, Brian Hass, Jennifer Wortman, Owen White, Malcolm J Gardner, Vishvanath Nene, Etienne P de Villiers.
Abstract
Massively parallel signature sequencing (MPSS) was used to analyze the transcriptome of the intracellular protozoan Theileria parva. In total 1,095,000, 20 bp sequences representing 4371 different signatures were generated from T.parva schizonts. Reproducible signatures were identified within 73% of potentially detectable predicted genes and 83% had signatures in at least one MPSS cycle. A predicted leader peptide was detected on 405 expressed genes. The quantitative range of signatures was 4-52,256 transcripts per million (t.p.m.). Rare transcripts (<50 t.p.m.) were detected from 36% of genes. Sequence signatures approximated a lognormal distribution, as in microarray. Transcripts were widely distributed throughout the genome, although only 47% of 138 telomere-associated open reading frames exhibited signatures. Antisense signatures comprised 13.8% of the total, comparable with Plasmodium. Eighty five predicted genes with antisense signatures lacked a sense signature. Antisense transcripts were independently amplified from schizont cDNA and verified by sequencing. The MPSS transcripts per million for seven genes encoding schizont antigens recognized by bovine CD8 T cells varied 1000-fold. There was concordance between transcription and protein expression for heat shock proteins that were very highly expressed according to MPSS and proteomics. The data suggests a low level of baseline transcription from the majority of protein-coding genes.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16186131 PMCID: PMC1236717 DOI: 10.1093/nar/gki818
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Distribution of sense transcripts within T.parva chromosomes 1–4. MPSS signatures >4 t.p.m. that were consistently positive over four cycles of MPSS are illustrated diagrammatically for each gene model. The bars indicate the log transcripts per million for each annotated gene as follows: green (4–99 t.p.m.), yellow (100–999 t.p.m.) and red (>1000 t.p.m.). Gene models lacking DpnII sites are indicated in black. Chromosomes 1–4 are represented in descending order on the page.
Figure 2Distribution of antisense transcripts within T.parva chromosomes 1–4. MPSS signatures >4 t.p.m. consistently positive over four cycles, derived from the antisense strand of predicted genes are illustrated diagrammatically for each gene model. The bars indicate the log transcripts per million for each annotated gene as follows; green (4–99 t.p.m.) and yellow (100–999 t.p.m.). The distribution of gene models lacking DpnII sites is indicated in black. Chromosomes 1–4 are shown in descending order.
Figure 3Distribution of signature abundances for T.parva signatures after logarithmic transformation of the data. Data are shown only for the subset of replicated and significant signatures annotated to a single locus in the T.parva genome, resulting in the truncation at the left-hand tail. Frequencies on the y-axis correspond to probability density. The solid black line shows a simple histogram of logged signature abundances whilst the solid red line shows a more reliable fixed width tri-cube kernel smoothed estimate of the probability density.
Figure 4Expression of telomere-associated genes in T.parva clustered using the TRIBE-MCL algorithm. Telomere-associated regions of each of the four T.parva chromosomes are illustrated showing the relative position of the ORFs on the forward (top) and reverse (bottom) strands with the multigene families color-coded. The left telomere is shown above the right. Genes within the same gene family identified using TRIBE-MCL are color-coded, with the copy number in the genome indicated. Arrows above each ORF indicate the MPSS expression level in two quantitative categories; 4–99, 100–999.
Figure 5Functional profiles of expressed proteins plotted according to their classification as defined by MIPS catalog (24). Only one class was assigned per protein to avoid redundancy. A complete list of the properties of the signatures, including functional categorization of predicted proteins is provided in Supplementary Table 2.
MPSS signatures of genes encoding antigens, Tp1–Tp5 and Tp7–Tp8 that are recognized by bovine CD8+ class I-restricted cytotoxic T cells
| Antigen | Gene length (bp) | Protein size (amino acids) | Molecular weight (predicted) | PI (predicted) | MPSS t.p.m. |
|---|---|---|---|---|---|
| Tp1 | 1771 | 543 | 61434.6 | 6.31 | 261 |
| Tp2 | 1019 | 174 | 19142.3 | 8.29 | 1069 |
| Tp3 | 827 | 265 | 28685.4 | 6.54 | 44 |
| Tp4 | 2595 | 579 | 63357.5 | 5.91 | 6 |
| Tp5 | 765 | 155 | 17806.7 | 4.49 | 163 |
| Tp7 | 2996 | 721 | 83667.7 | 4.77 | 8803 |
| Tp8 | 1581 | 440 | 50026.8 | 7.37 | 64 |
Tp6 did not contain a DpnII site and was therefore not detectable by MPSS.