| Literature DB >> 22004680 |
Robert S Coyne1, Linda Hannick, Dhanasekaran Shanmugam, Jessica B Hostetler, Daniel Brami, Vinita S Joardar, Justin Johnson, Diana Radune, Irtisha Singh, Jonathan H Badger, Ujjwal Kumar, Milton Saier, Yufeng Wang, Hong Cai, Jianying Gu, Michael W Mather, Akhil B Vaidya, David E Wilkes, Vidyalakshmi Rajagopalan, David J Asai, Chad G Pearson, Robert C Findly, Harry W Dickerson, Martin Wu, Cindy Martens, Yves Van de Peer, David S Roos, Donna M Cassidy-Hanley, Theodore G Clark.
Abstract
BACKGROUND: Ichthyophthirius multifiliis, commonly known as Ich, is a highly pathogenic ciliate responsible for 'white spot', a disease causing significant economic losses to the global aquaculture industry. Options for disease control are extremely limited, and Ich's obligate parasitic lifestyle makes experimental studies challenging. Unlike most well-studied protozoan parasites, Ich belongs to a phylum composed primarily of free-living members. Indeed, it is closely related to the model organism Tetrahymena thermophila. Genomic studies represent a promising strategy to reduce the impact of this disease and to understand the evolutionary transition to parasitism.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22004680 PMCID: PMC3341644 DOI: 10.1186/gb-2011-12-10-r100
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Life cycle of Ich. Infective theronts bore through the surface mucus and take up residence within the epithelium of susceptible fish. Theronts differentiate into feeding trophonts that grow and exit the host (as tomonts) within 4 to 7 days. Tomonts swim for a brief period and then adhere to an inert support where they secrete a gelatinous capsule. Tomonts divide within the capsule to form hundreds of tomites that differentiate into infective theronts within 18 to 24 hours at room temperature. Theronts that fail to infect fish die within 1 to 2 days.
Figure 2GC content of reads and scaffolds. (a) Percentage of GC content is plotted against percentage of combined quality Sanger and 454 reads of whole cell Ich DNA, showing a prominent shoulder of reads more GC-rich than expected for Ich. (b) Following assembly, mean GC content was plotted against the aggregate scaffold length within each percent GC bin, showing clean separation between scaffolds that make up the bacterial genomes and those that make up the much larger Ich genome.
Ordered list of Ich mitochondrial genes
| Start | End | Accession number | Product name | Gene symbol |
|
|---|---|---|---|---|---|
| 823 | 742 | IMG5_M206942 | tRNA-Tyr-2 | tRNA-Tyr-2 | √ |
| 1217 | 941 | IMG5_M206943 | LSU rRNA | √ | |
| tRNA-Leu | |||||
| 3687 | 1410 | IMG5_M206944 | LSU rRNA | √ | |
| 5041 | 3695 | IMG5_M206945 | Ymf 66 | ||
| 5301 | 4999 | IMG5_M206946 | Ymf57 | ||
| 6607 | 5441 | IMG5_M206947 | Ymf76 | √ | |
| 7446 | 6622 | IMG5_M206948 | Ribosomal protein S13 | √ | |
| 7903 | 7424 | IMG5_M206949 | Ribosomal protein S3 | √ | |
| 8202 | 7903 | IMG5_M206950 | Ribosomal protein S19 | √ | |
| 8996 | 8208 | IMG5_M206951 | Ribosomal protein L2 | √ | |
| 9471 | 8977 | IMG5_M206952 | Hypothetical protein | Hypotheticalc | |
| 9961 | 9455 | IMG5_M206953 | NADH dehydrogenase subunit 10 | √ | |
| 10381 | 9977 | IMG5_M206954 | Ribosomal protein S12 | √ | |
| 10914 | 10393 | IMG5_M206955 | NADH dehydrogenase subunit 2 | √ | |
| 10922 | 12214 | IMG5_M206956 | NADH dehydrogenase subunit 7 | √ | |
| 12220 | 12522 | IMG5_M206957 | Ribosomal protein S14 | √ | |
| 12522 | 13082 | IMG5_M206958 | Ymf60 | √ | |
| 14068 | 13079 | IMG5_M206959 | Ymf64 | √ | |
| 14748 | 14170 | IMG5_M206960 | Ymf75 | √ | |
| 14821 | 14750 | IMG5_M206961 | tRNA-Phe-1 | tRNA-Phe | √ |
| 15055 | 14828 | IMG5_M206962 | ATP synthase F0 subunit 9 | √ | |
| 15863 | 15060 | IMG5_M206963 | Ymf63 | √ | |
| 16966 | 15881 | IMG5_M206964 | Ymf65 | √ | |
| 17653 | 17174 | IMG5_M206965 | Ymf59 | √ | |
| 18096 | 17656 | IMG5_M206966 | Ribosomal protein L16 | √ | |
| 19722 | 18163 | IMG5_M206967 | Heme maturase | √ | |
| 20433 | 19726 | IMG5_M206968 | Ymf61 | √ | |
| tRNA-His | |||||
| 20829 | 20464 | IMG5_M206969 | NADH dehydrogenase subunit 3 | √ | |
| 21193 | 20852 | IMG5_M206970 | Ymf58 | √ | |
| 21795 | 21214 | IMG5_M206971 | NADH dehydrogenase subunit 9 | √ | |
| 25990 | 21839 | IMG5_M206972 | Ymf77 | √ | |
| 28327 | 28148 | IMG5_M206973 | NADH dehydrogenase subunit 1 | ||
| 28510 | 29802 | IMG5_M206974 | Apocytochrome b | √ | |
| 29792 | 31885 | IMG5_M206975 | NADH dehydrogenase subunit 5 | √ | |
| 31902 | 33521 | IMG5_M206976 | Cytochrome c oxidase subunit 2 | √ | |
| 33532 | 33735 | IMG5_M206977 | SSU rRNA | √ | |
| 33783 | 35202 | IMG5_M206978 | SSU rRNA | √ | |
| 35555 | 36841 | IMG5_M206980 | Ymf67 | √ | |
| 36862 | 36932 | IMG5_M206981 | tRNA-Trp-1 | tRNA-Trp | √ |
| 36961 | 38583 | IMG5_M206982 | Ymf68 | √ | |
| 38587 | 38865 | IMG5_M206983 | Hypothetical protein | Hypotheticalc | |
| 38942 | 41014 | IMG5_M206984 | Cytochrome c oxidase subunit 1 | √ | |
| 41015 | 41923 | IMG5_M206985 | NADH dehydrogenase subunit 1 | √ | |
| 41847 | 42605 | IMG5_M206986 | Ymf62 | √ | |
| 42627 | 42986 | IMG5_M206987 | Ribosomal protein L14 | √ | |
| tRNA-Glub | |||||
| 43012 | 43278 | IMG5_M206988 | Ymf70 | √ | |
| 43284 | 44807 | IMG5_M206989 | NADH dehydrogenase subunit 4 | √ | |
| 44811 | 45284 | IMG5_M206990 | Ymf73 | √ | |
| 45323 | 45393 | IMG5_M206991 | tRNA-Glu-1 | tRNA-Glub | |
| 45404 | 47681 | IMG5_M206992 | LSU rRNA | √ | |
| tRNA-Leu | |||||
| 47874 | 48150 | IMG5_M206993 | LSU rRNA | √ | |
| 48267 | 48348 | IMG5_M206994 | tRNA-Tyr-1 | tRNA-Tyr-1 | tRNA-Met |
aA checkmark indicates that the T. thermophila mtDNA contains a homolog of the same gene in the same relative position and orientation. bDivergent gene order. c'Hypothetical' indicates insufficient evidence to assign gene name.
Important genome statistics
| Ich |
| |
|---|---|---|
| Total predicted genes | 8,096 | 24,725 |
| Percentage coding | 21.0 | 47.8 |
| Total annotated sequence lengtha | 47,869,613 | 103,002,206 |
| Percentage GC | 15.9 | 22.3 |
| Longest gene | 21,958 | 47,333 |
| Shortest gene | 152 | 119 |
| Total gene length | 13,277,109 | 62,628,433 |
| Average gene lengthb | 1,639 | 2,533 |
| Average gene coding sequence | 1,243 | 1,989 |
| Gene density (per 10,000 bp) | 16.92 | 41.6 |
| Percentage genes with introns | 79.0 | 71.4 |
| Total exon length | 10,666,748 | 49,184,519 |
| Total number of exons | 29,479 | 114,215 |
| Longest exon | 11,206 | 14,389 |
| Shortest exon | 2 | 2 |
| Average exon length | 361.8 | 430.6 |
| Percentage GC | 24.1 | 27.5 |
| Total number of introns | 21,380 | 89,490 |
| Longest intron | 11,437 | 13,045 |
| Shortest intron | 16 | 18 |
| Average intron length | 152.8 | 162.7 |
| Average introns per gene | 2.6 | 3.6 |
| Percentage GC | 12.3 | 16.1 |
| Total intergenic regions | 33,985,751 | 39,886,399 |
| Shortest intergenic region | 2 | 2 |
| Longest intergenic region | 42,323 | 46,152 |
| Average intergenic length | 3,650 | 1,562 |
| Percentage GC | 13.9 | 18.1 |
aScaffolds < 2 kb in length were not annotated. bNot including 5' and 3' untranslated regions.
Figure 3Ortholog grouping. (a) Phyletic grouping of Ich genes with previously grouped orthologs from other species. Numbers in parentheses indicate the total number of ortholog groups to which the respective genes belong. (b) Shared orthology between the three ciliate genomes. The numbers depict the total number of ortholog groups in each category. The numbers within the brackets indicate ortholog groups specific for Tetrahymena only and accordingly there are 341 ciliate specific ortholog groups shared with all 3 ciliates compared here. See Additional file 6 for more details regarding shared orthology between the three ciliates.
Major kinase groups of Ich compared with other species
| Kinase group |
|
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|---|---|---|
| AGC | 78 (8) | 51 (8) | 219 (8) | 6 (4) | 11 (4) | 17 (6) | 29 (16) | 75 (13) | 63 (15) |
| Atypical | 37 (5) | 103 (6) | 270 (6) | 5 (3) | 5 (2) | 14 (6) | 18 (7) | 47 (12) | 38 (12) |
| CAMK | 68 (5) | 62 (6) | 442 (6) | 13 (4) | 23 (3) | 22 (4) | 40 (14) | 95 (17) | 74 (18) |
| CK1 | 24 (1) | 19 (1) | 125 (1) | 3 (1) | 3 (1) | 4 (1) | 83 (12) | 14 (3) | 12 (3) |
| CMGC | 68 (9) | 61 (10) | 199 (9) | 16 (8) | 21 (6) | 23 (8) | 48 (10) | 65 (9) | 63 (9) |
| Other | 330 (24) | 747 (45) | 1,449 (40) | 28 (6) | 71 (11) | 37 (20) | 67 (27) | 80 (30) | 81 (34) |
| RSK | 2 (1) | 1 (1) | 5 (1) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) |
| RGC | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 27 (1) | 8 (1) | 5 (1) |
| STE | 13 (4) | 19 (4) | 39 (4) | 1 (1) | 1 (1) | 14 (3) | 24 (3) | 51 (4) | 47 (4) |
| TK | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 0 (0) | 84 (18) | 107 (28) | 90 (29) |
| TKL | 13 (3) | 5 (3) | 15 (3) | 4 (1) | 7 (1) | 0 (0) | 15 (5) | 15 (7) | 45 (8) |
| Unclassified (PF00069) | 38 (1) | 0 (0) | 0 (0) | 36 (1) | 23 (1) | 0 (0) | 0 (0) | 0 (0) | 0 (0) |
| Total | 671 (61) | 1,068 (84) | 2,763 (78) | 112 (29) | 165 (30) | 131 (48) | 435 (113) | 557 (124) | 518 (133) |
| Percentage of proteome | 8.29 | 3.89 | 8.47 | 2.04 | 1.83 | 1.96 | 2.15 | 2.31 | 2.19 |
Total number of genes mapped to each kinase group; in parentheses, number of different families. An expanded list of all families under each kinase group is shown in Additional file 7. T. the, T. thermophila; P. tet, P. tetraurelia; P. fal, Plasmodium falciparum; T. gon, Toxoplasma gondii; S. cer, Saccharomyces cerevisiae; C. ele, Caenorhabditis elegans; D. rer, Danio rerio; H. sap, Homo sapiens. *Data for these two species was obtained from published reports [117,118] rather than from kinbase [116] or by orthology to the kinbase data, as for the other species.
Figure 4Immobilization antigens. (a) Unrooted neighbor-joining phylogenetic tree [123] of the 17 Ich immobilization antigen genes. The three described in the text as being related to previously identified genes are in blue. (b) Scaffolds containing arrays or tandem duplications of putative immobilization antigen genes (blue arrows) and pseudogenes (red outlined arrows). Locus accessions identify the individual genes. GenBank accession numbers for scaffolds 1 to 4 are as follows: 1, GL983567; 2, GL983846; 3, GL984331; 4, GL984394.
Comparison of Ich and Tetrahymena membrane transporters according to TC class, superfamily and substrate type
| Number of transporters in Ich (percentage of total transporters) | Number of transporters in | |
|---|---|---|
| TC class | ||
| 1. Channels | 155 (32) | 418 (38) |
| 2. Secondary carriers | 191 (40) | 386 (36) |
| 3. Primary active transporters | 131 (27) | 269 (25) |
| 9. Poorly characterized transporters | 6 (1) | 13 (1) |
| Total | 483 | 1086 |
| TC superfamily | ||
| VIC | 116 (24) | 396 (36) |
| APC | 7 (1) | 64 (6) |
| MFS | 75 (16) | 146 (13) |
| MC | 38 (8) | 43 (4) |
| CPA | 16 (3) | 20 (2) |
| ABC | 40 (8) | 159 (15) |
| P-ATPase | 65 (13) | 91 (8) |
| Substrate type | ||
| Inorganic molecules | 249 (51) | 500 (53) |
| Carbon sources | 54 (11) | 77 (8) |
| Drugs, toxins, macromolecules | 77 (16) | 155 (17) |
| Vitamins, co-factors | 15 (3) | 23 (2) |
| Nucleotide bases and derivatives | 21 (4) | 26 (3) |
| Amino acids and derivatives | 34 (7) | 49 (5) |
| Unknown | 38 (8) | 110 (12) |
MFS, major facilitator superfamily; VIC, voltage-gated ion channel; MC, mitochondrial carrier; CPA, cation:proton antiporter.
Protease complements in Ich and other model organisms
| Catalytic class | Percentage of the | ||||||
|---|---|---|---|---|---|---|---|
| Organism | Aspartic | Cysteine | Metallo | Serine | Threonine | Total | proteomea |
| 14 (5.5%) | 81 (31.9%) | 119 (46.9) | 25 (9.9%) | 15 (5.9%) | 254 | 3.1 | |
| 48 (8.3%) | 225 (38.9%) | 168 (29.1%) | 95 (16.4%) | 42 (7.3%) | 578 | 1.5 | |
| 43 (9.0%) | 211 (44.0%) | 139 (28.9%) | 73 (15.2%) | 14 (2.9%) | 480 | 1.7 | |
| 17 (10.5%) | 33 (34.7%) | 21 (22.1%) | 16 (16.9%) | 15 (15.8%) | 95 | 1.8 | |
| 19 (8.1%) | 41 (17.4%) | 81 (34.5%) | 75 (31.9%) | 19 (8.1%) | 235 | 2.4 | |
| 19 (11.1%) | 41 (24.0%) | 57 (33.3%) | 38 (22.2%) | 16 (9.4%) | 171 | 2.4 | |
| 27 (5.6%) | 125 (25.9%) | 190 (39.4%) | 115 (23.9%) | 25 (5.2%) | 482 | 2.4 | |
| 46 (6.2%) | 86 (11.5%) | 207 (27.7%) | 373 (49.9%) | 35 (4.7%) | 747 | 5.4 | |
| 320 (29.3%) | 190 (17.4%) | 252 (23.0%) | 291 (26.6%) | 41 (3.7%) | 1,094 | 4.5 | |
| 233 (27.6%) | 162 (19.2%) | 112 (13.3%) | 306 (36.2%) | 31(3.7%) | 849 | 3.1 | |
Values in parentheses are the percentage of the individual catalytic class in the protease complement. aThe percentage of the whole genome that encodes putative proteases. bThe distributions of T. thermophila and P. falciparum are based on Eisen et al. [11] and Wu et al. [60], respectively. The distributions of the other model organisms are based on the results published in Merops database.
Cytoskeletal proteins in Ich, T.thermophila and H. sapiens
| Protein type | Ich |
|
|
|---|---|---|---|
| α-Tubulin | 1 | 1 | 9 |
| α-Tubulin-like | 0 | 3 | 0 |
| β-Tubulin | 3 | 2 | 9 |
| β-Tubulin-like | 0 | 6 | 0 |
| γ-Tubulin | 1 | 1 | 2 |
| ε-Tubulin | 1 | 1 | 1 |
| δ-Tubulin | 0 | 1 | 1 |
| η-Tubulin | 1 | 1 | 0 |
| ι-Tubulin | 0 | 3 | 0 |
| Tubulin tyrosine ligase-like | 31 | 50 | 14 |
| Kinesin motor domain | 41 | 78 | 48 |
| Dynein heavy chain | 19 | 25 | 16 |
| Dynein intermediate chain | 6 | 6 | 7 |
| Dynein light intermediate chain | 1 | 2 | 3 |
| Dynein light chain | 16 | 14 | 9 |
| Myosin motor domain | 3 | 13 | 22 |
| Centrin 1 | 1 | 1 | 1 |
| Centrin 2 | 2 | 1 | 1 |
| Centrin 3 | 1 | 1 | 1 |
| Centrin 4 | 1 | 1 | 0 |
| Bld10/Cep135 | 1 | 1 | 1 |
| Centriolin | 0 | 1 | 1 |
| Cep76 | 0 | 0 | 1 |
| Cep164 | 0 | 1 | 1 |
| Dip13 | 0 | 0 | 2 |
| Poc1 | 1 | 1 | 2 |
| Poc5 | 1 | 1 | 1 |
| Sas4/CPAP | 2 | 1 | 2 |
| Sas6 | 1 | 2 | 1 |
| VFL1a/CLERC | 1 | 2 | 1 |
| WDR16 | 1 | 1 | 1 |
| MKS1 | 1 | 1 | 1 |
| MKS3 | 2 | 2 | 1 |
| MKS4/Cep290 | 0 | 0 | 1 |
| MKS5/RPGRIP1L | 1 | 1 | 1 |
| MKS6/CCD2A | 1 | 1 | 1 |
| AHI1 | 1 | 1 | 1 |
| NPHP1 | 1 | 1 | 1 |
| NPHP3 | 0 | 0 | 1 |
| NPHP4 | 1 | 1 | 1 |
| BBS1 | 1 | 1 | 2 |
| BBS2, 5, 7, 8, or 9 | 1 | 1 | 1 |
| BBS3/ARL6 | 0 | 1 | 1 |
| BBS4 | 0 | 1 | 1 |