| Literature DB >> 17183673 |
Barbara J Davids1, David S Reiner, Shanda R Birkeland, Sarah P Preheim, Michael J Cipriano, Andrew G McArthur, Frances D Gillin.
Abstract
Since the Giardia lamblia cyst wall is necessary for survival in the environment and host infection, we tested the hypothesis that it contains proteins other than the three known cyst wall proteins. Serial analysis of gene expression during growth and encystation revealed a gene, "HCNCp" (High Cysteine Non-variant Cyst protein), that was upregulated late in encystation, and that resembled the classic Giardia variable surface proteins (VSPs) that cover the trophozoite plasmalemma. HCNCp is 13.9% cysteine, with many "CxxC" tetrapeptide motifs and a transmembrane sequence near the C-terminus. However, HCNCp has multiple "CxC" motifs rarely found in VSPs, and does not localize to the trophozoite plasmalemma. Moreover, the HCNCp C-terminus differed from the canonical VSP signature. Full-length epitope-tagged HCNCp expressed under its own promoter was upregulated during encystation with highest expression in cysts, including 42 and 21 kDa C-terminal fragments. Tagged HCNCp targeted to the nuclear envelope in trophozoites, and co-localized with cyst proteins to encystation-specific secretory vesicles during encystation. HCNCp defined a novel trafficking pathway as it localized to the wall and body of cysts, while the cyst proteins were exclusively in the wall. Unlike VSPs, HCNCp is expressed in at least five giardial strains and four WB subclones expressing different VSPs. Bioinformatics identified 60 additional large high cysteine membrane proteins (HCMp) containing > or = 20 CxxC/CxC's lacking the VSP-specific C-terminal CRGKA. HCMp were absent or rare in other model or parasite genomes, except for Tetrahymena thermophila with 30. MEME analysis classified the 61 gHCMp genes into nine groups with similar internal motifs. Our data suggest that HCNCp is a novel invariant cyst protein belonging to a new HCMp family that is abundant in the Giardia genome. HCNCp and the other HCMp provide a rich source for developing parasite-specific diagnostic reagents, vaccine candidates, and subjects for further research into Giardia biology.Entities:
Mesh:
Substances:
Year: 2006 PMID: 17183673 PMCID: PMC1762436 DOI: 10.1371/journal.pone.0000044
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1HCNCp mRNA Transcripts Are Upregulated During Encystation
SAGE revealed increased steady state abundance of HCNCp transcripts beginning at 12 hr encystation through 42 hr compared to trophozoite (“troph”) and 4 hr encystation.
SAGE data is presented as percentage of all tags sampled at a given time point.
HCMp Proteins In The Giardial Genome Ordered By Transmembrane Region
| ORF | MW | % C | CxxC | CxC | Transmembrane | C-terminus |
|
| 179.9 | 11 | 6 | 28 | AIVVVVLLVLAAVAGFLVWWFVI | RPRKSGVLRERAPRKPGRGLPKPQLTRNSSLTASMYSTAPLLNGSRQPSMVQL |
|
| 178.5 | 11 | 6 | 28 | AIVVVVLLVLVAVAGFLIWWFVI | RPRRGGPLRERAPQKGSKSSRSKLKRQATSNASLHADVPLLSQLSGANSSIQL |
|
| 177.3 | 11 | 6 | 28 | AIVVVVLLVLVAVAGFLVWWFVI | RPRRGGPLRERAPRKPPGLPKPQLTHNSSLTASMYSTAPLLSGSRQSSMVQL |
|
| 176.1 | 11 | 6 | 28 | AIVVVVLLVLVAVAGFLVWWFVI | RPRRGGPLRERVPQKGSKSSSSSRSKLRRQADSSVSLHADVPLLSQASNVNSSIQL |
|
| 178.1 | 11 | 6 | 28 | IAVVVVVLLVLVAVAGFLVWWFV | IRPRKSGELRERAPRKGPSFSGSQKLKKQASSKASLHATAPLLSRSSHAGSSVQL |
|
| 177.9 | 11 | 6 | 28 | IAVVVVVLLVLVAVAGFLVWWFV | IRPRKSGELRERAPRKGPSFSGSQKLKKQASSKASLHATAPLLSRSSHAGSSVQL |
|
| 162.6 | 11 | 4 | 26 | IIVVVVVALVILAVAIFLIWKFV | IKPRSKSRKRSNNDLNQRLITVDGEDAPVLSPRARGQTLM |
|
| 86.5 | 11 | 34 | 0 | AIAAIVIVVLLVLAVAGFLVWWF | VFRRGGRPKRGAKYTSLMRGESYDYRQSLI |
|
| 85.8 | 12 | 34 | 0 | AIAAIVIVVLLVLAVAGFLVWWF | VFRRGGRPKRGAKYTSLMRGESYDYRQSLI |
|
| 86.2 | 12 | 34 | 0 | AIAAIVIVVLLVLAVAGFLVWWF | VFRRGGRPKRGAKYTSLMRGESYDYRQSLI |
|
| 85.7 | 12 | 34 | 0 | AIAAIVIVVLLVLAVAGFLVWWF | VFRRGGRPKRGAKYTSLMRGESYDYRQSLI |
|
| 71.9 | 11 | 21 | 3 | VTVAVVVSLLIIAAVALACWLVL | RRRRGSKIVSRKRTAAENIKLMGSVDEF |
|
| 121.5 | 13 | 56 | 1 | AGVSVTAILLVTGLVSFLLWWFL | CKRT |
|
| 120.8 | 13 | 56 | 1 | AGVSVTAVLLVTGLVSFLLWWFL | CRRSR |
|
| 122.5 | 13 | 56 | 0 | GVSVAAVLLVGSVVGFLLWWFLY | RRTQPYGAKHVALSRKRIPTTASITTPLN |
|
| 98.9 | 13 | 4 | 17 | GVLIGTSVGGVVLLIAIIVGVFF | CVKKARKGKAPEGRAGKKTRALLSDGDDEDEDLLVPSESESGSASAATGTDEIP |
|
| 79.6 | 13 | 19 | 6 | IAGLSVGSIILLTAVALSIYFGV | RACQKKNRSQVSFSRTVPIGDEDCNP |
|
| 140.4 | 14 | 69 | 3 | IGLSVTAVALLLIALGILLWKLL | SARKLSRTRCMHLPGSTTELLAGASGTGQSMIFDSINLSIT |
|
| 74.4 | 12 | 31 | 0 | GIAVAVIIVVGGLVGFLCWWFIY | RGRK |
|
| 72.5 | 12 | 29 | 0 | GISVAVIVVVGGLIGFLCWWFL | CRGKA |
|
| 60.4 | 11 | 21 | 0 | GVSVVVIIVVGGLAGFLCWWFI | GRRKA |
|
| 58.9 | 11 | 21 | 0 | SPSAIAGVSVVVIIVVGGLAGFL | C |
|
| 75.9 | 11 | 28 | 1 | AVGVTIAILAIAGVIGFLVWWFV | CKKKTNKVNMPPKLSSNSSIVSSRIGLM |
|
| 220.5 | 12 | 30 | 25 | GSVIAVLLVLGGVIGFCVWWFLV | RGKKGQAAQKGRRGKSYSGRKYVRTVSDPDSTSLLSTDMTNSLL |
|
| 153.5 | 11 | 5 | 24 | YVPIVVGVLVLSVLAGLIGFLSW | RFCCKNKKPKHPLDIPTKRRGRRERSAIALISYANFRGRNANEAMGLAELEEGSDGSAAV |
|
| 86.0 | 12 | 34 | 1 | GIAIGVVLVVGGVAAVLVWFFVF | RKKN |
|
| 87.3 | 12 | 34 | 1 | GIAIGVVLVVGGVAAVLVWFFVF | RKKAGVPLLYKQSIVASTH |
|
| 85.4 | 13 | 34 | 1 | GIAIGVVLVVGGVAAVLVWFFVF | RKKN |
|
| 85.6 | 13 | 34 | 1 | GIAIGVVLVVGGVAAVLVWFFVF | RKKN |
|
| 86.1 | 12 | 34 | 1 | GIAIGVVLVVGGVAGVLVWVFVF | RKKAGVPPLYKQSIVASTH |
|
| 85.2 | 13 | 34 | 1 | GIAISVVLVVGGVAGVLVWFFVF | RKKN |
|
| 169.6 | 12 | 24 | 22 | ITIAVLVVVGGSVAGVLVWFFLF | RNRKGPMKKSPRRRFHPDETSTSLLSQDYGSSML |
|
| 169.4 | 14 | 77 | 8 | ITGIAIAVIAVIGCAVGVLVWFL | CCRRSKAV |
|
| 85.6 | 12 | 30 | 2 | AGISVASVAIVAVIIGCLVWFLL | RRKNSGSLNREPSLFRPLSS |
|
| 99.6 | 12 | 38 | 1 | GVVAGISITIVVVVAAIVGVLVW | KYVCKKKSSKRIKMVDMDVSINTSQYMSTSTV |
|
| 262.0 | 13 | 108 | 1 | AITGITLGVAVLVGGAIGLALGL | TVCKRGCAGNGSLQPLTI |
|
| 263.8 | 13 | 110 | 1 | AITGITLGVAVLVGGAIGLALGL | TVCKRGCAGSTGEGLRPLYA |
|
| 265.7 | 13 | 110 | 2 | AITGITLGVTVLVGGTIGLTFSL | MASKHKSQASGLRPLTA |
|
| 139.8 | 16 | 70 | 18 | ATAISCSVIGAAIAIATTIALIV | KCQHHRHARMVRQVDALVTEADELAH |
|
| 145.5 | 12 | 5 | 25 | IGAVAGVTASTAVFFGLLFLSAA | RCHRTPAR |
|
| 79.4 | 11 | 22 | 1 | GIGVGVSVVVLVLIGVLIWWLVF | QRRRGSGFGGSRDMLTSRD |
|
| 82.6 | 11 | 22 | 1 | VIIGICVGAVLIMGALIGVLAWW | VVSRKKKSASFERSRSLVISKS |
|
| 87.1 | 12 | 24 | 3 | IAGGTVAGVAVIGVLVGFLCWWF | LCRGKRIGASPSTTALVRPKSV |
|
| 85.8 | 12 | 24 | 3 | IAGGTVAGVVVIGVLVGFLCWWF | LCRSKHIGASSSTTALVRSKSV |
|
| 84.8 | 12 | 23 | 3 | TAAIAGGTVAGVIVIGSLVGFL | CW |
|
| 86.0 | 11 | 24 | 3 | AGTTMAVLVVGVLVGFLCWWFIF | RGRRIDASPSTMTLISSKSM |
|
| 84.8 | 12 | 24 | 3 | ASAASAVLVISALIGFLCWWFI | CRGKRRYYR |
|
| 88.3 | 12 | 38 | 0 | MEISVAMVAIGALVGFLCWWFIF | RGRRIDASPSTMTLIPSKSI |
|
| 72.8 | 11 | 20 | 3 | IAIAVIVIVVIVGVLVGVLCWYF | LRNKRKRSIRPRTVSKMSESMGLVGSVDDF |
|
| 87.8 | 15 | 50 | 0 | SSLALFIIMLSLIILWIVVILIV | RYKEGARQHITTSLSNTETI |
|
| 89.7 | 14 | 47 | 0 | LIVGVIVFIIVLILIVVTVVLLI | RLRKRMDREDAIFCEQNTTLIPDGQGLEYQDSEEP |
|
| 62.5 | 11 | 19 | 2 | IAVGVAASVVGIMVVALVCWLVL | GVSSCLTCIENCAECKQTGTASFEC |
|
| 66.9 | 14 | 32 | 0 | ASSVGIVVLVLLVCIGVGLFFVF | RRKPAQQLDVTSETRLMNSQKNMEPTSVDTNPECVEYETD |
|
| 62.6 | 13 | 31 | 1 | WLYMWMMFVLFSGLMLLAFLS | TCFKKVLVAKEQSRPSSTCDSVSKDKRC |
|
| 56.7 | 12 | 24 | 3 | SITLPLAVLLLAATVITLAIILV | KKRKTTKHGLSTNPVTLTTMVST |
|
| 72.6 | 14 | 36 | 0 | VTAAIVILVLLLIGVCAAIPFIV | KVLVRKGVRRRTRAMKYDRGSADSLLPEGSADSAL |
|
| 66.9 | 12 | 30 | 0 | IVGISAGVILAVGIIAGAIVMTV | TSKKHK |
|
| 72.8 | 14 | 36 | 0 | VTAAIVILVLLLIGVCAAIPFIV | KVLVRKGVRRRTRAMKYDRGSADSLLPEGSADSAL |
|
| 109.1 | 15 | 61 | 2 | VVLGAALAISFAGGSSIALYFII | RRLLQ |
|
| 61.7 | 14 | 30 | 0 | ISSVSATVLVLLACIGIGLFFIF | RRRSPRLDAASEALSVETLQEHELSRQC |
|
| 70.1 | 14 | 35 | 0 | LISTIVILGLIIVGFIVATPFLV | KLAKKRGVRVSKLRSINSGSESHNEFLLEEPDLL |
|
| 71.9 | 14 | 36 | 0 | LISTIVILGLIIVGFIVATPFLV | KLAKKRGVRVSKLRSINSGSESHNEFLLEEPDLL |
The prototypical VSP (TSA 417) is included for comparison. ORF ID numbers can be used to view the data on GiardiaDB (www.mbl.edu/Giardia).
HCMp Categories And HMM Analyses
| ORF_ID | GGCY | CxxC | CxC | MEME group | TMK HMM (E value) | VSP HMM (E value) | EGF HMM (E value) | PC6B repeats |
| 113797* | 1 | 29 | 0 | VSP | 0.005 | 2.3E-306 | 0.30 | |
|
| 0 | 6 | 28 |
| 0.290 | 4.20E-04 | 6.80E-40 | |
|
| 0 | 6 | 28 |
| 0.120 | 3.10E-04 | 4.10E-38 | |
|
| 0 | 6 | 28 |
| 0.280 | 0.002 | 3.80E-37 | |
|
| 0 | 4 | 26 |
| 0.270 | 0.028 | 6.70E-37 | |
|
| 0 | 6 | 28 |
| 0.072 | 0.007 | 6.80E-35 | |
|
| 0 | 6 | 28 |
| 0.200 | 0.003 | 1.40E-32 | |
|
| 0 | 5 | 24 |
| 0.160 | 0.009 | 4.30E-31 | |
|
| 0 | 6 | 28 |
| 0.210 | 0.002 | 1.50E-30 | |
| 40376** | 5 | 77 | 8 | Group 1 | 4.30E-05 | 4.8E-23 | 0.01 | 4 |
|
| 3 | 30 | 25 |
| 0.004 | 9.1E-35 | 2.00E-29 | 1 |
|
| 3 | 19 | 2 |
| 0.170 | 1.30E-05 | 0.01 | 1 |
|
| 3 | 22 | 1 |
| 0.030 | 6.7E-17 | 0.16 | |
|
| 2 | 21 | 3 |
| 0.070 | 3.40E-06 | 0.01 | 1 |
|
| 2 | 20 | 3 |
| 0.059 | 9.3E-10 | 0.02 | 1 |
|
| 2 | 31 | 0 |
| 0.011 | 4.7E-14 | 0.07 | 1 |
|
| 2 | 38 | 1 |
| 7.40E-04 | 7E-16 | 0.12 | |
|
| 2 | 22 | 1 |
| 0.036 | 1.1E-16 | 1.10 | |
|
| 1 | 24 | 22 |
| 0.012 | 3.4E-27 | 2.50E-36 | 1 |
|
| 1 | 69 | 3 |
| 2.70E-05 | 3.9E-18 | 0.03 | 2 |
|
| 1 | 38 | 0 |
| 0.008 | 6.4E-48 | 0.13 | 2 |
|
| 1 | 28 | 1 |
| 0.025 | 5.6E-11 | 0.05 | |
|
| 1 | 24 | 3 |
| 0.390 | 3.9E-14 | 0.12 | |
|
| 1 | 30 | 2 |
| 0.005 | 8.00E-08 | 0.15 | |
|
| 0 | 5 | 25 |
| 0.076 | 0.008 | 5.50E-45 | |
|
| 0 | 4 | 17 |
| 0.140 | 7.50E-04 | 1.20E-22 | |
|
| 0 | 19 | 6 |
| 0.037 | 3.50E-05 | 2.80E-08 | |
|
| 0 | 70 | 18 |
| 2.90E-05 | 1.20E-08 | 0.03 | |
|
| 0 | 30 | 0 |
| 0.018 | 2.30E-05 | 0.03 | |
|
| 0 | 31 | 1 |
| 0.087 | 2.00E-06 | 0.04 | |
|
| 0 | 50 | 0 |
| 2.20E-04 | 2.50E-06 | 0.07 | |
|
| 0 | 47 | 0 |
| 5.70E-04 | 2.10E-06 | 0.10 | |
|
| 0 | 61 | 2 |
| 1.20E-04 | 2.30E-05 | 0.11 | 5 |
|
| 2 | 34 | 0 |
| 0.010 | 6.1E-12 | 0.19 | |
|
| 1 | 34 | 0 |
| 0.009 | 1.4E-11 | 0.12 | |
|
| 1 | 34 | 0 |
| 0.009 | 1.9E-12 | 0.24 | |
|
| 1 | 34 | 0 |
| 0.011 | 3E-11 | 0.29 | |
|
| 3 | 24 | 3 |
| 0.007 | 1.3E-13 | 0.04 | |
|
| 3 | 24 | 3 |
| 0.008 | 1.4E-13 | 0.05 | |
|
| 3 | 24 | 3 |
| 0.008 | 2.3E-15 | 0.07 | |
|
| 2 | 23 | 3 |
| 0.008 | 2.7E-13 | 0.04 | |
|
| 2 | 24 | 3 |
| 0.009 | 1.1E-14 | 0.05 | |
|
| 0 | 36 | 0 |
| 0.004 | 5.00E-06 | 0.01 | |
|
| 0 | 35 | 0 |
| 0.009 | 5.20E-06 | 0.01 | |
|
| 0 | 36 | 0 |
| 0.003 | 3.00E-09 | 0.02 | |
|
| 0 | 36 | 0 |
| 0.004 | 1.4E-09 | 0.11 | |
|
| 0 | 32 | 0 |
| 0.015 | 2.60E-06 | 0.03 | |
|
| 0 | 30 | 0 |
| 0.051 | 2.20E-06 | 0.11 | |
|
| 2 | 21 | 0 |
| 0.120 | 1.8E-58 | 0.29 | |
|
| 2 | 21 | 0 |
| 0.140 | 1.6E-73 | 0.29 | |
|
| 12 | 110 | 1 |
| 4.70E-05 | 1.1E-10 | 0.10 | 1 |
|
| 12 | 108 | 1 |
| 1.30E-04 | 6.1E-11 | 0.12 | 1 |
|
| 11 | 110 | 2 |
| 9.50E-05 | 2.4E-10 | 0.12 | 1 |
|
| 4 | 56 | 1 |
| 8.40E-05 | 4E-54 | 0.20 | 2 |
|
| 4 | 56 | 1 |
| 1.10E-04 | 6.3E-57 | 0.20 | 2 |
|
| 4 | 56 | 0 |
| 7.70E-05 | 2.1E-56 | 0.21 | 2 |
|
| 3 | 34 | 1 |
| 0.003 | 1.6E-70 | 0.16 | 1 |
|
| 3 | 34 | 1 |
| 0.004 | 5.3E-64 | 0.19 | 1 |
|
| 3 | 34 | 1 |
| 0.004 | 2.3E-65 | 0.83 | 1 |
|
| 3 | 34 | 1 |
| 0.002 | 2.3E-70 | 1.50 | 1 |
|
| 2 | 34 | 1 |
| 0.002 | 6.3E-69 | 0.19 | 1 |
|
| 2 | 34 | 1 |
| 0.002 | 3.9E-72 | 0.24 | 1 |
Figure 2Expression of HCNCp During Differentiation
Full length epitope-tagged HCNCp protein (arrow at 170 kDa) was detected beginning at 21 hr encystation (“21”) and was at peak levels at the cyst (“C”) stage.
Proteins having an AU1 tag and relative molecular masses of 42 and 21 kDa also increased during encystation (arrows).
Anti-AU1 antibodies did not react to non-transfected “control” trophozoites (“T”) or encysting cells (“21”, “42”, or “C”).
Size markers are indicated by dashes on the left side of figure in kDa and the taglin loading control is shown at the bottom of the figure (“LC”).
Figure 3Traffic of HCNCp And Cyst Proteins During Growth And Differentiation
Differential interference contrast (DIC) merged with DAPI images are shown to the left of each panel, HCNCp in green, anti-cyst proteins in red, and nucleic acid in blue (DAPI).
In trophozoites, HCNCp localized to nuclei and nuclear envelope/ER.
During encystation HCNCp co-localized with cyst proteins in encystation secretory vesicles (ESV) and to the cyst wall of water-resistant cysts.
In mature cysts, most of the HCNCp was within the cell body.
Scale bar is 5 µM.
Figure 4Presence Or Absence Of The HCNCp Gene And mRNA Transcripts In Trophozoites Of 9 Giardia Strains/Subclones
(A) Genomic PCR found that the gene for HCNCp is present in all giardial isolates except for GS/M (upper panel).
Lower panel shows tubulin controls for each isolate.
(B) HCNCp mRNA transcripts were detected in all strains/clones tested except for the most divergent known Giardia strain infecting humans, GS/M (upper panel).
Tubulin controls for rt-PCR without reverse transcriptase (“-“) or plus reverse transcriptase (“+”) are shown in the lower panel.
Sizing markers are indicated by dashes to the left side of the figure in kb.
Available SAGE Data For The High Cysteine Membrane Proteins (HCMp) Outlined In Table 1
| ORF ID | 11309 | 15317 | 16318 | 25816 | 41942 | 91707 | 113213 | 113531 | 113987 | 114626 | 115066 |
|
| 0.02104 | 0.10259 | 0.02367 | 0.01315 | 0.02104 | 0.0342 | 0.00526 | 0.02104 | 0.06313 | 0.02367 | 0.02104 |
|
| 0.02159 | 0.11065 | 0.04318 | 0.05667 | 0.0081 | 0.01619 | 0.0054 | 0.01619 | 0.02969 | 0.0054 | 0.0054 |
|
| 0.02116 | 0.16396 | 0.02909 | 0.06611 | 0.00529 | 0.00793 | 0.00793 | 0.01322 | 0.03173 | 0.02116 | 0.00529 |
|
| 0.01089 | 0.11976 | 0.00544 | 0.01089 | 0.00544 | 0.01633 | 0.01089 | 0 | 0.02722 | 0.00544 | 0 |
|
| 0.00511 | 0.12764 | 0.02298 | 0.01532 | 0.00511 | 0.01021 | 0.01276 | 0.02553 | 0.03319 | 0.02298 | 0.00255 |
SAGE data are presented as percentages of all tags sampled. ORF ID numbers can be used to view the data on GiardiaDB (www.mbl.edu/Giardia).
HCMp's In Model Organisms And Selected Parasite Genomes
| Organism | Genome Size | ≥10% Cysteine | Type1 TM | HCMp's | % Genome |
|
| 12 | 173 | 152 |
| 0.63 |
|
| 104 | 285 | 45 |
| 0.11 |
|
| 2,910 | 28 | 9 |
| 0.025 |
|
| 20 | 28 | 17 |
| 0.06 |
|
| 180 | 18 | 6 |
| 0.03 |
|
| 97 | 39 | 9 |
| 0.02 |
|
| 125 | 6 | 4 | 0 | |
|
| 80 | 7 | 3 | 0 | |
|
| 120 | 3 | 1 | 0 | |
|
| 30 | 3 | 1 | 0 | |
|
| 23 | 0 | 0 | 0 | |
|
| 12 | 0 | 0 | 0 | |
|
| 10 | 0 | 0 | 0 |
: Giardia genome ORF numbers from GiardiaDB (www.mbl.edu/Giardia):
40376,11309,25816,32607,10659,16318,15317,103454,101805,91707,7715,15008,17328,22547,9620,14017,15475,39904,16721,16842,15521,16374,91099,24880,27717,112126,113987,114891,114042,114089,112828,114930,114617,112633,137727,113213,113836,113531,114626,114991,16936,16716,21321,137715,112432,15250,41942,113512,114470,114180,113319,113416,112673,112135,115066,112584,113297,113801,137672,112305,11852
: gi|89285635,gi|89286821,gi|89288525,gi|89288637,gi|89288695,gi|89289016,gi|89289736,gi|89290140,gi|89290464,gi|89291419,gi|89291425,gi|89292518,gi|89292756,gi|89293702,gi|89295361,gi|89295485,gi|89295487,gi|89295488,gi|89295496,gi|89295499,gi|89295601,gi|89296595,gi|89299183,gi|89300019,gi|89302034, gi|89302083,gi|89304603,gi|89304693,gi|89306011,gi|89306012
: gi|21704279,gi|21704277,gi|14192943,gi|14192941,gi|5453766,gi|10092639,gi|4503667
: gi|67476461,gi|67473447,gi|67476571,gi|67467895,gi|67476995,gi|67464660
: CG33196,CG6124,CG2086,CG15011 from FlyBase (http://flybase.org/)
: gi|25143262,gi|25150348,gi|25150352,gi|25150359
% Genome defined as number of HCMp's/predicted ORFs.