| Literature DB >> 21194483 |
Josep F Abril1, Francesc Cebrià, Gustavo Rodríguez-Esteban, Thomas Horn, Susanna Fraguas, Beatriz Calvo, Kerstin Bartscherer, Emili Saló.
Abstract
BACKGROUND: Freshwater planarians are an attractive model for regeneration and stem cell research and have become a promising tool in the field of regenerative medicine. With the availability of a sequenced planarian genome, the recent application of modern genetic and high-throughput tools has resulted in revitalized interest in these animals, long known for their amazing regenerative capabilities, which enable them to regrow even a new head after decapitation. However, a detailed description of the planarian transcriptome is essential for future investigation into regenerative processes using planarians as a model system.Entities:
Mesh:
Substances:
Year: 2010 PMID: 21194483 PMCID: PMC3022928 DOI: 10.1186/1471-2164-11-731
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of sequence statistics for each assembly.
| SET | Contigs | Singletons | TOTAL SEQs | GC% | LENGTHs | |||
|---|---|---|---|---|---|---|---|---|
| 52,885 | 137,213 | 190, 098 | 35.130 | 20 | 354 | 6812 | 355.78 | |
| 52,501 | 137,077 | 189,578 | --- | --- | --- | --- | --- | |
| 52,321 | 137,353 | 189,674 | 35.127 | 20 | 354 | 6812 | 355.82 | |
| 53,867 | 138,766 | 192,633 | 35.108 | 20 | 358 | 7918 | 364.81 | |
Set names are related to the corresponding homology level cutoff value (90e stands for 90% similarity including the set of NCBI Unigene ESTs). Contigs are the result of at least two sequencing reads, and singletons of only one read. GC content is the average value for all sequences. Sequence lengths are shown as minimum, median, maximum and average values in nucleotides for each set.
Figure 1Overlap-analysis of Smed454 assemblies. Comparison of the 454-sequencing reads taken into account to build each Smed454 dataset. Venn-diagram numbers in plain format correspond to singleton reads, while numbers in bold correspond to sequencing reads that were assembled into Contigs. About 4,000 raw reads where split into two or more fragments, due to quality clipping. However, only distinct raw read identifiers, after removing the fragment suffix, were used to produce this figure. sgl: total number of singletons for that assembly; ctg: total number of contigs; uni: number of NCBI Unigene sequences not assembled into a contig (90e only); sqs: total number of sequences for a given dataset.
Figure 2GC content and length distributions of different assemblies. Violin plots show the distribution of frequencies of a given variable in different datasets using a density kernel estimator [71]. White marks on the violin plots indicate the median value for a given variable, and the red points indicate the mean. The thick line marks the 25/75% inter-quartile range. GC content (left panel) distribution is quite similar in all the datasets, with higher frequencies around 35%. Nucleotide length (right panel) highlights the major differences between un-assembled (NCBI ESTs and the 454 raw reads) and assembled (NCBI UniGene, 90, 98 and 90e) sets. The last four plots (light blue) show the length distribution for the component subsets of the 90e assembly.
Figure 3Distribution of different HSP types from 90e over genome sequences. The top table shows the total number of similarity hits, while the bottom table classifies the hits into different types of HSPs: A) 454 contigs not mapping to a genomic sequence; B) genomic contigs not mapping to a 454 contig; C and J) 454 contigs with an unmapped sequence on the left and right, respectively; D) missing sequence on 454 contigs corresponding to a putative gap in the assembly; E) contiguous HSPs on 454 contigs related to a genomic intron; F) co-linear unmapped sequences on both sequence sets; G) contiguous overlapping HSPs defining a larger similarity segment; H) unaligned genomic sequences between HSPs of two different 454 contigs, which can be interpreted as putative intergenic sequences; I) HSPs on 454 contigs supporting a pair of genomic contigs, which could then be merged into a larger genomic scaffold. All columns show HSP numbers--the '#HSPs' row--except for A and B, which correspond to number of sequences.
Figure 4Analysis of intronic features and splice sites on a set of 90e contigs. A) Distribution of the number of putative introns per 90e contig. B) Length distribution of putative introns. C) Pictograms summarizing the consensus donor and acceptor splice sites for the predicted introns. n corresponds to the number of intron sequences used to compute the nucleotide position weight matrices for the pictograms. Light grey shadowed regions correspond to the commonly used signal lengths for gene-finding, while dark grey ones define the nucleotide boundaries of the introns. Numbers below pictograms are the bit-scores that describe the information content per position.
Figure 5The content of the Smed454 web site. Screenshots of the pages that facilitate access to the three sequence assemblies (90, 98 and 90e), including the page displaying alignments of raw reads. A BLAST interface, adapted from NBCI's toolkit, is also available for querying the sequences from the datasets. The web site is available at http://planarian.bio.ub.es/datasets/454/
Figure 6Distribution of BLASTXhits of 90e sequences against NCBI NRprot. Sequences from the 90e dataset were compared against the NCBI NR protein database using BLASTX. The figure shows the distribution of the number of sequences binned by the number of HSPs they had. Y-axis in log scale.
Gene Ontology annotation for 90e set sequences.
| GO:0000166 | nucleotide binding | 54,823 | --- | --- | --- | --- | --- | GO:0043229 | intracellular organelle | 60,817 | --- |
| --- | unannotated | 9,709 | --- | --- | unannotated | 62,834 | --- | --- | unannotated | 11,131 | --- |
| GO:0016787 | hydrolase activity | 5,197 | 35.669 | GO:0043170 | macromolecule metabolic process | 5,793 | 35.610 | GO:0043234 | protein complex | 2,918 | 40.788 |
| GO:0016740 | transferase activity | 2,030 | 13.933 | GO:0022607 | cellular component assembly | 2,182 | 13.413 | GO:0044424 | intracellular part | 2,314 | 32.346 |
| GO:0043167 | ion binding | 1,323 | 9.080 | GO:0006810 | transport | 1,213 | 7.456 | GO:0031982 | vesicle | 819 | 11.448 |
| GO:0003735 | structural constituent of ribosome | 874 | 5.999 | GO:0006950 | response to stress | 1,070 | 6.577 | GO:0044425 | membrane part | 469 | 6.556 |
| GO:0005488 | binding | 761 | 5.223 | GO:0050789 | regulation of biological process | 1,012 | 6.221 | GO:0016020 | membrane | 210 | 2.935 |
| GO:0016491 | oxidoreductase activity | 703 | 4.825 | GO:0006807 | nitrogen compound metabolic process | 722 | 4.438 | GO:0005622 | intracellular | 111 | 1.552 |
| GO:0022857 | transmembrane transporter activity | 678 | 4.653 | GO:0048869 | cellular developmental process | 655 | 4.026 | GO:0044446 | intracellular organelle part | 91 | 1.272 |
| GO:0030235 | nitric-oxide synthase regulator activity | 597 | 4.097 | GO:0065009 | regulation of molecular function | 622 | 3.823 | GO:0005576 | extracellular region | 63 | 0.881 |
| GO:0043176 | amine binding | 580 | 3.981 | GO:0009056 | catabolic process | 507 | 3.117 | GO:0045211 | postsynaptic membrane | 21 | 0.294 |
| GO:0005515 | protein binding | 532 | 3.651 | GO:0044419 | interspecies interaction between organisms | 280 | 1.721 | GO:0044420 | extracellular matrix part | 19 | 0.266 |
| GO:0003676 | nucleic acid binding | 401 | 2.752 | GO:0055114 | oxidation reduction | 236 | 1.451 | GO:0043233 | organelle lumen | 18 | 0.252 |
| GO:0005215 | transporter activity | 387 | 2.656 | GO:0065008 | regulation of biological quality | 206 | 1.266 | GO:0031012 | extracellular matrix | 16 | 0.224 |
| GO:0016829 | lyase activity | 71 | 0.487 | GO:0048856 | anatomical structure development | 193 | 1.186 | GO:0042597 | periplasmic space | 15 | 0.210 |
| GO:0016853 | isomerase activity | 55 | 0.377 | GO:0051649 | establishment of localization in cell | 183 | 1.125 | GO:0000267 | cell fraction | 15 | 0.210 |
| GO:0048037 | cofactor binding | 52 | 0.357 | GO:0044237 | cellular metabolic process | 182 | 1.119 | GO:0044462 | external encapsulating structure part | 11 | 0.154 |
| GO:0016874 | ligase activity | 49 | 0.336 | GO:0023060 | signal transmission | 150 | 0.922 | GO:0031975 | envelope | 8 | 0.112 |
| GO:0004871 | signal transducer activity | 45 | 0.309 | GO:0048870 | cell motility | 141 | 0.867 | GO:0005615 | extracellular space | 7 | 0.098 |
| GO:0003824 | catalytic activity | 32 | 0.220 | GO:0008152 | metabolic process | 139 | 0.854 | GO:0009986 | cell surface | 6 | 0.084 |
| GO:0060589 | nucleoside-triphosphatase regulator activity | 32 | 0.220 | GO:0023033 | signaling pathway | 107 | 0.658 | GO:0043204 | perikaryon | 5 | 0.070 |
| GO:0042277 | peptide binding | 28 | 0.192 | GO:0044238 | primary metabolic process | 83 | 0.510 | GO:0030427 | site of polarized growth | 4 | 0.056 |
| GO:0022892 | substrate-specific transporter activity | 22 | 0.151 | GO:0042221 | response to chemical stimulus | 69 | 0.424 | GO:0042995 | cell projection | 3 | 0.042 |
| GO:0019208 | phosphatase regulator activity | 12 | 0.082 | GO:0006996 | organelle organization | 47 | 0.289 | GO:0030312 | external encapsulating structure | 2 | 0.028 |
| GO:0003712 | transcription cofactor activity | 12 | 0.082 | GO:0007017 | microtubule-based process | 42 | 0.258 | GO:0031594 | neuromuscular junction | 2 | 0.028 |
| GO:0019207 | kinase regulator activity | 11 | 0.075 | GO:0044281 | small molecule metabolic process | 39 | 0.240 | GO:0045177 | apical part of cell | 2 | 0.028 |
| GO:0008289 | lipid binding | 9 | 0.062 | GO:0051301 | cell division | 37 | 0.227 | GO:0019028 | viral capsid | 1 | 0.014 |
| GO:0005201 | extracellular matrix structural constituent | 8 | 0.055 | GO:0022613 | ribonucleoprotein complex biogenesis | 35 | 0.215 | GO:0031252 | cell leading edge | 1 | 0.014 |
| GO:0050840 | extracellular matrix binding | 6 | 0.041 | GO:0019637 | organophosphate metabolic process | 34 | 0.209 | GO:0044217 | other organism part | 1 | 0.014 |
| GO:0061134 | peptidase regulator activity | 6 | 0.041 | GO:0045184 | establishment of protein localization | 34 | 0.209 | GO:0044297 | cell body | 1 | 0.014 |
| GO:0030246 | carbohydrate binding | 6 | 0.041 | GO:0009628 | response to abiotic stimulus | 23 | 0.141 | GO:0044463 | cell projection part | 1 | 0.014 |
| GO:0016248 | channel inhibitor activity | 5 | 0.034 | GO:0019748 | secondary metabolic process | 21 | 0.129 | 7,154 | |||
| GO:0003702 | RNA polymerase II transcription factor activity | 5 | 0.034 | GO:0009058 | biosynthetic process | 21 | 0.129 | ||||
| GO:0005198 | structural molecule activity | 4 | 0.027 | GO:0007155 | cell adhesion | 18 | 0.111 | ||||
| GO:0016986 | transcription initiation factor activity | 4 | 0.027 | GO:0061024 | membrane organization | 16 | 0.098 | ||||
| GO:0042165 | neurotransmitter binding | 4 | 0.027 | GO:0007275 | multicellular organismal development | 16 | 0.098 | ||||
| GO:0003682 | chromatin binding | 3 | 0.021 | GO:0016192 | vesicle-mediated transport | 12 | 0.074 | ||||
| GO:0008430 | selenium binding | 3 | 0.021 | GO:0043062 | extracellular structure organization | 10 | 0.061 | ||||
| GO:0030234 | enzyme regulator activity | 2 | 0.014 | GO:0034330 | cell junction organization | 9 | 0.055 | ||||
| GO:0030528 | transcription regulator activity | 2 | 0.014 | GO:0048609 | reproductive process in a multicellular organism | 9 | 0.055 | ||||
| GO:0009055 | electron carrier activity | 2 | 0.014 | GO:0007154 | cell communication | 8 | 0.049 | ||||
| GO:0017056 | structural constituent of nuclear pore | 2 | 0.014 | GO:0003008 | system process | 7 | 0.043 | ||||
| GO:0017080 | sodium channel regulator activity | 2 | 0.014 | GO:0016049 | cell growth | 7 | 0.043 | ||||
| GO:0001871 | pattern binding | 2 | 0.014 | GO:0016458 | gene silencing | 6 | 0.037 | ||||
| GO:0019239 | deaminase activity | 2 | 0.014 | GO:0008219 | cell death | 5 | 0.031 | ||||
| GO:0043021 | ribonucleoprotein binding | 2 | 0.014 | GO:0033036 | macromolecule localization | 4 | 0.025 | ||||
| GO:0008538 | proteasome activator activity | 2 | 0.014 | GO:0048610 | reproductive cellular process | 4 | 0.025 | ||||
| GO:0030337 | DNA polymerase processivity factor activity | 1 | 0.007 | GO:0051236 | establishment of RNA localization | 4 | 0.025 | ||||
| GO:0031406 | carboxylic acid binding | 1 | 0.007 | GO:0071684 | organism emergence from protective structure | 4 | 0.025 | ||||
| GO:0042562 | hormone binding | 1 | 0.007 | GO:0006955 | immune response | 4 | 0.025 | ||||
| GO:0046906 | tetrapyrrole binding | 1 | 0.007 | GO:0007049 | cell cycle | 4 | 0.025 | ||||
| GO:0051540 | metal cluster binding | 1 | 0.007 | GO:0009405 | pathogenesis | 4 | 0.025 | ||||
| 14,570 | GO:0009607 | response to biotic stimulus | 4 | 0.025 | |||||||
| GO:0009791 | post-embryonic development | 4 | 0.025 | ||||||||
| GO:0048646 | anatomical structure formation involved in morphogenesis | 4 | 0.025 | ||||||||
| GO:0009790 | embryonic development | 3 | 0.018 | ||||||||
| GO:0015976 | carbon utilization | 2 | 0.012 | ||||||||
| GO:0032506 | cytokinetic process | 2 | 0.012 | ||||||||
| GO:0045103 | intermediate filament-based process | 2 | 0.012 | ||||||||
| GO:0070882 | cellular cell wall organization or biogenesis | 2 | 0.012 | ||||||||
| GO:0006413 | translational initiation | 2 | 0.012 | ||||||||
| GO:0009566 | fertilization | 2 | 0.012 | ||||||||
| GO:0001906 | cell killing | 1 | 0.006 | ||||||||
| GO:0043473 | pigmentation | 1 | 0.006 | ||||||||
| GO:0009987 | cellular process | 1 | 0.006 | ||||||||
| GO:0022402 | cell cycle process | 1 | 0.006 | ||||||||
| GO:0022411 | cellular component disassembly | 1 | 0.006 | ||||||||
| GO:0022415 | viral reproductive process | 1 | 0.006 | ||||||||
| GO:0023061 | signal release | 1 | 0.006 | ||||||||
| GO:0030030 | cell projection organization | 1 | 0.006 | ||||||||
| GO:0051606 | detection of stimulus | 1 | 0.006 | ||||||||
| GO:0000746 | conjugation | 1 | 0.006 | ||||||||
| GO:0007163 | establishment or maintenance of cell polarity | 1 | 0.006 | ||||||||
| GO:0009653 | anatomical structure morphogenesis | 1 | 0.006 | ||||||||
| 16,268 | |||||||||||
The most probable GO code in the three ontology categories--molecular function, biological process and cellular component-- for each sequence in the 90e data set was selected. For simplicity, only level one and two codes are shown here in order.
Figure 7Prediction of planarian transmembrane proteins and functional annotations. A) Venn-diagram showing the overlap between predictions of transmembrane proteins generated by the Phobius, TMHMM2.0 and SOSUI programs for a set of 56,362 protein sequences translated from planarian ESTs. Only proteins predicted to contain one or more transmembrane domains by at least two programs (colored orange, 8,597 proteins, of which 4,663 are non-redundant) were considered for further analysis. B) Top ten PFAM domains and C) gene ontologies (biological process) for the 4,663 non-redundant transmembrane-proteins predicted. The figures indicate the number of proteins contained in a given annotation group.
List of neurotransmitter, peptide and hormone receptor sequence candidates.
| ID | BLASTX HIT | ACCESSION NUMBER | E-VALUE |
|---|---|---|---|
| 90_1623 | adiponectin receptor (Schistosoma mansoni) | XP_002577010.1 | 2,00E-103 |
| 90_11706 | allatostatin receptor, putative (Ixodes scapularis) | XP_002414997.1 | 2,00E-18 |
| 90_9653 | amine GPCR (Schistosoma mansoni) | XP_002576533.1 | 8,00E-25 |
| P02IKPED | atrial natriuretic peptide receptor (Aedes aegypti) | XP_001652228.1 | 7,00E-28 |
| P02HWID8 | beta adrenergic receptor (Aedes aegypti) | XP_001651714.1 | 2,00E-18 |
| 90_17484 | similar to bombesin-like peptide receptor (Ornithorhynchus anatinus) | XP_001514235.1 | 2,00E-12 |
| 90_19322 | C1A receptor, putative (Ixodes scapularis) | XP_002405845.1 | 4,00E-04 |
| 90_4815 | calcitonin receptor, isoform CRA_d (Rattus norvegicus) | AAA65964.1 | 3,00E-50 |
| 90_20672 | cardioexcitatory receptor (Lymnaea stagnalis) | AAB92258.1 | 6,00E-11 |
| 90_6224 | class b secretin-like g-protein coupled receptor GPRmth5 (Pediculus humanus) | XP_002427184.1 | 2,00E-10 |
| P02IZJB4 | similar to putative diuretic hormone receptor II (Nasonia vitripennis) | XP_001606711.1 | 4,00E-22 |
| 90_7506 | type I dopamine receptor (Panulirus interruptus) | ABB87183.1 | 8,00E-69 |
| 90_6802 | dopamine receptor type D2 (Apis mellifera) | NP_001011567.1 | 2,00E-27 |
| 90_8536 | dro/myosuppressin receptor (Schistosoma mansoni) | XP_002570000.1 | 7,00E-26 |
| 90_9052 | FMRFamide receptor (Culex quinquefasciatus) | XP_001849293.1 | 2,00E-17 |
| P02GUXTP | similar to galanin receptor type I (Danio rerio) | XP_690480.1 | 3,00E-06 |
| 90_6830 | glutamate receptor kainate (Schistosoma mansoni) | XP_002576035.1 | 3,00E-70 |
| P02GLFYW | glutamate receptor NMDA (Schistosoma mansoni) | XP_002572261.1 | 3,00E-21 |
| 90_15092 | glutamate receptor, ionotropic, AMPA 1b (Danio rerio) | NP_991293.1 | 5,00E-78 |
| 90_13524 | metabotropic glutamate receptor (Schistosoma mansoni) | XP_002572726.1 | 1,00E-12 |
| 90_18656 | gonadotropin-releasing hormone receptor type I (Capra hircus) | ABL76162.1 | 7,00E-04 |
| 90_4098 | growth hormone secretagogue receptor (Schistosoma mansoni) | XP_002569813.1 | 7,00E-36 |
| 90_976 | growth hormone-inducible transmembrane protein (Osmerus mordax) | AC008873.1 | 2,00E-51 |
| 90_6465 | putative insulin receptor (Echinococcus multilocularis) | CAD30260.1 | 6,00E-61 |
| 90_7253 | lung seven transmembrane receptor (Culex quinquefasciatus) | XP_001868443.1 | 1,00E-68 |
| 90_17047 | metabotropic GABA-B receptor subtype, putative (Ixodes scapularis) | XP_002406087.1 | 4,00E-21 |
| 90_6512 | natriuretic peptide receptor (Xenopus laevis) | NP_001083703.1 | 4,00E-158 |
| 90_12800 | muscarinic acetylcholine (GAR) receptor (Schistosoma mansoni) | XP_002575679.1 | 2,00E-39 |
| 90_1507 | Nicotinic acetylcholine receptor alpha 1 subunit (Aplysia californica) | AF467898_1 | 3,00E-44 |
| 90_223 | neuroendocrine protein 7b2 (Schistosoma mansoni) | XP_002578500.1 | 6,00E-25 |
| 90_6302 | similar to neuromedin U receptor 2 (Strongylocentrotus purpuratus) | XP_001200425.1 | 4,00E-27 |
| 90_29452 | neuropeptide FF receptor 2 isoform 3 (Homo sapiens) | NP_001138228.1 | 2,00E-09 |
| 90_6772 | neuropeptide F-like receptor (Schistosoma mansoni) | XP_002573542.1 | 1,00E-28 |
| 90_5995 | neuropeptide Y receptor Y7 (Oncorhynchus mykiss) | ABB54774.1 | 9,00E-18 |
| 90_25975 | octopamine receptor (Aplysia californica) | AAF37686.1 | 1,00E-25 |
| 90_8498 | odorant receptor (Tetraodon nigroviridis) | CAG08888.1 | 6,00E-05 |
| 90_5999 | similar to olfactory receptor 355 (Bos taurus) | XP_610381.4 | 1,00E-05 |
| 90_2541 | P2Y purinergic receptor (Meleagris gallopavo) | AAA18784.1 | 2,00E-04 |
| 90_8537 | P2X receptor subunit (Schistosoma mansoni) | XP_002580774.1 | 1,00E-72 |
| 90_19040 | pituitary adenylate cyclase activating polypeptide receptor (Oncorhynchus mykiss) | NP_001118113.1 | 1,00E-08 |
| 90_28219 | parathyroid hormone 2 receptor (Danio rerio) | AAI62580.1 | 3,00E-11 |
| 90_6836 | peptide (allatostatin)-like receptor (Schistosoma mansoni) | XP_002572656.1 | 2,00E-66 |
| 90_7984 | peptide (allatostatin/somatostatin)-like receptor (Schistosoma mansoni) | XP_002575539.1 | 2,00E-32 |
| 90_10769 | progesterone receptor membrane component 1 (Danio rerio) | NP_001007393.1 | 7,00E-04 |
| 90_5450 | progestin receptor membrane component 1 (Oryzias latines) | BAE47967.1 | 2,00E-28 |
| P02GZGVI | prolactin releasing hormone receptor (Homo sapiens) | BAG36078.1 | 2,00E-06 |
| P02I1U9K | pyrokinin-like receptor (Dermacentor variabilis) | ACC99623.1 | 2,00E-11 |
| 90_10680 | Rhodopsin-like GPCR superfamily, domain-containing protein (Schistosoma japonicum) | CAX73015.1 | 6,00E-37 |
| 90_2955 | rhodopsin-like orphan GPCR (Schistosoma mansoni) | XP_002579928.1 | 2,00E-42 |
| 90_27829 | ryanodine receptor 44F (Schistosoma japonicum) | CAX69439.1 | 8,00E-16 |
| 90_14326 | serotonin receptor-like planarian receptor 1 (Dugesia japonica) | BAA22404.1 | 3,00E-54 |
| 90_15981 | serotonin receptor 7 (Dugesia japonica) | BAI44327.1 | 2,00E-14 |
| 90_11349 | sex peptide receptor (Tribolium castaneum) | NP_001106940.1 | 5,00E-25 |
| P02HBR62 | SIFamide receptor (Apis mellifera) | NP_001106756.1 | 9,00E-10 |
| 90_19415 | parathyroid hormone-related peptide receptor precursor (Tribolium castaneum) | XP_969953.1 | 7,00E-20 |
| P02FKOY5 | parathyroid hormone receptor 2, isoform CRA_c (Mus musculus) | EDL00229.1 | 3,00E-07 |
| 90_1140 | somatostatin receptor (Culex quinquefasciatus) | XP_001859671.1 | 7,00E-43 |
| P02JZNDR | tachykinin receptor 1 (Mus musculus) | NP_033339.2 | 2,00E-06 |
| P02FL51R | thyroid hormone receptor (Schistosoma mansoni) | XP_002573733.1 | 2,00E-23 |
| P02FHDMB | thyroid stimulating hormone receptor precursor (Canis lupus familiares) | NP_001003285.1 | 5,00E-04 |
| 90_3545 | thyrotropin-releasing hormone receptor 1 (Catostomus commersonii) | AAG31763.1 | 2,00E-51 |
| 90_26294 | tyramine receptor (Bombyx mori) | BAD11157.1 | 1,00E-11 |
Complete list of homeobox-containing gene sequence candidates.
| ID | BLASTX HIT | ACCESSION NUMBER | E-VALUE |
|---|---|---|---|
| F6AJIXP02J3PG4 | arrowhead [Schistosoma mansoni] | XP_002575389 | 6,00E-09 |
| 90_9219 | barh homeobox protein [Schistosoma mansoni] | XP_002571667 | 9,00E-26 |
| F6AJIXP02J2YIH | brain-specific homeobox [Tribolium castaneum] | EFA05724 | 5,00E-05 |
| 90_23337 | cut, isoform C [Drosophila melanogaster] | NP_001138174 | 3,00E-18 |
| F6AJIXP02HF7ZO | cut, isoform C [Drosophila melanogaster] | NP_001138174 | 2,00E-10 |
| 90_8368 | Cut-like homeobox 1 [Mus musculus] | AAH14289 | 2,00E-23 |
| 90_3019 | distalless, Dlx-1 [Platynereis dumerilii] | CAJ38799 | 8,00E-07 |
| 90_14605 | DjotxB [Dugesia japonica] | BAF80446 | 4,00E-65 |
| F6AJIXP02FICZL | Eye absent protein [Dugesia japonica] | CAD89530 | 1,00E-74 |
| F6AJIXP02IV6Y0 | gsx family homeobox protein [Schistosoma mansoni] | XP_002574396 | 3,00E-12 |
| 90_24312 | H6-like-homeobox [Drosophila melanogaster] | NP_732244 | 2,00E-15 |
| 90_8293 | homeobox protein distal-less dlx [Schistosoma mansoni] | XP_002574393 | 4,00E-07 |
| F6AJIXP02JJ1QK | Homeobox protein DTH-2 [Girardia tigrina] | Q00401 | 3,00E-40 |
| 90_8753 | homeobox prox 1 [Danio rerio] | NP_956564 | 5,00E-19 |
| 90_12057 | homeodomain protein Tlx [Capitella teleta] | ACH89436 | 1,00E-23 |
| 90_8083 | Hox class homeodomain protein AbdBa Hox protein [Schmidtea mediterranea]. | ABW79872 | 1,00E-26 |
| 90_7618 | Hox class homeodomain protein DjAbd-Ba [Dugesia japonica] | BAB41079 | 2,00E-16 |
| 90_6369 | Hox class homeodomain protein DjAbd-Bb [Dugesia japonica] | BAB41078 | 1,00E-108 |
| F6AJIXP02ILMDY | Hox class homeodomain protein DjAbd-Bb [Dugesia japonica] | BAB41078 | 3,00E-33 |
| F6AJIXP02HN15J_2 | Hypothetical protein CBG18604 [Caenorhabditis briggsae] | XP_002638395 | 7,00E-05 |
| 90_28860 | ladybird homeobox corepressor 1-like protein [Mus musculus | NP_001103213 XP_001479028 | 8,00E-33 |
| 90_6629 | lim domain binding protein [Schistosoma mansoni] | XP_002576324 | 6,00E-05 |
| F6AJIXP02GEYYP | lim domain homeobox 3/4 transcription factor [Saccoglossus kowalevskii) | NP_001158395 | 4,00E-23 |
| 90_10783 | lim homeobox protein [Schistosoma mansoni] | XP_002579046 | 2,00E-13 |
| 90_11027 | lim homeobox protein [Schistosoma mansoni] | XP_002579046 | 1,00E-26 |
| 90_10828 | LIM homeobox transcription factor 1 alpha [Mus musculus] | EDL39177 | 2,00E-14 |
| 90_13775 | LIM motif-containing protein kinase 1 [Schistosoma japonicum] | CAX72746 | 2,00E-11 |
| 90_9432 | LIM-homeodomain protein AmphiLim1/5 [Branchiostoma floridae] | ABD59002 | 5,00E-05 |
| 90_8762 | LIM-homeodomain transcription factor islet [Branchiostoma floridae | AAF34717 | 2,00E-15 |
| 90_6339 | Nk1 protein [Platynereis dumerilii] | CAJ38797 | 1,00E-11 |
| F6AJIXP02G077U | paired-like homeobox 2a [Danio rerio] | NP_996953 | 5,00E-16 |
| 90_6703 | phtf [Drosophila melanogaster] | NP_610232 | 2,00E-55 |
| 90_25126 | PLOX2-Dj [Dugesia japonica] | BAA77402 | 2,00E-42 |
| 90_21567 | PLOX4-Dj [Dugesia japonica] | BAA77404 | 2,00E-21 |
| 90_23010 | PLOX5-Dj [Dugesia japonica] | BAA77405 | 6,00E-22 |
| F6AJIXP02IVOTI | PLOX5-Dj [Dugesia japonica] | BAA77405 | 1,00E-17 |
| 90_21710 | pre-B-cell leukemia transcription factor 1 2 3 4 (pbx) [Schistosoma mansoni) | XP_002572195 | 2,00E-27 |
| 90_3405 | PREDICTED: similar to UBX domain protein 4 [Hydra magnipapillata] | XP_002162754 | 2,00E-06 |
| F6AJIXP02HI24E | PREP homeodomain-like protein [Schmidtea mediterranea] | ADB54565 | 2,00E-47 |
| F6AJIXP02JSRJD | PREP homeodomain-like protein [Schmidtea mediterranea] | ADB54565 | 1,00E-32 |
| F6AJIXP02GVFDM | prospero-like protein [Schistosoma mansoni] | XP_002578694 | 1,00E-21 |
| F6AJIXP02IUJ5Q | prospero-like protein [Schistosoma mansoni] | XP_002578694 | 4,00E-25 |
| F6AJIXP02HZIDG | short stature homeobox protein 2 isoform c [Homo sapiens | NP_001157150 | 5,00E-08 |
| 90_7545 | SIX homeobox 2 [Gallus gallus] | NP_001038160 | 7,00E-36 |
| F6AJIXP02HBGHT | SJCHGC06100 protein [Schistosoma japonicum] | AAW24487 | 6,00E-11 |
| 90_3395 | UBX domain containing 8, isoform CRA_d [Mus musculus] | EDL41153 | 3,00E-14 |
| 90_1176 | UBX domain-containing protein 4 [Mus musculus] | NP_080666 | 1,00E-06 |
| 90_2625 | ubx6(yeast)-related [Schistosoma mansoni] | XP_002576054 | 2,00E-16 |
| 90_24438 | visual system homeobox protein [Tribolium castaneum] | CAX64460 | 9,00E-23 |
| F6AJIXP02G5JJX_1 | Zn finger homeodomain 2 [Tribolium castaneum] | EFA01350 | 1,00E-05 |
List of eye-related gene sequence candidates.
| ID | BLASTX HIT | E-VALUE | |
|---|---|---|---|
| 90_7233 | abl interactor 2 [Schistosoma japonicum] | CAX69750.1 | 6.00E-019 |
| 90_4001 | adaptor-related protein complex [Schistosoma mansoni] | XP_002574891.1 | 3.00E-072 |
| 90_30923 | arginine/serine-rich splicing factor [Schistosoma mansoni] | XP_002574990.1 | 2.00E-026 |
| 90_482 | ATPase protein [Schistosoma japonicum] | AAW26203.1 | 3.00E-049 |
| 90_3152 | beta-catenin-like protein 2 [Schmidtea mediterranea] | ABW79874.1 | 0 |
| 90_12909 | BMP [Schmidtea mediterranea] | ABV04322.1 | 3.00E-090 |
| 90_120 | cat eye syndrome protein [Schistosoma japonicum] | AAX27345.2 | 4.00E-035 |
| P02FKNEB | CaTaLase family member (ctl-2) [Caenorhabditis elegans] | NP_001022473.1 | 1.00E-029 |
| 90_205 | Chaperonin Containing TCP-1 family member (cct-3) [Caenorhabditis elegans] | NP_494218.2 | 1.00E-090 |
| C90_6158 | disks large homolog 1 isoform 1 [Homo sapiens] | NP_001091894.1 | 2.00E-027 |
| P02GJNCV | extradenticle 1 protein [Schistosoma japonicum] | AAW24487.1 | 3.00E-013 |
| P02JKJ4Z_2 | eye53 [Dugesia japonica] | BAD20650.1 | 6.00E-016 |
| 90_8483 | eyes absent protein [Dugesia japonica] | CAD89531.1 | 2.00E-064 |
| 90_651f | ascin protein [Schistosoma japonicum] | XP_002574990.1 | 5.00E-045 |
| 90_14368 | heat shock protein 70 [Lumbricus terrestris] | ACB77918.1 | 4.00E-038 |
| 90_9533 | Heparan sulfate 6-O-sulfotransferase 2 [Danio rerio] | AAH45453.1 | 1.00E-042 |
| 90_6564 | histone-lysine n-methyltransferase suv9 [Schistosoma mansoni] | XP_002574171.1 | 3.00E-061 |
| 90_15456 | homeodomain protein NK4 [Platynereis dumerilii] | ABQ10640.1 | 8.00E-023 |
| 90_12892 | homeotic protein six3-alpha [Mus musculus] | S74256 | 1.00E-082 |
| 90_325 | importin-7 [Culex quinquefasciatus] | XP_001843364.1 | 2.00E-147 |
| 90_4360 | intraflagellar transport 57 homolog [Xenopus (Silurana) tropicalis] | NP_001016561.1 | 1.00E-044 |
| 90_11027 | lim homeobox protein [Schistosoma mansoni] | XP_002579046.1 | 5.00E-027 |
| 90_8432 | lozenge [Schistosoma mansoni] | XP_002580418.1 | 8.00E-032 |
| 90_8924 | Male ABnormal family member (mab-21) [Caenorhabditis elegans] | NP_497940.2 | 1.00E-046 |
| P02HSHWR | mothers against decapentaplegic homolog 4 [Mus musculus] | NP_032566.2 | 5.00E-018 |
| 90_5640 | muscleblind-like protein [Schistosoma mansoni] | XP_002575346.1 | 3.00E-025 |
| P02F0EF6 | neurogenic differentiation [Platynereis dumerilii] | CAQ57533.1 | 2.00E-012 |
| P02GMLJM | nuclear transcription factor X-box binding 1 (nfx1) [Schistosoma bovis] | XP_002577564.1 | 5.00E-014 |
| 90_828 | phenylalanine hydroxylase [Caenorhabditis elegans] | AAD31643.1 | 2.00E-145 |
| 90_2925 | protein [Schistosoma japonicum] | AAW24487.1 | 4.00E-126 |
| 90_7228 | protein kinase [Schistosoma mansoni] | XP_002576342.1 | 2.00E-077 |
| 90_2256 | Protein pob [Schistosoma japonicum] | CAX75988.1 | 4.00E-089 |
| 90_4436 | Rab-protein 6 [Drosophila melanogaster] | NP_477172.1 | 8.00E-085 |
| P02GENUT_1 | retinaldehyde dehydrogenase 1 [Eleutherodactylus coqui] | ACE74542.1 | 8.00E-008 |
| 90_11988 | runt protein [Branchiostoma lanceolatum] | AAN08565.1 | 4.00E-017 |
| P02FN7BT | Septin-7 (CDC10 protein homolog) [Schistosoma japonicum] | CAX83064.1 | 3.00E-012 |
| 90_3747 | serine/threonine protein kinase [Schistosoma mansoni] | XP_002580180.1 | 9.00E-094 |
| P02FICZL | six1-2 protein [Dugesia japonica] | CAD89530.1 | 8.00E-86 |
| P02IZDJZ_1 | SRY-related HMG box B protein [Platynereis dumerilii] | CAY12631.1 | 3.00E-028 |
| P02HE4J6 | strabismus protein CBR-VANG-1 [Platynereis dumerilii] | CAJ26300.1 | 1.00E-006 |
| 90_9483 | tetratricopeptide repeat protein 10 tpr10 [Schistosoma mansoni] | XP_002573898.1 | 4.00E-048 |
| 90_16088 | tyrosine kinase [Schistosoma mansoni] | XP_002576978.1 | 2.00E-031 |
| 90_11388 | ubiquitin conjugating enzyme E2 [Schistosoma mansoni] | XP_002578016.1 | 3.00E-053 |
| 90_1263 | vacuolar ATP synthase proteolipid subunit 1 2 3 [Schistosoma japonicum] | XP_002571892.1 | 9.00E-049 |
| 90_12567 | vermilion [Drosophila ananassae] | XP_001963597.1 | 2.00E-012 |
| 90_5500 | white pigment protein [Drosophila melanogaster] | CAA26716.2 | 2.00E-020 |
| 90_13309 | YY1 transcription factor [Schistosoma japonicum] | CAX73893.1 | 5.00E-049 |
| 90_10118 | zinc finger protein 42 homolog [Homo sapiens] | NP_777560.2 | 6.00E-031 |
| 90_9460 | 14-3-3 zeta isoform [Schistosoma bovis] | AAT39382.1 | 2.00E-023 |
| P02ILIK3 | 52-kD bracketing protein [Drosophila melanogaster] | CAA44483.1 | 1.00E-016 |