| Literature DB >> 15527499 |
Francis M F Nunes1, Valeria Valente, Josane F Sousa, Marco A V Cunha, Daniel G Pinheiro, Rafaela M Maia, Daniela D Araujo, Maria C R Costa, Waleska K Martins, Alex F Carvalho, Nadia Monesi, Adriana M Nascimento, Pablo M V Peixoto, Maria F R Silva, Ricardo G P Ramos, Luis F L Reis, Emmanuel Dias-Neto, Sandro J Souza, Andrew J G Simpson, Marco A Zago, Ademilson E E Soares, Marcia M G Bitondi, Enilza M Espreafico, Foued S Espindola, Maria L Paco-Larson, Zila L P Simoes, Klaus Hartfelder, Wilson A Silva.
Abstract
BACKGROUND: The ongoing efforts to sequence the honey bee genome require additional initiatives to define its transcriptome. Towards this end, we employed the Open Reading frame ESTs (ORESTES) strategy to generate profiles for the life cycle of Apis mellifera workers.Entities:
Mesh:
Year: 2004 PMID: 15527499 PMCID: PMC533872 DOI: 10.1186/1471-2164-5-84
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Apis mellifera Open Reading frame ESTs.
| - Embryos | 1358 |
| - Larval stages | 720 |
| - Pupae | 1219 |
| - Adults | 1479 |
| - Stage mix | 245 |
| - | 1769a |
| - | 16*b |
| - Genes of other organisms (orthologs) | 1123b |
| - No matches in GenBank | 2129 |
| - Number of contigs | 488 |
| - Number of singlets | 893 |
| - Total number of clusters | 1381 |
aBLASTN against dbEST using an E-score of 10-30 as cutoff value; bBLASTX against the nr database used <10-15 as the cutoff. *This set is also represented by ESTs in the dbEST database. It is included here as additional information only and is not to be summed up with the other matches.
Figure 1Distribution of Best-BLASTX-matches for assembled Apis mellifera Open Reading frame ESTs. After assembly into contigs and singlets the sequences were submitted to a search against a non-redundant protein database (NCBI). Independent of its E-score, the best match in each BLASTX result was listed according to organism category.
Gene Ontology classification of Apis mellifera ORESTES contigs according to the Drosophila genes that they represent.
| extracellular matrix | 4 |
| extracellular space | 5 |
| intracellular | 99 |
| membrane | 29 |
| others | 8 |
| reproduction | 18 |
| cell motility | 7 |
| response to stress | 6 |
| cell communication | 25 |
| pattern specification | 10 |
| cell growth and/or maintenance | 55 |
| metabolism | 79 |
| response to external stimulus | 10 |
| morphogenesis | 24 |
| embryonic development | 9 |
| cell differentiation | 9 |
| others | 41 |
| nucleotide binding | 14 |
| nucleic acid binding | 40 |
| RNA polymerase II transcription factor activity | 7 |
| antimicrobial peptide activity | 3 |
| helicase activity | 4 |
| receptor signaling protein activity | 5 |
| structural constituent of cytoskeleton | 5 |
| microfilament motor activity | 5 |
| transcription factor activity | 6 |
| kinase activity | 14 |
| oxidoreductase activity | 22 |
| transferase activity | 23 |
| hydrolase activity | 38 |
| protein binding | 29 |
| metal ion binding | 9 |
| ion transporter activity | 8 |
| others | 70 |
GO levels were set at 3. In either of the GO categories, individual contigs may be listed in more than one category. This GO classification only includes Apis mellifera orthologs to Drosophila genes that are represented by a Flybase code.
Figure 2CAP3 assembly of Apis mellifera Open Reading frame ESTs (AmORESTES) with Apis mellifera ESTs previously deposited in dbEST (AmNCBI). A) Read depth distribution of pure AmNCBI or AmORESTES and of mixed contigs; B) EST size distribution of these contigs, C) Details of individual mixed contigs showing the extension and gap-closing characteristics. In all graphs, AmORESTES sequences are in blue, AmNCBI contigs are in red, and mixed contigs are in green.
Figure 3Similarity and representation pattern of assembled Apis mellifera ESTs (ORESTES + NCBI dbESTs) with predicted proteins of other organisms. In this comparison we included eukaryotes with completely sequenced genomes (Drosophila melanogaster, Anopheles gambiae, Caenorhabditis elegans and human), plus higher taxon groups, such as protozoans (primarily represented by Plasmodium falciparum and P. yoelii) and fungi (primarily represented by Saccharomyces cerevisiae, Schizosaccharomyces pombe and Neurospora crassa). These BLASTX comparisons were performed with an E-value cut-off level set at 10-6. Subsequently, the representation pattern of each of the Apis ESTs in each of the eukaryotic genomes was listed. Out of the total 3,408 Apis EST contigs, 1,629 could be classified as putative orthologs, and these were grouped according to the representation of these genes at the different taxonomic levels.
Annotation and Gene Ontology characteristics of 11 honey bee EST contigs sharing significant similarity with mammalian but not with other vertebrate or invertebrate sequences. In all cases, the best match was with human proteins. For these 11 out of 23 contigs we could retrieve functional information.
| 437 | 2e-67 | NA | without result | NM_019116: ubiquitin binding protein | ubiquitin-specific protease domain |
| 663 | 3e-10 | NA | without result | NM_182830: MAM domain | contactin 5; neural adhesion molecule |
| 1081 | 5e-07 | 9.7 | without result | NM_013041: RAB3A interacting protein (rabin3)-like1 | guanin nucleotide exchange factor domain |
| 1425 | 8e-27 | NA | without result | NM_182565: hypothetical protein MGC29814 | TBP-associated factor 4; TATA box binding protein |
| 1674 | 4e-82 | NA | without result | NM_172374: interleukin 4 induced 1 | none |
| 1953 | 7e-11 | NA | without result | NM_024707: gem (nuclear organelle) associated protein | spliceosomal snRNP biogenesis |
| 2807 | 8e-07 | 5e-04 | regulation of physiological process | NM_138457: forkhead box P4 | transcription factor activity |
| 2896 | 4e-05 | 0.002 | reproduction, metabolism | NM_004654: ubiquitin-specific protease 9 | ubiquitin thiolesterase activity |
| 3167 | 2e-27 | 7e-05 | cell communication | NM_033046: rhotekin | signal transduction |
| 3347 | 5e-26 | 7.5 | without result | NM_014006: PI-3-kinase-related kinase SMG-1 | involved in nonsense-mediated mRNA decay |
| 3374 | 1e-10 | 0.18 | cell communication, | NM_014035: sorting nexing 24 | intracellular signaling cascade |
aE-value of the contig alignment with human or known insect sequences, NA: did not show any alignment; bGene Ontology results on Biological Process, FatiGO level 3, cadditional information obtained from Entrez Gene – NCBI and GOA link (GOAnnotations@EBI – European Bioinformatics Institute).
Figure 4Percentage of honey bee and Drosophila ESTs representing putative species-specific genes (blue bars) in relation to ESTs that represent genes solely shared within the insect clade (pink bars), or that have higher similarity with eukaryotes other than the insect clade (yellow bars). In separate comparisons, the Apis mellifera contigs (ORESTES + NCBI dbESTs, n = 5,000), AmORESTES (n = 5,000), AmNCBI dbESTs (n = 5,000), and Drosophila melanogaster ORESTES contigs (n = 5,000) were analyzed against protein databases of an insect (Anopheles gambiae) and several non-insect species (C. elegans, protozoans, fungi and H. sapiens) with completely sequenced genomes. The cut-off E-value in these comparisons was set at 10-6.
Specific primers used to assess quality and absence of DNA contaminants of the RNA samples, and randomly selected primers used to generate cDNA profiles.
| actin F (Apis) | 5' AGCTATGAACTTCCAGATGGT 3' |
| actin R (Apis) | 5' CCACATCTGTTGGAAGGT 3' |
| 16S mitochondrial F (Apis) | 5' TTATTCACCTGTTTATCAAAACAT 3' |
| 16S mitochondrial R (Apis) | 5' 'TATAGATAGAAACCAAYCTG 3' |
| 16S mitochondrial F (Drosophila) | 5' CCGGTCTGAACTCAGATCACGT 3' |
| 16S mitochondrial R (Drosophila) | 5' CGCCTGTTTAACAAAAACAT 3' |
| p3_2 | 5' TTGGGGATCGTATGTAGTATG 3' |
| pA82_1 | 5' CACTTCAGGATCCCTTGTAAGC 3' |
| pA82_2 | 5' CCAACATTGAATTCTCTTTGAC 3' |
| pA82_4 | 5' CAATAACAATGAATTCCAGAATCTCG 3' |
| pPT7C4_B | 5' GCTTACAAGGGATCCTGAAGTGTTTCC 3' |
| pPT7C4_XS | 5' GCAGGTAAACTCTACTCGAGTTACG 3' |
| M-RON-AS | 5' CCAGGATGTTTGGGTGATGTA 3' |
| CREB-S | 5' TCATGCAACATCATCTGCTCC 3' |
| H-SPARC-S | 5' CTAACCCAAGACATGACATTC 3' |
| M-CD151-S | 5' AAAGCTCGGAGGCAGCGAACT 3' |
| H-CD151-AS | 5' CATGTGGCTGCAAGGCAAAGC 3' |
| M-SPARC-AS | 5' GCCCAATTGCAGTTGAGTGAT 3' |
| M-ETS1-AS | 5' GTCTTGATGATGGTGAGAGTC 3' |
| FUT-3-S | 5' TCATGTCCAACCCTAAGTCAC 3' |
| FUT-3-AS | 5' TCCAGCAGGCCTTGCAGAAAT 3' |
| M-CMET-S | 5' TATCTCAAACGATCGAGAGAC 3' |
| M-CMET-AS | 5' GCACATCTATTACCAGCTTTG 3' |
| H-CMET-S | 5' TTTCAAATGGCCACGGGAC 3' |
| H-CMET-AS | 5' GCACATTTATGACCATTCTCG 3' |
| H-Rhoc-AS | 5' AGAAACAACTCCAGGGGCCTG 3' |
| M-Rhoc-AS | 5' CTACCCAAAGCAGAAACCCCA 3' |
| H-Sparc-AS | 5' CCAAAACCATCCTTGACAACA 3' |
| H-RON-AS | 5' TGATGAGGTCCTTCACGGTG 3' |
| B237-2 | 5' CGGAATTCACCAGATTTGAACAGAAGAG 3' |
| B237-3 | 5' AACTGCAGTTAACCAGATTTGAACAGAAA 3' |
| GST_(PGEX)_NHE_I-S | 5' CCGCTAGCATGTCCCCTATACTAGGTTA 3' |
| HOXA_I-F | 5' CGCTCCCGCTGTTTACTCT 3' |
| P21-RasaI-F | 5' GACCGCTCCTCCAACTAACC 3' |
| P21-RasaI-R | 5' CCGGCCCACCTCTTCTACTA 3' |
| SRY8299.2 | 5' TCTCTTTATGGCAAGACTTACG 3' |
| SRY1532.1 | 5' TCCTTAGCAACCATTAATCTGG 3' |
| 92R7.2 | 5' GCCTATCTACTTCAGTGATTTCT 3' |
| TAFIEX.1R | 5' ATCCAAGGTTCTCCCAATA 3' |