Literature DB >> 11276425

Survey of transcripts in the adult Drosophila brain.

K L Posey1, L B Jones, R Cerda, M Bajaj, T Huynh, P E Hardin, S H Hardin.   

Abstract

BACKGROUND: Classic methods of identifying genes involved in neural function include the laborious process of behavioral screening of mutagenized flies and then rescreening candidate lines for pleiotropic effects due to developmental defects. To accelerate the molecular analysis of brain function in Drosophila we constructed a cDNA library exclusively from adult brains. Our goal was to begin to develop a catalog of transcripts expressed in the brain. These transcripts are expected to contain a higher proportion of clones that are involved in neuronal function.
RESULTS: The library contains approximately 6.75 million independent clones. From our initial characterization of 271 randomly chosen clones, we expect that approximately 11% of the clones in this library will identify transcribed sequences not found in expressed sequence tag databases. Furthermore, 15% of these 271 clones are not among the 13,601 predicted Drosophila genes.
CONCLUSIONS: Our analysis of this unique Drosophila brain library suggests that the number of genes may be underestimated in this organism. This work complements the Drosophila genome project by providing information that facilitates more complete annotation of the genomic sequence. This library should be a useful resource that will help in determining how basic brain functions operate at the molecular level.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11276425      PMCID: PMC30707          DOI: 10.1186/gb-2001-2-3-research0008

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


Background

Drosophila melanogaster is an important model organism. After more than 50 years of study, the anatomy of the brain is well described and many brain functions have been mapped to particular substructures [1,2,3,4,5,6,7,8]. The adult brain is composed of approximately 200,000 neurons which are organized into discrete substructures. The optic lobe (composed of the lamina, medulla, lobula and lobula plate) is primarily involved in the processing of visual information from the photoreceptors and sending that information to the central brain [2,5,9]. The antennal lobes are chiefly responsible for the processing of olfactory information [10]. The mushroom bodies are involved in olfactory learning and memory and other complex behaviors [11,12,13,14,15]. A group of approximately six neurons in the lateral protocerebrum are sufficient to drive circadian rhythms in locomoter activity [16,17]. The central complex, although poorly understood, appears to be involved in motor coordination [18,19,20]. Despite our increasing knowledge of Drosophila brain anatomy and function, relatively little information is available concerning the molecules expressed in the brain that coordinate function and manifest behavior. Classic methods of identifying genes involved in neural function include behavioral screening of mutagenized flies, then rescreening candidate lines for pleiotropic effects due to developmental defects. This process is both laborious and time consuming. To augment this genetic approach, sequencing of random cDNAs is proving effective in identifying genes expressed in a specific cell type [21]. Much information has been collected through the analysis of expressed sequence tags (ESTs) [22,23,24,25]. Using this approach, sequence information is gathered from one or both ends of a cDNA and cataloged to determine the complexity of an mRNA population. Here, we use a modified EST approach and completely sequence novel cDNAs. Others have used a similar approach by shotgun sequencing concatenated cDNA inserts [26,27]. One goal of our work was to begin to develop a catalog of transcripts expressed in the brain. These transcripts, because of the location of their expression, are expected to contain a higher proportion of clones that are involved in neuronal function. Many Drosophila head libraries have been used to isolate cDNAs that correspond to genes identified by genetic screens for their involvement in brain function. Several transcripts identified in this manner are expressed at a relatively low level (dunce [28], CREB [29], dco [30], period [31], timeless [32], dissonance [33]). The Drosophila brain makes up only a small part of head tissue (approximately 14% dry weight). By eliminating non-brain tissues, we increase the relative representation of rare neural transcripts in this unique library. We began a catalog of the genes expressed in the brain of adult Drosophila in support of more conventional methods of understanding brain function. Cataloging sequence information and publishing the data through electronic databases has enriched molecular science in general. In a matter of a few minutes, one can use information from a single sequencing reaction to identify a gene that was sequenced by another laboratory, and one maybe able to deduce the function of the isolated clone. This set of tools facilitates molecular work in virtually every branch of biological sciences. This report details construction, quality analysis and initial characterization of a unique library created from adult Drosophila brains. Surprisingly, we discovered that 11% (29 clones) of the Drosophila brain cDNA clones that were randomly chosen for analysis are not matched with any EST sequence generated in support of the Drosophila genome project (as of 10 October 2000). Further, the genes encoding 59% of these novel ESTs are not predicted by algorithms used for fly genome annotation. From our analysis of ESTs that do not correspond to one of the 13,601 annotated genes, we predict that the number of genes in the Drosophila genome may be underestimated by 10-15% (approximately 1,300 to 2,000 genes).

Results and discussion

Library quality assessment

Desiccated brain tissue from adult Drosophila melanogaster was used to construct a library using the Stratagene Hybrid-Zap system. This library was designed for protein expression and, therefore, was constructed such that full-length cDNAs containing 5' untranslated regions are not likely to be present. The number of clones in the library and the size of the clones were used to assess the quality of the library. The number of clones in the primary library was determined by titering one of the five packaging reactions. The total number of clones in the primary library is 6.75 × 106 (that is, all five packaging reactions). From the analysis of the fully sequenced clones (141 novel and matched isolog clones reported in this study), the majority of the inserts (53%) were between 400 and 800 base pairs (511 base pairs ± 197 base pairs average deviation). Characterized clones from the library range between 139 to 1,746 base pairs (bp), including only 15 As of the poly(A) tail. The insert size for this library is as expected using the Stratagene Hybrid-Zap kit, given that the size-selection column retains DNA molecules larger than 200 bp) (Stratagene technical support, personal communication). Of the 283 clones that were either completely or end-tagged sequenced, approximately 4% (12 clones) did not contain an insert (Figure 1).
Figure 1

Scheme for classifying the Drosophila brain library clones.

Scheme for classifying the Drosophila brain library clones.

Clone selection

To try to maximize the discovery of novel transcripts, we investigated whether there was a correlation between transcript abundance and the presence of the sequence in a public database. Specifically, a reverse northern blot experiment using radiolabeled head cDNA was performed to determine whether hybridization level could be used to identify frequently occurring transcripts. We reasoned that the abundance of these transcripts may increase their representation in data banks when compared to less abundant transcripts. The data from this experiment are shown in Table 1.
Table 1

Hybridization data from 85 randomly chosen clones

Hybridization levelOriginally novelSignal
 AbsentAF171761242.2
AF171771282.1
AF171773212.4
AF171774264.8
AF171776262.6
AF171777293.6
AF171778296.6
AF171781299.4
AF171762194.0
AF179229211.9
 LightAF171764428.0
AF171765461.4
AF171769483.8
AF171772305.0
AF171779460.6
AF171782335.6
AF171785360.3
AF179230444.6
 MediumAF171763508.5
AF171766617.2
AF171767615.6
AF171768541.3
AF171770509.0
AF171786681.5
AF171787924.1
AF171789984.0
AF171790978.2
AF171791862.7
AF171792583.2
AF171793731.2
AF171794695.5
 DarkAF1717621237.0
AF1717751066.0

Hybridization levelKnownSignal

 AbsentNone
 LightAF171706 Drosophila GS2 for glutamine synthase315.3
AF171707 Drosophila ubiquitin protein gene388.6
AF171709 Drosophila cytochrome c oxidase365.3
AF171711 Drosophila calmodulin gene340.7
AF171715 Drosophila CCATT box-transmembrane domain392.1
 MediumAF083504 Drosophila Pls dso3465 (d149) dso 8544 (D187)645.3
AF171701 Drosophila frequenin gene503.4
AF171702 Drosophila nicotinic acetylcholine receptor526.0
AF171704 Drosophila ADP/ATP Translocase537.1
AF171705 Drosophila tyrosine kinase gene544.5
AF171703 Drosophila ferritin subunit 1 (Fer1) mRNA552.0
AF171708 Drosophila mRNA for rab-related protein 4543.2
AF171710 Drosophila cytochrome c oxidase subunit538.0
AF083505 Drosophila 2-g8 from P1 DS02782 (D71)870.4
AF171712 Drosophila gene encoding S-adenosylmethionine decarboxylase918.0
AF171713 Drosophila TRIP-1 homolog (Dm TRIP) mRNA576.6
AF171714 Drosophila twinstar (tsr) gene664.1
AF171716 Drosophila burdock retrotransposon gag protein631.6
AF171717 Drosophila GS1 mRNA for glutamine synthase557.8
AF171718 Drosophila geranylgeranyl transferase694.9
 DarkNone

Hybridization levelRibosomal/mitochondrialSignal

 AbsentAF083279 Ribosomal protein rat 60s L35A235.1
AF083516 Drosophila ribosomal protein S17 gene299.2
AF083518 Drosophila ribosomal protein L31218.4
 LightAF083272 Yeast ribosomal protein L46489.6
Clone 17 Mitochondrial 16S ribosomal mRNA480.4
Clone 26 Mitochondrial 16S ribosomal mRNA343.2
AF083276 M. musculus ribosomal protein L21 mRNA361.4
AF083277 M. musculus ribosomal protein L21 mRNA388.1
AF083278 Ribosomal protein human 60s L24326.8
AF083515 Drosophila ribosomal protein 15a (40s subunit)376.5
Clone 61 Mitochondrial 16S ribosomal mRNA354.1
Clone 72 Mitochondrial 16S ribosomal mRNA470.5
AF083520 Drosophila ribosomal protein L18a352.6
AF083521 Drosophila ribosomal protein S14 A and B genes396.5
 MediumAF083513 Drosophila mRNA ribosomal protein508.8
AF083514 Drosophila ribosomal protein L19 gene579.8
AF083281 Ribosomal protein R. norvegicus S23798.1
AF083519 Drosophila 60S ribosomal protein L43 mRNA837.9
Clone 80 Mitochondrial 16S ribosomal mRNA601.1
Clone 90 Mitochondrial 16S ribosomal mRNA798.6
AF083522 Drosophila 5.8S and 2S ribosomal rRNA819.8
Clone 94 Mitochondrial 16S ribosomal mRNA820.0
 DarkAF083275 Drosophila ribosomal protein S18 mRNA1166.0
AF083517 Drosophila ribosomal protein L22 mRNA2423.0

Hybridization levelIsologsKnown

 AbsentAF083295 C. pothophila cytochrome oxidase I & II224.9
AF083300 Human clathrin coat-associated protein 50233.7
 LightAF083301 R. norvegicus trg mRNA carrier protein precursor474.5
 MediumAF083296 Human protein synthesis factor (eIF-1A)512.2
AF083298 Silkworm mRNA for DNA SC factor500.1
AF083297 Bovine ATP synthase G chain, mitochondrial, H+ transporting555.9
AF083299 H. sapiens Arp 2/3 complex 20 kD subunit, actin related protein570.4
AF083302 H. sapiens mRNA for testican519.7
DarkNone

The photo stimulating units (psl) within a circle of standard area was used to determine the relative hybridization level to radiolabeled head cDNA for each clone. Hybridization categories are as follows. Absent, 0-300 psl (corresponding to background); light, 301-499 psl; medium, 500-899 psl; dark, 900-2500 psl. Clones are categorized as follows. Novel, sequence information for clones that were novel at the initial phase of this project; known, Drosophila sequence information previously submitted to sequence databanks; ribosomal protein/ribosomal RNA/mitochondrial, sequences corresponding to ribosomal proteins, ribosomal RNAs or mitochondrial transcripts; isologs, transcripts that have a high degree of similarity to sequences reported in the databanks.

Hybridization data from 85 randomly chosen clones The photo stimulating units (psl) within a circle of standard area was used to determine the relative hybridization level to radiolabeled head cDNA for each clone. Hybridization categories are as follows. Absent, 0-300 psl (corresponding to background); light, 301-499 psl; medium, 500-899 psl; dark, 900-2500 psl. Clones are categorized as follows. Novel, sequence information for clones that were novel at the initial phase of this project; known, Drosophila sequence information previously submitted to sequence databanks; ribosomal protein/ribosomal RNA/mitochondrial, sequences corresponding to ribosomal proteins, ribosomal RNAs or mitochondrial transcripts; isologs, transcripts that have a high degree of similarity to sequences reported in the databanks. The level of hybridization to the probe varied considerably within a category. In particular, novel transcripts did not uniformly have low levels of hybridization, which suggested that hybridization level would not greatly aid in identifying novel clones. Therefore, subsequent clones for this study were randomly chosen for sequence analysis. It is possible that abundant transcripts may not be as well represented in the database as a result of directed cloning of rarer molecules, or that cDNA abundance in this library may not accurately reflect relative transcript abundance in the fly brain.

Sequence data

We obtained sequence data for 271 independently isolated cDNAs representing transcripts expressed in the Drosophila brain (Figure 1, Table 2). Of these, 141 clones originally classified as either novel (114 clones) or matched isologs (27 clones) were completely sequenced. Only end-tag sequence data was collected for clones classified as matched isolog ribosomal protein sequences (16 clones), known Drosophila sequences (71 clones) and known Drosophila ribosomal protein sequences (23 clones). All insert sequences or ESTs can be obtained by searching GenBank with the appropriate accession numbers listed in Table 2. Data for 20 mitochondrial 16S clones are not reported here because mitochondrial expression is not the focus of this study.
Table 2

GenBank accession numbers of all sequenced clones

Clone categoryGenBank accession numberTotal
Novel sequences only matched to genomic dataAF171764, AF171789, AF171794, AF171800, AF171805, AF171808, AF171813, AF171815, AF171819, AF171821, AF171828, AF171838, AF171850, AF171854, AF171858, AF171859, AF171865.17
Matched with an EST, but NOT a predicted geneAF171766, AF171772, AF171779, AF171780, AF171781, AF171782, AF171785, AF171787, AF171796, AF171797, AF171804, AF171811, AF171818, AF171826, AF171829, AF171837, AF171839, AF171840, AF171845, AF171848, AF171857, AF171860, AF171862, AF171863, AF171869.25
Matched with a predicted gene, but NOT an ESTAF171867, AF171768, AF171778, AF171762, AF171790, AF171799, AF171771, AF171803, AF171812, AF171832, AF171861, AF171868.12
Matched with an EST and a predicted geneAF171761, AF171762, AF171763, AF171765, AF171767, AF171769, AF171770, AF171773, AF171774, AF171775, AF171776, AF171777, AF171784, AF171786, AF171788, AF171791, AF171792, AF171793, AF171795, AF171798, AF171801, AF171802, AF171806, AF171807, AF171809, AF171810, AF171814, AF171816, AF171817, AF171820, AF171822, AF171823, AF171824, AF171825, AF171827, AF171830, AF171831, AF171833, AF171834, AF171835, AF171836, AF171841, AF171842, AF171843, AF171844, AF171846, AF171847, AF171849, AF171851, AF171852, AF171853, AF171855, AF171856, AF171864, AF171866, AF171870, AF171871, AF171872, AF179229, AF179230.60
Matched isologAF083295-AF08321.27
Matched isolog ribosomal protein sequencesAF083272, AF083275-AF083279, AF083281, AF083286- AF083294.16
Known Drosophila ribosomal protein sequencesAF083513-AF083522, AF083524-AF083526, AF083528, AF083530, AF083531, AF083537, AF083538, AF083544-AF083548.23
Known Drosophila sequencesAF083504-AF083512, AF171701-AF171743, AF171745-AF171760, AF171784, AF171807, AF171872.71

Sequences classified as 'novel' represent new EST sequence data from D.melanogaster (as of 10 October, 2000)and are not homologous to any EST or cDNA sequence in GenBank. These 17 novel sequences are not predicted to be transcribed, and are described in more detail in Table 5b. Sequences classified as 'matched with an EST' correspond to known EST sequence data, but do not have a corresponding predicted gene. Sequences classified as 'matched with a predicted gene' correspond to those that are predicted genes, but do not have corresponding EST data. Sequences classified as 'matched with an EST and a predicted gene' have both EST and predicted gene matches. Sequences classified as 'matched isologs' are sequences that are homologous to genes found in other organisms. Sequences classified as 'matched isolog ribosomal protein' are sequences that are homologous to ribosomal protein genes found in other organisms. Sequences classified as 'known Drosophila ribosomal protein sequences' are matched with sequences previously reported to GenBank. Sequences classified as 'known Drosophila' are sequences that have been previously reported to databases (AF083507 and AF083508 correspond to clone 159; AF083509 and AF083510 correspond to clone 226).

GenBank accession numbers of all sequenced clones Sequences classified as 'novel' represent new EST sequence data from D.melanogaster (as of 10 October, 2000)and are not homologous to any EST or cDNA sequence in GenBank. These 17 novel sequences are not predicted to be transcribed, and are described in more detail in Table 5b. Sequences classified as 'matched with an EST' correspond to known EST sequence data, but do not have a corresponding predicted gene. Sequences classified as 'matched with a predicted gene' correspond to those that are predicted genes, but do not have corresponding EST data. Sequences classified as 'matched with an EST and a predicted gene' have both EST and predicted gene matches. Sequences classified as 'matched isologs' are sequences that are homologous to genes found in other organisms. Sequences classified as 'matched isolog ribosomal protein' are sequences that are homologous to ribosomal protein genes found in other organisms. Sequences classified as 'known Drosophila ribosomal protein sequences' are matched with sequences previously reported to GenBank. Sequences classified as 'known Drosophila' are sequences that have been previously reported to databases (AF083507 and AF083508 correspond to clone 159; AF083509 and AF083510 correspond to clone 226).
Table 5b

Additional information

Accession numberEST hit?Predicted gene?AAATAA/Poly(A) spacingSplicing detected?Expression detected inInsert sizeComments
AF171819NONO37Body145
AF171858NONO18Body186
AF171764NONO280Both450
AF171805NONO25Both259
AF171789NONO134Brain857
AF171794NONO45Brain610
AF171800NONONot presentBrain190
AF171808NONO23Brain269
AF171815NONO12Brain363
AF171854NONO52Spliced, consensusBrain195Extensive poly(A) tail (146 nucleotides)
AF171813NONONot presentNeither216Extensive poly(A) tail (>42 nucleotides)
AF171821NONONot presentSpliced, consensusNeither359
AF171828NONONot presentNeither189
AF171838NONO18Neither430
AF171850NONO13Neither1104
AF171859NONO18Neither1786
AF171865NONONot presentNeither452

Additional information concerning clones that lack EST data and that are not predicted to be transcribed (Novel cDNA). The GenBank accession number for each of the 17 clones is indicated. The distance between a putative poly(A) addition sequence (AAATAAA), when present, and the poly(A) sequence is shown. 'Splicing detected?' indicates the two cDNA clones showing evidence for consensus splicing. 'Expression detected' refers to each clone's hybridization results when probed with radiolabeled cDNA from brain or body (see Table 6). 'Insert size', size of the cDNA insert; 'comments', additional comments for the identified clone. None of the 17 cDNA is predicted to encode a protein larger than 100 amino acids.

We generated sequence data for 27 Drosophila genes that had been previously sequenced from other organisms. Isologs that were identified in the brain library but which had not been identified or sequenced in Drosophila melanogaster are listed in Table 3 (with the exception of ribosomal protein genes). An isolog is defined as a sequence that has a high degree of similarity to genes identified in other organisms, but the functional relationship between these genes has not been demonstrated [34]. As expected, we recovered many previously identified Drosophila genes (Table 4). We did not continue with full insert sequencing of these Drosophila sequences, but the EST data for these clones was submitted to GenBank.
Table 3

Isologs identified in Drosophila brain study

GeneOrganismAccession NumberScoreProbability
Cytochrome oxidase I and IIC. pothophilaAF0832952016.2e-8
Protein synthesis factor eIF-1AH. sapiensAF0832962371.1e-23
ATP synthase G chainB. taurusAF0832972822.6e-30
DNA supercoiling factorSilkwormAF0832982491e-65
Arp2/3 complex 20 kD subunitH. sapiensAF0832996813.7e-85
Clathrin coat-associated protein 50H. sapiensAF0833006631.5e-85
Trg geneR. norvegicusAF0833013007.1e-39
Testican geneH. sapiensAF0833028068.8e-57
Calmodulin-like processed pseudogene (similar to D. melanogaster DMTnc 73F troponin but not identical)H. sapiensAF0833031231.0e-27
Peripheral type benzadiazipine receptorH. sapiensAF0833042205.3e-38
Retinal protein 4H. sapiensAF0833054594.2e-73
Ubiquitin-like S30 ribosomal fusion proteinH. sapiensAF0833061511.9e-15
Insulinoma rig-analog DNA-binding proteinH. sapiensAF0833074981.3e-31
Neuronal calcium binding proteinC. elegansAF0833086298.2e-78
Mitochondrial ubiquinone-binding proteinH. sapiensAF0833091152.0e-25
Oxidoreductase geneH. sapiensAF0833101723.4e-23
Iron-sulfur proteinR. rieskeAF0833118125.1e-57
Copper chaperone for superoxide dismutaseH. sapiensAF0833123592.0e-42
Putative fatty-acid binding proteinA. gambiaeAF0833135301.0e-53
SMT3 proteinH. sapiensAF0833146498.3e-44
Core P2 precursor ubiquinol cytochrome c reductase complexB. taurusAF0833151725.5e-15
Tyrosyl-tRNA synthetaseH. sapiensAF0833165007.8e-39
Transferrin geneFlesh flyAF0833175111.0e-61
Leucyl-tRNA synthetaseA. thalianaAF0833181793.1e-72
Protein translation factor SUI1A. gambiaeAF0833193811.8e-47
Uracil phosphoribosyl transferaseS. cerevisiaeAF0833204348.7e-51
Metallopanstimulin geneH. sapiensAF0833213223.3e-39

'Gene' indicates the homologous gene name, 'organism' indicates the organism which has the greatest similarity to the Drosophila clone, and the accession number for the Drosophila isolog is listed. The 'score' and 'probability' for each match using BLASTN [41] are reported.

Table 4

Brain cDNA clones matched with previously reported Drosophila genes

GeneAccession numberLocationReference
Frequenin geneAF171701CNS and PNS of adults and embryos[45]
Ferritin subunit 1 geneAF171703Fat body and gut of larvae, present in all stages and increased with iron supplementation[46]
Tyrosine kinase geneAF171705Not specified-
Glutamine synthase geneAF171706Mitochondrial
AF171717
Rab-related protein 4 geneAF171708Endoplasmic reticulum and Golgi (rats expression highest in brain)[47,48]
S-adenosylmethionine decarboxylase geneAF171712Polyamine synthesis, presumably in every cell, highest in 24 and 48 h larvae[49]
Twinstar geneAF171714Male germ line and larvae throughout development[50,51]
CCATT box transmembrane domain geneAF171715Not specified-
Geranylgeranyl transferase geneAF171718Not specified-
ADP-robosylation factor class II geneAF171722Uniformly distributed ubiquitous in all adult body segments[52,53]
Virus-like particleAF171727Long gland and ovipositor in adults[54]
Rot geneAF171731Not specified-
DNA-binding protein erect wingAF171732Throughout embryonic development and enriched in adult head[55]
Membrane-associated protein geneAF171736Uniform in embryonic development[56]
BBC-1 geneAF171739Not specified-
Vimar geneAF171743Midgut and hindgut, visceral mesoderm, CNS, and PNS in embryos[57]
Metallothionein geneAF171741Alimentary canal and lower in other tissues of larvae[58]
Nicotinic acetylcholine receptor geneAF171702Brain and CNS predominantly in late embryos and adult head[59,60,61]
Burdock retrotransposon gag protein geneAF171716Not specified-
Transposable element to copia mgd3 retroposonAF171724Varies with Drosophila populations[62,63]
AF171725
Heat-shock gene hsp 27AF171728CNS, sperm, and oocytes, present in all stages, highest in white prepupae[64,65]
Alpha 1,2 mannosidase geneAF171730Embryonic PNS, adult eye, and wing[66]
GTP cyclohydrolase IAF171733Embryo nuclear, adult eye and head[67,68]
Teashirt geneAF171735Epidermis and mesoderm during development[69]
AF171759
49 kD phosphoproteinAF171737Photoreceptors[70]
AF171738
Alcohol dehydrogenase related geneAF171740Not specified-
Vacuolar-ATPase geneAF171742Uniform expression in all stages[71]
Micropia-Dm11 3' flanking DNAAF171746Not specified-
AF171748
RM62 RNA helicaseAF171749Not specified-
ADP/ATP translocaseAF171704Not specified-
Ubiquitin protein geneAF171707Tissue-general, all life stages[72]
Calmodulin geneAF171711CNS and mushroom bodies of adults[73]
AF171781
TRIP-1 homolog geneAF171713Not specified-
AF171726
Bnb gene for developmentAF171729Mesectoderm and presumptive epidermis, after dorsal closure periphery of nervous system including glia that may establish longitudinal neuropile scaffolding, embryonic CNS[74]
AF171751
B(2)gcn geneAF171754Not specified-
Diacylglycerol kinase geneAF171720Eye-specific in adult nervous system, muscles, compound eye,[75,76]
AF171721brain cortex, fibrillar muscle, and tubular muscle
AF171755
AF171756
Gene from heat-shock locus 93DAF171760Constitutive monitoring the 'health' of translation machinery, presumably in every cell[77,78]
Cytochrome c oxidase geneAF171709Mitochondrial
AF171710
AF171719
AF171723
AF171734
AF171752
BM40 geneAF171872Not specified-
Histone H3.3 geneAF171745Gonads and somatic tissue, uniform distribution in polytene chromosomes[79,80]
Acetylcholine receptor-related proteinAF171747CNS[81]
Hu-li tai shao geneAF171750Ovarian ring canal[82]
Laminin receptor geneAF171753Neural tissue[83]
eIF-2 alpha-subunitAF171758Expressed throughout embryos, and CNS in later stages[84,85]
Gerceraldehyde-3-phosphate dehydrogenase-2 geneAF171757Evenly distributed, expressed in all stages[86]
CNS-specific Noe geneAF171772CNS[87]
AF171796
AF171818
AF171848
AF171780
Medea-B geneAF171807Not specified-
Phospholipase C norpA geneAF171840Retina and body of adults[88]
Recq helicase 5 geneAF171784DNA repair, recombination, and replication[89]

Each previously identified gene is listed with its accession number from the Drosophila brain study. Location information is as reported by the indicated reference.

Isologs identified in Drosophila brain study 'Gene' indicates the homologous gene name, 'organism' indicates the organism which has the greatest similarity to the Drosophila clone, and the accession number for the Drosophila isolog is listed. The 'score' and 'probability' for each match using BLASTN [41] are reported. Brain cDNA clones matched with previously reported Drosophila genes Each previously identified gene is listed with its accession number from the Drosophila brain study. Location information is as reported by the indicated reference. Approximately 42% of the sequence data generated in this study were originally novel according to sequence analysis searches conducted at the beginning of this project. Since then, much EST data has been added to GenBank and the Drosophila genome sequence has been released. Thus, in October 2000 the 114 previously novel brain cDNA were again compared with fly sequence data. The percentage of transcripts that do not have corresponding ESTs is reduced to 11% (Table 2; of 29 clones, 17 have no EST matches and are not predicted genes following genome annotation, and 12 have no EST matches but are matched with a predicted gene). Although each of these 29 clones lacks an EST match, each clone is identified within the Drosophila genome sequence recently reported by Adams et al. [35]. It is possible that some of these clones represent the 3' ends of ESTs for which only 5' sequence data is available. Considering that data for approximately 80,000 ESTs (24,193 ESTs from adult heads alone) are reported [36] and that our analysis examined only 271 randomly chosen brain library clones, 11% is a surprisingly large number. This indicates that this library is a valuable resource for generating sequence data that will facilitate genome annotation, specifically identifying regions transcribed in the adult fly brain. From our analysis it is clear that EST data are essential for accurate and thorough genome annotation. In particular, using current genome annotation algorithms, 42 of the 271 brain clones do not correspond to predicted genes (Table 2). Of these 42 clones, however, 25 have EST matches with the Berkeley Drosophila Genome Project (BDGP) data (Tables 2, 5). Comparisons of the remaining 17 cDNA sequences with the Drosophila genome sequence show evidence of RNA processing (exon/intron borders and consensus splicing sequences) for two clones, and presence of a poly(A) addition sequence (AAUAAA) 12 to 30 bp upstream of an extensive poly(A) region at the 3' end of the insert sequence for seven clones (Table 5b). Ten of the 17 clones were detected in reverse northern experiments using either brain or body radiolabeled cDNA (Table 6). The distribution of detection by brain cDNA, body cDNA, both or neither (not detectable above background) for the 17 clones in this category is similar to the distributions observed in the other categories (Tables 5a, 6), and strikingly similar to the detection frequency observed for the 'matched with an EST and a predicted gene' category. Although these data suggest that these sequences are transcribed, additional experiments are necessary to confirm whether this is true for each clone. None of the clones in this category is predicted to encode a protein larger than 100 amino acids. It is possible that these sequences may correspond to genomic DNA. Alternatively, these novel RNA molecules may perform some unknown cellular function that requires a conserved structure rather than a conserved sequence.
Table 5a

Correlation between EST match, gene prediction and hybridization analysis

Detected inAccession numberInsert sizePercentage of category
Novel cDNA
BodyAF17181914512%
 BodyAF171858186
BothAF17176445012%
 BothAF171805259
BrainAF17178985735%
 BrainAF171794610
 BrainAF171800190
 BrainAF171808269
 BrainAF171815363
 BrainAF171854195
NeitherAF17181321641%
 NeitherAF171821359
 NeitherAF171828189
 NeitherAF171838430
 NeitherAF1718501104
 NeitherAF1718591786
 NeitherAF171865452
Matched with an EST, but NOT a predicted gene
BodyAF1718574324%
BothAF17177248032%
 BothAF1717791101
 BothAF171787449
 BothAF171804553
 BothAF171818180
 BothAF1718391282
 BothAF171848345
 BothAF171860151
BrainAF17176679836%
 BrainAF171780292
 BrainAF171781400
 BrainAF171782506
 BrainAF171785727
 BrainAF171796313
 BrainAF171797400
 BrainAF171811386
 BrainAF1718631072
NeitherAF17182668328%
 NeitherAF171829380
 NeitherAF171837503
 NeitherAF171840623
 NeitherAF171845376
 NeitherAF171862800
 NeitherAF171869641
Matched with a predicted gene, but NOT an EST
BodyAF1718672878.3%
BothAF171768174633.3%
 BothAF171778389
 BothAF171790396
 BothAF171799819
BrainAF171771106033.3%
 BrainAF171783373
 BrainAF171803688
 BrainAF171812338
NeitherAF17183233825%
 NeitherAF171861398
 NeitherAF171868689
Matched with an EST and a predicted gene
BodyAF1718465783%
 BodyAF171856600
BothAF17176323630%
 BothAF171770919
 BothAF171773315
 BothAF171774745
 BothAF171776278
 BothAF171777165
 BothAF171786223
 BothAF171791451
 BothAF171792619
 BothAF171793234
 BothAF171795200
 BothAF171798733
 BothAF171809577
 BothAF171810966
 BothAF171817250
 BothAF171830567
 BothAF1792291396
 BothAF179230168
BrainAF171761108127%
 BrainAF171762652
 BrainAF171765281
 BrainAF171767228
 BrainAF1717691051
 BrainAF171775523
 BrainAF1717881421
 BrainAF171801600
 BrainAF171802623
 BrainAF171806727
 BrainAF171807380
 BrainAF171814237
 BrainAF171816324
 BrainAF171822376
 BrainAF171824785
 BrainAF171833868
NeitherAF17178435140%
 NeitherAF171820427
 NeitherAF171823234
 NeitherAF171825675
 NeitherAF171827253
 NeitherAF171831222
 NeitherAF171834240
 NeitherAF171835916
 NeitherAF171836406
 NeitherAF171841436
 NeitherAF171842331
 NeitherAF17184399
 NeitherAF171844764
 NeitherAF171847301
 NeitherAF171849193
 NeitherAF171851364
 NeitherAF171852364
 NeitherAF171853473
 NeitherAF171855991
 NeitherAF171864541
 NeitherAF171866412
 NeitherAF171870399
 NeitherAF171871488
 NeitherAF171872546

Hybridization results for the indicated cDNA categorized as 'novel cDNA'; 'matched with an EST, but not a predicted gene'; 'matched with a predicted gene, but not an EST'; or 'matched with an EST and a predicted gene' (see Table 2). Purified plasmids containing the indicated insert sequence were hybridized with either labeled brain or body cDNA (see Table 6). The percentage of clones exhibiting the indicated hybridization pattern within each category is indicated.

Table 6

Body versus brain expression of originally novel clones

Clone categoryGenBank accession numberpsl/mm2-bkgTotal

BrainBody
Brain onlyAF1717611.52ND34
AF1717650.68ND
AF1717660.60ND
AF1717671.62ND
AF1717690.15ND
AF1717710.79ND
AF1717751.20ND
AF1717804.18ND
AF1717811.10ND
AF1717821.03ND
AF1717830.94ND
AF1717851.52ND
AF1717881.44ND
AF1717890.77ND
AF1717940.61ND
AF1717963.74ND
AF1717970.87ND
AF1718001.01ND
AF1718010.98ND
AF1718020.46ND
AF1718030.86ND
AF1718061.15ND
AF1718070.70ND
AF1718080.92ND
AF1718110.38ND
AF1718120.60ND
AF1718140.90ND
AF1718150.91ND
AF1718160.24ND
AF1718225.43ND
AF1718241.14ND
AF1718330.34ND
AF1718540.20ND
AF1718633.73ND
Body and brainAF1717623.940.0133
AF1717632.818.92
AF1717641.130.06
AF1717681.760.39
AF1717701.250.70
AF17177214.10.81
AF1717731.030.43
AF1717740.770.04
AF1717761.090.43
AF1717771.201.65
AF1717781.452.48
AF1717790.670.55
AF1792292.460.09
AF1792301.383.88
AF1717860.770.03
AF1717871.141.02
AF1717901.170.80
AF1717911.076.13
AF1717921.251.56
AF1717931.191.47
AF1717953.320.68
AF1717980.920.31
AF1717991.300.05
AF1718041.050.47
AF1718051.110.13
AF1718092.220.49
AF1718101.340.12
AF1718170.630.01
AF17181818.30.94
AF1718300.310.01
AF1718399.932.40
AF17184820.70.13
AF17186023.40.67
Body onlyAF171819ND12.96
AF171846ND0.08
AF171856ND0.06
AF171857ND0.19
AF171858ND0.35
AF171867ND0.21
Neither body nor brainAF171784NDND41
AF171813NDND
AF171820NDND
AF171821NDND
AF171823NDND
AF171825NDND
AF171826NDND
AF171827NDND
AF171828NDND
AF171829NDND
AF171831NDND
AF171832NDND
AF171834NDND
AF171835NDND
AF171836NDND
AF171837NDND
AF171838NDND
AF171840NDND
AF171841NDND
AF171842NDND
AF171843NDND
AF171844NDND
AF171845NDND
AF171847NDND
AF171849NDND
AF171850NDND
AF171851NDND
AF171852NDND
AF171853NDND
AF171855NDND
AF171859NDND
AF171861NDND
AF171862NDND
AF171864NDND
AF171865NDND
AF171866NDND
AF171868NDND
AF171869NDND
AF171870NDND
AF171871NDND
AF171872NDND

Purified plasmid DNA was spotted onto a nylon filter and hybridized with either radiolabeled brain or body cDNA. Clones are classified on the basis of whether they were detected with brain, body, both (brain and body) or neither (not above background) radiolabeled cDNA. The clones tested were originally novel (no EST or previous sequence information), but some clones have changed classification as a result of both the large number of ESTs submitted thorough the Drosophila genome effort (BDGP) and the Drosophila genome annotation efforts. Additional information for these clones is listed in Tables 2, 5. ND, not determined. Bkg, background.

Correlation between EST match, gene prediction and hybridization analysis Hybridization results for the indicated cDNA categorized as 'novel cDNA'; 'matched with an EST, but not a predicted gene'; 'matched with a predicted gene, but not an EST'; or 'matched with an EST and a predicted gene' (see Table 2). Purified plasmids containing the indicated insert sequence were hybridized with either labeled brain or body cDNA (see Table 6). The percentage of clones exhibiting the indicated hybridization pattern within each category is indicated. Additional information Additional information concerning clones that lack EST data and that are not predicted to be transcribed (Novel cDNA). The GenBank accession number for each of the 17 clones is indicated. The distance between a putative poly(A) addition sequence (AAATAAA), when present, and the poly(A) sequence is shown. 'Splicing detected?' indicates the two cDNA clones showing evidence for consensus splicing. 'Expression detected' refers to each clone's hybridization results when probed with radiolabeled cDNA from brain or body (see Table 6). 'Insert size', size of the cDNA insert; 'comments', additional comments for the identified clone. None of the 17 cDNA is predicted to encode a protein larger than 100 amino acids. Body versus brain expression of originally novel clones Purified plasmid DNA was spotted onto a nylon filter and hybridized with either radiolabeled brain or body cDNA. Clones are classified on the basis of whether they were detected with brain, body, both (brain and body) or neither (not above background) radiolabeled cDNA. The clones tested were originally novel (no EST or previous sequence information), but some clones have changed classification as a result of both the large number of ESTs submitted thorough the Drosophila genome effort (BDGP) and the Drosophila genome annotation efforts. Additional information for these clones is listed in Tables 2, 5. ND, not determined. Bkg, background. The Drosophila genome is predicted to contain 13,601 genes [35]. Ifour observations are representative and can be extended to the number of genes in the fly genome, then our analysis suggests that the total number of genes may be underestimated by approximately 15% (42 of the 271 randomly chosen cDNAs do not correspond to a predicted gene). Thus, approximately 2,000 genes may await discovery.

Transcript distribution analysis

A second hybridization study was conducted to determine whether clones originally identified as novel were detectable in the brain and/or body of adult Drosophila. This data may offer clues as to which transcripts are involved in basic neuronal function, as opposed to a function that may be specific to the brain. The Drosophila central nervous system (CNS) includes thoracic and abdominal ganglia and, therefore, neural transcripts are often expressed throughout the body. Thus, it was possible that few transcripts would be brain-specific. To determine how the (originally) novel clones were distributed in the animal, plasmid templates from 114 novel clones were spotted on filters and hybridized with radiolabeled cDNA from either brains or bodies (minus heads). The results of this study are listed in Table 6. In this experiment cDNA probe is limiting and, therefore, many transcripts that are in low abundance may not be detectable. In fact, 36% of the clones were not detected in either brain or body. These clones may correspond to less abundant transcripts. Ideally, hybridization probe would be in excess in these experiments to determine which clones are brain specific, but Drosophila brain cDNA is limiting. Approximately 30% of the clones were detectable only in the brain and are candidates for genes involved in brain function. Clones that were detected in both tissues made up about one third of the novel transcripts (29%). About 5% of the clones were detected only in the body. As the library is made from brain tissue, we did not expect to recover many transcripts that would only be detectable in the body, as compared to the brain. We used published localization data from previously identified transcripts to evaluate the data we collected for the novel clones (Table 4). Approximately 22% (7 of 32) of the known Drosophila genes listed are neural-specific, and approximately 30% of novel transcripts were detected only in the brain. Approximately 29% of the novel transcripts were detectable in both brain and body tissues. Known Drosophila genes that were localized in body and brain tissues accounted for 56% (18 of 32) of genes for which localization data was available (Table 4). Clones detected in both tissues may indicate that the gene product is needed in all cells. Genes from the nervous system would be expected to be expressed in both tissues, so transcripts detected in both cannot be ruled out of this category; but these transcripts are not brain specific. Approximately 5% of the novel clones were detectable only in the body, as compared to 22% (7 of 32) of the known Drosophila clones detected only in body tissues. These transcripts are apparently expressed at a higher level in the body and at relatively low levels in the brain. It should be noted that localization data were not specified for 18 of the 49 (37%) known transcripts listed in Table 4. This analysis suggests that this brain cDNA library is a rich source for generating cDNA sequence information and for identifying novel, brain-specific cDNAs.

Conclusions

The initial analysis of an adult Drosophila brain library is presented here. Somewhat surprisingly, we observe no clear connection between the abundance of a transcript and its appearance in a sequence data bank. However, molecular screens that are directed towards isolating rare transcripts may skew the transcript-related data in sequence banks towards less abundant molecules. As shown in Figure 1 and Table 2, we have identified and sequenced 29 novel clones that do not match with other known expressed sequences (but do match with fly genomic sequence information), 85 clones that are matched with EST data, 71 clones that were previously reported Drosophila sequences, 39 clones that contain ribosomal protein sequences, 27 clones that are matched with genes previously reported for other organisms (isologs, Table 3) and 20 clones corresponding to mitochondrial sequences. Why did we recover such a high percentage of novel sequences? Libraries made from brain tissue are proposed to have a higher complexity of transcripts than libraries made from other tissues [21]. Therefore, EST screens of brain libraries should yield larger numbers of independent transcripts, as a result of the increased transcript complexity within brain tissues. Another possible explanation for the surprisingly large number of novel cDNAs identified in our analysis is that our library is not normalized. It has been proposed that hybrids form between poly(dA) and poly(dT) sequences during the hybridization/subtraction reaction and that these sequences are subsequently lost [36]. An ultimate goal of this project is to create a database of all the transcripts expressed in the Drosophila brain and to correlate this information with their patterns of expression in the brain. This type of a database would be a valuable resource and could be used in comparative studies with other organisms. Comparisons of transcripts from organisms with relatively simple brains (Drosophila) to organisms with more complex neural function (humans) may offer insights into basic brain function and aid in the identification of transcripts involved in higher-order brain functions. The 35 clones that appear enriched in the brain may identify proteins or RNAs that are involved in a brain-specific function. Transcripts identified in this library can be directly tested for protein-protein interaction using the yeast two-hybrid capability of the library, making it a good resource for many areas of study. Our analysis of this unique brain library demonstrates that many transcribed regions of the Drosophila genome remain undiscovered, and that approximately 2,000 more genes may be identified. Genome annotation efforts emphasize identifying protein-coding regions [37]. Thus, it is possible that some of the ESTs lacking a corresponding predicted gene were missed during genome annotation because an open reading frame (or one of sufficient size) was not predicted. Complete genomic sequences are excellent resources, and extensive annotation of a genome makes the sequence information even more powerful. Current software is not sufficient to identify all transcribed regions within the genome. As of the year 2000, EST data for 24,193 clones from adult Drosophila head libraries is reported and estimated to represent over 40% of all Drosophila genes [38]. Our results confirm that not all transcribed regions of the genome are identified and that EST analyses are essential for accurate and complete genome annotation.

Materials and methods

Tissue preparation

To produce the animals for the brain dissections, adult Drosophila were entrained using 12 h light and 12 h darkness in temperature-controlled incubators at 25°C. Entrained adult D. melanogaster (Canton S) flies were collected 3 h after the lights were turned off, frozen on dry ice, shaken to detach heads from bodies, and separated through a screen to isolate the heads. Frozen heads were incubated in prechilled -20°C, 100% acetone (EM Science) at -20°C overnight to replace the water in the tissue with acetone [39]. Prechilling the acetone prevents the heads from thawing when added to the acetone. Heads were dried at room temperature, and brains were removed using fine dissecting tweezers.

RNA preparation

RNA was isolated according to the Micro RNA Isolation protocol from Stratagene, with the exception of homogenization. Dried tissue was homogenized in denaturing solution with β-mercaptoethanol for 1 min, incubated on ice for 15 min, and then homogenized for an additional 5 min. The addition of a rehydration step increased the yield of RNA from dried tissue to approximately the same level as fresh tissue (16-21.9 μg total RNA extracted from 100 fresh heads, 15-29.3 μg total RNA extracted from 100 acetone-dried heads with rehydration step, compared to 3-6.6 μg total RNA extracted from 100 acetone-dried heads with no rehydration step). Poly(A) RNA was isolated using the Poly(A) Quick mRNA Isolation Kit from Stratagene according to the manufacturer's instructions. Approximately 5 μg poly(A) RNA was extracted from approximately 15,000 brains. Weighing acetone-dried brains allowed us to estimate how many were used to construct the library (50 acetone-dried brains weigh 0.25 mg and 50 acetone-dried heads weigh 1.8 mg).

Library construction

The library was constructed using a Stratagene HybriZAP™ Library kit. First-strand cDNA synthesis was primed from the 3' end of the poly(A) RNA using a poly(T) primer that also contained an XhoI restriction site and a GAGA sequence (5'-GAGAGAGAGAGAGAGAGAGAACTAGTCTCGAGTTTTTTITTTTTTTTTTT-3'). 5-methyl dCTP was used during first-strand cDNA synthesis to protect internal XhoI sites. Second-strand synthesis was primed by the partially digested RNA that resulted from RNase H treatment of the first-strand synthesis reaction. Pfu DNA polymerase was used to blunt the cDNA and EcoRI adapters were ligated to the blunt ends. The cDNA was digested with EcoRI and XhoI, size separated (retaining molecules approximately 200 bp or larger), ligated into HybriZAP™ vector arms, and packaged into phage heads for amplification.

Determining the number of primary clones

The cDNA library was titered to determine how many independent clones were recovered. At the 10-1 dilution there were 270 plaques per plate, giving a total of 6.75 × 106 clones in the primary library. This number was calculated as follows: (number of plaques 270) × (dilution factor 10) × (total packaging volume 500 μl) / (total number of mg packaged 8.75 × 10-5) × (number of μl packaged 1) = 1.542 × 1010 plaque-forming units (PFU) per mg or 1.35 × 106 PFU per packaging reaction. There are five packaging reactions for the entire library for a total of 6.75 × 106 clones in the primary library.

Sequencing-template preparation

PCR template

Individual phage plaques were incubated in 400 μl SM buffer overnight and used as amplification template. Amplification reactions were performed in a total volume of 40 μl and contained 2 μl eluted phage, 40 ng of each primer (FADI 5'-CACTACAATGGATGATG-3' and RADI 5'-CTTGCGGGGTTTTTCAG-3'), 0.001% Tween 20 (Sigma), 2.5 U Taq DNA polymerase (Promega), 1x Taq polymerase buffer (Promega), 1.56 mM MgCl2 (Promega), and 0.25 mM of each dNTP (USB). PCR was performed on a Perkin-Elmer 9600 GeneAmp PCR system, as specified in the HybriZAP™ Two-Hybrid cDNA Gigapack Cloning Kit instruction manual (Stratagene). After amplification, reactions were incubated at 37°C for 15 min with 0.5 Uμl-1 exonuclease I (USB) and 0.5 Uμl-1 shrimp alkaline phosphatase (Amersham Life Sciences). Enzymes were inactivated by heat treatment at 85°C for 15 min. Resulting samples were electrophoretically separated on a 1% Agarose (Kodak) gel and compared to a quantitative marker, BioMarker-EXT (BioVentures), to estimate the DNA concentration of each sample. This DNA was directly used in subsequent sequencing reactions.

Plasmid template

Individual phage plaques were incubated in 400 μl SM buffer overnight and then used for excision. Library phage were incubated with ExAssist Helper Phage™ (Stratagene) and XL1-Blue Escherichia coli cells, and grown overnight in Luria Broth. E. coli cells were killed by heat treatment (70°C, 20 min). XLOR E. coli cells were inoculated with the released phagemids and this mixture was plated on 50μg ml-1 ampicillin (Sigma) selection medium. Resulting colonies were cultured for subsequent plasmid DNA preparation (Perfect Prep Plasmid DNA kit, 5'-3' Inc.).

Sequencing

Initial sequence information was obtained using standard sequencing methods (described below) and a vector primer directed toward the 5' end of the insert (FADI 5'-CACTACAATGGATGATG). These ESTs were evaluated using the BLAST search program [40,41] linked to the nonredundant GenBank database. Novel cDNAs and isologs were completely sequenced. If a cDNA had been previously identified, sequence determination was not continued. The second standard sequencing reaction was primed from poly(A) tail using 1.6 pmol of a poly(T) primer anchored (PLYT 5'-TTTTTTTTTTTTTTTV-3' (V=A, C, or G)). cDNA sequences were completed using an octamer-primer walking strategy [42,43]. Automated sequencing reactions were performed using ABI PRISM Dye Terminator (or dRhodomine for PLYT reactions) Cycle Sequencing Ready Reaction Kits with AmpliTaq DNA polymerase, FS, according to the manufacturer's directions or as described for octamer-primed sequencing reactions [42,43]. The FADI primer was annealed at 48°C and the PLYT primer was annealed at 20°C. Sequencing reactions were ethanol precipitated, pellets were resuspended in 3.5 μl loading buffer, 1.5 μl was loaded onto a sequencing gel, and the data was collected by an ABI PRISM 377 DNA sequencer. Data collected from the ABI PRISM 377 DNA sequencer was manually edited using Sequencher 3.0 (GeneCodes).

Hybridization analyses

Eighty-five individual phage plaques were incubated in 400 μl SM buffer overnight and 2 μl phage eluant was used to produce a grid of plaques on a lawn of E. coli cells. Filter lifts were taken from the grid of plaques and hybridized at 65°C overnight with labeled Drosophila head cDNA at 1 × 106 cpm per ml hybridization buffer (50% formamide, 5x SSC, 0.1% Ficoll w/v, 0.1% PVP w/v, 0.1% BSA w/v, 0.1% SDS w/v, 0.2 μg ml-1 salmon sperm DNA and 1 mM EDTA). Filters were then washed sequentially at 42°C in 5x SSC for 1 h, at 65°C in 1x SSC for 1 h, and at 65°C in 0.1x SSC for 1 h. Filter lifts were exposed to phosphorimaging plates for 24 h (Fuji Medical Systems) and psl (photo stimulating units) of a standard area were determined using a Fuji Bas1000 Imager.

Hybridization of all novel clones

Plasmid DNA (10 ng) from each novel clone was denatured at 95°C for 5 min and spotted onto a nylon filter. Filters were hybridized at 65°C overnight with either labeled Drosophila body (minus head) or labeled Drosophila brain cDNA at 1 × 106 cpm ml-1 in Church and Gilbert buffer (7% SDS, 1 mM EDTA, 500 mM Na2HPO4, and pH to 7.2 with H3PO4 [44]). Filters were washed twice for 1 h (each) at 65°C in Church and Gilbert buffer. Filters were then exposed to phosphorimaging plates for 24 h, and psl of a standard area was determined using a Fuji Bas1000 Imager. 10 ng of each plasmid on the filter contains approximately 1.2 × 1012 copies and, therefore, probe is expected to be limiting in these experiments.

Categorizing clones

Clones classified as 'novel' had no obvious match to nucleic acid/protein sequence information in GenBank (a score less than 100). Novel clones may contain blocks of less than 100 bases that are matched with other sequences, but these small regions have no known functional correlation and the similarity between the two sequences was very low. It is possible that some of our novel clones could be part of an EST from the BDGP that has not been fully sequenced. Sequences categorized as matched to EST data have a high degree of similarity (a score over 100) to reported sequence information, but the previously collected sequence data was not associated with a known function. Sequences categorized as 'known Drosophila' were a perfect match (with perhaps the exception of a few bases, fewer than 10) with sequence information from Drosophila. 'Matched isologs' are sequences that have a high degree of similarity (score of over 100 at the protein level) to a gene found in another organism, but a functional homology between these genes has not been determined. Ribosomal protein sequences were categorized as 'known' or 'isologs' using the above criteria. Ribosomal protein and mitochondrial sequence clones were categorized separately because these types of transcripts frequently occurred in our library.
  81 in total

1.  Generation and analysis of 280,000 human expressed sequence tags.

Authors:  L D Hillier; G Lennon; M Becker; M F Bonaldo; B Chiapelli; S Chissoe; N Dietrich; T DuBuque; A Favello; W Gish; M Hawkins; M Hultman; T Kucaba; M Lacy; M Le; N Le; E Mardis; B Moore; M Morris; J Parsons; C Prange; L Rifkin; T Rohlfing; K Schellenberg; M Bento Soares; F Tan; J Thierry-Meg; E Trevaskis; K Underwood; P Wohldman; R Waterston; R Wilson; M Marra
Journal:  Genome Res       Date:  1996-09       Impact factor: 9.043

2.  Structure and developmental expression of the D alpha 2 gene encoding a novel nicotinic acetylcholine receptor protein of Drosophila melanogaster.

Authors:  P Jonas; A Baumann; B Merz; E D Gundelfinger
Journal:  FEBS Lett       Date:  1990-08-20       Impact factor: 4.124

3.  Induction of a dominant negative CREB transgene specifically blocks long-term memory in Drosophila.

Authors:  J C Yin; J S Wallach; M Del Vecchio; E L Wilder; H Zhou; W G Quinn; T Tully
Journal:  Cell       Date:  1994-10-07       Impact factor: 41.582

4.  Dynamic expression pattern of Ca(2+)/calmodulin-dependent protein kinase II gene in the central nervous system of Drosophila throughout development.

Authors:  M Rachidi; C Lopes; Y Takamatsu; S Ohsako; J C Benichou; J M Delabar
Journal:  Biochem Biophys Res Commun       Date:  1999-07-14       Impact factor: 3.575

5.  Iron availability dramatically alters the distribution of ferritin subunit messages in Drosophila melanogaster.

Authors:  T Georgieva; B C Dunkov; N Harizanova; K Ralchev; J H Law
Journal:  Proc Natl Acad Sci U S A       Date:  1999-03-16       Impact factor: 11.205

6.  Rapid cDNA sequencing (expressed sequence tags) from a directionally cloned human infant brain cDNA library.

Authors:  M D Adams; M B Soares; A R Kerlavage; C Fields; J C Venter
Journal:  Nat Genet       Date:  1993-08       Impact factor: 38.330

7.  Molecular cloning of a Drosophila diacylglycerol kinase gene that is expressed in the nervous system and muscle.

Authors:  I Masai; T Hosoya; S Kojima; Y Hotta
Journal:  Proc Natl Acad Sci U S A       Date:  1992-07-01       Impact factor: 11.205

8.  Immunocytochemical and learning studies of a Drosophila melanogaster neurological mutant, no-bridgeKS49 as an approach to the possible role of the central complex.

Authors:  A Bouhouche; G Vaysse; M Corbière
Journal:  J Neurogenet       Date:  1993-12       Impact factor: 1.250

9.  Expression of a Drosophila melanogaster acetylcholine receptor-related gene in the central nervous system.

Authors:  S C Wadsworth; L S Rosenthal; K L Kammermeyer; M B Potter; D J Nelson
Journal:  Mol Cell Biol       Date:  1988-02       Impact factor: 4.272

10.  Positional cloning and sequence analysis of the Drosophila clock gene, timeless.

Authors:  M P Myers; K Wager-Smith; C S Wesley; M W Young; A Sehgal
Journal:  Science       Date:  1995-11-03       Impact factor: 47.728

View more
  4 in total

1.  Synaptic and genomic responses to JNK and AP-1 signaling in Drosophila neurons.

Authors:  Paul D Etter; Radhakrishnan Narayanan; Zaneta Navratilova; Chirag Patel; Dirk Bohmann; Heinrich Jasper; Mani Ramaswami
Journal:  BMC Neurosci       Date:  2005-06-02       Impact factor: 3.288

2.  A Drosophila systems model of pentylenetetrazole induced locomotor plasticity responsive to antiepileptic drugs.

Authors:  Farhan Mohammad; Priyanka Singh; Abhay Sharma
Journal:  BMC Syst Biol       Date:  2009-01-21

3.  An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome.

Authors:  M Hild; B Beckmann; S A Haas; B Koch; V Solovyev; C Busold; K Fellenberg; M Boutros; M Vingron; F Sauer; J D Hoheisel; R Paro
Journal:  Genome Biol       Date:  2003-12-22       Impact factor: 13.583

4.  Drosophila Exhibit Divergent Sex-Based Responses in Transcription and Motor Function After Traumatic Brain Injury.

Authors:  Ekta J Shah; Katherine Gurdziel; Douglas M Ruden
Journal:  Front Neurol       Date:  2020-06-19       Impact factor: 4.086

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.