| Literature DB >> 17650329 |
Rafaela M Maia1, Valeria Valente, Marco A V Cunha, Josane F Sousa, Daniela D Araujo, Wilson A Silva, Marco A Zago, Emmanuel Dias-Neto, Sandro J Souza, Andrew J G Simpson, Nadia Monesi, Ricardo G P Ramos, Enilza M Espreafico, Maria L Paçó-Larson.
Abstract
BACKGROUND: The sequencing of the D.melanogaster genome revealed an unexpected small number of genes (~ 14,000) indicating that mechanisms acting on generation of transcript diversity must have played a major role in the evolution of complex metazoans. Among the most extensively used mechanisms that accounts for this diversity is alternative splicing. It is estimated that over 40% of Drosophila protein-coding genes contain one or more alternative exons. A recent transcription map of the Drosophila embryogenesis indicates that 30% of the transcribed regions are unannotated, and that 1/3 of this is estimated as missed or alternative exons of previously characterized protein-coding genes. Therefore, the identification of the variety of expressed transcripts depends on experimental data for its final validation and is continuously being performed using different approaches. We applied the Open Reading Frame Expressed Sequence Tags (ORESTES) methodology, which is capable of generating cDNA data from the central portion of rare transcripts, in order to investigate the presence of hitherto unnanotated regions of Drosophila transcriptome.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17650329 PMCID: PMC1949825 DOI: 10.1186/1471-2164-8-249
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Analysis flowchart of 176 ORESTES unannotated in . Analysis against the genome assembly 4.3 and expressed sequences were manually performed using BLASTn at the FlyBase [5] and BLAT Search [16].
Figure 2Developmental profile of transcripts containing exons unannotated in the current version (4.3) of the . Autoradiograms of Northern blots containing poly A+ RNA extracted from embryos (E), third instar larvae (L3), and adults (A), which were hybridized to the probes indicated at the bottom of each blot. The estimated size (kb) of each transcript is indicated on the left.
Genomic mapping of the validated ORESTES and sizes of the respective transcripts
| 1 | 7D3–5 | Intron | 10.0 | 4.0 | ||
| 2 | 88D1 | Intergenic | 8.4 | CG33967 | 4.9 | |
| 3 | 25A3 | Intergenic | 2.1; 4.5 | CG11928 | 0.3 | |
| 4 | 32A2 | Intergenic | 4.5 | CG7329 | 1.6 | |
| 5 | 88A12 | Exon/Intron | 1.5 | CG33329 | 1.2 | |
| 6 | 66A22-B3 | Exon/Intron | 1.8 | nmo (CG7892) | Isoforms 2.2–3.1 | |
| 7 | 87E8 | Intron | 0.6 | CG9813 | Isoforms 2.1–2.8 | |
| 8 | 91C6-D1 | Intron | 0.9 | CG7720 | Isoforms 2.4–2.5 | |
| 9 | 9F2–4 | Intron | 2.0 | Imp (CG1691) | Isoforms 2.8–3.9 | |
| 10 | 33A1–2 | Intergenic | 11.5 | CG18265 | 6.4 | |
| 11 | 98F10 | Intergenic | 10.2 | CG11874 | 3.2 | |
| 12 | 74E4–5 | Intergenic | 11.0 | TORC | 3.0 | |
| 13 | 25D6-E1 | Intergenic | 3.5 | nompC (CG11020-RB) | 5.1 | |
| 14 | 74D4 | Intron | 9.8 | CycT (CG6292) | Isoforms 3.2–4.3 | |
| 15 | 33A1–2 | Intron | 1.8; 5.0; 9.8 | crol (CG14938) | Isoforms 6.2–7.1 | |
| 16 | 48F6 | Exon/Intergenic | 2.2; 3.1; 4.0 | CG13165 | 1.5 | |
| 17 | 90D1-E1 | Intergenic | 8.5 | cpo (CG31243) | Isoforms 2.8–6.2 |
α sizes of the transcripts detected by ORESTES; βsizes of the predicted transcripts encoded by genes located closest to the ORESTES mapping site, • do not match any EST/cDNA sequence available up to October 2006 in the NCBI database
Figure 3Molecular characterization of a new serine endopeptidase gene. A) Graphic representation of the genomic alignment of the new transcript (SP212) identified by ORE-5. The small arrows localize the primers used for cloning the SP212 cDNA. The asterisk indicates the primer that was based on the ORE-5 sequence. CG-33329 was electronically annotated. B) Sequence of the SP212 cDNA and the amino-acid sequence encoded in the largest open reading frame (GenBank accession number: EF108315). Grey boxes mark the motifs containing the SP catalytic residues, which are underlined.
Figure 4SP212 was probably originated through gene duplication during evolution of SP family. The phylogram is based on multiple alignment of the amino acid sequences of the catalytic domain of SP212 and other 3 SPs and 1 SPH of Drosophila. These sequences are from proteins that presented the highest similarity with SP212 catalytic domain sequence (e-value < -20) in searches against the NCBI databank, using Blastx. The alignment was performed using Clustalw (20). SP55, SPH144 and SP60, as well SP212, localize in the same region (88A12-B1) of chromosome 2R, whereas SP186 is located in chromosome X.
Figure 5Induction of the SP212 mRNA in adult flies after infection with different microorganisms. Northern blot analysis of total RNA extracted from adult flies at different times (3, 24 and 48 hours) after challenging by pricking with a needle dipped into 109 cells/ml cultures of either Gram+ (S.aureus) or Gram- (E.coli) bacteria or fungi (A.fumigattus). Note that asceptic pricking by itself triggers the induction of the SP212, albeit at a lower level.