Literature DB >> 10908584

Pre-messenger RNA processing factors in the Drosophila genome.

S M Mount1, H K Salz.   

Abstract

Entities:  

Mesh:

Substances:

Year:  2000        PMID: 10908584      PMCID: PMC2180228          DOI: 10.1083/jcb.150.2.f37

Source DB:  PubMed          Journal:  J Cell Biol        ISSN: 0021-9525            Impact factor:   10.539


× No keyword cloud information.
In eukaryotes, messenger RNAs are generated by a process that includes coordinated splicing and 3′ end formation. Factors essential for the splicing of mRNA precursors (pre-mRNA) in eukaryotes have been identified primarily through the study of nuclear extracts derived from mammalian cells and Saccharomyces cerevisiae genetics. Here, we identify homologues of most known pre-mRNA processing factors in the recently completed sequence of the Drosophila genome. The set of proteins required for RNA processing shows remarkably little variation among eukaryotic species, and individual proteins are highly conserved. In general, proteins involved in the mechanics of RNA processing are even more conserved than proteins involved in the interpretation of RNA processing signals. The genome does not appear to contain a gene for the U11 RNA, or for a protein unique to the U11 snRNP, which raises the possibility that the U12-dependent spliceosome functions without U11 in Drosophila.

Introduction

Most RNA processing factors have been identified in either nuclear splicing extracts derived from mammalian cells or in Saccharomyces cerevisiae (Burge et al. 1999; Kambach et al. 1999; Minvielle-Sebastia and Keller 1999). However, Drosophila is extensively used for genetic investigations of complex and regulated splicing. In this review, we survey the recently complete Drosophila sequence (Adams et al. 2000) for sequences related to factors identified in these other systems. In many cases, functional data for the Drosophila protein are not available, and our assignments are based on the best match among genomes. We have not included genes that have been identified in Drosophila for which there is evidence of a role in splicing (see, for example, the list presented in Burnette et al. 1999). This analysis yields a list of 27 genes that encode small nuclear RNAs (snRNAs; see Table ) and a list of 99 genes that encode proteins involved in RNA processing (see Table ). Our survey confirms that the components of the RNA processing machinery are highly conserved. Very few factors identified in other species are absent from the Drosophila genome. In general, the Drosophila proteins are more closely related to their vertebrate counterparts than to the Saccharomyces cerevisiae proteins.
Table 1

Genes for Drosophila snRNAs Involved in Splicing

Drosophila geneAccessionCytological position
U1a-21DAE00358821D
U1a-82EbX5354282E
U1a-82EcAE00360482E
U1a-95CaAE00374595C
U1a-95CbAE00374595C
U1c-95CcAE00374595C
U2-14BAE00350114B
U2-34ABaAE00363934A-34B
U2-34ABbAE00363934A-34B
U2-34ABcAE00363934A-34B
U2-38ABaAE00366438A5-38B4
U2-38ABbAE00366438A5-38B4
U4-38ABAE00366438A5-38B4
U4-39BAE00366939B1-39B3
U4-25FAE00361025F
U5-14BAE00350114B
U5-23DAE00358123D
U5-34AAE00363934A-34B
U5-35DAE00364835D4
U5-38ABaAE00366438A5-38B4
U5-38ABbAE00366438A5-38B4
U5-63BCAE00347763BC
U6-96AaAE00374896A
U6-96AbAE00374896A
U6-96AcAE00374896A
U4atac-82EAE00360382E
U6atac-29BAE00362129B
U12AE00352673B
Table 2

Genes for Drosophila Proteins Involved in RNA Processing

Drosophila geneAccession No.ProteinCytologyHuman homologues,E valueYeast (S.c.), E value
snRNP proteins
Core proteins
CG5352AAF52947SmB31ENP_003082, 2E-38SMB1, NP_010946, 2E-10
CG10753AAF49893SmD169DNP_008869, E-36SMD1, NP_011588, 5E-11
CG1249AAF54135SmD283E5NP_004588, 9E-46SMD2, NP_013377, 5E-25
SmD3 (CG8427)AAF58566SmD348F3NP_004166, E-37SMD3, NP_013248, E-20
CG18591AAF52576SmE28D7NP_003085, 8E-33SME1, NP_014802, E-17
DebB (CG16792)AAF58559SmF48F3NP_003086, 6E-37SMX3, NP_015508, 3E-16
CG9742AAF48634SmG14F6NP_003087, 7E-27SMX2, NP_011170, 7E-16
CG4279AAF46688LSM157C3CAB45865, E-43LSM1, NP_01241, 7E-19
CG10418AAF49929LSM269BCAAD56226, 3E-46LSM2, NP_009527, 8E-28
CG5926AAF56219LSM395D7CAB45866, 6E-29LSM3, NP_013543, E-05
CG17764AAF52939LSM431BNP_036453, 3E-70LSM4, NP_011037, E-10
CG6610AAF50703LSM565A6NP_036454, 5E-35LSM5, NP_011073, E-13
CG9344AAF46645LSM657B11NP_009011, 8E-33LSM6, NP_010666, 7E-09
CG13277AAF53562LSM736A8AAD56231, 8E-38LSM7, NP_014252, 3E-16
CG2021AAF47567LSM862ABAAD56232, E-34LSM8, NP_012556, E-05
U1 snRNP
snRNP70K (CG8749)AAF52471U1-70k27C9NP_003080, 2E-62Snp1, NP_012203, 7E-31
CG5454AAF55616U1C91E2NP_003084, 8E-28Yhc1, NP_013401, 4E-06
snf (CG4528)*P43332U1A4F2NP_004587, 7E-32Mud1, NP_009677, 6E-04
CG7564 AAF49330Luc7-like74C1BAA91737, 4E-84Luc7, NP_010196, 5E-24
CG3198 AAF46183Related to Luc7,6C8BAA90542, 3E-06Luc7, NP_010196, 3E-06
CG1646 AAF56816Prp39/Prp42-like98F4BAA92024, 2E-30; BAA91318, 9E-23Prp39, NP_013667, 2E-13; Prp42, NP_010521, 1E-06
CG3542AAF51161Prp40-like23CFBP11, AAD39463, 2E-98Prp40, NP_012913, 3E-14
Rox8, CG5422AAF56225Similar to Nam8 See text95DNP_003243, E-82Pub1p, NP_014382, E-28 Nam8, NP_011954, E-19
U2 snRNP
snf (CG4528)*P43332U2B′′4F2NP_003083, 7E-68Yib9, NP_012274, 3E-05
CG1406AAF59207U2A′43E8NP_003081, 7E-54Lea1, NP_015111, 5E-05
noi (CG2925)CAA11045SF3a60/SAP6183B4NP_006793, E-180Prp9, NP_010254, 3E-31
CG10754AAF49876SF3a66/SAP6269E2NP_038679, E-100Prp11, NP_010241, 1E-13
CG16941AAF55372SF3a120/SAP11489E3-4NP_005868, E-155Prp21, NP_012332, 2E-11
Spx (CG3780)AAF46136SF3b53/SAP495E1NP_005841, E-109Hsh49, NP_014964, 1E-28
CG3605AAF51159SF3b150/SAP14523CNP_006833, E-178Cus1, NP_013967, 5E-38
CG13900AAF47416SF3b120/SAP13061D4-E1CAB56791, E = “0”Rse1, NP_013663, 8E-78
CG2807AAF51478SF3b160/SAP15521C6NP_036565, E = “0”Ymr288w, NP_014015, E = “0”
CG4291AAF51446FBP21-like21D2FBP21, AAC34810, 7E-28Prp40, NP_012913, 0.078
U5 snRNP
CG8877AAF58573220 kD48F1NP_006436, E = “0”Prp8, NP_012035, E = “0”
CG5205AAF55204200 kD88F8CAA94089, E = “0”Ygr271w, NP_011787, E = “0”; Brr2, NP_011099, E = “0”
CG3436AAF5153640 kD21B2NP_004805, E-120unknown
CG4849AAF56769116 kD98B8NP_004238, E = “0”Snu114, NP_012748, E-156
CG6841AAF49211Prp6-like75E1BAA37140, E = “0”Prp6, NP_009611, 6E-74
CG10333AAF53680100 kD37A4NP_004809, E = “0”Prp28, NP_010529, 2E-92
CG3058AAF5101715 kD25A1NP_006692, 5E-78Dib1, NP_015407, 3E-48
TrisnRNP
CG7757AAF49097SAP9080hNP_004689, E-144Prp3, NP_010761, 1E-23
CG6322AAF49331SAP6074C1NP_004688, E-138Prp4, NP_015504, 9E-56
Hoi-Polloi (CG3949)AAF2020915.5 kD30C7NP_004999, 4E-39Snu13, NP_010888, 9E-32
CG17266AAF5737520 kD cyclophilin42C5NP_006338, 4E-80unknown
Prespliceosome proteins
U2af38 (CG3582)Q94535U2AF small subunit21B7NP_006749, 5E-88unknown
CG3294AAF50982Related to U2AF small subunit25B4-B5BAA08533, 1E-20 U2AF1-RS2unknown
U2af50 (CG9998)Q24562U2AF large subunit14C1NP_009210, E-146Mud2, NP_012849, 0.22
CG5836CAB64937SF1/BBP90B1CAA03883, E-106Bbp1, Msl5, NP_013217, 4E-62
CG12357AAF55496CBP2090E3-4CBP20, NP_031388, 4E-63Mud13, NP_015147, 6E-37
Cbp80 (CG7035)AAF45970CBP804C9-10CBP80, NP_002477, E= “0”GCR3, NP_013844, 3E-23
CG7907AAF55893CBP80-like36CCBP80, NP_002477, 1E-96GCR3, NP_013844, 1E-10
Hel25E (CG7269)AAF52261UAP5625F1NP_004631, E = “0”Sub2, NP_010199, E-141
CG6227AAF48446Prp5-like13C5NP_055644, E = “0”Prp5, NP_009796, 4E-94
Catalytically active spliceosome proteins
CG6876AAF49655Prp31-like71B2CAB43677, E-141Prp31, NP_011605, 3E-22
CG6905AAF47383pombe, cdc5-related61CAAB61210, E = “0”Cef1, NP_013940, 4E-49
CG10689AAF53766Prp2-like37CNP_003578, E = “0”Prp2, NP_014408, E-172
CG8241AAF58294Prp22-like50EHRH1, NP_004932, E = “0”Prp22, NP_010929, E = “0”
CG1405AAF48355Prp16-like12D1NP_054722, E = “0”Prp16, NP_013012, E-160
CG6015AAF55949Prp17-like93F14AAC39730, E = “0”Prp17, NP_010652, 1E-76
CG1420AAF56845Slu7-like98F13NP_006416, E-128Slu7, NP_010373, 1E-12
Prp18 (CG6011)AAF55627Prp18-like91E4NP_003666, 4E-93Prp18, NP_011520, 4E-05
SR proteins
SC35 (CG5442)AAF53192SC3533DESC35, NP_035488, 5E-53Npl3, NP_010720, 3E-12
ASF/SF2 (CG6987)AAF55300ASF/SF289CASF/SF2, NP_008855, 2E-80Npl3, NP_010720, 7E-15
B52 (CG10851)AAF54968B5287FSRp55, CAB43960, 2E-96; SRp75, NP_005617, 3E-96; Srp40/HRS, Q13243, E-78Npl3, NP_010720, 3E-14
9G8 (CG10203)AAF524549G827C9G8, NP_006267, 4E-42Npl3, NP_010720, 7E-05
RBP1 (CG17136)AAF54555RBP186C12SRP20, NP_003008, 2E-38Nsr1, NP_011675, E-07
RBP1-like (CG1987)AAF48264RBP1-like11F3SRP20, NP_003008, 4E-31Nsr1, NP_011675, 4E-14
SRp54 (CG4602)AAF52825SRp5430DESRp54, NP_004759, 2E-41unknown
SR protein kinases
CG8174AAF58140SRPKD152A1SRPK1, NP_003128, 9E-76; SRPK2, NP_003129, 3E-77Sky1, NP_0013943, 5E-44
CG9085AAF51819SRPKD279E4SRPK1, NP_003128, 2E-63Sky1, NP_0013943, 3E-49
SRPK2, NP_003129, 3E-66
CG8565AAF48523SRPKD313F3-4SRPK1, NP_003128, 8E-50; SRPK2, NP_003129, 3E-50Sky1, NP_0013943, 2E-70
Doa (CG1658)AAF56832LAMMER/CLK kinase98F6CLK2, NP_003984, E-151Kns1, NP_013081, 5E-61
Miscellaneous proteins
Crn (CG3193)AAF45760crn/CLF12F1NP_057736, E = “0”Clf1, NP_013218, E-123
Puf60 (CG12085)AAF47501PUF60 (poly-U binding)62AAAF05605, 1E-81unknown
nonA-1 (CG10328)AAF55347PSF89C7NP_005057, 5E-62unknown
PTB (CG2094)AAF57208PTB/hnRNP I100FNP_002810, E-150unknown
CG11107AAF59269Prp43-like43B2NP_001349, E = “0”Prp43, NP_011395, E = “0”
CG11274AAF49848Srm160-like69F2NP_005830, 2E-43unknown
CG7971AAF47543Srm300-like62B4AAF21439, 2E-37unknown
CG16725AAF49446Similar to SMN73A5NP_035550, 1E-05unknown
CG17454AAF45352SPF30 SMN-related80C-82BNP_005862, 3E-37unknown
CG10419AAF49187SMN-interacting (SIP1)75F3NP_003607, 2E-18Brr1, NP_015382, E = 0.05
CG7942AAF50486Debranching enzyme66B9-10AAD53327, E-113Dbr1/Prp26, NP_012773, 2E-62
CG11266AAF52478CC1.3; splicing factor domains only27D3NP_004893, E-123unknown
BcDNA:GH01073 (UKL; CG8108)AAF56342Splicing factor domains only96B1AC023603, E-39unknown
RSF1 (CG5655)AAF52890Repressive splicing factor31C5unknownunknown
Cleavage and polyadenylation
General
CG9854AAF57528PolyA polymerase56D11P51003, E-173Pap1, P29468, E-102
PAbp (CG5119)AAF57747PolyA binding protein55B9-10P29341, E-176 (mouse)Pab1, AAA34838, E-106
Pabp2 (CG2163)AAF59127PolyA binding protein II44BNP_004634, 2E-54Sgn1, NP_012266, 2E-18
CG4612AAF47219Possible polyA binding protein60DCAA15498, 8E-46Pab1, AAA34838, 2E-24
CPSF
CG10110AAF58240CPSF-160 kD51A7Q10570, E = “0”Cft1, NP_010587, 4E-37
BcDNA:LD14168 (CG1957)AAF56844CPSF-100 kD98F13Q10568, E = “0”, bovineYsh1, NP_013379, E-32
CG7698AAF55578CPSF-73 kD91B7AAB70268, E = “0”Ysh1, NP_013379, E-146
CG1972AAF56931CPSF-73–kD variant99C2AAF00224, 7E-93Ysh1, NP_013379, E-73
CG5222AAF49538Related to CPSF-100 and -7372D5BAA91867, E = “0”; AAB67601, E-113Ysh1, NP_013379, 6E-14
Clp (CG3642)AAF51453CPSF-30 kD21D2AAD00321, 8E-89Yth1, NP_015432, 1E-34
CstF
Su(f) (CG17170)AAF45314CstF-77 kD20ENP_001317, E = “0”Rna14, NP_013777, 2E-47
CstF-64 (CG7697)AAF55577CstF-64 kD91B7NP_001316, E-71Rna15, NP_011471, 2E-18
CstF-50 (CG2261)AAF57183CstF-50 kD100E1-2NP_001315, E-129unknown
CG2097AAF51962Symplekin83C1NP_004810, E-173Pta1, NP_009356, 5E-04
Cleavage factors I & II
CG3689AAF5027825-kD subunit67A6NP_008937, E-102unknown
CG7185AAF5044568-kD subunit66C5NP_008938, 3E-34unknown
CG10228AAF46699Factor IA51D2-4KIAA0824, 7E-41Pcf11, NP_010514, 3E-9

(Table continues)

(Table continues)

Methods

Protein sequences of known yeast and human splicing factors were used to query the annotated set of predicted Drosophila proteins using BLASTP, and the nucleotide sequence of the genome using tblastn, on the NCBI server (Altschul et al. 1997; http://www.ncbi.nlm.nih.gov/). All identified Drosophila genes were used to query the nonredundant database to establish the optimal yeast and human matches. Alignments were generated using the blast two sequences option (Tatusova et al., 1999) or LALIGN (Huang and Miller 1991). Cytological positions were taken from GadFly (http://hedgehog.lbl.gov:8000) or flybase (http://flybase.bio.indiana.edu/), or deduced from the positions of flanking genes. To identify snRNA genes, the Drosophila genome was queried using modified blastn parameters (parameter set A: -r 10 -q -11 -W 8 -G 100 -E 50; B: -r 10 -q -11 -W 7 -G 5 -E 20; C: -r 10 -q -11 -W 7 -G 15 -E 4; D: -r 7 -q -14 -W 7 -G 7 -E 3; and E: -r 4 -q -5 -W 8 -G 10 -E 2). A curated database containing these results will be available at http://www.wam.umd.edu/~smount/DmRNA factors/table.html.

Results and Discussion

Major and minor snRNP components

U snRNAs.

Two types of spliceosomes have been previously described (Burge et al. 1999). The more common U2-type spliceosome is responsible for splicing the majority of introns, and the U12-type spliceosome is responsible for splicing a minor class of rare introns (perhaps 0.1% in both humans and flies). The Drosophila genome contains multiple copies of the 5 U snRNAs found in the major class spliceosomes. We found five genes for U1, six genes for U2, three genes for U4, seven genes for U5, and three genes for U6 (Table ). With the exception of U4-25F, and the U5 genes (which were previously known only by in situ hybridization), these genes had been described previously (Alonso et al. 1984; Das et al. 1987; Saba et al. 1986; Saluz et al. 1988; Lo and Mount 1990). The variant U4-25F has only 69% identity with the major form of fly U4 (Saba et al. 1986), and 68% with human U4. Although the possibility that these new snRNA genes are pseudogenes cannot be ruled out, they appear likely to be functional because of their highly conserved promoters. In the case of U4-25F, some of the variation includes compensatory changes that allow formation of conserved stem loop structures. There are four clusters of snRNA genes, including one at 38AB with two U2, one U4, and two U5 genes within 6 kb. The Drosophila genome also contains introns that resemble the minor class (or U12) introns first identified in mammals (Adams et al. 2000). These are recognized by the U12-type spliceosome including U11, U12, U4atac, and U6atac snRNAs in place of U1, U2, U4, and U6 (Hall and Padgett 1994; Tarn and Steitz 1996). Identification of snRNAs for the U12-type spliceosome in the genome involved modification of the standard parameters for BLASTN (see Methods). It was possible to find one gene for U12 snRNA, one gene for U6atac snRNA, and one gene for U4atac snRNA. These are almost certainly authentic genes, as critical sequences are conserved. In addition, the highly conserved snRNA promoter is present in each case, including a 9/10 or perfect match to the PSE consensus TAATTCCCAA, which is ∼52 nucleotides upstream of the start (Jensen et al. 1998; Lo and Mount 1990). In contrast, no gene for the U11 snRNA was found. Consistent with the absence of a U11 snRNA, we also failed to find the U11 35-kD–specific protein (accession No. NP_008951; Will et al. 1999). In fact, the U11 snRNP, which functions in 5′ splice site recognition, may not be required for splicing. The highly conserved minor class 5′ splice sites could be recognized by an unknown protein that acts during the early steps of splicing, by the U6atac snRNA alone, or by both. This mechanism would be analogous to a situation seen in vitro, where certain vertebrate introns can be processed in the absence of U1 snRNP if the 5′ splice sites can be recognized by U6 snRNA (Crispino et al. 1994; Tarn and Steitz 1994).

snRNP Proteins.

Each snRNP contains a set of Sm core proteins shared with the other snRNPs and a set of proteins that are specific to that snRNP. All 15 known proteins of the Sm family were identified and are highly conserved. These include seven Sm proteins that bind to the U1, U2, U4, and U5 snRNPs (B, D1, D2, D3, E, F, and G); the seven related LSm proteins found in the U6 snRNP (LSM2-LSM8); and the CaSm/LSM1p protein (Bouveret et al. 2000; Tharun et al. 2000). These and subsequent matches are shown in Table . The table reports, for each Drosophila gene, the GenBank accession number and expectation value (the expected number of matches this good or better; Altschul et al. 1997) for the best human and yeast (Saccharomyces cerevisiae) match. In addition to the Sm proteins, each snRNP also contains a set of snRNP-specific proteins. As expected, orthologues of the proteins that are contained in both the vertebrate and Saccharomyces cerevisiae snRNPs are easily identified in Drosophila, based on their extensive sequence homology, except that a single Drosophila protein, encoded by the sans-fille (snf) gene, corresponds to both the U1 snRNP-U1A and U2 snRNP-U2B′′ proteins (Polycarpou-Schwarz et al. 1996; Stitzinger et al. 1999), and no additional homologues were found. Interestingly, the Saccharomyces cerevisiae U1 snRNP is more complex than the vertebrate U1 snRNP, with seven additional protein components that are not found in the purified vertebrate U1 snRNP (Gottschalk et al. 1998; Rigaut et al. 1999). Only two of these proteins, Luc7 and Prp40, have easily identifiable Drosophila orthologues. The Drosophila ortholog of Luc7, CG7564, is 33% identical to the entire Luc7 protein (Fortes et al. 1999), and a second Drosophila Luc7-related protein is 21% identical to the yeast protein (Fortes et al. 1999). We identified a single Prp40-like gene in the Drosophila database. CG3542 shares 23% identity with the entire yeast Prp40 protein and 41% sequence identity over its entire length of 757 amino acids with the human protein, FBP11. FBP11 was initially identified because it also contains a tyrosine-rich WW domain and like Prp40, it interacts with the splicing factor SF1 (Bedford et al. 1997). These observations suggest that the function of these proteins in forming bridges between 5′ splice sites and the branchpoint may be conserved (Abovich and Rosbash 1997). It is likely that these Drosophila proteins, like their human homologues, would not be found in purified U1 snRNPs, but, nevertheless, do share a function with their yeast counterparts. A second human Prp40-like protein, FBP21, has been described in the literature (Bedford et al. 1998). FBP21 is more closely related to the Drosophila CG4291, with 28% identity over the entire length of the 338–amino acid protein. Because similarity between FBP11/CG3542 and FBP21/CG4291 is limited to the WW repeats, FBP21/CG4291 is unlikely to be related to Prp40. Consistent with this idea, human FBP21 has been found to stably associate with the U2 snRNPs and, therefore, may function at a later stage of spliceosome assembly than does Prp40 (Bedford et al. 1998). Searches with the yeast U1 snRNP proteins Prp39 and Prp42 have identified only a single homologous sequence. Prp39 and Prp42 belong to a family of TPR repeat proteins (McLean and Rymond 1998) and share 25% sequence identity with each other over a ∼270–amino acid region that includes several copies of the TPR repeat motif. We identified a single Drosophila protein, encoded by the CG1646 gene, that shares 25% sequence identity with Prp39 and Prp42 over the same ∼270 amino acids, and is the best match between the Saccharomyces cerevisiae and Drosophila genomes. The Drosophila crooked neck protein (Crn) is another TPR repeat protein, it's yeast homologue has been shown to act later in spliceosome assembly (Chung et al. 1999). Surprisingly, there are three Saccharomyces cerevisiae U1 snRNP proteins that have no clear counterparts in the Drosophila database. No Drosophila proteins, whose best match in the S. cerevisiae genome is Snu71, Snu56 or Nam8, were found. Recent work on Snu56 and Nam8 suggests that these proteins contact the pre-mRNA directly and may anchor the U1 snRNP onto the substrate (Puig et al. 1999; Zhang and Rosbash 1999), a function that could be dispensable in metazoans because it could be provided by the SR proteins. Alternatively, a similar function may be provided by proteins, such as Drosophila rox8 in the case Nam8, that do not appear to be orthologues (Drosophila rox8 matches three other yeast proteins better than Nam8). Proteins in the U2 snRNP, U5 snRNP and U4/U6.U5 tri-snRNP are generally very conserved, and no significant differences between Drosophila and other species were revealed by our analysis (Table ).

Proteins Required for Splice Site Selection

SR proteins are splicing factors that contain either one or two characteristic RNA-binding domains and an RS domain. These proteins are among the earliest acting proteins in spliceosome assembly (Zahler et al. 1992; Graveley et al. 1999; Tacke and Manley 1999). There are 11 well characterized mammalian SR proteins: 9G8, SRp20, ASF/SF2, SC35, SRp30c, p54, SRp40, SRp55, SRp75 (for review see Mount 1997), NSSR1, and NSSR2 (Komatsu et al. 1999). Individual SR proteins differ with respect to the sequence specificity of their RNA-binding domains, and with respect to their ability to recognize and activate different exonic splicing enhancer sequences. We have identified seven SR protein genes in the Drosophila genome. These include the previously described B52, RBP1, SRp54 (Kennedy et al. 1998) and X16/9G8 (Vorbruggen et al. 2000) genes, as well as Drosophila orthologues of ASF/SF2 and SC35. In addition, we have identified a novel gene, CG1987, that is 95% identical to RBP1. Phosphorylation of SR proteins is thought to play an important role in controlling spliceosome assembly (Stojdl and Bell 1999; Yeakley et al. 1999). Both SRPK and LAMMER (or CLK) kinases phosphorylate SR proteins. We have identified three kinases of the SRPK type (CG8174, CG9085, and CG8565) and only one LAMMER kinase, the previously described Doa kinase (Du et al. 1998). A variety of proteins bind to pre-mRNA (also known as hnRNA), and many of these proteins, defined as hnRNP proteins, have been shown to influence splicing, typically by inhibition of splicing events near their binding sites (Chen et al. 1999). A number of Drosophila hnRNP proteins have been described (e.g., Matunis et al. 1992), and some nuclear RNA-binding proteins without clear homologues in mammalian species have unambiguous roles in the regulation of splicing (e.g., SXL). However, because it is impossible to determine from sequence alone whether a given RNA-binding protein is likely to function in splicing, or even to reside in the nucleus, we have not undertaken an analysis of these proteins. These proteins are discussed in the accompanying article by Lasko 2000.

Genome Contents: Parallels and Differences

The results of our search for RNA processing factors known from studies in mammalian extracts and Saccharomyces cerevisiae genetics indicate that very few RNA processing factors are absent from the Drosophila genome. Indeed, our survey reveals remarkably little variation in this list among yeast, flies, and mammals. As expected, Drosophila proteins are more closely related to their vertebrate counterparts than to the Saccharomyces cerevisiae proteins. The extensive conservation of the components of the spliceosome between vertebrates and Drosophila supports the suggestion that the primary mode of regulating splicing takes place at the level of spliceosome assembly (Lopez 1998; Staley and Guthrie 1998). Some of these factors, such as SR proteins, which regulate assembly of the spliceosome on many different RNAs, are well conserved and are easily identifiable. Missing from these tables are the factors that regulate the splicing of specific RNAs. These factors are less likely to be well conserved and, indeed, some may prove to be organism specific (e.g., SXL and TRA). Even the variation we observe among proteins and RNAs with clearly established roles in splicing, per se, is weighted towards the early events in splice site selection. For example, the Drosophila genome is missing a set of U1 snRNP proteins that are found in the yeast U1 snRNP but not in the vertebrate U1 snRNP, and the genome does not appear to contain a gene for the U11 RNA, or for the single known protein unique to the U11 snRNP, suggesting that U12 functions without U11 in Drosophila. Here again, variation is observed in components that function in splice site selection.
  46 in total

1.  Identification of eight proteins that cross-link to pre-mRNA in the yeast commitment complex.

Authors:  D Zhang; M Rosbash
Journal:  Genes Dev       Date:  1999-03-01       Impact factor: 11.361

2.  Interaction of the U1 snRNP with nonconserved intronic sequences affects 5' splice site selection.

Authors:  O Puig; A Gottschalk; P Fabrizio; B Séraphin
Journal:  Genes Dev       Date:  1999-03-01       Impact factor: 11.361

3.  BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences.

Authors:  T A Tatusova; T L Madden
Journal:  FEMS Microbiol Lett       Date:  1999-05-15       Impact factor: 2.742

Review 4.  Structure and assembly of the spliceosomal small nuclear ribonucleoprotein particles.

Authors:  C Kambach; S Walke; K Nagai
Journal:  Curr Opin Struct Biol       Date:  1999-04       Impact factor: 6.809

5.  Trans-acting factors required for inclusion of regulated exons in the Ultrabithorax mRNAs of Drosophila melanogaster.

Authors:  J M Burnette; A R Hatton; A J Lopez
Journal:  Genetics       Date:  1999-04       Impact factor: 4.562

Review 6.  Alternative splicing of pre-mRNA: developmental consequences and mechanisms of regulation.

Authors:  A J Lopez
Journal:  Annu Rev Genet       Date:  1998       Impact factor: 16.830

7.  Binding of hnRNP H to an exonic splicing silencer is involved in the regulation of alternative splicing of the rat beta-tropomyosin gene.

Authors:  C D Chen; R Kobayashi; D M Helfman
Journal:  Genes Dev       Date:  1999-03-01       Impact factor: 11.361

8.  A role for SRp54 during intron bridging of small introns with pyrimidine tracts upstream of the branch point.

Authors:  C F Kennedy; A Krämer; S M Berget
Journal:  Mol Cell Biol       Date:  1998-09       Impact factor: 4.272

9.  Protein phosphorylation plays an essential role in the regulation of alternative splicing and sex determination in Drosophila.

Authors:  C Du; M E McGuffin; B Dauwalder; L Rabinow; W Mattox
Journal:  Mol Cell       Date:  1998-12       Impact factor: 17.970

10.  Phosphorylation regulates in vivo interaction and molecular targeting of serine/arginine-rich pre-mRNA splicing factors.

Authors:  J M Yeakley; H Tronchère; J Olesen; J A Dyck; H Y Wang; X D Fu
Journal:  J Cell Biol       Date:  1999-05-03       Impact factor: 10.539

View more
  53 in total

1.  The KH-type RNA-binding protein PSI is required for Drosophila viability, male fertility, and cellular mRNA processing.

Authors:  Emmanuel Labourier; Marco Blanchette; Jennie W Feiger; Melissa D Adams; Donald C Rio
Journal:  Genes Dev       Date:  2002-01-01       Impact factor: 11.361

2.  The Drosophila U2 snRNP protein U2A' has an essential function that is SNF/U2B" independent.

Authors:  A A Nagengast; H K Salz
Journal:  Nucleic Acids Res       Date:  2001-09-15       Impact factor: 16.971

3.  Domains of human U4atac snRNA required for U12-dependent splicing in vivo.

Authors:  Girish C Shukla; Andrea J Cole; Rosemary C Dietrich; Richard A Padgett
Journal:  Nucleic Acids Res       Date:  2002-11-01       Impact factor: 16.971

4.  snRNA 3' end formation requires heterodimeric association of integrator subunits.

Authors:  Todd R Albrecht; Eric J Wagner
Journal:  Mol Cell Biol       Date:  2012-01-17       Impact factor: 4.272

5.  Proteomic and functional analysis of the mitotic Drosophila centrosome.

Authors:  Hannah Müller; David Schmidt; Sandra Steinbrink; Ekaterina Mirgorodskaya; Verena Lehmann; Karin Habermann; Felix Dreher; Niklas Gustavsson; Thomas Kessler; Hans Lehrach; Ralf Herwig; Johan Gobom; Aspasia Ploubidou; Michael Boutros; Bodo M H Lange
Journal:  EMBO J       Date:  2010-09-03       Impact factor: 11.598

6.  Systematic identification of factors involved in post-transcriptional processes in wheat grain.

Authors:  Sergiy Lopato; Ljudmilla Borisjuk; Andrew S Milligan; Neil Shirley; Natalia Bazanova; Kate Parsley; Peter Langridge
Journal:  Plant Mol Biol       Date:  2006-08-29       Impact factor: 4.076

7.  Genes required for Drosophila nervous system development identified by RNA interference.

Authors:  Andrej I Ivanov; Alessandra C Rovescalli; Paola Pozzi; Siuk Yoo; Brian Mozer; Hsi-Ping Li; Shu-Hua Yu; Haruhiro Higashida; Vicky Guo; Michael Spencer; Marshall Nirenberg
Journal:  Proc Natl Acad Sci U S A       Date:  2004-11-08       Impact factor: 11.205

8.  Identification of alternative splicing regulators by RNA interference in Drosophila.

Authors:  Jung W Park; Katherine Parisky; Alicia M Celotto; Robert A Reenan; Brenton R Graveley
Journal:  Proc Natl Acad Sci U S A       Date:  2004-10-18       Impact factor: 11.205

9.  Use of RNA interference to dissect the roles of trans-acting factors in alternative pre-mRNA splicing.

Authors:  Jung W Park; Brenton R Graveley
Journal:  Methods       Date:  2005-12       Impact factor: 3.608

10.  The Drosophila U1-70K protein is required for viability, but its arginine-rich domain is dispensable.

Authors:  Helen K Salz; Ricardo S Y Mancebo; Alexis A Nagengast; Olga Speck; Mitchell Psotka; Stephen M Mount
Journal:  Genetics       Date:  2004-12       Impact factor: 4.562

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.