| Literature DB >> 25866769 |
Guang Yang1, Wei Li2, Wenzhen Liao2, Xin Zhang2, Yi Zou2, Jianfeng Dai2, Yueqin Li2, Chunxia Jing3, Tianhong Zhou2.
Abstract
The UL49 ORF of human cytomegalovirus (HCMV) is essential for viral replication; conserved among all herpes viruses; however, the function is unclear. Once the UL49 ORF was precisely deleted from the start to stop codon, the mutant did not yield infectious progeny. In this study, we find out many alternatively processed ESTs in UL49 locus in HCMV-infected cells, in which there are two novel transcription termination sites in UL49 locus. Most of these ESTs are rare transcripts that contain directed repeat sequences in the intron splicing regions. There is a typical GU-AG intron splicing site in UL49Y transcripts. The 1847 bp UL49Y cDNA spans an ORF from 335 to 1618 and encodes a putative protein of 427 amino acids with a predicted molecular mass of 47.1 kDa. All the new EST sequences and UL49Y cDNA sequence have been deposited in the GenBank database (GenBank Accession nos. GW314860-GW314900 and GU376796). This study provides us with very important clues for revealing the importance of the UL49 locus alternative splicing.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25866769 PMCID: PMC4383306 DOI: 10.1155/2015/280276
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
List of primers used in this paper.
| Primer | Nucleotide sequence (5′-3′) | Corresponding to the nucleotide position of HCMV Towne genome |
|---|---|---|
| Primer 1 | CTGCCAGCAAAACTTTCCGCT | 70442–70422 |
| Primer 2 | TCGGGACCGTCAAGAAAAGAGCG | 70403–70381 |
| Primer 3 | TGGTGCTGGCTCTCCTGCTGGTGC | 71653–71630 |
| Primer 4 | GCATGTAGCCGACCTGCTGAAAGGC | 68717–68741 |
| Primer 5 | ATGGCCAATCGCCGTCTCCGACAC | 71608–71585 |
| Primer 6 | AGACCGCGACTTCCTCCGCATCCA | 68783–68806 |
| Primer 7 | GTTACAAAACAACGTATCACTTTTACGG | 69612–69639 |
| Primer 8 | CTTGGGCTGTCAGCGCCGGGTG | 69667–69688 |
| 3′RACE Outer primer | TACCGTCGTTCCACTAGTGATTT | 3′-Full RACE Core Sets kit supply |
| 3′RACE Inner primer | CGCGGATCCTCCACTAGTGATTTCACTATAGG | 3′-Full RACE Core Sets kit supply |
Figure 1Organization and transcription summary of HCMV Towne UL49 locus. (a) Nucleotide positions correspond to the genome sequence under GenBank Accession no. AY315197.2. (b) ORF map summary of the UL48, UL48A, UL49, and UL50. (c) cDNAs and the location of polyadenylation signals and polyadenylation site are identified in group B. (d) cDNAs and the location of polyadenylation signals polyadenylation site are identified in group A. (e) Primers (Table 1) corresponding to the nucleotide position of HCMV Towne genome used in this paper.
Sequences of novel HCMV UL49 alternatively spliced RNA fragments by 3′RACE.
| Name of novel transcript | HCMV sequence (nt)a | Nested PCR product length (bp) | Intron length (bp) | Splice donor ( | Splice acceptor (intron/ | GenBank accession number |
|---|---|---|---|---|---|---|
| UL49RA1 | 70403–70116; | 619 | 1156 |
| GTCCTG | GW314860 |
| UL49RA2 | 70403–70271; | 177 | 1598 |
| AGCC | GW314861 |
| UL49RA3 | 70403–70138; | 1456 | 189 |
| GCG | GW314862 |
| UL49RA4 | 70403–70121; | 624 | 1151 |
| TGTGG | GW314863 |
| UL49RB1 | 70403–70146; | 650 | 144 |
| GCGCGC | GW314864 |
| UL49RB2 | 70403–70163; | 724 | 70 |
| G | GW314865 |
| UL49RB3 | 70403–70207; | 365 | 429 |
| GTTGGC | GW314866 |
| UL49RB4 | 70403–70090; | 611 | 183 |
| C | GW314867 |
| UL49RB5 | 70403–70133; | 519 | 275 |
| AA | GW314868 |
| UL49RB6 | 70403–70138; | 605 | 189 |
| GCG | GW314869 |
These RNA fragments were acquired by nested PCR with Primer 1 and 3′RACE Outer primer for outer PCR and Primer 2 and 3′RACE Inner prime for inner PCR. aNumbering refers to HCMV genomic sequences (GenBank Accession no. AY315197.2). bUnderlines indicate direct repeat sequences.
Novel sequences of HCMV UL49 alternatively spliced RNA fragments (group A).
| Name of novel transcript | HCMV sequence (nt)a | Nested PCR product length (bp) | Intron length (bp) | Splice donor ( | Splice acceptor (intron/ | GenBank accession number |
|
| ||||||
| UL49A1 | 71608–71124; | 706 | 2120 |
| TG | GW314870 |
| UL49A2 | 71608–71530; | 1377 | 1449 |
| CTGTG | GW314871 |
| UL49A3 | 71608–71321; | 614 | 2212 |
| GGAGCGG | GW314872 |
| UL49A4 | 71608–71490; | 1338 | 1488 |
| GCGC | GW314873 |
| UL49A5 | 71608–71288; | 502 | 2324 |
| TGTGG | GW314874 |
| UL49A6 | 71608–71457; | 381 | 2355 |
| CGCCAGAC | GW314875 |
| UL49X | 71608–71530; | 485 | 2341 |
| GAAC | GW314876 |
| UL49A8 | 71608–71416; | 519 | 2307 |
| GGA | GW314877 |
| UL49A9 | 71608–71565; | 419 | 2407 |
| CAGAA | GW314878 |
| UL49A10 | 71608–71562; | 715 | 2111 |
| GC | GW314879 |
| UL49A11 | 71608–71321; | 495 | 2331 |
| GGGCGG | GW314880 |
| UL49A12 | 71608–71515; | 454 | 2372 |
| GACTGG | GW314881 |
| UL49A13 | 71608–71519; | 265 | 2561 |
| CCTG | GW314882 |
| UL49A14 | 71608–71478; | 580 | 2246 |
| G | GW314883 |
| UL49A15 | 71608–71561; | 1961 | 865 |
| CGCG | GW314884 |
| UL49A16 | 71608–71276; | 646 | 95C
|
| ACGTCTACAG/ | GW314885 |
| UL49A17 | 71608–71276; | 760 | 95C
|
| ACGTCTACAG/ | GW314886 |
| UL49A18 | 71608–71276; | 909 | 95C
|
| ACGTCTACAG/ | GW314887 |
| UL49A19 | 71608–71276; | 792 | 95C
|
| ACGTCTACAG/ | GW314888 |
These RNA fragments were amplified by nested PCR with Primer 3 and Primer 4 for outer PCR and Primer 5 and Primer 6 for inner PCR. aNumbering refers to HCMV genomic sequences (GenBank Accession no. AY315197.2). bUnderlines indicate direct repeat sequences. CIntrons conform to the GU-AG rule.
Sequences of novel HCMV UL49 alternatively spliced RNA fragments (group B).
| Name of novel transcript | HCMV sequence (nt)a | Nested PCR product length (bp) | Intron length (bp) | Splice donor ( | Splice acceptor (intron/ | GenBank accession number |
|---|---|---|---|---|---|---|
| UL49B1 | 71608–71179; | 861 | 1081 |
| CGCACGG | GW314889 |
| UL49B2 | 71608–71548; | 1086 | 856 |
| GAGG | GW314890 |
| UL49Y | 71608–71276; | 1847 | 95C |
| ACGTCTACAG/ | GU376796 |
| UL49B4 | 71608–71282; | 758 | 1184 |
| CGCACGGGC | GW314891 |
| UL49B5 | 71608–71527; | 437 | 1505 |
|
| GW314892 |
| UL49B6 | 71608–71284; | 610 | 1332 |
| CAGGC | GW314893 |
| UL49B7 | 71608–71576; | 533 | 1409 |
| CGGCA | GW314894 |
| UL49B8 | 71608–71501; | 410 | 1532 |
| GCGGCCA | GW314895 |
| UL49B9 | 71608–71543; | 407 | 1535 |
| GGGG | GW314896 |
| UL49B10 | 71608–71533; | 335 | 1607 |
| TTC | GW314897 |
| UL49B11 | 71608–71341; | 733 | 1209 |
| GG | GW314898 |
| UL49B12 | 71608–71276; | 908 | 95C
|
| ACGTCTACAG/ | GW314899 |
| UL49B13 | 71608–69667 | 1942 | — | — | — | GW314900 |
aNumbering refers to HCMV genomic sequences (GenBank Accession no. AY315197.2).
bUnderlines indicate direct repeat sequences. CIntrons conform to the GT-AG rule.
—: There is no intron in the transcript of UL49B13.
Figure 4The summarize of all the alternative splicing transcripts founded in UL49 Locus. Primer 1–Primer 8 are the primers used in the research. The GW314850-GW314900 is the GenBank Accession number and the situations are the same in Tables 2, 3, and 4. The GU376796 and the pound symbol are the UL49Y. The numbers in x-axis mean the HCMV genome situation (AY315197.2).
Figure 2Agarose gel electrophoresis of differential transcripts of alternatively spliced UL49 cDNAs. Lane M: DNA Marker; Lane 1: cells uninfected by virus; Lanes 2 to 7: RT-PCR from HCMV-infected HDFn poly(A) RNA at 2, 6, 12, 48, 72, and 96 h postinfection, respectively; Lane 8: poly(A) RNA harvested at 72 h postinfection in the presence of inhibitor of DNA replication phosphonoformic acid (100 μM); and Lane 9: poly(A) RNA harvested at 24 h postinfection in the presence of inhibitor of protein synthesis cyclohexamide (100 μg/mL) 1 h prior to infection. UL49X and UL49Y were indicated by arrows.
Figure 3Nucleotide and deduced amino acid sequence of the UL49Y gene. Numbers on the left refer to the first nucleotide in each corresponding line. An upstream inframe stop codon (TGA) at 5′ to the start codon is shaded in gray. An asterisk indicates the stop codon. The DNA sequence has been deposited in the GenBank database (GenBank Accession no. GU376796).