| Literature DB >> 16877819 |
Kert Mätlik1, Kaja Redik, Mart Speek.
Abstract
Transcription of transposable elements interspersed in the genome is controlled by complex interactions between their regulatory elements and host factors. However, the same regulatory elements may be occasionally used for the transcription of host genes. One such example is the human L1 retrotransposon, which contains an antisense promoter (ASP) driving transcription into adjacent genes yielding chimeric transcripts. We have characterized 49 chimeric mRNAs corresponding to sense and antisense strands of human genes. Here we show that L1 ASP is capable of functioning as an alternative promoter, giving rise to a chimeric transcript whose coding region is identical to the ORF of mRNA of the following genes: KIAA1797, CLCN5, and SLCO1A2. Furthermore, in these cases the activity of L1 ASP is tissue-specific and may expand the expression pattern of the respective gene. The activity of L1 ASP is tissue-specific also in cases where L1 ASP produces antisense RNAs complementary to COL11A1 and BOLL mRNAs. Simultaneous assessment of the activity of L1 ASPs in multiple loci revealed the presence of L1 ASP-derived transcripts in all human tissues examined. We also demonstrate that L1 ASP can act as a promoter in vivo and predict that it has a heterogeneous transcription initiation site. Our data suggest that L1 ASP-driven transcription may increase the transcriptional flexibility of several human genes.Entities:
Year: 2006 PMID: 16877819 PMCID: PMC1559930 DOI: 10.1155/JBB/2006/71753
Source DB: PubMed Journal: J Biomed Biotechnol ISSN: 1110-7243
Primers used for the detection of mRNAs, chimeric mRNAs, and L1 splice forms.
| mRNA | Forward primer | Reverse primer | Annealing temperature |
| AL711955 | CTTGTGGCAGAAGGGAGAAG | GCAGCAGAGAGGACTTTGG | 65°C |
| (L1) | TCTCAGACTGCTGTGCTA | GCAGCAGAGAGGACTTTGG | 60°C |
| GGAGAAAACAGGGCCACATA | CATGCTCAGAGTTCCAGCAA | 60°C | |
| GACCCTTTTGTCTCCCTTCC | CATGCTCAGAGTTCCAGCAA | 60°C | |
| L1- | CTGCTGTGCTAGCAATCAGC | CATGCTCAGAGTTCCAGCAA | 60°C |
| AAAGCGTTCCAGGTATTTTTG | GCTCTTCAGGGTGTTCCAAG | 55°C | |
| L1- | CTGCTGTGCTAGCAATCAGC | GCTCTTCAGGGTGTTCCAAG | 60°C |
| ACGGTCCAAAGGGAAACTCT | CCTTGTAGATTGCAGGCAGAC | 60°C | |
| L1- | CTGCTGTGCTAGCAATCAGC | CCTTGTAGATTGCAGGCAGAC | 60°C |
| L1- | CTAAGCAAGCCTGGGCAATG | CCTTGTAGATTGCAGGCAGAC | 60°C |
| L1- | TTCCCGGCTGCTTTGTTTAC | CCTTGTAGATTGCAGGCAGAC | 60°C |
| L1- | GGCTCCACCCAGTTCGAGCT | CCTTGTAGATTGCAGGCAGAC | 60°C |
| L1- | AGGCAGGCCTCCTTGAGCTCTG | CCTTGTAGATTGCAGGCAGAC | 60°C |
| L1- | AGGTGGAGCCTACAGAGGCAG | CCTTGTAGATTGCAGGCAGAC | 60°C |
| L1- | TGCAGAGGTTACTGCTGTCT | CCTTGTAGATTGCAGGCAGAC | 60°C |
| GGATTTCAAGGCAAGACCG | TTTGCACCTTCTTTTCCTGC | 55°C | |
| L1- | CTGCTGTGCTAGCAATCAGC | TAGGGTGATCCAGGTCCTCA | 60°C |
| CGCAAACATCAAACCAGATG | TACTGTGTGGTGGCCTGGTA | 60°C | |
| L1- | CTGCTGTGCTAGCAATCAGC | GCCTTCAAATGCAGGACTGT | 60°C |
| L1 II sp v | CTCCCCCAGCCTCGCTGC | GGTTCATCTCACTGGCTC | 60°C |
| L1 IV sp v | CTGCTGTGCTAGCAATCAGC | GGTTCATCTCACTGGAAA | 55°C |
1sp v stand for splice variant
Figure 1Distribution of chimeric mRNAs derived from the L1 ASP as an alternative promoter. The presence of native mRNAs derived from a gene predicted by (a) AL711955 and KIAA1797, (b) CLCN5 [25], (c) SLCO1A2, (d) MET proto-oncogene [26], and their corresponding chimeric transcripts is shown at the upper and lower RT-PCR panels. cDNAs were derived from the following human tissues: 1, thymus; 2, prostate; 3, spleen, 4, small intestine; 5, colon; 6, ovary; 7, testis; 8, peripheral blood leukocytes; 9, placenta; 10, skeletal muscle; 11, brain; 12, kidney; 13, heart; 14, lung; 15, pancreas, and 16, liver. GenBank accession numbers for each mRNA and chimeric L1 mRNA are shown. Product sizes are shown on the left of each panel and below the forward primer on the scheme. L1 (PA2 or Ta subfamily) is shown by a large box with the 5′ UTR region indicated in red and its orientation is marked by an arrow. Exons are marked by open boxes (not in scale). Splicing schemes are shown by lines. The location of translation initiation, codon is marked by ATG. Primers used in PCR are shown by arrowheads below the exons. (b) Exons transcribed from the CLCN5 upstream promoter [uP] are designated with −1a to −4 a. (c) A 315 bp RT-PCR product corresponds to L1-SLCO1A2 transcript derived from the upstream L1 ASP (L1 ASP2), but not from the L1 ASP (L1 ASP1) predicted by the EST (BX955947). (d) A minor L1-MET splice variant is shown by a broken line. P stands for promoter and dP stands for downstream promoter.
Figure 2Distribution of antisense RNAs derived from L1 ASP. The presence of mRNAs derived from (a) COL11A1 [27], (b) BOLL [28], and their antisense RNAs is shown at the upper and lower RT-PCR panels, respectively. The exons of antisense RNAs L1-COL11A1 and L1-BOLL complementary to exon 40 of COL11A1 and exon 6 of BOLL are shown as grey boxes. (a) COL11A1 exons 38–47. (b) Two L1-BOLL splice variants and a nonspecific product, marked by an asterisk, are presented. A 211 bp product derived from the L1 ASP1 is identical to EST BE866323 (splicing scheme III). A novel 242 bp product generated from the L1 ASP2 corresponds to the splicing scheme V. For the remaining description details, see Figure 1 legend.
Widespread L1 ASP-driven transcription of human genes revealed from ESTs/mRNAs.
| EST | Source | Similarity to L1 5′UTR opposite strand | Similarity to known mRNA | Location in the genome | Orientation |
| Type I splicing (1 EST) | |||||
| BU943355++ (4 ex) | Pool of 40 cell line polyA+ | 4−59 ≡ 592−647 (96%) 60−289 ≡ 762−990 (96%) L1PA3 AC007780 | Arylsulfatase G, NM_014960 331−649 ≡ 1342−1660 (99%) | NT_010641 (chr 17) 10/11 | Sense |
| Type II splicing (2 ESTs) | |||||
| CD642260 (4 ex) | Embryonic stem cell line WA01/H1 | 12−117 ≡ 542−647 (97%) 118−230 ≡ 878−990 (96%) L1PA2 AC022762 | Olfactory receptor, family 56, subfamily B, member 4, NM_001005181 373−728 ≡ 802−443 (98%) | NT_009237 (chr 11) 3′/1 | Antisense |
| NM_017794 (46 ex) | RA-induced NT2 neuronal precursor cells | 4−150 ≡ 501−647 (93%) 151−262 ≡ 878−990 (93%) L1P AL354879 | Hypothetical protein KIAA1797, AL711955* 331−834 ≡ 60−563 (99%) | NT_008413 (chr 9) 5′/45 | Sense |
| Type III splicing (22 ESTs) | |||||
| BM910612 (6 ex) | Brain, astrocytoma grade IV cell line | 1−134 ≡ 514−647 (98%) L1Ta (Hs) AC011597 | Fibronectin type III domain containing 6 (cytokine receptor) NM_144717 268−915 ≡ 336−982 (98%) | NT_086641 (chr 3) 1/7 | Sense |
| BF676152 (3 ex) | Prostate | 4−126 ≡ 520−647 (91%) L1PA2 AC097061 | Hypothetical protein BC014608, NM_138796 127−713 ≡ 422−1005 (91%) | NT_021877 (chr 1) 5/11 | Sense |
| AU123136++ (7 ex) | Uninduced NT2 cell line | 1−125 ≡ 523−647 (96%) L1PA2 AC079005 | Breast carcinoma amplified sequence 3, NM_017679 126−623 ≡ 710−1208 (99%) | NT_010783 (chr 17) 9/24 | Sense |
| AA226814+ (3 ex) | Ntera-2 neuroepithelial cells | 1−111 ≡ 538−649 (93%) L1PA2 AC018470 | Secernin 3 (dipeptidase), NM_024583 112−347 ≡ 843−1075 (96%) | NT_005403 (chr 2) 5/8 | Sense |
| BU959632 (5 ex) | Pool of 40 cell line polyA+ | 4−45 | Cardiomyopathy associated 5, NM_153610 46−559 ≡ 3235−3748 (97%) | NT_006713 (chr 5) 8/12 | Sense |
| BF208095+ (6 ex) | Bladder carcinoma cell line | 2−57 ≡ 592−647 (94%) L1PA2 AC002080 | Hepatocyte growth factor receptor (MET proto-oncogene), NM_000245 132−456 ≡ 1387−1714 (99%) 462−663 ≡ 1805−2013 (92%) | NT_007927 (chr 7) 2/21 | Sense |
| AA220950+ (3 ex) | Ntera-2 neuroepithelial cells | 1−39 ≡ 609−647 (89%) L1PA3 AC022261 | Dynein, cytoplasmic, intermediate polypeptide 1, NM_004411 40−247 ≡ 613−818 (95%) | NT_007910 (chr 7) 5/17 | Sense |
| BM557937 (7 ex) | Brain astrocytoma grade IV cell line | 1−110 ≡ 538−647 (93%) L1PA3 AC022748 | Cholinergic receptor, nicotinic, beta poly-peptide 4, NM_000750 410−713 ≡ 168−471 (99%) | NT_024654 (chr 15) 5′/6 | Sense |
| BG335812 (> 6 ex) | Placenta choriocarcinoma cell line | 2−105 ≡ 544−647 (93%) L1PA2 AC009949 | Nuclear antigen Sp100, NM_003113 106−522 ≡ 139−556 (90%) | NT_005403 (chr 2) 2/25 | Sense |
| BE865812+(4 ex) | Bladder carcinoma cell line | 1−43 ≡ 605−647 (97%) L1Ta (Hs) AL049838 | Chromosome 14 open reading frame 37, NM_001001872 44−343 ≡ 933−1228 (96%) | NT_025892 (chr 14) 5/7 | Sense |
| BE866323+(4 ex) | Bladder carcinoma cell line | 1−92 | Bol, boule-like ( | NT_005246 (chr 2) 3′/11 | Antisense |
| BP352155 (5 ex) | Well-differentiated squamous cell carcinoma cell line TE13 | 1−113 ≡ 535−647 (96%) L1PA2 AC004519 | Hypothetical protein FLJ31340, BX346336* 114−490 ≡ 500−876 (98%) | NT_086723 (chr 7) 1/ >5 | Sense |
| BP351387 (5 ex) | Well-differentiated squamous cell carcinoma cell line TE13 | 1−67 = 581−647 L1Ta (Hs) AL663118 | Chloride channel 5, NM_000084 213−583 = 243−613 | NT_086939 (chr X) 5′/12 | Sense |
| BP351082 (>4 ex) | Well-differentiated squamous cell carcinoma cell line TE13 | 1−71 ≡ 576−647 (95%) L1PA3 AC114734 | Hypothetical protein MGC16169 (protein kinase) NM_033115 72−593 ≡ 1913−2433 (99%) | NT_086651 (chr 4) 17/24 | Sense |
| BP369881 (6 ex) | Testis | 1−65 ≡ 581−647 (92%) L1PA3 AL136525 | WD repeat and FYVE domain containing 2, NM_052950 66 −570 ≡ 460−963 (99%) | NT_086801 (chr 13) 3/12 | Sense |
| AA226765 (3 ex) | Brain Ntera-2 neuroepithelial cells | 1−67 ≡ 581 −647 (92%) L1PA3 AC025170 | Hypothetical protein FLJ35779, NM_152408 68−356 ≡ 480−767 (97%) | NT_086677 (chr 5) 4/11 | Sense |
| CF593264 (> 5 ex) | Placenta | 29−95 ≡ 581−647 (95%) L1PA3 AL050323 | Phospholipase C, beta 1, NM_182734 174−769 ≡ 103−692 (98%) | NT_011387 (chr 20) 5′/33 | Sense |
| BP873102 (5 ex) | Embryonal kidney cell line=“293” | 1−67 ≡ 581−647 (95%) L1PA2 AL022400 | RAB GTPase activating protein 1-like, NM_014857 68−583 ≡ 731−1244 (95%) | NT_086598 (chr 1) 4/21 | Sense |
| CD110319 (2 ex) | Placenta “preeclamptic placenta” | 25−92 ≡ 580−647 (97%) L1PA2 AC004452 | FLJ16237 protein, NM_001004320 93−568 ≡ 428−900 (97%) | NT_086703 (chr 7) 2 /13 | Sense |
| BX476029 (5 ex) | Pooled from different tissues | 2−77 ≡ 572−647 (93%) L1PA3 AL121946 | Polycystic kidney and hepatic disease 1, NM_138694 78−567 ≡ 7273−7762 (99%) | NT_007592 (chr 6) 43/67 | Sense |
| CB960713 (4 ex) | Placenta | 30−107 ≡ 570−647 (96%) L1PA3 AC005922 | ATP-binding cassette, subfamily A, NM_172386 108−208 = 3283−3183 | NT_010641 (chr 17) 25/38 | Antisense |
| CD644604 (3 ex) | Embryonic stem cells, cell line=“WA01” | 14−115 ≡ 547−647 (94%) L1PA3 AC022029 | Catenin (cadherin-associated protein), alpha 3, NM_013266 116−736 ≡ 755−1375 (98%) | NT_086771 (chr 10) 5/19 | Sense |
| Type IV splicing (1 EST) | |||||
| CF594290 (9 ex) | Placenta | 29−230 ≡ 531−732 (94%) 231−340 ≡ 878−988 (95%) L1PA2 AC022306 | Hypothetical protein FLJ32800, NM_152647 354−451 = 1305−1402 452 −780 ≡ 1642−1964 (97%) | NT_010194 (chr 15) 5/16 | Sense |
| Type V splicing (19 ESTs) | |||||
| BE787024++ (3 ex) | Lung large cell carcinoma cell line | 17−215 ≡ 533−732 (98%) L1Ta (Hs) AC079750 | Activin A receptor, type IC, NM_145259 216−752 ≡ 548−1086 (95%) | NT_005403 (chr 2) 2/9 | Sense |
| BE568884+ (4 ex) | Bladder carcinoma cell line | 1−178 ≡ 554−732 (97%) | CD96 antigen, NM_005816 179−627 ≡ 659−1113 (97%) | NT_086640 (chr 3) 2/15 | Sense |
| BE617461++ (6 ex) | Colon adenocarcinoma cell line | 8−185 ≡ 553−732 (98%) L1PA2 AC092916 | RAB3A interacting protein, NM_175625 186−738 ≡ 998−1556 (98%) | NT_086796 (chr 12) 3/10 | Sense |
| BE568818+ (3 ex) | Bladder carcinoma cell line | 1−163 ≡ 570−732 (93%) L1PA2 AC010585 | Secretory carrier membrane protein 1, NM_052822 164−516 ≡ 717−1063 (97%) | NT_006713 (chr 5) 6/8 | Sense |
| BU858570 (2 ex) | Pool of 40 cell line polyA+ RNAs | 4−166 ≡ 571−732 (93%) L1PA2 AL691464 | Guanylate binding protein 1, NM_002053 167−402 ≡ 259−494 (95%) | NT_004686 (chr 1) 2/11 | Sense |
| BF028725 (3 ex) | Bladder carcinoma cell line | 2−123 ≡ 612−732 (91%) L1PA2 AC004800 | Hypothetical protein FLJ36166, NM_182634 124−264 ≡ 3282−3424 (95%) | NT_086704 (chr 7) 2/21 | Sense |
| AA224229+ (4 ex) | 6 week, differentiated, post-mitotic hNT, neurons | 1−94 ≡ 640−732 (98%) L1Ta (Hs) AL365308 | Chromosome 6 open reading frame 170, NM_152730 95−430 ≡ 2622−2957 (99%) | NT_086697 (chr 6) 22/30 | Sense |
| BG542212++ (> 3 ex) | Lung | 2−187 ≡ 547−732 (97%) L1Ta (Hs) AC096569 | Zinc finger protein 638, NM_014497 188−638 ≡ 3576−4013 (92%) | NT_022184 (chr 2) 18/28 | Sense |
| AV693621 (2 ex) | Hepatocellular carcinoma | 1−172 ≡ 559−732 (93%) L1PA2 AL627203 | Collagen, type XI, alpha 1, variant A, NM_001854 187−279 = 3433−3341 | NT_004623 (chr 1) 46/67 | Antisense |
| BE735854+ (6 ex) | Pancreas adenocarcinoma cell line | 1−95 ≡ 638−732 (93%) L1PA2 AC092903 | Similar to beta-1, 4-mannosyltransferase, CD708577* 95−387 ≡ 174−466 (99%) | NT_005588 (chr 3) 1/ >5 | Sense |
| R64632 (4 ex) | Soares placenta Nb2HP | 1−52 = 681−732 L1PA2 AL713859 | Hypothetical protein FLJ10986, NM_018291 53−406 ≡ 1319−1671 98% | NT_029223 (chr 1) 11/14 | Sense |
| BP352672 (4 ex) | Well-differentiated squamous cell carcinoma cell line TE13 | 1−126 ≡ 608−732(94%) L1PA2 AL354711 | Chromosome 9 open reading frame 39, NM_017738 127−603 = 152−631 | NT_008413 (chr 9) 2/23 | Sense |
| BP358215 (7 ex) | Mammary gland tumor cell line T47D | 1−147 ≡ 586−732 (92%) L1PA2 AL391749 | Regulator of G-protein signalling 6, NM_004296 148−581 ≡ 188−621 (99%) | NT_026437 (chr 14) 5′/17 | Sense |
| H72033 (4 ex) | Soares breast 2NbHBst | 1−107 ≡ 626−732 (97%) L1PA2 AC079005 | Breast carcinoma amplified sequence 3, NM_017679 108 −370 ≡ 710−967 (95%) | NT_010783 (chr 17) 9/24 | Sense |
| CA488981 (3 ex) | Cell_line=ZR-75-1, MCF7, SK-BR-3, MDA-MB-231, hTERT-HME1, LNCaP | 1−159 ≡ 574−732 (91%) L1PA2 AC034215 | Monogenic, audiogenic seizure susceptibility 1 homolog, NM_032119 160−736 ≡ 17956−18532 (99%) | NT_086677 (chr 5) 83/98 | Sense |
| BX955947 (3 ex) | Pooled from different tissues | 1−116 ≡ 617−732 (89%) L1PA2 AC006559 | Solute carrier organic anion transporter family, member 1A2, NM_021094 240−342 = 186−288 | NT_009714 (chr 12) 5′/14 | Sense |
| BX477512++ (3 ex) | Pooled from different tissues | 2−129 ≡ 605−732 (93%) L1PA2 AC024061 | Hypothetical protein FLJ38736, NM_182758 130−551 = 3191−3216 | NT_086827 (chr 15) 18/20 | Sense |
| CN412489++ (2 ex) | Embryonic stem cells, embryoid bodies from H1, 7 and H9 cell lines | 1−151 ≡ 582−732 (98%) L1PA2 AL133299 | FLJ46156 protein, NM_198499 152− 348 = 1087−1283 | NT_086806 (chr 14) 8/37 | Sense |
| CN408255 (4 ex) | Embryonic stem cells, DMSO-treated H9 cell line | 1−180 ≡ 553−732 (95%) L1PA2 AP00942 | Baculoviral IAP repeat-containing 2, NM_001166 181−514 = 2766−3099 | NT_033899 (chr 11) 6/9 | Sense |
| Type VI splicing (4 ESTs) | |||||
| CD643062 (8 ex) | Embryonic stem cell line WA01/H1 | 10−220 ≡ 780−990 (97%) L1PA2 AC018741 | Hypothetical LOC388927, XM_371478 237−744 ≡ 1−509 (99%) | NT_015926 (chr 2) ND | Sense |
| BU176833 (6 ex) | Eye retinoblastoma cell line | 1−227 ≡ 763−989 (96%) L1PA3 AC105054 | Rho GTPase activating protein 25, NM_014882 536−878 ≡ 419−757 (97%) | NT_022184 (chr 2) 5′/10 | Sense |
| BE568192 (3 ex) | Bladder carcinoma cell line | 1−60 ≡ 931−990 (98%) L1PA2 AP005264 | Similar to hypothetical protein LOC375127, XM_496265 95−367 ≡ 213−490 (95%) | NT_010859 (chr 18) 3/5 | Sense |
| BP245205 (3 ex) | Embryonal kidney cell line 293 | 6−135 ≡ 861−990 (95%) L1PA2 AC099512 | Monogenic, audiogenic seizure susceptibility 1 homolog, NM_032119 138−574 ≡ 17953−18384 (98%) | NT_086677 (chr 5) 91/98 | Sense |
1 EST/mRNA GenBank accession number and number of exons (ex) determined by SPIDEY [1]. ESTs are grouped according to 6 different splicing schemes [2]. Sixteen identical or similar ESTs described earlier by Nigumann et al [2] and Wheelan et al [44] are shown by + and ++, respectively.
2 Source of the EST as annotated in EST division of GenBank.
3 EST similarity (≡) or identity (=) to a representative L1 genomic clone #11A [3]. Subfamily of L1 [4] and GenBank accession number were determined by genome browser [5]. For some ESTs the 5′ nucleotides (< 28 nt) were derived either from vector/adaptor or represented as low quality sequence.
4 Similarity/identity to known mRNA as determined by BLASTN [6] and BLAST2 sequences [7] programs. mRNA description is based on the RefSeq database [8]. If the mRNA has not been described, an EST (marked by an asterisk) is shown. This EST contains a putative first exon transcribed from the non-L1 (native) promoter.
5 Genomic contig (accession no), chromosome (chr), and position of the L1 ASP in the intron, upstream (5′) or downstream (3′)/total number of exons, as determined with MegaBLAST and SPIDEY programs. ND stands for not determined.
6 Orientation with respect to the gene's transcription.
Examples of the L1 ASP functioning as an alternative promoter or driving antisense transcription of human genes.
| EST | Source | Similarity to L1 5′UTR opposite strand | Similarity to known mRNA | Location in the genome | Orientation |
| Type II splicing | |||||
| CD642260 (4 ex) | Embryonic stem cell line WA01/H1 | 12−117 ≡ 542−647 (97%) 118−230 ≡ 878−990 (96%) L1PA2 AC022762 | Olfactory receptor, family 56, subfamily B, member 4, NM_001005181 373−728 ≡ 802−443 (98%) | NT_009237 (chr 11) 3′/1 | Antisense |
| NM_017794 (46 ex) | RA-induced NT2 neuronal precursor cells | 4−150 ≡ 501−647 (93%) 151−262 ≡ 878−990 (93%) L1P AL354879 | Hypothetical protein KIAA1797, AL711955* 331−834 ≡ 60−563 (99%) | NT_008413 (chr 9) 5′/45 | Sense |
| Type III splicing | |||||
| BE866323+ (4 ex) | Bladder carcinoma cell line | 1−92 | Bol, boule-like ( | NT_005246 (chr 2) 3′/11 | Antisense |
| BP351387 (5 ex) | Well-differentiated squamous cell carcinoma cell line TE13 | 1−67 = 581−647 L1Ta (Hs) AL663118 | Chloride channel 5, NM_000084 213−583 = 243−613 | NT_086939 (chr X) 5′/12 | Sense |
| CB960713 (4 ex) | Placenta | 30−107 ≡ 570−647 (96%) L1PA3 AC005922 | ATP-binding cassette, subfamily A, NM_172386 108−208 = 3283−3183 | NT_010641 (chr 17) 25/38 | Antisense |
| Type V splicing | |||||
| AV693621 (2 ex) | Hepatocellular carcinoma | 1−172 ≡ 559−732 (93%) L1PA2 AL627203 | Collagen, type XI, alpha 1, variant A, NM_001854 187−279 = 3433−3341 | NT_004623 (chr 1) 46/67 | Antisense |
| BP358215 (7ex) | Mammary gland tumor cell line T47D | 1−147 ≡ 586−732 (92%) L1PA2 AL391749 | Regulator of G-protein signalling 6, NM_004296 148−581 ≡ 188−621 (99%) | NT_026437 (chr 14) 5′/17 | Sense |
| BX955947 (3 ex) | Pooled from different tissues | 1−116 ≡ 617−732 (89%) L1PA2 AC006559 | Solute carrier organic anion transporter family, member 1A2, NM_021094 240−342 = 186−288 | NT_009714 (chr 12) 5′/14 | Sense |
1 EST/mRNA GenBank accession number and number of exons (ex) determined by SPIDEY [1]. ESTs are grouped according splicing schemes [2]. EST described earlier by Nigumann et al [2] is marked by +.
2 Source of the EST as annotated in EST division of GenBank.
3 EST similarity (≡) or identity (=) to a representative L1 genomic clone #11A [3]. Subfamily of L1 [4] and GenBank accession number were determined by genome browser [5]. For some ESTs, the 5′ nucleotides (< 28 nt) were either derived from vector/adaptor or represented as low quality sequence.
4 Similarity/identity to known mRNA as determined by BLASTN [6] and BLAST2 sequences [7] programs. mRNA description is based on the RefSeq database [8]. If the mRNA has not been described, an EST (marked by an asterisk) is shown. This EST contains a putative first exon transcribed from the non-L1 (native) promoter.
5 Genomic contig (accession no), chromosome (chr), and position of the L1 ASP in the intron, upstream (5′), or downstream (3′)/total number of exons, as determined with MegaBLAST and SPIDEY programs. ND stands for not determined.
6 Orientation with respect to the gene's transcription.
Figure 3Distribution of L1 splice variants II and IV. The presence of splice variants was estimated by RT-PCR in 16 normal human tissues (numbered as in Figure 1 legend) using a reverse primer designed to hybridize to the junction of exons 1 and 2. The schematically represented splice variants II and IV use a common splicing acceptor site at position +116 and splicing donor sites located at positions +262 and +347, respectively [2]. SP stands for L1 sense promoter; sp v stands for splice variant.
Figure 4TSS mapped to the L1 ASP region. (a) The position of TSS present in the DBTSS is shown highlighted on the consensus sequence of L1Hs [4] between positions 347 and 601. TSS with single and multiple entries present in the database are represented by yellow and blue highlight, respectively. The letters above the sequence mark the 3′ end of the oligonucleotide primers used in RT-PCR (see Table 1). (b) Southern blot RT-PCR analysis of the L1-MET transcripts. The lanes are marked according to the primers used in the PCR. Multiple bands on each lane represented the different splice variants of the L1-MET transcript, as confirmed by sequence analysis.