| Literature DB >> 24707484 |
Dmitrii E Polev1, Iuliia K Karnaukhova2, Larisa L Krukovskaya2, Andrei P Kozlov2.
Abstract
Human gene LOC100505644 uncharacterized LOC100505644 [Homo sapiens] (Entrez Gene ID 100505644) is abundantly expressed in tumors but weakly expressed in few normal tissues. Till now the function of this gene remains unknown. Here we identified the chromosomal borders of the transcribed region and the major splice form of the LOC100505644-specific transcript. We characterised the major regulatory motifs of the gene and its splice sites. Analysis of the secondary structure of the major transcript variant revealed a hairpin-like structure characteristic for precursor microRNAs. Comparative genomic analysis of the locus showed that it originated in primates de novo. Taken together, our data indicate that human gene LOC100505644 encodes some non-protein coding RNA, likely a microRNA. It was assigned a gene symbol ELFN1-AS1 (ELFN1 antisense RNA 1 (non-protein coding)). This gene combines features of evolutionary novelty and predominant expression in tumors.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24707484 PMCID: PMC3953637 DOI: 10.1155/2014/398097
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Gene-specific amplification primers.
| No. | Primer name | Primer sequence |
|---|---|---|
| 1 | 5gsp1 | GCTGAGAGTGAATTCGGGGTGCAG |
| 2 | 5gsp1n | GCTGCTGCGTCTCGGAGTGAATG |
| 3 | 5gsp2 | GACTGGGAGTGAGAAGGAGC |
| 4 | 5gsp2n | GGAGTGAGAAGGAGCGGAG |
| 5 | 5gsp3 | GGTCTTTACTCCCATTCAACGGAAGAG |
| 6 | 5gsp3n | CATTCAACGGAAGAGGAAGCAGAGC |
| 7 | 3gsp1 | CCTCCTGTCATTCACTCCGAGACGC |
| 8 | 3gsp1n | CATTCACTCCGAGACGCAGCAGC |
| 9 | Upstream primer (UP) | GTGGCGCCTCAGCCACAATC |
| 10 | Nested upstream primer (NUP) | GCCTCAGCCACAATCGTAAT |
| 11 | Distal downstream primer (DDP) | GTGAGAAACCACAAGCTCCCTG |
| 12 | Nested distal downstream primer (NDDP) | GGTCTTTACTCCCATTCAA |
| 13 | Proximal downstream primer (PDP) | CAGGTTCTTCAGCCAGGAAG |
| 14 | Nested proximal downstream primer (NPDP) | GGAGTGAGAAGGAGCGGAG |
Figure 1Expression of the ELFN1-AS1 gene in human normal tissues and tumors. (a) Upper pane: PCR with ELFN1-AS1-specific primers. (a) Lower pane: GAPDH control. (a) Lanes: M: DNA size marker; 1: normal brain; 2: normal heart; 3: normal kidney; 4: normal liver; 5: normal lung; 6: normal pancreas; 7: normal placenta; 8: normal skeletal muscle; 9: brain, malignant meningioma, moderately differentiated; 10: lung, nonsmall cell carcinoma; 11: kidney, transitional cell carcinoma, papillary moderately differentiated; 12: kidney, renal cell carcinoma; 13: liver, hepatocellular carcinoma, well differentiated; 14: liver, adenocarcinoma, moderately differentiated; 15: gallbladder, adenocarcinoma; 16: esophagus, squamous cell carcinoma, ulcer, well differentiated; 17: stomach, adenocarcinoma, ulcer, well differentiated; 18: small intestine, mesenchymoma, moderately differentiated; NTC: no template control; PC: positive control. (b) Upper pane: PCR with ELFN1-AS1-specific primers. (b) Lower pane: GAPDH control. (b) Lanes: M: DNA size marker; 1: normal colon; 2: normal ovary; 3: normal peripheral blood leukocytes; 4: normal prostate; 5: normal small intestine; 6: normal spleen; 7: normal testis; 8: normal thymus; 9: colon, adenocarcinoma, moderately differentiated; 10: rectum, adenocarcinoma, moderately/poorly differentiated; 11: ovary, adenocarcinoma, ulcer, moderately differentiated; 12: fallopian tube, medullary carcinoma, poorly differentiated; 13: uterus, adenocarcinoma; 14: ureter, moderately differentiated, transitional cell carcinoma; 15: bladder, transitional cell carcinoma, papillary; 16: testis, germinoma; 17: parotid, squamous cell carcinoma, mixed; NTC: no template control; PC: positive control.
Figure 2Identification of the primary structure of the Hs.633957-specific RNA. (a) Scheme of the transcript variants for the locus Hs.633957. Splice sites and polyadenylation sites are marked and their positions are given according to the leftmost TSS. Arrows indicate position of the primers which were used for identification of the major splice form. BX119057-like variant is filled in dark grey. (b) Results of cDNA PCR amplification with primers bordering the 39 to 3642 gene region. (c) Results of cDNA PCR amplification with primers bordering the 39–3289 gene region. cDNA samples: 1: gallbladder, adenocarcinoma; 2: rectum, well differentiated adenocarcinoma; 3: ureter, papillary transitional cell carcinoma; 4: T-cell Hodgkin's lymphoma; 5: kidney; 6: liver; NTC: no template control.
Figure 3Transcribed locus Hs.633957 overview. A sketch of the chromosome 7 region containing transcribed locus Hs.633957 is shown according to the human genome assembly NCBI35/hg18. (A) Transcript variants are given to show the location of the region. (B) Repeating elements by RepeatMasker version 3.2.7 track indicates the location of repeating elements. (C) GC-box marks the GC-box motif identified in current study. HS sites mark position of the DNase I hypersensitivity sites. Location of the sites was estimated as averages of the left and right peak boundaries coordinates among the overlapping peaks from independent experiments for which data is available from UCSC Genome Browser (24 for HS1 and 17 for HS2). (D) Enhanced H3K27Ac, H3K4Me1, and Promoter H3K4Me3 tracks indicate association of the specific histone marks with the genome in certain cell lines. Overlaid data for several cell lines is shown. (E) ENCODE transcription factor ChIP-seq track provides a summary for association of various TFs with chromatin in various cell lines. Cell lines are indicated by letter code (H: HeLa-S3, K: K-562, h: HUVEC, g: GM12891, L: HepG2, G: GM12878). (F) Association of Pol 2 with chromatin in various cell lines is shown according to ENCODE ChIP-seq data. (G) ENCODE data on transcription levels in various cell lines are obtained by RNA-seq. Overlaid data is shown.
Figure 4Transcribed locus Hs.633957 may code for a microRNA. (a) A hairpin-like fragment of the secondary structure was predicted for the BX119057 sequence by Mfold. Nucleotide positions are given according to the genome sequence starting from the leftmost TSS. (b) The potential interaction of the predicted mature miRNA with its target site in DPYS mRNA. (c) Fragment of the DPYS mRNA secondary structure. The target site for the miRNA is in lower case. The miRNA seed site-interacting region is shaded. Nucleotide positions are given according to the mRNA (acc. No. NM_001385.2).
Figure 5Emergence pattern of functional regions around ELFN1-AS1. The changes of the functional regions in the context of the phylogeny are shown. E-boxes are shown as grey boxes numbered 1–8. Lighter colour with “∗” sign indicates an alternative E-box sequence at the point. GABP and GC represent the GABP binding site and the GC-box. SD represents the splice donor site, SA1-SA3: the acceptor splice sites. Black crosses indicate the absence of a splice signal. The “∗” indicates presence of an alternative splice signal. Exons and introns are not to scale. Data for the nonprimate vertebrates are not shown.