Weiyun Wu1, Wenkai Tan1, Shicai Ye1, Yu Zhou1, Juanhua Quan1. 1. Department of Gastroenterology, Affiliated Hospital of Guangdong Medical University, Zhanjiang, Guangdong 524001, P.R. China.
Abstract
The pathogenesis of colorectal cancer (CRC) is poorly understood. MicroRNA (miR)-32 upregulation in CRC tissues was previously reported, where it increased the proliferation, migration and invasion, and reduced apoptosis of CRC cells by inhibiting the expression of phosphatase and tensin homolog (PTEN). However, the mechanism underlying miR-32 upregulation remains unknown. miR-32 is an intronic miRNA located within intron 14 of the transmembrane protein 245 gene (TMEM245). The present study aimed to elucidate the biological pathways underlying miR-32 regulation in CRC. A truncated promoter containing the 5'-flanking region of TMEM245/miR-32 gene was constructed. The promoter region was analyzed by dual luciferase reporter assay in CRC cells. DNA pull-down assay and mass spectrometry (MS) were used to identify proteins binding to the core promoter. Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and transcription factor (TF) analyses were used to identify the binding proteins. The -320 to -1 bp fragment of the 5'-flanking region exhibited the highest luciferase activity. The regions spanning -606 to -320 bp exhibited a significant decrease in luciferase activity, compared with the -320 to -1 bp fragment. DNA pull-down assay and MS revealed 403 potential miR-32 promoter binding proteins. GO and KEGG pathway analysis indicated that these proteins were involved in numerous physiological and biochemical processes, including 'structural molecule activity', 'RNA binding', 'small molecule metabolic process' and 'biogenesis'. Furthermore, TF analysis revealed 10 potential interacting TFs, including SMAD family member 1 (SMAD1), signal transducer and activator of transcription 1 (STAT1) and forkhead box K1 (Foxk1). These results suggested that the core promoter region may be located within-320 to -1 bp of the 5'-flanking region of TMEM245/miR-32 gene, while the region from -606 to -320 bp may harbor repressive regulatory elements. The TFs SMAD1, STAT1 and Foxk1 may be involved in the transcriptional regulation of miR-32.
The pathogenesis of colorectal cancer (CRC) is poorly understood. MicroRNA (miR)-32 upregulation in CRC tissues was previously reported, where it increased the proliferation, migration and invasion, and reduced apoptosis of CRC cells by inhibiting the expression of phosphatase and tensin homolog (PTEN). However, the mechanism underlying miR-32 upregulation remains unknown. miR-32 is an intronic miRNA located within intron 14 of the transmembrane protein 245 gene (TMEM245). The present study aimed to elucidate the biological pathways underlying miR-32 regulation in CRC. A truncated promoter containing the 5'-flanking region of TMEM245/miR-32 gene was constructed. The promoter region was analyzed by dual luciferase reporter assay in CRC cells. DNA pull-down assay and mass spectrometry (MS) were used to identify proteins binding to the core promoter. Gene Ontology (GO), Kyoto Encyclopedia of Genes and Genomes (KEGG) and transcription factor (TF) analyses were used to identify the binding proteins. The -320 to -1 bp fragment of the 5'-flanking region exhibited the highest luciferase activity. The regions spanning -606 to -320 bp exhibited a significant decrease in luciferase activity, compared with the -320 to -1 bp fragment. DNA pull-down assay and MS revealed 403 potential miR-32 promoter binding proteins. GO and KEGG pathway analysis indicated that these proteins were involved in numerous physiological and biochemical processes, including 'structural molecule activity', 'RNA binding', 'small molecule metabolic process' and 'biogenesis'. Furthermore, TF analysis revealed 10 potential interacting TFs, including SMAD family member 1 (SMAD1), signal transducer and activator of transcription 1 (STAT1) and forkhead box K1 (Foxk1). These results suggested that the core promoter region may be located within-320 to -1 bp of the 5'-flanking region of TMEM245/miR-32 gene, while the region from -606 to -320 bp may harbor repressive regulatory elements. The TFs SMAD1, STAT1 and Foxk1 may be involved in the transcriptional regulation of miR-32.
Colorectal cancer (CRC) is a type of malignant tumor derived from the colonic epithelial mucosa. Carcinogenesis of CRC involves abnormal expression of genes associated with proliferation, apoptosis, metastasis and angiogenesis (1). To date, the molecular mechanisms underlying CRC oncogenesis are not fully understood. The pathogenesis and development of CRC is a multi-factor, multi-step process, in which gene mutations and abnormal expression may serve important roles (2). The involvement of DNA epigenetic modifications (3), non-coding RNAs including microRNAs (miRNAs/miRs) (4), long non-coding RNA (lncRNA) (5), circular RNA (cirRNA) (6) and chromatin remodeling (7) in the development of CRC are receiving increasing attention.miRNAs are a class of non-coding, single-stranded RNAs with a length of 18–25 nucleotides. They affect gene expression by binding to specific sites at the 3′-untranslated region of target mRNAs. miRNAs are involved in tumor development and are potential biomarkers in cancer diagnostics and treatment, including CRC (8,9). Accumulating evidence has demonstrated that aberrantly expressed miRNAs acted as oncogenes or tumor suppressor genes in CRC (4,10,11).miR-32 is an intronic miRNA located within intron 14 of transmembrane protein 245 gene (TMEM245). Our previous studies (12,13) revealed that miR-32 was upregulated in CRC tissues and that high miR-32 levels were significantly associated with lymph node and distant metastasis. Additionally, patients with high miR-32 expression had a poor overall survival. Furthermore, overexpression of miR-32 led to increased proliferation, migration and invasion, and reduced apoptosis of CRC cells via inhibition of phosphatase and tensin homolog (PTEN). However, the mechanism underlying the upregulation of miR-32 remains unknown. The aim of the current study was to elucidate the mechanisms involved in the upregulation of miR-32 in CRC by analyzing the promoter of the miR-32 gene and investigating the proteins that bind to the promoter. The results obtained may assist in the further investigation of transcriptional regulatory mechanisms of miR-32 expression.
Materials and methods
DNA cloning and construction of truncated promoter plasmids
To analyze the promoter region responsible for expression of the miR-32 gene, serially truncated fragments of five different lengths of the 5′-flanking region of host gene TMEM245 (ENST00000374586.7 from University of California Santa Cruz Genome Browser) were amplified by polymerase chain reaction (PCR) using different pairs of primers (Table I). PCR was performed with 2×HIFITaq PCR StarMix (Genstar, Beijing, China) and the conditions were as follows: Initial denaturation at 94°C for 5 min; 35 cycles of 94°C for 30 sec, 60°C for 30 sec and 72°C for 2 min; and final extension at 72°C for 10 min. These primers also introduced a KpnI site at the 5′end and an XhoI site at the 3′end of the amplified fragments. The PCR fragments were purified, digested with KpnI and XhoI and cloned into a pGL3-basic vector (Promega Corporation, Madison, WI, USA). Five successive truncated constructs from the 5′-flanking region termed pGL3-1987 (−1987 to −1 bp), pGL3-1648 (−1648 to −1 bp), pGL3-1088 (−1088 to −1 bp), pGL3-606 (−606 to −1 bp), pGL3-320 (−320 to −1 bp) were generated. All inserts were verified by DNA sequencing (Beijing Genomics Institute, Beijing, China).
Table I.
Primer sequences.
Sequence 5′ → 3′
Primer
Forward
Reverse
Position (bp)
pGL3-1987
AAAGGTACCCAGCCTGTTCA ACATGGTGAA
AAACTCGAGGTAATGGGAGTCGGGC TAGAAAC
−1987 to −1
pGL3-1648
AAAGGTACCCTCCCACC GGGAGACTGC
AAACTCGAGGTAATGGGAGTCGG GCTAGAAAC
−1648 to −1
pGL3-1088
AAAGGTACCCTTGCAAGG TTTGAAGCAATCA
AAACTCGAGGTAATGGGAGTCGG GCTAGAAAC
1088 to −1
pGL3-606
AAAGGTACCCTTGCCTGT GCCACTTGG
AAACTCGAGGTAATGGGAGTCGG GCTAGAAAC
606 to −1
pGL3-320
AAAGGTACCTTCTAGTATG CAGCTTGGGTTTTAATATC
AAACTCGAGGTAATGGGAGTCG GGCTAGAAAC
- 320 to −1
Cell transfection and dual luciferase assay
The CRC cell line HCT-116 was obtained from the Cell Bank of the Type Culture Collection of Chinese Academy of Sciences (Shanghai, China). HCT-116 cells were plated in six-well plates and cultured in RPMI-1640 medium (GE Healthcare Life Sciences, Logan, UT, USA) supplemented with 10% fetal bovine serum (Gibco; Thermo Fisher Scientific, Inc., Waltham, MA, USA) and 1% penicillin/streptomycin (GE Healthcare Life Sciences) at 37°C with 5% CO2. For each transfection, cells were incubated with 0.5 µg of each promoter reporter plasmid respectively, and 0.5 µg of pRL-TK (Promega Corporation), which was used as an internal control. The pGL3-basic vector was used as the negative control. Transfection was performed using Lipofectamine® 2000 (Thermo Fisher Scientific, Inc.). Cells were harvested for subsequent experimentation 48 h after transfection using the dual-luciferase reporter assay system (Promega Corporation), following the manufacturer's protocol. Firefly luciferase activity was normalized to Renilla luciferase activity and the ratio of Firefly/Renilla luciferase in each group reflected the promoter activity. Each experiment was performed in triplicate.
DNA pull-down assay
The DNA pull-down was performed with a DNA pull-down test kit (catalog no. KT401), according to the manufacturer's protocol (Gzscbio, Guangzhou, China). Briefly, the sequences of −320 to −1 bp 5′-flanking region were amplified by PCR and tagged with biotin. The biotin-labeled promoter was bound with streptavidin magnetic beads (Dynabeads™ M-280 Streptavidin; Thermo Fisher Scientific, Inc.) at 4°C for 4 h. The non-biotinylated promoter was used as the negative control. All protein extracted from HCT-116 cells in the input group was used as the positive control. The bound promoter was incubated with 1 mg protein extracted from HCT-116 cells with gentle agitation at 4°C overnight. The bound beads-promoter protein complexes were washed with wash buffer, which was included in the kit, and separated by SDS-PAGE. Gel bands were visualized by silver staining.
Mass spectrometry (MS) and bioinformatics analysis
The proteins were digested by incubating with 0.02 µg/µl trypsin at 37°C overnight. The resulting peptides were extracted, purified and processed using a Q-Exactive mass spectrometer (Thermo Fisher Scientific, Inc.). Proteins were identified by comparing the MS data with the Uniprot human protein sequence database (https://www.uniprot.org/taxonomy/9606) using the Mascot search engine (http://www.matrixscience.com/; V2.3.02). Finally, the protein identification results were verified by analyzing the protein quality, matches of the secondary spectrum of the protein, number of peptides matching the protein, protein abundance and protein description between samples. The proteins exhibiting differential binding to the biotin-labeled promoter and negative control, identified by MS, were analyzed using Gene Ontology (GO) functional enrichment (http://www.geneontology.org/), Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analyses (https://www.kegg.jp/) and the AnimalTFDB database (version 2; bioinfo.life.hust.edu.cn/AnimalTFDB) for possible transcription factors (TFs). Proteins were considered significantly enriched in GO terms and KEGG pathways when P<0.05. The MS and bioinformatics analyses were performed by Sagene Biotech Co., Ltd. (Guangzhou, China).
Statistical analysis
The experimental data were analyzed using one-way analysis of variance and were presented as the mean ± standard deviation from three independent experiments using SPSS software (version 19.0; IBM Corp., Armonk, NY, USA). P<0.05 was considered to indicate a statistically significant difference.
Results
Functional analysis of the promoter of the miR-32 gene
To investigate the mechanisms involved in the expression of miR-32, the 5′-flanking region of the host gene TMEM245 was dissected into a series of deletion fragments termed pGL3-1987 (−1987 to −1 bp), pGL3-1648 (−1648 to −1 bp), pGL3-1088 (−1088 to −1 bp), pGL3-606 (−606 to −1 bp) and pGL3-320 (−320 to −1 bp) (Fig. 1A). A dual luciferase reporter assay was performed to further detect the transcriptional activity of the fragments. Compared with the pGL3-basic vector, the luciferase activity in pGL3-320, pGL3-606, pGL3-1088, pGL3-1648 and pGL3-1987 was significantly increased in HCT-116 cells (P<0.05; Fig. 1B). The fragment −320 to −1 bp exhibited the most increased activity, indicating the presence of potential positive regulatory elements, which enhance miR-32 transcription in this region. However, a decrease in transcriptional activity in the pGL3-606 group compared with the pGL3-320 group (P<0.05) was observed (Fig. 1B), suggesting the presence of repressive regulatory elements in the region between position −606 and −320 bp.
Figure 1.
The luciferase activity of the truncated promoter of miR-32. (A) Schematics of each truncated promoter plasmids. (B) Dual luciferase reporter assays of miR-32 gene promoter constructs. Various recombinant vectors, including pGL3-1987, pGL3-1648, pGL3-1088, pGL3-606, pGL3-320 and pRL-TK, were co-transfected into HCT-116 cells. pRL-TK and pGL3-basic were used as internal and negative controls, respectively. Relative luciferase activity was determined by the ratio of Fluc/Rluc activity. Data presented as the mean ± standard deviation of three independent experiments. *P<0.05, compared with the pGL3-basic group. #P<0.05, compared with pGL3-320. Fluc, firefly luciferase; Rluc, Renilla luciferase; miR, microRNA.
Identification of promoter-binding proteins
The DNA pull-down assay was used to identify putative interacting proteins binding to the −320 to −1 bp fragment using streptavidin magnetic beads coated with the biotin-labeled promoter. The proteins were subsequently analyzed by SDS-PAGE and silver staining (Fig. 2). Identification of the differentially binding proteins in the two groups by MS revealed that the binding factors of miR-32 promoter included 403 proteins. These proteins were further analyzed by bioinformatics tools.
Figure 2.
SDS-PAGE analysis of promoter-binding proteins obtained using the DNA pull-down assay. The gel was stained with silver staining. M, PageRuler pre-stained protein ladder; 0, input group, total protein extracted from HCT-116 cells; 1, biotin-labeled promoter-binding protein complexes obtained by pull-down assay; 2, non-biotinylated promoter-binding protein complexes obtained by pull-down assay (negative control).
GO, KEGG pathway and transcription factor analysis
The 403 binding proteins were analyzed using GO enrichment analysis. The GO analysis classified the proteins into the following three functional categories: Biological process, cellular component and molecular function (Fig. 3). Under cellular component, the top ten GO terms were ‘cytoplasmic part’, ‘intracellular’, ‘cytoplasm’, ‘intracellular part’, ‘cytosol’, ‘mitochondrion’, ‘organelle’, ‘cell’, ‘cell part’ and ‘intracellular organelle’ (Fig. 3A). Under molecular function, the top ten GO terms were ‘catalytic activity’, ‘pyrophosphatase activity’, ‘hydrolase activity’, ‘acting on acid anhydrides’, ‘hydrolase activity’, ‘acting on acid anhydrides’, ‘in phosphorus-containing anhydrides’, ‘nucleoside-triphosphatase activity’, ‘binding’, ‘GTPase activity’, ‘structural molecule activity’, ‘RNA binding’ and ‘oxidoreductase activity’ (Fig. 3B). Under biological process, the top ten GO terms were ‘small molecule metabolic process’, ‘generation of precursor metabolites and energy’, ‘nucleobase-containing compound metabolic process’, ‘cellular aromatic compound metabolic process’, ‘heterocycle metabolic process’, ‘organic cyclic compound metabolic process’, ‘cellular component’, ‘organization or biogenesis’, ‘single-organism metabolic process’, ‘aromatic compound catabolic process’ and ‘nucleobase-containing compound catabolic process’ (Fig. 3C).
Figure 3.
Gene Ontology analysis of the binding proteins. (A) Cellular component. (B) Molecular function. (C) Biological process. GTP, guanosine triphosphate.
KEGG pathway analysis revealed that the 403 binding proteins identified were involved in various cellular processes, including ‘tight junction’, ‘oxidative phosphorylation’, ‘mRNA surveillance’, ‘actin cytoskeleton regulation’, ‘protein export’, and diseases, including ‘Huntington's disease’, ‘Alzheimer's disease’, ‘Parkinson's disease’, ‘non-alcoholic fatty liver disease’ (Fig. 4).
Figure 4.
Kyoto Encyclopedia of Genes and Genomes pathway analysis of the binding proteins. TCA, tricarboxylic acid.
Possible transcription factors (TFs) involved were predicted using the AnimalTFDB database. The analysis resulted in the identification of 10 potential interacting TFs of the miR-32 promoter (Table II).
Table II.
TFs potentially interacting with the microRNA-32 promoter as detected by mass spectrometry.
Gene ID
TF symbol
Domain
TF name
ENSG00000170365
SMAD1
MH1
SMAD family member 1
ENSG00000061455
PRDM6
ZBTB
PR domain containing 6
ENSG00000164916
FOXK1
Fork
Forkhead box K1
ENSG00000165684
SNAPC4
MYB
Small nuclear RNA activating complex, polypeptide 4
ENSG00000182359
KBTBD3
ZBTB
Kelch repeat and BTB (POZ) domain containing 3
ENSG00000153048
CARHSP1
CSD
Calcium regulated heat stable protein 1
ENSG00000167377
ZNF23
ZBTB
Zinc finger protein 23
ENSG00000121297
TSHZ3
ZBTB
Teashirt zinc finger homeobox 3
ENSG00000115415
STAT1
STAT
Signal transducer and activator of transcription 1
ENSG00000136535
TBR1
T-box
T-box, brain, 1
TF, transcription factor.
Discussion
In recent years, a number of studies reported the potential role of miRNAs in different types of cancer (14–16). The characterization of dysregulated miRNAs in CRC may help to improve the understanding of carcinogenesis and develop treatments for the disease. Previous studies have demonstrated that the overexpression of miR-32 led to increased proliferation, migration, and invasion and reduced apoptosis of CRC cells via inhibition of the anti-oncogene PTEN (12,13). However, the regulation of miR-32 expression in CRC remains unknown. The aim of the current study was to investigate the regulation of miR-32 expression.The expression of miRNAs is regulated by regulatory systems, including the promotors of their host genes, epigenetic regulation and TFs (17–21). Dysregulation of miRNA expression in different types of cancer may be due to an abnormal combination of TFs acting on the promoter regions or due to epigenetic changes, including aberrant DNA methylation and histone modification (22–24). Various stimuli in the external environment or signals at different stages of development may cause different TFs to bind to transcriptional regulatory elements, activating or inhibiting the transcription of miRNAs (25). Therefore, the identification of proteins interacting with the promoter of miRNAs as well as analysis of their function is important for investigating the transcriptional regulation of miRNAs. Zhu et al (21) demonstrated that TF Kruppel like factor 4 negatively regulated miR-106a expression by binding to the promoter of miR-106a. A study by Kumar et al (26) revealed that the TF myocyte enhancer factor-2 and hypermethylation and histone modifications may have contributed to the downregulation of the miR-379/miR-656 cluster in oligodendrogliomas, either acting independently or in synergy, in oligodendroglioma. Nuclear factor-κB bound to the promoter of miR-1275 and inhibited its transcription, in response to tumor necrosis factor α (TNF-α) stimulation (27). Another study reported that transforming growth factor β1 (TGFβ1) promoted the binding of mothers against decapentaplegic homolog (SMAD)4 to the miR-155 promoter at a site located 454 bp from the transcription start site, suggesting that miR-155 may be a transcriptional target of the TGFβ1/SMAD4 pathway (28).miRNAs are transcribed by RNA polymerase II to generate the original transcript of miRNAs, called primary miRNAs (pri-miRNAs) (29–31). Drosha, an enzyme in the polymerase III family, processes the pri-miRNAs into a hairpin-like precursor miRNA (pre-miRNA) (29–31). The pre-miRNA is exported into the cytoplasm by exportin 5 and then cleaved by Dicer into 18–25 nucleotide double-stranded miRNAs, which are then unwound to generate mature miRNAs (29–31). Half of the known mammalian miRNA sequences are located in the introns of protein-coding host genes, referred to as intronic miRNAs (32). Such intron-derived miRNAs are commonly expressed coordinately and processed with their host gene transcripts (33). miR-32 is an intronic miRNA encoded by TMEM245, as described in the University of California, Santa Cruz Genome Browser (genome.ucsc.edu). Several intronic miRNAs are transcribed together with the host gene (18,34,35). Human papillomavirus type 16 E6 may regulate miR-23b, an intronic miRNA, indirectly through the methylation of its host gene TMEM245 (36). Lerner et al (37) demonstrated that deleted in lymphocytic leukemia 2 (DLEU2) acts as a host gene of miR-15a/miR-16-1, and the binding of the Myc to two alternative DLEU2 promoters represses both the host gene transcription and levels of mature miR-15a/miR-16-1. It is reported that the transcript levels of TMEM245 and miR-32 are positively correlated (38). Functional analysis of the promoter of miR-32 is required to understand the molecular mechanisms governing miR-32 gene expression. In the present study, the truncation analysis and luciferase reporter assays demonstrated that the cloned promoter fragment was capable of driving expression of the luciferase gene in transfected HCT-116 cells. The core promoter of miR-32 may be located within the −320 to −1 bp region which exhibited the highest luciferase activity. The regions spanning −606 to −320 bp potentially harbor negative regulatory elements as a significant decrease in promoter activity was observed. These data indicate that the miR-32 overexpression is due to the complex interactions between different regulatory elements and promoter.A DNA pull-down assay in combination with MS was performed to identify the proteins that bind to the miR-32 gene promoter. In addition, bioinformatics analyses were performed to characterize the binding proteins. The binding proteins were involved in a variety of key biological processes, including ‘structural molecule activity’, ‘RNA binding’, ‘small molecule metabolic process’ and ‘biogenesis’. This suggested that these proteins may potentially serve a role in carcinogenesis. The KEGG pathway analysis revealed that the 403 binding proteins identified were involved in neuronal diseases. Yan et al (39) demonstrated that miR-32 promotes neuroinflammation and neuropathic pain development through regulation of dual-specificity phosphatase 5, and knockdown of miR-32 suppressed mechanical allodynia and heat hyperalgesia and decreased inflammatory cytokine [interleukin (IL)-1β, TNF-α and IL-6] protein expression in rats following spinal nerve ligation. Another study revealed that two single nucleotide polymorphisms in the host gene TMEM245 were involved in genetic loci strongly associated with schizophrenia (40). Since miR-32 and its host gene TMEM245 may be involved in the pathogenesis of nervous system-associated diseases, proteins binding to the miR-32 promoter may also be involved in the signaling pathways of nervous system diseases. The 403 binding proteins identified in the current study are also involved in other cellular processes, including ‘tight junction’, ‘oxidative phosphorylation’, ‘mRNA surveillance’ and ‘actin cytoskeleton regulation’. These pathways are also involved in the development of tumors (41–44), including pancreatic, ovarian and gastric cancer, which may be associated with pathogenesis of colorectal cancer.TFs are a group of proteins that can regulate RNA transcription by binding to the promoter of the corresponding DNA sequence (45). TF analysis revealed 10 potential interacting TFs, including SMAD1, signal transducer and activator of transcription 1 (STAT1) and forkhead box K1 (Foxk1) among the binding proteins. Yang et al (46) revealed that SMAD1 promotes migration of CRC cells by inducing Snail and ajubaLIM protein expression simultaneously. The level of SMAD1 was significantly increased in CRC tissues, and was confirmed as significant predictor for overall survival (47). High STAT1 activity was significantly associated with longer patient overall survival in CRC (48). Wu et al (49) demonstrated that higher expression of Foxk1 could indicate a poor prognosis in patients with CRC since Foxk1 induces epithelial-mesenchymal transition (EMT) and promotes CRC cell invasion in vitro and in vivo; knockdown of Foxk1 inhibited TGF-β1-induced EMT. These transcription factors potentially serve a role in the pathogenesis of CRC, and further investigation is required to identify, verify and validate their involvement as binding proteins in miR-32 expression.The current study demonstrated that the core promoter region of the humanmiR-32 gene is located in the region spanning −320 to −1 bp, and binding proteins, especially TFs, may be involved in the transcriptional regulation of this gene. These results provide insight into the mechanism of miR-32 gene regulation. Due to the limitations of bioinformatics, which are only based on bioinformatics predictions, it's not certain whether these proteins can affect the expression of miR-32. Further verifications, including gain-of-function and loss-of-function studies, and a chromatin immunoprecipitation assay, are required to clarify the function of possible binding proteins.