| Literature DB >> 32550544 |
Yao Lu1, Qilong Li1, Kexin Zheng2, Chenghao Fu1, Chunying Jiang1, Dayu Zhou2, Chao Xia1, Shiliang Ma1.
Abstract
In metazoan genome, the mechanism of gene expression regulation between transcriptional regulatory elements and their target gene is spatiotemporal. Active promoters possess many specific chromosomal features, such as hypersensitive to DNaseI and enrichment of specific histone modifications. In this article, we proposed a novel method which possesses a high efficiency to find promoters in vitro. A promoter-trap library was constructed with totally 706 random mouse genomic DNA fragment clones, and 260 promoter-active fragments of the library were screened by transient transfection into 4T1 cells. To demonstrate the accuracy of this promoter finding method, 13 fragments with promoter activities were randomly selected for published DNase-seq and ChIP-seq data analysis, downstream transcripts prediction and expression confirmation. qRT-PCR results showed that six predicted transcription units were successfully amplified in different mouse tissues/cells or in reconstituted mouse mammary tumors. Our results indicate that this promoter finding method can successfully detect the promoter-active fragments and their downstream transcripts.Entities:
Keywords: ATAC-seq, Assay for transposase-accessible chromatin using sequencing; Bioinformatics; CAGE, cap analysis of gene expression; CMV, Cytomegalovirus; Cancer-specific promoter; ChIP-seq, Chromatin immunoprecipitation followed by massively parallel DNA sequencing; Ct, threshold; DHS, DNaseI hypersensitive sites; DNase-seq, DNase I hypersensitive sites sequencing; EF1a1, eukaryotic translation elongation factor 1 alpha 1; FBS, fetal bovine serum; GRO-seq, global run-on sequencing; Gene expression regulation; Gene finding; H3K4me3, histone H3 lysine 4 trimethylation; Itpr2, inositol 1, 4, 5-triphosphate receptor 2; LSINCT5, long stress-induced non-coding transcript 5; MCS, multiple cloning site; MPRA, Massively parallel reporter assays; Mouse breast cancer; PBS, phosphate buffered solution; Promoter trap; RNA-seq, RNA sequencing; SD, standard deviation; STARR-seq, Self-transcribing active regulatory region sequencing; TFs, transcription factors; TSS, transcription start sites; dNTPs, deoxy-ribonucleoside triphosphate; eRNAs, enhancer RNAs; mSEAP, mouse synthetic secreted embryonic alkaline phosphatase; pNPP, p-nitropheny-phosate; qRT-PCR, quantitative RT-PCR; tpk1, thiamine pyrophosphokinase
Year: 2019 PMID: 32550544 PMCID: PMC7286058 DOI: 10.1016/j.gene.2019.100008
Source DB: PubMed Journal: Gene X ISSN: 2590-1583
Fig. 1Flow chart of gene finding with promoter-trap technology. The main components of the promoter-trap vector (Pro-trap) include a bacteria selection marker Zeo, a eukaryotic reporter gene mSEAP and a multiple cloning site (MCS) directly upstream of the reporter gene. Both the Pro-trap and the 4T1 cells genomic DNA were restriction enzyme digested with BamHI/HindIII, and the purified enzyme digested big fragment from the pro-trap and the random fragments from the mouse 4T1 genome were ligated and transformed into E. coli GT115 cells. After transformation, each single clone was picked up and amplified. Recombinant plasmids were gel purified and quantified before reporter activity assay with a 96-well plate. Positive clones were determined after subtraction of reporter activities from the negative control. Furthermore, downstream transcription units of the 13 selected fragments were bioinformatically predicted and amplified by qRT-PCR in different tissues/cells.
Fig. 2Diagram of the construction of promoter trap library. (A) Pro-DNA-trap was a reporter vector driven by random 4T1 cells genomic DNA fragments. It was used for insertion of the random 4T1 cells genomic DNA fragments obtained by HindIII and BamHI double restriction enzyme digestion. Agarose gel map demonstrated the varieties of the inserted random DNA fragments. Lane 1–9: Recombinant vector containing trapped fragments with different base pairs. P: Pro-trap vector. M: Marker. (B) The Pro-trap was a promoter-less vector used as the negative control, and the agarose gel map showed the Pro-trap vector. Lane 1: Double digested Pro-trap. M: Marker. (C) The Pro-CMV-trap was a vector with the insertion of a CMV promoter into the BamHI/HindIII sites as the positive control, and the agarose gel map showed the fragment size of the CMV promoter. Lane 1–4: the fragments of the CMV promoter. M: Marker. (D) The pRL-CMV was a vector with a CMV promoter for normalizing transfection efficiency. Lane 1: the pRL-CMV. M: Marker.
Fig. 3Screening of inserted fragments with reporter gene activity. (A) After transfection of 4T1 cells with either negative control (pro-trap) or positive control (CMV-trap), the optimal transfection time point was shown as 36 h. The time course of the mSEAP activities was indicated. The figure was plotted by the mean value of the three replicates. stands for the negative control, stands for the cell culture supernatants collected at 6 h, stands for the cell culture supernatants collected at 12 h, stands for the cell culture supernatants collected at 24 h, stands for the cell culture supernatants collected at 36 h, stands for the cell culture supernatants collected at 48 h. (B) The heat map of SEAP activities of positive clones that possess promoter activities higher than positive control corresponding to its 96-well plate. The figure was displayed by the mean value of the three replicates. (C) The relative distribution of 706 clones with different activities. Negative clones represent the promoter activities lower than that of the negative control; positive clones represent the promoter activities between that of the positive and negative control; highly positive clones represent the promoter activities higher than that of the positive control. Each clone was compared with negative and positive control on its corresponding 96-well plate.
S2 FigThe heat map of mSEAP activities of 706 clones.
Fig. 4Genomic localization of the 13 selected fragments with promoter activities. (A) The heat map of SEAP activities of the 13 randomly selected clones. The SEAP activities of each clone were subtracted from the negative control in the corresponding 96-well plate. The figure was displayed by the mean value of the three replicates and sorted according the location of the selected clones on mouse genome. The red line on the right represented the trapped fragments (including P-4T1-24, P-4T1-273, P-4T1-328, P-4T1-447, P-4T1-516, P-4T1-622, P-4T1-699) located in the nongene regions; the orange line on the right represented the trapped fragments (including P-4T1-28, P-4T1-146, P-4T1-329, P-4T1-490, P-4T1-544) located in the intron regions; the blue line on the right represented the trapped fragments (P-4T1-629) located in the exon region. (B) The 13 trapped fragments were mapped on mouse genome (mm9). The red line pointed the location of each trapped fragment on mouse genome. (C) Proportion of the location of the 13 trapped fragments. In total 13 trapped fragments, 7.6% was located in the gene exon region, 53.8% was located in the nongene regions and 38.4% was located in the gene intron regions. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
Fig. 5DNase-seq and H3K4me3 ChIP-seq analysis detected trapped promoter activities in normal cell lines and tissues. (A) The heat map of the DNase I hypersensitivity of the 13 trapped fragments. All data were converted to log2 scale. The maximum red circle represented the value of 5 and the minimum blue circle represented the value of 0. The white block represented the trapped fragment displayed as heterochromatin in its corresponding cell lines and tissues. (B) The relative distribution of chromatin structure of the 13 trapped fragments in different cell lines and tissues. Red region represented the location of its trapped fragment on mouse genome. (C) The heat map of the H3K4me3 modification in the 13 trapped fragments. All data were converted to log2 scale. Red represented the value of 6, blue represented the value of 0 and gray represented the unmodified region. (For interpretation of the references to colour in this figure legend, the reader is referred to the web version of this article.)
S6 FigThe signal patterns of DNase-seq of 13 trapped fragments in different cell lines and tissues.
S7 FigThe signal patterns of H3K4me3 of 13 trapped fragments in different cell lines.
Fig. 6Tissue specific expression of predicted genes. (A) The representative image of tumors from tumor-bearing mice at 42 days after transplantation of 4T1 cells. Tumor volume was estimated according to the formula Volume (mm3) = L × W2 × 0.4. (B–G) qRT-PCR was carried out with primer pairs listed in S1 Table. The level of each gene was relative to that of EF1a1. Experiments were performed on three biological replicates for samples from each tissue. Bars represented the means ± standard deviation (SD) of 3 independent replicates. *P value <0.05, **P value <0.01. stands for the gene expression level in 4T1 cell cognate reconstituted mouse tumors, stands for the gene expression level in mouse tumor cells 4T1, stands for the gene expression level in mouse tumor cells 4T07, stands for the gene expression level in BALB/c mouse livers, lungs, spleens after transplanted tumors, stands for the gene expression level in tumor free BALB/c mouse.