Literature DB >> 31423302

Unveiling the gene regulatory landscape in diseases through the identification of DNase I-hypersensitive sites.

Ying Chen1, Ailing Chen1.   

Abstract

DNase I-hypersensitive sites (DHSs) serve key roles in the regulation of gene transcription as markers of cis-regulatory elements (CREs). Recent advances in next-generation sequencing have enabled the genome-wide location and annotation of DHSs in a variety of cells. Numerous studies have confirmed that DHSs are involved in several processes in cell fate decision and development. DHSs have also been indicated in cancer and inherited diseases as driver distal regulatory elements. Here, the definition of DHSs is reviewed, in addition to high-throughput methods of DHS identification. Furthermore, the function of DHSs in gene expression is probed. The roles of DHSs in disease occurrence are also reviewed and discussed. Concomitant advances in the identification of essential roles of DHSs will assist in disclosing the underlying molecular mechanisms, supplementing gene transcription and enlarging the molecular basis of DHS-related bioprocesses, phenotypes, distinct traits and diseases.

Entities:  

Keywords:  DNase I-hypersensitive sites; DNase-sequencing; cis-regulatory elements; low-input DNase-sequencing; single-cell DNase-sequencing

Year:  2019        PMID: 31423302      PMCID: PMC6684942          DOI: 10.3892/br.2019.1233

Source DB:  PubMed          Journal:  Biomed Rep        ISSN: 2049-9434


1. Introduction

Definition of DNase I-hypersensitive sites (DHSs)

DNase I is an endonuclease with little DNA-sequence specificity (1). In the early 1960s, DNase I was used to probe how the nucleosome was organized (2). Weintraub and Groudine (3) found that active chromatin was prioritized for decomposition by this enzyme. These specific regions are termed DHSs, which are distinct markers of active chromatin that co-position with various cis-regulatory elements (CREs), including the expression regulation sequences of enhancers and promoters, negative regulation sequence of insulators, silencers, and certain locus control regions. DHSs usually disperse around transcriptionally active genes and are accessible to regulatory proteins. Therefore, by contrast, certain regions within DHSs are resistant to degradation as they are protected by these gene regulatory proteins, including transcription factors.

Generation and erasure of DHSs

The underlying molecular mechanisms of gene regulation remain to be fully elucidated. The first step of gene regulation is that the cells adopt an ‘open’ structure in response to external stimulation at a group of specific sequences located in certain chromatin regions. These exposed sequences are bound by site-specific transcriptional regulatory elements, leading to chromatin structure remodeling marked by significant accessibility to the nucleases (4). DHSs in the open chromatin serve important roles in chromatin rearrangement (5,6), and are stimulated by the different state of histone acetylation (7,8) and the binding capacity of the chromatin remodeling multiprotein complex (9). DHSs act as binding anchors of activator proteins and mediating cofactors, and interact with the preinitiation complex at promoters (2,10-14), regulating gene expression. DHSs can be eliminated by site-specific factors. It was reported that DHS at a CCCTC-binding factor (CTCF)-dependent silencer can be eliminated due to the eviction of CTCF and remodeling of a nucleosome caused by inducible non-coding RNA transcription in the chicken lysozyme (15). DHSs are dynamic and fine-tuned by different classes of remodeling enzymes. For example, TFE3 combines with ACF to stimulate the occurrence of an active DHS site in the IgH intronic enhancer, whereas PU.1 has been demonstrated to recruit Mi2β and subsequently erase this DHS (16).

Characteristic features of DHSs

In mammalian cells, >3% of the genome is found to be DNase I-hypersensitive (17). To date, 290,000,000 DHSs have been recognized. Each tissue/cell type is represented by multiple distinguished DHS profiling derived from different individuals. DHSs are considered to be the one of the most useful discriminative features between cell types (18) and have several distinctive characters. First, DHSs are typically characterized by high sensitivity to DNase I, particularly when the related gene is actively transcribed. Regions with high transcriptional activity are reported to be even more sensitive than those with no transcriptional activity. That is, there exists two states of DHSs, open and closed, in which the accessibility features of chromatin is increased or decreased, respectively, associated with gene expression (19). Second, DHSs are short sequences of ~200 base pairs with low methylation, and the majority are no more than several hundred base pairs long. These low-methylation regions are co-positioned at or close to the transcription starting sites (20,21), which affect gene transcription according to the degree of methylation. Third, DHSs are representative markers of regulatory DNA and overlap with multiple CREs, including promoters, enhancers and active transcription sites (22). In addition, DHSs have underpinned the identification of other CREs, including insulators, silencers and locus control regions (10). Fourth, although each tissue/cell exhibits distinguished DHS signatures, there exists specific core regions in DHSs that can be identified by sequence-specific DNA-binding proteins. The core regions are conserved in different cell types across species and are enriched with binding sites of HMG14 and HMG17 proteins (23).

2. High-throughput sequencing methods for DHS identification

The techniques used for DHS identification do not vary substantially, all of which are novel techniques based on high-throughput sequencing. The differences are compared in Table I.
Table I

Comparison of different techniques to identify DHSs.

TechniqueCell requirementData interpretationNovel DHS identificationCustomized DHSsComplex processDNase I digestionResolution
DNase-seqLarge number of cellsDifficultYesNoYesYesHigh
scDNase-seqOne or ~100 cellsDifficultNoNoYesYesLow
liDNase-seq<30 cellsDifficultYesNoReducedYesHigh
ImmunoSEQLarge number of cellsEasyCustomizedYesSimplerNo 
   DHS region    

Accurate DHSs, DNase I-hypersensitive sites; DNase-seq, DNase-sequencing; scDNase-seq, single-cell DNase-seq; liDNase-seq, low-input DNase-seq.

DNase-sequencing (DNase-seq)

Decades ago, Southern blot hybridization was the major method used to identify DHSs by characterizing digested DNAs following the titration of DNase I (24). However, the low-throughput nature of this strategy profoundly restricted its further application. Improvements and the wide application of the massively parallel sequencing technique has allowed high-resolution genome-scale mapping of various DHSs, which lays a foundation for assembling comprehensive catalogs of regulatory sequences (25,26). The first method used for identifying thousands of DHSs simultaneously was introduced by Crawford et al (27). Firstly, the nuclei are cleaved by DNase I, and the two ends are then digested blunt using T4 DNA polymerase. The genome is then cleaved by adding BamHI and BglII. The digested blunt or sticky fragments are ligated into the pBluescript SK(+) plasmid and sequenced. This method enriches the sequence within the genome that relates to active chromatin and identifies DHSs on a genome-wide scale. Two years later, these high-throughput strategies were renewed by attaching a biotinylated linker to the DNase-digested ends (Fig. 1). The linker tags are used to extract short joint DNA sequences, which can be identified by DNase-based high-throughput next-generation sequencing (DNase-seq) (28) or DNase-based microarrays (DNase-chip) (26). Similar strategies providing accurate mapping of DHSs further assist in revealing a large category of CREs in all types of mammalian tissues and cells (29,30).
Figure 1.

Experimental procedures of the high-throughput DHSs identification protocol. Intact nuclei are digested with DNase I and then made blunt, followed by ligation of a biotinylated linker and sonication for shearing. The products are incubated on a streptavidin column for pooling of the DNase I-cleaved ends. The extracted short adjacent DNA fragments are either hybridized to tiled arrays (DNase-chip) or subjected to library construction and next generation sequencing (DNase-sequencing). DHSs, DNase I-hypersensitive sites.

Morin et al (31) simulated the whole exon sequencing paradigm, and developed a customized capture panel for known DHSs (‘immune sequences’), specific for DHS detection in immune cells and genetic variation in immune-related diseases.

Single-cell DNase-seq (scDNase-seq)

Despite the robustness of DNase-seq technology, millions of cells are required, which limits its application in rare cases with limited cells, such as in certain cells from patients. In addition, traditional DNase-seq suffers from low sensitivity as a result of DNA loss during the multiple purification steps. Therefore, scDNase-seq, also known as Pico-seq, was developed to minimize DNA loss, and has been applied in the analysis of chromatin accessibility using single cells (32,33). To prevent loss of the small quantity of DNase I-hypersensitive DNA released by DNase I digestion of single cells, a large amount of circular plasmid DNA is added as carrier DNA in the subsequent steps of library preparation (Fig. 2). The previous application of scDNase-seq to tumor cells, NIH3T3 cells and pools of normal cells has shown that DHS patterns at the single-cell level are highly reproducible among individual cells (33). This method enables the generation of a genome-wide DHS map for rare samples, which is more valuable for clinical application.
Figure 2.

Schematic of single-cell DNase sequencing. Intact nuclei are digested by DNase I and followed by library construction including end-repair, ligation of the adaptors, PCR amplification with circular carrier DNA and high-throughput sequencing. DHSs, DNase I-hypersensitive sites.

Low-input DNase-seq (liDNase-seq)

Although scDNase-seq can identify DHSs from a small quantity of starting material, the annotation of sequencing results requires a pre-known DHS database, which is a challenge in the identification of de novo DHSs sites. Lu et al (34) introduced liDNase-seq by modifying the scDNase-seq method to achieve de novo genome-wide DHS identification using no more than 30 cells. The major technical improvements, including reducing the complexity of the purification process prior to the adaptor ligation reaction and the first amplification reaction, and modifying the size selection step by using SPRI affinity beads in place of gel purification. The process for generating DHS maps is similar to that of the ENCODE project (www.encodeproject.org and http://genome.ucsc.edu/ENCODE) and allows the identification of de novo DHSs at much higher resolution than in previous methods.

ImmunoSEQ technique

Morin et al (31) developed an ImmunoSEQ technique for the detection of known DHSs. Using this technique, whole-exome sequencing or customized DHS region sequencing can be efficiently performed. The analysis focuses on the variation of non-coding regions of immune-related diseases.

3. Functions of DHSs

DHSs are involved in gene expression regulation

DHSs are essential features of all defined types of active CREs and are often co-positioned with them (35-37). DHSs are directly involved in chromatin modeling and structural reestablishment, the recognition of regulatory proteins and regulating the initiation of transcription. DHSs are associated with nearby gene expression changes through the binding of certain regulatory proteins to their specific sequence at promoters or other CREs regions, and are thus involved in cell fate decisions, individual variation and development (22). Frank et al (19) detected thousands of CREs at which the accessibility of chromatin increased or decreased. These changes coincided with the transcription level of adjacent genes, which is important in the regulation of global gene expression, and most likely infers activation or deactivation of enhancer elements. Huang and Liew (38) identified DHSs in the 4-kb upstream locus of the cardiac myosin heavy chain-α (MHC-α) gene in the hamster and revealed a conserved GATA-motif site that interacts specifically with GATA-binding factors at different stages of cardiomyocyte development, which provided evidence for the role of GATA factors in the gene expression of cardiac MHC-α. Open DHSs often mark increases in local transcription levels, which supports the observation that open DHSs are enhancers. Similarly, closed DHSs may represent reduced enhancer activity (19). This influence is more apparent when genes are associated with two or more directional matched DHS changes. The findings identified in genome-wide association studies (GWASs) show that genetic variations frequently lie in non-coding regions of the genome that contain CREs, which suggests that gene expression change underlies the development of several complex traits.

DHSs exhibit distinguished profiles and contribute to define CREs

Cells in a specific stage and stature possess a fixed set of CREs that are accessible to trans-acting factors, and thus underlie a complex controlling network of chromatin (35,39). Each cell type has a specific set of regulatory sequences and the cumulative span of those sequence consists of >80% of the non-coding region of the genome. Studies on DHSs help to disclose delicate gene regulation mechanisms and enable extensive annotation of the genome. The genome-wide mapping of DHSs provides a novel platform for the promising investigation of a specific molecular biological problems affected by the regulation of a given gene or a group of genes (17). In addition, DHSs form a complicated, spatially- and temporally-specific network. Certain DHSs identified in one cell type by DNase-seq may not occur in the other cell types. Pan et al (40) reported that 12 DHSs in chromatin related to the Msx2 gene varied in different cell types in the chicken, when they examined anterior and posterior limb mesenchymal cells, calvarial osteoblasts and fibroblasts in embryos. Most of the DHSs were not detected as active in any of the four typed of cells, and only the DHS in the basal promoter region was present in all four tissues. One DHS was active and unique in the cells with Msx2 transcripts, and a secondary DHS was unique in non-expressing cells. The anterior and posterior limb mesenchyme cells had a distinct group of DHSs, which were more complex than those detected in calvarial osteoblast cells, which suggested that a complicated DHS pattern may be involved in the different regulation models of the Msx2 gene in these two tissues, and is involved in cell fate decision by interaction with cell-specific transcription factors to guide the transcription program of cell fate decision and development. DHSs of a certain gene may also change in response to different transcription activity. Grünweller et al (41) examined the 5'-end of the vigilin gene in chickens using the DNase-seq technique and reporter gene analysis method. They identified two candidate DHSs. One DHS was active and unique under high transcriptional activity of the vigilin gene promoter in the chicken cells, which was termed DHS1, and a secondary DHS was only found under low transcriptional activity, which was termed DHS2. The activity of the promoter of the vigilin gene was enhanced over 10 times by upstream sequences of the transcription start site (TSS). Identifying DHSs and comparing their features differs among various cell types or within a similar cell type, but culture in different circumstances is essential for revealing gene expression patterns under different conditions. This can effectively complement current understanding and may have potential clinical applications for disease treatment. The exploitation of variable and plastic patterns of active DHSs offers potential for the identification of certain cell or tissue states, which may have potential to be applied to clinical diagnoses and predictions or the evaluation of therapeutic effects. Studies investigating DHSs facilitate the identification of novel CREs, as DHSs are more promising indicators for the identification of chromatin accessibility, which have been widely used to map functional regulation elements. DHSs overlie CREs with parallel degrees of nuclease sensitivity and cover the main sequence of regulatory factor (42). DHSs usually contain CREs related to transcriptional activation on the reporter locus, such as enhancers, but can also contain transcription inhibition, such as silencers (17). A DHS map reveals the state and pattern of the presence of CREs, in addition to the variable and plastic states of chromatin in various cell types (25,29,33,43). Liu et al (44) identified 17,472 specific DHSs and transcription factor binding sites in two cell lines, the hESC H1 cell line and trophoblast (TB)-treated cell line, and constructed a transcription factor network for placental development. The specific DHSs in the TB-treated cells were found in the ‘blood vessel’ and ‘trophectoderm’, including members of the transcription factor motif family: Leucine zipper, helix-loop-helix, GATA and ETS. The model of a TB system induced by bone morphogenetic protein 4 (BMP4) was demonstrated to be important in investigating the mechanism of trophoblast development and revealed novel candidate genes involved in the regulation of human placental development. These findings indicate that DHSs enable the precise delineation of genomic CREs. Further investigations on DHSs are expected to reveal more novel regulatory elements. Identifying sequence variations of the DHSs of phylogenetic trees instead of the coding region of genes may assist in disclosing the changes and evolution of certain phenotypes. Dong et al (45) analyzed and evaluated the accelerated evolution of orthologous sequences at DHSs from the human genome and primate genomes using systematic biology methods, and constructed a comparison map between the DHSs and ancient repeat elements (AREs). Their analysis identified the local AREs of all DHSs and demonstrated that they were neutrally evolving. Therefore, it is noteworthy that ~0.44% of DHSs in the human genome are undergoing accelerated evolution (termed ace-DHSs). Further analysis of ace-DHSs is warranted for investigating the evolution of human-specific phenotypes. These DHS analyses are important in basic studies and may be of potential value in translational medicine and personalized medicine.

4. Available data of DHSs

The ENCODE project (www.encodeproject.org and http://genome.ucsc.edu/ENCODE) aims to evolve comprehensive schemes to list all human DHSs in order to map and catalog genome-wide CREs. DHSs mark transcriptionally active sites of chromatin, which may be the origin of cell selectivity. The ENCODE research institutes have performed genome-wide mapping of DHSs in >100 human cell and tissue types, and almost 3,000,000 DHSs have been identified, including 71 normal differentiated primary cells, 16 immortalized primary cells, 30 malignancy-derived cell lines and eight multipotent and pluripotent progenitor cells. The 20-50-bp reads from the DNase-seq experiments enabled unique mapping to 86.9% of the genomic sequence, allowing the interrogation of a large fraction of transposon sequences. The DHS profiles of 125 different human cell types were obtained, and of these, 34% were specific to individual cell types and only a minority were detected in all cell types (3,692). The open state of DHSs varied >100 times, but the constitution pattern was consistent in distinct cell types. It was demonstrated that ~5% of DHSs were detected in the TSS region, while the remaining 95% represented distal DHSs dispersed uniformly in intronic and intergenic regions. These data provide additional information for disclosing the mechanism of transcription.

5. DHSs and diseases

DHSs are associated with multiple diseases and have been suggested to serve distinct roles in the etiology of cancer, immune-related diseases, inflammatory bowel disease, Alzheimer's disease, bone marrow density problems, coronary artery disease, autism, and certain common diseases and complex traits (Fig. 3) (46). Recent evidence demonstrates the potential value of cell-specific and disease-related DHSs in personalized medicine. Evidence showing high overlap between human diseases and CREs has been well-documented, which confirm that ‘critical’ cell types may function as causal factors for certain diseases or help to maintain certain phenotypic traits (46).
Figure 3.

Process of identification of disease-related DHSs. DHSs, DNase I-hypersensitive sites; CRE, cis-regulatory element; TSS, transcription start site.

The accessibility or inaccessibility of the state of DHSs is reported to be associated with diseases. An increasing number of novel DHSs have been found to be associated with diseases. Specific cell and tissue types have been identified as being associated with different diseases. For example, specific immune cell types are involved in immune-related diseases (inflammatory bowel disease), and specific tissue types are involved in diseases affecting specific organs (coronary artery disease), with other associations including adrenal glands in coronary artery disease, immune systems in Alzheimer's disease and kidneys with bone marrow density (46). Thousands of tumor-specific DHSs located at promoter and enhancer regions have been detected, which have been shown to be involved in the occurrence and development of cancer. Function-related mutations of the DHS region are closely correlated with transcription initiation activity and thus result in the occurrence of certain diseases (Fig. 4). There are >100 studies on DHSs that assessed various cell or tissue types by ENCODE Alliance (124 different cell types) and NIH roadmap epigenomics group (342 different adult fetal tissue samples), which demonstrate that an overlap exists between mutations at non-coding DNA regulatory sequences and diseases and traits.
Figure 4.

Pathological mechanism of variation at DHS sites of disease-related genes. (A) DHSs act as binding anchors of activator proteins and cofactors. A multiprotein complex is assembled and bound to the ‘open’ sequences of DHSs, including promoter or enhancer regions, and after chromatin rearrangement this leads to gene transcription. (B) When pathological mutations or variations occur at crucial regions of DHSs, it influences the identification and interaction of regulatory binding proteins and the transcription function is interrupted or altered, which may subsequently cause phenotypes. DHSs, DNase I-hypersensitive sites.

GWASs have identified numerous single nucleotide polymorphisms (SNPs) at DHSs, which are associated with various types of quantitative traits and complex disorders. Local mutation density is variable throughout the genome (47). A study on 1,161 human cancer genomes revealed that the density of point mutations at the center of the DHS in the gene promoter region of somatic cells was increased (48). Numerous tissue types, including brain, pancreatic and liver tissue, have also demonstrated the enrichment of SNPs associated with DHSs in major depressive disorder (49). A 14-kb Down syndrome cell adhesion molecule deletion sequence, containing 12 CNS DHSs, was found in an autistic family, with regulatory potential affecting the biology of the central nervous system (50). De novo mutations, rich in DHSs and proximal genes, have been significantly predicted to result in the loss of transcription binding factors. For example, deletion of lysine-specific demethylase 5B binding was found at the promoter of the candidate autism risk gene, EFR3A (51). A mutation at the SNP (chr18:52417839 G>C) site was reported to be correlated with follicular thyroid cancer, which had an influence on the binding of tumor suppressor protein p53 and subsequently resulted in the decreased gene expression of thioredoxin-like 1 (TXNL1) (33). The highest prevalence of mutations was found in the hypothetical driving factor DHS chr5:1325957-1328153, located in an intron of the Cleft lip and palate transmembrane 1-like (CLPTM1L) gene and 30 kb upstream of telomerase reverse transcriptase (TERT), results in the overexpression of six adjacent genes and four of these genes [TERT, CLPTM1L, thyroid hormone receptor interactor 13 (TRIP13), lysophosphatidylcholine acyltransferase 1 (LPCAT1)] are known to be associated with cancer (52-54). Recently, a statistical method has been developed to identify distal regulatory elements with hypothetical driving mutations in breast cancer, to identify DHSs in non-coding genomic sequences associated with significant mutations in breast cancer and abnormal expression of adjacent genes, which may be important in the development of cancer (55). The mutation of chr5:1325957-1328153 at the DHS region in breast cancer was reported to be associated with the overexpression of oncogene tripartite motif containing 27 (TRIM27). In addition, abnormal activity was found with the mutation of chr6:28948439-28951450 at the DHS region. Using data from The Cancer Genome Atlas (TCGA) and breast cancer International Alliance (metabonomics) molecular taxonomy, Guo et al (56) found two hypothetical functional variants, rs62331150 and rs73838678, located at the DHS site and transcription factor binding region. Among them, rs62331150 was associated with the expression of tet methylcytosine dioxygenase 2 (TET2) in normal breast tissue and tumor tissue. Two new SNP (rs12309362 and rs9970827) were found to be significantly associated with reducing the risk of hepatocellular carcinoma (HCC) by measuring the mutations at the peak of DHSs in 1,538 patients with HCC and 1,465 normal controls (57). A study on endometriosis, by re-sequencing 1.29 mb of the 9p21 region, revealed that the mutation of rs17761446 at the DHS was associated with endometriosis, the protective G allele at this site had a strong interaction with the ANRIL promoter. Further chromatin immunoprecipitation analysis confirmed that the protective G allele also had a preferential binding capacity with transcription factor 7-like 2, EP300 and may be involved in the development of endometriosis (58). Studies on prostate cancer and breast cancer cells have revealed that different DHS patterns of androgen receptor (AR) and estrogen receptor 1 (ESR1) were of high predictive value for hormone receptor binding and may be involved in the development of these types of cancer. The quantitative measurement of DHS changes can predict the binding sites of perturbation-inducible transcription factors (59). Following activation of the transcription of AR in LNCaP cells, 244 upregulated and 486 downregulated accessible DHS regions were detected to be the candidate sites for further investigation, which may be associated with prostate cancer. CTCF and the ELK1-ETS transcription factor are potential upstream regulator elements, which are rich in open promoter regions of downregulated genes. The inhibitor of DNA-binding 1 HLH protein (ID1) is the only transcription factor that is significantly upregulated, exhibiting basal sequence enrichment in the promoter region of the upregulated gene. Therefore, CTCF, ELK1 and ID1 may be potential targets for the treatment of prostate cancer (60). Increasing evidence shows that changes in the expression of BMP4 are involved in the pathogenesis of cancer, which is associated with cancer metastasis and progression, including rectal, hepatocellular and ovarian cancer (61). In order to determine the characteristics of BMP4 transcription mediators in breast cancer, RNA-Seq and DNase-seq were analyzed in T-47D and MDA-MB-231 breast cancer cells treated with BMP4. It was confirmed that MBD2, core-binding factor-β and hypoxia-inducible factor 1α were downstream regulators of the BMP4 signal, which enhanced cell migration and decreased cell growth (62). In tumor therapy, particularly in acute myeloid leukemia (AML), intratumoral heterogeneity caused by clonal evolution has been found, which may have an influence on the effect of treatment. In order to solve this problem, the chromatin accessibility of subclones of AML was compared directly using unsupervised clustering analysis. Marked differences in the chromatin landscape and transcriptional regulation among the subclones were detected and confirmed. The data indicated that the common DHSs of individual AML subclones dominated in the clustering analysis over the subclone-specific DHSs, most likely due to the impact of shared founder mutations at the DHSs in each AML subclone. Clone-specific DHSs, runt-related transcription factor and ETS motifs are expressed in abundance in the two clones of DHSs, although GATA motifs are particularly abundant in FLT3-WT clones (63). It may be a potential strategy to use DHSs analysis to improve the treatment effect, particularly for those types of cancer with features of intraturmoral heterogeneity. Genetic variation at DHSs has been reported to be correlated with carcinogenesis (64). By analyzing 1,161 human cancer samples from 14 types of cancer, DHS profiles and SNP distributions were mapped to link to promoter activity, some of which were involved in differential nucleotide excision repair (NER) and resulted in carcinogenesis (48). Consistent with this finding, genome-wide maps of NER regions show that the repair ability of nucleotide excision was decreased with mutation at the DHS of gene promoter regions. Jin et al (33) reported that thousands of tumor-specific DHSs were identified on cells dissected from follicular thyroid carcinoma samples fixed on formalin-fixed paraffin-embedded slides. Numerous DHSs have been reported to be correlated with the development of thyroid cancer (33). A de novo mutation (chr18:52417839 G>C) at a DHS located downstream of the TXNL1 gene is associated with the formation of thyroid carcinoma. It was reported that rs62331150 located at promoter region and rs73838678 located at the enhancer region of the gene, increased the risk of breast cancer. These two SNP sites were found to be in linkage disequilibrium with rs9790517 of the adjacent TET2 gene (55). It was also found that, in samples with mutation at the rs12309362 and rs9970827 sites, the risk of being affected with HCC decreased significantly (57). Ten DHSs were identified as being mutated with abnormal expression of target genes in breast cancer (55). Mutation at the DHS chr5:1325957-1328153, was present at a high frequency in the cancer cells, resulting in the high level of transcription of certain genes close to it, including TERT, CLPTM1L, TRIP13 and LPCAT1, which has been confirmed to be associated with cancer (52,53,65). Mutations at DHSs chr5:1325957-1328153 were found to result in the high expression of TRIM27, and certain mutations in this region caused abnormal accessibility of DHS chr6:28948439-28951450, which are associated with cancer. The pattern of DHSs can be stimulated by hormones through regulation of the binding capacity of AR and ESR1 in prostate cancer cells and breast cancer cells. Following binding with AR or ESR1, the DNase I-hypersensitivity of certain sequences was found to be altered, and the regional nucleosome occupancy changed for AR binding but not for ESR1b binding, which indicated different interaction modes in AR and ESR1 regulation (59). In Gene Ontology analysis, genes associated with tumor-specific DHSs are abundant in biological processes, including the regulation of GTPase activity and response to hypoxia, and cancer-related pathways. Understanding the accessibility dynamics of chromatin in the process of disease occurrence and development can provide insights into how cell fate is regulated, and how transcriptional systems are organized and regulated in different tissues and how they are destroyed in disease states. In addition, the application of DHSs in biomedical research can expand the field of cell-selective gene regulation analysis, enabling the identification of long-range regulatory patterns of the system and previously undescribed phenomena, such as DHS activation patterns and mutation rates in abnormal and immortal cells.

6. Discussion

Although DHSs occupy a small portion of human genome, ~2% of the genome, a relatively large proportion of CREs may be involved in the establishment of well-organized expression networks in each cell type and thus contribute to the etiology of a certain disease. The comprehensive delineation of distribution, constituents and biological activities of DHSs help to map and classify functional CREs. The identification of CREs is critical for elucidating the mechanism of gene expression regulation underlying biological events, and the development and progression of certain diseases. To date, the DNase-seq technique remains one of the most efficient techniques for disclosing the known and unknown regulatory elements of diverse target cells. Cooper et al (32) released a detailed protocol in Nature that may facilitate the spread of DNase-seq analysis. However, the number of researchers able to utilize the high-throughput DHS capture technique well is limited. In addition, careful manipulation is required due to high background noise. Difficulties in performing experiments are not the greatest challenge for its wider application; bioinformatics analysis is difficult for the majority of laboratories. An insufficient number of bioinformatics technicians, particularly those with the required programming skills and familiarity in this field, is the main concern. The exploitation of software or online tools is required to simplify bioinformatics analysis and enable easier understanding by researchers. Although evidence has identified an association between DHSs and diseases, using DHSs as a marker for prediction, prevention and pre-clinical diagnosis remains a challenge due to the complexity of regulation of the DHS profile. The uncertainty of genome-wide prediction of CREs and specific DHS related to gene function may increase the challenge of its application in clinical practice. Future work should focus on the investigation of more delicate methods to locate DHSs precisely, help disclose the mechanisms underlying gene expression differences, determine how to modify chromatin accessibility, reveal how changes in transcription factor binding are driven by genetic variations, and guide how to integrate DHSs in clinical practice.
  65 in total

Review 1.  Locus control regions: coming of age at a decade plus.

Authors:  Q Li; S Harju; K R Peterson
Journal:  Trends Genet       Date:  1999-10       Impact factor: 11.639

2.  Identifying gene regulatory elements by genome-wide recovery of DNase hypersensitive sites.

Authors:  Gregory E Crawford; Ingeborg E Holt; James C Mullikin; Denise Tai; Robert Blakesley; Gerard Bouffard; Alice Young; Catherine Masiello; Eric D Green; Tyra G Wolfsberg; Francis S Collins
Journal:  Proc Natl Acad Sci U S A       Date:  2004-01-19       Impact factor: 11.205

3.  Genome-wide mapping of DNase hypersensitive sites using massively parallel signature sequencing (MPSS).

Authors:  Gregory E Crawford; Ingeborg E Holt; James Whittle; Bryn D Webb; Denise Tai; Sean Davis; Elliott H Margulies; YiDong Chen; John A Bernat; David Ginsburg; Daixing Zhou; Shujun Luo; Thomas J Vasicek; Mark J Daly; Tyra G Wolfsberg; Francis S Collins
Journal:  Genome Res       Date:  2005-12-12       Impact factor: 9.043

4.  DNase-chip: a high-resolution method to identify DNase I hypersensitive sites using tiled microarrays.

Authors:  Gregory E Crawford; Sean Davis; Peter C Scacheri; Gabriel Renaud; Mohamad J Halawi; Michael R Erdos; Roland Green; Paul S Meltzer; Tyra G Wolfsberg; Francis S Collins
Journal:  Nat Methods       Date:  2006-07       Impact factor: 28.547

Review 5.  Insulators: exploiting transcriptional and epigenetic mechanisms.

Authors:  Miklos Gaszner; Gary Felsenfeld
Journal:  Nat Rev Genet       Date:  2006-08-15       Impact factor: 53.242

6.  Mapping and characterization of DNase I hypersensitive sites in Arabidopsis chromatin.

Authors:  Yuichi Kodama; Shingo Nagaya; Atsuhiko Shinmyo; Ko Kato
Journal:  Plant Cell Physiol       Date:  2007-02-05       Impact factor: 4.927

7.  High-resolution mapping and characterization of open chromatin across the genome.

Authors:  Alan P Boyle; Sean Davis; Hennady P Shulha; Paul Meltzer; Elliott H Margulies; Zhiping Weng; Terrence S Furey; Gregory E Crawford
Journal:  Cell       Date:  2008-01-25       Impact factor: 41.582

8.  Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genome.

Authors:  Nathaniel D Heintzman; Rhona K Stuart; Gary Hon; Yutao Fu; Christina W Ching; R David Hawkins; Leah O Barrera; Sara Van Calcar; Chunxu Qu; Keith A Ching; Wei Wang; Zhiping Weng; Roland D Green; Gregory E Crawford; Bing Ren
Journal:  Nat Genet       Date:  2007-02-04       Impact factor: 38.330

9.  DNA hypomethylation and germ cell-specific expression of testis-specific H2B histone gene.

Authors:  Y C Choi; C B Chae
Journal:  J Biol Chem       Date:  1991-10-25       Impact factor: 5.157

10.  The LPS-induced transcriptional upregulation of the chicken lysozyme locus involves CTCF eviction and noncoding RNA transcription.

Authors:  Pascal Lefevre; James Witham; Claire E Lacroix; Peter N Cockerill; Constanze Bonifer
Journal:  Mol Cell       Date:  2008-10-10       Impact factor: 17.970

View more
  3 in total

1.  Gene expression pattern of trophoblast-specific transcription factors in trophectoderm by analysis of single-cell RNA-seq data of human blastocyst.

Authors:  Yajun Liu; Yi Zhang; Shiwen Li; Jinquan Cui
Journal:  Funct Integr Genomics       Date:  2021-02-05       Impact factor: 3.410

2.  iDHS-FFLG: Identifying DNase I Hypersensitive Sites by Feature Fusion and Local-Global Feature Extraction Network.

Authors:  Lei-Shan Wang; Zhan-Li Sun
Journal:  Interdiscip Sci       Date:  2022-09-27       Impact factor: 3.492

3.  An RNA Polymerase III General Transcription Factor Engages in Cell Type-Specific Chromatin Looping.

Authors:  Lara de Llobet Cucalon; Chiara Di Vona; Marco Morselli; Marco Vezzoli; Barbara Montanini; Martin Teichmann; Susana de la Luna; Roberto Ferrari
Journal:  Int J Mol Sci       Date:  2022-02-18       Impact factor: 5.923

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.