Literature DB >> 34532451

High-throughput chromosome conformation capture-based analysis of higher-order chromatin structure in nasopharyngeal carcinoma.

Yuanyuan Yang1, Mingfa Chen2, Lingjun Cheng1, Canping Su1, Xiyi Liao3, Hongzhang He1, Mingming You1, Gang Rui4, Guolin Hong1.   

Abstract

BACKGROUND: Firstly, we aimed to compare the differences of higher-order chromatin structure between nasopharyngeal carcinoma (NPC) and normal nasopharyngeal tissues. The second objective was to analyze the specific chromatin interaction site of NPC and the NPC-related genes regulated by this interaction site.
METHODS: We included 6 NPC patients and 6 healthy controls to obtain the sequencing results of highest-throughput chromosome conformation capture (Hi-C) technique, followed by further analysis of the specific chromatin interaction sites in NPC.
RESULTS: We found an abnormal ultra-long distance interaction site on the chromosome 7p in the CNE210 sample, which was caused by a fusion gene SEPT7P2-PSPH. Additionally, a significant interaction site between chromosome 8q and 3p was revealed in the samples CNE25, CNE29, and CNE211, which was the interaction between 1.5 kb downstream of ASAP1 and 0.8 kb upstream of CTNNB1 gene. Further quantitative polymerase chain reaction (qPCR) revealed that ASAP1 and CTNNB1 genes were more highly expressed in CNE25, CNE29, and CNE211 than in the Np group, preliminarily indicating that this interaction site was likely related to the high expression of ASAP1 and CTNNB1 in NPC.
CONCLUSIONS: Through Hi-C analysis, we analyzed the specific chromatin interaction sites associated with NPC, and found the chromosomal translocation and chromatin interaction sites associated with NPC based on statistical analysis. This study has certain guiding significance for in-depth study of the mechanism of NPC occurrence and development. 2021 Annals of Translational Medicine. All rights reserved.

Entities:  

Keywords:  Hi-C analysis; Hi-C library preparation; chromatin interaction; nasopharyngeal carcinoma (NPC)

Year:  2021        PMID: 34532451      PMCID: PMC8422082          DOI: 10.21037/atm-21-3273

Source DB:  PubMed          Journal:  Ann Transl Med        ISSN: 2305-5839


Introduction

Nasopharyngeal carcinoma (NPC), as a common malignant tumor of the head and neck (1) and it has significant geographical and ethnic differences in its incidence. In 2020, there were 133,354 newly diagnosed NPC patients worldwide (2). At present, the etiology and pathogenesis of NPC remain unclear, and the treatment of NPC is still based on radiotherapy and chemotherapy. Generally, NPC patients are found to progress to late stage, leading to poor prognosis and treatment outcomes (3-5). Therefore, there is an urgent need to investigate and elucidate the pathogenesis of NPC. According to previous clinical studies, mutations in some genetic information are closely related with NPC development (6). Chromosomes are carriers of genetic information, which occupy independent nuclear regions in a highly organized way rather than random structures (7). Chromosome conformation reflects and also probably regulates basic biological processes (8). Highest-throughput chromosome conformation capture (Hi-C) takes the whole nucleus as the research object. This technology utilizes high-throughput sequencing combined with bioinformatic methods to study genome-wide chromosomal interactions, in order to obtain the genome sequence and chromatin three-dimensional structure information at the chromosome level, which are crucial for the normal function of cells (9-11). A genomic contact matrix produced by Hi-C analysis, at a specific resolution given by sequencing depth, measures the strength of interaction between 2 genomic loci (12). The combination of Hi-C analysis and chromatin co-immunoprecipitation sequencing (ChIP-Seq) with transcriptome analysis can elaborate the mechanisms involved in trait formation in organisms from gene regulatory networks and epigenetic networks. Human genome chromatin is highly folded within a narrow nucleus and there are some interactions and regulatory relationships within and between chromatin (13). However, studies on specific chromatin interaction sites in NPC are still limited. In this study, by utilizing Hi-C technology, an important method to study the spatial structure of chromatin, we aimed to analyze and compare the differences in the higher-order chromatin structure between NPC and adjacent tissues, and explore the specific chromatin interaction sites of NPC to further investigate the NPC related genes regulated by the interaction sites. We present the following article in accordance with the MDAR reporting checklist (available at https://dx.doi.org/10.21037/atm-21-3273).

Methods

General information

Biopsy tissue samples were obtained from 6 healthy controls (Np group) and 6 NPC patients (CNE group) admitted to the First Affiliated Hospital of Xiamen University between February 2017 and December 2018. All participants provided written informed consent, and the study was conducted with the approval of the Ethics Committee of The First Affiliated Hospital of Xiamen. All procedures performed in this study involving human participants were in accordance with the Declaration of Helsinki (as revised in 2013).

Study methods

Sample sequencing

Fresh specimens were fixed and cross-linked with 37% formaldehyde according to known methods, followed by cell lysis. Then, the quality of the samples was examined, and those deemed to be good quality were utilized for Hi-C fragment preparation. Chromatin digestion was performed using restriction endonuclease (HindIII/MboI) and the digestion effect was examined. Next, on completion of biotin labeling, blunt-end ligation, and DNA purification, Hi-C samples were prepared and subsequently taken for DNA quality testing. The samples that passed the DNA quality test were utilized for standard library preparation. On completion of removal of biotin, sonication, end repair, and “A” tailing, the fragments containing biotin were captured and bound to ligated adapters, and eventually Hi-C libraries were obtained after polymerase chain reaction (PCR) amplification and a final quality control. The next sequencing step was completed utilizing a HiSeq 3000 sequencing system (Illumina, San Diego, CA, USA). The specimens were further analyzed for the overall understanding of intrachromosomal (cis-acting) and interchromosomal (trans-acting) and NPC specific chromatin interaction sites. Sample information is shown in .
Table 1

Sample information

Sample nameLatin name of speciesSample typeEndonuclease
Np6914Homo sapiensNpMboI
Np6915Homo sapiensNpMboI
CNE29Homo sapiensCNEMboI
CNE210Homo sapiensCNEMboI
CNE211Homo sapiensCNEMboI
Np691Homo sapiensNpMboI
Np692Homo sapiensNpMboI
Np693Homo sapiensNpMboI
CNE24Homo sapiensCNEMboI
CNE25Homo sapiensCNEMboI
CNE28Homo sapiensCNEMboI
Np697Homo sapiensNpMboI

Quality analysis of sequencing data

The sequencing error rate was related to base quality and was also affected by multiple factors such as the sequencer itself, sequencing reagents, and samples. The base quality could be measured by the Phred score (or Q score) which was calculated with the following formula: The Phred score estimated the probability of incorrect base call with “e” as an error rate. The corresponding relationship is shown in .
Table 2

Corresponding relationship of base identification and Phred score

Phred scoreErrorAccuracyQ-score
101/1090%Q10
201/10099%Q20
301/1,00099.90%Q30
401/10,00099.99%Q40

Base distribution analysis of sequencing data

The base distribution of reads was obtained with the base position of the filtered reads sequence as the abscissa, and the proportion of adenosine thymine cytosine guanine (ATCG) bases and N (unidentified bases) at each position as the ordinate.

Analysis of cis-trans interaction proportion

Cis refers to paired reads on the same chromosome, and trans refers to paired reads on different chromosomes. In the genome-wide contact matrix, this pattern is characterized by significant intra-chromosomal interactions and fewer inter-chromosomal interactions along the diagonal. Part of the reason for this pattern is that chromosomes are physically separated in the nucleus and occupy independent nuclear regions. This pattern can simply evaluate the quality of data. The existence of noise in the matrix, such as random noise, may cause cis and trans interactions to be similar. A high-noise experiment will also lead to similar proportions of cis and trans interactions. Therefore, this pattern is usually quantified through a statistical ratio of cis-trans to total interaction (14). On completion of the calculation of genomic distance S (distance between nucleotide sequences in base pairs) and chromosome interaction frequencies (IFs), their logarithms were taken for mapping.

Establishment of contact matrix

The Hi-C contact matrix refers to a list of DNA-DNA contacts generated using Hi-C data, which represents the actual interaction probability of various sites in the genome (15). Genome-wide contact matrix can be constructed by efficient statistics of all contacts of the whole genome, which can be visualized by the software HiC-Pro as a heat map, that is, the Hi-C map (16). The resolution of Hi-C map is defined as: when a contact matrix is constructed with the minimum unit length, 80% of the unit length contains a minimum of over 1,000 contacts, and this minimum unit length is the resolution (17). Higher resolution reflects smaller scope of the genome. Lieberman-Aiden’s study on Hi-C found that the mammalian genome can be divided into 2 chromosomal structural regions called A/B compartments; much of the interaction between loci occurs in the same chromosomal structural region (18). Further, studies have revealed closer relations of A compartment with open chromatin, and of B compartment with closed chromatin. The A/B compartments are cell-specific and can converse in different tissues and cells, this conversation partly relates to gene expression regulation (19). Therefore, analysis of A/B compartments can determine intra- and inter-chromosomal interactions.

Calculation of topologically associated domains (TAD)-fusion score

In a mammalian Hi-C map with about 40 kb resolution, a distinct triangular shape is presented, and this structure is named TAD (20) with significant boundaries among domains. It has been shown that CTCF and structural proteins such as cohesin significantly affect the formation of TAD boundaries (21). The CTCF is a repressive transcription factor that directionally binds specific DNA sequence sites (22), and it mediates insulator looping with off-target genes while disrupting enhancer looping, thereby inhibiting transcription of genes (23,24). Servant introduced the TAD-fusion score to quantitatively describe that genomic fragment deletion is more likely to cause TAD-fusion, subsequently cause abnormal enhancer-promoter interactions, and ultimately diseases (16). This score is a measure of the degree of compactness of TAD. The more compact the TAD, the stronger the interaction inside the TAD and the weaker the interaction among the TAD (25). The TAD-fusion score = internal contacts/external contacts.

Software

The software Bowtie2 (Johns Hopkins University, Baltimore, MD, USA) and HiC-Pro (26) were used to generate the reference sequences and process the HiC data.

Results

Sequencing quality distribution map

The sequencing quality distribution map was obtained with the base position of the filtered sequence as the abscissa and the average quality value of each position as the ordinate, in order to roughly reflect the stability of sequencing quality during sequencing (). The map showed that the overall sequencing quality of the 12 specimens was qualified, and base call accuracy was above 99.9%.
Figure 1

Sequencing quality distribution map.

Sequencing quality distribution map.

Base distribution map

As shown in , there was an equal proportion of A and T bases and an equal proportion of G and C bases at each position of the Reads during sequencing. This result indicated no sequence bias, and the base distribution of the samples was in line with the expectation.
Figure 2

Base distribution map.

Base distribution map.

Cis-trans interaction proportion

Analysis of the proportion of cis-trans interaction showed that the cis-acting sites were greater than 80% in all 12 specimens (), indicating that the cis-acting was greater than the trans-acting in both CNE and Np groups.
Figure 3

Cis-trans interaction proportion.

Cis-trans interaction proportion.

Cis-trans IFs

The IFs obtained from Hi-C-based experiments generally decay with increasing distance in a given range, and the slopes of IFs could be described by the interaction decay exponents (IDEs) (18). The result showed () that cis-trans IFs decreased significantly with increasing sequence space.
Figure 4

Cis-trans interaction frequencies. Abscissa: interchromatin distance at different sites; ordinate: interaction strength.

Cis-trans interaction frequencies. Abscissa: interchromatin distance at different sites; ordinate: interaction strength.

Chromosome interactions

Intrachromosomal interactions

From yellow to red, intrachromosomal interaction intensity was gradually increased (). The result showed an abnormal ultra-long distance interaction site on the chromosome 7p in the CNE210 sample, which was caused by a fusion gene, SEPT7P2-PSPH, that has been previously reported (27).
Figure 5

Heatmap of intrachromosomal interactions.

Heatmap of intrachromosomal interactions.

Interchromosomal interactions

Although chromosomes occupy independent nuclear regions, there are interchromosomal interactions due to the small space of the nucleus. It has been found that in human and murine cells, long chromosomes tend to be closer to long chromosomes, and short ones to short ones (28). After normalizing the interaction between chromosomes, we obtained the contact matrix represented as a heatmap. The higher value of the interchromosomal interaction, the closer the 2 chromosomes in space. As shown in , a significant interaction site between chromosome 8q and 3p was found in the samples CNE25, CNE29, and CNE211, which was the interaction between 1.5 kb downstream of ASAP1 and 0.8 kb upstream of CTNNB1 gene. The quantitative polymerase chain reaction (qPCR) revealed that ASAP1 and CTNNB1 genes were expressed higher in CNE25, CNE29, and CNE211 than in the Np group.
Figure 6

Heatmap of interchromosomal interactions after normalization.

Heatmap of interchromosomal interactions after normalization.

Genome-wide interactions

A genome-wide contact matrix was constructed on the sequences obtained from Hi-C sequencing, and then a Hi-C map was obtained after normalization (). The color of each point represents the log value of interaction intensity of the corresponding genomic loci. From yellow to red, interaction intensity was gradually enhanced. From the obvious red diagonal in the , we could see that the interaction of homologous chromosomes was stronger than that of non-homologous chromosomes, and the intrachromosomal interaction was also stronger than the interchromosomal interactions.
Figure 7

Heatmap of genome-wide interactions.

Heatmap of genome-wide interactions.

Distribution of TAD-fusion score

We randomly selected a region of the chromosomes of 12 samples to construct an interactive heatmap and subsequently to calculate the TAD-fusion score. The comparison of the scores of 12 samples was used to determine whether the compactness of the TAD changes under different conditions. As shown in , in comparison with the Np group, higher fusion score was identified in the CNE group, indicating a significant TAD sliding in NPC.
Figure 8

TAD score distribution of each sample. TAD, topologically associated domains.

TAD score distribution of each sample. TAD, topologically associated domains.

Discussion

Currently, NPC treatment is still based on traditional chemotherapy, and there is a lack of specific targeted agents for NPC. In recent years, the development of next-generation sequencing technology has greatly promoted the study of genes, transcriptome, and epigenome in human cancers. In this study, on completion of Hi-C library preparations, sequencing, and establishment of a contact matrix, a genome-wide interaction heatmap was obtained through use of the latest Hi-C technology. Our results were further analyzed based on existing studies, so as to explore specific chromosome interaction sites in NPC and provide a reliable experimental basis for clinical studies on NPC pathogenesis and targeted therapy. A study by Zhang et al. (29), with paraffin-embedded samples from 127 NPC patients, found that HOXC6 expression was significantly correlated with T stage, N stage, and clinical stage of NPC. Lu et al. (30) reported that EVI1expression was increased in NPC tissues and cell lines, subsequently promoting the epithelial-mesenchymal transition of NPC cells and consequently promoting the differentiation of cancer cells. Wang et al. (31), through both in vitro and in vivo experiments, demonstrated the upregulation of TMPRSS3 expression in human NPC tissues and cell lines, and inhibition of proliferation and tumorigenicity of NPC cells by knockdown of TMPRSS3 expression. Sun et al. (32) showed the promotion of NPC cell proliferation by HOXA10. Zhang et al. (33) linked the promotion of NPC invasion and metastasis with activation of the NFAT1/ITGA6 signaling pathway by NFAT1. Li et al. (34) pointed out that measuring SLIT2 promoter methylation in blood samples was a potential diagnostic method for NPC, and the diagnostic accuracy of this method could be improved by combining multiple samples. Our results showed that the cis-acting was greater than the trans-acting in both CNE and Np groups; cis-trans IFs decreased significantly with increasing sequence space; and there were some specific chromosome interaction sites in the NPC samples. A significant interaction site between chromosome 8q and 3p was found in the samples CNE25, CNE29, and CNE211, which was the interaction between 1.5 kb downstream of ASAP1 and 0.8 kb upstream of CTNNB1 gene. Further qPCR revealed that ASAP1 and CTNNB1 genes were more highly expressed in CNE25, CNE29, and CNE211 than in the Np group, preliminarily indicating that this interaction site was probably related to the high expression of ASAP1 and CTNNB1 in NPC. Through Hi-C analysis, we analyzed the specific chromatin interaction sites associated with NPC, and located the chromosomal translocation and chromatin interaction sites associated with NPC based on statistical analysis. This study has certain guiding significance for in-depth study of NPC pathogenesis, but these outcomes require confirmation by further studies. In summary, Hi-C technology can realize the screening and verification of high-throughput sequencing in large sample sizes in a short time. Therefore, it is more conducive to studies on early diagnosis and pathogenesis of cancers, and contributes to targeted drug therapy and individualized treatment for cancer patients. The article’s supplementary files as
  31 in total

Review 1.  Nasopharyngeal carcinoma.

Authors:  Yu-Pei Chen; Anthony T C Chan; Quynh-Thu Le; Pierre Blanchard; Ying Sun; Jun Ma
Journal:  Lancet       Date:  2019-06-06       Impact factor: 79.321

Review 2.  The 3D Genome as Moderator of Chromosomal Communication.

Authors:  Job Dekker; Leonid Mirny
Journal:  Cell       Date:  2016-03-10       Impact factor: 41.582

3.  Comprehensive mapping of long-range interactions reveals folding principles of the human genome.

Authors:  Erez Lieberman-Aiden; Nynke L van Berkum; Louise Williams; Maxim Imakaev; Tobias Ragoczy; Agnes Telling; Ido Amit; Bryan R Lajoie; Peter J Sabo; Michael O Dorschner; Richard Sandstrom; Bradley Bernstein; M A Bender; Mark Groudine; Andreas Gnirke; John Stamatoyannopoulos; Leonid A Mirny; Eric S Lander; Job Dekker
Journal:  Science       Date:  2009-10-09       Impact factor: 47.728

4.  The role of chemotherapy in the treatment of stage II nasopharyngeal carcinoma: Retrospective analysis of the national cancer database.

Authors:  Zaheer Ahmed; Lara Kujtan; Kevin Kennedy; Valerie Wood; David Schomas; Janakiraman Subramanian
Journal:  Cancer Med       Date:  2019-02-21       Impact factor: 4.452

5.  Characteristic And Novel Therapeutic Strategies Of Nasopharyngeal Carcinoma With Synchronous Metastasis.

Authors:  Wenjun Liao; Maolang Tian; Nianyong Chen
Journal:  Cancer Manag Res       Date:  2019-09-16       Impact factor: 3.989

6.  EVI1 promotes epithelial-to-mesenchymal transition, cancer stem cell features and chemo-/radioresistance in nasopharyngeal carcinoma.

Authors:  Yaoyong Lu; Yingying Liang; Xin Zheng; Xubin Deng; Wendong Huang; Gong Zhang
Journal:  J Exp Clin Cancer Res       Date:  2019-02-15

7.  Pinostilbene Hydrate Inhibits the Migration and Invasion of Human Nasopharyngeal Carcinoma Cells by Downregulating MMP-2 Expression and Suppressing Epithelial-Mesenchymal Transition Through the Mitogen-Activated Protein Kinase Signaling Pathways.

Authors:  Pao-Yu Tseng; Yen-Tze Liu; Chia-Chieh Lin; Yi-Ching Chuang; Yu-Sheng Lo; Yi-Ting Hsi; Ming-Ju Hsieh; Mu-Kuan Chen
Journal:  Front Oncol       Date:  2019-12-03       Impact factor: 6.244

8.  Independence of chromatin conformation and gene regulation during Drosophila dorsoventral patterning.

Authors:  Elizabeth Ing-Simmons; Roshan Vaid; Xin Yang Bing; Michael Levine; Mattias Mannervik; Juan M Vaquerizas
Journal:  Nat Genet       Date:  2021-04-01       Impact factor: 38.330

Review 9.  The macro and micro of chromosome conformation capture.

Authors:  Viraat Y Goel; Anders S Hansen
Journal:  Wiley Interdiscip Rev Dev Biol       Date:  2020-09-28       Impact factor: 5.814

10.  Control of cell identity genes occurs in insulated neighborhoods in mammalian chromosomes.

Authors:  Jill M Dowen; Zi Peng Fan; Denes Hnisz; Gang Ren; Brian J Abraham; Lyndon N Zhang; Abraham S Weintraub; Jurian Schujiers; Tong Ihn Lee; Keji Zhao; Richard A Young
Journal:  Cell       Date:  2014-10-09       Impact factor: 41.582

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.