Literature DB >> 32869528

Genome-wide DNA methylation analysis by MethylRad and the transcriptome profiles reveal the potential cancer-related lncRNAs in colon cancer.

Guixi Zheng1, Yuzhi Zhang2, Hongchun Wang1, E Ding1, Ailin Qu1, Peng Su3, Yongmei Yang1, Mingjin Zou1, Yi Zhang1.   

Abstract

Colon cancer (CC) is characterized by global aberrant DNA methylation that may affect gene expression and genomic stability. A series of studies have demonstrated that DNA methylation could regulate the expressions of not only protein-coding genes but also ncRNAs. However, the regulatory role of lncRNA genes methylaton in CC remains largely unknown. In the present study, we systemically characterize the profile of DNA methylation, especially the aberrant methylation of lncRNAs genes using MethylRAD technology. A total of 132 999 CCGG/8487 CCWGG sites were identified as differentially methylated sites (DMSs), which were mainly located on the introns and intergenic elements. Moreover, 1,359 CCGG/1,052 CCWGG differentially methylated genes (DMGs) were screened. Our results demonstrated that aberrant methylation of lncRNA genes occurred most frequently, accounting for 37.5% and 44.3% in CCGG and CCWGG DMGs respectively. In addition, 963 lncRNA DMGs were co-analyzed with 1328 differentially expressed lncRNAs which were identified from TCGA database. We found that 15 lncRNAs might be CC-related lncRNAs. ZNF667-AS1 and MAFA-AS1 were down-regulated in CC, which might be silenced by hypermethylation. Besides, 13 lncRNAs were hypomethylated and up-regulated in CC. Moreover, our results validated the expression and methylation level of CC-related lncRNAs by RT-qPCR and pyrosequencing assay. In conclusion, we performed a genome-wide DNA methylation analysis by MethylRAD to acquire both CCGG and CCWGG DMSs and DMGs in CC. The results screened lncRNA DMSs as potential biomarkers and identified 15 lncRNAs as CC-related lncRNAs. This study provided novel therapy targets and valuable insights into molecular mechanism in tumorigenesis and development of CC.
© 2020 The Authors. Cancer Medicine published by John Wiley & Sons Ltd.

Entities:  

Keywords:  Bioinformatic analysis; Colon cancer; MethylRAD; lncRNA gene methylation

Mesh:

Substances:

Year:  2020        PMID: 32869528      PMCID: PMC7571838          DOI: 10.1002/cam4.3412

Source DB:  PubMed          Journal:  Cancer Med        ISSN: 2045-7634            Impact factor:   4.452


INTRODUCTION

Colorectal cancer (CRC) is the third most common malignant cancer worldwide, and it is also the second leading cause of cancer‐related deaths. The World Health Organization (WHO) predicts a substantial increase of newly diagnosed CRC cases and an increase of 80% in CRC‐related deaths by 2030. Colon cancer (CC) and rectal cancer (RC) are usually recognized as a single tumor entity called CRC in all fields of research and clinical practice. However, there are obvious differences in molecular mechanism, pathology, surgical topography, and multimodal treatment between CC and RC. In the present study, we focused on CC which is resulted from the accumulation of a variety of genetic and epigenetic alterations in colon epithelial cells. Several studies have demonstrated that epigenetic alterations occur earlier and more frequently than genetic alterations in CC. , Approximately 75% CC arises from long‐term accumulation of epigenetic alterations. In the past decades, a large number of epigenetic aberrations have been identified in CC patients, such as DNA methylation, histone modification, and expressions of microRNAs and long non‐coding RNAs (lncRNAs). Global DNA hypomethylation has been considered as a common epigenetic feature in CC patients. Some DNA methylations, such as SEPT9, MLH1, APC, LINE‐1, and so on, have been identified as markers for early detection, prognosis and/or treatment response of CC. , It has also been reported that silencing of multiple tumor suppressor genes by DNA hypermethylation plays a key role in the process of initiation, development, and metastasis of CC. Previous studies on regulation of DNA methylation mainly focus on protein‐coding genes. Protein‐coding genes are less than 2% of the whole genome, and more than 80% genes are transcribed into noncoding RNAs (ncRNAs). A series of studies have demonstrated that DNA methylation not only regulates the expressions of protein‐coding genes but also affects ncRNAs (including miRNAs and lncRNAs). , ncRNAs have attracted immense research attention, and quite a few miRNAs and lncRNAs are dysregulated and involved in the initiation and progression of CC. However, the regulatory role of methylaton of lncRNA genes in CC remains largely unknown. In the present study, we aimed to identify cancer‐related lncRNAs in CC by co‐analysis of methylation and transcriptome profiles. Firstly, we conducted the genome‐wide DNA methylation profiling of five paired CC tumor tissues and adjacent normal tissues using MethylRAD technology, followed by high‐throughput sequencing. We screened the differentially methylated sites (DMSs) and differentially methylated genes (DMGs) of both CCGG and CCWGG. Moreover, the RNA sequencing data of CC patients were downloaded from The Cancer Genome Atlas (TCGA) database, and the differentially expressed lncRNAs were analyzed. We found that the dysregulated lncRNAs might be caused by aberrant methylation. Finally, we identified that ZNF667AS1 and MAFA‐AS1 were hypermethylated and down‐regulated, while the other 13 lncRNAs were hypomethylated and up‐regulated in CC. These findings suggested that those above‐mentioned 15 lncRNAs were CC‐related lncRNAs. Furthermore, we validated the expression and methylation levels of these 15 lncRNAs by RT‐qPCR and pyrosequencing.

MATERIAL AND METHODS

Subjects

Five CC patients for MethylRad assay and 20 CC patients for validation assay were recruited from the Department of General Surgery, Qilu Hospital of Shandong University, between December 2013 and December 2016. The diagnosis of CC was confirmed by histopathology or histobiopsy. CC tissues and paired adjacent normal mucosal tissues were collected since CC is a malignant epithelial tumor and originate from glandular epithelium of the colonic mucosa. All tissues were immediately frozen in liquid nitrogen and stored at −80°C prior to DNA extraction. Written informed consent was obtained from every participant for use of tissue samples.

MethylRAD library preparation and sequencing

The MethylRAD libraries were constructed using a previously described protocol. Briefly, 200 ng DNA was digested with enzyme FspEI (NEB), and the adapters were ligated to the DNA fragments. Subsequently, ligation products were amplified by PCR, and the resultant amplicons were purified using QIAquick PCR Purification Kit (Qiagen) and then pooled for single‐end sequencing using the Illumina HiseqXTen platform. Raw reads were first subjected to quality filtering and adaptor trimming. Base quality values were calculated using a Phred quality score (Qsanger=−10log10p). The high‐quality (HQ) reads were subsequently aligned against the genome sequence of Crassostrea gigas. FspEI sites (CmC/mCDS) extracted from the genome of Crassostrea gigas were built as reference sites, and HQ reads were mapped against these reference sites using the SOAP program. For relative quantification of MethylRAD data, the number of reads supporting methylcytosines was calculated and normalized as RPM (reads per million).

Raw data re‐filtering and quality control

The sequences containing adapters and low‐quality reads were excluded from raw reads, and clean reads were generated. Pear software was used to re‐filter paired clean reads according to the two criteria as follows: (a) low‐quality reads containing more than 15% of bases, of which the quality scores were lower than 30, were excluded, and (b) sequences containing more than 8% N base were excluded. Then the enzyme reads of each sample were generated by deleting the labels without the digestion site of FspEI enzyme.

Identification of DMSs

The labels containing CCGG or CCWGG sites were extracted as reference sequences. Then the enzyme reads were mapped to the established reference sequences using SOAP software. Moreover, the enzyme reads with the unique alignment position on the reference gene were considered as mapping reads. The methylation levels of CCGG and CCWGG sites were reflected by the sequencing depth of the methylation label. The relative quantification value of methylation level of sites in each sample was calculated by site coverage reads/HQ read number of the library*106. The DMSs between tumor and control groups were analyzed by EdgeR using R software. The methylated sites with P < .05 and log2FC > 1 between the two groups were considered as DMSs.

Identification of DMGs

The methylation level of each gene was represented by the sum of methylation level of all the methylated sites in this gene. The differential methylation level of gene was compared between tumor and adjacent normal tissue groups. The methylated genes with P < .05 and log2FC > 1 were considered as DMGs.

The annotation of lncRNAs

The data annotation of human lncRNAs was retrieved from GENCODE v25. The gene name, sequence and description were retrieved from HGNC using R script. The lncRNA DMSs and DMGs were selected after annotation.

Identification of differentially expressed lncRNAs in TCGA database

The RNA sequencing data in CC was downloaded from TCGA database using R script. The differential expression level of each gene was analyzed by DESeq using R script. The lncRNAs with P < .05 and log2FC > 1 were considered as differentially expressed lncRNAs.

Identification of CC‐related lncRNAs

The lncRNA DMGs in methylation file and dysregulated lncRNAs in RNA sequencing file were co‐analyzed by Venny 2.1. The lncRNAs, which were hypermethylated and down‐regulated, or were hypomethylated and up‐regulated in CC, were screened to perform further functional analysis.

The functional enrichment analysis of cancer‐related lncRNAs

LncRNA2Target v2.0 and StarBase v3.0 were used to predict the functions of cancer‐related lncRNAs. , LncRNA2Target v2.0 provides a comprehensive resource of lncRNA‐target relationships, which were validated by immunoprecipitation assay, RNA pull‐down assay and luciferase reporter assay. StarBase v3.0 is an open‐source platform for studying the miRNA‐ncRNA, miRNA‐mRNA, ncRNA‐RNA, RNA‐RNA, RBP‐ncRNA, and RBP‐mRNA interactions from CLIP‐Seq, degradome‐seq, and RNA‐RNA interactome data. It identifies more than 1.5 million RNA‐RNA interactions from multidimensional sequencing data. The co‐expressed protein‐coding genes were screened for Gene Ontology (GO) enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis. The GO categories and enriched pathways were reported as significant only if the P value < .05.

The prediction of lncRNA‐miRNA interactions

DIANA‐LncBase experimental v.2 was employed to predict miRNAs that interacted with cancer‐related lncRNAs. DIANA‐Tools provides experimentally verified miRNA targets on protein‐coding RNAs and ncRNAs (TarBase v7.0 and LncBase).

Quantitative real‐time qPCR

Total RNA was isolated from 20 paired tumor and adjacent nontumor tissues of CC patients using Trizol reagent (Thermo Fisher Scientific) according to the manufacturer's instructions. The cDNA was synthesized using the Prime Script™ RT Reagent Kit (Takara, Dalian, China). The reaction volume was 20 µL consisting of 1 µg of template RNA, 4 µL of 5 × Prime Script buffer mix, 1 µL of oligo dT primer, and 1 µL of Prime Script RT enzyme mix and ddH2O. The mixture was first incubated at 37°C for 30 minutes, followed by 85°C for 5 seconds. Then, qPCR was performed with SYBR Premix ExTaq™ (Takara) on the CFX96TM Real‐Time System (Bio‐Rad Laboratories Inc, Hercules, CA, USA). The thermocycling profile of reaction was 95°C for 5min, followed by 40 cycles of 95°C for 5s, 60°C for 20s, and 72°C 20s. The relative expression of lncRNAs was calculated using 2‐ΔΔCt method. The primer sequences of lncRNAs are shown in Table S1.

Pyrosequencing assay

Genomic DNA was extracted from 20 paired tumor and adjacent nontumor tissues of CC patients using the QIAamp DNA Mini Kit (Qiagen, Hilden, Germany). Then DNA was bisulfite‐converted using the EpiTect Bisulfite Kit (Qiagen, Hilden, Germany) and PCR was performed using PyroMark PCR Kit (Qiagen, Hilden, Germany) according to the manufacturer's instructions. The methylation levels of 15 lncRNAs gene were measured by pyrosequencing assay which was performed on the PyroMark Gold Q96 ID (Qiagen, Hilden, Germany). The methylation sites were selected based on our high sequencing results.

Statistical analysis

The expression of lncRNAs was presented as mean ± SEM of three replicate measurements in RT‐qPCR assay. Two‐tailed t‐test was used to determine statistical difference between paired tumor and adjacent nontumor tissue. The statistical difference of methylation level of lncRNA genes was assessed with t‐test between tumor group and adjacent non‐tumor group. A two‐sided P < .05 was considered as statistical significance. Data were analyzed using GraphPad Prism version 5.0 (GraphPad Software).

RESULTS

MethylRAD sequencing data re‐filtering and quality control

In the present study, we used a recently discovered DNA methylation profiling method called MethylRAD. MethylRAD uses one of the Mrr‐like enzymes FspEI to perform reduced methylome sequencing and collect 32‐bp methylated DNA fragments from the whole genome for high‐throughput sequencing. This new method can discriminate between CG and non‐CG methylations and analyze both CCGG and CCWGG methylation sites. After the raw data were re‐filtered according to the criteria mentioned in section of Materials and methods and the paired‐end sequencing reads were mapped, the clean reads, enzyme reads, and mapping reads were generated in each sample as shown in Table 1.
Table 1

Sequencing clean reads and re‐filtered reads in each sample

SampleClean ReadsEnzyme ReadsMapping ReadsRatio
107C194 689 14751 921 26739 585 62976.24%
107T194 689 14758 815 93544 228 64175.20%
111C194 689 14759 216 15644 285 08174.79%
111T194 689 14752 735 85236 441 99669.11%
120C148 088 44735 816 75427 341 98676.34%
120T148 088 44738 880 00629 361 13175.52%
124C148 088 44738 840 01429 184 64975.14%
124T148 088 44736 728 61227 133 92773.88%
153C118 096 06432 884 14125 144 38076.47%
153T118 096 06437 680 04928 305 94075.12%

Adjacent tissue group:107C, 111C, 120C, 124C and 153C; Colon cancer tissue group:107T, 111T, 120T, 124T and 153T; Ratio:Mapping reads/Enzyme reads.

Sequencing clean reads and re‐filtered reads in each sample Adjacent tissue group:107C, 111C, 120C, 124C and 153C; Colon cancer tissue group:107T, 111T, 120T, 124T and 153T; Ratio:Mapping reads/Enzyme reads.

Distribution of DNA methylation sites on different functional elements and in different gene regions

The labels containing CCGG or CCWGG sites were extracted from the whole genome sequence as reference sequences. The enzyme reads of each sample were mapped to the established reference sequences, and only the sites with sequencing depth higher than three were considered as reliable methylation sites. Figure 1A and Figure 1B illustrate that the distribution of methylation sites on different gene functional elements in each sample was analyzed by bedtools software. The results indicated that both CCGG and CCWGG methylation sites were mainly distributed on introns and intergenic elements.
Figure 1

Distribution of differential methylation sites. (A) CCGG methylation sites on different gene function elements, (B) CCWGG methylation sites on different gene function elements, (C) CCGG methylation sites on different chromosomes, (D) CCWGG methylation sites on different chromosomes, (E) CCGG methylation sites on different functional components, (F) CCWGG methylation sites on different functional components

Distribution of differential methylation sites. (A) CCGG methylation sites on different gene function elements, (B) CCWGG methylation sites on different gene function elements, (C) CCGG methylation sites on different chromosomes, (D) CCWGG methylation sites on different chromosomes, (E) CCGG methylation sites on different functional components, (F) CCWGG methylation sites on different functional components

Relative quantification of methylation levels of sites

We detected a total of 773 956 CCGG and 59 968 CCWGG methylation sites. As described in the section of Materials and methods, the methylation level of CCGG and CCWGG sites was reflected by the sequencing depth of methylation labels. The PRM was used to compare methylation level of each site between tumor and adjacent normal tissue groups. Finally, a total of 132 999 CCGG/8487 CCWGG sites were identified as DMSs. Usually, DMSs are unevenly distributed on chromosomes. Figure 1C and Figure 1D exhibit that the position information of DMSs on chromosomes was labeled. According to the position of methylation sites on the genome, the distribution of DMSs on different functional elements was plotted using pie chart as shown in Figure 1E and Figure 1F. The results showed that both CCGG and CCWGG DMSs were mainly located on the introns and intergenic elements.

Identification of lncRNA DMSs

According to the annotation data of lncRNAs retrieved from GENCODE, there were 42 175 CCGG/2794 CCWGG DMSs which belonged to lncRNA category. Finally, we obtained 44 877 lncRNA DMSs after combining CCGG and CCWGG files with the exclusion of 92 repeated DMSs. The cluster analysis was performed to further explore the differential methylation level of lncRNA DMSs. The top 10 000 lncRNA DMSs were selected to plot cluster heatmap. The heatmap plot showed a different methylation pattern between tumor tissues (T) and adjacent nontumor tissues (C). There were more DMSs with hypermethylation in tumor tissues. LncRNA DMSs could separate tumor and adjacent normal tissue groups very well as shown in Figure 2A. All the five paired of tumor tissues and adjacent nontumor tissues could be separated correctly.
Figure 2

Hierarchical cluster analysis and classification of DMGs. (A) Heat‐map of top 10 000 lncRNA differentially methylated sites between tumor tissue and adjacent tissue group. (B) Classification of DMGs and the rate distribution of 6 different biotypes genes

Hierarchical cluster analysis and classification of DMGs. (A) Heat‐map of top 10 000 lncRNA differentially methylated sites between tumor tissue and adjacent tissue group. (B) Classification of DMGs and the rate distribution of 6 different biotypes genes

Identification of lncRNA DMGs

The methylation level of each gene was represented by the sum of methylation levels of all the sites in this gene. According to the criteria described in the section of Materials and methods, we identified 1,359 CCGG and 1,052 CCWGG DMGs. These DMGs were further reclassified into the following biotypes: protein‐coding genes, lncRNA genes, miRNA genes, pseudogenes, misc_RNA genes, and others. Figure 2B reveals that there were 510 lncRNA genes (37.5%) and 421 pseudogenes (31%) in CCGG DMGs, and 466 lncRNA genes (44.3%) and 449 protein‐coding genes (42.7%) in CCWGG DMGs. The results demonstrated that lncRNA genes occurred frequently in both CCGG and CCWGG DMGs. Finally, we identified 963 lncRNA DMGs (Table S2) for further co‐analysis with TCGA RNA sequencing data. The RNA sequencing data of 480 CC patients and 41 controls were downloaded from TCGA database using R script. A total of 3693 differentially expressed genes (DEGs) were obtained by DEseq. According to Ensemble Genes 95 and Human genes GRCh38.p12 database, there were 1,328 DEGs in lncRNA category (Table S3). The lncRNA DMGs and dysregulated lncRNAs were co‐analyzed by Venny 2.1. There were 963 lncRNA DMGs, including 387 hypermethylated and 576 hypomethylated ones, and 1328 dysregulated lncRNAs, including 1,311 up‐regulated and 17 down‐regulated lncRNAs. Finally, we identified that ZNF667AS1 and MAFA‐AS1, which was down‐regulated, might be silenced by hypermethylation. Moreover, the other 13 lncRNAs were hypomethylated and up‐regulated in CC (Figure 3, Table 2). Our result demonstrated that these 15 lncRNAs might be CC‐related lncRNAs.
Figure 3

Venny plot of lncRNA differentially methylated gens (DMGs) in MethylRad sequencing data and differentially expressed lncRNAs in TCGA database

Table 2

The list of colon cancer‐related lncRNAs

Gene IDEnsembl_gene_IDGene NameMethylation levelexpression level
log2FC P valuelog2FC P value

100

128 252

ENSG00000166770ZNF667‐AS13.1810576333.12E‐09‐1.3072.01356
104 326 051ENSG00000254338MAFA‐AS11.034955058.008881073‐1.150617756.023423169
100 505 658ENSG00000246422AC008781.2‐2.053666981.00001282.254499463.001472013
728 655ENSG00000251164HULC‐1.376854024.0008635432.5129892155.59E‐05
105 370 855ENSG00000259756AC100839.1‐1.361977359.0095114362.784632131.000976912
101 926 928ENSG00000258551CRAT37‐1.349790211.0012184791.212392239.011940946
101 929 344ENSG00000231817LINC01198‐1.325550476.005428641.932731982.001108353
101 928 104ENSG00000267659LINC01482‐1.301990851.0012053461.384972804.007679533
101 927 659ENSG00000250582SMAD1‐AS2‐1.255865686.0089471492.900569484.006680646
101 927 761ENSG00000223442TH2LCRR‐1.201145929.0023811.086872425.000338205
104 472 714ENSG00000226758DISC1‐IT1‐1.170968744.0027261151.944068824.036600097
100 873 982ENSG00000223882ABCC5‐AS1‐1.15089644.0034460082.079893143.004943804
645 030ENSG00000237853NFIA‐AS1‐1.028709419.0021931823.888287973.0000251
101 927 539ENSG00000263499AC007431.1‐1.005971442.0119933993.879714114.00000419
101 927 948ENSG00000229494AC012494.1‐1.002646894.0000002062.146082342.003494868
Venny plot of lncRNA differentially methylated gens (DMGs) in MethylRad sequencing data and differentially expressed lncRNAs in TCGA database The list of colon cancer‐related lncRNAs 100 128 252 To explore the potential functions of these cancer‐related lncRNAs, we attempted to identify the protein‐coding genes that were co‐expressed with these lncRNAs using LncRN2Target v2.0 and StarBase. Among the input set of 15 cancer‐related lncRNAs, 28 and nine co‐expressed protein‐coding genes were identified to be associated with HULC and ZNF667AS1, respectively (Table 3).
Table 3

Co‐expressed protein‐coding genes of lncRNA ZNF667‐AS1 and HULC

LncRNACo‐expressed protein‐coding genesDatabase
ZNF667‐AS1CALCOCO2, FUNDC2, HMGCS1, CAPRIN1, PAFAH1B2, ZNF814, AC010326.2, EIF3C, PPP1CCStarbase
HULCANKRD11, CDCA4, CDK8, CEP83, CWF19L1, GGT1, IWS1, LTBR, MKNK2, MORC2, NDUFC2, NNT, NOP53, NUCKS1, RPRD1A, SDC4, SH3D19, SNAPC2, TBCB, TM9SF3, VPS4A, ZNF195, CDKN2C, UBE2I, E2F1, EZH2, PTGS2, HMGA2

LncRNA2tARGET v2.0

Starbase

Co‐expressed protein‐coding genes of lncRNA ZNF667AS1 and HULC LncRNA2tARGET v2.0 Starbase

Prediction of lncRNA‐miRNA interaction

According to DIANA‐LncBase experimental v.2 results, we found that six miRNAs and seven miRNAs could bind to lncRNA ZNF667AS1 and HULC respectively. These lncRN‐miRNA interactions were verified by immunoprecipation, reporter gene assay or qPCR analysis (Table 4).
Table 4

The miRNAs that were experimentally proved to bind lncRNA ZNF667‐AS1 and HULC

LncRNAsmiRNAsPrScoreMethods
ZNF667‐AS1hsa‐miR‐574‐5p 34 0.404IP
hsa‐miR‐143‐3p 34 0.658
hsa‐miR‐330‐3p 35
hsa‐miR‐33a‐3p 34
hsa‐miR‐484 34
hsa‐miR‐1‐3p 34 0.586
HULChsa‐miR‐1236‐3p 36 0.468RP
hsa‐miR‐134‐5p 36 RP
hsa‐miR‐372‐3p 36 RP,qP
hsa‐miR‐433‐3p 36 0.425RP
hsa‐miR‐557 36 RP
hsa‐miR‐613 36 RP,qP
hsa‐miR‐622 36 RP

The data of IP, RP, qPCR was obtained from previous studies as cited references.

Abbreviations: IP: positive by immunoprecipitation; RP: positive by reporter gene assay; qP: positive by qPCR.

The miRNAs that were experimentally proved to bind lncRNA ZNF667AS1 and HULC The data of IP, RP, qPCR was obtained from previous studies as cited references. Abbreviations: IP: positive by immunoprecipitation; RP: positive by reporter gene assay; qP: positive by qPCR.

Validation of the expression and methylation level of CC‐related lncRNAs

In order to validate our MethylRad sequencing results and TCGA data, the expression and methylation levels of 15 CC‐related lncRNAs were detected by RT‐qPCR and pyrosequencing. Our results indicated that lncRNA ZNF667AS1 and MAFA‐AS1 were down‐regulated and hypermethylated in CC tumor tissues (Figure 4A and B). Meanwhile, the other 13 lncRNAs (AC008781.2, HULC, AC100839.1, CRAT37, LINC01198, LINC01482, SMAD1‐AS2, TH2LCRR, DISC1IT1, ABCC5AS1, NFIA‐AS1, AC007431.1, AC012494.1) were up‐regulated and hypomethylated in CC tumor group which confirmed our previous MethylRad sequencing results (Figure 4A and B).
Figure 4

The expressions of CC‐related lncRNAs by RT‐qPCR (A) and aberrant methylation levels of the genes by pyrosequencing (B) in 20 cases of CC tumor and non‐tumor tissues. RT‐qPCR data are presented as mean ± SEM of three replicate measurements. Statistical significance of the differences is assessed by two‐tailed t‐test between tumor group and adjacent nontumor group

The expressions of CC‐related lncRNAs by RT‐qPCR (A) and aberrant methylation levels of the genes by pyrosequencing (B) in 20 cases of CC tumor and non‐tumor tissues. RT‐qPCR data are presented as mean ± SEM of three replicate measurements. Statistical significance of the differences is assessed by two‐tailed t‐test between tumor group and adjacent nontumor group

DISCUSSION

Aberrant DNA methylation has been shown to play an important role in the pathogenesis of CC. Previous studies have identified that methylation changes are associated with a limited set of genes in CC. , , , However, the profiles of DNA methylation, especially the menthylation of lncRNA genes in CC, still remain largely unexplored. With rapid advances in sequencing technology, sequencing‐based methods have been promising in identifying DNA methylation sites and genes on a genome‐wide scale. In the present study, we used a simple and scalable DNA methylation profiling method called MethylRad, which can collect 32‐bp methylated DNA fragments from the whole genome for further high‐throughput sequencing. MethylRAD can discriminate between CG and non‐CG methylations, and analyze both CCGG and CCWGG methylation sites, whereas most of other methods can either only recognize CG methylation or recognize both CG and non‐CG methylations but cannot distinguish them. , We detected a total of 773 956 CCGG and 59 968 CCWGG sites by generating a genome‐wide DNA methylation profile using five pairs of CC tumor tissues and adjacent normal tissues. We also analyzed the distribution of DNA methylation sites on different functional elements and in different gene regions. Our results indicated that CCGG and CCWGG methylated sites were mainly distributed on introns and intergenic elements. Moreover, a total of 132 999 CCGG and 8487 CCWGG sites were identified as DMSs, which was consistent with the previous studies that there is an increasing number of hypermethylated CGIs in CC. According to the annotation data of lncRNAs retrieved from GENCODE, there were 44 877 lncRNA DMSs after combining CCGG and CCWGG files. The top 10 000 lncRNA DMSs were selected to plot cluster heatmap of tumor and adjacent normal tissue groups. The results demonstrated that the lncRNA DMSs could separate tumor and adjacent normal tissue groups and might provide potential tumor markers of CC. The methylation levels of these lncRNA DMSs need to be verified by pyrosequencing in the future research. In order to explore the candidate differentially methylated lncRNA genes that might contribute to tumorigenesis and development of CC, we identified 1359 CCGG and 1052 CCWGG DMGs. These DMGs mainly belonged to lncRNA genes and pseudogenes in CCGG DMGs, while they were mainly lncRNA genes and protein‐coding genes in CCWGG DMGs. Our results were consistent with previous studies that DNA methylation not only regulates the expressions of protein‐coding genes but also affect ncRNAs. Li et al have also reported that aberrant methylation changes occur more frequently in the promoters of ncRNA genes compared with protein‐coding genes in breast cancer. To better understand the potential effect of methylation on lncRNA expression, the RNA sequencing data of CC patients were downloaded from TCGA database and analyzed. We identified 1328 DEGs in lncRNA category. The lncRNA DMGs and dysregulated lncRNAs were co‐analyzed to identify cancer‐related lncRNAs. We identified two lncRNAs, which were down‐regulated and might be silenced by hypermethylation. In addition, the other 13 lncRNAs were hypomethylated and up‐regulated. The expression and methylation results of these 15 CC‐related lncRNAs were also validated by RT‐qPCR and pyrosequencing assay. Our results might provide new insights into tumorigenesis and development of CC. Among the 15 cancer‐related lncRNAs, ZNF667AS1, HULC, LINC01198, and NFIA‐AS1 have been reported in previous studies. We also identified protein‐coding genes and microRNAs that were interacted with ZNF667AS1 and HULC. As far as we know, there are no studies about the interaction role of the other 13 lncRNAs. It has been demonstrated that lncRNA HULC is markedly up‐regulated in CC patients and CC cell lines. HULC accelerates the growth of CC cells by targeting miR‐613. These findings suggest that HULC inhibits the tumor growth through miR‐613‐dependent RTKN modulation. HULC may be an oncogenic lncRNA, which promotes the progression of CC and can be considered as an effective therapeutic target in human CC. Our study also found that HULC was hypomenthylated and up‐regulated in CC, which was consistent with previous studies. Vrba et al have reported that ZNF667AS1 expression is silenced by aberrant DNA methylation in several cancers, providing strong evidence for a suppressor role of ZNF667AS1. Several studies have also demonstrated that ZNF667AS1 is down‐regulated and may serve as a tumor suppressor gene in breast and cervical cancer. , In our present study, we validated that ZNF667AS1 was hypermethylated and down‐regulated in CC patients. However, the role of ZNF667AS1 in CC still needs to be clarified in further investigation. Several studies have demonstrated that LINC01198 was up‐regulated in glioblastmoma and associated with clinical outcomes. , , LINC01198 could activate PI3/AKT signaling to facilitate gliomagenesis through regulating PIK3CA and PTEN. Sun et al have found that LINC01198 was one of the high‐risk factors for bladder cancer tumors. Shao et al have indicated that NFIA‐AS1 was down‐regulated and might be a promising biomarker for ER positive breast cancer. It still needs further study about the other CC‐related lncRNAs. Collectively, we performed a genome‐wide methylation analysis by MethylRad and transcriptome profiles from TCGA database in CC. We identified DMSs on lncRNA genes, differentially methylated lncRNA genes and also dysregulated lncRNAs. Finally, we found that ZNF667AS1 and MAFA‐AS1 were hypermethylated and down‐regulated, while the other 13 lncRNAs were hypomethylated and up‐regulated in CC. In contrast to the extensive evidence of protein‐coding genes, only few lncRNAs have been clearly elucidated till now. In the present study, we proposed 15 lncRNAs as potential CC‐related lncRNAs, and they might participate in the tumorigenesis and progression of cancer. Taken together, our study provided useful diagnostic markers and new insights into molecular mechanisms of CC.

CONFLICTS OF INTEREST

The authors declared that they have no conflicts of interest.

AUTHOR’S CONTRIBUTION

GZ carried out most of experiments and data analysis, interpreted the entire results and drafted the manuscript. YZ and ED carried out part of statistic analysis. PS confirmed the diagnosis of colon cancer. HW, YY and AQ carried out TCGA data analysis. All authors read and approved the final manuscript. YZ conceived and designed the study. Table S1 Click here for additional data file. Table S2 Click here for additional data file. Table S3 Click here for additional data file.
  37 in total

1.  The transcriptome profiles and methylation status revealed the potential cancer-related lncRNAs in patients with cervical cancer.

Authors:  Li-Yun Yuan; Xiaomin Qin; Lin Li; Jinting Zhou; Min Zhou; Xianxian Li; Ying Xu; Xiao-Jun Wang; Hui Xing
Journal:  J Cell Physiol       Date:  2018-10-26       Impact factor: 6.384

2.  CREB up-regulates long non-coding RNA, HULC expression through interaction with microRNA-372 in liver cancer.

Authors:  Jiayi Wang; Xiangfan Liu; Huacheng Wu; Peihua Ni; Zhidong Gu; Yongxia Qiao; Ning Chen; Fenyong Sun; Qishi Fan
Journal:  Nucleic Acids Res       Date:  2010-04-27       Impact factor: 16.971

Review 3.  Promoter hypermethylation of tumour suppressor genes as potential biomarkers in colorectal cancer.

Authors:  Jennifer Mun-Kar Ng; Jun Yu
Journal:  Int J Mol Sci       Date:  2015-01-22       Impact factor: 5.923

4.  Epigenetic Silencing of MORT Is an Early Event in Cancer and Is Associated with Luminal, Receptor Positive Breast Tumor Subtypes.

Authors:  Lukas Vrba; Bernard Walter Futscher
Journal:  J Breast Cancer       Date:  2017-06-26       Impact factor: 3.588

5.  LncRNA profile study reveals four-lncRNA signature associated with the prognosis of patients with anaplastic gliomas.

Authors:  Wen Wang; Fan Yang; Lu Zhang; Jing Chen; Zheng Zhao; Haoyuan Wang; Fan Wu; Tingyu Liang; Xiaoyan Yan; Jiye Li; Qing Lan; Jiangfei Wang; Jizong Zhao
Journal:  Oncotarget       Date:  2016-11-22

6.  Genetic and epigenetic events generate multiple pathways in colorectal cancer progression.

Authors:  Massimo Pancione; Andrea Remo; Vittorio Colantuoni
Journal:  Patholog Res Int       Date:  2012-07-24

7.  starBase v2.0: decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data.

Authors:  Jun-Hao Li; Shun Liu; Hui Zhou; Liang-Hu Qu; Jian-Hua Yang
Journal:  Nucleic Acids Res       Date:  2013-12-01       Impact factor: 16.971

8.  MethylRAD: a simple and scalable method for genome-wide DNA methylation profiling using methylation-dependent restriction enzymes.

Authors:  Shi Wang; Jia Lv; Lingling Zhang; Jinzhuang Dou; Yan Sun; Xue Li; Xiaoteng Fu; Huaiqian Dou; Junxia Mao; Xiaoli Hu; Zhenmin Bao
Journal:  Open Biol       Date:  2015-11       Impact factor: 6.411

Review 9.  Are Colon and Rectal Cancer Two Different Tumor Entities? A Proposal to Abandon the Term Colorectal Cancer.

Authors:  Stephan Paschke; Sakhavat Jafarov; Ludger Staib; Ernst-Dietrich Kreuser; Catharina Maulbecker-Armstrong; Marc Roitman; Torbjörn Holm; Curtis C Harris; Karl-Heinrich Link; Marko Kornmann
Journal:  Int J Mol Sci       Date:  2018-08-30       Impact factor: 5.923

10.  LncRNA2Target v2.0: a comprehensive database for target genes of lncRNAs in human and mouse.

Authors:  Liang Cheng; Pingping Wang; Rui Tian; Song Wang; Qinghua Guo; Meng Luo; Wenyang Zhou; Guiyou Liu; Huijie Jiang; Qinghua Jiang
Journal:  Nucleic Acids Res       Date:  2019-01-08       Impact factor: 16.971

View more
  2 in total

1.  LncRNA RP11-295G20.2 regulates hepatocellular carcinoma cell growth and autophagy by targeting PTEN to lysosomal degradation.

Authors:  Linhui Liang; Lin Huan; Jiajia Wang; Yangjun Wu; Shenglin Huang; Xianghuo He
Journal:  Cell Discov       Date:  2021-12-14       Impact factor: 10.849

2.  Genome-wide DNA methylation analysis by MethylRad and the transcriptome profiles reveal the potential cancer-related lncRNAs in colon cancer.

Authors:  Guixi Zheng; Yuzhi Zhang; Hongchun Wang; E Ding; Ailin Qu; Peng Su; Yongmei Yang; Mingjin Zou; Yi Zhang
Journal:  Cancer Med       Date:  2020-09-01       Impact factor: 4.452

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.