| Literature DB >> 24490137 |
Carol Ying-Ying Szeto1, Chi Ho Lin2, Siu Chung Choi2, Timothy T C Yip3, Roger Kai-Cheong Ngan3, George Sai-Wah Tsao4, Maria Li Lung1.
Abstract
Nasopharyngeal carcinoma (NPC) is a prevalent malignancy in Southeast Asia among the Chinese population. Aberrant regulation of transcripts has been implicated in many types of cancers including NPC. Herein, we characterized mRNA and miRNA transcriptomes by RNA sequencing (RNASeq) of NPC model systems. Matched total mRNA and small RNA of undifferentiated Epstein-Barr virus (EBV)-positive NPC xenograft X666 and its derived cell line C666, well-differentiated NPC cell line HK1, and the immortalized nasopharyngeal epithelial cell line NP460 were sequenced by Solexa technology. We found 2812 genes and 149 miRNAs (human and EBV) to be differentially expressed in NP460, HK1, C666 and X666 with RNASeq; 533 miRNA-mRNA target pairs were inversely regulated in the three NPC cell lines compared to NP460. Integrated mRNA/miRNA expression profiling and pathway analysis show extracellular matrix organization, Beta-1 integrin cell surface interactions, and the PI3K/AKT, EGFR, ErbB, and Wnt pathways were potentially deregulated in NPC. Real-time quantitative PCR was performed on selected mRNA/miRNAs in order to validate their expression. Transcript sequence variants such as short insertions and deletions (INDEL), single nucleotide variant (SNV), and isomiRs were characterized in the NPC model systems. A novel TP53 transcript variant was identified in NP460, HK1, and C666. Detection of three previously reported novel EBV-encoded BART miRNAs and their isomiRs were also observed. Meta-analysis of a model system to a clinical system aids the choice of different cell lines in NPC studies. This comprehensive characterization of mRNA and miRNA transcriptomes in NPC cell lines and the xenograft provides insights on miRNA regulation of mRNA and valuable resources on transcript variation and regulation in NPC, which are potentially useful for mechanistic and preclinical studies.Entities:
Keywords: AIP, aryl hydrocarbon receptor interacting protein; BAX, BCL2-asscoiated X protein; CIITA, class II, major histocompatibility complex, transactivator; DKK1, Dickkopf-Like protein 1; EBV, Epstein–Barr virus; ECM, extracellular matrix; EGFR, epidermal growth factor receptor; EGR1, early growth response 1; FBLN2, fibulin 2; GADD45, growth arrest and DNA-damage-inducible; GNG11, guanine nucleotide binding protein (G protein), Gamma 11; GO, gene ontology; GSTP1, glutathione S-transferase pi 1; IL18, interleukin 18; INDEL, insertion and deletion; LMP1, Epstein–Barr virus latent membrane protein 1; LTBP2, latent transforming growth factor beta binding protein 2; MDM2, MDM2 oncogene, E3 ubiquitin protein ligase; MET, met proto-oncogene; MMP19, matrix metallopeptidase 19; NGS, next-generation sequencing; NPC, nasopharyngeal carcinoma; Nasopharyngeal carcinoma; Nasopharyngeal cell lines/xenograft (NP460, HK1, C666, X666); PI3K, phosphoinositide 3-kinase; PTEN, phosphatase and tensin homolog; RNA sequencing; RNASeq, RNA sequencing; SNP, single nucleotide polymorphism; TNFRSF9, tumour necrosis factor receptor superfamily, member 9; TP53; Transcriptome analysis; UTR, untranslated region; miRNA, microRNA
Year: 2014 PMID: 24490137 PMCID: PMC3907684 DOI: 10.1016/j.fob.2014.01.004
Source DB: PubMed Journal: FEBS Open Bio ISSN: 2211-5463 Impact factor: 2.693
Fig. 1An overview of the samples used in this study and data analysis workflow. (A) Cell lines sequenced in this study. (B) Bioinformatics analysis workflow.
Fig. 2Differentially expressed genes/miRNAs in HK1 and C666 compared to NP460 and relevant biological pathways/functions in transcriptome data. (A) Venn diagram showing the number of protein coding gene/microRNA differentially expressed in HK1 and C666 compared to NP460. (B) List of Biological pathways and Gene Oncology Biological Functions from Reactome FI analysis. Terms begin with circle: enriched in mRNA only; terms begin with diamond: enriched in mRNA and miRNA; terms begin with rectangle: enriched in miRNA only; (P): term from Reactome pathway enrichment; (G): term from Gene Oncology Biological Process.
Fig. 3microRNA regulated biological network enriched in EGFR, ErbB and Wnt signaling pathways. (A) Network diagram showing Reactome biological network. The intensity of the node color indicates the degree of up-(red) or down (green)-regulation with respect to NP460. Round node: protein coding gene; diamond node: human encoded miRNA; triangle node: EBV encoded miRNA; orange circle: QPCR-validated genes from three subsequent passages. (B) NGS expression and QPCR expression on QPCR-validated genes. (i) EGFR: epidermal growth factor receptor; (ii) EGR1: early growth response protein 1; (iii) GNG11: guanine nucleotide binding protein (G Protein), Gamma 11; (iv) DKK1: Dickkopf-Like protein 1; (v) MET: Met Proto-Oncogene; (vi) hsa-miR-141-5p; (vii) has-miR-200c-3p; (viii) EBV-mir-BART2-5p; (ix) EBV-mir-BART14-3p; (x) EBV-mir-BART-17-5p. Error bar on QPCR plot: SD from three subsequent passages. *p < 0.05; **p < 0.01. p Value was estimated by DeSeq package in transcriptome data and by one-way ANOVA followed by Tukey post hoc analysis in QPCR, respectively. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 4Biological network enriched in cytokine and interferon signaling. (A) Network diagram showing Reactome biological network relevant to the genes differentially expressed and regulated by miRNA in C666/X666. The intensity of the node color indicates the degree of up-(red) or down (green)-regulation with respect to NP460. Round node: protein coding gene; diamond node: human encoded miRNA; triangle node: EBV encoded miRNA; orange circle: QPCR-validated genes from three subsequent passages. (B) NGS expression and QPCR expression on QPCR-validated genes. (i) CIITA: MHC class II transactivator type III; ii) IL18: interleukin 18; iii) TNFRSF9: tumor necrosis factor receptor superfamily, member 9. Error bar on QPCR plot: SD from three subsequent passages. *p < 0.05; **p < 0.01. p Value was estimated by DeSeq package in transcriptome data and by one-way ANOVA followed by Tukey post hoc analysis in QPCR, respectively. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article.)
Fig. 5Sequence variants identified from transcriptome data. (A) An overview of sequence variants identified from transcriptome NGS. (B) Single nucleotide variants in TP53 and GSTP1 detected in NPC and NP cell lines. Height of sequence logos represents the frequency of nucleotides detected at each position.
Top three isomirs of EBV-BHRF miRNA from mature miRbase annotated sequence.
| EBV-miRNA | Match type | Read sequence | C666 read | X666 read | C666 rank | X666 rank |
|---|---|---|---|---|---|---|
| mir-BHRF1-1-5p | **Mature 5′ | TAACCTGATCAGCCCCGGAGTT | 3 | 8 | 1st | 1st |
| Mature 5′ sub | AACCTGATCAGCCCCGGAGTT | 2 | N/A | 2nd | N/A | |
| mir-BHRF1-2-3p | Mature 3′ sub | TATCTTTTGCGGCAGAAATTG | N/A | 2 | N/A | 1st |
| **Mature 3′ | TATCTTTTGCGGCAGAAATTGA | N/A | 1 | N/A | 2nd | |
| mir-BHRF1-2-5p | Mature 5′ sub | AAATTCTGTTGCAGCAGATAG | N/A | 1 | N/A | 1st |
Mature 3′/5′: **Mature miRbase annotated sequence; Mature 3′/5′ sub: observed tag is shorter than reference sequence; Mature 3′/5′ super: observed tag is longer than the annotated mature sequence; Mature 3′/5′ sub/super variant: observed tag with mismatches to annotated sequence.
Number of reads detected in NGS. N/A: no read detected in this sample. Total read counts of EBV miRNA of C666: 1761771; X666: 469388.
Rank in N/A: ranking of the read in that sample is out of top3. d No mature sequence nor isomirs of mir-BART15-5p found in C666-1 and X666 NGS read.
Top three isomirs of novel EBV-BART miRNA from Chen et al. [26].
| EBV-miRNA | Match type | Read sequence | C666 read | X666 read | C666 rank | X666 rank |
|---|---|---|---|---|---|---|
| mir-BART16-3p | ##Mature 3′ | ATCACCACCCTCTATCCATAT | 41 | 5 | 1st | N/A |
| Mature 3′ sub/super | AGATCACCACCCTCTATCCATA | 40 | 11 | 2nd | 1st | |
| Mature 3′ sub/super | AGATCACCACCCTCTATCCAT | 39 | 7 | 3rd | 3rd | |
| Mature 3′ super | AGATCACCACCCTCTATCCATAT | 9 | 8 | N/A | 2nd | |
| Mature 3′ super | GATCACCACCCTCTATCCATAT | 11 | 8 | N/A | 2nd | |
| mir-BART22-5p | ##Mature 5′ | TGCTAGACCCTGGAGTTGAACC | 121 | 23 | 1st | 1st |
| Mature 5′ sub | TGCTAGACCCTGGAGTTGAAC | 55 | 13 | 2nd | 2nd | |
| Mature 5′ sub | TGCTAGACCCTGGAGTTG | 3 | N/A | 3rd | N/A | |
| Mature 5′ super variant | TGCTAGACCCTGGAGTTGAACCt | 3 | 1 | 3rd | N/A | |
| Mature 5′ sub | TGCTAGACCCTGGAGTT | N/A | 3 | N/A | 3rd | |
| Mature 5′ variant | TGCTAGACCCTGGAGTTGAACa | 1 | 3 | N/A | 3rd | |
| mir-BART12-5p | Mature 5′ super | ACCCGCCCATCACCACCGGACA | 50 | 2 | 1st | 1st |
| Mature 5′ super variant | gACCCGCCCATCACCACCGGA | 10 | N/A | 2nd | N/A | |
| ##Mature 5′ | ACCCGCCCATCACCACCGGAC | 7 | 1 | 3rd | 2nd | |
| Mature 5′ sub | ACCCGCCCATCACCACCGGA | 7 | N/A | 3rd | N/A | |
| Mature 5′ super variant | ACCCGCCCATCACCACCGGACt | 5 | 1 | N/A | 2nd |
Mature 3′/5′: ## Mature sequence from Chen et al. [26]; Mature 3′/5′ sub: observed tag is shorter than reference sequence; Mature 3′/5′ super: observed tag is longer than the annotated mature sequence; Mature 3′/5′ sub/super variant: observed tag with mismatches to annonated sequence.
Number of reads detected in NGS. N/A: no read detected in this sample. Total read counts of EBV miRNA of C666: 1761771; X666: 469388.
Rank in N/A: ranking of the read in that sample is out of top3. dNo mature sequence nor isomirs of mir-BART15–5p found in C666–1 and X666 NGS read.
Fig. 6Meta-analysis of NGS data with clinical data from public database. (A) Meta-analysis on GSE12452, GSE13597 and NGS data; (B)meta-analysis on GSE34573 and NGS data; Dotted line: threshold for cluster analysis; circled number: cluster number according to threshold. Both dendrograms in (A) and (B) were generated by MetaDE using complete linkage method. (C) Common SNPs detected in NGS and GPL3718/ GPL3720 SNP array. ND: SNP not detected.
Common SNPs detected in NGS and GPL3718/ GPL3720 SNP array.
| Transcriptome NGS result | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| VarFreq: ratio B/A | C666 | NPC biopsies (15) | Control blood (5) | |||||||||
| Gene | Function | dbSNP135 | NP460 VarFreq | HK1 VarFreq | C666 VarFreq | AA | AB | BB | AA | AB | BB | |
| CCDC99 | Exonic | rs3797713 | 1 | 0.6034 | 1 | AA | 7 | 6 | 0 | 2 | 3 | 0 |
| CEP55 | Exonic | rs2293277 | 0.4123 | 1 | 0.4487 | AB | 7 | 7 | 1 | 2 | 3 | 0 |
| PIGC | Exonic | rs1063412 | 1 | 0.5946 | 1 | AB | 0 | 13 | 1 | 0 | 2 | 1 |
| GNL3 | Exonic | rs2289247 | 0.4375 | 1 | 1 | AB | 11 | 4 | 0 | 2 | 2 | 0 |
| PPA2 | Exonic | rs13787 | 0.4058 | 1 | 0.3667 | AB | 6 | 4 | 1 | 0 | 1 | 3 |
| PDLIM5 | UTR3 | rs13141591 | 1 | 1 | ND | BB | 0 | 0 | 17 | 0 | 0 | 5 |
| SQSTM1 | UTR3 | rs1065154 | ND | ND | 0.6486 | AB | 3 | 7 | 3 | 3 | 0 | 1 |
| DUSP7 | UTR3 | rs3821841 | 0.4731 | 1 | ND | AB | 4 | 7 | 0 | 5 | 0 | 0 |
| DLX4 | UTR3 | rs1058564 | ND | ND | 0.6538 | AB | 10 | 4 | 0 | 3 | 2 | 0 |
| RAB31 | UTR3 | rs557706 | 1 | 0.4528 | ND | ND | 2 | 8 | 1 | 1 | 0 | 4 |
| CERS2 | UTR3 | rs267739 | 1 | 1 | 1 | AB | 13 | 1 | 0 | 5 | 0 | 0 |
| CANT1 | UTR5 | rs3744171 | ND | ND | 0.5 | AB | 6 | 5 | 1 | 4 | 1 | 0 |
ND: SNP not detected.