| Literature DB >> 34238982 |
Huiling He1,2, Sandya Liyanarachchi1,2, Wei Li1,2, Daniel F Comiskey1,2, Pearlly Yan3,2, Ralf Bundschuh3,4,5, Altan M Turkoglu4, Pamela Brock3,2, Matthew D Ringel6,7, Albert de la Chapelle1,2.
Abstract
Papillary thyroid carcinoma (PTC) is the most common type of thyroid cancer. The molecular characteristics of histologically normal appearing tissue adjacent to the tumor (NAT) from PTC patients are not well characterized. The aim of this study was to characterize the global gene expression profile of NAT and compare it with those of normal and tumor thyroid tissues. We performed total RNA sequencing with fresh frozen thyroid tissues from a cohort of three categories of samples including NAT, normal thyroid (N), and PTC tumor (T). Transcriptome analysis shows that NAT presents a unique gene expression profile, which was not associated with sex or the presence of lymphocytic thyroiditis. Among the differentially expressed genes (DEGs) of NAT vs N, 256 coding genes and 5 noncoding genes have been reported as cancer genes involved in cell proliferation, apoptosis, and/or tumorigenesis. Bioinformatics analysis with Ingenuity Pathway Analysis software revealed that "Cancer, Organismal Injury and Abnormalities, Cellular Response to Therapeutics, and Cellular Movement" were major dysregulated pathways in the NAT tissues. This study provides improved insight into the complexity of gene expression changes in the thyroid glands of patients with PTC.Entities:
Mesh:
Substances:
Year: 2021 PMID: 34238982 PMCID: PMC8266864 DOI: 10.1038/s41598-021-93526-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Gene expression patterns and correlation with clinical information. (A) Sample clusters. Euclidean distance and average linkage clustering method was used. Gender and the presence of lymphocytic thyroiditis (LT) in each sample are marked with color bars. (B) Plot of principal component analysis. (C) t-SNE plot. (D) Principal component analysis with confidence ellipses according to tissue type. A total of 22,411 genes after filtering were included in the analysis. All the plots were created with log transformed normalized gene expression estimates.
Top 40 differentially expressed coding genes between NAT and Na.
| Gene IDb | Gene name | P valuec | Fold change (NAT/N) | DEGs of NAT/LT− vs Nd |
|---|---|---|---|---|
| ENSG00000213977.7 | TAX1BP3 | 2.43E−12 | 3.67 | Yese |
| ENSG00000147586.9 | MRPS28 | 4.97E−11 | 2.39 | Yes |
| ENSG00000257727.5 | CNPY2 | 1.34E−08 | 2.21 | Yes |
| ENSG00000148180.18 | GSN | 2.1E−08 | 3.82 | Yes |
| ENSG00000174903.15 | RAB1B | 4.09E−08 | −1.59 | Yes |
| ENSG00000203791.14 | METTL10 | 4.15E−08 | 1.93 | Yes |
| ENSG00000170296.9 | GABARAP | 8.02E−08 | 4.17 | Yes |
| ENSG00000143368.9 | SF3B4 | 7.59E−07 | −1.56 | Yes |
| ENSG00000136371.10 | MTHFS | 2.29E−06 | 2.42 | Yes |
| ENSG00000171295.12 | ZNF440 | 2.73E−06 | 1.53 | Noe |
| ENSG00000088038.17 | CNOT3 | 4.31E−06 | −2.16 | Yes |
| ENSG00000132471.11 | WBP2 | 6.74E−06 | −1.62 | Yes |
| ENSG00000277203.1 | F8A1 | 8.99E−06 | −1.75 | Yes |
| ENSG00000183889.12 | AC138969.4 | 1.44E−05 | −1.84 | Yes |
| ENSG00000198171.12 | DDRGK1 | 1.87E−05 | −1.54 | Yes |
| ENSG00000128739.21 | SNRPN | 2.59E−05 | 2.29 | Yes |
| ENSG00000104969.9 | SGTA | 2.76E−05 | −1.51 | Yes |
| ENSG00000263290.5 | SCAMP3 | 3.07E−05 | −1.58 | Yes |
| ENSG00000164039.14 | BDH2 | 3.16E−05 | 1.71 | Yes |
| ENSG00000147955.16 | SIGMAR1 | 3.26E−05 | −1.56 | Yes |
| ENSG00000175274.18 | TP53I11 | 5.42E−05 | −1.74 | Yes |
| ENSG00000270011.6 | ZNF559-ZNF177 | 5.71E−05 | 2.80 | Yes |
| ENSG00000205544.3 | TMEM256 | 6.32E−05 | 3.71 | Yes |
| ENSG00000111775.2 | COX6A1 | 6.36E−05 | 2.28 | Yes |
| ENSG00000100350.14 | FOXRED2 | 6.66E−05 | −1.87 | Yes |
| ENSG00000279576.1 | AP000769.1 | 8.82E−05 | 16.16 | Yes |
| ENSG00000181264.8 | TMEM136 | 9.61E−05 | 1.54 | Yes |
| ENSG00000115239.21 | ASB3 | 9.75E−05 | 1.69 | Yes |
| ENSG00000100348.9 | TXN2 | 0.000123 | −1.57 | Yes |
| ENSG00000167182.13 | SP2 | 0.000132 | −1.82 | Yes |
| ENSG00000277462.1 | ZNF670 | 0.000133 | 2.05 | Yes |
| ENSG00000167644.11 | C19orf33 | 0.000162 | 19.70 | Yes |
| ENSG00000134590.13 | FAM127A | 0.000174 | −1.62 | Yes |
| ENSG00000239697.10 | TNFSF12 | 0.000177 | −1.83 | Yes |
| ENSG00000164898.12 | C7orf55 | 0.000191 | 3.25 | Yes |
| ENSG00000124614.13 | RPS10 | 0.000237 | 13.41 | Yes |
| ENSG00000188257.10 | PLA2G2A | 0.000277 | −14.64 | Yes |
| ENSG00000196757.7 | ZNF700 | 0.000284 | 1.59 | Yes |
| ENSG00000116649.9 | SRM | 0.000299 | −1.83 | Yes |
| ENSG00000189171.14 | S100A13 | 0.000328 | 1.93 | Yes |
aNAT, normal appearing tumor adjacent tissue; N, normal thyroid control.
bThe human genome GRC38 was used for gene mapping and annotation.
cBenjamini and Hochberg multiple testing corrected p-value. Model was adjusted for age, gender and LT status.
dNAT/LT−, normal appearing thyroid tissue without co-existance of lymphocytic thyroiditis.
eYes, the gene is overlapping with the DEGs of NAT/LT− vs N; No, no overlapping.
Top 40 differentially expressed non-coding genes between NAT and N.
| Gene ID | Gene name | Gene biotype | P value | Fold change (NAT/N) | DEGs of NAT/LT− vs N |
|---|---|---|---|---|---|
| ENSG00000242299.1 | RP11-234A1.1 | processed_pseudogene | 3.671E−24 | 8.28 | Yes |
| ENSG00000254911.3 | SCARNA9 | antisense | 1.74E−18 | 7.48 | Yes |
| ENSG00000233328.3 | PFN1P1 | processed_pseudogene | 1.19E−15 | −7.49 | Yes |
| ENSG00000136149.6 | RPL13AP25 | processed_pseudogene | 2.877E−14 | 7.24 | Yes |
| ENSG00000272779.1 | LL22NC03-80A10.6 | transcribed_unprocessed_pseudogene | 1.681E−13 | 4.04 | Yes |
| ENSG00000226525.5 | RPS7P10 | processed_pseudogene | 4.401E−13 | 6.42 | Yes |
| ENSG00000278771.1 | Metazoa_SRP | misc_RNA | 4.444E−10 | 3.68 | Yes |
| ENSG00000253954.3 | HMGN1P38 | processed_pseudogene | 7.143E−10 | −3.10 | Yes |
| ENSG00000259918.1 | NDUFA5P11 | processed_pseudogene | 4.037E−09 | −3.79 | Yes |
| ENSG00000243199.1 | RP11-408P14.1 | processed_pseudogene | 4.037E−09 | 5.23 | Yes |
| ENSG00000233913.7 | CTC-575D19.1 | processed_pseudogene | 1.224E−08 | 6.18 | Yes |
| ENSG00000251733.1 | SCARNA8 | scaRNA | 1.336E−08 | −6.45 | Yes |
| ENSG00000178464.6 | CTD-2192J16.15 | processed_pseudogene | 1.507E−08 | 4.98 | Yes |
| ENSG00000256745.1 | RP11-680H20.1 | processed_pseudogene | 2.153E−08 | 3.88 | Yes |
| ENSG00000235776.2 | AC000089.3 | processed_pseudogene | 2.215E−08 | 20.89 | Yes |
| ENSG00000259706.1 | HSP90B2P | processed_pseudogene | 2.254E−08 | 2.53 | Yes |
| ENSG00000230629.2 | RPS23P8 | processed_pseudogene | 3.119E−08 | 3.54 | Yes |
| ENSG00000224631.4 | RP11-51O6.1 | transcribed_processed_pseudogene | 4.399E−08 | 7.42 | Yes |
| ENSG00000231767.3 | RP11-92K2.2 | processed_pseudogene | 5.442E−08 | 7.41 | Yes |
| ENSG00000272101.2 | AC243587.1 | processed_pseudogene | 1.158E−07 | 3.74 | Yes |
| ENSG00000236534.1 | H3F3BP1 | processed_pseudogene | 2.069E−07 | 4.02 | Yes |
| ENSG00000236698.1 | EIF1AXP1 | processed_pseudogene | 2.363E−07 | 2.61 | Yes |
| ENSG00000274574.1 | AC006359.1 | snRNA | 2.363E−07 | −3.56 | Yes |
| ENSG00000212607.1 | SNORA3B | snoRNA | 2.957E−07 | −4.84 | Yes |
| ENSG00000282670.1 | AC254944.3 | lncRNA | 4.208E−07 | 4.36 | Yes |
| ENSG00000234797.5 | RPS3AP6 | processed_pseudogene | 4.208E−07 | 5.31 | Yes |
| ENSG00000239470.3 | RP11-16F15.2 | processed_pseudogene | 4.208E−07 | 6.50 | Yes |
| ENSG00000235174.1 | RPL39P3 | processed_pseudogene | 4.568E−07 | 3.07 | Yes |
| ENSG00000214389.2 | RPS3AP26 | processed_pseudogene | 7.386E−07 | 4.28 | Yes |
| ENSG00000174977.8 | AC026271.5 | processed_pseudogene | 7.838E−07 | 2.76 | Yes |
| ENSG00000243829.1 | CTB-33G10.1 | processed_pseudogene | 1.064E−06 | 61.76 | Yes |
| ENSG00000256393.1 | RPL41P5 | processed_pseudogene | 1.876E−06 | 4.31 | Yes |
| ENSG00000253683.1 | CTB-79E8.3 | processed_pseudogene | 2.023E−06 | 3.24 | Yes |
| ENSG00000178660.6 | ARMC10P1 | processed_pseudogene | 2.181E−06 | 3.17 | Yes |
| ENSG00000266992.1 | DHX40P1 | unprocessed_pseudogene | 2.205E−06 | 3.79 | Yes |
| ENSG00000274026.1 | FAM27E3 | transcribed_processed_pseudogene | 2.293E−06 | −3.17 | Yes |
| ENSG00000265727.2 | RN7SL648P | misc_RNA | 4.46E−06 | 1.97 | Yes |
| ENSG00000198618.5 | PPIAP22 | processed_pseudogene | 4.543E−06 | 4.31 | Yes |
| ENSG00000220749.4 | RPL21P28 | processed_pseudogene | 4.871E−06 | 3.83 | Yes |
| ENSG00000283390.1 | RP11-134F2.7 | processed_pseudogene | 4.871E−06 | 42.91 | Yes |
The abbreviations, gene annotation, and statistics are the same as described in Table 1.
Figure 2Venn diagram of differentially expressed genes (DEGs) among samples. Venn diagram to illustrate the overlapped DEGs between NAT vs N, NAT/LT− vs N, and NAT/LT+ vs NAT/LT−.
Figure 3Venn diagram of differentially expressed genes (DEGs) among samples. Venn diagram to illustrate the overlapped DEGs between NAT vs N, NAT/LT− vs N, and T vs NAT.
Figure 4Line plots showing gene expression patterns. Fold changes from N to NAT to T are plotted. (A) Up-up, 38 genes. (B) Up-stable, 354 genes. (C) Up-down, 37 genes. (D) Down-down, 31 genes. (E) Down-stable, 148 genes. (F) Down-up, 38 genes. Up, p-value < 0.05 and log2(fold change) > = 0.584; Down, p-value < 0.05 and log2(fold change) < −0.584; Stable, not belonging to “Up” or “Down” groups; some stable genes may have relatively high/low fold changes but the p-values are not significant.
Figure 5Dendrogram and clusters generated using 646 differentially expressed genes of NAT vs N. The heat map was generated based on 58 samples (46 NAT and 12 N). The annotation bars (above heat map) show sample clusters (SC1-SC4) and gender and the LT status of samples. The bar on the left side demonstrates the range for scaled and centered log2 expression. Red means higher expression and green means lower expression. Four major gene clusters are marked.