| Literature DB >> 32872377 |
Allison R Kolbe1, Matthew L Bendall2, Alexander T Pearson3, Doru Paul4, Douglas F Nixon2, Marcos Pérez-Losada1,5,6, Keith A Crandall1,5.
Abstract
Human endogenous retroviruses (HERVs) have been implicated in a variety of human diseases including cancers. However, technical challenges in analyzing HERV sequence data have limited locus-specific characterization of HERV expression. Here, we use the software Telescope (developed to identify expressed transposable elements from metatranscriptomic data) on 43 paired tumor and adjacent normal tissue samples from The Cancer Genome Atlas Program to produce the first locus-specific retrotranscriptome of head and neck cancer. Telescope identified over 3000 expressed HERVs in tumor and adjacent normal tissue, and 1078 HERVs were differentially expressed between the two tissue types. The majority of differentially expressed HERVs were expressed at a higher level in tumor tissue. Differentially expressed HERVs were enriched in members of the HERVH family. Hierarchical clustering based on HERV expression in tumor-adjacent normal tissue resulted in two distinct clusters with significantly different survival probability. Together, these results highlight the importance of future work on the role of HERVs across a range of cancers.Entities:
Keywords: RNA-seq; TCGA; cancer; endogenous retrovirus; transposable element
Mesh:
Year: 2020 PMID: 32872377 PMCID: PMC7552064 DOI: 10.3390/v12090956
Source DB: PubMed Journal: Viruses ISSN: 1999-4915 Impact factor: 5.818
Figure 1Human endogenous retrovirus (HERV) expression in head and neck squamous cell carcinoma (HNSCC) tumor and paired normal samples. (A) Principal component analysis, with shape and color indicating tumor vs normal tissue. (B) Volcano plot of differential expression between tumor and normal tissue. Positive log-fold change indicates higher expression in tumor tissue. The dashed horizontal line indicates adjusted p-value threshold of 0.05. The dashed vertical line indicates a log2-fold change of +/−1.5.
Figure 2Expression for the top 100 differentially expressed HERVs. Heatmap colors represent the difference between each sample’s expression level and the mean expression level of each HERV (sample expression − mean expression). Row colors indicate HERV family membership. HERV families with five or fewer members present in the top 100 differentially expressed HERVs are presented as “other”. Clustering of the top 100 differentially expressed HERVs was performed on variance-stabilized expression data using the method ward.D2 in R [40]. An ordered list of the HERV loci shown in this figure is available in Supplementary Table S5. Heatmap was generated with pheatmap [47], with the viridis [48] and RColorBrewer color palette [49].
Figure 3Circos plot showing differentially expressed host genes (inner ring) and HERVs (outer ring) across chromosomal locations. Values are plotted on a −log (adjusted p-value) scale. Color indicates degree of significance, with the darkest colors corresponding to lowest significant p-value (0.05), with the brightest red corresponding to an adjusted p-value of p < 0.00000000001. Plot was generated using circlize in R [52].
Figure 4Survival analysis of HNSCC subtypes resulting from hierarchical clustering of HERV expression in tumor-adjacent normal tissue (Supplementary Figure S4). Vertical lines indicate censored data. Curves were fit using the Kaplan–Meier model and statistical significance was calculated using the log-rank test.