| Literature DB >> 35088088 |
Georgia Katsoula1,2, Julia Steinberg2,3, Margo Tuerlings4, Rodrigo Coutinho de Almeida4, Lorraine Southam2, Diane Swift5, Ingrid Meulenbelt4, J Mark Wilkinson5, Eleftheria Zeggini2,6.
Abstract
Osteoarthritis is a prevalent joint disease and a major cause of disability worldwide with no curative therapy. Development of disease-modifying therapies requires a better understanding of the molecular mechanisms underpinning disease. A hallmark of osteoarthritis is cartilage degradation. To define molecular events characterizing osteoarthritis at the whole transcriptome level, we performed deep RNA sequencing in paired samples of low- and high-osteoarthritis grade knee cartilage derived from 124 patients undergoing total joint replacement. We detected differential expression between low- and high-osteoarthritis grade articular cartilage for 365 genes and identified a 38-gene signature in osteoarthritis cartilage by replicating our findings in an independent dataset. We also found differential expression for 25 novel long non-coding RNA genes (lncRNAs) and identified potential lncRNA interactions with RNA-binding proteins in osteoarthritis. We assessed alterations in the relative usage of individual gene transcripts and identified differential transcript usage for 82 genes, including ABI3BP, coding for an extracellular matrix protein, AKT1S1, a negative regulator of the mTOR pathway and TPRM4, coding for a transient receptor potential channel. We further assessed genome-wide differential splicing, for the first time in osteoarthritis, and detected differential splicing for 209 genes, which were enriched for extracellular matrix, proteoglycans and integrin surface interactions terms. In the largest study of its kind in osteoarthritis, we find that isoform and splicing changes, in addition to extensive differences in both coding and non-coding sequence expression, are associated with disease and demonstrate a novel layer of genomic complexity to osteoarthritis pathogenesis.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35088088 PMCID: PMC9239745 DOI: 10.1093/hmg/ddac017
Source DB: PubMed Journal: Hum Mol Genet ISSN: 0964-6906 Impact factor: 5.121
Figure 1Study overview.
Figure 2Differentially expressed (DE) genes between paired low- and high-osteoarthritis grade cartilage. (A) Differential expression of all genes. (B) Differential expression of lncRNA genes. Gene names shown in white boxes in A and B highlight newly implicated differentially expressed genes. (C) Biotype annotations of the DE genes. (D, E) Enlarged C. gene biotype bar plots showing less abundant biotypes. (F) Hierarchical clustering on top 100 DE genes (logCPM: log-counts-per-million). Gene names highlighted in bold show newly implicated DE genes.
Figure 3Gene set enrichments among DE genes. (A) Gene sets significantly associated with DE genes at 5% FDR. Only the ten gene sets with the most significant enrichment in each category are shown. Log2FC: log-fold-change. Adj. P val: FDR. (B) Hierarchical clustering of Gene Ontology (GO) terms based on gene semantic similarity using the Jaccard coefficient. (C) Gene set enrichments among cell type makers. (D) Genes associated with different inflammation-related cell types discussed in the text. Genes are colored according to their log2FC.
Figure 4Differentially expressed lncRNAs between low- and high-osteoarthritis grade cartilage. (A) Heatmap of Spearman correlations between differentially expressed RBP genes and differentially expressed lncRNAs. (B) Network of differentially expressed lncRNAs targeted by three differentially expressed RBP genes that have been previously associated in osteoarthritis cartilage. Novel lncRNAs identified in our analysis are highlighted with a thicker border width. All network nodes are colored according to the log2FC: log-fold-change. (C) Expression (covariate-adjusted logCPM: log counts per million) of long noncoding RNA (lncRNA) TENM3-AS1 and gene TENM3 in low- and high-osteoarthritis grade cartilage. Both TENM3-AS1 and TENM3 were significantly upregulated in high-osteoarthritis grade cartilage. Violin plots show the expression distribution across samples. Boxplots center at the median and whiskers extend to 1.5 times the interquartile range.
Figure 5Differential transcript usage between low- and high-osteoarthritis grade cartilage. (A) Distribution of differentially used transcripts between low- and high-osteoarthritis grade cartilage among ENSEMBL transcript biotypes. (B–F) Violin plots show the distribution of usage of differentially used transcripts for ABI3BP, TRPM4, AKT1S1, VDAC2 and GADD45A. Boxplots within the violin plots have their center at the median and whiskers extend to 1.5 times the interquartile range. Heatmaps show the normalized expression of the respective transcripts between low and high-osteoarthritis grade cartilage accounting for technical variation. logCPM: log-counts-per-million.
Figure 6Differential splicing between low- and high-osteoarthritis grade cartilage for COL11A1. The figure illustrates decreased skipping of exon 8 of COL11A1 in high-osteoarthritis grade cartilage. The exons affected by the skipping event fall within the N-terminal variable region of collagen. The violin plot shows the distribution of intron usage (covariate-adjusted PSI) of the differentially excised intron cluster containing exon 8 (E8) for low- and high-osteoarthritis grade cartilage.
Figure 7Comparison of gene expression differences of the DE genes identified in the discovery and replication datasets. The plot shows gene-level log-fold-changes of DE genes identified in the discovery compared to the replication dataset. Individual genes are shown as single points, and the color corresponds to whether the gene is identified as DE in discovery and replication dataset (red), in the discovery dataset only (black).