Literature DB >> 25888722

Transcriptome analysis of ageing in uninjured human Achilles tendon.

Mandy Jayne Peffers1, Yongxiang Fang2, Kathleen Cheung3, Tim Koh Jia Wei4, Peter David Clegg5, Helen Lucy Birch6.   

Abstract

INTRODUCTION: The risk of tendon injury and disease increases significantly with increasing age. The aim of the study was to characterise transcriptional changes in human Achilles tendon during the ageing process in order to identify molecular signatures that might contribute to age-related degeneration.
METHODS: RNA for gene expression analysis using RNA-Seq and quantitative real-time polymerase chain reaction analysis was isolated from young and old macroscopically normal human Achilles tendon. RNA sequence libraries were prepared following ribosomal RNA depletion, and sequencing was undertaken by using the Illumina HiSeq 2000 platform. Expression levels among genes were compared by using fragments per kilobase of exon per million fragments mapped. Differentially expressed genes were defined by using Benjamini-Hochberg false discovery rate approach (P<0.05, expression ratios 1.4 log2 fold change). Alternative splicing of exon variants were also examined by using Cufflinks. The functional significance of genes that showed differential expression between young and old tendon was determined by using ingenuity pathway analysis.
RESULTS: In total, the expression of 325 transcribed elements, including protein-coding transcripts and non-coding transcripts (small non-coding RNAs, pseudogenes, long non-coding RNAs and a single microRNA), was significantly different in old compared with young tendon (±1.4 log2 fold change, P<0.05). Of these, 191 were at higher levels in older tendon and 134 were at lower levels in older tendon. The top networks for genes differentially expressed with tendon age were from cellular function, cellular growth, and cellular cycling pathways. Notable differential transcriptome changes were also observed in alternative splicing patterns. Several of the top gene ontology terms identified in downregulated isoforms in old tendon related to collagen and post-translational modification of collagen.
CONCLUSIONS: This study demonstrates dynamic alterations in RNA with age at numerous genomic levels, indicating changes in the regulation of transcriptional networks. The results suggest that ageing is not primarily associated with loss of ability to synthesise matrix proteins and matrix-degrading enzymes. In addition, we have identified non-coding RNA genes and differentially expressed transcript isoforms of known matrix components with ageing which require further investigation.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25888722      PMCID: PMC4355574          DOI: 10.1186/s13075-015-0544-2

Source DB:  PubMed          Journal:  Arthritis Res Ther        ISSN: 1478-6354            Impact factor:   5.156


Introduction

The increasing number of people reaching old age provides huge challenges to society, as whereas life span increases, life quality faced by many individuals in old age is poor [1]. Whereas muscle, bone, and joint age-related disease is well recognised, the fibrous connecting tendon tissue has received little attention, despite representing a very common site of pain and dysfunction. Epidemiological studies have revealed a clear link between age and increasing incidence of tendon injury [2,3], suggesting that the mechanical integrity of tendon declines during the ageing process. Although it is generally accepted that a degenerative process precedes gross tendon injury, the aetiology of this process remains elusive and the definition of degeneration is poorly defined. Histological examination of painful Achilles tendon [4], dysfunctional posterior tibialis [5], and supraspinatus tendon collected from cadavers [6] has revealed pathological changes, including signs of collagen fibre disruption, increased staining for glycosaminoglycan, hypercellularity, and cell shape change to a more chondroid appearance. Similar changes have been observed in macroscopically abnormal equine flexor tendon [7], another common site of age-associated tendon injury. Histological abnormalities are more often observed in older individuals [6], although the relationship with ageing and the apparent change in cell function is not clear. Ageing is generally associated with a decline in protein synthesis [8] and a loss of cell functionality [9]. It has been suggested that early degenerative changes in tendon result from an accumulation of micro-damage within the extracellular matrix (ECM) due to an imbalance between anabolic and catabolic pathways [10]. Recent work on equine flexor tendon identified an accumulation of partially degraded collagen within the ECM of old tendons, and it was hypothesised that an inability to remove partially degraded collagen may account for reduced mechanical competency [11]. Another study found that flexor tendon explants from older horses were more susceptible to fatigue damage following cyclical loading in vitro than explants from young horses and that this was a cell-mediated process involving the matrix metallo-proteinases (MMPs) [12]. Cell ageing has been associated with a decreased ability to modulate inflammation resulting in a chronic low-level inflammation termed ‘inflamm-aging’ [13]. Recent work by Dakin and colleagues [14] measured prostaglandin E2 in injured equine flexor tendons and found that levels increased with increasing horse age but that levels of formyl peptide receptor 2/ALX, a receptor responsible for suppressing the inflammatory response, were significantly reduced. These findings intimate that aged individuals exhibit a reduced capacity to resolve inflammation and that ageing may contribute to deregulated tendon repair through these pathways. Quantitative analysis of gene expression changes with age may help the understanding of ageing mechanisms and their interactions with age-related diseases such as tendinopathies [15]. Although microarray technology has been employed to investigate gene expression changes following tendon injury [16], in tendinopathic tissue [17], in response to cyclical strain [18] or a single loading event [19], and effect of loading on tendon healing [20], no comprehensive analysis of alterations in gene expression with age has been undertaken in tendon. RNA-Seq can capture the whole transcriptome, including coding RNAs, isoforms produced by alternative splicing, long non-coding RNAs (lncRNAs) (the importance of which is becoming apparent in disease [21] and ageing [22-24]), and short non-coding RNAs. We have previously used RNA-Seq successfully on equine cartilage tissue and identified an over-representation of genes with reduced expression relating to ECM, degradative proteases, matrix synthetic enzymes, cytokines, and growth factors in ageing cartilage [24]. In this study, we used RNA-Seq to comprehensively identify the human Achilles tendon transcriptome for the first time and then examine changes that occur with ageing. We hypothesised that ageing results in reduced expression of ECM-related proteins and matrix-degrading enzymes. In addition, we sought to identify previously unrecognised slice variants and non-coding RNAs associated with tendon ageing in a ‘bottom-up’ inductive approach.

Methods

Sample collection and preparation

All human Achilles tendons used in this study—RNA-Seq and quantitative real-time polymerase chain reaction (qRT-PCR)—were harvested from limbs amputated during surgical procedures to treat sarcomas at the Royal National Orthopaedic Hospital, Stanmore. Tissue collection was carried out through the Stanmore Musculoskeletal BioBank, which has ethical approval from the Cambridgeshire 1 Research Ethics Committee (REC reference 09/H0304/78) to collect tissue for research into musculoskeletal conditions. All patients gave consent for their tissue to be used for musculoskeleton-related research. Local research-and-development approval for this project was given by the UCL/UCLH/RF Joint Research Office (reference number 11/0464). For RNA-Seq, tendons were collected from donors who were 69.4 ± 7.3 years old (old group, n = 5, 3 female, 2 male) and donors who were 19 ± 5.8 years old (young group, n = 4, 4 male). Tendon tissue was collected within 24 hours of limb removal, except for one sample in which tissue was collected within 48 hours. The Achilles tendon was dissected free from the limb. Only tendons with a normal macroscopic appearance were used for this study. A section of tissue approximately 1 cm in length was taken from the mid region of the tendon between the musculotendinous junction and the insertion site. Outer tissue (paratenon) was removed and the remaining tendon tissue placed into RNAlater (Ambion, Warrington, UK) in accordance with the instructions of the manufacturer.

RNA extraction

Tendon was pulverising into a powder with a dismembranator (Mikro-S; Sartorius, Melsungen, Germany) following freezing in liquid nitrogen. Immediately, 20 volumes of Tri Reagent (Ambion) was added to the powdered tendon tissue and the RNA extracted and purified as described by Peffers et al. [25] (2013). RNA was quantified by using a Nanodrop ND-100 spectrophotometer (Labtech, Uckfield, East Sussex, UK) and assessed for purity by UV absorbance measurements at 260 and 280 nm.

RNA-Seq analysis: cDNA library preparation and sequencing

Total RNA was analysed by the Centre for Genomic Research, University of Liverpool, for RNA-Seq library preparation and sequencing by using the Illumina HiSeq 2000 platform. Total RNA integrity was confirmed by using an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). Ribosomal RNA (rRNA) was depleted from 9 total RNA samples by using the Ribo-Zero™ rRNA Removal Kit (Human/Mouse/Rat; Epicentre, Madison, WI, USA) in accordance with the instructions of the manufacturer. cDNA libraries were prepared with the ScriptSeq v2 RNA-Seq library preparation kit (Epicentre) by using 50 ng rRNA depleted RNA as starting material in accordance with manufacturer protocols as previously described [24]. The final pooled library was diluted to 8 pmol before hybridisation. The dilute library (120 μL) was hybridised on one lane of the HiSeq 2000 at 2 × 100-base pair (bp) paired-end sequencing with v3 chemistry.

Data processing

The sequence libraries for each sample were processed by using CASAVA version 1.8.2 to produce 100-bp paired-end sequence data in fastq format. The fastq files were processed by using Cutadapt version 1.2.1 [26] with option ‘–O 3’ to trim adapter from any read if it matched the adapter sequence for 3 bp or more at the 3′ end. In addition, a quality trimming was performed by using Sickle version 1.200. The trimmed R1-R2 read pairs, for each sample, were aligned to reference sequence [27] by using TopHat2 version 2.0.10 [28] with default settings, except for the option –g 1. Read counts were obtained from the mapping results by using HTSeq-count and genome annotation [29]. The differential gene expression analysis was performed on R platform by using the edgeR package [30] and focused on the contrast of old and young donors. The count data were normalised across libraries by using trimmed mean M (TMM) values of the default methods edgeR. The tagwise dispersions were estimated and then used for logFC (log2 fold change) estimating and testing. Differentially expressed genes (DEGs) were extracted by applying the threshold false discovery rate (FDR) of less than 0.05 to adjusted P values, which were generated by using Benjamini and Hochberg approach [31]. In addition, FPKM (fragments per kilobase of exon per million fragments mapped) values were converted from count values for comparing expression levels among genes. All sequence data produced in this study have been submitted to National Center for Biotechnology Information Gene Expression Omnibus (NCBI GEO) under Array Express accession number E-MTAB-2449.

Analysis of splice variants

Trimmed paired reads were aligned to a reference human transcriptome (Ensembl iGenomes build GRCh37) by using Bowtie2 [32]. The alignments (BAM files) were converted into sorted SAM files by using SAMtools [33]. Parameters for TopHat were estimated by using a Picard tool (CollectInsertSizeMetrics.jar) [34]. Reads were aligned to the reference genome (Ensembl build GRCh37) by using TopHat [35], specifying mate inner distance (mean inner distance between mate pairs) and standard deviation for each sample. Mapped reads were then assembled into complete transcripts by using the splice junction mapping tool Cufflinks [36] with option –G, which uses the Ensembl reference gene track to improve mapping. Cuffmerge was used to merge the assembled transcripts into a consensus gene track from the all of the mapped samples. Cuffdiff was used to identify DEGs and differentially expressed transcripts between young and old tendon. Genes and transcripts were identified as being significantly differentially expressed with q values of less than 0.05, calculated by the Benjamini and Hochberg FDR correction [31]. Downstream analysis and visualisation of results, including quality control of the samples, was undertaken by using the cummeRbund package in R. Graphs were generated by using cummeRbund and the ggplot2 package [37].

Functional analysis

To systematically determine networks, functional analyses, and canonical pathways that the DEGs might involve, we performed the pathway/network enrichment analysis using the ingenuity pathway analysis (IPA) tool from the Ingenuity Systems [38] by using a list of DEGs with values-adjusted P value of less than 0.05 and ±1.4 log2 fold regulation. Gene symbols were used as identifiers and the Ingenuity Knowledge Base gene was used as a reference for a pathway analysis. For network generation, a data set containing gene identifiers and corresponding expression values was uploaded. Default settings were used to identify molecules whose expression was significantly differentially regulated. These molecules were overlaid onto a global molecular network contained in the Ingenuity Knowledge Base. Networks of ‘network-eligible molecules’ were then algorithmically generated based on their connectivity. The functional analysis identified the biological functions and diseases that were most significant to the data set. Right‐tailed Fisher’s exact test was used to calculate P values. Canonical pathways analysis identified the pathways from the IPA library that were most significant to the data set. For isoform analysis, the Database for Annotation, Visualization and Integrated Discovery (DAVID) (DAVID bioinformatics resources 6.7) was used [39]. The web-based functional annotation tool enabled functional clustering of gene. The functional clustering tool was used for functional enrichment for DEG isoforms with values-adjusted P value of less than 0.05 and ±1.4 log2 fold regulation.

Real-time polymerase chain reaction

Samples of RNA from both the same pools used for the RNA-Seq analysis and an additional independent cohort harvested in the same manner (n = 4 young; 16.7 ± 2.8 years old and n = 4 old; 73.2 ± 6.5 years old) were used for qRT-PCR. To validate results from differentially expressed isoforms, the independent cohort was used. Moloney murine leukaemia virus (M-MLV) reverse transcriptase and random hexamer oligonucleotides (both from Promega, Southampton, UK) were used to synthesize cDNA from 1 μg RNA in a 25 μL reaction. PCR was performed on 1 μL 10× diluted cDNA by employing a final concentration of 300 nM of each primer in 20 μL reaction volumes on an ABI 7700 Sequence Detector using PrimerDesign 2X PrecisionTM SYBR Green Mastermix (Primer Design, Southampton, UK). qRT-PCR was undertaken by using gene-specific primers (for protein-coding genes these were exon-spanning). Primers used had been validated in previous publications [40,41] and supplied by Eurogentec (Seraing, Belgium) or were designed and validated commercially (Primer Design). Steady-state transcript abundance of potential endogenous control genes was measured in the RNA-Seq data. Assays for four genes—glucose-6-phosphate isomerise (GPI), beta-actin (ACTB), ribosomal protein 13 (RSP13), and ribosomal protein 16 (RPS16)—were selected as potential reference genes as their expression was unaltered. Stability of this panel of genes was assessed by applying a gene stability algorithm [42]. RSP16 was selected as the most stable endogenous control gene. Relative expression levels were normalised to RPS16 and calculated by using the 2−∆Ct method [43]. Primers pairs used in this study are listed (Table 1). qRT-PCR analysis data was log10-transformed to ensure normal distribution and then analysed by using Student’s t test.
Table 1

Gene primer sequences used in RNA-Seq validation

Name Gene type Ensemble gene ID/transcript ID Primer sequence
RPS16 Protein-codingENSG00000105193F: GAAATCCTACCGATAAGCCCA
R: TTTCTTGAAACTTTAAAATCCCTCAA
MMP3 Protein-codingENSG00000149968F: ATTCCATGGAGCCAGGCTTTC
R: CATTTGGGTCAAACTCCAACTGTG
COL1A1 a Protein-codingENSG00000108821F: GGAGGAGAGTCAGGAAGG
R: GCAACACAGTTACACAAGG
ACAN b Protein-codingENSG00000108821F: TCGAGGACAGCGAGGCC
R: TCGAGGGTGTAGCGTGTAGAGA
POU3F4 Protein-codingENSG00000196767F: GTTCGCTCGCTCTCTCGTA
R: GAAGGGAAGGGAAGGGGAAA
MYF5 Protein-codingENSG00000111049F: TGAACTAATTTTCTGGTCTATATGAC
R: TGTACATGTTGTCTTGGTTTGGG
IGF1 Protein-codingENSG00000017427F: CACTATGGACAGATGTAAAAGAAACTA
R: ACACACTGGGGACAAGAAATAAA
MMP16 Protein-codingENSG00000156103F: ACCCGTGTAACCCTTTGAGA
R: AACCTGAACTTCTTGAACTTGTG
TGFB3 Protein-codingENSG00000119699F: GTAAAGAAAGTGTGGGTTTGGTTA
R: AACATCTCAACTTACCATCCCTTT
EGF Protein-codingENSG00000138798F: ACAGGAGGCTTCGGAGTT
R: AATCAGGCAATTTACTTACAATCTT
COL3A1-001 Protein-codingENST00000304636F: CAGGTCCCAGCGGTTCT
R: CCTTTTGGTCCAGACACTCC
COL3A1-201 Protein-codingENST00000317840F: GGTAGCCCTGGTGAGAGAG
R: TGCCAGGAGGTCCAAAGAG
FGF10-001 Protein-codingENST00000264664F: TGCCGTCAAAGCCATTAACA
R: CATTTTCCTCTATCCTCTCCTTCA
CRTAC1-001 Protein-codingENST00000370597F:ATCTTCTTCAACAACATTGCCTAC
R: GGGTCTCCGTGCTCTCTAC
IGF1-001 Protein-codingENST00000337514F: CAGCAGTCTTCCAACCCAAT
R: AAGAGATGCGAGGAGGACAT
XIST LncENSG00000229807F: TCCCAGAGAATGCCTAATACTTT
R: GTAGAAGAGATACGGAGTAGGAAT
LINC00957 LncENSG00000235314F: GAGAGTAAGCAGACCTGGGT
R: ACCTTGTCCGAGTTCCATCT
RP11.308 N19.1 LncENSG00000234323F: GCCTCTTTCATCACTGCCGA
R: TAGCAGCAGTTGGGGTGTTT

a[40] and b[41] denote primer pairs published previously. F, forward; Lnc, long non-coding; R, reverse.

Gene primer sequences used in RNA-Seq validation a[40] and b[41] denote primer pairs published previously. F, forward; Lnc, long non-coding; R, reverse.

Statistical analysis

The analyses were undertaken by using edgeR [30], S-Plus (version 7.0; Tibco Software Inc., Palo Alto, CA, USA), SPSS (version 20; IBM, Portsmouth, Hampshire, UK), and Excel (2007; Microsoft, Redmond, WA, USA) software.

Results

Overview of RNA-Seq data

An average of 32.1 million pairs of 100-bp paired-end reads per sample were generated that aligned to the reference sequence of the human genome. Using pooled R1 and R2 files for all samples in Trimmed data gave 95.1% of called bases with of Phred score of more than 30 [44]. (See Table 2 for summary of mapping results.) Of the 63,152 human genes, between 40.5% and 47.4% had at least one read aligned; 20,322 of the genes had no reads aligned from any of the nine samples. This is similar to the output of other RNA-Seq sequencing studies [24,45].
Table 2

Summary of sequence alignment to the human genome

Sample Reads to align a Reads aligned to genome Percentage b Reads properly paired Percentage c Reads with mate unmapped Percentage d Percentage of ‘expressed’ genes e
Young 161,516,10655,995,22391.0351,821,06892.554,174,1557.4546.65
Young 274,068,43067,565,30491.2262,477,22692.475,088,0787.5345.98
Young 365,954,16860,771,42392.1456,659,04693.234,112,3776.7747.19
Young 459,947,54053,381,91789.0548,909,71091.624,472,2078.3840.58
Young 589,707,09280,798,45990.0774,516,84692.236,281,6137.7745.01
Old 162,712,52257,345,26191.4453,413,21293.143,932,0496.8646.42
Old 246,590,70441,788,78089.6938,331,15691.733,457,6248.2740.46
Old 361,364,40255,643,82990.6851,415,61092.404,228,2197.6046.36
Old 455,361,08249,725,12289.8245,615,23891.734,109,8848.2742.44
Maximum89,707,09280,798,45992.1474,516,84693.236,281,6138.3847.4
Mean65,591,73459,482,20590.6554,953,87692.364,528,3307.6444.92
Minimum46,590,70441,788,78089.0538,331,15691.623,457,6246.7740.46

The table shows the number and percentage of reads mapped to the human reference sequences. aSum of R1 and R2 reads used in the alignment. bPercentage of reads used in the alignment that align to the reference genome. cPercentage of reads used in the alignment that align to the reference genome in the correct relative orientation to their mate. dPercentage of reads used in the alignment that align to the reference genome but whose mate does not align. ePercentage of 63,152 annotated human genes with at least one read aligned.

Summary of sequence alignment to the human genome The table shows the number and percentage of reads mapped to the human reference sequences. aSum of R1 and R2 reads used in the alignment. bPercentage of reads used in the alignment that align to the reference genome. cPercentage of reads used in the alignment that align to the reference genome in the correct relative orientation to their mate. dPercentage of reads used in the alignment that align to the reference genome but whose mate does not align. ePercentage of 63,152 annotated human genes with at least one read aligned. These reads were used to estimate transcript expression of all nine samples using FPKM in order to identify the most abundant genes in tendon. Table 3 demonstrates the 25 most highly expressed genes in young and old tendon (the entire data set is in Additional file 1).
Table 3

Top 25 genes

Gene name Name Log fold change FDR-adjusted Mean FPKM
RN7SK 7SK small nuclear0.11.022,323.9
RN7SL2 RNA, 7SL, cytoplasmic 2−0.50.712,212.2
RN7SL4P RNA, 7SL, cytoplasmic, pseudogene 1−0.60.65,813.8
RN7SL1 RNA, 7SL, cytoplasmic 2−1.10.53,846.0
ANGPTL7 Angiopoietin-like 70.40.83,529.0
RNA28S5 Ribosomal RNA 18S 6−0.30.93,374.0
RN7SKP255 7SK small nuclear pseudogene 2550.11.02,898.5
RN7SKP203 7SK small nuclear pseudogene 203−0.11.02,836.7
S100A6 S100 calcium binding protein A60.10.91,950.3
MALAT1 Metastasis associated lung adenocarcinoma transcript 10.01.01,801.5
DCN Decorin0.01.01,791.5
RN7SL5P RNA, 7SL, cytoplasmic, pseudogene 2−1.30.41,774.0
THBS4 Thrombospondin 4−0.80.61,368.2
MT-CO1 Mitochondrial cytochrome c oxidase III−0.60.61,230.9
TMSB10 Thymosin beta 10−0.30.71,220.1
ASPN Asporin−1.60.51,145.3
MMP3 Matrix metallopeptidase 32.20.21,126.7
RNA18S5 Ribosomal RNA 28S 5−0.20.91,125.5
CLU Clusterin0.30.91,106.6
MT-ND3 Mitochondrial NADH dehydrogenase 3−0.50.61,036.1
LUM Lumican0.01.0978.9
VIM Vimentin0.20.9977.1
CILP Cartilage intermediate layer protein0.30.9920.8
EEF1A1 Eukaryotic translation elongation factor 1 alpha-like 70.01.0891.4
IGFBP6 Insulin-like growth factor-binding protein 60.70.6859.0

The table demonstrates the 25 most highly expressed genes in young and old tendon in terms of transcript expression as determined by using fragments per kilobase of exon per million fragments mapped (FPKM). FDR, false discovery rate.

Top 25 genes The table demonstrates the 25 most highly expressed genes in young and old tendon in terms of transcript expression as determined by using fragments per kilobase of exon per million fragments mapped (FPKM). FDR, false discovery rate.

Identification of differentially expressed genes and isoforms

A principal component analysis (PCA) plot of log2 gene expression data indicated that the effect of age on gene expression was distinct as data were clustered in two groups (Figure 1A). Within the young group, two samples clustered together and two were independent of each other indicating more variability between young donors. Alterations in gene expression between young and old tendon demonstrated significant age-related changes. In total, the expression of 325 transcribed elements, including protein-coding transcripts and non-coding transcripts, small non-coding RNAs (snoRNAs), pseudogenes, lncRNAs, and a single microRNA, was significantly different in old compared with young tendon (±1.4 log2 fold change, FDR-adjusted P value of less than 0.05) (Figure 1B). Of these, 191 were at higher levels in the older tendon and 134 were at lower levels in the older tendon. The top 10 genes most DEG (increased and decreased) during tendon ageing are given in Table 4. The entire list of significantly DEG transcripts is presented in Additional file 2. NCBI GEO under accession number E-MTAB-2449 contains a complete list of all genes mapped. Of the 191 transcripts expressed at a higher level in old donors, 148 were known protein-coding genes. The remaining 43 genes contained 34 lncRNAs, one snoRNA, and eight pseudogenes (Table 5). Within the group where gene expression was lower in old compared with young tendon, 112 were known protein-coding genes. The remaining 22 genes contained 16 lncRNAs, one snoRNA, four pseudogenes, and a single microRNA (miRNA) (Table 6). Thus, 325 genes were input into IPA for downstream analysis, and 273 of these were mapped.
Figure 1

Principal component analysis (PCA) and volcano plot of differentially abundant transcripts. (A) PCA plot of log2 gene expression data showed the greatest variability in RNA-Seq data was due to the age of the donor. The second and third components are also shown as changes in the first and second components are due to differing sample sizes between young and old groups. (B) Volcano plot represents log2 FC × log2 CPM plot (counts per million mapped reads) CPM plot. Low expression genes (log2 CPM of less than −5) are coloured orange. Significant differentially expressed genes (DEGs) are coloured in red. A set of DEGs between young and old tendon was identified. With the common dispersion in edgeR [30], 325 DEGs were identified with a P value of less than 0.05 (red).

Table 4

Top 10 genes with the highest and lowest log fold change when comparing young and old tendon

Condition increased differential expression Gene symbol Gene name Location Function Log 2 fold change
Old CYP24A1 Cytochrome P450, family 24, subfamily A,1CytoplasmEnzyme9.3
FOLH1B Folate hydrolase 1BCytoplasmEnzyme8.8
POU3F4 POU class 3 homeobox 4NucleusTranscription regulator8.3
FOLH1 Folate hydrolase 1Plasma membranePeptidase7.7
BRS3 Bombesin-like receptor 3Plasma membraneG-protein coupled receptor7.6
XIST X (inactive)-specific transcript (non-protein coding)VariableLong non-coding RNA6.9
EGF Epidermal growth factorExtracellular spaceGrowth factor6.5
BRINP3 Bone morphogenetic protein/retinoic acid inducible neural-specific 3Secreted/mitochondriaOsteoblast differentiation factor6.5
NXPH2 Neurexophilin 2Extracellular spaceSignalling molecules6.5
MYF5 Myogenic factor 5NucleusTranscription regulator6.4
Young CYP1A1 Cytochrome P450, family 1, subfamily A, polypeptide 1CytoplasmEnzyme−7.6
CLEC3A C-type lectin domain family 3, member ASecretedCell adhesion−6.1
DIRAS1 DIRAS family, GTP-binding RAS-like 1Plasma membraneEnzyme−5.9
SYT13 Synaptotagmin XIIIPlasma membraneTransporter−5.6
GPR143 G protein-coupled receptor 143Plasma membraneG-protein coupled receptor−5.6
SLC7A14 Solute carrier family 7 member 14Plasma membraneTransporter−5.4
HIST 1H3A HISTONE 1H3ANucleusNucleosome−5.4
PYCR1 Pyrroline-5-carboxylate reductase 1CytoplasmEnzyme−5.3
UTS2R Urotensin 2 receptorPlasma membraneG-protein coupled receptor−5.2
HBG1 Hemoglobin, gamma ACytoplasmFe carrier−4.9

Log2 fold change and q value (adjusted P value) were determined in edgeR. A logarithm to the base 2 of 9 is approximately a linear fold change of 3.2. Shown are the 10 genes with highest and lowest expression in old compared with young tendon samples.

Table 5

Non-coding RNAs significantly overexpressed in old tendon

Gene accession Category Type of transcript Log 2 fold change Description/Aliases Function
RNVU1-6SnoRNASmall nuclear4.9Variant U1 small nuclear 6Splicesomal
RP11-578O24.2PseudogenePseudogene6.2Unknown
AP003041.1PseudogenePseudogene5.3Unknown
MKRN7PPseudogenePseudogene4.8Makorin ring finger protein 7Unknown
RPS4XP22PseudogenePseudogene4.1Ribosomal protein S4X pseudogene 22Unknown
RP11-346 M5.1PseudogenePseudogene2.8Unknown
RN7SKP234PseudogenePseudogene2.4RNA,7SK small nuclear pseudogene 243Unknown
CTD-2114 J12.1PseudogenePseudogene2.1Unknown
AL021068.1PseudogenePseudogene1.9Unknown
CTD-2083E4.4LncRNANovel processed2.3Unknown
RP3-326 L13.3LncRNANovel lncRNA7.1Unknown
RP11-377D9.3LncRNANovel lncRNA6.3Inc-KIAA1467-2Unknown
RP11-71E19.2LncRNANovel lncRNA5.4Inc-KLF15-2Unknown
AC073636.1LncRNANovel lncRNA5.1Inc-NFE2L2-3Unknown
RP11-279 F6.1LncRNANovel lncRNA4.7Inc-RPLP1-1Unknown
AC004510.3LncRNANovel lncRNA4.1Unknown
RP11-500B12.1LncRNANovel lncRNA4.1Inc-TLR4-1Unknown
AC007405.6LncRNANovel lncRNA3.8Inc-AC007405.7.1-1Unknown
RP11-399D6.2LncRNANovel lncRNA3.5Inc-DMRTA1-2Unknown
RP11-966I7.1LncRNANovel lncRNA3.5Inc-PRKD1-8Unknown
RP11-815 M8.1LncRNANovel lncRNA3.1Inc-DUSP10-3Unknown
AC003090.1LncRNANovel lncRNA2.4Inc-NPVF5Unknown
RP11-79H23.3LncRNANovel lncRNA2.2Inc-FAM164A-1Unknown
RP11-4 F5.2LncRNANovel lncRNA2.0Inc-MCTP2-1Unknown
RP5-1024G6.8LncRNANovel lncRNA1.9Unknown
NKX2-1-AS1LncRNANovel antisense7.4NKX2-1 antisense RNA 1Unknown
RP3-326 L13.2LncRNANovel antisense6.5Unknown
RP11-464O2.2LncRNANovel antisense5.4Inc-LHPP-1Unknown
AC091633.3LncRNANovel antisense5.4Inc-MUC20-2Unknown
RP11-711G10.1LncRNANovel antisense5.3Unknown
SATB2-AS1LncRNANovel antisense4.2SATB2 antisense RNA 1Unknown
FEZF1-AS1LncRNANovel antisense4.1FEZF1 antisense RNA 1Unknown
SLC26A4-AS1LncRNANovel antisense3.7SLC26A4 antisenseUnknown
RP11-160A10.2LncRNANovel antisense3.5Inc-CLVS2Unknown
RP4-803 J11.2LncRNANovel antisense3.4Inc-RAB4A-1Unknown
RP11-402 J6.1LncRNANovel antisense2.5Inc-ALPK1-1Unknown
RP11-710C12.1LncRNANovel antisense2.0Unknown
XISTLncRNAKnown lncRNA7.0X inactive specific transcriptX chromosomal inactivation
LINC00261LncRNAKnown lncRNA5.9LINC RNA 261 Inc FOXA2-2Cancer
TSIXLncRNAKnown lncRNA3.9TSIX transcript, XIST antisense RNAAntisense during X chromosomal inactivation
LINC00461LncRNAKnown lncRNA2.5INC MEF2C-2Unknown
DLX6-AS1LncRNAKnown antisense2.7Embryonic ventral forebrain-1Gene expression; hippocampus

Genes here have at least a ±1.4 log2 fold change and false discovery rate-adjusted P value of less than 0.05 as determined by edgeR. LncRNA, long-coding RNA; SnoRNA, small non-coding RNA.

Table 6

Non-coding RNAs with significantly reduced expression in old tendon

Gene accession Category Type of transcript Log 2 fold change Description/Aliases Function
LINC00957LncRNANovel lncRNA−1.73Long intergenic non-coding RNA 957Unknown
ZNF667-AS1LncRNANovel lncRNA−1.82ZNF667 antisense RNA 1 (head to head)Unknown
RP11-308 N19.1LncRNANovel lncRNA−2.43Unknown
AF131215.9LncRNANovel sense intronic−2.78Unknown
CTB-113P19.1LncRNANovel antisense−2.93Inc-G3BP1-2Unknown
AF131215.2LncRNANovel sense intronic−3.24Unknown
MAFG-AS1LncRNAKnown antisense−3.45MAFG antisense RNA 1 (head to head)Unknown
AC020571.3LncRNANovel antisense−3.50Inc-CCDC150-1Unknown
AC012613.2LncRNANovel antisense−3.86Inc-IL17B-3Unknown
RP11-923I11.1LncRNANovel antisense−3.91Inc-SCN8A-1Unknown
RP11-270 M14.5LncRNANovel lncRNA−4.01Inc-TTLL5-1Unknown
CTD-2540B15.9LncRNANovel lncRNA−4.50Novel lincRNAUnknown
CTD-3049 M7.1LncRNANovel lncRNA−4.53Inc-RGMA7Unknown
RP5-1198O20.4LncRNANovel lncRNA−5.25Inc-KLF17-1Unknown
RP11-300E4.2LncRNANovel antisense−6.24Inc-JPH1_5Unknown
RP11-445 L6.3LncRNANovel lncRNA−6.31Inc-TNC-2Unknown
MIR1245AMiRNAKnown miRNA−2.15Unknown
MXRA5P1PseudogenePseudogene−3.52MXRA5P1Unknown
RNY3P2PseudogenePseudogene−5.30RNA Ro-associated Y3 pseudogene 2Unknown
RP11-494 K3.2PseudogenePseudogene−6.36Unknown
CTC-260E6.10PseudogenePseudogene−7.23Unknown
Y_RNASnoRNANovel misc RNA−5.53DNA replication and cell proliferation

Terms are derived from Ensemble [46] and Vega [47]. ‘Antisense’ overlaps the genomic span of a protein-coding locus on the opposite strand. ‘Known’ indicates identical to known cDNA or proteins from the same species and has an entry in a model database. ‘Novel’ indicates identical or homologous to cDNAs from the same species or proteins from all species. ‘Processed transcript’ does not contain open reading frame and cannot be placed in any other category. ‘Pseudogene’ indicates homology to protein but from a disrupted coding sequence and an active homologous gene can be found at another locus. ‘Sense intronic’ has a long non-coding transcript in introns of a coding gene that does not overlap any exons. LncRNA, long non-coding RNA (which can be further classified as LINCRNA, which is a long interergenic non-coding RNA locus of more than 200 base pairs); miRNA, microRNA; SnoRNA, small non-coding RNA.

Principal component analysis (PCA) and volcano plot of differentially abundant transcripts. (A) PCA plot of log2 gene expression data showed the greatest variability in RNA-Seq data was due to the age of the donor. The second and third components are also shown as changes in the first and second components are due to differing sample sizes between young and old groups. (B) Volcano plot represents log2 FC × log2 CPM plot (counts per million mapped reads) CPM plot. Low expression genes (log2 CPM of less than −5) are coloured orange. Significant differentially expressed genes (DEGs) are coloured in red. A set of DEGs between young and old tendon was identified. With the common dispersion in edgeR [30], 325 DEGs were identified with a P value of less than 0.05 (red). Top 10 genes with the highest and lowest log fold change when comparing young and old tendon Log2 fold change and q value (adjusted P value) were determined in edgeR. A logarithm to the base 2 of 9 is approximately a linear fold change of 3.2. Shown are the 10 genes with highest and lowest expression in old compared with young tendon samples. Non-coding RNAs significantly overexpressed in old tendon Genes here have at least a ±1.4 log2 fold change and false discovery rate-adjusted P value of less than 0.05 as determined by edgeR. LncRNA, long-coding RNA; SnoRNA, small non-coding RNA. Non-coding RNAs with significantly reduced expression in old tendon Terms are derived from Ensemble [46] and Vega [47]. ‘Antisense’ overlaps the genomic span of a protein-coding locus on the opposite strand. ‘Known’ indicates identical to known cDNA or proteins from the same species and has an entry in a model database. ‘Novel’ indicates identical or homologous to cDNAs from the same species or proteins from all species. ‘Processed transcript’ does not contain open reading frame and cannot be placed in any other category. ‘Pseudogene’ indicates homology to protein but from a disrupted coding sequence and an active homologous gene can be found at another locus. ‘Sense intronic’ has a long non-coding transcript in introns of a coding gene that does not overlap any exons. LncRNA, long non-coding RNA (which can be further classified as LINCRNA, which is a long interergenic non-coding RNA locus of more than 200 base pairs); miRNA, microRNA; SnoRNA, small non-coding RNA. The analysis identified a number of transcript isoforms expressed in tendon, some of which were differentially expressed between young and old groups of tendon (Figure 2). In total, 183,660 isoforms were detected in young and 191,673 isoforms were detected in old tendon. Among these, 21,193 isoforms were detected only in young and 29,206 isoforms only in old. Sixty-three known isoforms were upregulated in old tendon, with 80 downregulated with an FDR-adjusted P value of less than 0.05 and ±1.4 log2 fold regulation. The top 10 up- and down-regulated isoforms are presented in Table 7. The entire list of significantly DEG isoforms is presented in Additional file 3.
Figure 2

Volcano plot showing significant differentially expressed (DE) isoforms between young and old tendon. Following assembly with Cufflinks of alternative splicing transcripts [36], significant DE transcripts were identified with q values (P value adjusted to false discovery rate) of less than 0.05. Red spots represent significant DE isoforms, and black spots are for non-DE isoforms.

Table 7

List of top 10 up-/down-regulated isoforms in ageing tendon

Comparison Gene Locus Log 2 (fold change) q value
Higher in oldPM20D26:89855768-898752840.540.04
ILF3-AS119:10762537-107645200.620.01
PERP6:138411922-1384286480.650.03
ODC12:10580093-105886300.670.03
PRKAA21:57110994-571810080.680.03
PNRC16:89790469-897948790.690.02
SYT111:155829299-1558549900.700.02
F13A16:6144317-63212460.710.01
CITED26:139693392-1396957570.720.01
LDB110:103867316-1038802100.730.01
Lower in oldHLA-DRB56:32485119-32498064−4.720.01
MYH117:10286448-10527201−4.520.01
CRABP21:156669397-156675608−4.440.03
CPXM120:2774714-2781283−4.290.01
COL3A12:189839045-189877472−4.160.01
COL3A12:189839045-189877472−3.870.01
ADAM1210:127700949-128077024−3.840.01
CAPN6X:110488330-110513751−3.720.03
COL1A117:48260649-48278993−3.460.01
FAP2:163018279-163101661−3.370.01

The differentially expressed genes (DEGs) in young versus old tendon were determined by Cuffdiff. The fold change is the ratio of fragments per kilobase of exon per million fragments mapped (FPKM) of those genes in young to FPKM of those genes in old tendon. The significant DEGs (q values of less than 0.05) calculated with a Benjamini-Hochberg correction were ranked on their log2 fold change, and 10 genes with the highest or lowest fold changes are shown in the table.

Volcano plot showing significant differentially expressed (DE) isoforms between young and old tendon. Following assembly with Cufflinks of alternative splicing transcripts [36], significant DE transcripts were identified with q values (P value adjusted to false discovery rate) of less than 0.05. Red spots represent significant DE isoforms, and black spots are for non-DE isoforms. List of top 10 up-/down-regulated isoforms in ageing tendon The differentially expressed genes (DEGs) in young versus old tendon were determined by Cuffdiff. The fold change is the ratio of fragments per kilobase of exon per million fragments mapped (FPKM) of those genes in young to FPKM of those genes in old tendon. The significant DEGs (q values of less than 0.05) calculated with a Benjamini-Hochberg correction were ranked on their log2 fold change, and 10 genes with the highest or lowest fold changes are shown in the table.

Age-related changes in transcription factors

There was an increase in the expression of 12 transcription factors in tendon derived from older donors compared with young: ALX homeobox (ALX1), insulin gene enhancer protein 1 (ISL1), lim homeobox 9 (LHX9), myocardin (MYOCD), POU domain, class 3, transcription factor 4 (POU3F4), POU3F3, paired box transcription factor 3 (PAX3), PAX6, PAX9, thyroid transcription factor 1 (NKX2-1), NKX6-1, and dachshund family transcription factor 2 (DACH2). In comparison, there was an increase in only three in younger donors: basic helix-loop-helix family, member e40 (BHLHE40), early growth response 2 (EGR2), and zinc finger of the cerebellum 3 (ZIC3).

Differentially expressed genes and network analysis

DEGs (325) and differentially expressed transcript isoforms (143) associated with ageing were analysed together in IPA with the following criteria: P value of less than 0.05 and 1.4 log2 fold change. Network-eligible molecules were overlaid onto molecular networks based on information from the ingenuity pathway knowledge database and networks generated based on connectivity. (See Additional file 4 for all identified networks and their respective molecules.) The top four scoring networks for genes differentially expressed with tendon age were from cellular function and maintenance, cellular growth and proliferation, cellular cycling, and cellular development (Figure 3). Significant IPA canonical pathways are demonstrated in Table 8, and the associated molecules of the top canonical pathways identified are in Additional file 5. These include hepatic fibrosis, oestrogen biosynthesis, and transcriptional regulatory networks in embryonic stem cells. Interestingly, skeletal and muscular disorders were identified as one of the top diseases associated with the gene set (Additional file 6).
Figure 3

Top-scoring networks derived from the 325 genes differentially expressed in ageing. (A) Ingenuity pathway analysis (IPA) identified from cellular function and maintenance as the principal associated network functions with scores of 43. (B) The second top-scoring network was a cellular growth and proliferation, with scores of 32. (C) IPA identified cell cycle and skeletal and muscular system development function with a score of 32. (D) The fourth top-scoring network was cellular development, with a score of 28. Figures are graphical representations between molecules identified in our data in their respective networks. Green nodes indicate upregulated gene expression in older tendon; red nodes indicate downregulated gene expression in older tendon. Intensity of colour is related to higher fold change. The key to the main features in the networks is shown.

Table 8

A number of ingenuity pathway analysis canonical pathways were significantly affected in ageing tendon

Name of canonical pathway P value Ratio
Hepatic fibrosis/Hepatic stellate cell activation5.69 × 10−4 8/142 (0.056)
Oestrogen biosynthesis1.61 × 10−3 4/38 (0.105)
Transcriptional regulatory network in embryonic stem cells1.95 × 10−3 4/40 (0.1)
Glioblastoma signalling3.66 × 10−3 7/159 (0.044)
Bupropion degradation4.89 × 10−3 3/26 (0.115)

The significance of the association between the data set and the canonical pathway was measured by using a ratio of the number of molecules from the data set that mapped to the pathway divided by the total number of molecules that map to the canonical pathway. Fisher’s exact test was used to calculate P values.

Top-scoring networks derived from the 325 genes differentially expressed in ageing. (A) Ingenuity pathway analysis (IPA) identified from cellular function and maintenance as the principal associated network functions with scores of 43. (B) The second top-scoring network was a cellular growth and proliferation, with scores of 32. (C) IPA identified cell cycle and skeletal and muscular system development function with a score of 32. (D) The fourth top-scoring network was cellular development, with a score of 28. Figures are graphical representations between molecules identified in our data in their respective networks. Green nodes indicate upregulated gene expression in older tendon; red nodes indicate downregulated gene expression in older tendon. Intensity of colour is related to higher fold change. The key to the main features in the networks is shown. A number of ingenuity pathway analysis canonical pathways were significantly affected in ageing tendon The significance of the association between the data set and the canonical pathway was measured by using a ratio of the number of molecules from the data set that mapped to the pathway divided by the total number of molecules that map to the canonical pathway. Fisher’s exact test was used to calculate P values.

Functional annotation of up- and down-regulated isoforms

There was a reduction in the DEG isoforms of 32 genes (representing 15% of the data set) relating to the ECM, degradative proteases, cytokines, and growth factors in tendon derived from older donors compared with young donors. In comparison, there was an increase in only two ECM genes (representing 1.3% of the data set) in older donors (data not shown). DAVID identified significant gene ontology (GO) terms in the upregulated and downregulated set of transcript isoforms (Table 9) with only two terms ‘secreted’ and ‘signal’ overlapping between the two groups. Interestingly, other terms are strikingly different between the upregulated and downregulated isoform data sets. Several of the top GO terms identified in downregulated isoforms in old tendon relate to collagen and post-translational modification of collagen (for example, hydroxylation, hydroxylysine, hydroxyproline, and triple helix).
Table 9

Significant gene ontology terms annotated according to DAVID’s SP-PIR-Keywords

Term Count Percentage P value
GO terms identified in downregulated isoforms in old tendon
Signal8845.16.30 × 10−18
Secreted5829.77.04 × 10−15
Extracellular matrix2412.33.25 × 10−14
Glycoprotein8543.62.72 × 10−9
Collagen126.21.62 × 10−7
Trimer73.68.71 × 10−6
Hydroxylation94.62.22 × 10−5
Triple helix73.62.00 × 10−5
Hydroxylysine73.62.00 × 10−5
Disulfide bond5729.21.85 × 10−5
Cell adhesion189.23.57 × 10−5
Hydroxyproline73.64.38 × 10−5
Ehlers-Danlos syndrome52.67.34 × 10−5
Angiogenesis63.10.011506
Metalloprotease84.10.016954
Pyroglutamic acid52.60.021871
GO terms identified in upregulated isoforms in old tendon
Secreted3024.21.48 × 10−4
Plasma75.60.003767
Signal3931.50.006761
Acute phase43.20.052997

Gene ontology (GO) terms were revealed in upregulated and downregulated differentially expressed gene isoforms of tendon ageing. P value represents Benjamini-Hochberg corrected P value. DAVID, Database for Annotation, Visualization and Integrated Discovery.

Significant gene ontology terms annotated according to DAVID’s SP-PIR-Keywords Gene ontology (GO) terms were revealed in upregulated and downregulated differentially expressed gene isoforms of tendon ageing. P value represents Benjamini-Hochberg corrected P value. DAVID, Database for Annotation, Visualization and Integrated Discovery.

Confirmation of DEG by using qRT-PCR measurements of selected genes

To validate the RNA-Seq technology, selected gene expression differences noted in the RNA-Seq analysis were re-measured by using reverse transcription and qRT-PCR. This was performed on the original RNA from all donors used to perform the RNA-Seq experiment (Table 10) and an independent cohort (Additional file 7A). All genes were found to have comparable results with RNA-Seq data; for instance, genes identified as having an increase in expression in older samples in the RNA-Seq experiment also gave increased expression relative to RPS16 following qRT-PCR. Statistical significance was tested by using Student’s t test. Two genes whose expressions were not significantly altered in RNA-Seq results—aggrecan (ACAN) and MMP3—were also unaltered when assessed with qRT-PCR. Gene expression analysis using qRT-PCR of an independent cohort found similar results. Validation of differential isoform expression by using qRT-PCR was in general concordance with RNA-Seq (Additional file 7B). In all cases, the level of expression varied between the two platforms.
Table 10

Real-time polymerase chain reaction analysis of 14 selected genes reveals good correlation with RNA-Seq results

Gene name RNA-Seq results RT-PCR results
Age P value
Differential expression Significant log 2 fold change q value Young Old
EGF Higher in old6.502.06 × 10−8 0.003 ± 0.0020.01 ± 0.0070.04
POU3F4 8.207.79 × 10−16 0.021 ± 0.0150.245 ± 0.1470.02
MYF5 6.300.0030.061 ± 0.0570.25 ± 0.070.01
TGFB3 Lower in old−1.600.03510.717 ± 6.0220.897 ± 0.340.05
MMP16 −2.300.0220.107 ± 0.0870.014 ± 0.020.07
COL3A1 −3.700.00813.696 ± 17.5351.05 ± 0.5940.27
MYH1 −4.600.0030.288 ± 0.1550.14 ± 0.1010.23
IGF1 −2.100.0042.094 ± 1.2340.483 ± 0.2780.04
COL1A1 −3.300.00947.968 ± 32.6768.515 ± 9.1540.42
MMP3 No changeNot significant1.00051.181 ± 362.1199.323 ± 66.100.15
ACAN 1.0001.439 ± 1.430.848 ± 0.990.66
XIST Higher in old6.900.0000.827 ± 0.7127.41 ± 3.4650.03
LINC00957 Lower in old−1.700.0361.044 ± 0.5840.363 ± 0.1740.08
RP11.308 N19.1 −2.4300.0400.02 ± 0.0650.005 ± 0.0010.03

Values for real-time polymerase chain reaction (RT-PCR) are the mean ± standard deviation of relative expression levels normalised to expression of RSP16 (to two decimal places). Statistical significance was tested by using Student’s t test. RT-PCR results are expressed as 2^-DCT. ACAN, aggrecan; COL1A1, collagen type 1 alpha 1; COL3A1, collahen type 3 alpha 1; EGF, epidermal growth factor; IGF1, insulin growth factor 1; LINC00957 long intergenic non-protein coding RNA 957; MMP3, matrix metalloproteinase; MMP16, matrix metalloproteinase 16; MYF5, myogenic factor 5; MYH1, myosin heavy chain 1; POU3F4, POU class 3 homeobox 4; RP11.308 N19.1, Inc-ZNF462-2; TGFB3, transforming growth factor β.

Real-time polymerase chain reaction analysis of 14 selected genes reveals good correlation with RNA-Seq results Values for real-time polymerase chain reaction (RT-PCR) are the mean ± standard deviation of relative expression levels normalised to expression of RSP16 (to two decimal places). Statistical significance was tested by using Student’s t test. RT-PCR results are expressed as 2^-DCT. ACAN, aggrecan; COL1A1, collagen type 1 alpha 1; COL3A1, collahen type 3 alpha 1; EGF, epidermal growth factor; IGF1, insulin growth factor 1; LINC00957 long intergenic non-protein coding RNA 957; MMP3, matrix metalloproteinase; MMP16, matrix metalloproteinase 16; MYF5, myogenic factor 5; MYH1, myosin heavy chain 1; POU3F4, POU class 3 homeobox 4; RP11.308 N19.1, Inc-ZNF462-2; TGFB3, transforming growth factor β.

Discussion

Ageing is recognised as a significant risk factor for tendon injury; however, knowledge of changes to the transcriptome of tendon cells has previously been limited to that gained from quantitative PCR [5,48,49] and microarray studies on tendinopathic human [50,51] and rat tissue [17,52]. In this study, we report for the first time the use of the RNA-Seq technique to undertake deep transcriptome profiling of young and old macroscopically normal human Achilles tendon. Importantly, validation studies using qRT-PCR demonstrated high correlation between methodologies and demonstrated reproducibility using a different donor set. One of the many advantages of RNA-Seq over microarrays is that it enables de novo analysis of transcripts, including novel transcripts. In this study, we were able to identify and quantify protein-coding transcripts, alternatively spliced isoforms, lncRNAs, pseudogenes, and small regulatory RNAs, including small nucleolar RNAs (snoRNA) and an miRNA. The age of the donor accounted for most of the variability in the data, although PCA identified more variability between young donors. We did not have access to detailed medical history and lifestyle factors for the patients in this study, so we are unable to determine whether other factors explain the variability more precisely. Tendon is characterised by a large amount of ECM interspersed around a relatively sparse population of cells. The main component of the matrix is the fibril-forming type I collagen, which composes about 70% of the dry weight of the matrix [53]. Minor collagen types include other fibril-forming collagens, type III and V; fibril-associated collagens, type XII and XIV; and type VI collagen. As expected, these were the main collagen genes we identified in the transcriptome of the Achilles tendon tissue, albeit at relatively low levels. The non-collagenous component of tendon is rich in small leucine-rich proteoglycans (SLRPs), including decorin, biglycan, fibromodulin, and lumican, and the glycoproteins COMP, lubricin, tenomodulin, and tenascin C [54]. Interestingly, the results of this study show that decorin was by far the most highly expressed ECM gene across the samples in comparison with relatively low levels of collagen transcripts. Lumican was the next most highly expressed ECM protein followed by fibromodulin and COMP. These results are in line with our recent proteomics study in which decorin was the second most abundant ECM protein in a guanidine soluble extract of equine flexor tendon [55]. Degradation of the ECM is accomplished by a family of MMPs along with other proteases, and we identified expression of collagenases, stromelysins, gelatinases, and aggrecanases, although in general the levels of expression were low. An exception to this was MMP3, a stromelysin responsible for proteoglycan degradation, which was one of the most abundant transcripts, again supporting the finding of a higher turnover of non-collagenous proteins. Ageing results in changes to the tendon ECM composition, although these are poorly defined at present and the impact on tendon mechanical properties is not clear as some studies report increased stiffness with ageing [56,57] whereas others report a decrease [58,59]. A recent study using equine flexor tendon found that, although the mechanical properties of the gross structure and the component fascicles did not change with age, the inter-fascicular matrix became stiffer. Given this finding, we expected to find differential expression of ECM transcripts, particularly those enriched in the inter-fascicular matrix. The differential gene expression analysis showed no regulation of proteins likely to be enriched in the inter-fascicular matrix or inter-fibrillar proteins [54]. The alpha 1 chain of type I collagen and alpha 1 chain of type III collagen were identified as having reduced expression in the old age group, although this lost statistical significance when measured by qRT-PCR on a larger sample set. For the most part, these data do not support our original hypothesis that tendon ageing results in reduced expression of genes relating to ECM, degradative proteases, cytokines, and growth factors, unlike changes evident in ageing cartilage [24]. Tendon disease, which has a clear association with ageing, has been the focus of several gene expression studies. Generally, findings in these studies are in keeping with the hypothesis of increased matrix turnover, with an imbalance favoring catabolism. For example, various studies have demonstrated increased expression of collagen 1 alpha 1 (COL1A1) [5,48,49] and proteins more typical of cartilage COL2A1, aggrecan, and SOX9 [5,52]. Tendinopathic samples show an upregulation of various MMPs, including MMP23 [5,51], a disintegrin and metalloproteinase 12 (ADAM12) [5,50,51], and downregulation of MMP3. The results of our study are in stark contrast to this with very low expression levels for COL2A1, aggrecan, SOX9, most MMPs (except MMP3), and a significant downregulation of ADAM12 in the old group. Therefore, the results suggest that degeneration is not an inevitable consequence of ageing and that ageing and disease-associated degeneration are distinct processes. In this study, we identified DEG gene sets with ageing related to a dysregulation of cellular function and maintenance, cellular growth and proliferation, cellular cycling, and cellular development. Therefore, these changes suggest that the cellular component of tendon may lose the ability to respond appropriately to mechanical and chemical signals. Other studies have linked cellular senescence, a state of irreversible growth arrest, in a small subset of cells (progenitor cells) in tendon with tendon ageing [60]. A senescence phenotype has been described, although no marker of senescence identified thus far is entirely specific to the senescent state [9]. Most senescent cells express p16(INK)4a, which is not commonly expressed by quiescent or terminally differentiated cells [9]. In this study, p16(INK)4a was expressed at higher levels in the old group, although transcript levels overall were low, which may indicate that a small subpopulation of cells is responsible for the difference. Senescent cells have been shown to contribute to an inflammatory profile, and the term ‘inflamm-aging’ has been coined [13]. Studies have shown upregulation of inflammatory mediators such as cytochrome oxidase 2 (COX2), interleukin 6 (IL-6), and prostaglandin E2 (PGE2) and downregulation of the lipoxin A4 (LXA4) receptor FPR2 (formyl peptide receptor 2)/ALX in human or equine tendinopathic tissue [14,49]. Inflammatory pathways, however, were not recognised in our GO mapping of DEGs between young and old groups in this study. An interesting finding in this study was the differential expression of isoforms and those with reduced expression in the older tendons mapping to ECM, degradative proteases, cytokines, and growth factors. AS is a significant regulatory mechanism in gene expression as it enables versatility at the post-transcriptional level accounting for proteome complexity and may affect up to 92% of human genes [61]. Differences between isoforms of the same protein extend from a complete loss of function, acquiring a new function to subtle modulations, the latter observed in the majority of cases [62]. Few AS events have been reported in tendon to date. Those that have include versican, in which AS may contribute to changes in ECM structure and function in tendinopathies [48]; lubricin, which is location-dependent [63]; and insulin-like growth factor 1, which is mechanical stress-dependent [64]. The isoforms showing the greatest difference between young and old tendon groups in our study (for example, COL1A1, COL3A1, and ADAM12) are recognised as some of the most important proteins for tendon function and the relevance of these isoforms requires further investigation. The results of our study have yielded new information relating to tendon cell phenotype and to the ageing process, identifying transcripts that are not generally recognised as being important in tendon. For example, the gene most highly expressed, disregarding ribosomal proteins, was angiopoetin-like 7 (ANGPTL7). This protein has previously been identified as highly expressed in microarray analysis of human tendinopathic tissue from various tendons [50]. Angiopoetins are involved in angiogenesis [65], inflammation, and glucose [66] and lipid [67] metabolism. In the cornea, ANGPTL7 may function as negative regulator of angiogenesis, contributing to the avascular properties of the tissue [68], whereas in the human ocular trabecular meshwork cells, it has a role in the organization of the ECM [69]. Thus, we suggest that ANGPTL7 may have a role in maintaining the relatively avascular nature of tendon tissue and in the organisation of the ECM. This represents an area for further investigation. One of the limitations of this study is that the samples were taken from patients with malignant disease. We consider that it is very unlikely that this has influenced the results as samples are taken only when the tumour is at a site distant to the tendon and the tendon is macroscopically normal; however, we cannot rule out the possibility that some of the genes showing high expression, such as metastasis associated lung adenocarcinoma transcript 1 (MALAT1), ANGPITL7, and S100A6, are related to the disease state. Another point of interest was the expression of genes associated with muscle cells. For example, we observed a reduced DEG of myosin heavy chain 1 and an increase DEG of myogenic factor 5 (MYF5) and MYOCD. Our previous studies in ageing cartilage also identified DEG of muscle-related genes: myosin heavy chain 2, myosin 3A, and myosin 1B, which were all reduced in cartilage ageing [24]. Samples of Achilles tendon were taken at a region far removed from muscle insertion, and the identification of muscle genes in cartilage and tendon is unlikely to be due to inadvertent inclusion of muscle tissue. Interestingly, there was an increase in the expression of a large set of transcription factors in old compared with young tendon. In Caenorhabditis elegans [70] and a number of tissues, including heart [71] and brain [72], transcription factors have been implicated in ageing. Interestingly, deacetylates Nk2 homeobox 1 (NKX2-1), a transcription factor showing upregulation in old tendon in our study, is involved in neuronal activation in dorsomedial and lateral hypothalamic nuclei, a function thought to contribute to a more ‘youthful’ physiology during ageing [73]. Conversely, an isoform of scleraxis (SCX), a critical transcription factor in tendon development [74], was reduced in old tendon. In addition, the reduced expression in old tendon of EGR2 and AS EGR1, both of which are required for tendon differentiation [75], may affect tendon repair [76]. We identified eight pseudogenes showing upregulation in the old group of tendons. Pseudogenes have similar sequences to their counterpart coding genes, but owing to mutation/deletion or insertion of nucleotides they cannot be transcribed. It is hypothesised that pseudogenes act as post-transcriptional regulators of the corresponding parental gene [74]. In other studies, pseudogenes have been identified as increasing with age, such as pseudogene cyclin D2 in the ovary [76], and recent work has indicated that they may have a role in inflammation [77]. This provides an exciting new frontier to explore in ageing research, and further work is required to determine whether any of the pseudogenes identified in this study have functional significance. Our study is the first to profile lncRNAs in tendon. LncRNAs are a large and functionally heterogeneous class of RNAs with a length of more than 200 nucleotides. They have been shown to regulate mRNA transcription, splicing, stability, translation, and epigenetic modification, providing a complex spectrum of gene regulatory functions [78], and a number of studies have identified roles for lncRNAs in ageing [22,79]. In this study, lncRNAs were shown to be DEGs in ageing tendon, and 34 showed upregulation in old tendon. In musculoskeletal disease, relatively little work interrogates the role lncRNAs in tissue physiology and disease except for a few studies in cartilage/OA [80-82] and muscle (reviewed [83]) and an osteosarcoma study [84]. The lncRNA transcriptome signatures in ageing tendon provide an interesting set of genes for further studies to determine their role in tendon ageing and disease.

Conclusions

Our study is the first to interrogate tendon by using RNA-Seq. We demonstrate dynamic alterations in RNA with age, at numerous genomic levels, which indicate changes in the regulation of transcriptional networks. Further extensive follow-up analysis of modulator genes, splice variants, and non-coding RNAs found in this study may be useful in understanding tendon ageing.
  72 in total

1.  Microarray analysis of the tendinopathic rat supraspinatus tendon: glutamate signaling and its potential role in tendon degeneration.

Authors:  T J Molloy; M W Kemp; Y Wang; G A C Murrell
Journal:  J Appl Physiol (1985)       Date:  2006-08-03

2.  Age-related changes to the molecular and cellular components of equine flexor tendons.

Authors:  H L Birch; J V Bailey; A J Bailey; A E Goodship
Journal:  Equine Vet J       Date:  1999-09       Impact factor: 2.888

3.  BC200 RNA in normal human neocortex, non-Alzheimer dementia (NAD), and senile dementia of the Alzheimer type (AD).

Authors:  W J Lukiw; P Handley; L Wong; D R Crapper McLachlan
Journal:  Neurochem Res       Date:  1992-06       Impact factor: 3.996

4.  Gene expression profiling of the aging mouse cardiac myocytes.

Authors:  Natalya Bodyak; Peter M Kang; Makoto Hiromura; Indra Sulijoadikusumo; Nobuo Horikoshi; Konstantin Khrapko; Anny Usheva
Journal:  Nucleic Acids Res       Date:  2002-09-01       Impact factor: 16.971

Review 5.  The role of the non-collagenous matrix in tendon function.

Authors:  Chavaunne T Thorpe; Helen L Birch; Peter D Clegg; Hazel R C Screen
Journal:  Int J Exp Pathol       Date:  2013-05-30       Impact factor: 1.925

6.  Hypothalamic Sirt1 in aging.

Authors:  Akiko Satoh; Shin-ichiro Imai
Journal:  Aging (Albany NY)       Date:  2014-01       Impact factor: 5.682

7.  Long noncoding RNAs, emerging players in muscle differentiation and disease.

Authors:  Maria Victoria Neguembor; Mathivanan Jothi; Davide Gabellini
Journal:  Skelet Muscle       Date:  2014-03-31       Impact factor: 4.912

8.  edgeR: a Bioconductor package for differential expression analysis of digital gene expression data.

Authors:  Mark D Robinson; Davis J McCarthy; Gordon K Smyth
Journal:  Bioinformatics       Date:  2009-11-11       Impact factor: 6.937

Review 9.  Transcriptional (dys)regulation and aging in Caenorhabditis elegans.

Authors:  Zachary Pincus; Frank J Slack
Journal:  Genome Biol       Date:  2008-09-16       Impact factor: 13.583

10.  Accurate normalization of real-time quantitative RT-PCR data by geometric averaging of multiple internal control genes.

Authors:  Jo Vandesompele; Katleen De Preter; Filip Pattyn; Bruce Poppe; Nadine Van Roy; Anne De Paepe; Frank Speleman
Journal:  Genome Biol       Date:  2002-06-18       Impact factor: 13.583

View more
  31 in total

Review 1.  Transcriptional Signatures of Aging.

Authors:  R Stegeman; V M Weake
Journal:  J Mol Biol       Date:  2017-07-03       Impact factor: 5.469

2.  Aging leads to inferior Achilles tendon mechanics and altered ankle function in rodents.

Authors:  A M Pardes; Z M Beach; H Raja; A B Rodriguez; B R Freedman; L J Soslowsky
Journal:  J Biomech       Date:  2017-06-20       Impact factor: 2.712

Review 3.  Whole transcriptome analysis with sequencing: methods, challenges and potential solutions.

Authors:  Zhihua Jiang; Xiang Zhou; Rui Li; Jennifer J Michal; Shuwen Zhang; Michael V Dodson; Zhiwu Zhang; Richard M Harland
Journal:  Cell Mol Life Sci       Date:  2015-05-28       Impact factor: 9.261

4.  Analysis of Masticatory Muscle Tendon-aponeurosis Hyperplasia by Using Next-generation Sequencing.

Authors:  Megumi Yumoto; Yosuke Mizuno; Yuta Isozaki; K O Ito; Tetsuya Yoda; Tsuyoshi Sato
Journal:  In Vivo       Date:  2022 Mar-Apr       Impact factor: 2.155

Review 5.  Tendon Extracellular Matrix Assembly, Maintenance and Dysregulation Throughout Life.

Authors:  Seyed Mohammad Siadat; Danae E Zamboulis; Chavaunne T Thorpe; Jeffrey W Ruberti; Brianne K Connizzo
Journal:  Adv Exp Med Biol       Date:  2021       Impact factor: 2.622

Review 6.  The role of MicroRNAs in tendon injury, repair, and related tissue engineering.

Authors:  Qian Liu; Yaxi Zhu; Weihong Zhu; Ge Zhang; Yunzhi Peter Yang; Chunfeng Zhao
Journal:  Biomaterials       Date:  2021-08-26       Impact factor: 15.304

Review 7.  Research perspectives-Pipelines to human tendon transcriptomics.

Authors:  Lorenzo Ramos-Mucci; Paula Sarmiento; Dianne Little; Sarah Snelling
Journal:  J Orthop Res       Date:  2022-03-16       Impact factor: 3.102

8.  Decoding the Regulatory Landscape of Ageing in Musculoskeletal Engineered Tissues Using Genome-Wide DNA Methylation and RNASeq.

Authors:  Mandy Jayne Peffers; Katarzyna Goljanek-Whysall; John Collins; Yongxiang Fang; Michael Rushton; John Loughlin; Carole Proctor; Peter David Clegg
Journal:  PLoS One       Date:  2016-08-17       Impact factor: 3.240

9.  Anatomical heterogeneity of tendon: Fascicular and interfascicular tendon compartments have distinct proteomic composition.

Authors:  Chavaunne T Thorpe; Mandy J Peffers; Deborah Simpson; Elizabeth Halliwell; Hazel R C Screen; Peter D Clegg
Journal:  Sci Rep       Date:  2016-02-04       Impact factor: 4.379

10.  Minimally invasive, endoscopic Achilles tendon reconstruction using semitendinosus and gracilis tendons with Endobutton stabilization.

Authors:  Tomasz Piontek; Paweł Bąkowski; Kinga Ciemniewska-Gorzela; Monika Grygorowicz
Journal:  BMC Musculoskelet Disord       Date:  2016-06-03       Impact factor: 2.362

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.