Literature DB >> 24603890

Long non-coding RNA expression profiles in hereditary haemorrhagic telangiectasia.

Pernille M Tørring1, Martin Jakob Larsen2, Anette D Kjeldsen3, Lilian Bomme Ousager2, Qihua Tan4, Klaus Brusgaard2.   

Abstract

Hereditary Haemorrhagic Telangiectasia (HHT) is an autosomal dominantly inherited vascular disease characterized by the presence of mucocutaneous telangiectasia and arteriovenous malformations in visceral organs. HHT is predominantly caused by mutations in ENG and ACVRL1, which both belong to the TGF-β signalling pathway. The exact mechanism of how haploinsufficiency of ENG and ACVRL1 leads to HHT manifestations remains to be identified. As long non-coding RNAs (lncRNAs) are increasingly recognized as key regulators of gene expression and constitute a sizable fraction of the human transcriptome, we wanted to assess whether lncRNAs play a role in the molecular pathogenesis of HHT manifestations. By microarray technology, we profiled lncRNA transcripts from HHT nasal telangiectasial and non-telangiectasial tissue using a paired design. The microarray probes were annotated using the GENCODE v.16 dataset, identifying 4,810 probes mapping to 2,811 lncRNAs. Comparing HHT telangiectasial tissue with HHT non-telangiectasial tissue, we identified 42 lncRNAs that are differentially expressed (q<0.001). Using GREAT, a tool that assumes cis-regulation, we showed that differently expressed lncRNAs are enriched for genomic loci involved in key pathways concerning HHT. Our study identified lncRNAs that are aberrantly expressed in HHT telangiectasia and indicates that lncRNAs may contribute to regulate protein-coding loci in HHT. These results suggest that the lncRNA component of the transcriptome deserves more attention in HHT. A deeper understanding of lncRNAs and their role in telangiectasia formation possesses potential for discovering therapeutic targets in HHT.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24603890      PMCID: PMC3946172          DOI: 10.1371/journal.pone.0090272

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Hereditary Haemorrhagic Telangiectasia (HHT) is an autosomal dominantly inherited vascular disease characterized by the presence of mucocutaneous telangiectasia and arteriovenous malformations (AVMs) in visceral organs, primarily the lungs, liver and brain. The most common clinical manifestation is spontaneous and recurrent epistaxis [1], [2], originating from nasal telangiectasias, which can be difficult to prevent and can lead to severe anaemia. Moreover, pulmonary arteriovenous malformations (PAVMs) are seen in approximately one third of patients and are potentially lethal, due to haemorrhage or shunting of blood through these abnormal blood vessels, which can cause paradoxical embolisms and cerebral abscesses [2], [3]. The clinical manifestations of HHT are extremely variable, even between family members, and age-dependent penetrance is present. The reported prevalence in Denmark is around 1 in 6,500 [4]. Around 85% of clinically diagnosed HHT patients [5]–[7] carry a mutation in either ENG (HHT1) or ACVRL1 (HHT2). Patients with mutations in ENG or ACVRL1 are clinically similar, as all reported manifestations are known to occur in both. However, later onset and lower penetrance, less cerebral and pulmonary AVMs, but more liver involvement and a risk of developing pulmonary arterial hypertension, are observed in patients with ACVRL1 mutations. Currently there is no cure for the disease; only symptomatic treatment is possible. HHT manifestations are thought to result from an imbalance in the process of angiogenesis. Angiogenesis is the development of new capillary blood vessels from pre-existing vessels and is controlled by different cytokines such as vascular endothelial growth factor (VEGF) and transforming growth factor beta 1 (TGFβ1). ENG and ACVRL1 encode receptor proteins: Endoglin and ALK1 (Activin A receptor type II-like 1) (SKR3) respectively, which are components of the TGF-β signalling pathway. The TGF-β signalling pathway plays a complex and important role in development and homeostasis of many organs, including the vascular system. In the latter, it is involved in cell proliferation, migration, extracellular matrix formation, vascular smooth muscle cell differentiation and vascular tone. As Endoglin and ALK1 proteins are predominantly expressed in endothelial cells, these are primary cellular targets of the disease. Thus, HHT manifestations are caused by a disturbance in the TGF-β signalling pathway [8]–[10]. The exact mechanism of how haploinsufficiency of ENG and ACVRL1 leads to HHT manifestations remains to be identified. Our knowledge of the non protein-coding part of the human transcriptome is expanding rapidly, and recently attention has shifted towards the most numerous but still poorly understood group—the long non-coding RNAs (lncRNAs). Long non-coding RNAs (lncRNAs) are defined as eukaryote RNAs longer than 200 nucleotides in length, without protein coding capacity. This arbitrary limit of 200 nucleotides distinguishes lncRNAs from small regulatory RNAs, such as microRNA and short interfering RNA (siRNA). Studies indicate that lncRNAs are generated through pathways similar to that of protein-coding genes, with similar histone-modification profiles, splicing signals, and exon/intron lengths [11]. LncRNAs have been shown to be involved in major mechanisms of transcriptional and post-transcriptional regulation, such as targeting transcription factors, initiating chromatin remodelling, directing methylation complexes and blocking nearby transcription [12]. LncRNAs appear to control expression of protein-coding genes through both cis- and trans-acting pathways [11]. Moreover, lncRNAs tend to be expressed at lower levels than protein-coding genes and display more tissue-specific expression patterns [11]. Of the many identified lncRNAs, only few have been characterized functionally [13]. A complete catalogue of lncRNAs is not yet available; nonetheless, in 2012 the GENCODE consortium published the most complete set of human lncRNAs to date [11], comprising 9277 manually annotated genes producing 14,880 transcripts. GENCODE annotate lncRNAs according to their localization regarding the nearest known protein transcripts: exonic, intronic, overlapping or intergenic [11]. To identify potential HHT therapeutic targets, further knowledge on how a disturbance of the TGF-β signalling pathway leads to HHT manifestations is needed. As lncRNAs are increasingly recognized as key regulators of gene expression we assessed the lncRNA expression in HHT tissue. Thus, the purpose of this study was to investigate the possible involvement of lncRNAs in the molecular pathogenesis of the telangiectasia formation in HHT patients. This was done by expression profiling of lncRNAs in 40 telangiectasial and 40 non-telangiectasial samples from HHT patients using microarray technology.

Materials and Methods

Ethics statement

This study was approved by the ethics committee of Southern Denmark (S-20090131), and the patients provided verbal and written informed consent to participate in this study in accordance with the Declaration of Helsinki.

HHT patients

Nasal mucosal biopsy specimens of telangiectasial and non-telangiectasial tissue in pairs from HHT1 (n = 19) and HHT2 (n = 21) patients were used in this study. The nasal mucosa was anaesthetized with gauze impregnated with Lidocain phenylephrinehydrocloride. No infiltration anaesthesia was used. The samples were collected with a Weil Nasal Forceps by an experienced rhinologist and contained either macroscopicly visible telangiectasia or natural mucosa. All patients showed signs of HHT according to the Curaçao Criteria [14] and carried the familial pathogenic mutation in either ENG or ACVRL1. The 80 paired samples represented 14 different HHT1 families and 14 different HHT2 families. The familial mutations were of different types and distributed throughout the two genes (Table 1) [15]. The patients were aged 28–56 years (median age 40) and included 21 women and 19 men. Patients were diagnosed and treated in the national HHT Centre of Denmark in Odense University Hospital.
Table 1

Genotypes and phenotypes of the HHT patients who participated in this study.

GeneFamily No.Nucleotid changeAmino acid changeAgeSexPhenotype
ENG 9c.360C>Ap.Tyr120*49ME,T,P,G,F
ENG 9c.360C>Ap.Tyr120*53FE,T,F
ENG 13c.360C>Ap.Tyr120*54FE,T,P,G,F
ENG 13c.360C>Ap.Tyr120*30FE,T,P,F
ENG 15c.361-2A>Gp.?47FE,T,G,F
ENG 15c.361-2A>Gp.?38ME,T,P,F
ENG 15c.361-2A>Gp.?40FE,T,F
ENG 24c.360C>Ap.Tyr120*40FE,T,F
ENG 24c.360C>Ap.Tyr120*46FE,T,P,G,F
ENG 37c.360C>Ap.Tyr120*51ME,T,P,F
ENG 38c.360C>Ap.Tyr120*37FE,T,F
ENG 49c.821C>Tp.Thr274Ile47FE,T,F
ENG 61c.817-3T>Gp.?49FE,T,F
ENG 65c.1166_1168delTCTp.Phe389del35FT,P,F
ENG 87c.808C>Tp.Gln270*38MT,P,F
ENG 92c.219+1G>Tp.?44ME,T,P
ENG 93c.1550_1551delTGp.Val517Glufs*1037ME,T,F
ENG 94p.1582_1583delp.Pro528Alafs*3830ME,T,F
ENG 96c.277C>Tp.Arg93*46ME,T,P,F
ACVRL1 8c.1120C>Tp.Arg374Trp55ME,T,F
ACVRL1 8c.1120C>Tp.Arg374Trp28ME,T,F
ACVRL1 18c.1468C>Tp.Gln490*43ME,T,F
ACVRL1 18c.1468C>Tp.Gln490*44FE,T,P,F
ACVRL1 18c.1468C>Tp.Gln490*41ME,T,F
ACVRL1 20c.430C>Tp.Arg144*47ME,T
ACVRL1 42c.1135G>Ap.Glu379Lys32FE,T,F
ACVRL1 43c.626-3C>Gp.?38FE,T,F
ACVRL1 43c.626-3C>Gp.?36FT,F
ACVRL1 46c.1013T>Ap.Val338Asp39ME,T,F
ACVRL1 47c.1120C>Tp.Arg374Trp40FE,T,F
ACVRL1 56c.1-?_1048+?delp.0?44FE,T,F
ACVRL1 57c.143G>Ap.Gly48Glu34ME,T,F
ACVRL1 57c.143G>Ap.Gly48Glu32FT,F
ACVRL1 67c.266G>Ap.Cys89Tyr24FE,T,F
ACVRL1 67c.266G>Ap.Cys89Tyr31ME,T,F
ACVRL1 69c.1232G>Ap.Arg411Gln39ME,T,F
ACVRL1 71c.143G>Ap.Gly48Glu44FE,T,F
ACVRL1 82c.139_140insCGp.Arg47Profs*848ME,T,F
ACVRL1 88c.155delp.Thr52Lysfs*256FE,T,G,F
ACVRL1 88c.155delp.Thr52Lysfs*250ME,T,F

Abbreviations: AVM, arteriovenous malformation; E, epistaxis; T, telangiectasia; P, pulmonary AVM; C, cerebral AVM; G, gastrointestinal telangiectasia/gastrointestinal bleeding; H, Hepatic AVM; F, family history. M, male; F, female.

Abbreviations: AVM, arteriovenous malformation; E, epistaxis; T, telangiectasia; P, pulmonary AVM; C, cerebral AVM; G, gastrointestinal telangiectasia/gastrointestinal bleeding; H, Hepatic AVM; F, family history. M, male; F, female.

RNA isolation and quality

Samples were collected in RNAlater Solution (Life Technologies) and stored at −20°C. Total RNA was isolated using the RNeasy Micro Kit according to the RNeasy® Micro Handbook (Qiagen). The quantity of RNA was assessed with the NanoDrop spectrophotometer ND-8000 (NanoDrop Technologies), and RNA quality was assessed by the Agilent 2100 Bioanalyzer (Agilent Technologies).

Microarray labelling, hybridization, and scanning

Sample labelling and array hybridization were performed according to the Two-Color Microarray-Based Gene Expression Analysis—Low Input Quick Amp Labeling—protocol (Agilent Technologies) using the SurePrint G3 Human Gene Expression 8×60 microarray format (Agilent Technologies). Samples were labelled with Cy5 and Universal Human Reference RNA (Stratagene) was labelled with Cy3 and used as a common reference on all arrays.

Data pre-processing and annotation of lncRNAs

Agilent Feature Extraction software v. 10.7.3.1 (Agilent technologies) was used to analyse acquired array images. Data were then within-array normalized by Loess normalization method and between-array normalized by Quantile normalization. The normalized values were used to calculate log2 transformed Cy5/Cy3 ratios. Replicate probes were collapsed calculating the median. Missing expression values were imputed by k-nearest neighbours averaging (k = 10). Date pre-processing was performed using the R (R Core team 2012) package limma [16]. Microarray data were deposited to the Gene Expression Omnibus (GSE53515). All the 42,164 probes of the Agilent SurePrint G3 array were re-annotated using GENCODE v.16 gene annotation database (www.gencodegenes.org) [11]. The genomic coordinates of the probes in the Agilent array were matched to the genomic coordinates of the lncRNAs from the GENCODE v.16, to identify the probes covering lncRNAs. LncRNAs were included if 55 base pairs overlapped with the 60-mer array probes.

Data analysis

Data analyses were performed using the Qlucore Omics Explorer 2.3 software (Qlucore). Differentially expressed lncRNAs comparing telangiectasial and non-telangiectasial tissue were ranked according to statistical significance determined by two-group comparison (paired t-test). This was done for the groups HHT1 and HHT2 seperately and subsequently for the total group of HHT. Multiple testing was adjusted for by the Benjamini-Hochberg method. Differentially expressed lncRNAs were chosen for further evaluation (q<0.15). Principal component analysis (PCA) and hierarchical clustering were performed in Qlucore Omics Explorer 2.3 to examine whether telangiectasial and non-telangiectasial samples could be separated.

Genomic Regions Enrichment of Annotation

To assess the potential cis-regulatory effect of the differentially expressed lncRNAs in HHT, we used the Genomic Regions Enrichment of Annotations Tool (GREAT) [17]. This program has genomic coordinates as inputs and outputs nearby genes and their ontologies. It analyses the functional significance of cis-regulatory regions identified by localized measurements of DNA binding events across an entire genome.

Correlation analysis

To identify potential trans-regulated transcripts, we assessed the correlation between the expression levels of the 10 highest ranking statistically significantly differentially expressed lncRNAs (HHT total) and every transcript present on the Agilent array by calculating Pearson's correlation coefficient. The transcripts were ranked according to the corrected p-values for the correlation coefficients; this was performed for each lncRNA of interest. Correction for multiple testing was performed using the Bonferoni correction. Manhattan plots were drawn using the Integrative Genome Viewer (IGV) version 2.3.18 [18], based on each of these correlation-sets for individual statistically significantly differentially expressed lncRNAs.

Results

Annotation of lncRNAs

All the 42,164 probes of the Agilent SurePrint G3 array were re-annotated using GENCODE v.16 gene annotation database (www.gencodegenes.org)[11]. We identified 5,069 probes mapping to 2,939 lncRNAs. These 5,069 probes included both intronic and exonic lncRNA probes. Of these 5,069 probes, 404 mapped to overlapping lncRNAs and mRNAs – of these, 259 probes were excluded as they mapped to overlapping mRNA exons and lncRNA introns – only probes mapping to overlapping mRNA exons and lncRNA exons were included. Further analysis was performed using the 4,810 remaining lncRNA probes, mapping to 2,811 lncRNAs, covering almost 31% of the GENCODE lncRNA dataset (Figure S1).

Identification of differentially expressed lncRNAs in HHT telangiectasia

We detected 617 statistically significantly differentially expressed lncRNAs (q<0.05, paired t-rest) when comparing telangiectasial and non-telangiectasial samples in the total HHT group (80 paired samples), of which 42 were highly statistically significant (q<0.001). Of the 42 differentially expressed lncRNAs (listed in Table 2), 16 were upregulated and 26 downregulated. None of these 42 lncRNAs are characterized functionally.
Table 2

Top 42 long non-coding RNAs (q<0.001) differentially expressed in HHT telangiectasia.

Ensembl gene IDgencode.v16_GenomicCoordinatesq-value (FDR)Fold changeGene biotypeHGNC symbol
ENSG00000249772.1chr5:80409204-80410671_R1.83E-060.86antisense-
ENSG00000230544.1chr13:114586640-114588308_F1.50E-050.88lincRNALINC00453
ENSG00000215231.3chr5:5034472-5070117_F1.50E-050.88lincRNA-
ENSG00000237548.1chr9:124646915-124725998_R1.50E-050.86processed_transcriptTTLL11-IT1
ENSG00000263753.1chr18:5232875-5246507_F0.000110.90lincRNALINC00667
ENSG00000231133.1chr20:61727150-61733631_R0.000140.90processed_transcriptHAR1B
ENSG00000256218.1chr12:5475214-5476940_R0.000150.90lincRNA-
ENSG00000241269.1, ENSG00000188365.3chr7:5459458-5462753_F0.000220.85antisense-
ENSG00000226496.1chr21:42513427-42520060_R0.000231.14antisenseLINC00323
ENSG00000248176.1chr4:29119930-29204392_F0.000390.89lincRNA-
ENSG00000259484.1chr15:57151866-57210697_R0.000390.86processed_transcript-
ENSG00000206129.3chr18:53670844-53858493_R0.000421.11lincRNA-
ENSG00000235285.1chr13:44720606-44732358_R0.000420.90sense_intronicSMIM2-IT1
ENSG00000147753.5chrY:6317509-6325947_F0.000421.13lincRNATTTY7
ENSG00000240453.1chr1:745489-753092_R0.000420.84processed_transcript-
ENSG00000259150.1chr15:26360960-26378184_F0.000421.09lincRNALINC00929
ENSG00000255471.1* chr11:86603256-86636079_R0.000450.81antisense-
ENSG00000237036.3chr10:31596646-31608810_R0.000561.11antisense-
ENSG00000233154.1chr1:116966346-117021464_R0.000650.91lincRNA-
ENSG00000197251.3chr6:33553883-33561115_R0.000681.10antisenseLINC00336
ENSG00000248176.1chr4:29119930-29204392_F0.000730.92lincRNA-
ENSG00000254154.3chr1:177897923-178007142_R0.000730.88processed_transcript-
ENSG00000215374.4chr8:7159133-7212876_R0.000730.90processed_transcript-
ENSG00000196096.3chr2:214141276-214148929_R0.000730.92processed_transcript-
ENSG00000259758.1chr8:141530255-141539600_R0.000731.13lincRNACASC7
ENSG00000264772.1, ENSG00000265500.1chr17:7466604-7485342_F0.000741.26processed_transcript-
ENSG00000229563.1chrX:45364633-45489447_F0.000761.10processed_transcript-
ENSG00000203325.3chr1:32517892-32539075_R0.000760.88antisense-
ENSG00000232956.3* chr7:45022622-45026560_R0.000761.20lincRNA-
ENSG00000232021.2* chr4:109088681-109177992_F0.000760.90processed_transcriptLEF1-AS1
ENSG00000250195.1chr4:139741108-139933800_R0.000761.21antisense-
ENSG00000250608.1chr3:131043936-131100319_R0.000761.11processed_transcript-
ENSG00000266952.1chr18:61880317-61927290_R0.000780.91lincRNA-
ENSG00000259334.1chr14:24391456-24403777_R0.000791.11lincRNALINC00596
ENSG00000249364.1chr5:66675206-67101066_F0.000790.91processed_transcript-
ENSG00000231185.2* chr5:141704858-142051566_F0.000860.93processed_transcript-
ENSG00000232046.1chr2:66801162-66957289_F0.000860.91processed_transcript-
ENSG00000230133.1chr20:24180403-24205224_F0.000860.90lincRNA-
ENSG00000215808.2chr1:238643684-238649323_R0.000901.11processed_transcript-
ENSG00000135253.9chr7:128502505-128550773_R0.000941.08processed_transcriptKCP
ENSG00000233251.3chr2:56400669-56412905_R0.000941.09processed_transcript-
ENSG00000245910.3chr8:67833919-67838633_R0.00100.85processed_transcriptSNHG6

*Are part of the lncRNAs identified by GREAT analysis.

*Are part of the lncRNAs identified by GREAT analysis. Analysis of the HHT1 samples resulted in 70 differentially expressed lncRNAs (q<0.05, paired t-test) when comparing telangiectasial and non-telangiectasial samples, and in HHT2 analysis resulted in 114 differentially expressed lncRNAs (q<0.05, paired t-test). The number of common lncRNAs (q<0.05) in the three groups is rather small as shown in Figure 1.
Figure 1

Venn diagram.

The number of statistically significantly differentially expressed long non-coding RNAs in the three groups (q<0.05). The numbers in the overlapped areas indicate the lncRNAs that are differentially expressed in the multiple groups.

Venn diagram.

The number of statistically significantly differentially expressed long non-coding RNAs in the three groups (q<0.05). The numbers in the overlapped areas indicate the lncRNAs that are differentially expressed in the multiple groups. The complete results of the paired t-tests in HHT and the subgroups HHT1 and HHT2 are provided in Table S1, S2, and S3.

Principal component analysis (PCA) and hierarchical clustering

PCA applied to the differentially expressed lncRNAs (q<0.15) revealed a clear separation of the telangiectasial and non-telangiectasial samples, regarding HHT1 (Figure 2a), HHT2 (Figure 2b) and HHT (Figure 2c). No statistically significant difference in expression values between HHT1 telangiectasial tissue and HHT2 telangiectasial tissue was observed. Even so, the data suggest that there is a minor difference as the comparison and grouping of the subgroups introduce more variation in the PCA plots (Figures 2a and 2b compared with Figure 2c).
Figure 2

Principal component analysis (PCA) of the three groups.

a. PCA applied to the differentially expressed long non-coding RNAs (lncRNAs)(paired t-test, q<0.15) revealed a clear separation of the telangiectasial and non-telangiectasial samples, regarding HHT1. b. PCA applied to the differentially expressed lncRNAs (paired t-test, q<0.15) revealed a clear separation of the telangiectasial and non-telangiectasial samples, regarding HHT2. c. PCA applied to the differentially expressed lncRNAs (paired t-test, q<0.15) revealed a clear separation of the telangiectasial and non-telangiectasial samples in the total group of HHT. There is no clustering into subgroups.

Principal component analysis (PCA) of the three groups.

a. PCA applied to the differentially expressed long non-coding RNAs (lncRNAs)(paired t-test, q<0.15) revealed a clear separation of the telangiectasial and non-telangiectasial samples, regarding HHT1. b. PCA applied to the differentially expressed lncRNAs (paired t-test, q<0.15) revealed a clear separation of the telangiectasial and non-telangiectasial samples, regarding HHT2. c. PCA applied to the differentially expressed lncRNAs (paired t-test, q<0.15) revealed a clear separation of the telangiectasial and non-telangiectasial samples in the total group of HHT. There is no clustering into subgroups. Hierarchical clustering: amongst the differentially expressed lncRNAs (q<0.001) in the HHT group both up- and down-regulation was observed (Figure 3). Hierarchical clustering for HHT1 and HHT2 is shown in Figure S6 and S7.
Figure 3

Hierarchical clustering HHT.

Hierarchical clustering for the 80 HHT paired samples comparing the expression in telangiectasial (yellow) and non-telangiectasial (pink) tissue, using the top variant lncRNAs (n = 42)(q<0.001). In the heat map rows correspond to long non-coding RNAs (lncRNAs) and columns to samples. Red indicates elevated expression, green indicates reduced expression. In the 42 differentially expressed lncRNAs; 16 are up-regulated and 26 are down-regulated.

Hierarchical clustering HHT.

Hierarchical clustering for the 80 HHT paired samples comparing the expression in telangiectasial (yellow) and non-telangiectasial (pink) tissue, using the top variant lncRNAs (n = 42)(q<0.001). In the heat map rows correspond to long non-coding RNAs (lncRNAs) and columns to samples. Red indicates elevated expression, green indicates reduced expression. In the 42 differentially expressed lncRNAs; 16 are up-regulated and 26 are down-regulated.

HHT1

The differentially expressed lncRNAs with q<0.15 (n = 617) were loaded into GREAT. Of these, 31% map within 50 kilobases (kb) of an annotated gene, and 89% map within 500 kb of a gene (Figure S2). Gene ontology [19] (www.geneontology.org) analysis, performed in GREAT, of the 617 neighbouring genomic regions (826 genes) is seen in Table 3. The ontologies of those loci concerned six gene ontology (GO) terms (binomial false discovery rate (FDR) q-value<0.001), of which three were considered key terms as they are central to HHT pathogenesis: blood vessel morphogenesis (GO:0001568, 3747 gene products), blood vessel development (GO:0001568, 3747 gene products) and vasculogenesis (Synonym: Vascular morphogenesis, GO:0001570, 557 gene products). Table 4 (and Tables S4–S6 in File S1) lists the genes involved in the three GO terms and their relative positions. In the GO terms tree (Figure 4) the close connection and overlap of the three mentioned GO terms can be seen.
Table 3

Gene ontology (GO) analysis of 617 genomic regions (826 genes) located nearby differentially expressed long non-coding RNAs in the HHT1 group.

OntologyTerm nameTerm IDBinom FDR Q-valueBinom fold enrichmentHyper FDR Q-valueHyper fold enrichment
GO Biological Processblood vessel morphogenesisGO:00485149.21e-72.294.59602e-52.72
GO Biological Processblood vessel developmentGO:00015686.81e-62.075.89303e-52.52
GO Biological Processimmune system developmentGO:00025207.30e-52.065.60561e-32.06
GO Biological Processhemopoietic or lymphoid organ developmentGO:00485341.74e-42.041.14257e-22.04
GO Biological ProcesshemopoiesisGO:00300974.03e-42.061.65733e-22.06
GO Biological ProcessvasculogenesisGO:00015706.00e-43.435.61511e-44.90
Table 4

Overlap between gene-genomic regions association tables for the different GO terms.

GO Blood vessel morphogenesis (HHT1)GO Blood vessel development (HHT1)GO Vasculogenesis (HHT1)GO Vasculogenesis (HHT2)GO Vasculogenesis (HHT)GO Blood vessel morphogenesis (HHT)Gene located on ChromosomeFoldchange (ENSG)
AMOT AMOT AMOT 21.09
ANGPT2 81.13
APOB APOB APOB 141.06–1.07
BMP4 BMP4 BMP4 140.92–0.93
CAV1 CAV1 CAV1 CAV1 CAV1 CAV1 7 1.01–1.18
CCM2 CCM2 CCM2 CCM2 CCM2 CCM2 7 1.16–1.25
CDH5 160.89
CITED2 CITED2 CITED2 60.92–0.93
CXCL12 CXCL12 CXCL12 100.89–0.92
CXCR4 CXCR4 21.10
CYP1B1 CYP1B1 CYP1B1 20.89–0.90
FGF1 FGF1 FGF1 50.09–1.16
FN1 FN1 FN1 20.88–1.22
FOXF1 FOXF1 FOXF1 FOXF1 FOXF1 FOXF1 16 1.06–1.09
FOXO1 131.11
FZD4 FZD4 FZD4 FZD4 FZD4 FZD4 11 0.65–1.22
GREM1 GREM1 151.09
HDAC7 HDAC7 HDAC7 HDAC7 HDAC7 120.91–0.92
HES1 HES1 HES1 30.91–1.14
ITGB1 101.09
ITGA5 ITGA5 121.19–1.20
JAG1 JAG1 201.27
JAM3 JAM3 110.95
LEF1 LEF1 LEF1 40.87–0.90
MEF2C 51.09–1.11
MEIS1 MEIS1 MEIS1 20.57–1.14
NOX5 NOX5 150.93
NR2F2 150.92
NRP1 101.09
NRXN1 NRXN1 NRXN1 20.94–0.95
PRKX PRKX X1.08
PROK2 PROK2 PROK2 31.08–1.20
PRSS23 PRSS23 PRSS23 PRSS23 PRSS23 PRSS23 11 0.65–0.93
RASA1 RASA1 RASA1 RASA1 RASA1 RASA1 5 0.92–1.16
SHH SHH SHH 70.88–1.09
SOX4 SOX4 SOX4 60.92–1.12
STAB2 STAB2 STAB2 120.92–0.94
T T T T 61.16–1.28
TGFB2 TGFB2 TGFB2 11.08–1.13
THBS1 THBS1 THBS1 150.93–1.17
TIPARP TIPARP TIPARP TIPARP TIPARP TIPARP 3 0.84–1.08
VEGFA VEGFA VEGFA 61.07–1.24
WARS WARS 140.92
WNT2 70.93
WNT7A WNT7A WNT7A WNT7A WNT7A 30.91–1.10
WNT7B 220.90
ZC3H12A ZC3H12A 10.92
ZFPM2 ZFPM2 ZFPM2 ZFPM2 ZFPM2 ZFPM2 81.13–1.23
ZMIZ1 ZMIZ1 ZMIZ1 ZMIZ1 ZMIZ1 ZMIZ1 10 0.01–1.11

q-values can be seen of tables S4–S9. Genes that are present for all six GO–terms are highlighted in bold.

Figure 4

Gene Ontology terms tree.

Graphical view of the gene ontology (GO) terms tree showing part of the GO terms in the ontology biological process and their connections. Our three GO terms; blood vessel development, blood vessel development, and vasculogenesis (in red boxes), are the more narrow and specific terms in the tree and overlap by a number of genes.

Gene Ontology terms tree.

Graphical view of the gene ontology (GO) terms tree showing part of the GO terms in the ontology biological process and their connections. Our three GO terms; blood vessel development, blood vessel development, and vasculogenesis (in red boxes), are the more narrow and specific terms in the tree and overlap by a number of genes. q-values can be seen of tables S4–S9. Genes that are present for all six GO–terms are highlighted in bold.

HHT2

The differentially expressed lncRNAs with q<0.15 (n = 640) were loaded into GREAT. Of these, 31% map within 50 kilobases (kb) of an annotated gene, and 89% map within 500 kb of a gene (Figure S3). Gene ontology analysis of the 640 neighbouring genomic regions (887 genes) is seen in Table 5. The ontologies of those loci concerned five GO terms (binomial FDR q-value<0.001), of which especially vasculogenesis was interesting. Table 4 (and Table S7 in File S1) lists the genes involved in this GO term and their relative positions.
Table 5

Gene ontology (GO) analysis of 640 genomic regions (887 genes) located nearby differentially expressed long non-coding RNAs in the HHT2 group.

OntologyTerm nameTerm IDBinom FDR Q-valueBinom fold enrichmentHyper FDR Q-valueHyper fold enrichment
GO Biological ProcessvasculogenesisGO:00015701.90e-53.806.20e-54.91
GO Biological Processpositive regulation of neuron differentiationGO:00456664.95e-53.134.51e-44.41
GO Biological Processregulation of mesenchymal cell proliferationGO:00104643.90e-43.283.45e-35.00
GO Biological Processregulation of DNA bindingGO:00511017.35e-43.242.21e-55.00
GO Biological Processregulation of bindingGO:00510988.32e-42.431.13e-53.38

HHT

The differentially expressed lncRNAs with q<0.05 (n = 617) were loaded into GREAT. A more stringent cut-off of q<0.05 was applied here, as the sample size was larger. Of these lncRNAs, 31% map within 50 kilobases (kb) of an annotated gene, and 90% map within 500 kb of a gene (Figure S4). Gene ontology analysis of the 617 neighbouring genomic regions (837 genes) is seen in Table 6. Applying a cut-off of q<0.001 resulted in three GO terms: vasculogenesis, lactation and blood vessel morphogenesis. Table 4 (and Tables S8 and S9 in File S1) lists the genes involved in the GO terms ‘vasculogenesis’ and ‘blood vessel morphogenesis’ and their relative positions.
Table 6

Gene ontology (GO) analysis of 617 genomic regions (837 genes) located nearby differentially expressed long non-coding RNAs in the total HHT group.

OntologyTerm nameTerm IDBinom FDR Q-valueBinom fold enrichmentHyper FDR Q-valueHyper fold enrichment
GO biological processvasculogenesisGO:00015701.71e-53.944.92e-55.21
GO biological processlactationGO:00075952.06e-56.519.57e-34.85
GO biological processblood vessel morphogenesisGO:00485141.92e-42.014.06e-42.40
By GREAT analysis, 63 lncRNAs were identified, which is equal to 10.2% of the differentially expressed lncRNAs in the HHT group (n = 617, q<0.05). Concordingly, 89.8% of statistically significantly differentially expressed lncRNAs in the HHT group are not part of the three selected statistically significant GO terms in GREAT. To identify potential trans-regulated transcripts, we assessed the correlation between the expression levels of the 10 highest ranking differentially expressed lncRNAs in the Hereditary Haemorrhagic Telangiectasia (HHT) total group and all transcripts present on the Agilent array, by calculating Person's correlation coefficient. The Manhattan plots drawn on this basis showed that several statistically significant correlations were present for each lncRNA in question (40-3725 correlations, p<0.05 (Bonferoni-corrected)(data not shown)), and that the correlating transcripts were mapping to many different chromosomes (Figure 5 shows the top three ranking lncRNAs). More than half the correlated transcripts were lncRNAs (data not shown). In the cases of multiple lncRNAs neighbouring a gene you see a clear correlation of foldchange/direction; meaning that if one lncRNA neighbouring a gene is upregulated, in most cases (37/49) so are the other(s) (Table 4).
Figure 5

Manhattan plots.

Manhattan plots showing significance of correlation between the three top statistically significantly differentially expressed long non-coding RNAs' (lncRNAs) expression and expression of all other genes at the microarray. The negative logarithm (-log10) p-values of the Pearson correlation were plotted across chromosomes. The Bonferoni-corrected significance level is indicated by the dashed line (p<0.05). a. ENSG00000249772.1 (Chromosome 5) has 144 statistically significantly correlated transcripts, of which 62.5% are other lncRNAs. b. ENSG00000230544.1 (Chromosome 13) has 178 statistically significantly correlated transcripts, of which 61% are other lncRNAs. c. ENSG00000215231.3 (Chromosome 5) has 158 statistically significantly correlated transcripts, of which 63% are other lncRNAs. For comparison, only 11% of the probes across the microarray map to lncRNAs.

Manhattan plots.

Manhattan plots showing significance of correlation between the three top statistically significantly differentially expressed long non-coding RNAs' (lncRNAs) expression and expression of all other genes at the microarray. The negative logarithm (-log10) p-values of the Pearson correlation were plotted across chromosomes. The Bonferoni-corrected significance level is indicated by the dashed line (p<0.05). a. ENSG00000249772.1 (Chromosome 5) has 144 statistically significantly correlated transcripts, of which 62.5% are other lncRNAs. b. ENSG00000230544.1 (Chromosome 13) has 178 statistically significantly correlated transcripts, of which 61% are other lncRNAs. c. ENSG00000215231.3 (Chromosome 5) has 158 statistically significantly correlated transcripts, of which 63% are other lncRNAs. For comparison, only 11% of the probes across the microarray map to lncRNAs.

Discussion

To our knowledge, this is the first study to assess the regulatory effects of long noncoding RNAs (lncRNAs) in HHT affected tissue. Using microarray technology, we identified lncRNAs that are statistically significantly differentially expressed in HHT telangiectasial tissue compared with HHT non-telangiectasial nasal tissue. Using GREAT, a tool which assumes cis-regulation, we also showed that those lncRNAs are enriched for genomic loci involved in key pathways concerning HHT. The comparison of expression profiles between HHT telangiectasial and HHT non-telangiectasial nasal tissue clearly showed the differential expression of a substantial number of lncRNAs. Principal component analysis (PCA) and hierarchical clustering were able to discriminate between HHT telangiectasial and HHT non-telangiectasial nasal tissue, demonstrating an HHT telangiectasial profile. However, gene expression profiling did not allow us to discriminate between HHT1 and HHT2, possibly because of the size of the study and lack of statistical power. Nonetheless, HHT1 and HHT2 had somewhat different groups of differentially expressed lncRNAs with a smaller number of common lncRNAs, which may be due to the minor phenotypical and pathway differences of HHT1 and HHT2. The lncRNA list used in this study was retrieved from the GENCODE v.16 dataset, which uses a combination of manual annotation, computational analysis and targeted experimental validation and is the largest catalogue of human lncRNAs to date. It was used in order to retrieve optimal lncRNA annotation. To minimize the number of probes mapping only or mostly to mRNAs, we filtered out probes mapping to overlapping mRNA exons and lncRNA introns. LncRNAs appear to control expression of protein-coding genes through both cis- and trans-acting pathways. As cis-regulation is reported to be more pronounced for lncRNAs [11], we chose to use GREAT for further analysis of our data. GREAT is a tool that can be used to relate the lncRNA transcriptome to biology using well-known mRNA function and gene ontology (GO) terms. Using GREAT, three interesting and statistically significant GO terms were the result: blood vessel morphogenesis, blood vessel development, and vasculogenesis. All three were located inside the top 6 resulting GO terms. Blood vessel morphogenesis is defined as: ‘The process in which the anatomical structures of blood vessels are generated and organized. The blood vessel is the vasculature carrying blood’. Blood vessel development is defined as: ‘The process whose specific outcome is the progression of a blood vessel over time, from its formation to the mature structure’; and vasculogenesis: ‘The differentiation of endothelial cells from progenitor cells during blood vessel development, and the de novo formation of blood vessels and tubes’. These GO term results are interesting as HHT manifestations are thought to result from imbalanced angiogenesis. The results may point to a causal effect of lncRNAs regulating mRNAs central to vasculogenesis, angiogenesis, and perhaps even HHT telangiectasia formation. In most of the cases of multiple lncRNAs neighbouring a gene, a clear correlation of direction was observed. However, a correlation of direction was not present in all cases, which could indicate that a more fine-tuned regulation is being played out. Each GO term involved both up- and downregulated lncRNAs, which may suggest local cis-regulation at multiple sites. The three GO terms contain repeatedly mentioned genes. Accordingly, that indicates a central group of genes that seems to be cis-regulated by differentially expressed lncRNAs. This central group of genes primarily involves: CAV1, CCM2, FOXF1, FZD4, PRSS23, RASA1, SMO, TIPARP, ZFPM2 and ZMIZ1. Some of these are genes known to cause vascular malformation disorders, such as HHT: CCM2 causes cerebral cavernous malformation type 2 [20] and RASA1 causes capillary malformation-arteriovenous malformation syndrome (CM-AVM) [21]. Another approach in the data analysis was the correlation study. The correlations between the expression levels of the 10 highest ranking differentially expressed lncRNAs in the HHT total group and every transcript present in the Agilent microarray were assessed by calculating Pearson's correlation coefficient. Multiple statistically significant correlations, mapping to many different chromosomes, were present for each lncRNA in question. Hence, the correlation analyses may indicate that multiple trans-regulative mechanisms are present. More than half the correlated transcripts are lncRNAs and, as these tend to be more tissue-specific than mRNAs, it could be that they are all co-regulated. Nevertheless, these results suggest only a certain degree of whole-genome correlation, and this may not be associated to the HHT phenotype at all. In 2007, two gene expression microarray studies were published describing HHT samples. Thomas et al. [22] studied cultured human umbilical vein endothelial cells from seven newborns carrying the familial HHT mutation, using human umbilical vein endothelial cells from three healthy newborns as controls. Fernandez-Lopez et al. [23] studied blood outgrowth endothelial cells from three HHT patients using blood outgrowth endothelial cells from healthy donors as controls. Neither of the two papers described non-coding RNA, and their raw data have not been made available for re-annotation and data analysis in order to compare with our results. The strength of our study is that affected HHT tissue was analysed in a paired design with samples from a relatively large number of patients. This resulted in a large number of differently expressed lncRNAs, even though we applied a strict cut-off. GREAT is a useful tool to relate lncRNAs to known biology, but it is limited to cis-regulation. Of the 617 aberrantly expressed lncRNAs in the HHT group, only 63 were identified in relevant GO terms by GREAT, indicating that regulative mechanisms other than cis-regulation may be important. Future studies are needed to validate the results of our study and to contribute knowledge about specific lncRNA functionality and both cis- and trans-regulatory mechanisms. In summary, our study identified lncRNAs that are aberrantly expressed in HHT telangiectasia and indicates that lncRNAs may contribute to regulate protein-coding loci in HHT, suggesting that the lncRNA component of the transcriptome deserves more attention in HHT research. Thus, a deeper understanding of lncRNAs and their role in telangiectasia formation possesses potential for discovering therapeutic targets and for identifying new biomarkers. Comparison of Agilent probe genomic coordinates and Gencode v.16 long non-coding RNAs genomic coordinates. (TIFF) Click here for additional data file. Relative orientation and distance (kb) to the nearest transcriptional start site (TSS) of all differentially expressed long non-coding RNAs (q<0.15) in the HHT1 group. (TIFF) Click here for additional data file. Relative orientation and distance (kb) to the nearest transcriptional start site (TSS) of all differentially expressed long non-coding RNAs (q<0.15) in the HHT2 group. (TIFF) Click here for additional data file. Relative orientation and distance (kb) to the nearest transcriptional start site (TSS) of all differentially expressed long non-coding RNAs (q<0.15) in the HHT total group. (TIFF) Click here for additional data file. Venn diagram: Overlap of long non-coding RNAs from the subgroups that are loaded into GREAT. (TIFF) Click here for additional data file. Hierarchical clustering HHT1. (TIFF) Click here for additional data file. Hierarchical clustering HHT2. (TIFF) Click here for additional data file. Dataset - paired t-test HHT1. (XLS) Click here for additional data file. Dataset - paired t-test HHT2. (XLS) Click here for additional data file. Dataset - paired t-test HHT. (XLS) Click here for additional data file. Contains the Tables S4–S9. Table S4. Gene-genomic regions association table for GO term ‘blood vessel morphogenesis’ (HHT1); Table S5. Gene-genomic regions association table for the GO term ‘blood vessel development’ (HHT1); Table S6. Gene-genomic regions association table for GO term ‘vasculogenesis’ (HHT1); Table S7. Gene-genomic regions association table for GO term ‘vasculogenesis’ (HHT2); Table S8. Gene-genomic regions association table for GO term ‘vasculogenesis’ (HHT); and Table S9. Gene-genomic regions association table for GO term ‘blood vessel morphogenesis’ (HHT). (DOCX) Click here for additional data file.
  23 in total

1.  A comparison of background correction methods for two-colour microarrays.

Authors:  Matthew E Ritchie; Jeremy Silver; Alicia Oshlack; Melissa Holmes; Dileepa Diyagama; Andrew Holloway; Gordon K Smyth
Journal:  Bioinformatics       Date:  2007-08-25       Impact factor: 6.937

2.  GREAT improves functional interpretation of cis-regulatory regions.

Authors:  Cory Y McLean; Dave Bristor; Michael Hiller; Shoa L Clarke; Bruce T Schaar; Craig B Lowe; Aaron M Wenger; Gill Bejerano
Journal:  Nat Biotechnol       Date:  2010-05-02       Impact factor: 54.908

3.  Hereditary haemorrhagic telangiectasia: a population-based study of prevalence and mortality in Danish patients.

Authors:  A D Kjeldsen; P Vase; A Green
Journal:  J Intern Med       Date:  1999-01       Impact factor: 8.989

4.  High frequency of ENG and ALK1/ACVRL1 mutations in German HHT patients.

Authors:  Claudia Schulte; Urban Geisthoff; Andreas Lux; Susan Kupka; Hans-Peter Zenner; Nikolaus Blin; Markus Pfister
Journal:  Hum Mutat       Date:  2005-06       Impact factor: 4.878

5.  Balancing the activation state of the endothelium via two distinct TGF-beta type I receptors.

Authors:  Marie-José Goumans; Gudrun Valdimarsdottir; Susumu Itoh; Alexander Rosendahl; Paschalis Sideras; Peter ten Dijke
Journal:  EMBO J       Date:  2002-04-02       Impact factor: 11.598

6.  Mutations in endoglin and in activin receptor-like kinase 1 among Danish patients with hereditary haemorrhagic telangiectasia.

Authors:  K Brusgaard; A D Kjeldsen; L Poulsen; H Moss; P Vase; K Rasmussen; T A Kruse; M Hørder
Journal:  Clin Genet       Date:  2004-12       Impact factor: 4.438

7.  Altered endothelial gene expression associated with hereditary haemorrhagic telangiectasia.

Authors:  B Thomas; M Eyries; K Montagne; S Martin; M Agrapart; R Simerman-François; M Letarte; F Soubrier
Journal:  Eur J Clin Invest       Date:  2007-07       Impact factor: 4.686

8.  Endoglin promotes endothelial cell proliferation and TGF-beta/ALK1 signal transduction.

Authors:  Franck Lebrin; Marie-José Goumans; Leon Jonker; Rita L C Carvalho; Gudrun Valdimarsdottir; Midory Thorikay; Christine Mummery; Helen M Arthur; Peter ten Dijke
Journal:  EMBO J       Date:  2004-09-23       Impact factor: 11.598

9.  Capillary malformation-arteriovenous malformation, a new clinical and genetic disorder caused by RASA1 mutations.

Authors:  Iiro Eerola; Laurence M Boon; John B Mulliken; Patricia E Burrows; Anne Dompmartin; Shoji Watanabe; Romain Vanwijck; Miikka Vikkula
Journal:  Am J Hum Genet       Date:  2003-11-24       Impact factor: 11.025

10.  Gene expression fingerprinting for human hereditary hemorrhagic telangiectasia.

Authors:  Africa Fernandez-L; Africa Fernandez-Lopez; Eva M Garrido-Martin; Francisco Sanz-Rodriguez; Miguel Pericacho; Alicia Rodriguez-Barbero; Nelida Eleno; Jose M Lopez-Novoa; Anette Düwell; Miguel A Vega; Carmelo Bernabeu; Luisa M Botella
Journal:  Hum Mol Genet       Date:  2007-04-09       Impact factor: 6.150

View more
  9 in total

1.  Microarray expression profiling of dysregulated long non-coding RNAs in triple-negative breast cancer.

Authors:  Chen Chen; Zhilu Li; Yuan Yang; Tingxiu Xiang; Weihong Song; Shengchun Liu
Journal:  Cancer Biol Ther       Date:  2015       Impact factor: 4.742

2.  Protein complex scaffolding predicted as a prevalent function of long non-coding RNAs.

Authors:  Diogo M Ribeiro; Andreas Zanzoni; Andrea Cipriano; Riccardo Delli Ponti; Lionel Spinelli; Monica Ballarino; Irene Bozzoni; Gian Gaetano Tartaglia; Christine Brun
Journal:  Nucleic Acids Res       Date:  2018-01-25       Impact factor: 16.971

Review 3.  The role of long non-coding RNAs in the pathogenesis of hereditary diseases.

Authors:  Peter Sparber; Alexandra Filatova; Mira Khantemirova; Mikhail Skoblov
Journal:  BMC Med Genomics       Date:  2019-03-13       Impact factor: 3.063

4.  Long non-coding RNA expression profiles predict metastasis in lymph node-negative breast cancer independently of traditional prognostic markers.

Authors:  Kristina P Sørensen; Mads Thomassen; Qihua Tan; Martin Bak; Søren Cold; Mark Burton; Martin J Larsen; Torben A Kruse
Journal:  Breast Cancer Res       Date:  2015-04-11       Impact factor: 6.466

5.  Research on potential biomarkers in hereditary hemorrhagic telangiectasia.

Authors:  Luisa-María Botella; Virginia Albiñana; Luisa Ojeda-Fernandez; Lucia Recio-Poveda; Carmelo Bernabéu
Journal:  Front Genet       Date:  2015-03-31       Impact factor: 4.599

6.  Global Prioritizing Disease Candidate lncRNAs via a Multi-level Composite Network.

Authors:  Qianlan Yao; Leilei Wu; Jia Li; Li Guang Yang; Yidi Sun; Zhen Li; Sheng He; Fangyoumin Feng; Hong Li; Yixue Li
Journal:  Sci Rep       Date:  2017-01-04       Impact factor: 4.379

7.  Comprehensive analysis of lncRNA expression profiles reveals a novel lncRNA signature to discriminate nonequivalent outcomes in patients with ovarian cancer.

Authors:  Meng Zhou; Yanying Sun; Yifan Sun; Wanying Xu; Zhaoyue Zhang; Hengqiang Zhao; Zhaohua Zhong; Jie Sun
Journal:  Oncotarget       Date:  2016-05-31

8.  Transcriptome-wide Profiling of Cerebral Cavernous Malformations Patients Reveal Important Long noncoding RNA molecular signatures.

Authors:  Santhilal Subhash; Norman Kalmbach; Florian Wegner; Susanne Petri; Torsten Glomb; Oliver Dittrich-Breiholz; Caiquan Huang; Kiran Kumar Bali; Wolfram S Kunz; Amir Samii; Helmut Bertalanffy; Chandrasekhar Kanduri; Souvik Kar
Journal:  Sci Rep       Date:  2019-12-03       Impact factor: 4.379

Review 9.  Non-Coding RNAs and Hereditary Hemorrhagic Telangiectasia.

Authors:  Anthony Cannavicci; Qiuwang Zhang; Michael J B Kutryk
Journal:  J Clin Med       Date:  2020-10-17       Impact factor: 4.241

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.