| Literature DB >> 21247874 |
Qi Liao1, Changning Liu, Xiongying Yuan, Shuli Kang, Ruoyu Miao, Hui Xiao, Guoguang Zhao, Haitao Luo, Dechao Bu, Haitao Zhao, Geir Skogerbø, Zhongdao Wu, Yi Zhao.
Abstract
Although accumulating evidence has provided insight into the various functions of long-non-coding RNAs (lncRNAs), the exact functions of the majority of such transcripts are still unknown. Here, we report the first computational annotation of lncRNA functions based on public microarray expression profiles. A coding-non-coding gene co-expression (CNC) network was constructed from re-annotated Affymetrix Mouse Genome Array data. Probable functions for altogether 340 lncRNAs were predicted based on topological or other network characteristics, such as module sharing, association with network hubs and combinations of co-expression and genomic adjacency. The functions annotated to the lncRNAs mainly involve organ or tissue development (e.g. neuron, eye and muscle development), cellular transport (e.g. neuronal transport and sodium ion, acid or lipid transport) or metabolic processes (e.g. involving macromolecules, phosphocreatine and tyrosine).Entities:
Mesh:
Substances:
Year: 2011 PMID: 21247874 PMCID: PMC3089475 DOI: 10.1093/nar/gkq1348
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Re-annotation of Affymetrix Mouse Genome 430 2.0 Array probes. (A) Computational pipeline for re-annotating the probes of the Mouse 430 2.0 array. (B) The relative distribution of the 496 468 original probes of the Affymetrix Mouse Genome 430 2.0 Array.
Figure 2.Specificity and accuracy of the r-Mouse4302cdf file. (A) Mean expressional correlation of probes (GSE9954) targeting the same coding gene before and after re-annotation (P < 2.20e-16 by Kolmogorov–Smirnov Test). (B) Coefficient of variance on the expressional correlation of probes (GSE9954) targeting the same coding gene between before and after re-annotation (P < 2.2e-16, Kolmogorov–Smirnov Test). (C) Expressional correlations of lncRNAs. Dark grey line: distribution of Spearman correlation coefficients for the expression of identical lncRNAs in corresponding tissues in the Riken cDNA array data set and in the re-annotated Mouse 430 2.0 (GSE9954) array data. Light grey lines: distribution of Spearman correlation coefficients for the expression of randomly selected lncRNA pairs, repeated 1000 times. (Mean Spearman correlation coefficient was 0.26, mean P-value of the KS test was 4.39e-08). (D) Expression profiles of lncRNAs TK27265 and TK100617 in the re-annotated Mouse 430 2.0 array data and in the Riken cDNA array data.
Figure 3.The coding–non-coding gene co-expression network. (A) The relationship between the number of data sets in which gene pairs were co-expressed and the similarity of the annotated functions of the connected gene pairs. The figure shows the probability that gene pairs co-expressed in a number of data sets have X or less common GO BP terms. (B) Visualization of the CNC network. Green nodes represent non-coding genes while pink nodes represent coding genes. Several of the largest modules are shown. (C) GO enrichment analysis result of 2858 coding genes co-expressed with at least one lncRNA gene. (D) GO enrichment analysis of 243 coding genes enriched for co-expressed lncRNA genes. (E) GO enrichment analysis result of the 1249 coding genes associated with 189 lncRNAs enriched for co-expressed coding genes.
Figure 4.Genomic contexts of four non-coding genes. (A) Intronic lncRNA TK226771. (B) Upstream lncRNA TK118632. (C) Downstream lncRNA TK79018.
Figure 5.Hub-based functional prediction. (A) The relationship among PV, GN and precision in the CNC network. (B) The relationship among the PV, GN and specificity in the CNC network. Random networks are shown for comparison.
Figure 6.The largest modules of the co-expression network. The color depth signifies the number of data sets in which the gene pairs were co-expressed.
Figure 7.The ‘synaptic transmission’ module. (A) GO enrichment analysis of coding genes within the module. (B) Expression patterns of coding and non-coding genes within the module (GSE9954 was used). (C) Network visualization of the module. Green circles indicate lncRNAs, yellow circles represent coding genes involved in the neuron active ligand–receptor interaction pathway, while blue circles represent other coding genes. Co-expressed and co-located gene pairs are marked by red rectangles.