| Literature DB >> 24847104 |
Zhisong He1, Hindrike Bammann2, Dingding Han3, Gangcai Xie1, Philipp Khaitovich2.
Abstract
The current annotation of the human genome includes more than 12,000 long intergenic noncoding RNAs (lincRNA). While a handful of lincRNA have been shown to play important regulatory roles, the functionality of most remains unclear. Here, we examined the expression conservation and putative functionality of lincRNA in human and macaque prefrontal cortex (PFC) development and maturation. We analyzed transcriptome sequence (RNA-seq) data from 38 human and 40 macaque individuals covering the entire postnatal development interval. Using the human data set, we detected the expression of 5835 lincRNA annotated in GENCODE and further identified 1888 novel lincRNA. Most of these lincRNA show low DNA sequence conservation, as well as low expression levels. Remarkably, developmental expression patterns of these lincRNA were as conserved between humans and macaques as those of protein-coding genes. Transfection of development-associated lincRNA into human SH-SY5Y cells affected gene expression, indicating their regulatory potential. In brain, expression of these putative target genes correlated with the expression of the corresponding lincRNA during human and macaque PFC development. These results support the potential functionality of lincRNA in primate PFC development.Entities:
Keywords: development; human; lincRNA; macaque; prefrontal cortex
Mesh:
Substances:
Year: 2014 PMID: 24847104 PMCID: PMC4074677 DOI: 10.1261/rna.043075.113
Source DB: PubMed Journal: RNA ISSN: 1355-8382 Impact factor: 4.942
FIGURE 1.Expression and conservation of novel and annotated human lincRNAs. (A) Distributions of the average expression levels of novel (blue) and annotated (red) lincRNAs, as well as annotated protein-coding genes (gray), in human PFC samples measured in RPKM (reads per kilobase of exon model per million mapped reads). (B) Sequence conservation of lincRNA genes. Shown are median phastCon scores of novel lincRNAs expressed (blue) and not expressed (green) in the human PFC, annotated lincRNAs expressed (red) and not expressed (orange) in the human PFC, as well as median phastCon scores of simple repeats (gray) used as a nonconserved sequence reference. PhastCon scores were calculated based on 11 primate genomes. “Expressed lincRNAs” are defined as lincRNA detected in at least half of the samples with maximal expression > 1 RPKM. The box plots show variation of the median conservation estimates calculated by bootstrapping over phastCon score values 1000 times.
FIGURE 2.Developmental expression patterns of lincRNAs and protein-coding genes in human and macaque PFC. (A) Expression patterns of lincRNA (red: human, light red: macaque) and protein-coding genes (blue: human, light blue: macaque) defined as age-related in human PFC development. Expression values were z-transformed before plotting. Numbers in brackets above each panel indicate numbers of lincRNAs or protein-coding genes in the cluster. On all plots, macaque ages are linearly transformed to the human age scale. (B) Distribution of Pearson correlation coefficients (PCC) based on the comparison of developmental expression profiles between humans and macaques for annotated lincRNAs (orange), novel lincRNAs (green), and protein-coding genes (gray). Confidence intervals of the PCC distribution for protein-coding genes are shown by the dashed gray lines and were calculated by subsampling protein-coding genes according to the expression distribution of lincRNA genes 1000 times.
FIGURE 3.The effect of lincRNA overexpression on putative target genes in SY5Y cells. (A) Expression level distributions of protein-coding genes (green) and lincRNAs (blue) in the SH-SY5Y human neuroblastoma cell line. The red arrows mark the average expression levels of the three transfected lincRNAs after overexpression. (B) Number of genes showing differential expression (DE) between mock and lincRNA transfection in the SH-SY5Y cell line plotted for each of the three development-associated lincRNAs. Genes affected by more than one lincRNA transfection are shown in light blue, with independently affected genes in dark blue. The shaded bars show the number of independent protein-coding genes. The solid gray line shows the number of differentially expressed genes expected by chance. The two gray dotted lines show the 90% confidence interval for the chance distribution. (C) Regulatory effects of lincR-0003 in brain. The red box plots show the medians of the absolute values of the Pearson correlation coefficients based on the developmental expression profile of a lincRNA and its independent protein-coding target genes, identified in SY5Y cells. The variance of the median estimates was calculated by bootstrapping target genes 1000 times. The gray boxes show the distribution of the median values of the absolute Pearson correlation coefficients based on the developmental expression profiles of lincRNAs and the same number of nontarget protein-coding genes sampled 1000 times. (D) Expression profile of lincR-0003 in human (blue) and macaque (green) PFC development. Macaque ages are linearly transformed to the human scale. Expression values were z-transformed. Shaded areas indicate standard deviation of expression level estimates. (E) Conservation of lincR-0003 target profiles between human and macaque PFC development. The red box plots show the medians of the Pearson correlation coefficients based on developmental expression profiles of independent lincR-0003 protein-coding target genes in human and macaque PFC. The variance estimate and the background distribution (gray box plot) are as in panel C.