| Literature DB >> 30069000 |
Jose P Silva1, Derek van Booven2.
Abstract
Long non-coding RNAs (lncRNAs) regulate expression of protein-coding genes in cis through chromatin modifications including DNA methylation. Here we interrogated whether lncRNA genes may regulate transcription and methylation of their flanking or overlapping protein-coding genes in livers of mice exposed to a 12-week cholesterol-rich Western-style high fat diet (HFD) relative to a standard diet (STD). Deconvolution analysis of cell type-specific marker gene expression suggested similar hepatic cell type composition in HFD and STD livers. RNA-seq and validation by nCounter technology revealed differential expression of 14 lncRNA genes and 395 protein-coding genes enriched for functions in steroid/cholesterol synthesis, fatty acid metabolism, lipid localization, and circadian rhythm. While lncRNA and protein-coding genes were co-expressed in 53 lncRNA/protein-coding gene pairs, both were differentially expressed only in 4 lncRNA/protein-coding gene pairs, none of which included protein-coding genes in overrepresented pathways. Furthermore, 5-methylcytosine DNA immunoprecipitation sequencing and targeted bisulfite sequencing revealed no differential DNA methylation of genes in overrepresented pathways. These results suggest lncRNA/protein-coding gene interactions in cis play a minor role mediating hepatic expression of lipid metabolism/localization and circadian clock genes in response to chronic HFD feeding.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30069000 PMCID: PMC6070528 DOI: 10.1038/s41598-018-29993-4
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
RNA-seq alignment statistics.
| Sample | Input read pairs | Uniquely mapped read pairs | Uniquely mapped read pairs (%) |
|---|---|---|---|
| HFD A | 64811482 | 47999627 | 74.06% |
| HFD B | 75925925 | 58699825 | 77.31% |
| HFD C | 66170800 | 52564490 | 79.44% |
| STD A | 62155721 | 47486909 | 76.40% |
| STD B | 75900043 | 57932416 | 76.33% |
| STD C | 55098436 | 41984291 | 76.20% |
| Average | 66677068 | 51111260 | 76.62% |
| Average HFD (n = 3 pools) | 68969402 | 53087981 | 76.94% |
| Average STD (n = 3 pools) | 64384733 | 49134539 | 76.31% |
Three pools of 2 livers per diet (HFD A-C; STD A-C) were paired-end sequenced on the Illumina platform. Input read pairs passed Illumina’s internal quality control filters. Uniquely mapped read pairs are read pairs aligning to a unique site of the mouse genome with less than 2 mismatches.
Figure 1Composition of the mouse hepatic transcriptome by total RNA (a) ncRNA (b) short ncRNA (c) lncRNA (d) and sense RNA (complementary to antisense RNA) (e). Only RNAs with average expression levels >1 CPM were considered for analysis.
Figure 2Differential gene expression analyses by edgeR and validation of expression differences by nCounter® technology. (a) Two-way clustering of differentially expressed genes by edgeR separates HFD- from STD-fed mouse liver pools. The color code reflects the expression value of a gene given as the log2 of counts per million (log2CPM). Only genes with average expression levels across all samples >1 CPM were considered for analysis. (b) Correlation analysis of fold-changes determined by edgeR and nCounter® technology for 95 protein-coding genes reveals tight concordance (Pearson’s correlation coefficient r = 0.97). (c) Biological processes modulated by HFD-induced protein-coding gene expression.
Estimation of hepatic cell type proportions by deconvolution analysis.
| Algorithm | Cell type | STD Mean (%) | STD SEM (%) | HFD Mean (%) | HFD SEM (%) | |
|---|---|---|---|---|---|---|
| DSA | Hepatocytes | 49.7 | 0.3 | 50.3 | 0.3 | 0.23 |
| DSA | Kupffer cells | 25.2 | 0.2 | 24.8 | 0.2 | 0.23 |
| DSA | Sinusoidal endothelial cells | 25.2 | 0.2 | 24.8 | 0.2 | 0.23 |
| meanProfile | Hepatocytes | 44.4 | 0.3 | 44.2 | 0.3 | 0.51 |
| meanProfile | Kupffer cells | 28.3 | 0.1 | 28.4 | 0.2 | 0.44 |
| meanProfile | Sinusoidal endothelial cells | 27.2 | 0.1 | 27.2 | 0.0 | 0.88 |
The Digital Sorting Algorithm (DSA) and meanProfile algorithm in the CellMix1.6 package estimated frequencies of hepatocyte, Kupffer cells and sinusoidal endothelial cells by deconvolving cell type specific marker gene expression. The p-values are from two-sided t tests comparing the estimated frequencies of a given cell type in STD livers versus HFD livers.
Pathway analysis.
| Biological Process | GO-ID | FDR DAVID | FDR BiNGO/Cytoscape3.2 | DEG DAVID | DEG BiNGO/Cytoscape3.2 |
|---|---|---|---|---|---|
| Steroid biosynthetic process | 6694 | 1.6E-13 | 6.2E-21 | 20 | 21 |
| Cholesterol metabolic process | 8203 | 1.0E-10 | 1.42E-22 | 17 | 22 |
| Fatty acid metabolic process | 6631 | 6.8E-04 | 3.1E-08 | 16 | 17 |
| Lipid localization | 10876 | 5.6E-03 | 1.3E-05 | 12 | 11 |
| Circadian rhythm | 7623 | 6.1E-02 | 8.01E-04 | 6 | 6 |
DAVID and BiNGO/Cytoscape3.2 determined statistical overrepresentation of biological processes using the Benjamini-Hochberg method. Only DEG with average expression levels across all samples >1 CPM were considered for enrichment analysis. Biological processes are denoted by their official gene ontology names. GO-ID refers to the official Gene Ontology Identity Number. FDR DAVID and FDR BiNGO/Cytoscape3.2 refer to the False Discovery Rate-adjusted p-value calculated by DAVID and BiNGO/Cytoscape3.2, respectively. DEG DAVID and DEG BiNGO/Cytoscape3.2 refer to the number of DEG assigned by DAVID and BiNGO/Cytoscape3.2, respectively, to a biological process.
Diet-induced differential expression of lncRNA genes.
| Gene symbol | edgeR FC (H/S) | edgeR FDR | nCounter FC (H/S) | nCounter | nCounter targeted transcript isoforms |
|---|---|---|---|---|---|
| A330076H08Rik | 0.02 | 1.9E-03 | 0.74 | 3.29E-01 | ENSMUST00000181804 |
| 0.45 | 1.17E-02 | ENSMUST00000181187 | |||
| A530040E14Rik | 0.06 | 6.0E-02 | 0.39 | 1.95E-02 | ENSMUST00000161666 |
| ENSMUST00000162453 | |||||
| 0.29 | 3.47E-03 | ENSMUST00000162353 | |||
| ENSMUST00000093501 | |||||
| ENSMUST00000161150 | |||||
| AY512915 | 0.01 | 2.2E-03 | 0.39 | 1.81E-02 | ENSMUST00000089303 |
| 0.67 | 1.78E-01 | ENSMUST00000159751 | |||
| 0.54 | 3.85E-02 | ENSMUST00000162315 | |||
| 0.55 | 9.59E-02 | all transcript isoforms | |||
| Dlx6os1 | 0.01 | 1.8E-04 | 0.39 | 2.42E-02 | ENSMUST00000159827 |
| Gm12002 | 0.003 | 3.8E-03 | 0.38 | 4.16E-03 | ENSMUST00000153794 |
| 0.72 | 7.83E-03 | ENSMUST00000174550 | |||
| Gm13582 | 0.02 | 2.1E-04 | 0.50 | 5.06E-02 | all transcript isoforms |
| Gm16025 | 0.06 | 3.95E-02 | 0.56 | 1.67E-02 | ENSMUST00000161282 |
| 0.81 | 2.11E-02 | all transcript isoforms | |||
| Gm16028 | 0.06 | 1.5E-02 | 0.20 | 7.95E-03 | all transcript isoforms |
| Gm20631 | 0.06 | 9.1E-03 | 0.40 | 7.51E-03 | ENSMUST00000177239 |
| 0.39 | 2.00E-02 | ENSMUST00000177268 | |||
| 0.48 | 1.34E-02 | all transcript isoforms | |||
| Gm21846 | 0.04 | 1.7E-02 | 0.52 | 6.53E-03 | All transcript isoforms |
| Wt1os | 0.003 | 3.1E-05 | 0.67 | 1.96E-01 | ENSMUST00000172701 |
| ENSMUST00000174870 | |||||
| 0.42 | 1.97E-02 | ENSMUST00000135153 | |||
| Gm26870 | 0.19 | 1.5E-08 | n.d. | n.d. | n.d. |
| Gm11399 | 0.07 | 6.4E-03 | n.d. | n.d. | n.d. |
| D230017M19Rik | 0.1 | 2.7E-03 | n.d. | n.d. | n.d. |
Differential expression of lncRNA genes was validated by nCounter® technology in individual livers (n = 6/diet). nCounter® probes targeted all or specific transcript isoforms of a lncRNA gene. Specifically targeted transcript isoforms are denoted by their Ensembl transcript IDs. nCounter® fold-change values were not determined (n.d.) for Gm26870, Gm11399, D230017M19Rik due to lack of specific nCounter® probes. FC (H/S) refers to the fold-change (FC) in expression in HFD livers (H) relative to STD livers (S). nCounter® p-values were calculated by two-sided t test. edgeR FDR refers to the False Discovery Rate-adjusted p-value calculated by edgeR.
5meDIP-seq alignment statistics.
| Sample | Number of reads | Number of alignments | Number of unique alignments | Number of unique alignments without duplicate reads |
|---|---|---|---|---|
| HFD A | 33550731 | 31488970 | 15502775 | 10051886 |
| HFD B | 33370167 | 31294850 | 16434763 | 10894779 |
| HFD C | 36995355 | 34708215 | 19888168 | 14948873 |
| STD A | 37931887 | 35475014 | 19716455 | 13654052 |
| STD B | 39045552 | 36407892 | 20167366 | 14222042 |
| STD C | 36576656 | 34298870 | 19934085 | 14208816 |
| Average | 36245058 | 33945635 | 18607269 | 12996741 |
| Average HFD (n = 3 pools) | 34638751 | 32497345 | 17275235 | 11965179 |
| Average STD (n = 3 pools) | 37851365 | 35393925 | 19939302 | 14028303 |
5-methylcytosine DNA immunoprecipitates (n = 3 pools of 2 livers/diet) were single-end sequenced. The number of alignments is the number of reads that aligned with no more than 2 mismatches to the mouse genome. The number of unique alignments refers to the number of reads that aligned to a unique site of the mouse genome with no more than 2 mismatches. The number of unique alignments without duplicate reads is the number of reads that aligned to a unique site of the mouse genome with less than 2 mismatches and were not exact duplicates produced by PCR amplification of the sequencing libraries. Of these, exactly 10 million reads were used for methylation analysis.
Figure 3Heatmap of DNA methylation in STD and HFD livers determined by 5meDIP-seq. (a) K-means clustering of 154664 methylated regions (y-axis) by peak signal (fragment counts) from −2000 bp to +2000 bp relative to midpoint (x-axis). (b) K-means clustering of 16026 CpG islands (y-axis) annotated in the UCSC genome browser by peak signal (fragment counts) from −2000 bp to +2000 bp relative to midpoint (x-axis). The color code in (a and b) reflects the number of fragments at the specified position. All heatmaps were generated using seqplots and a default cluster size of 5.