| Literature DB >> 29977047 |
M Marti-Marimon1, N Vialaneix2, V Voillet1, M Yerle-Bouissou1, Y Lahbib-Mansais1, L Liaubet3.
Abstract
The integration of genetic information in the cellular and nuclear environments is crucial for deciphering the way in which the genome functions under different physiological conditions. Experimental techniques of 3D nuclear mapping, a high-flow approach such as transcriptomic data analyses, and statistical methods for the development of co-expressed gene networks, can be combined to develop an integrated approach for depicting the regulation of gene expression. Our work focused more specifically on the mechanisms involved in the transcriptional regulation of genes expressed in muscle during late foetal development in pig. The data generated by a transcriptomic analysis carried out on muscle of foetuses from two extreme genetic lines for birth mortality are used to construct networks of differentially expressed and co-regulated genes. We developed an innovative co-expression networking approach coupling, by means of an iterative process, a new statistical method for graph inference with data of gene spatial co-localization (3D DNA FISH) to construct a robust network grouping co-expressed genes. This enabled us to highlight relevant biological processes related to foetal muscle maturity and to discover unexpected gene associations between IGF2, MYH3 and DLK1/MEG3 in the nuclear space, genes that are up-regulated at this stage of muscle development.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29977047 PMCID: PMC6033925 DOI: 10.1038/s41598-018-28173-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Experimental design. Published data are represented in green squares (microarray data and 3D DNA FISH data), statistical methods are represented in blue (GGM: Gaussian Graphical Models) and new information about spatial localization used for network inference is represented in red.
Figure 2Analysis of gene associations. Pink nodes represent target genes, red edges represent the known associations observed by 3D DNA FISH and the dotted orange edge represents the observed as not associated after 3D FISH validations. Because networks are very dense and contain many genes, a sub-network restricted to the target genes and their direct neighbors is extracted from each network, and presented in this figure. (a) Network 0 is inferred without a priori information, and restricted to the nodes corresponding to IGF2, DLK1 and MEG3 (in yellow). To infer Networks 1, 2 and 3, new a priori information of spatial localization is introduced for the following pairs of genes: (b) IGF2-DLK1, IGF2-MEG3 and DLK1-MEG3 for Network 1; (c) IGF2-MEST, (DLK1/MEG3)-MEST, (DLK1/MEG3)-DCN, RPL32-IGF2, IGF2-DCN for Network 2; (d) IGF2-MYH3, DLK1-MYH3, MEG3-MYH3 and MEST-MYH3 for Network 3.
Association percentages of tested gene pairs.
| Gene associations | Number of nuclei analysed | Percentage of nuclei with signals | |||
|---|---|---|---|---|---|
| Distant (d > 1 µm) | Close (0, 5 < d ≤ 1 µm) | Co-localized (d < 0.5 µm) | Associated (d ≤ 1 µm) | ||
| MEST* - IGF2* | 100 | 66 | 32 | 2 |
|
| MEST* - (DLK1-MEG3)* | 90 | 66 | 28 | 6 |
|
| DCN - (DLK1-MEG3)* | 73 | 85 | 15 | 0 |
|
| RPL32 - IGF2* | 80 | 80 | 16 | 4 |
|
| DCN - IGF2* | 98 | 90 | 7 | 3 |
|
| IGF2* - MYH3 | 58 | 48 | 43 | 9 |
|
| (DLK1-MEG3)* - MYH3 | 69 | 55 | 38 | 7 |
|
| MEST* - MYH3 | 103 | 74 | 23 | 3 |
|
| ZAR1 - IGF2* | 61 | 92 | 8 | 0 |
|
| ZAR1 - PRLR | 63 | 92 | 8 | 0 |
|
Associated signals (close + co-localized) are considered as those separated by a 3D distance (d) ≤ 1 µm, and are divided into two different classes: “close” signals (0.5 < d ≤ 1 µm), and “co localized” signals (d ≤ 0.5 µm). *Genes imprinted in pig.
Figure 3Analysis of gene associations by DNA FISH. Extended focus of 3D image sections from confocal microscopy and overlay of the 3 channels (blue, red and green) were obtained with Volocity v6.0 software (Perkin Elmer). The four signals in the nuclei correspond to the two alleles of each gene. Nuclei are counterstained with DAPI (blue). In all experiments, the percentage of association between genes was higher than 10% except for (e). Scale = 1.7 µm.
Normalized mutual information (NMI) between pairs of clusterings.
| Network 0 | Network 1 | Network 2 | Network 3 | |
|---|---|---|---|---|
| Network 0 | 1 | 0.3893 | 0.3381 | 0.3244 |
| Network 1 | 0.3893 | 1 | 0.4007 | 0.3923 |
| Network 2 | 0.3381 | 0.4007 | 1 | 0.4152 |
| Network 3 | 0.3244 | 0.3923 | 0.4152 | 1 |
NMI measure the similarity between two clusterings. The value is comprised between 0 and 1 and is equal to 1 when the two clusterings are identical.
Comparison of GOBP in clusters 1 and 2 between Network 0 and Network 3.
| GO ID | GOBP Terms | Network 0 - Cluster 1 | Network 3 - Cluster 1 | ||
|---|---|---|---|---|---|
| Genes | FDR | Genes | FDR | ||
| 43062 | Extracellular structure |
| 5,76E-05 |
| |
| 71417 | Cellular response to organonitrogen compound |
|
| 1,16E-02 | |
| 45995 | Regulation of embryonic development |
| 2,24E-03 |
| 1,16E-02 |
| 71559 | Reponse to transforming growth factor beta |
|
|
| 1,24E-01 |
| 44236 | Multicellular organism metabolic process |
| 2,35E-03 |
| 3,05E-03 |
| 43588 | Skin development |
|
|
| 1,44E-01 |
| 1101 | Reponse to acid chemical |
| 1,17E-02 |
| 2,27E-02 |
| 1501 | Skeletal system development |
| 1,43E-02 | 3,05E-03 | |
|
|
| ||||
| 72350 | Tricarboxylic acid metabolic process |
| 3,02E-06 |
| 2,11E-05 |
| 51186 | Cofactor metabolic process |
|
|
| 1,34E-03 |
| 72524 | Pyridine-containig compound metabolic process |
|
|
| 1,11E-02 |
| 6631 | Fatty acid metabolic process |
|
|
| 1,17E-03 |
| 6091 | Generation of precursor metabolites and energy |
| 1,09E-04 |
|
|
| 6090 | Pyruvate metabolic process |
| 5,42E-03 |
| 2,32E-02 |
| 6790 | Sulfur compound metabolic process |
|
| 4,79E-01 | |
| 42180 | Cellular ketone metabolic process |
| 1,46E-02 |
| 8,05E-02 |
| 45454 | Cell redox homeostasis |
| 1,46E-02 |
| 4,91E-02 |
| 44282 | Small molecule catabolic process |
| 1,88E-02 |
| 4,51E-02 |
| 98656 | Anion transmembrane transport |
|
|
| 3,77E-01 |
| 6081 | Cellular aldehyde metabolic process |
| 2,59E-02 |
| 8,73E-02 |
| 43648 | Dicarboxylic acid metabolic process |
| 3,13E-02 |
| 2,13E-02 |
| 16042 | Lipid catabolic process |
| 3,65E-02 |
| 6,59E-02 |
| 10257 | NADH dehydrogenase complex assembly |
|
| ||
| 97031 | Mitochondrial respiratory chain complex I biogenesis |
|
| ||
GO terms enriched in one of the clusters as well as all GO terms associated to one of the three target genes at least (even if not significantly enriched). In bold, the smallest FDR value for a given GOBP term when the difference between the FDR of the two clusters is higher than one order of magnitude. Genes tested by 3D DNA FISH are in underline bold.
Figure 4Reconstructed network of genes in cluster 1 of Network 3, based on Ingenuity Pathways Knowledge Base. Nodes are displayed using various shapes that represent the functional class of the gene product. The reconstructed network was generated through the use of Ingenuity Pathway Analysis (IPA) (Ingenuity Systems; QIAGEN, Inc., Valencia, CA, USA).