| Literature DB >> 23049964 |
Dafne Pérez-Montarelo1, Nicholas J Hudson, Ana I Fernández, Yuliaxis Ramayo-Caldas, Brian P Dalrymple, Antonio Reverter.
Abstract
The processes that drive tissue identity and differentiation remain unclear for most tissue types. So are the gene networks and transcription factors (TF) responsible for the differential structure and function of each particular tissue, and this is particularly true for non model species with incomplete genomic resources. To better understand the regulation of genes responsible for tissue identity in pigs, we have inferred regulatory networks from a meta-analysis of 20 gene expression studies spanning 480 Porcine Affymetrix chips for 134 experimental conditions on 27 distinct tissues. We developed a mixed-model normalization approach with a covariance structure that accommodated the disparity in the origin of the individual studies, and obtained the normalized expression of 12,320 genes across the 27 tissues. Using this resource, we constructed a network, based on the co-expression patterns of 1,072 TF and 1,232 tissue specific genes. The resulting network is consistent with the known biology of tissue development. Within the network, genes clustered by tissue and tissues clustered by site of embryonic origin. These clusters were significantly enriched for genes annotated in key relevant biological processes and confirm gene functions and interactions from the literature. We implemented a Regulatory Impact Factor (RIF) metric to identify the key regulators in skeletal muscle and tissues from the central nervous systems. The normalization of the meta-analysis, the inference of the gene co-expression network and the RIF metric, operated synergistically towards a successful search for tissue-specific regulators. Novel among these findings are evidence suggesting a novel key role of ERCC3 as a muscle regulator. Together, our results recapitulate the known biology behind tissue specificity and provide new valuable insights in a less studied but valuable model species.Entities:
Mesh:
Substances:
Year: 2012 PMID: 23049964 PMCID: PMC3458843 DOI: 10.1371/journal.pone.0046159
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Description of the datasets used in this study.
| Reference | GEO Acc. | Chips | Tissue(s) | Brief description |
|
| GSE26701 | 12 | SM | 4 postmortem times (20 min, 2 h, 6 h, 24 h) with 3 rep. |
|
| GSE22487 | 12 | LD | 4 developmental times (0 d, 7 d, 14 d, 21 d) with 3 rep. |
|
| GSE21383 | 12 | OVA | 6 high prolificacy replicates +6 low prolificacy rep. |
|
| GSE19975 | 6 | LD, SOL | 2 tissues with 3 rep. |
|
| GSE22165 | 30 | BRAIN | 10 conditions (3 treatments |
|
| GSE18641 | 12 | UTE | 6 pregnant rep. +6 non-pregnant rep. |
|
| GSE14643 | 13 | HEART | 6 untreated rep. +7 treated rep. |
|
| GSE15256 | 54 | ILE | 3 conditions |
|
| GSE11853 | 12 | PLA | 2 breeds |
|
| GSE11787 | 6 | SPL | 3 infected rep.s +3 uninfected rep. |
|
| GSE9333 | 8 | BFT | 2 breeds with 4 rep. |
|
| GSE11193 | 12 | LD | 6 high drip loss rep. +6 low drip loss rep. |
|
| GSE7314 | 15 | MLN | 3 uninfected rep.+(3 infected rep. |
|
| GSE7313 | 15 | MLN | 3 uninfected rep. +(3 infected rep. |
|
| GSE10898 | 64 | OLF, HYP, PIN, ADE, NEU, ACO AME, THY, DIA, BIC, BFT, AFT, STO, LIV, ILE, BLO | 2 breeds |
|
| GSE13528 | 48 | LIV, BFT | 2 conditions |
|
| GSE18359 | 40 | LIV, BFT | 2 conditions |
|
| GSE21096 | 20 | HEART | 4 treatments with 5 rep. |
|
| GSE23596 | 9 | SPL | 3 treatments with 3 rep. |
|
| GSE14739 | 80 | HYP, ADE, THY, OVA, TES, BFT | 4 breeds |
| TOTAL | 20 | 480 | 27 |
Rep.: replicates.
Tissue codes are as follows: SM: Semi-membranosus muscle; LD: Longissimus dorsi muscle; OVA: Ovaries; SOL: Soleus muscle; BRAIN: Brain; UTE: Uterus; HEART: Heart; ILE: Ileum; PLA: Placenta; SPL: Spleen; BFT: Back fat tissue; MLN: Mesenteric lymph nodes; OLF: Olfactory bulb; HYP: Hypothalamus; PIN: Pineal gland; ADE: Adenohypophysis; NEU: Neurhypophysis; ACO: Adrenal cortex; AME: Adrenal medulla; LIV: Liver, THY: Thyroid gland; DIA: Diaphragm; BIC: Biceps femoris muscle; AFT: Abdominal fat tissue; STO: Stomach; BLO: Blood; TES: Testes.
Figure 1Clustering of tissues.
Hierarchical cluster analysis of the 27 tissues based on the expression of 12,320 porcine genes.
Figure 2Tissue specificity value (TSV).
(A) Distribution of the percentage of genes having its maximum expression in each tissue (left y axis) and the mean TSV of all the selected genes per tissue (right y axis). Standard errors are indicated as bars above and below the mean TSV. (B) Empirical density distribution of the TSV for tissue-specific genes (red bars), transcription factor genes (green bars) and remaining genes (black bars).
Enrichment of tissue-specific genes for transcription factors (TF), imprinted genes (IMP) and disease-associated genes (DIS).
| All genes (N = 12,320) | Tissue-Specific (N = 1,232) |
| |||
| N | % | N | % | ||
| TF | 1,072 | 8.70 | 112 | 9.09 | 3.67E-02 |
| IMP | 134 | 1.09 | 23 | 1.87 | 3.53E-03 |
| DIS | 8,807 | 71.48 | 969 | 78.65 | 3.74E-10 |
These percentages do not sum to one because not all of the 12,320 genes (or the subset of 1,232 tissue-specific genes) belong to one of the tree categories under scrutiny: TF, IMP and DIS.
Figure 3Tissue specific regulatory network of the porcine transcriptome.
Legend of colours assigned to each of the 27 tissues in the network (left). The co-expression network (right). Node size was mapped to average transcript abundance, node colour was mapped to the tissue in which each particular gene is specific and node shape was mapped to the different gene types: TS (squares), TF (triangles) and TSTF (circles).
Figure 4Embryonic origin of tissues.
Tissue specific regulatory network of the porcine transcriptome, showing the embryonic origin of each tissue. In this instance, node colour was mapped to the embryonic origin of each tissue: blue for the ectoderm-derived tissues, green for the mesoderm ones and yellow for the tissues formed from the endoderm.
Figure 5TSTF Network.
The colour codes are as per Figure 3 and node size was mapped to average transcript abundance.
Figure 6Regulatory impact factors (RIF).
Scatter plot of the relationship between RIF1 and RIF2 in the two contrasts explored: (A) Muscle vs. Other Tissues; and (B) CNS vs. Other Tissues. Notice how in each contrast, the transcription factors of biological relevance are concentrated on the right half and upper-right quarter of the scatter.
Enrichment of tissue specificity in the regulatory impact factor (RIF) analysis.
| Overall | |RIF1|+RIF2>2 | |||||
| Muscle vs others | CNS vs Others | |||||
| N | % | N | % | N | % | |
| TF | 960 | 89.5 | 151 | 88.3 | 151 | 88.3 |
| TF CNS | 25 | 2.3 | 3 | 1.7 | 8 | 4.7 |
| TF muscle | 27 | 2.5 | 9 | 5.3 | 5 | 3.0 |
| TFTS | 60 | 5.6 | 8 | 4.7 | 13 | 7.6 |
| Total | 1,072 | 171 | 177 | |||
Normalized mean expression (NME, base-2 logarithm scale) and differential expression (DE) for the 10 most DE genes along with their co-expression correlation with ERCC3 and MYOCD in the skeletal muscle and other tissues.
| Gene | NME | DE | Corr. with | Corr. with | ||
| Muscle | Others | Muscle | Others | |||
|
| 6.277 | 4.311 | −0.467 | −0.596 | 0.449 | 0.591 |
|
| 4.447 | 4.187 | −0.579 | 0.402 | 0.638 | −0.422 |
|
| 4.891 | 4.131 | 0.462 | 0.017 | −0.462 | 0.087 |
|
| 5.637 | 4.069 | −0.118 | 0.163 | 0.237 | −0.133 |
|
| 5.086 | 4.061 | −0.471 | −0.374 | 0.585 | 0.129 |
|
| 7.126 | 3.994 | −0.155 | 0.128 | 0.146 | 0.018 |
|
| 5.396 | 3.989 | 0.823 | 0.162 | −0.944 | 0.134 |
|
| 7.060 | 3.985 | −0.463 | −0.409 | 0.445 | 0.099 |
|
| 7.103 | 3.914 | −0.394 | −0.611 | 0.555 | 0.651 |
|
| 5.930 | 3.856 | 0.599 | 0.164 | −0.747 | −0.328 |