| Literature DB >> 19794913 |
Nicholas J Hudson1, Antonio Reverter, YongHong Wang, Paul L Greenwood, Brian P Dalrymple.
Abstract
BACKGROUND: Despite modern technologies and novel computational approaches, decoding causal transcriptional regulation remains challenging. This is particularly true for less well studied organisms and when only gene expression data is available. In muscle a small number of well characterised transcription factors are proposed to regulate development. Therefore, muscle appears to be a tractable system for proposing new computational approaches. METHODOLOGY/PRINCIPALEntities:
Mesh:
Substances:
Year: 2009 PMID: 19794913 PMCID: PMC2749936 DOI: 10.1371/journal.pone.0007249
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The profiles of MYOD1 and MYOG across the 6 transcriptional landscapes.
Their significant correlation in each of the 6 instances explains their inclusion in the Always Correlated landscape.
Figure 2The frequency distributions of all correlation coefficients in each of the six transcriptional landscapes (black) plus those deemed significant by PCIT (red).
Figure 3The Always Correlated transcriptional landscape.
Networks were visualised using the organic algorithm of Cytoscape [29]. A) Node size was mapped to average transcript abundance, edge colour was mapped to the sign of the correlation in the “Overall” landscape and node colour was mapped to Gene Ontology process. Node shape was mapped as follows: TFs (triangles), signalling molecules (squares) and chromatin remodelers (diamonds). All other genes (i.e. non-regulators) were mapped as ovals. B) Node size was mapped to number of connections. C) The transcription landscape built from connections with correlation coefficients >0.99.
Network connectivity.
| O | P | W | P | P | D | |
|
| 22.92 | 48.53% | 46.60% | 27.08% | 7.09% | 3.49% |
|
| 37.21% | 16.55 | 39.99% | 30.42% | 7.27% | 2.99% |
|
| 37.95% | 29.89% | 18.74 | 27.98% | 7.50% | 2.88% |
|
| 20.09% | 22.92% | 19.37% | 13.20 | 4.20% | 2.56% |
|
| 1.78% | 2.10% | 1.99% | 0.98% | 5.87 | 3.06% |
|
| 1.29% | 1.14% | 1.09% | 0.90% | 0.95% | 3.03 |
Clustering coefficient (diagonal) for each network.
Percent overlap computed from the ratio of common links divided by the total number of unique links for positive (above diagonal) and negative (below diagonal) links across each pair-wise network comparison.
Composition of modules containing muscle subunits in the Always Correlated network.
| Slow twitch fibres | Fast twitch fibres | Embryonic fibres | Other structural protein genes | Other muscle protein genes | |
| Slow twitch module | MYL2, TNNT1, MYBPC1 | TPM2, LDB3 | MB, CA3, SH3BGR | ||
| Fast twitch module | MYH1, TNNT3, MYOM2, TPM1, ACTN3, MYBPC2, TMOD4 |
| NEB (tv), MYPN, SSPN | RYR1, ALDOA, ATP2A1, ENO3, CKM, PFKM, FBP2, PGAM2, DHRS7C, JPH2 | |
| In another module/cluster | [MYL6B, (HSPB3)] [TNNC1 (TRDN)] [TPM3 (HDAC3)] | [MYL4 (TMSB10, TMEM204, TUBB2B, | [MYBPH, MYBPHL[ [TTN (tv), NEB (tv)] [LMOD2, TTN (tv)] [SMPX (PDE4DIP)] [MYOT (GHITM)] [MYBPC3 (SFRS7)] [KBTBD5 (RPS6KA3, HSPB8)] [TNNT2 (TCF7L2, CTNNB1, NAV3, PSRC1, SH3PXD2A)] [TCAP, PDLIM3, (HSPB7)] [TRIM63, (SLC7A8)] | ||
| Not in the Always Correlated network | MYH7, MYL3, MYOZ2, MYOM3, TNNI1, TMOD1 | MYL1, MYH2, MYH4, TNNC2, MYOZ1, TNNI2, MYLPF | MYH3, MYL7 | Many other genes | Many other genes |
Fibre type assignments are from [31], except for TMOD1 and TMOD4 [64] and MYL6B [65].
italics negatively correlated with the majority of the members of the module.
tv – transcript variant.
[] module or cluster.
() non-structural protein.
Figure 4The expression profiles of mammalian muscle over development.
Representatives from each of the main functional modules are shown: Immune, nuclear and mitochondrially-encoded mitochondrial genes (A); Extra-cellular matrix, fat and glycolysis gene transcription (B); Vasculature, fast and slow twitch muscle (C); and Cell cycle, ribosome and neuron gene transcription (D).
Figure 5The connectivity of all genes in the Always Correlated transcriptional landscape versus the transcriptional regulators.
The transcriptional regulators correlated with the functional modules.
| Module gene ontology | Transcription Factors common to both approaches | Transcriptional Regulators present in the Always connected landscape modules and not in Top 10 | Top 10 correlated Transcription Factors in the Overall landscape using the Module-to-Regulator analysis |
| Cell cycle | FOXM1 | No TF/NDC80 | E2F1, CBX2, TCF19, MYBL2, FOXM1, CDCA4 |
| Glycolysis/Fast Twitch | none | EP400, ZBTB7A, MAFB, CBX8, SIX1 | BGLAP, TBX15, PAX2, HLF |
| Mitochondria (nuclear-encoded) | none | ESRRA | CEBPB |
| Mitochondria (mito-encoded) | none | EBF3, PAWR/EFNA2, SMURF1, ADRBK2/no chromatin | LBX1, ZNF358, ATF4, THRB, NFIX, BHLHB3, BGLAP, CREB3L4, GPS2, ZBTB7B |
| Extracellular matrix | PHF19 | PCBD1, PBX3/CARHSP1 DCLK1, ANGPTL1 | TIAM1 |
| Immune system | IRF1 | none | IRF1, RNF14, TEAD1, LRRFIP1, EPAS1, NR1D2, RORC, PCGF5, PHF12, HOXD8 |
| Microvasculature | SOX17 | none | SOX17, HES2, FOXF1, TAL1, SOX18, LHX6, TCF7, LMO2, NRIP2, ZHX1 |
| Ribosomal proteins | none | none | SCMH1, CITED1, RBM4, ILF2, PRDM16, PRMT1, DPF3, NCOA5, CDCA7L, TRIM28 |
| Fat | none | none | TAF6L, DRAP1, ZNF219, ZNF496, CITED1, PIAS1 |
| Neural | TLX3 | ZNF621/ACCN2, AKAP7/no chromatin | TLX3, IRX6 |
Order = TFs followed by Signalling molecules then Chromatin remodellers with “/” separating the 3 groups.
Order = descending strength of absolute average correlation coefficient. References providing experimental evidence supporting our computational output are provided.
Figure 6The expression profiles of the neuron module genes across the Overall landscape (i.e. the 10 Piedmontese and 10 Wagyu development time points, plus the starvation-realimentation experiment).
The expression profile of the neurogenesis TF TLX3 is also shown, which did not make the module by PCIT but was ranked top by the downstream (nerve) module-to-regulator analysis.
Figure 7Range in expression level of genes versus frequency.
Distribution of genes in postnatal Piedmontese and Wagyu samples in red and in all Piedmontese and Wagyu samples in blue. Including pre-natal as well as post-natal muscle stages increases the exploration of parametric expression space. An increase in the frequency of genes experiencing moderate-high changes in expression level reduces the formation of spurious edges in the computed co-expression networks.