Literature DB >> 25171496

Synergistic transcriptional and post-transcriptional regulation of ESC characteristics by core pluripotency transcription factors in protein-protein interaction networks.

Leijie Li1, Liangcai Zhang2, Guiyou Liu3, Rennan Feng4, Yongshuai Jiang5, Lei Yang5, Shihua Zhang6, Mingzhi Liao1, Jinlian Hua7.   

Abstract

The molecular mechanism that maintains the pluripotency of embryonic stem cells (ESCs) is not well understood but may be reflected in complex biological networks. However, there have been few studies on the effects of transcriptional and post-transcriptional regulation during the development of ESCs from the perspective of computational systems biology. In this study, we analyzed the topological properties of the "core" pluripotency transcription factors (TFs) OCT4, SOX2 and NANOG in protein-protein interaction networks (PPINs). Further, we identified synergistic interactions between these TFs and microRNAs (miRNAs) in PPINs during ESC development. Results show that there were significant differences in centrality characters between TF-targets and non-TF-targets in PPINs. We also found that there was consistent regulation of multiple "core" pluripotency TFs. Based on the analysis of shortest path length, we found that the module properties were not only within the targets regulated by common or multiple "core" pluripotency TFs but also between the groups of targets regulated by different TFs. Finally, we identified synergistic regulation of these TFs and miRNAs. In summary, the synergistic effects of "core" pluripotency TFs and miRNAs were analyzed using computational methods in both human and mouse PPINs.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25171496      PMCID: PMC4149371          DOI: 10.1371/journal.pone.0105180

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

The capacity to differentiate into different cell types, a property known as pluripotency, is a defining property of embryonic stem cells (ESCs). ESCs are derived from the inner cell mass of the mammalian blastocyst [1], [2]. Pluripotency may be conferred on somatic cells following their fusion with ESCs [3]. During this process, the transcription factor (TF) NANOG is specifically expressed, and this may facilitate fusion-induced pluripotency [4]. Moreover, human and mouse fibroblasts can be reprogrammed into ES-like cells which are called induced pluripotent stem cells (iPS) by forced expression of other TFs (OCT4, SOX2, Klf4, and c-Myc) [5]–[7]. The quality of iPS is enhanced upon selection of cells that express endogenous OCT4 or NANOG [8], [9]. Recently, Deng et al. reprogramed somatic cells into pluripotent cells using a combination of seven small-molecule compounds and called them CiPS [10]. Epigenetic modifications (DNA methylation, histone modification, miRNAs and other methods of epigenetic regulation) have also been found to play important roles in the maintenance of ‘stemness’ [11]–[13]. These results indicate that in addition to the genetic factors affecting the maintenance of pluripotency, complex epigenetic factors are also involved in the transformation of ESCs. In order to understand the mechanism by which pluripotency is established and maintained in ESCs, further effort will be required to research all aspects of the properties of molecules and their complex interactions in the biological networks which are involved in transcriptional and post-transcriptional regulation. According to previous studies, a small set of TFs, including OCT4, SOX2 and NANOG comprise the “core” pluripotency factors in ESCs [14]. OCT4 has long been considered to play essential roles in maintaining pluripotency in vivo and in vitro [15]. In fact, the concentration of OCT4 is crucial for pluripotency: reduced expression evokes trophoectoderm development, whereas enhanced expression leads to primitive endoderm differentiation [16]. As a transcriptional partner of OCT4, SOX2 assembles on regulatory elements of target genes together with OCT4 to collaborate in transcriptional control, without directly interacting with OCT4 protein [17]. The function of NANOG is to promote the self-renewal of ESCs and alleviate the requirement for Leukemia Inhibitory Factor (LIF) [18], [19]. Among OCT4 targets, about half are associated with SOX2. Furthermore, more than 90% of the target genes shared by OCT4 and SOX2 are also associated with NANOG [20]. Based on the above results, some researchers have constructed biological networks that involve these TFs, and analyzed their properties during the development of ESCs [21]–[24]. In a word, as key factors in the maintenance of the pluripotency and self-renewal of ESCs, OCT4, SOX2 and NANOG coordinate the regulation of downstream genes. Besides the traditional genetic impacts on the maintenance of ESC pluripotency, epigenetic regulation is also involved in the process of ESC development. In particular, more and more studies have found that miRNAs play important roles during the development of ESCs [25]–[27]. miRNAs are endogenous single strand non-coding RNAs which can inhibit target mRNA expression in a post-transcriptional manner [28]. It is characteristic of miRNAs that they regulate target genes in a minor manner and show temporal and spatial specificity. They may form a complex interaction network with other biological molecules in vivo. However it is not clear how the target genes of “core” pluripotency TFs regulate ESC development synergistically with miRNAs. In this study, we identified protein-protein interaction networks (PPINs) and analyzed the topological properties of the target genes of OCT4, SOX2 and NANOG in human and mouse ESCs. Further, we explored the effects of miRNAs on the post-transcriptional regulation of the target genes of these three “core” pluripotency TFs. We found that the centrality of “core” pluripotency transcription factor target genes is higher than that of randomly selected genes in PPINs. Furthermore, when genes are regulated by more “core” pluripotency TFs, they show more properties of centrality. The target genes regulated by both transcriptional and post-transcriptional methods also have higher centrality properties in PPINs. These results indicate that there are both the complex interactions between different “core” pluripotency TFs during ESC development within transcriptional levels and the interactions occur across both transcriptional and post-transcriptional levels in biological networks.

Materials and Methods

Dataset of transcription factor targets

In order to obtain comprehensive target datasets of “core” pluripotency TFs in ESCs, we manually collected related articles in PubMed. Finally, 10 articles and their corresponding datasets were extracted and used in this study (Table 1). The human database contained 3,949 entries, including 623 targets of OCT4, 1,436 targets of SOX2 and 1,886 targets of NANOG (Figure 1 left). The mouse database contained 25,222 entries, including 12,637 targets of OCT4, 5,971 targets of SOX2 and 6,614 targets of NANOG (Figure 1 right). The detailed targets of these three TFs are shown in Table S1 and Table S2.
Table 1

Targets of “core” pluripotency TFs in ESCs.

SpeciesYearsPubMed IDOCT4-targetsSOX2-targetsNANOG-targets
human200516153702 [20] 62312791687
human200818443585 [46] 0734988
mouse200516518401 [47] 77801027
mouse200818555785 [48] 436929412612
mouse200818358816 [21] 08191284
mouse200818692474 [25] 432033803114
mouse200818347094 [49] 215101936
mouse201020362542 [50] 101012
mouse201121477851 [51] 751800
mouse201323582322 [52] 441716992492
Figure 1

Venn diagram showing the targets of “core” pluripotency TFs.

(A) Human targets of TFs. (B) Mouse targets of TFs.

Venn diagram showing the targets of “core” pluripotency TFs.

(A) Human targets of TFs. (B) Mouse targets of TFs.

Datasets of Protein-Protein interactions

In order to avoid any bias due to the data source, protein-protein interaction data were also downloaded from two different databases: the Biological General Repository for Interaction Datasets (BioGRID) version 3.2.110 (http://theBioGRID.org/) and the Human Protein Reference Database (HPRD) (http://www.hprd.org/) [29], [30]. We then removed duplicated edges and selfloops using Cytoscape and analyzed topological properties using the NetworkAnalyzer tools in Cytoscape [31]–[33]. The datasets from BioGRID contained 9,698 nodes with 52,284 edges and 4,281 nodes with 7,415 edges (excluding pure high throughput experimental data) in human and mouse respectively. The HPRD dataset contained 9,453 nodes with 36,867 edges (excluding pure high throughput experimental data).

Dataset of miRNA targets

Targets of miRNAs were downloaded from two different databases, which cover both human and mouse species. The first one is the miRecords dataset (http://mirecords.biolead.org/), which includes 284 miRNAs, 1,101 targets with 2,087 edges in the human and 145 miRNAs, 266 targets with 442 edges in the mouse [34]. The second database is TarBase (http://diana.cslab.ece.ntua.gr/DianaToolsNew/index.php), which includes 111 miRNAs, 862 targets with 1,093 edges in the human and 44 miRNAs, 75 targets with 104 edges in the mouse [35]. The targets of miRNAs listed in miRecords consist of experimentally verified targets and predicted targets which are an integration of predicted miRNA targets produced by 11 of the following miRNA target prediction tools: DIANA-microT, MicroInspector, miRanda, MirTarget2, miTarget, NBmiRTar, PicTar, PITA, RNA22, RNAhybrid, and TargetScan/TargertScanS. Since TarBase only includes experimentally verified miRNA targets, their scale of targets is less than those in miRecords. In order to obtain robust results, all the datasets used in this work were experimentally verified.

Analysis of protein interaction network topological properties

Analyzing the topological properties of PPINs not only reveals the complex molecular interaction pathways, but also provides reference points for the detection of important transcriptional factors and downstream targets involved in maintaining the pluripotency of ESCs. Many topological properties of PPINs were used in this analysis. In this study, the Average Shortest Path Length (ASPL), Betweenness (BC), Closeness, Clustering Coefficient (CC), Degree, Eccentricity, Neighborhood Connectivity (NC), Radiality, Stress and Topological Coefficient (TC) were used to analyze the targets of OCT4, SOX2, NANOG and miRNAs (TarBases and miRecords) in the PPINs of the BioGRID and HPRD databases (Table 2). All the analyses were processed with Cytoscape and its NetworkAnalyzer tools.
Table 2

Definitions of the topological properties.

PropertyFunctionDescription
Average Shortest Path Length (ASPL)ASPLThe average number of steps for all shortest paths.
Betweenness Centrality (BC) k,j denotes shortest paths between node pairs K and j, kij denotes that pass through the node i.
Closeness Centrality (CC) L(n,m) is the length of the shortest path between two nodes n and m. The Closeness centrality of each node is a number between 0 and 1.
Clustering Coefficient (CC) di is the number of neighbors of i, and ei is the number of connected pairs between all neighbors of i.
Degreedi The number of links to node i.
EccentricityEThe maximum node eccentricity (E) can be desicribed as the network diameter, that is the largest distance between two nodes.
Neighborhood Connectivity (NC)dIt is defined as the average connectivity of all neighbors of the node.
RadialityR = D-ASPL+1This attribute is a node centrality index computed by the diameter (D) of a node n's the connected component plus 1 and subtracting the average shortest path length (ASPL).
Stressss of a node n is the number of shortest paths passing through n.
Topological Coefficient (TC) J(i, j) is the number of neighbors shared between the nodes i and j, plus one if there is a direct link between i and j. avg(J(i,j)) is the average value of J(i, j). di is degree of node i.
All the above topological properties can be used to measure the centrality of nodes in biological networks. General speaking, the higher the centrality of one node, the more important roles it plays in biological networks. For detailed description, we took some properties as examples to illustrate their meanings. ASPL is defined as the average shortest path length between a node and all the nodes in biological networks. Closeness centrality is defined as the reciprocal of the average shortest path length of one node which can be used as a measure of how fast information spreads from a given node to all other reachable nodes in biological networks. In undirected biological networks (such as PPINs), CC of a node is defined as the proportion of the observed connections between the neighbors of this node against the maximum number of possible connections among them. CC is used to indicate the close extent of the local neighborhood of one node. Degree is one simplest and most used topological index, which is defined as the number of nodes directly connected to a given node. TC is a relative measurement of the tendency of one node in biological networks to have shared interactive partners with other nodes. For more in-depth interpretation of these concepts, one can get the exact definitions of these topological properties from Table 2.

Results

The targets of “core” pluripotency TFs

A total of 2,512 human pluripotency “core” TF targets were identified in the collected articles. Among these targets, 42.9% were shared by at least two TFs, corresponding to 1,017 targets. The number of OCT4, SOX2 and NANOG targets was 623, 1,435 and 1,885 respectively, while the proportion shared by other TFs was 77.5%, 69.5% and 54.5% respectively (Figure 1A). Similar results were found in the mouse species. The total number of TF targets was 15,714, including 6,393 targets that were shared by two or three other TFs, accounting for 40.7% of the total number. Among the TF targets, 12,636 were OCT4 targets, 5,970 were SOX2 targets and 6,613 wee NANOG targets. Of these, 45.0%, 86.8% and 76.2%, respectively, were shared by other TFs (Figure 1B). As the results show, the target numbers and proportions that shared by different TFs were differences between human and mouse. The reason may be contributed from the research depth on human and mouse. As our manuscript show (Table 1), compared with the up to 8 literatures in mouse, there were only 2 literatures that contain at least two of the three “core” TF targets in human. As one extremely example, there was only 1 literature that contain OCT4 targets in human. In order to overcome the dataset bias and enrich the information in human, more works should be done for the genome wide target detection of these three “core” TFs in human.

Mapping targets into PPINs

We mapped the targets of the three TFs and miRNAs into the PPINs and obtained sub-networks consisting only of “core” pluripotency TF targets and their direct neighbors (Figure S1 to S9). From both BioGRID and HPRD results, it was evident that the proportions of TF-miRNA targets (targets which are regulated by both these three “core” TFs and miRNAs) and TF-non-miRNA targets (targets which are regulated only by these three “core” TFs but not regulated by miRNAs) were smaller compared with the proportion of non-TF-non-miRNA genes (genes which are neither regulated by these three “core” TFs nor regulated by miRNAs) in both human and mouse (Figure 2, 3 and 4).
Figure 2

The distribution of targets of “core” pluripotency TFs in human BioGRID.

(A) miRNA targets obtained from miRecords. (B) Targets of miRNAs from TarBase.

Figure 3

The distribution of targets of “core” pluripotency TFs in HPRD.

(A) miRNA targets obtained from miRecords. (B) Targets of miRNAs from TarBase.

Figure 4

The distribution of targets of “core” pluripotency TFs in mouse BioGRID.

(A) miRNA targets obtained from miRecords. (B) Targets of miRNAs from TarBase.

The distribution of targets of “core” pluripotency TFs in human BioGRID.

(A) miRNA targets obtained from miRecords. (B) Targets of miRNAs from TarBase.

The distribution of targets of “core” pluripotency TFs in HPRD.

(A) miRNA targets obtained from miRecords. (B) Targets of miRNAs from TarBase.

The distribution of targets of “core” pluripotency TFs in mouse BioGRID.

(A) miRNA targets obtained from miRecords. (B) Targets of miRNAs from TarBase.

Analysis of topological properties

The topological properties were analyzed with Cytoscape version 3.0.2, especially the NetworkAnalyzer tools. To begin with, the most connected components of protein-protein interaction networks were extracted from BioGRID, including 9,552 nodes with 52,202 edges in the human and 3,831 nodes with 7,123 edges in the mouse. The same analysis identified 9,205 nodes and 36,748 edges in the most connected component of the HPRD database. Finally, topological properties were analyzed with NetworkAnalyzer and filtered using strict statistical parameters (P values<0.05 with t test, processed with R version 3.0.2).

Comparison of centrality properties between TF-targets and non-TF-targets

Following, we compared the topological properties between TF-targets (genes that are regulated by the “core” TFs, including OCT4, SOX2 and NANOG) and non-TF-targets (genes that are not regulated by any of the “core” TFs, including OCT4, SOX2 and NANOG). First we analyzed human PPINs. The results showed that the ASPL of SOX2-targets and NANOG-targets was shorter compared with non-targets, in both the BioGRID and HPRD datasets. We also found that radiality of SOX2-targets and NANOG-targets was greater than non-targets in both BioGRID and HPRD resources. Furthermore, the degree of the NANOG-targets was also significantly different compared with non-NANOG-targets in BioGRID and HPRD, indicating that many proteins are connected with NANOG-targets. Without consistent significant results, the SOX2-targets were only found to have higher degree values compared with non-SOX2-targets in HPRD, while similar results were not found in BioGRID. With regard to OCT4-targets, BC differed significantly between OCT4-targets and non-OCT4-targets in both BioGRID and HPRD databases, indicating that the shortest paths going through OCT4 targets were more than a random choice. This indicates that OCT4 targets may be internal module proteins and are more likely to locate in the hub position in networks. In summary, a certain degree of higher centrality in PPINs was found in human “core” pluripotency targets compared with non-TF-targets (Table 3).
Table 3

Human PPIN topological properties of TF-targets vs NON-TF-targets.

PropertyDatasetOCT4 vs NON-OCT4SOX2 vs NON-SOX2NANOG vs NON-NANOG
AverageAveragePAverageAveragePAverageAverageP
ASPLBioGRID3.5741713.607040.25613.5287783.6119390.00012283.5214643.6143022.332E-6
HPRD4.0930964.1468390.14294.088064.1494180.023934.0979434.1498020.04426
BetweennessBioGRID1.75347E-040.0014849812.439E-046.74666E-040.0015009160.12876.40797E-040.0015205570.07553
HPRD2.79325E-040.00257121.69E-060.0028823120.0024620860.81790.0023062370.0025114420.8878
ClosenessBioGRID0.28623410.2900580.22530.29196290.28977270.4242.92E-010.2897020.288
HPRD0.25576320.25894280.54140.25733450.258950.67720.25893720.25882280.9757
CCBioGRID0.21010710.18287420.083250.18158010.18395220.80990.18996670.18316930.448
HPRD0.10453420.10309160.89970.098257810.103520950.49490.098934910.103571380.5041
DegreeBioGRID18.3250820.278290.31629.6981419.47650.118531.1076719.135520.03128
HPRD7.8553857.8026950.93319.275667.6901150.0095519.0706967.6750230.008655
EccentricityBioGRID8.1981428.193280.90398.1316178.1982440.050128.1363128.1990933.91E-02
HPRD9.6153859.5739480.55999.6129039.5724550.44749.592939.5735770.7064
NCBioGRID126.1499119.29060.5021122.932119.2540.5943130.1599118.46640.08831
HPRD39.2293135.44780.0573137.1333835.456850.226536.6040435.472860.3295
RadialityBioGRID0.78548580.78218370.17090.78897040.78177515.68E-050.78940080.78159061.308E-6
HPRD0.77840530.77405640.088910.77866790.7738590.010690.77764150.77385460.0332
StressBioGRID305873.1450004.90.02379934659.5407185.90.25349.08E+05399436.90.174
HPRD310804.13571700.4041447976.2348391.20.1233417604349232.80.1987
TCBioGRID0.20219130.18659860.12790.17817810.18781230.16520.18212230.18761210.3843
HPRD0.20134620.19588960.60860.18934620.19660060.31030.19381950.19630810.7063
Similar results were also obtained in the mouse. For NANOG-targets, 6 measurements were found to differ significantly from those in non-NANOG-targets, including ASPL, Closeness, Degree, NC, Radiality and Stress. For SOX2-targets, 5 measurements in total were significantly different compared with non-SOX2-targets: ASPL, Closeness, Degree, NC and Radiality. For OCT4-targets, 5 measurements were found to differ from non-OCT4-targets, including BC, Degree, NC, Stress and TC. Taking these measurement results together, the target genes of “core” pluripotency transcription factors show higher centrality properties in mouse PPINs (Table 4).
Table 4

Mouse PPIN topological properties of TF-targets vs NON-TF-targets.

PropertyOCT4 vs NON-OCT4SOX2 vs NON-SOX2NANOG vs NON-NANOG
AverageAveragePAverageAveragePAverageAverageP
ASPL4.9223324.9181820.88764.8740964.9461490.010264.8829614.9432730.03216
BC0.001190.0006490.0035230.00113670.00096390.41230.00138650.00081270.06287
Closeness0.20841170.20850470.93320.21019890.20750070.011710.21004180.20750650.0171
CC0.09429130.10208570.36490.09509190.09753130.75530.09645190.0968160.9627
Degree4.0760542.9106381.95E-064.1964023.4633560.012874.3649893.3417360.00176
Eccentricity10.9401410.87660.064610.901810.930720.378810.8915710.93760.1595
NC41.778955.007624.25E-0641.1232248.354150.00348741.6206848.294180.006982
Radiality0.75485420.75511360.88760.7578690.75336570.010260.75731490.75354540.03216
Stress118484.9265591.250.002851117399.7294174.790.263141375.379456.660.03827
TC0.19170320.17307310.020490.18529460.18636030.88740.19031510.18346690.3597

Consistency analysis of multiple “core” pluripotency TF regulations

Through the analysis of the distributions of “core” pluripotency TF targets, we identified many genes regulated by at least two TFs (Figure 1). This result indicates that these TFs may be involved in complex interactions and execute similar functions synergistically as cells progress along the pathway of ESC development. To investigate this further, we continued to explore cooperation between the TF regulators through their topological properties. As expected, no difference in the centrality properties was found between 1TF-targtes, 2TF-targets and 3TF-targets in human PPINs, whichever protein-protein interaction datasets were used (Table 5).
Table 5

Human PPIN topological properties of 1TF (OCT4 or SOX2 or NANOG), 2TF (OCT4-SOX2 or OCT4-NANOG or SOX2-NANOG) and 3TF (OCT4-SOX2-NANOG) targets.

PropertyDataset1TF vs 2TF1TF vs 3TF2TF vs 3TF
AverageAveragePAverageAveragePAverageAverageP
ASPLBioGRID3.5259563.5259340.99953.5259563.5543690.48113.5259343.5543690.5418
HPRD4.1359294.0737810.19734.1359294.0680190.2054.0737814.0680190.9237
BCBioGRID0.00028380.00111670.28590.000283860.000198390.17220.001116730.000198390.2389
HPRD0.00341770.00244440.7380.003417720.000311960.15440.002444480.000311960.2699
ClosenessBioGRID0.29149320.29454550.53330.29149320.28597540.13320.29454550.28597540.08464
HPRD0.25493810.26132420.36070.25493810.25676220.81130.26132420.25676220.6061
CCBioGRID0.19036920.18010440.52890.19036920.20277730.58450.18010440.20277730.3524
HPRD0.10099590.09384280.58380.100995960.10546170.78440.093842830.10546170.5175
DegreeBioGRID25.5071538.220670.323625.5071518.896740.086738.2206718.896740.1302
HPRD8.23574710.0780350.087828.2357478.3149170.937610.0780358.3149170.1749
EccentricityBioGRID8.1351358.1256980.8768.1351358.1847830.3448.1256988.1847830.3633
HPRD9.6563949.5867050.46339.6563949.5635360.38449.5867059.5635360.8494
NCBioGRID129.8866117.06110.2598129.8866135.49830.7446117.0611135.49830.3063
HPRD37.4839435.625330.392337.4839439.035770.609135.6253339.035770.2953
RadialityBioGRID0.78917250.78892360.93630.78917250.78713590.54330.78892360.78713590.6429
HPRD0.77534470.77957250.20620.77534470.77967190.23820.77957250.77967190.9807
StressBioGRID4.84E+0514604382.79E-01484355352091.70.21631460438352091.70.2189
HPRD350199.8518785.50.1401350199.8343444.90.941518785.5343444.90.1942
TCBioGRID1.83E-010.17753356.09E-010.18348240.19442520.4690.17753350.19442520.3052
HPRD1.98E-010.17924371.33E-010.19792560.20637390.57570.17924370.20637390.105
Similar results were also found in the mouse. Among the ten centrality properties, none of them was found to differ between 1TF-targets and 2TF-targets. When compared with 3TF-targets, only ASPL, Closeness and Radiality were found to be different from those of 2TF-targets. The greatest diversity was found between the groups of the 1TF-targets and 3TF-targets, where we found differences in five measurements: ASPL, Closeness, Eccentricity, Radiality and Stress (Table 6).
Table 6

Mouse PPIN topological properties of 1TF (OCT4 or SOX2 or NANOG), 2TF (OCT4-SOX2 or OCT4-NANOG or SOX2-NANOG) and 3TF (OCT4-SOX2-NANOG) Targets.

Property1TF vs 2TF1TF vs 3TF2TF vs 3TF
AverageAveragePAverageAveragePAverageAverageP
ASPL4.9525084.9765650.5524.9525084.8294780.00059454.9765654.8294780.000944
BC0.00094760.00141710.43490.00094760.00129050.10840.00141710.00129050.8358
Closeness0.20699690.20649690.73740.20699690.21208130.00022440.20649690.21208130.0008233
CC0.09483010.0939840.93630.09483010.09712060.81940.0939840.09712060.792
Degree3.7870313.9025110.8223.7870314.5429640.067743.9025114.5429640.2474
Eccentricity10.9658710.963070.952810.9658710.869240.0210910.9630710.869240.06803
NC43.3136241.096610.498343.3136241.056020.467941.0966141.056020.991
Radiality0.75296820.75146470.5520.75296820.76065760.00059450.75146470.76065760.000944
Stress91310.31140625.70.388991310.31135029.880.04418140625.7135029.880.9252
TC0.19451230.1896620.64520.19451230.18726130.44930.1896620.18726130.8355

Modularity within the inner “core” pluripotency TFs with shortest path length analysis

When we investigated the regulation of multiply “core” pluripotency TFs during the development of ESCs in PPINs, one question was triggered about whether there are closer relationships between targets of these TFs. One hypothesis suggests that the connections within and between TF targets should be closer than between other genes in PPINs. In other words, module properties are expected within and between the “core” pluripotency TFs. In order to verify this hypothesis, we performed shortest path length (SPL) analysis across human and mouse species. We found that the background averages of PPIN SPLs were 4.227, 4.201 and 5.125 in HPRD, human BioGRID and mouse BioGRID respectively. First, the smaller SPLs were detected within TF targets compared with the background SPLs. As shown in Table 7, 8 and 9, the SPLs of TF targets, including OCT4, SOX2 and NANOG, were all smaller than those of other proteins in the PPINs, no matter which source of PPIN data was used. Second, the common targets of at least two TFs showed smaller SPLs compared with other proteins in the PPINs (Table 7, 8 and 9). When the number of TFs in combination was 2, the corresponding P values of the t tests were 4.99E-5, 0.001 and 3.90E-286; while when the number was 3, the P value was 7.98E-258 for values from the HPRD database (Table 7). Similar results were found for other databases (Table 8 and 9). Finally, we compared the distances between groups of different TF targets with SPLs and found that they were significantly different, especially when the group comprised the common targets of three TFs (Table 7, 8 and 9). From the HPRD and mouse BioGRID databases we found that the SPLs of three TFs were smaller than those of most other groups, but similar results were not observed in the PPIN of the human BioGRID. Summarizing the above results, we found module properties not only within the targets regulated by common or multiple “core” pluripotency TFs but also between the groups of targets regulated by different TFs.
Table 7

Analysis of shortest path length in HPRD.

only-OCT4only-SOX2only-NANOGOCT4-SOX2OCT4-NANOGSOX2-NANOGOCT4-SOX2-NANOG
SPL4.0504.0974.1704.0434.0984.0413.960
Others1.95E-202.25E-721.93E-534.99E-050.0013.90E-2867.98E-258
only-SOX20.017
only-NANOG2.93E-091.91E-18
OCT4-SOX20.8820.2210.007
OCT4-NANOG0.2400.9780.0620.349
SOX2-NANOG0.6277.05E-112.10E-880.9610.116
OCT4-SOX2-NANOG1.63E-062.31E-431.45E-1260.0465.28E-051.08E-19

(In this table, SPL indicates shortest path and the first line is the value of shortest path length of the targets; others is the p-value).

Table 8

Analysis of shortest path length in human BioGRID.

only-OCT4only-SOX2only-NANOGOCT4-SOX2OCT4-NANOGSOX2-NANOGOCT4-SOX2-NANOG
SPL4.2213.8653.9663.8624.5573.9443.989
Others4.37E-012.00E-3230.00E+002.59E-081.43E-170.00E+001.76E-111
only-SOX22.27E-46
only-NANOG7.13E-233.37E-26
OCT4-SOX29.52E-090.9580.080
OCT4-NANOG6.70E-107.71E-692.37E-462.51E-14
SOX2-NANOG4.46E-269.35E-153.45E-030.1711911.24E-48
OCT4-SOX2-NANOG1.97E-222.57E-282.49E-020.014331.92E-512.24E-05

(In this table, SPL indicates shortest path length and the first line is the value of shortest path length of the targets; others is the p-value).

Table 9

Analysis of shortest path length in mouse BioGRID.

only-OCT4only-SOX2only-NANOGOCT4-SOX2OCT4-NANOGSOX2-NANOGOCT4-SOX2-NANOG
SPL5.2144.8474.7054.9594.9524.8404.765
Others0.00E+005.35E-890.00E+002.44E-1312.93E-1555.94E-750.00E+00
only-SOX21.21E-155
only-NANOG0.00E+008.31E-24
OCT4-SOX23.17E-2832.33E-115.17E-116
OCT4-NANOG0.00E+002.14E-102.44E-1144.92E-01
SOX2-NANOG5.59E-1297.10E-016.51E-173.34E-102.01E-09
OCT4-SOX2-NANOG0.00E+002.86E-081.35E-112.04E-1392.40E-1396.23E-06

(In this table, SPL indicates shortest path length and the first line is the value of shortest path length of the targets; others is the p-value).

(In this table, SPL indicates shortest path and the first line is the value of shortest path length of the targets; others is the p-value). (In this table, SPL indicates shortest path length and the first line is the value of shortest path length of the targets; others is the p-value). (In this table, SPL indicates shortest path length and the first line is the value of shortest path length of the targets; others is the p-value).

Post-transcriptional regulation effects on “core” pluripotency TF targets

Based on the evidence that epigenetic regulation may play important roles in ESC development, we attempted to analyze the effects of post-transcriptional regulation on the targets of “core” pluripotency TFs in PPINs. As typical forms of post-transcriptional regulation, miRNA targets were obtained for further analysis from two different sources, including miRecords and TarBase. Results show that there were some significant different characteristics between TF-miRNA targets and TF-non-miRNA targets in the both two PPINs from human BioGRID and HPRD databases. The differences were found in both miRecords and TarBase. First, we found that OCT4-miRNA targets were different from OCT4-non-miRNA targets in several centrality properties, including Degree, Stress and TC (Figure 5). Second, observation of both SOX2-miRNA targets and NANOG-miRNA targets revealed that they differed from TF-non-miRNA targets in several measurements. The measurements that differed between SOX2-miRNA and SOX2-non-miRNA targets were ASPL, Closeness, Degree, Eccentricity, Radiality, Stress and TC (Figure 6). And the measurements, including ASPL, Clossness, Degree, Eccentricity, Radiality, Stress and TC, were different between NANOG-miRNA targets and NANOG-non-miRNA targets (Figure 7). Furthermore, these results were observed across both the BioGRID and HPRD protein-protein interaction databases and the miRNA target databases miRecords, TarBase.
Figure 5

Analysis of topological properties of OCT4-miRNA-targets in human PPINs.

(A) Comparison of OCT4-miRNA and OCT4-non-miRNA targets. (B) Comparison of OCT4-miRNA and non-OCT4-non-miRNA targets.

Figure 6

Analysis of topological properties of SOX2-miRNA-targets in human PPINs.

(A) Comparison of SOX2-miRNA and SOX2-non-miRNA targets. (B) Comparison of SOX2-miRNA and non-SOX2-non-miRNA targets.

Figure 7

Analysis of topological properties of NANOG-miRNA-targets in human PPINs.

(A) Comparison of NANOG-miRNA and NANOG-non-miRNA targets. (B) Comparison of NANOG-miRNA and non-NANOG-non-miRNA targets.

Analysis of topological properties of OCT4-miRNA-targets in human PPINs.

(A) Comparison of OCT4-miRNA and OCT4-non-miRNA targets. (B) Comparison of OCT4-miRNA and non-OCT4-non-miRNA targets.

Analysis of topological properties of SOX2-miRNA-targets in human PPINs.

(A) Comparison of SOX2-miRNA and SOX2-non-miRNA targets. (B) Comparison of SOX2-miRNA and non-SOX2-non-miRNA targets.

Analysis of topological properties of NANOG-miRNA-targets in human PPINs.

(A) Comparison of NANOG-miRNA and NANOG-non-miRNA targets. (B) Comparison of NANOG-miRNA and non-NANOG-non-miRNA targets. Further, in order to identify the more obvious effects of miRNA regulation on the development of ESCs, we compared TF-miRNA target genes to non-TF-non-miRNA target genes in the human. Interestingly, as expected we found a significant difference between TF-miRNA and non-TF-non-miRNA targets on the topological properties of components of PPINs. First, OCT4-miRNA targets were found to have higher centrality properties compared with non-OCT4-non-miRNA genes in PPINs, with higher values of ASPL, BC, Degree, Stress, Radiality and TC (Figure 5). Second, the measurements which were higher in SOX2-miRNA targets compared with non-SOX2-non-miRNA genes were ASPL, BC, Closeness, Degree, Eccentricity, Radiality, Stress and TC (Figure 6). Third, up to 9 measurements differed between NANOG-miRNA targets and non-NANOG-non-miRNA genes in PPINs, excluding CC (Figure 7). These results were supported by all the different data sources, including BioGRID, HPRD, miRecords and TarBase. In order to overcome species bias, we conducted the same experiments in mouse ESCs. Interestingly, the results showed the same tendency as above in data from all the dataset sources. Firstly, the TF-miRNA targets also showed higher centrality properties compared with TF-non-miRNA targets (Figure 8, 9 and 10). The numbers of measurements that were different between TF-miRNA targets and TF-non-miRNA targets were up to 7, 9 and 5 in OCT4, SOX2 and NANOG respectively. The centrality properties that were not found to differ between OCT4-miRNA and OCT4-non-miRNA targets were CC, NC and TC. For SOX2, only NC showed no difference. The higher measurements in NANOG-miRNA targets included ASPL, Closeness, Degree, Eccentricity and Radiality. Second, the TF-miRNA targets differed from the non-TF-non-miRNA target genes in PPINs (Figure 8, 9 and 10). The numbers of measurements that were different between TF-miRNA targets and non-TF-non-miRNA targets were up to 9, 10 and 7 in OCT4, SOX2 and NANOG respectively. The measurements that were not found to differ were CC and BC, CC and TC in OCT4 and NANOG respectively. In summary, miRNAs play important roles during the development of ESCs and participate in complex interactions with “core” pluripotency TFs from a biological system perspective.
Figure 8

Analysis of topological properties of OCT4-miRNA-targets in mouse PPIN.

(A) Comparison of OCT4-miRNA and OCT4-non-miRNA targets. (B) Comparison of OCT4-miRNA and non-OCT4-non-miRNA targets.

Figure 9

Analysis of topological properties of SOX2-miRNA-targets in mouse PPIN.

(A) Comparison of SOX2-miRNA and SOX2-non-miRNA targets. (B) Comparison of SOX2-miRNA and non-SOX2-non-miRNA targets.

Figure 10

Analysis of topological properties of NANOG-miRNA-targets in mouse PPIN.

(A) Comparison of NANOG-miRNA and NANOG-non-miRNA targets. (B) Comparison of NANOG-miRNA and non-NANOG-non-miRNA targets.

Analysis of topological properties of OCT4-miRNA-targets in mouse PPIN.

(A) Comparison of OCT4-miRNA and OCT4-non-miRNA targets. (B) Comparison of OCT4-miRNA and non-OCT4-non-miRNA targets.

Analysis of topological properties of SOX2-miRNA-targets in mouse PPIN.

(A) Comparison of SOX2-miRNA and SOX2-non-miRNA targets. (B) Comparison of SOX2-miRNA and non-SOX2-non-miRNA targets.

Analysis of topological properties of NANOG-miRNA-targets in mouse PPIN.

(A) Comparison of NANOG-miRNA and NANOG-non-miRNA targets. (B) Comparison of NANOG-miRNA and non-NANOG-non-miRNA targets.

Discussion

In this study, we globally analyzed the topological properties of targets of the “core” pluripotency TFs in PPINs, including OCT4, SOX2 and NANOG. In addition, the post-transciptional effects of miRNAs on these TFs were also analyzed in both human and mouse. Up to ten topological properties were included in the analysis process, including Shortest Path Length, Betweeness, Closeness, Cluster Coefficient, Degree, Eccentricity, Neighborhood Connectivity, Radiality, Stress and Topological Coefficient. All the above analyses were processed in three protein-protein interaction datasets (HPRD, human BioGRID and mouse BioGRID), two miRNA target datasets (miRecords and TarBase) and two species (human and mouse). Though there were different dataset scales of the three “core” TF targets between different databases and species, the common characteristic of these three “core” TFs in biological networks were still observed in our research. The use of several data sources and measurements in this study ensures the robust nature of the results obtained. Besides, all the above analysis was based on ESCs datasets. Consider the similar property of “stemness” with ESCs in many other types of stem cells, especially induced pluripotent stem cells, we infer that these three “core” TFs will have similar roles and characteristics in biological networks. With the increasing of high throughput datasets about targets of the “core” pluripotency TFs in other types of stem cells, similar research should be processed and compared with current results. We found significant differences in centrality properties between “core” pluripotency TF-targets and non-TF-targets in PPINs. These results were widespread in HPRD, human BioGRID and mouse BioGRID datasets. The numbers of centrality properties were 6 and 8 in human and mouse respectively. The former contained ASPL, BC, Degree, Eccentricity, Radiality and Stress, while the latter comprised ASPL, BC, Closeness, Degree, NC, Radiality, Stress and TC. Comparing the two results, we found that ASPL, BC, Degree, Radiality and Stress were robust and only CC among the 10 measurements could not be detected, which is used to judge the close link of node neighborhoods in biological networks. The reason why CC did not appear in the analysis was not clear. It may be because the targets of “core” pluripotency TFs perform their functions in relative isolation which may help to avoid harm to the complex environment in vivo during the development of ESCs [36], [37]. With the higher central properties, these results indicate that targets of these three “core” TFs play more important roles than random genes during the development of ESCs. We found synergistic regulation of multiple “core” pluripotency TFs during the development of ESCs. As we hypothesized, no difference in topological properties was found between 1TF-targets, 2TF-targets and 3TF-targets in human PPINs, including HPRD and human BioGRID. The same tendency was also found in mouse PPINs from BioGRID. Based on these results, it can be seen that although the number of TFs regulating common target genes increases from 1 to 3, their centrality properties do not increase accordingly. This indicates that pluripotency related genes may be regulated by 1 or 2 or even 3 TFs, but the genes are no different from the biological system viewpoint. In other words, the number of TFs that regulate the common pluripotency genes is not important. This means that the seemingly unnecessary TFs may provide compensatory regulation in the molecular pathways of ESC development [38]. Such compensatory regulation will conversely enhance the status of the common targets in a biological process. Through the consistent regulation of these TFs, the maintenance of the pluripotency of ESCs is rendered more reliable. As an example, the synergistic regulation of histone deacetylase 1 (HDAC1) by OCT4, SOX2 and NANOG plays important roles not only in the development of ESCs but also in the growth of tumor cells [39]-[41]. The module properties within the inner “core” pluripotency TFs were detected through the analysis of shortest path length in PPINs. Through these experiments, we found that the shortest path length of targets regulated by common TFs were smaller than those of randomly selected background values. This indicates that the genes regulated by common “core” pluripotency TFs are in close communication in PPINs and that this will be helpful in synergistically and quickly maintaining the pluripotency of ESCs [42]. Next, the shortest path lengths of target genes regulated by multiple TFs were further analyzed. Results show that these genes also have smaller shortest path lengths between each other. Third, we took the analysis of the extent of contact between groups of different TF targets and found that they were different, especially when the group comprised the common targets regulated by three TFs. In summary, we identified module properties not only within the targets regulated by common or multiple “core” pluripotency TFs but also between the groups of targets regulated by different TFs. The genes within module may have close and quickly information flowing which will help them to implement the common function of maintaining pluripotency during the development of ESCs. We also found that miRNAs play important roles in the regulation of “core” pluripotency TFs during the development of ESCs as a way of post-transcriptional regulation in PPINs. The difference of centrality properties observed between TF-miRNA targets and TF-non-miRNA targets was found in both human and mouse PPINs. Further, the differences in topological properties between TF-miRNA targets and non-TF-non-miRNA target genes were more obvious in PPINs. These different centrality properties show different close extent and different importance in computational biological network view. Considering the effects that miRNAs impose to the “core” TFs on these properties in PPINs, it is observed that there is synergistic regulation between themselves. The synergistic regulation of “core” pluripotency TFs and miRNAs will enhance the function of targets. In consideration of the drastic effects of TFs and the minor regulation of miRNAs, the synergism of these three “core” TFs and miRNAs may help to achieve their aims of regulation about pluripotency, like one machine which has many gear wheel of different size. As an example, myocyte enhancer factor2 (MEF2) which is a target of both NANOG and miRNA, is an important transcription factor regulating the survival and development of many types of cells [43], [44]. One of its prominent functions is the control of gene transcription in cell differentiation. All of the other genes that regulated by NANOG and miRNAs (including Mxi1, Arid3b, Kit, Hoxa11, Hoxa7, Mef2c, Gja1, Myc, Hdac4 and Tmsb4x) are known to be related with stem cells. To further verify our results, we performed analysis of function categories with these ten targets in PIR and found that they are development proteins and related with transcription regulation (Table 10). Our results clearly reveal the effects of epigenetic regulations on the development of ESCs in biological networks, findings which are consistent with previous studies performed using molecular and cell technology [25], [45]. This finding may provide candidate pathway for deep detection about ESCs molecular mechanism from post transcriptional level.
Table 10

Function categories analysis about targets of NANOG and miRNAs in PIR.

TermP Value
Transcription regulation8.59E-05
Transcription9.74E-05
Dna-binding7.75E-04
Proto-oncogene0.004849
Nucleus0.005432
DNA binding0.010331
Developmental protein0.048812
Sub-network of OCT4 in HPRD. (PDF) Click here for additional data file. Sub-network of SOX2 in HPRD. (PDF) Click here for additional data file. Sub-network of NANOG in HPRD. (PDF) Click here for additional data file. Sub-network of OCT4 in human BioGRID. (PDF) Click here for additional data file. Sub-network of SOX2 in human BioGRID. (PDF) Click here for additional data file. Sub-network of NANOG in human BioGRID. (PDF) Click here for additional data file. Sub-network of OCT4 in mouse BioGRID. (PDF) Click here for additional data file. Sub-network of SOX2 in mouse BioGRID. (PDF) Click here for additional data file. Sub-network of NANOG in mouse BioGRID. (PDF) Click here for additional data file. Targets of “core” pluripotency TFs in the human. (XLS) Click here for additional data file. Targets of “core” pluripotency TFs in the mouse. (XLS) Click here for additional data file.
  51 in total

Review 1.  Network biology: understanding the cell's functional organization.

Authors:  Albert-László Barabási; Zoltán N Oltvai
Journal:  Nat Rev Genet       Date:  2004-02       Impact factor: 53.242

2.  Topological analysis and interactive visualization of biological networks and protein structures.

Authors:  Nadezhda T Doncheva; Yassen Assenov; Francisco S Domingues; Mario Albrecht
Journal:  Nat Protoc       Date:  2012-03-15       Impact factor: 13.491

3.  Chipping away at the embryonic stem cell network.

Authors:  Stuart H Orkin
Journal:  Cell       Date:  2005-09-23       Impact factor: 41.582

4.  Core transcriptional regulatory circuitry in human embryonic stem cells.

Authors:  Laurie A Boyer; Tong Ihn Lee; Megan F Cole; Sarah E Johnstone; Stuart S Levine; Jacob P Zucker; Matthew G Guenther; Roshan M Kumar; Heather L Murray; Richard G Jenner; David K Gifford; Douglas A Melton; Rudolf Jaenisch; Richard A Young
Journal:  Cell       Date:  2005-09-23       Impact factor: 41.582

5.  Directly reprogrammed fibroblasts show global epigenetic remodeling and widespread tissue contribution.

Authors:  Nimet Maherali; Rupa Sridharan; Wei Xie; Jochen Utikal; Sarah Eminli; Katrin Arnold; Matthias Stadtfeld; Robin Yachechko; Jason Tchieu; Rudolf Jaenisch; Kathrin Plath; Konrad Hochedlinger
Journal:  Cell Stem Cell       Date:  2007-06-07       Impact factor: 24.633

Review 6.  Integrating post-transcriptional regulation into the embryonic stem cell gene regulatory network.

Authors:  Paul A Cassar; William L Stanford
Journal:  J Cell Physiol       Date:  2012-02       Impact factor: 6.384

7.  Highly efficient miRNA-mediated reprogramming of mouse and human somatic cells to pluripotency.

Authors:  Frederick Anokye-Danso; Chinmay M Trivedi; Denise Juhr; Mudit Gupta; Zheng Cui; Ying Tian; Yuzhen Zhang; Wenli Yang; Peter J Gruber; Jonathan A Epstein; Edward E Morrisey
Journal:  Cell Stem Cell       Date:  2011-04-08       Impact factor: 24.633

8.  MicroRNA regulation of cell lineages in mouse and human embryonic stem cells.

Authors:  Kathryn N Ivey; Alecia Muth; Joshua Arnold; Frank W King; Ru-Fang Yeh; Jason E Fish; Edward C Hsiao; Robert J Schwartz; Bruce R Conklin; Harold S Bernstein; Deepak Srivastava
Journal:  Cell Stem Cell       Date:  2008-03-06       Impact factor: 24.633

9.  Microarray analysis of LTR retrotransposon silencing identifies Hdac1 as a regulator of retrotransposon expression in mouse embryonic stem cells.

Authors:  Judith Reichmann; James H Crichton; Monika J Madej; Mary Taggart; Philippe Gautier; Jose Luis Garcia-Perez; Richard R Meehan; Ian R Adams
Journal:  PLoS Comput Biol       Date:  2012-04-26       Impact factor: 4.475

10.  Multipotent cell lineages in early mouse development depend on SOX2 function.

Authors:  Ariel A Avilion; Silvia K Nicolis; Larysa H Pevny; Lidia Perez; Nigel Vivian; Robin Lovell-Badge
Journal:  Genes Dev       Date:  2003-01-01       Impact factor: 11.361

View more
  1 in total

Review 1.  SALL4, the missing link between stem cells, development and cancer.

Authors:  Hiro Tatetsu; Nikki R Kong; Gao Chong; Giovanni Amabile; Daniel G Tenen; Li Chai
Journal:  Gene       Date:  2016-02-16       Impact factor: 3.688

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.