| Literature DB >> 28797047 |
Valerio Leone Sciabolazza1, Raffaele Vacca2, Therese Kennelly Okraku1, Christopher McCarty1.
Abstract
A growing body of evidence shows that collaborative teams and communities tend to produce the highest-impact scientific work. This paper proposes a new method to (1) Identify collaborative communities in longitudinal scientific networks, and (2) Evaluate the impact of specific research institutes, services or policies on the interdisciplinary collaboration between these communities. First, we apply community-detection algorithms to cross-sectional scientific collaboration networks and analyze different types of co-membership in the resulting subgroups over time. This analysis summarizes large amounts of longitudinal network data to extract sets of research communities whose members have consistently collaborated or shared collaborators over time. Second, we construct networks of cross-community interactions and estimate Exponential Random Graph Models to predict the formation of interdisciplinary collaborations between different communities. The method is applied to longitudinal data on publication and grant collaborations at the University of Florida. Results show that similar institutional affiliation, spatial proximity, transitivity effects, and use of the same research services predict higher degree of interdisciplinary collaboration between research communities. Our application also illustrates how the identification of research communities in longitudinal data and the analysis of cross-community network formation can be used to measure the growth of interdisciplinary team science at a research university, and to evaluate its association with research policies, services or institutes.Entities:
Mesh:
Year: 2017 PMID: 28797047 PMCID: PMC5552257 DOI: 10.1371/journal.pone.0182516
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Structural characteristics of union networks (G).
| Number of nodes | 4414 | 4038 | 3358 | 6414 |
| Number of edges | 11950 | 9783 | 8363 | 2.154 |
| Density | 0.0012 | 0.0012 | 0.0015 | 0.0010 |
| Modularity | 0.8411 | 0.8558 | 0.8466 | 0.7777 |
| Number of components | 237 | 264 | 260 | 220 |
| % of nodes in the giant component | 85.09 | 81.08 | 77.43 | 90.80 |
Notation summary.
| Approach | Graph | Edge weight |
|---|---|---|
| Cross-sectional | ||
| Inter-temporal | ||
Fig 1Extracting co-membership relationships from collaborative subgroups.
Red shaded areas are yearly collaborative subgroups. Red arrows indicate sequences identified as inter-temporal collaborative subgroups. Investigators A and B are in the same yearly collaborative subgroup for all the 3 years: g(A,B) = 3. However, only for 2 years are they in the same inter-temporal collaborative subgroup: g(A,B) = 1. Investigators C and D are in the same yearly collaborative subgroup for two consecutive years (g(C,D) = 2), but this is not an inter-temporal subgroup (g(C,D) = 0). Investigators D and E are in the same yearly collaborative subgroup for two non-consecutive years (g(D,E) = 2), but they are never in the same inter-temporal collaborative subgroup (g(D,E) = 0).
Main characteristics of collaborative subgroups in cross-sectional union networks.
| Mean (SD) | Mean (SD) | Mean (SD) | Mean (SD) | |
|---|---|---|---|---|
| Number of nodes (subgroup size) | 15.76 (43.16) | 13.19 (35.13) | 11.26 (28.84) | 25.05 (79.27) |
| Average degree in subgroup | 4.74 (4.65) | 4.31 (4.09) | 4.40 (4.31) | 1.96 (1.75) |
| Subgroup density | 0.78 (0.33) | 0.78 (0.32) | 0.80 (0.31) | 0.80 (0.32) |
| Subgroup diameter | 2.38 (2.89) | 2.32 (2.77) | 2.15 (2.49) | 2.32 (2.95) |
Fig 2Cross-sectional and inter-temporal co-membership networks.
Structural characteristics of co-membership networks.
| Cross-sectional co-membership networks | Inter-temporal co-membership networks | |||
|---|---|---|---|---|
| Number of nodes | 2919 | 1219 | 1672 | 535 |
| Number of edges | 11543 | 4186 | 5831 | 1627 |
| Density | 0.0027 | 0.0056 | 0.0042 | 0.0114 |
| Modularity | -0.0006 | 0.0020 | 0.7654 | 0.7828 |
| Number of components | 37 | 21 | 44 | 24 |
| % of nodes in the giant component | 95 | 94 | 89 | 82 |
Attributes of research communities.
| Cross-sectional co-membership communities | Inter-temporal co-membership communities | |||
|---|---|---|---|---|
| Mean (SD) | Mean (SD) | |||
| Size | 23.16 (52.70) | 6.44 (10.25) | 15.33 (29.14) | 10.91 (18.62) |
| Generalized variance of department affiliation | 0.50 (0.3099) | 0.5486 (0.23) | 0.58 (0.26) | 0.5575 (0.28) |
| Generalized variance of college affiliation | 0.33 (0.28) | 0.31 (0.26) | 0.37 (0.23) | 0.34 (0.26) |
| Within-community density | 0.60 (0.32) | 0.67 (0.24) | 0.64 (0.27) | 0.58 (0.30) |
| Bridging | 1.44 (3.64) | 1.51 (3.80) | 0.31 (0.60) | 1.31 (3.56) |
| N | 189 | 126 | 109 | 49 |
Structural characteristics of cross-community collaboration networks.
| Cross-sectional co-membership networks | Inter-temporal co-membership networks | |||
|---|---|---|---|---|
| Number of nodes | 189 | 126 | 109 | 49 |
| Number of edges | 580 | 356 | 230 | 52 |
| Average degree | 9.20 | 8.47 | 6.3303 | 3.18 |
| Density | 0.0016 | 0.0022 | 0.0002 | 0.0022 |
| Modularity | 0.60 | 0.39 | 0.39 | 0.66 |
| Number of components | 36 | 54 | 61 | 28 |
| % of nodes in the giant component | 78 | 55 | 42 | 30 |
| % of isolates | 16 | 40 | 52 | 48 |
Fig 3Networks of collaborations between research communities.
Each node is a community, an edge represents above-median density of collaborations between communities (). Node size represents the number of investigators in the community. Node colors represents the modal disciplinary area of investigators in the community (Blue = Health sciences, Red = Engineering, Green = Agricultural and Food Sciences, Black = College of Liberal Arts and Sciences, White = Other).
ERGM results.
| Dep. Var.: Link in network | ||||
|---|---|---|---|---|
| Cross-sectional co-membership networks | Inter-temporal co-membership networks | |||
| Edges | -4.7159 | -3.9486 | -3.6029 | -4.1178 |
| Building ( | -0.0093 | 0.0093 | 0.0058 | 0.056 |
| Floor ( | 0.1634 | 0.0026 | 0.0431 | 0.0237 |
| Department variance: difference | -0.3728 | -0.0842 | -1.3272 | -1.0146 |
| College variance: difference | -0.4484 | -0.2754 | -0.2292 | 0.959 |
| Density: | -0.3017 | -1.2022 | -0.7617 | -0.2674 |
| Bridging measure: difference | 0.0047 | -0.0309 | -0.0733 | -0.4779 |
| Number of nodes: difference | -0.0025 | 0.0057 | -0.0044 | -0.0354 |
| CTSI: both low | -0.2105 | -0.0236 | -0.0224 | 0.5112 |
| Isolates | 0.0917 | 1.3767 | 1.2412 | 0.4748 |
| AIC | 2433.3995 | 1101.6016 | 685.0428 | 216.6608 |
| N. Obs. | 189 | 126 | 109 | 49 |
Note: ERGM estimated coefficients and standard errors (in parentheses) are reported.
Coefficients are in log–odds.
*, **, *** indicate statistical significance at the 10, 5 and 1 percent levels.
Fig 4Goodness of fit.