| Literature DB >> 29921970 |
Heather Mattie1, Kenth Engø-Monsen2, Rich Ling3, Jukka-Pekka Onnela4.
Abstract
Understanding factors associated with tie strength in social networks is essential in a wide variety of settings. With the internet and cellular phones providing additional avenues of communication, measuring and inferring tie strength has become much more complex. We introduce the social bow tie framework, which consists of a focal tie and all actors connected to either or both of the two focal nodes on either side of the focal tie. We also define several intuitive and interpretable metrics that quantify properties of the bow tie which enable us to investigate associations between the strength of the "central" tie and properties of the bow tie. We combine the bow tie framework with machine learning to investigate what aspects of the bow tie are most predictive of tie strength in two very different types of social networks, a collection of medium-sized social networks from 75 rural villages in India and a nationwide call network of European mobile phone users. Our results show that tie strength depends not only on the properties of shared friends, but also on non-shared friends, those observable to only one person in the tie, hence introducing a fundamental asymmetry to social interaction.Entities:
Mesh:
Year: 2018 PMID: 29921970 PMCID: PMC6008360 DOI: 10.1038/s41598-018-27290-8
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1A simple example of the social bow tie G. The blue circle contains the nodes and edges that comprise the overlapping friendship circle of the focal nodes i and j, denoted g. The parts of the bow tie shaded in orange contain the individual (non-overlapping) social circles of the focal nodes, denoted g for node i and g for node j.
Descriptions of tie strength predictors.
| Predictor | Description |
|---|---|
|
| Sum of the degrees of |
|
| Absolute difference in the degrees of |
|
| Sum of the strengths of |
|
| Absolute difference in the strengths of |
|
| Sum of the clustering coefficients of |
|
| Absolute difference in the clustering coefficients of |
|
| Sum of the weighted clustering coefficients of |
|
| Absolute difference in the weighted clustering coefficients of |
|
| Sum of the ages of |
|
| Absolute difference in the ages of |
|
| Categorical variable indicating a male-male, female-female, or female-male tie |
|
| Indicator variable of a male-male tie |
|
| Indicator variable of a female-female tie |
|
| Indicator variable of a female-male tie |
|
| Indicator if i and j have the same billing zip code |
|
| Unweighted overlap of edge between |
|
| Weighted overlap of edge between |
|
| Number of common friends of |
|
| Number of edges among the common friends of |
|
| Sum of the number of nodes in |
|
| Absolute difference in the number of nodes in |
|
| Sum of the number of edges in |
|
| Absolute difference in the number of edges in |
Figure 2Accuracy and feature importance plots for the India social network. Accuracy, measured as the absolute difference between empirical tie strength (w) and predicted tie strength (), for Models 1–3 using both RF regression (R) and classification (C) after imputation is shown in (a). Feature importance using RF regression and classification after imputation are shown for Model 1 (b), Model 2 (c) and Model 3 (d). The horizontal bars represent how informative the predictor is with a longer bar meaning more informative. The black vertical line represents the value of an equilibrium or null importance if every predictor were equally informative.
Figure 3Accuracy and feature importance plots for the CDR call network with normalized (N) and averaged (A) tie strengths. Accuracy, measured as the absolute difference between empirical tie strength (y, z) and predicted tie strength (), for all three models using RF regression after imputation is shown in (a). Note that only one curve is visible for each strength measure since the accuracy of all three models is indistinguishable. Feature importance using RF regression after imputation are shown for Model 1 (b), Model 2 (c) and Model 3 (d).