| Literature DB >> 33191981 |
Michele Ianni1, Elio Masciari2, Giancarlo Sperlí2.
Abstract
The pervasive diffusion of Social Networks (SN) produced an unprecedented amount of heterogeneous data. Thus, traditional approaches quickly became unpractical for real life applications due their intrinsic properties: large amount of user-generated data (text, video, image and audio), data heterogeneity and high speed generation rate. More in detail, the analysis of user generated data by popular social networks (i.e Facebook (https://www.facebook.com/), Twitter (https://www.twitter.com/), Instagram (https://www.instagram.com/), LinkedIn (https://www.linkedin.com/)) poses quite intriguing challenges for both research and industry communities in the task of analyzing user behavior, user interactions, link evolution, opinion spreading and several other important aspects. This survey will focus on the analyses performed in last two decades on these kind of data w.r.t. the dimensions defined for Big Data paradigm (the so called Big Data 6 V's).Entities:
Keywords: Big Data; Centrality measure; Fake news; Social Network
Year: 2020 PMID: 33191981 PMCID: PMC7649712 DOI: 10.1007/s10844-020-00629-2
Source DB: PubMed Journal: J Intell Inf Syst ISSN: 0925-9902 Impact factor: 1.888
Fig. 1Network topologies
Fig. 2Degree centrality
Fig. 3Betwenness centrality
Fig. 4Closeness centrality
Fig. 5Eigen centrality
Fig. 6Page rank
Essential bibliography for every V
| V dimension | Bibliography | Main application field |
|---|---|---|
| Veracity | García Lozano et al. ( | Fake news detection |
| Bessi et al. ( | ||
| Lazer et al. ( | ||
| Sharma et al. ( | ||
| Shu et al. ( | ||
| Bondielli and Marcelloni ( | ||
| Castillo et al. ( | ||
| Mihalcea and Strapparava ( | ||
| Khan et al. ( | ||
| Kotteti et al. ( | ||
| Wu et al. ( | ||
| Kwon et al. ( | ||
| Wang and Terano ( | ||
| Reis et al. ( | ||
| Variety | Liu et al. ( | Influence analysis |
| Shen et al. ( | ||
| Corradini et al. ( | ||
| Fang et al. ( | ||
| Chen et al. ( | ||
| Tian et al. ( | ||
| Ianni et al. ( | ||
| Variability | Barbieri et al. ( | Behavioral analysis |
| Jacobs et al. ( | ||
| Clifton et al. ( | ||
| Ahmad et al. ( | ||
| Shi et al. ( | ||
| Zhang et al. ( | ||
| Cassavia et al. ( | ||
| Volume and velocity | Anagnostopoulos et al. ( | System design |
| Cassavia et al. ( | ||
| Value | Liu et al. ( | Artificial Intelligence |
| Bonchi et al. ( | ||
| Aral et al. ( | ||
| Myers et al. ( | ||
| Richardson and Domingos ( | ||
| Barbieri et al. ( | ||
| Aslay et al. ( | ||
| Lu et al. ( | ||
| Lou et al. ( | ||
| Li et al. ( | ||
| Cassavia et al. ( |
A snapshot of some SN statistics
| S.No | Source of data | No. of elements | Frequency |
|---|---|---|---|
| 1 | Tweets | More than 9,000 | Per Second |
| 2 | Facebook Updates | More than 41,000 | Per second |
| 3 | Emails | 2,398,54 Mails | Per second |
| 4 | Google Search | More than 40,000 | Per second |
| 5 | Youtube | 101,604 Videos | Per second |
| 6 | 2,000+ Photos | Per second | |
| 7 | Tumblr | More than 1964 Posts | Per second |
| 8 | Skype | More than 1,700 Posts | Per second |
Fig. 7A system for data collection
Fig. 8A system for data collection and transformation
Fig. 9A system for SN and Big Data collection for Healthcare
Fig. 10A snapshot of some SN statistics