| Literature DB >> 28690426 |
Abstract
The increasing size of Big Data is often heralded but how data are transformed and represented is also profoundly important to knowledge discovery, and this is exemplified in Big Graph analytics. Much attention has been placed on the scale of the input graph but the product of a graph algorithm can be many times larger than the input. This is true for many graph problems, such as listing all triangles in a graph. Enabling scalable graph exploration for Big Graphs requires new approaches to algorithms, architectures, and visual analytics. A brief tutorial is given to aid the argument for thoughtful representation of data in the context of graph analysis. Then a new algebraic method to reduce the arithmetic operations in counting and listing triangles in graphs is introduced. Additionally, a scalable triangle listing algorithm in the MapReduce model will be presented followed by a description of the experiments with that algorithm that led to the current largest and fastest triangle listing benchmarks to date. Finally, a method for identifying triangles in new visual graph exploration technologies is proposed.Entities:
Keywords: Graph; MapReduce; parallel programming; scalable algorithms; triangle counting; visual analytics
Year: 2016 PMID: 28690426 PMCID: PMC5480623 DOI: 10.1177/1473871616666393
Source DB: PubMed Journal: Inf Vis ISSN: 1473-8716 Impact factor: 0.956
Figure 1.Thing of Things. (a) Sub-network. (b) List of binary relationships.
Figure 2.The full Thing.
Figure 3.Graphviz “force-directed” layout.
Figure 4.Graphviz “scalexy” layout.
Figure 5.MATLAB Spy Plot.
Figure 6.Spectral Partitioning.
|
|
|---|
|
|
| |
| emit |
| |
| emit |
| |
|
|
| |
| emit |
| |
|
|
| |
| emit |
| |
|
|
| emit |
| |
| emit |
| |
|
|
|
|
| |
| emit |
| |
SNAP graphs (http://www.snap.stanford.edu).
|
| 2 |
| |
|---|---|---|---|
| AstroPh | 18,772 | 396,160 | 1,351,441 |
| AS-Skitter | 1,696,415 | 22,190,596 | 28,769,868 |
| LiveJournal | 3,997,962 | 69,362,386 | 177,820,130 |
| Orkut | 3,072,441 | 234,370,174 | 627,584,181 |
| Friendster | 65,608,366 | 3,612,134,278 | 4,173,724,142 |
Figure 7.Triangle listing results for SNAP datasets.
Graph500 RMAT graphs (, , ).
|
| 2 |
| |
|---|---|---|---|
| RMAT 24 | 16,777,216 | 268,435,456 | 2,127,854,845 |
| RMAT 26 | 67,108,864 | 1,073,741,824 | 9,838,965,401 |
| RMAT 28 | 268,435,456 | 4,294,967,296 | 44,970,850,296 |
| RMAT 30 | 1,073,741,824 | 17,179,869,184 | 203,333,933,183 |
| RMAT 32 | 4,294,967,296 | 68,719,476,736 | <11,068,947,204,458 |
Figure 8.Triangle listing results for Graph500 datasets.