| Literature DB >> 25866846 |
Zhiyuan Lin1, Minsuk Kahng1, Kaeser Md Sabrin1, Duen Horng Polo Chau1, Ho Lee2, U Kang2.
Abstract
Graph computation approaches such as GraphChi and TurboGraph recently demonstrated that a single PC can perform efficient computation on billion-node graphs. To achieve high speed and scalability, they often need sophisticated data structures and memory management strategies. We propose a minimalist approach that forgoes such requirements, by leveraging the fundamental memory mapping (MMap) capability found on operating systems. We contribute: (1) a new insight that MMap is a viable technique for creating fast and scalable graph algorithms that surpasses some of the best techniques; (2) the design and implementation of popular graph algorithms for billion-scale graphs with little code, thanks to memory mapping; (3) extensive experiments on real graphs, including the 6.6 billion edge YahooWeb graph, and show that this new approach is significantly faster or comparable to the highly-optimized methods (e.g., 9.5× faster than GraphChi for computing PageRank on 1.47B edge Twitter graph). We believe our work provides a new direction in the design and development of scalable algorithms. Our packaged code is available at http://poloclub.gatech.edu/mmap/.Entities:
Year: 2014 PMID: 25866846 PMCID: PMC4389765 DOI: 10.1109/BigData.2014.7004226
Source DB: PubMed Journal: Proc IEEE Int Conf Big Data