RBSEP: a reassignment and buffer based streaming edge partitioning approach

نویسندگانHamid Saadatfar
نشریهJournal of Big Data
شماره صفحات1-17
شماره سریال6
شماره مجلد1
نوع مقالهFull Paper
تاریخ انتشار2019
رتبه نشریهISI
نوع نشریهچاپی
کشور محل چاپایران
نمایه نشریهScopus

چکیده مقاله

In recent years, the rapid growth of the Internet has led to creation of massively large graphs. Since databases have become very large nowadays, they cannot be processed by a simple machine at an acceptable time anymore; therefore, traditional graph partitioning methods, which are often based on having a complete image of the entire graph, are not applicable to large datasets. This challenge has led to the appearance of a new approach called streaming graph partitioning. In streaming graph partitioning, a stream of input data is received by a partitioner, and partitioner decides which computational machine the data should be transferred to. Often, streaming partitioner does not have any information about the whole graph, and usually distributes the vertices based on some greedy heuristics which may not be optimal for incoming vertices. Hence, partitioner’s decision can be significantly improved if more information about the graph is utilized. In this paper, we present a new vertex-cut streaming graph partitioning approach. The proposed method uses the idea of postponing the decision for some of the edges (by means of an intelligent buffering) and corrects some of the past decisions to improve the quality of the graph partitioning. The proposed approach is evaluated using from real-world graphs. The experimental results show that the performance of the proposed method is superior in comparison with the previous HDRF method.

لینک ثابت مقاله

tags: Graph partitioning, Streaming partitioning, Edge partitioning, Big Data Processing