Show simple item record

AuthorZhang, Honglei
AuthorRaitoharju, Jenni
AuthorKiranyaz, Serkan
AuthorGabbouj, Moncef
Available date2021-03-25T11:33:36Z
Publication Date2016
Publication NameJournal of Big Data
ResourceScopus
URIhttp://dx.doi.org/10.1186/s40537-016-0060-5
URIhttp://hdl.handle.net/10576/18015
AbstractGraph clustering is an important technique to understand the relationships between the vertices in a big graph. In this paper, we propose a novel random-walk-based graph clustering method. The proposed method restricts the reach of the walking agent using an inflation function and a normalization function. We analyze the behavior of the limited random walk procedure and propose a novel algorithm for both global and local graph clustering problems. Previous random-walk-based algorithms depend on the chosen fitness function to find the clusters around a seed vertex. The proposed algorithm tackles the problem in an entirely different manner. We use the limited random walk procedure to find attractor vertices in a graph and use them as features to cluster the vertices. According to the experimental results on the simulated graph data and the real-world big graph data, the proposed method is superior to the state-of-the-art methods in solving graph clustering problems. Since the proposed method uses the embarrassingly parallel paradigm, it can be efficiently implemented and embedded in any parallel computing environment such as a MapReduce framework. Given enough computing resources, we are capable of clustering graphs with millions of vertices and hundreds millions of edges in a reasonable time.
Languageen
PublisherSpringerOpen
TitleLimited random walk algorithm for big graph data clustering
TypeArticle
Issue Number1
Volume Number3


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record