Accelerating Core Decomposition in Complex Network on GPUs

doi:10.13328/j.cnki.jos.005627

微信服务号

微信订阅号

Home > Archive>Volume 31, Issue 4, 2020 >1225-1239. DOI:10.13328/j.cnki.jos.005627

PDF HTML XML Export Cite reminder

Accelerating Core Decomposition in Complex Network on GPUs
DOI:
                        10.13328/j.cnki.jos.005627
                    
Author:
                        
                        
                    
Affiliation:
Clc Number:TP311
Fund Project:National Key R&D Program of China (2018YFB0803600); National Natural Science Foundation of China (61702488, 61732020)

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

To analysis the complex networks, core decomposition is a basic and efficient strategy to distinguish the relative importance of nodes and to discover a special family of core subgraphs in networks. After core decomposition, every node in each k-core subgraph connects to other k neighbor nodes internally. The core decomposition has been widely applied in several application scenarios, e.g., user behavior analysis in social networks, visualization of complex networks, static analysis in large system software project, etc. With the increasing scale and complexity of networks, existing works, which mostly focus on the multi-core CPU-based implementation of core decomposition, cannot satisfy the high performance of core decomposition in large-scale complex networks. Meanwhile, GPU provides not only massive parallelism degree (up to 10 000 threads) but also efficient memory I/O bandwidth (approximately 100 GB/s), which makes it an excellent hardware platform for large graph structure analytic, such as BFS (breadth first search), SSSP (single source shortest path) algorithms. This study proposes two strategies to enhance the parallel performance of core decomposition on GPU-based platform. An algorithm, RLCore, is first presented which exploits GPU-based bottom-up traverse approach and recursively distinguishes the core levels of nodes by considering their degree and edges. Then, a second optimal algorithm is proposed to improve performance and scalability, namely ESCore, based on the locality property of core decomposition. In ESCore, nodes gather and update their core level values from their neighbors, until there is no update among nodes. Compared to RLCore, ESCore strategy reduces the synchronization overhead from multi-thread contention when scaling to massive parallelism, whereas the iteration number of ESCore is depended on the convergence of nodes. From the evaluation results, two proposed acceleration algorithms achieve maximum 4.06 billion TEPS (traversed edges per second), which corresponds to up to 33.6X speedup compared to a single threaded CPU execution.

Reference

Cited by

Get Citation

张珩,崔强,侯朋朋,武延军,赵琛.面向GPU平台的复杂网络core分解方法研究.软件学报,2020,31(4):1225-1239

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:July 14,2017
Revised:September 01,2017
Adopted:
Online: April 16,2020
Published: April 06,2020

You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address：4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code：100190
Phone：010-62562563 Fax：010-62562533 Email：jos@iscas.ac.cn
Technical Support：Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063

微信服务号

微信订阅号

Get Citation

Share

Article Metrics

History