TY - GEN
T1 - Load balanced clustering coefficients
AU - Green, Oded
AU - Munguía, Lluís Miquel
AU - Bader, David A.
PY - 2014/2/16
Y1 - 2014/2/16
N2 - Clustering coefficients is a building block in network sciences that offers insights on how tightly bound vertices are in a network. Effective and scalable parallelization of clustering coefficients requires load balancing amongst the cores. This property is not easy to achieve since many real world networks are scale free, which leads to some vertices requiring more attention than others. In this work we show two scalable approaches that load balance clustering coefficients. The first method achieves optimal load balancing with an O(|E|) storage requirement. The second method has a lower storage requirement of O(|V |) at the cost of some imbalance. While both methods have a similar time complexity, they represent a tradeoff between maintaining a balanced workload and memory complexity. Using a 40-core system we show that our load balancing techniques outperform the widely used and simple parallel approach by a factor of 3X-7.5X for real graphs and 1.5X-4X for random graphs. Further, we achieve 25X - 35X speedup over the sequential algorithm for most of the graphs.
AB - Clustering coefficients is a building block in network sciences that offers insights on how tightly bound vertices are in a network. Effective and scalable parallelization of clustering coefficients requires load balancing amongst the cores. This property is not easy to achieve since many real world networks are scale free, which leads to some vertices requiring more attention than others. In this work we show two scalable approaches that load balance clustering coefficients. The first method achieves optimal load balancing with an O(|E|) storage requirement. The second method has a lower storage requirement of O(|V |) at the cost of some imbalance. While both methods have a similar time complexity, they represent a tradeoff between maintaining a balanced workload and memory complexity. Using a 40-core system we show that our load balancing techniques outperform the widely used and simple parallel approach by a factor of 3X-7.5X for real graphs and 1.5X-4X for random graphs. Further, we achieve 25X - 35X speedup over the sequential algorithm for most of the graphs.
KW - Graph algorithms
KW - Parallel algorithms
KW - Scalable programming
KW - Social network analysis
UR - https://www.scopus.com/pages/publications/84897481823
UR - https://www.scopus.com/pages/publications/84897481823#tab=citedBy
U2 - 10.1145/2567634.2567635
DO - 10.1145/2567634.2567635
M3 - Conference contribution
AN - SCOPUS:84897481823
SN - 9781450326544
T3 - PPAA 2014 - Proceedings of the 2014 Workshop on Parallel Programming for Analytics Applications
SP - 3
EP - 10
BT - PPAA 2014 - Proceedings of the 2014 Workshop on Parallel Programming for Analytics Applications
PB - Association for Computing Machinery
T2 - 1st Workshop on Parallel Programming for Analytics Applications, PPAA 2014
Y2 - 16 February 2014 through 16 February 2014
ER -