TY - GEN
T1 - High-Performance Truss Analytics in Arkouda
AU - Du, Zhihui
AU - Patchett, Joseph
AU - Rodriguez, Oliver Alvarado
AU - Li, Fuhuan
AU - Bader, David A.
N1 - Publisher Copyright:
© 2022 IEEE.
PY - 2022
Y1 - 2022
N2 - In graph analytics, a truss is a cohesive subgraph based on the number of triangles supporting each edge. It is widely used for community detection applications such as social networks and security analysis, and the performance of truss analytics highly depends on its triangle counting method. This paper proposes a novel triangle counting kernel named Minimum Search (MS). Minimum Search can select two smaller adjacency lists out of three and uses fine-grained parallelism to improve the performance of triangle counting. Then, two basic algorithms, MS-based triangle counting, and MS-based support updating are developed. Based on the novel triangle counting kernel and the two basic algorithms above, three fundamental parallel truss analytics algorithms are designed and implemented to enable different kinds of graph truss analysis. These truss algorithms include an optimized K-Truss algorithm, a Max-Truss algorithm, and a Truss Decomposition algorithm. Moreover, all proposed algorithms have been implemented in the parallel language Chapel and integrated into an open-source framework, Arkouda. Through Arkouda, data scientists can efficiently con-duct graph analysis through an easy-to-use Python interface and handle large-scale graph data in powerful back-end computing resources. Experimental results show that the proposed methods can significantly improve the performance of truss analysis on real-world graphs compared with the existing and widely adopted list intersection-based method. The implemented code is publicly available from GitHub (https://github.com/Bears-R-Us/arkouda-njit).
AB - In graph analytics, a truss is a cohesive subgraph based on the number of triangles supporting each edge. It is widely used for community detection applications such as social networks and security analysis, and the performance of truss analytics highly depends on its triangle counting method. This paper proposes a novel triangle counting kernel named Minimum Search (MS). Minimum Search can select two smaller adjacency lists out of three and uses fine-grained parallelism to improve the performance of triangle counting. Then, two basic algorithms, MS-based triangle counting, and MS-based support updating are developed. Based on the novel triangle counting kernel and the two basic algorithms above, three fundamental parallel truss analytics algorithms are designed and implemented to enable different kinds of graph truss analysis. These truss algorithms include an optimized K-Truss algorithm, a Max-Truss algorithm, and a Truss Decomposition algorithm. Moreover, all proposed algorithms have been implemented in the parallel language Chapel and integrated into an open-source framework, Arkouda. Through Arkouda, data scientists can efficiently con-duct graph analysis through an easy-to-use Python interface and handle large-scale graph data in powerful back-end computing resources. Experimental results show that the proposed methods can significantly improve the performance of truss analysis on real-world graphs compared with the existing and widely adopted list intersection-based method. The implemented code is publicly available from GitHub (https://github.com/Bears-R-Us/arkouda-njit).
KW - Graph Analytics
KW - K-Truss
KW - Parallel Algorithm
KW - Triangle Counting
UR - http://www.scopus.com/inward/record.url?scp=85158144691&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85158144691&partnerID=8YFLogxK
U2 - 10.1109/HiPC56025.2022.00026
DO - 10.1109/HiPC56025.2022.00026
M3 - Conference contribution
AN - SCOPUS:85158144691
T3 - Proceedings - 2022 IEEE 29th International Conference on High Performance Computing, Data, and Analytics, HiPC 2022
SP - 105
EP - 114
BT - Proceedings - 2022 IEEE 29th International Conference on High Performance Computing, Data, and Analytics, HiPC 2022
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 29th Annual IEEE International Conference on High Performance Computing, Data, and Analytics, HiPC 2022
Y2 - 18 December 2022 through 21 December 2022
ER -