TY - JOUR
T1 - A Density-Center-Based Automatic Clustering Algorithm for IoT Data Analysis
AU - Zhang, Tao
AU - Zhou, Meng Chu
AU - Guo, Xiwang
AU - Qi, Liang
AU - Abusorrah, Abdullah
N1 - Funding Information:
This work was s pported in part by the National Natural Science Foundation under Grant 61991413 and Grant 91948202; in part by the Science Fund for Creative Research Groups of the National Natural Science Foundation of China under Grant 61821005; and in part by the Deanship of Scientific Research (DSR) at King Abdulaziz University (KAU), Jeddah, Saudi Arabia, under Grant FP-52-43.
Publisher Copyright:
© 2014 IEEE.
PY - 2022/12/15
Y1 - 2022/12/15
N2 - With the rapid development of Internet of Things (IoT), much data has been produced, and new requirements have been posed for data mining. Clustering plays an essential role in discovering the underlying patterns of IoT data. It is widely used in health prognoses, pattern recognition, information retrieval, and computer vision. Density clustering is crucial to find arbitrary-shaped clusters and noise points without knowing the number of clusters in advance. However, its efficiency and applicability are reduced sharply when there exists mutual interference among parameters. In this article, a new algorithm called density-center-based automatic clustering (DAC) is proposed. First, this work presents a nonparametric density computing method. Second, it proposes to use an adaptive neighborhood whose radius is automatically calculated based on all the points in a data set. Finally, it selects appropriate density centers from a decision graph, which merge their surrounding points into the same groups. Experiments are conducted to show that DAC has higher accuracy than six classic and updated algorithms. Its effectiveness is shown via data from photovoltaic power and oil extraction systems. As an outstanding feature that its compared peers lack, it can determine parameters automatically. Thus this work greatly advances the state-of-the-art of clustering algorithms in the field of IoT data analysis.
AB - With the rapid development of Internet of Things (IoT), much data has been produced, and new requirements have been posed for data mining. Clustering plays an essential role in discovering the underlying patterns of IoT data. It is widely used in health prognoses, pattern recognition, information retrieval, and computer vision. Density clustering is crucial to find arbitrary-shaped clusters and noise points without knowing the number of clusters in advance. However, its efficiency and applicability are reduced sharply when there exists mutual interference among parameters. In this article, a new algorithm called density-center-based automatic clustering (DAC) is proposed. First, this work presents a nonparametric density computing method. Second, it proposes to use an adaptive neighborhood whose radius is automatically calculated based on all the points in a data set. Finally, it selects appropriate density centers from a decision graph, which merge their surrounding points into the same groups. Experiments are conducted to show that DAC has higher accuracy than six classic and updated algorithms. Its effectiveness is shown via data from photovoltaic power and oil extraction systems. As an outstanding feature that its compared peers lack, it can determine parameters automatically. Thus this work greatly advances the state-of-the-art of clustering algorithms in the field of IoT data analysis.
KW - Density clustering
KW - Internet of Things (IoT)
KW - density peaks
KW - neighborhood
KW - nonparametric
UR - http://www.scopus.com/inward/record.url?scp=85136122542&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85136122542&partnerID=8YFLogxK
U2 - 10.1109/JIOT.2022.3194886
DO - 10.1109/JIOT.2022.3194886
M3 - Article
AN - SCOPUS:85136122542
SN - 2327-4662
VL - 9
SP - 24682
EP - 24694
JO - IEEE Internet of Things Journal
JF - IEEE Internet of Things Journal
IS - 24
ER -