TY - GEN
T1 - UCK-means
T2 - 2011 8th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2011, Jointly with the 2011 7th International Conference on Natural Computation, ICNC'11
AU - Peng, Yu
AU - Luo, Qinghua
AU - Peng, Xiyuan
PY - 2011
Y1 - 2011
N2 - Due to some reasons such as transmitting error or outdated or imprecise measurement, data uncertainty is an inherent property in wireless sensor networks or in LXI test framework. When we apply data mining techniques to these uncertain data, we must consider the uncertainty to get better data mining results. At present, most of uncertain data clustering methods assume the probability density functions or probability distribution function of whole data is available. However, in many real applications, this piece of information is rarely available. Only limited uncertain information may be available, such as the standard deviation. In this paper, we adopt a more realistic assumption that the standard deviation of individual measurement data is available, and propose a new uncertain distance computing method between multi-dimensional uncertain data. In addition, we propose an uncertain customized data clustering algorithm based on the classical K-means to process the multi-dimensional uncertain data. Experiment results show that the uncertain clustering algorithm can produce better results with lower complexity.
AB - Due to some reasons such as transmitting error or outdated or imprecise measurement, data uncertainty is an inherent property in wireless sensor networks or in LXI test framework. When we apply data mining techniques to these uncertain data, we must consider the uncertainty to get better data mining results. At present, most of uncertain data clustering methods assume the probability density functions or probability distribution function of whole data is available. However, in many real applications, this piece of information is rarely available. Only limited uncertain information may be available, such as the standard deviation. In this paper, we adopt a more realistic assumption that the standard deviation of individual measurement data is available, and propose a new uncertain distance computing method between multi-dimensional uncertain data. In addition, we propose an uncertain customized data clustering algorithm based on the classical K-means to process the multi-dimensional uncertain data. Experiment results show that the uncertain clustering algorithm can produce better results with lower complexity.
KW - Wireless Sensor Network
KW - data clustering
KW - data mining
KW - uncertain data
UR - https://www.scopus.com/pages/publications/80053398713
U2 - 10.1109/FSKD.2011.6019639
DO - 10.1109/FSKD.2011.6019639
M3 - 会议稿件
AN - SCOPUS:80053398713
SN - 9781612841816
T3 - Proceedings - 2011 8th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2011
SP - 1196
EP - 1200
BT - Proceedings - 2011 8th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2011
Y2 - 26 July 2011 through 28 July 2011
ER -